TY - JOUR
T1 - Statistical cue integration in DAG deformable models
AU - Goldenstein, Siome Klein
AU - Vogler, Christian
AU - Metaxas, Dimitris
N1 - Funding Information:
This work was supported in part by an ONR N-0014-02-1-0931, an US National Science Foundation Career Award NSF-9624604, NSF EIA-98-09209, AFOSR F49620-98-1-0434, and NSBRI-NBPF00202. S. Goldenstein was partially supported by CNPq—Brazilian’s “Conselho Nacional de Desenvolvimento Científico e Tecnológico.”
PY - 2003/7
Y1 - 2003/7
N2 - Deformable models are a useful modeling paradigm in computer vision. A deformable model is a curve, a surface, or a volume, whose shape, position, and orientation are controlled through a set of parameters. They can represent manufactured objects, human faces and skeletons, and even bodies of fluid. With low-level computer vision and image processing techniques, such as optical flow, we extract relevant information from images. Then, we use this information to change the parameters of the model iteratively until we find a good approximation of the object in the images. When we have multiple computer vision algorithms providing distinct sources of information (cues), we have to deal with the difficult problem of combining these, sometimes conflicting contributions in a sensible way. In this paper, we introduce the use of a directed acyclic graph (DAG) to describe the position and Jacobian of each point of deformable models. This representation is dynamic, flexible, and allows computational optimizations that would be difficult to do otherwise. We then describe a new method for statistical cue integration method for tracking deformable models that scales well with the dimension of the parameter space. We use affine forms and affine arithmetic to represent and propagate the cues and their regions of confidence. We show that we can apply the Lindeberg theorem to approximate each cue with a Gaussian distribution, and can use a maximum-likelihood estimator to integrate them. Finally, we demonstrate the technique at work in a 3D deformable face tracking system on monocular image sequences with thousands of frames.
AB - Deformable models are a useful modeling paradigm in computer vision. A deformable model is a curve, a surface, or a volume, whose shape, position, and orientation are controlled through a set of parameters. They can represent manufactured objects, human faces and skeletons, and even bodies of fluid. With low-level computer vision and image processing techniques, such as optical flow, we extract relevant information from images. Then, we use this information to change the parameters of the model iteratively until we find a good approximation of the object in the images. When we have multiple computer vision algorithms providing distinct sources of information (cues), we have to deal with the difficult problem of combining these, sometimes conflicting contributions in a sensible way. In this paper, we introduce the use of a directed acyclic graph (DAG) to describe the position and Jacobian of each point of deformable models. This representation is dynamic, flexible, and allows computational optimizations that would be difficult to do otherwise. We then describe a new method for statistical cue integration method for tracking deformable models that scales well with the dimension of the parameter space. We use affine forms and affine arithmetic to represent and propagate the cues and their regions of confidence. We show that we can apply the Lindeberg theorem to approximate each cue with a Gaussian distribution, and can use a maximum-likelihood estimator to integrate them. Finally, we demonstrate the technique at work in a 3D deformable face tracking system on monocular image sequences with thousands of frames.
KW - Affine arithmetic
KW - Deformable model representation
KW - Deformable model tracking
KW - Directed acyclic graphs
KW - Face tracking
KW - Statistical cue integration
UR - http://www.scopus.com/inward/record.url?scp=0042349411&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=0042349411&partnerID=8YFLogxK
U2 - 10.1109/TPAMI.2003.1206510
DO - 10.1109/TPAMI.2003.1206510
M3 - Article
AN - SCOPUS:0042349411
SN - 0162-8828
VL - 25
SP - 801
EP - 813
JO - IEEE Transactions on Pattern Analysis and Machine Intelligence
JF - IEEE Transactions on Pattern Analysis and Machine Intelligence
IS - 7
ER -