|
The key requirement in building an appearance model from a set of
images, is the existence of a dense correspondence across the set.
This is often defined by interpolating between the correspondences
of a limited number of user-defined landmarks. Shape variation is
then represented in terms of the motions of these sets of landmark
points. Using the notation of Cootes et
al [9], the shape (configuration of landmark
points) of a single example can be represented as a vector
formed by concatenating the coordinates of the
positions of all the landmark points for that example. The texture
is represented by a vector
, formed by concatenating
the image values for the shape-free texture sampled from the
image.
In the simplest case, we model the variation of shape and texture
in terms of multivariate gaussian distributions, using Principal
Component Analysis (PCA) [15], obtaining linear
statistical models of the form:
![]() |
![]() |
![]() |
|
![]() |
(2) | ||
![]() |
![]() |
![]() |
(3) |
In generative mode, the input shape (
) and texture
(
) parameters can be varied continuously, allowing
the generation of sets of images whose statistical distribution
matches that of the training set.
In many cases, the variations of shape and texture are correlated.
If this correlation is taken into account, we then obtain a
combined statistical model of the more general form:
![]() |
![]() |
![]() |
|
![]() |
(4) | ||
![]() |
![]() |
![]() |
(5) |
Generally, we wish to distinguish between the meaningful shape
variation of the objects under consideration, and the apparent
variation in shape that is due to the positioning of the object
within the image (the pose of the imaged object). In this case,
the appearance model is generated from an (affinely) aligned set
of images. Point positions
in the original image
frame are then obtained by applying the relevant pose
transformation
:
![]() |
(6) |
In an analogous manner, we can also normalise the image set with respect to the mean image intensities and image variance,
![]() |
(7) |
As noted above, a meaningful dense groupwise correspondence is required before an appearance model can be built. NRR provides a natural method of obtaining such a correspondence, as noted by Frangi and Rueckert [11,12]. It is this link that forms the basis of our new approach to NRR evaluation.
The link between registration and modelling is further exploited in the Minimum Description Length (MDL) [16] approach to groupwise NRR, where modelling becomes an integral part of the registration process. This is of one of the registration strategies evaluated later in the paper.