A 2-level framework is employed for integrating lower-level and higher-level
vision features.
Given the isotropic feature vectors
and
and anisotropic feature
extracted from
a query image, and
,
and
extracted from the
image in the database, the first
level of the framework maps the feature vectors to a discriminant value within
each of the 3 categories, structure, histogram and texture.
The respective mappings
:
,
:
and
:
, where
,
and
,
are selected as
norms:
,
and
.
At the second level a supra discriminant is generated by utilizing the mapping
:
that is given as:
![]() |
(16) |
![]() |
|||
![]() |
|||
![]() |
(18) |
The above integration framework has the following advantages over a simple
concatenation of vectors
,
and
.
First, the different lengths of these three vectors preclude the proper
construction of a concatenated vector that is equally sensitive to all
of its components. The 3-dimensional vector output by
is equally sensitive to all of its three 1-dimensional components.
Second, the size of the corresponding weight vector for the concatenated
vector will be large, making the selection of proper weights difficult and
unfeasible. Third, in our proposed integration, weights are assigned at the
module level, i.e., structure, histogram and texture, whereas weights
in a concatenated vector are assigned at the vector component level without
particular regard to the modular structure of the system.
The weight vector
plays an important role in controlling the content of images retrieved. For
a given image query, different weights can be assigned to structure,
histogram and texture according to user specification to control
the images retrieved.