Color and Texture-Based Segmentation using EM and its Application to Content-Based Image Retrieval

Aus de_evolutionary_art_org
Wechseln zu: Navigation, Suche


Referenz

Serge Belongie, Chad Carson, Hayit Greenspan and Jitendra Malik: Color and Texture-Based Segmentation using EM and its Application to Content-Based Image Retrieval. In Proceedings of the Sixth International Conference on Computer Vision, Vol. 10, pp. 675-682, Jan 1998.

DOI

http://dx.doi.org/10.1109/ICCV.1998.710790

Abstract

Retrieving images from large and varied collections using image content as a key is a challenging and important problem. In this paper we present a new image representation which provides a transformation from the raw pixel data to a small set of image regions which are coherent in color and texture space. This so-called “blobworld” representation is based on segmentation using the expectation-maximization algorithm on combined color and texture features. The texture features we use for the segmentation arise from a new approach to texture description and scale selection. We describe a system that uses the blobworld representation to retrieve images. An important and unique aspect of the system is that, in the context of similarity-based querying, the user is allowed to view the internal representation of the submitted image and the query results. Similar systems do not offer the user this view into the workings of the system; consequently, the outcome of many queries on these systems can be quite inexplicable, despite the availability of knobs for adjusting the similarity metric

Extended Abstract

Bibtex

@INPROCEEDINGS{710790,
author={S. Belongie and C. Carson and H. Greenspan and J. Malik},
booktitle={Computer Vision, 1998. Sixth International Conference on},
title={Color- and texture-based image segmentation using EM and its application to content-based image retrieval},
year={1998},
pages={675-682},
keywords={image representation;image segmentation;information retrieval;query processing;content-based image retrieval;expectation-maximization algorithm;image regions;image representation;image segmentation;scale selection;similarity-based querying;texture description;texture features;Application software;Computer science;Content based retrieval;Expectation-maximization algorithms;Image databases;Image representation;Image retrieval;Image segmentation;Information retrieval;Pixel},
doi={10.1109/ICCV.1998.710790},
url={http://dx.doi.org/10.1109/ICCV.1998.710790 http://de.evo-art.org/index.php?title=Color_and_Texture-Based_Segmentation_using_EM_and_its_Application_to_Content-Based_Image_Retrieval},
month={Jan},
}

Used References

S. Ayer and H. Sawhney, "Layered representation of motion video using robust maximum-likelihood estimation of mixture models and MDL encoding", Proc. Int. Conf. Comp. Vis., pp. 777-784, 1995 http://dx.doi.org/10.1109/ICCV.1995.466859

J. Bigÿn, Local symmetry features in image processing, 1988

A. Dempster , N. Laird and D. Rubin, "Maximum likelihood from incomplete data via the EM algorithm", J. Royal Statistical Soc., vol. 39, no. 1, pp. 1-38, 1977

P. Enser, "Query analysis in a visual information retrieval context", J. Doc. and Text Management, vol. 1, no. 1, pp. 25-52, 1993

W. Förstner, "A framework for low level feature extraction", Proc. Eur. Conf. Comp. Vis., pp. 383-394, 1994 http://dx.doi.org/10.1007/BFb0028370

D. Forsyth , J. Malik and R. Wilensky, "Searching for digital pictures", Scientific American, vol. 276, no. 6, pp. 72-77, 1997 http://dx.doi.org/10.1038/scientificamerican0697-88

W. T. Freeman and E. H. Adelson, "The design and use of steerable filters", IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 13, no. 9, pp. 891-906, 1991 http://dx.doi.org/10.1109/34.93808

Jonas Gårding, Tony Lindeberg, "Direct computation of shape cues using scale-adapted spatial derivative operators", Int. J. Comp. Vis., vol. 17, no. 2, pp. 163-191, 1996 http://dx.doi.org/10.1007/BF00058750

G. H. Granlund and H. Knutsson, Signal Processing for Computer Vision, 1995, Kluwer Academic Publishers http://dx.doi.org/10.1007/978-1-4757-2377-9

A. Gupta and R. Jain, "Visual information retrieval", Comm. Assoc. Comp. Mach., vol. 40, no. 5, pp. 70-79, 1997 http://dx.doi.org/10.1145/253769.253798

T. Hofmann , J. Puzicha and J. M. Buhmann, "Deterministic annealing for unsupervised texture segmentation", Proc. Int. Workshop on Energy Min. Methods in Comp. Vis. and Patt. Rec., pp. 213-228, 1997 http://dx.doi.org/10.1007/3-540-62909-2_82

A. K. Jain and F. Farrokhnia, "Unsupervised texture segmentation using Gabor filters", Pattern Recognition, vol. 24, no. 12, pp. 1167-1186, 1991 http://dx.doi.org/10.1016/0031-3203(91)90143-S

J.-S. Jang , C.-T. Sun and E. Mizutani, Neuro-Fuzzy and Soft Computing, 1997, Prentice Hall

P. Kelly , M. Cannon and D. Hush, "Query by image example: the CANDID approach", SPIE Proc. Storage and Retrieval for Image and Video Databases, pp. 238-248, 1995 http://dx.doi.org/10.1117/12.205289

T. Leung and J. Malik, "Detecting, localizing and grouping repeated scene elements from an image", Proc. Eur. Conf. Comp. Vis., pp. 546-555, 1996 http://dx.doi.org/10.1007/BFb0015565

P. Lipson , E. Grimson and P. Sinha, "Configuration based scene classification and image indexing", Proc. IEEE Comp. Soc. Conf. Comp. Vis. and Pattern Recogn., pp. 1007-1013, 1997 http://dx.doi.org/10.1109/CVPR.1997.609453

J. Malik and P. Perona, "Preattentive texture discrimination with early vision mechanisms", J. Opt. Soc. Am. A, vol. 7, no. 5, pp. 923-932, 1990 http://dx.doi.org/10.1364/JOSAA.7.000923

W. Niblack, "The QBIC project; querying images by content using colour, texture and shape", In SPIE Proc. Storage and Retrieval for Image and Video Databases, pp. 173-187, 1993

V. Ogle and M. Stonebraker, "Chabot: Retrieval from a relational database of images", IEEE Computer, vol. 28, no. 9, pp. 40-48, 1995 http://dx.doi.org/10.1109/2.410150

A. Pentland , R. Picard and S. Sclaroff, "Photobook: Content-based manipulation of image databases", Int. J. Comp. Vis., vol. 18, no. 3, pp. 233-254, 1996 http://dx.doi.org/10.1007/BF00123143

J. Ponce , A. Zisserman and M. Hebert, Object Representation in Computer Vision II, vol. 1144, 1996, Springer

J. Rissanen, "Modeling by shortest data description", Automatica, vol. 14, pp. 465-471, 1978 http://dx.doi.org/10.1016/0005-1098(78)90005-5

U. Shaft and R. Ramakrishnan, "Data modeling and querying in the PIQ image DBMS", IEEE Data Engineering Bulletin, vol. 19, no. 4, pp. 28-36, 1996

M. Swain and D. Ballard, "Color indexing", Int. J. Comp. Vis., vol. 7, no. 1, pp. 11-32, 1991 http://dx.doi.org/10.1007/BF00130487

Y. Weiss and E. Adelson, "A unified mixture framework for motion segmentation: Incorporating spatial coherence and estimating the number of models", Proc. IEEE Comp. Soc. Conf. Comp. Vis. and Pattern Recogn., pp. 321-326, 1996 http://dx.doi.org/10.1109/CVPR.1996.517092

W. Wells , R. Kikinis , W. Grimson and F. Jolesz, "Adaptive segmentation of MRI data", Int. Conf. on Comp. Vis., Virtual Reality and Robotics in Medicine, pp. 59-69, 1995 http://dx.doi.org/10.1007/BFb0034933

D. Yarowsky, "Word-sense disambiguation using statistical models of Roget\'s categories trained on large corpora", Proc. Int. Conf. Comp. Linguistics, pp. 454-460, 1992 http://dx.doi.org/10.3115/992133.992140

Links

Full Text

http://vision.cornell.edu/se3/wp-content/uploads/2014/09/00710790.pdf

internal file


Sonstige Links