Relative attributes

Aus de_evolutionary_art_org
Wechseln zu: Navigation, Suche


Referenz

D. Parikh and K. Grauman: Relative attributes. In International Conference on Computer Vision (ICCV), 2011.

DOI

http://dx.doi.org/10.1109/ICCV.2011.6126281

Abstract

Human-nameable visual “attributes” can benefit various recognition tasks. However, existing techniques restrict these properties to categorical labels (for example, a person is `smiling' or not, a scene is `dry' or not), and thus fail to capture more general semantic relationships. We propose to model relative attributes. Given training data stating how object/scene categories relate according to different attributes, we learn a ranking function per attribute. The learned ranking functions predict the relative strength of each property in novel images. We then build a generative model over the joint space of attribute ranking outputs, and propose a novel form of zero-shot learning in which the supervisor relates the unseen object category to previously seen objects via attributes (for example, `bears are furrier than giraffes'). We further show how the proposed relative attributes enable richer textual descriptions for new images, which in practice are more precise for human interpretation. We demonstrate the approach on datasets of faces and natural scenes, and show its clear advantages over traditional binary attribute prediction for these new tasks.

Extended Abstract

Bibtex

@INPROCEEDINGS{6126281,
author={D. Parikh and K. Grauman},
booktitle={2011 International Conference on Computer Vision},
title={Relative attributes},
year={2011},
pages={503-510},
keywords={face recognition;learning (artificial intelligence);natural scenes;binary attribute prediction;categorical label;face dataset;human interpretation;human-name visual attribute;image textual description;learned ranking function;natural scene;object category;ranking function per attribute;recognition task;scene category;training data;zero-shot learning;Humans;Image recognition;Machine learning;Support vector machines;Training;Visualization;Vocabulary},
doi={10.1109/ICCV.2011.6126281},
url={http://dx.doi.org/10.1109/ICCV.2011.6126281 http://www.cs.utexas.edu/~grauman/papers/ParikhGrauman_ICCV2011_relative.pdf http://de.evo-art.org/index.php?title=Relative_attributes },
ISSN={1550-5499},
month={Nov},
}

Used References

A. Farhadi, I. Endres, D. Hoiem, and D. A. Forsyth. Describing Objects by their Attributes. CVPR, 2009. http://dx.doi.org/10.1109/CVPR.2009.5206772

C. H. Lampert, H. Nickisch, and S. Harmeling. Learning To Detect Unseen Object Classes by Between-Class Attribute Transfer. CVPR, 2009. http://dx.doi.org/10.1109/CVPR.2009.5206594

M. Rohrbach, M. Stark, G. Szarvas, I. Gurevych and B. Schiele. What Helps Where And Why? Semantic Relatedness for Knowledge Transfer. CVPR, 2010. http://dx.doi.org/10.1109/CVPR.2010.5540121

T. L. Berg, A. C. Berg and J. Shih. Automatic Attribute Discovery and Characterization from Noisy Web Data. ECCV, 2010. http://dx.doi.org/10.1007/978-3-642-15549-9_48

N. Kumar, A. Berg, P. Belhumeur and S. K. Nayar. Attribute and Simile Classifiers for Face Verification. ICCV, 2009. http://dx.doi.org/10.1109/ICCV.2009.5459250

J.Wang, K. Markert and M. Everingham. Learning Models for Object Recognition from Natural Language Descriptions. BMVC, 2009. http://dx.doi.org/10.5244/C.23.2

G. Wang and D. Forsyth. Joint Learning of Visual Attributes, Object Classes and Visual Saliency. ICCV, 2009. http://dx.doi.org/10.1109/ICCV.2009.5459194

Y. Wang and G. Mori. A Discriminative Latent Model of Object Classes and Attributes. ECCV, 2010. http://dx.doi.org/10.1007/978-3-642-15555-0_12

D. Parikh and K. Grauman. Interactively Building a Discriminative Vocabulary of Nameable Attributes. CVPR, 2011. http://dx.doi.org/10.1109/CVPR.2011.5995451

M. Palatucci, D. Pomerleau, G. Hinton and T. Mitchell. Zero-Shot Learning with Semantic Output Codes. NIPS, 2009.

A. Oliva and A. Torralba. Modeling the Shape of the Scene: A Holistic Representation of the Spatial Envelope. IJCV, 2001

N. Kumar, A. C. Berg, P. N. Belhumeur and S. K. Nayar. Attribute and Simile Classifiers for Face Verification. ICCV, 2009. http://dx.doi.org/10.1109/ICCV.2009.5459250

V. Ferrari and A. Zisserman. Learning Visual Attributes. NIPS, 2007.

O. Russakovsky and L. Fei-Fei. Attribute Learning in Large-scale Datasets. Workshop on Parts and Attributes, ECCV, 2010.

A. Farhadi, I. Endres and D. Hoiem. Attribute-centric Recognition for Crosscategory Generalization. CVPR, 2010. http://dx.doi.org/10.1109/CVPR.2010.5539924

S. Branson, C. Wah, B. Babenko, F. Schroff, P. Welinder, P. Perona P and S. Belongie. Visual Recognition with Humans in the Loop. ECCV, 2010.

R. Fergus, H. Bernal, Y. Weiss and A. Torralba. Semantic Label Sharing for Learning with Many Categories. ECCV, 2010. http://dx.doi.org/10.1007/978-3-642-15549-9_55

J. Deng, A. C. Berg, K. Li and L. Fei-Fei. What Does Classifying More Than 10,000 Image Categories Tell Us? ECCV, 2010.

G. Wang, D. Forsyth and D. Hoiem. Comparative Object Similarity for Improved Recognition with Few or No Examples. CVPR, 2010. http://dx.doi.org/10.1109/CVPR.2010.5539955

A. Gupta and L. S. Davis. Beyond Nouns: Exploiting Prepositions and Comparative Adjectives for Learning Visual Classifiers. ECCV, 2008. http://dx.doi.org/10.1007/978-3-540-88682-2_3

B. Siddiquie and A. Gupta. Beyond Active Noun Tagging: Modeling Contextual Interactions for Multi-Class Active Learning. CVPR, 2010. http://dx.doi.org/10.1109/CVPR.2010.5540044

T. Joachims. Optimizing Search Engines using Clickthrough Data. KDD, 2002. http://dx.doi.org/10.1145/775047.775067

Z. Cao, T. Qin, T. Liu, M. Tsai and H. Li. Learning to Rank: From Pairwise Approach to Listwise Approach. ICML, 2007. http://dx.doi.org/10.1145/1273496.1273513

T. Liu. Learning to Rank for Information Retrieval. Foundations and Trends in Information Retrieval, 2009. http://dx.doi.org/10.1561/1500000016

V. Jain and M. Varma. Learning to Re-Rank: Query-Dependent Image Re-Ranking Using Click Data. WWW, 2011. http://dx.doi.org/10.1145/1963405.1963447

Y. Hu, M. Li and N. Yu. Multiple-Instance Ranking: Learning to Rank Images for Image Retrieval. CVPR, 2008. http://ieeexplore.ieee.org/xpl/articleDetails.jsp?arnumber=4587352&navigation=1

E. P. Xing, A. Y. Ng, M. I. Jordan and S. Russell. Distance Metric Learning, with Application to Clustering with Side-Information. NIPS, 2002.

A. Frome, Y. Singer, F. Sha, and J. Malik. Learning Globally-Consistent Local Distance Functions for Shape-based Image retrieval and Classification. ICCV, 2007. http://dx.doi.org/10.1109/ICCV.2007.4408839

O. Chapelle. Training a Support Vector Machine in the Primal. Neural Computation, 2007. http://dx.doi.org/10.1162/neco.2007.19.5.1155

L. Fei-Fei, R. Fergus and P. Perona. A Bayesian Approach to Unsupervised One-Shot Learning of Object Categories ICCV, 2003.

M. Stark, M. Goesele and B. Schiele. A Shape-Based Object Class Model for Knowledge Transfer. ICCV, 2009. http://dx.doi.org/10.1109/ICCV.2009.5459231

A. Farhadi, M. Hejrati, M. Sadeghi, P. Young, C. Rashtchian, J. Hockenmaier, and D. Forsyth. Every Picture Tells a Story: Generating Sentences from Images. ECCV, 2010.

C. Dwork, R. Kumar, M. Naor and D. Sivakumar. Rank Aggregation Methods for the Web. WWW, 2001. http://dx.doi.org/10.1145/371920.372165

Links

Full Text

http://www.cs.utexas.edu/~grauman/papers/ParikhGrauman_ICCV2011_relative.pdf

internal file


Sonstige Links