S. Banerjee and T. Pedersen, Extended gloss overlaps as a measure of semantic relatedness, International Joint Conference on Artificial Intelligence (IJCAI'03), 2003.

H. Bannour, Une approche sémantique basée sur l'apprentissage pour la recherche d'image par contenu, COnférence en Recherche d'Infomations et Applications, pp.471-478, 2009.

H. Bannour and C. Hudelot, Towards ontologies for image interpretation and annotation, 2011 9th International Workshop on Content-Based Multimedia Indexing (CBMI), pp.211-216, 2011.
DOI : 10.1109/CBMI.2011.5972547

URL : https://hal.archives-ouvertes.fr/hal-00825255

H. Bannour and C. Hudelot, Building Semantic Hierarchies Faithful to Image Semantics, advances in Multimedia Modeling (MMM'12), pp.4-15, 2012.
DOI : 10.1016/j.patcog.2006.04.045

URL : https://hal.archives-ouvertes.fr/hal-00740144

H. Bannour and C. Hudelot, Combinaison d'information visuelle, conceptuelle, et contextuelle pour la construction automatique de hiérarchies sémantiques adaptées à l'annotation d'images, actes de la conférence Reconnaissance des Formes et Intelligence Artificielle (RFIA'12), pp.462-469, 2012.

H. Bannour and C. Hudelot, Hierarchical image annotation using semantic hierarchies, Proceedings of the 21st ACM international conference on Information and knowledge management, CIKM '12, pp.2431-2434, 2012.
DOI : 10.1145/2396761.2398659

URL : https://hal.archives-ouvertes.fr/hal-00825214

K. Barnard, P. Duygulu, D. Forsyth, N. Freitas, . De et al., Matching words and pictures, Journal of Machine Learning Research, vol.3, pp.1107-1135, 2003.

E. Bart, I. Porteous, P. Perona, and M. Welling, Unsupervised learning of visual taxonomies, 2008 IEEE Conference on Computer Vision and Pattern Recognition, 2008.
DOI : 10.1109/CVPR.2008.4587620

D. M. Blei, T. L. Griffiths, M. I. Jordan, and J. B. Tenenbaum, Hierarchical topic models and the nested chinese restaurant process, Neural Information Processing Systems, 2004.

A. Budanitsky and G. Hirst, Evaluating WordNet-based Measures of Lexical Semantic Relatedness, Computational Linguistics, vol.17, issue.1, pp.13-47, 2006.
DOI : 10.1016/S0022-5371(79)90604-2

G. Carneiro, A. B. Chan, P. J. Moreno, and N. Vasconcelos, Supervised Learning of Semantic Classes for Image Annotation and Retrieval, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.29, issue.3, pp.394-410, 2007.
DOI : 10.1109/TPAMI.2007.61

H. Cevikalp, New clustering algorithms for the support vector machine based hierarchical classification, Pattern Recognition Letters, vol.31, issue.11, pp.1285-1291, 2010.
DOI : 10.1016/j.patrec.2010.03.009

K. W. Church and P. Hanks, Word association norms, mutual information, and lexicography, Proceedings of the 27th annual meeting on Association for Computational Linguistics -, pp.22-29, 1990.
DOI : 10.3115/981623.981633

C. Cortes and V. Vapnik, Support-vector networks, Machine Learning, 1995.
DOI : 10.1007/BF00994018

J. Deng, A. C. Berg, K. Li, and L. Fei-fei, What Does Classifying More Than 10,000 Image Categories Tell Us?, European conference on computer vision (eccv'10), 2010.
DOI : 10.1007/978-3-642-15555-0_6

J. Deng, W. Dong, R. Socher, L. Li, K. Li et al., Imagenet: A large-scale hierarchical image database, Computer Vision and Pattern Recognition (CVPR'09), 2009.

T. Deselaers and V. Ferrari, Visual and semantic similarity in ImageNet, CVPR 2011, pp.1777-1784, 2011.
DOI : 10.1109/CVPR.2011.5995474

J. Fan, Y. Gao, and H. Luo, Hierarchical classification for automatic image annotation, Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval, SIGIR '07, pp.111-118, 2007.
DOI : 10.1145/1277741.1277763

J. Fan, Y. Gao, and H. Luo, Integrating Concept Ontology and Multitask Learning to Achieve More Effective Classifier Training for Multilevel Image Annotation, IEEE Transactions on Image Processing, vol.17, issue.3, 2008.
DOI : 10.1109/TIP.2008.916999

J. Fan, H. Luo, Y. Shen, and C. Yang, Integrating visual and semantic contexts for topic network generation and word sense disambiguation, Proceeding of the ACM International Conference on Image and Video Retrieval, CIVR '09, 2009.
DOI : 10.1145/1646396.1646440

C. Fellbaum, WordNet: An electronic lexical database, 1998.

T. Gao and D. Koller, Discriminative learning of relaxed hierarchy for large-scale visual recognition, International Conference on Computer Vision (ICCV'11), pp.2072-2079, 2011.

G. Griffin and P. Perona, Learning and using taxonomies for fast visual categorization, 2008 IEEE Conference on Computer Vision and Pattern Recognition, 2008.
DOI : 10.1109/CVPR.2008.4587410

A. Hauptmann, R. Yan, and W. Lin, How many high-level concepts will fill the semantic gap in news video retrieval?, Proceedings of the 6th ACM international conference on Image and video retrieval, CIVR '07, pp.627-634, 2007.
DOI : 10.1145/1282280.1282369

V. Lavrenko, R. Manmatha, and J. Jeon, A model for learning the semantics of pictures, Neural Information Processing Systems (NIPS'03), 2003.

F. Li and P. Perona, A bayesian hierarchical model for learning natural scene categories, Computer Vision and Pattern Recognition (CVPR'05), pp.524-531, 2005.

L. Li, C. Wang, Y. Lim, D. M. Blei, and F. Li, Building and using a semantivisual image hierarchy, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2010.
DOI : 10.1109/CVPR.2010.5540027

Y. Liu, D. Zhang, G. Lu, and W. Ma, A survey of content-based image retrieval with high-level semantics, Pattern Recognition, vol.40, issue.1, pp.262-282, 2007.
DOI : 10.1016/j.patcog.2006.04.045

D. G. Lowe, Object recognition from local scale-invariant features, Proceedings of the Seventh IEEE International Conference on Computer Vision, 1999.
DOI : 10.1109/ICCV.1999.790410

M. Marszalek and C. Schmid, Semantic Hierarchies for Visual Object Recognition, 2007 IEEE Conference on Computer Vision and Pattern Recognition, pp.1-7, 2007.
DOI : 10.1109/CVPR.2007.383272

URL : https://hal.archives-ouvertes.fr/inria-00548680

M. Marszalek and C. Schmid, Constructing Category Hierarchies for Visual Recognition, European Conference on Computer Vision (ECCV'08), pp.479-491, 2008.
DOI : 10.1007/978-3-540-88693-8_35

URL : https://hal.archives-ouvertes.fr/inria-00548656

M. Naphade, J. R. Smith, J. Tesic, S. Chang, W. Hsu et al., Large-Scale Concept Ontology for Multimedia, IEEE Multimedia, vol.13, issue.3, pp.86-91, 2006.
DOI : 10.1109/MMUL.2006.63

S. Patwardhan and T. Pedersen, Using wordnet-based context vectors to estimate the semantic relatedness of concepts, Proceedings of the EACL 2006 Workshop on Making Sense of Sense: Bringing Computational Linguistics and Psycholinguistics Together, pp.1-8, 2006.

J. C. Platt, N. Cristianini, and J. Shawe-taylor, Large margin dag for multiclass classification, Advances in Neural Information Processing Systems (NIPS'00), 2000.

P. Resnik, Using information content to evaluate semantic similarity in a taxonomy, International Joint Conferences on Artificial Intelligence (IJCAI'95), 1995.

DOI : 10.1016/j.patcog.2006.04.045

B. C. Russell, A. Torralba, K. P. Murphy, and W. T. Freeman, LabelMe: A Database and Web-Based Tool for Image Annotation, International Journal of Computer Vision, vol.3, issue.1, pp.1-3, 2008.
DOI : 10.1007/s11263-007-0090-8

J. Sivic, B. C. Russell, A. Zisserman, W. T. Freeman, and A. A. Efros, Unsupervised discovery of visual object class hierarchies, 2008 IEEE Conference on Computer Vision and Pattern Recognition, 2008.
DOI : 10.1109/CVPR.2008.4587622

A. W. Smeulders, M. Worring, S. Santini, A. Gupta, and R. Jain, Content-based image retrieval at the end of the early years, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.22, issue.12, pp.1349-1380, 2000.
DOI : 10.1109/34.895972

A. Tousch, S. Herbin, and J. Audibert, Semantic hierarchies for image annotation: A survey, Pattern Recognition, vol.45, issue.1, pp.333-345, 2012.
DOI : 10.1016/j.patcog.2011.05.017

URL : https://hal.archives-ouvertes.fr/hal-00624460

X. Wei and C. Ngo, Ontology-enriched semantic space for video search, Proceedings of the 15th international conference on Multimedia , MULTIMEDIA '07, pp.981-990, 2007.
DOI : 10.1145/1291233.1291447

L. Wu, X. Hua, N. Yu, W. Ma, and S. Li, Flickr distance, Proceeding of the 16th ACM international conference on Multimedia, MM '08, pp.31-40, 2008.
DOI : 10.1145/1459359.1459364

B. Yao, X. Yang, L. Lin, M. W. Lee, and S. C. Zhu, I2T: Image Parsing to Text Description, Proceedings of IEEE, 2009.
DOI : 10.1109/JPROC.2010.2050411