Using complexity estimates in aesthetic image classification

Aus de_evolutionary_art_org
Wechseln zu: Navigation, Suche


Romero, J., Machado, P., Carballal, A., Santos, A.: Using complexity estimates in aesthetic image classification. Journal of Mathematics and the Arts 6(2-3), 125–136 (2012)



In recent years, the search for computational systems that classify images based on aesthetic properties has gained momentum. Such systems have a wide range of potential applications, including image search, organization, acquisition and generation. This work explores the use of complexity estimates to predict the aesthetic merit of photographs. We use a set of image metrics and two different classifiers. Our approach classifies images gathered from a photography web site, attempting to reproduce the evaluation made by a group of users. For this purpose, we use complexity estimate metrics based on the encoding size and compression error of JPEG and fractal compression, which are applied to the original value channel and to the images resulting from applying Sobel and Canny filters to this channel. By employing these estimates, in conjunction with the average and standard deviation of the value channel, i.e., 20 features, a success rate of 74.59% was attained. Using the three most influential features yields a success rate of 71.34%, which is competitive with the best results reported in the literature, 71.44%, using the same dataset.

Extended Abstract


Used References

[1] R. Arnheim, Art and visual perception, a psychology of the creative eye, Faber and Faber, London 1956.

[2] S. Baluja, D. Pomerleau, and T. Jochem, Towards automated artificial evolution for computer-generated images, Connection Science 6 (1994), pp. 325–354.

[3] G.D. Birkhoff, Aesthetic measure, Harvard University Press 1933.

[4] J. Canny, A computational approach to edge detection, IEEE Trans. Pattern Anal. Mach. Intell. 8 (1986), pp. 679–698.

[5] C.C. Chang and C.J. Lin (2001), software available at

[6] R. Datta and J.Z. Wang, ACQUINE: aesthetic quality inference engine - real-time automatic rating of photo aesthetics., in Multimedia Information Retrieval, URL, ACM, 2010, pp. 421–424.

[7] R. Datta, D. Joshi, J. Li, and J.Z. Wang, Studying Aesthetics in Photographic Images Using a Computational Ap- proach., in Computer Vision – ECCV 2006, 9th European Conference on Computer Vision, Part III, LNCS, Springer, Graz, Austria, 2006, pp. 288–301.

[8] R. Datta, D. Joshi, J. Li, and J.Z. Wang, Image retrieval: Ideas, influences, and trends of the new age, ACM Comput. Surv. 40 (2008), pp. 5:1–5:60, URL

[9] H.J. Eysenck, The empirical determination of an aesthetic formula, Psychological Review 48 (1941), pp. 83–92.

[10] H.J. Eysenck, The experimental study of the ’Good Gestalt’ - A new approach, Psychological Review 49 (1942), pp. 344–363.

[11] A. Forsythe, M. Nadal, N. Sheehy, C.J. Cela-Conde, and M. Sawey, Predicting beauty: Fractal dimension and vi- sual complexity in art, British Journal of Psychology 102 (2011), pp. 49–70, URL

[12] M. Graves, Design judgement test, manual, The Psychological Corporation, New York 1948.

[13] I. Guyon and A. Elisseeff, An introduction to variable and feature selection, J. Mach. Learn. Res. 3 (2003), pp. 1157– 1182, URL

[14] Y. Ke, X. Tang, and F. Jing, The Design of High-Level Features for Photo Quality Assessment, Computer Vision and Pattern Recognition, IEEE Computer Society Conference on 1 (2006), pp. 419–426.

[15] Y. Luo and X. Tang, Photo and Video Quality Evaluation: Focusing on the Subject, in ECCV (3), Lecture Notes in Computer Science, Vol. 5304, Springer, 2008, pp. 386–399.

[16] P. Machado and A. Cardoso, Computing Aesthetics, in Proceedings of the XIVth Brazilian Symposium on Artificial Intelligence: Advances in Artificial Intelligence, LNCS, Vol. 1515, Springer, Porto Alegre, Brazil, 1998, pp. 219–229.

[17] P. Machado, J. Romero, and B. Manaris, Experiments in computational aesthetics: An iterative approach to stylistic change in evolutionary art, in The Art of Artificial Evolution: A Handbook on Evolutionary Art and Music, J. Romero and P. Machado, eds., Springer Berlin Heidelberg, 2007, pp. 381–415.

[18] P. Machado, J. Romero, A. Santos, A. Cardoso, and B. Manaris, Adaptive Critics for Evolutionary Artists, in Applica- tions of Evolutionary Computing, EvoWorkshops 2004: EvoBIO, EvoCOMNET, EvoHOT, EvoIASP, EvoMUSART, EvoSTOC, LNCS, Vol. 3005, 5-7 Apr., Springer, Coimbra, Portugal, 2004, pp. 435–444.

[19] N.C. Meier, Art in human affairs, McGraw-Hill 1942.

[20] A. Moles, Thorie de l’information et perception esthtique, Denoel 1958.

[21] J. Romero, P. Machado, A. Carballal, and O. Osorio, Aesthetic Classification and Sorting Based on Image Compression, in EvoApplications (2), Lecture Notes in Computer Science, Vol. 6625, Springer, 2011, pp. 394–403.

[22] R. Saunders and J.S. Gero, Curious agents and situated design evaluations, AI EDAM 18 (2004), pp. 153–161.

[23] I. Sobel, An isotropic 3 x 3 image gradient operator, Machine Vision for Three-Dimensional Scenes (1990), pp. 376–379.

[24] N. Svang ̊ard and P. Nordin, Automated Aesthetic Selection of Evolutionary Art by Distance Based Classification of Genomes and Phenomes using the Universal Similarity Metric, in Applications of Evolutionary Computing, EvoWork- shops 2004: EvoBIO, EvoCOMNET, EvoHOT, EvoIASP, EvoMUSART, EvoSTOC, LNCS, Vol. 3005, Springer, Coim- bra, Portugal, 2004, pp. 445–454.

[25] H. Tong, M. Li, H. Zhang, J. He, and C. Zhang, Classification of Digital Photos Taken by Photographers or Home Users, in PCM (1), Lecture Notes in Computer Science, Vol. 3332, Springer, 2004, pp. 198–205.

[26] V. Vapnik, An overview of statistical learning theory, IEEE Transactions on Neural Networks 10 (1999), pp. 988–999. [27] I.H. Witten and E. Frank, Data mining: practical machine learning tools and techniques with java implementations, SIGMOD Rec. 31 (2002), pp. 76–77.

[28] A. Zell, G. Mamier, M. Vogt, N. Mache, R. H ̈ubner, S. D ̈oring, K.U. Herrmann, T. Soyez, M. Schmalzl, T. Sommer, et al., SNNS: Stuttgart Neural Network Simulator User Manual, version 4.2, Tech. Rep. 3/92, University of Stuttgart, Stuttgart, 2003.


Full Text

intern file

Sonstige Links