TY - GEN
T1 - Rapid
T2 - 2014 ACM Conference on Multimedia, MM 2014
AU - Lu, Xin
AU - Lin, Zhe
AU - Jin, Hailin
AU - Yang, Jianchao
AU - Wang, James Z.
PY - 2014/11/3
Y1 - 2014/11/3
N2 - Effective visual features are essential for computational aesthetic quality rating systems. Existing methods used machine learning and statistical modeling techniques on handcrafted features or generic image descriptors. A recentlypublished large-scale dataset, the AVA dataset, has further empowered machine learning based approaches. We present the RAPID (RAting PIctorial aesthetics using Deep learning) system, which adopts a novel deep neural network approach to enable automatic feature learning. The central idea is to incorporate heterogeneous inputs generated from the image, which include a global view and a local view, and to unify the feature learning and classifier training using a double-column deep convolutional neural network. In addition, we utilize the style attributes of images to help improve the aesthetic quality categorization accuracy. Experimental results show that our approach significantly outperforms the state of the art on the AVA dataset.
AB - Effective visual features are essential for computational aesthetic quality rating systems. Existing methods used machine learning and statistical modeling techniques on handcrafted features or generic image descriptors. A recentlypublished large-scale dataset, the AVA dataset, has further empowered machine learning based approaches. We present the RAPID (RAting PIctorial aesthetics using Deep learning) system, which adopts a novel deep neural network approach to enable automatic feature learning. The central idea is to incorporate heterogeneous inputs generated from the image, which include a global view and a local view, and to unify the feature learning and classifier training using a double-column deep convolutional neural network. In addition, we utilize the style attributes of images to help improve the aesthetic quality categorization accuracy. Experimental results show that our approach significantly outperforms the state of the art on the AVA dataset.
UR - https://www.scopus.com/pages/publications/84913588188
UR - https://www.scopus.com/pages/publications/84913588188#tab=citedBy
U2 - 10.1145/2647868.2654927
DO - 10.1145/2647868.2654927
M3 - Conference contribution
AN - SCOPUS:84913588188
T3 - MM 2014 - Proceedings of the 2014 ACM Conference on Multimedia
SP - 457
EP - 466
BT - MM 2014 - Proceedings of the 2014 ACM Conference on Multimedia
PB - Association for Computing Machinery
Y2 - 3 November 2014 through 7 November 2014
ER -