no code implementations • 25 May 2016 • Yanxiang Chen, Yuxing Hu, Luming Zhang, Ping Li, Chao Zhang
To remedy these problems, we develop a deep architecture to learn aesthetically-relevant visual attributes from Flickr1, which are localized by multiple textual attributes in a weakly-supervised setting.