1 code implementation • 5 Sep 2022 • Asish Bera, Zachary Wharton, Yonghuai Liu, Nik Bessis, Ardhendu Behera
Over the past few years, a significant progress has been made in deep convolutional neural networks (CNNs)-based image recognition.
Ranked #1 on Fine-Grained Image Classification on Stanford Dogs
1 code implementation • 23 Oct 2021 • Asish Bera, Zachary Wharton, Yonghuai Liu, Nik Bessis, Ardhendu Behera
We address this by proposing an end-to-end CNN model, which learns meaningful features linking fine-grained changes using our novel attention mechanism.
Ranked #1 on Image Classification on Caltech-256
no code implementations • 23 Oct 2021 • Zachary Wharton, Ardhendu Behera, Asish Bera
Convolutional Neural Networks (CNNs) have revolutionized the understanding of visual content.
no code implementations • 17 Jan 2021 • Zachary Wharton, Ardhendu Behera, Yonghuai Liu, Nik Bessis
Our model is named Coarse Temporal Attention Network (CTA-Net), in which coarse temporal branches are introduced in a trainable glimpse network.
1 code implementation • 17 Jan 2021 • Ardhendu Behera, Zachary Wharton, Pradeep Hewage, Asish Bera
We evaluate our approach using six state-of-the-art (SotA) backbone networks and eight benchmark datasets.
Ranked #1 on Fine-Grained Image Classification on Food-101
no code implementations • 17 Jan 2021 • Ardhendu Behera, Zachary Wharton, Morteza Ghahremani, Swagat Kumar, Nik Bessis
Affect is often expressed via non-verbal body language such as actions/gestures, which are vital indicators for human behaviors.
Facial Expression Recognition Facial Expression Recognition (FER) +2