no code implementations • 22 Jan 2016 • Amr Bakry, Ahmed Elgammal
Embedding the visual units on a manifold and using manifold kernels is one way to measure these distances.
no code implementations • 16 Nov 2015 • Mohamed Elhoseiny, Tarek El-Gaaly, Amr Bakry, Ahmed Elgammal
In the task of Object Recognition, there exists a dichotomy between the categorization of objects and estimating object pose, where the former necessitates a view-invariant representation, while the latter requires a representation capable of capturing pose information over different categories of objects.
no code implementations • 9 Aug 2015 • Amr Bakry, Mohamed Elhoseiny, Tarek El-Gaaly, Ahmed Elgammal
How does fine-tuning of a pre-trained CNN on a multi-view dataset affect the representation at each layer of the network?
no code implementations • CVPR 2013 • Amr Bakry, Ahmed Elgammal
Our approach outperforms for the speaker semi-dependent setting by at least 15% of the baseline, and competes in the other two settings.