no code implementations • CVPR 2023 • Cong Wei, Brendan Duke, Ruowei Jiang, Parham Aarabi, Graham W. Taylor, Florian Shkurti
Equipped with the learned unstructured attention pattern, sparse attention ViT (Sparsifiner) produces a superior Pareto-optimal trade-off between FLOPs and top-1 accuracy on ImageNet compared to token sparsity.
1 code implementation • 1 Sep 2022 • Zikun Chen, Ruowei Jiang, Brendan Duke, Han Zhao, Parham Aarabi
Generative Adversarial Networks (GANs) have been widely applied in modeling diverse image distributions.
no code implementations • 12 May 2022 • Robin Kips, Ruowei Jiang, Sileye Ba, Brendan Duke, Matthieu Perrot, Pietro Gori, Isabelle Bloch
In this paper we propose a novel framework based on deep learning to build a real-time inverse graphics encoder that learns to map a single example image into the parameter space of a given augmented reality rendering engine.
no code implementations • 31 Mar 2021 • Eu Wern Teh, Terrance DeVries, Brendan Duke, Ruowei Jiang, Parham Aarabi, Graham W. Taylor
We further show that GIST and RIST can be combined with existing semi-supervised learning methods to boost performance.
1 code implementation • CVPR 2021 • Rohit Saha, Brendan Duke, Florian Shkurti, Graham W. Taylor, Parham Aarabi
Therefore, we propose Latent Optimization of Hairstyles via Orthogonalization (LOHO), an optimization-based approach using GAN inversion to infill missing hair structure details in latent space during hairstyle transfer.
1 code implementation • CVPR 2021 • Brendan Duke, Abdalla Ahmed, Christian Wolf, Parham Aarabi, Graham W. Taylor
SST extracts per-pixel representations for each object in a video using sparse attention over spatiotemporal features.
no code implementations • 5 Jun 2019 • TianXing Li, Zhi Yu, Edmund Phung, Brendan Duke, Irina Kezele, Parham Aarabi
Recent works on convolutional neural networks (CNNs) for facial alignment have demonstrated unprecedented accuracy on a variety of large, publicly available datasets.
no code implementations • 5 Jun 2019 • Brendan Duke, Abdalla Ahmed, Edmund Phung, Irina Kezele, Parham Aarabi
We also provide a postprocessing and rendering algorithm for nail polish try-on, which integrates with our semantic segmentation and fingernail base-tip direction predictions.
no code implementations • 26 Mar 2018 • Brendan Duke, Graham W. Taylor
We propose a generalized class of multimodal fusion operators for the task of visual question answering (VQA).