no code implementations • 5 Jul 2022 • Gary Leung, Jun Gao, Xiaohui Zeng, Sanja Fidler
HILA extends hierarchical vision transformer architectures by adding local connections between features of higher and lower levels to the backbone encoder.
no code implementations • 4 Nov 2019 • Alborz Rezazadeh Sereshkeh, Gary Leung, Krish Perumal, Caleb Phillips, Minfan Zhang, Afsaneh Fazly, Iqbal Mohomed
We present VASTA, a novel vision and language-assisted Programming By Demonstration (PBD) system for smartphone task automation.