1 code implementation • 30 Nov 2023 • Dominick Reilly, Srijan Das
To facilitate the adoption of video transformers for ADL, we hypothesize that the augmentation of RGB with human pose information, known for its sensitivity to fine-grained motion and multiple viewpoints, is essential.
Ranked #1 on Action Classification on Toyota Smarthome dataset (using extra training data)
1 code implementation • 31 Oct 2023 • Srijan Das, Tanmay Jain, Dominick Reilly, Pranav Balaji, Soumyajit Karmakar, Shyam Marjit, Xiang Li, Abhijit Das, Michael S. Ryoo
We explore the appropriate SSL tasks that can be optimized alongside the primary task, the training schemes for these tasks, and the data scale at which they can be most effective.
1 code implementation • 15 Jun 2023 • Dominick Reilly, Aman Chadha, Srijan Das
Both PAAT and PAAB surpass their respective backbone Transformers by up to 9. 8% in real-world action recognition and 21. 8% in multi-view robotic video alignment.