no code implementations • 2 Nov 2023 • Anqi Joyce Yang, Sergio Casas, Nikita Dvornik, Sean Segal, Yuwen Xiong, Jordan Sir Kwang Hu, Carter Fang, Raquel Urtasun
Auto-labels are most commonly generated via a two-stage approach -- first objects are detected and tracked over time, and then each object trajectory is passed to a learned refinement model to improve accuracy.
1 code implementation • ICCV 2023 • Mohamed Ashraf Abdelsalam, Samrudhdhi B. Rangrej, Isma Hadji, Nikita Dvornik, Konstantinos G. Derpanis, Afsaneh Fazly
While most previous work focus on the problem of data scarcity in procedural video datasets, another core challenge of future anticipation is how to account for multiple plausible future realizations in natural settings.
no code implementations • 27 May 2023 • Liquan Wang, Nikita Dvornik, Rafael Dubeau, Mayank Mittal, Animesh Garg
We show in the experiments that such affordance learning predicts interaction which covers most modes of interaction for the querying articulated object and can be fine-tuned to a goal-conditional model.
no code implementations • CVPR 2023 • Nikita Dvornik, Isma Hadji, Ran Zhang, Konstantinos G. Derpanis, Animesh Garg, Richard P. Wildes, Allan D. Jepson
This motivates the need to temporally localize the instruction steps in such videos, i. e. the task called key-step localization.
1 code implementation • 31 Oct 2022 • Avery Ma, Nikita Dvornik, Ran Zhang, Leila Pishdad, Konstantinos G. Derpanis, Afsaneh Fazly
For image classification, the most popular data augmentation techniques range from simple photometric and geometrical transformations, to more complex methods that use visual saliency to craft new training examples.
1 code implementation • 12 Oct 2022 • Ziyi Wu, Nikita Dvornik, Klaus Greff, Thomas Kipf, Animesh Garg
While recent object-centric models can successfully decompose a scene into objects, modeling their dynamics effectively still remains a challenge.
1 code implementation • 10 Oct 2022 • Nikita Dvornik, Isma Hadji, Hai Pham, Dhaivat Bhatt, Brais Martinez, Afsaneh Fazly, Allan D. Jepson
In this setup, we seek the optimal step ordering consistent with the procedure flow graph and a given video.
1 code implementation • CVPR 2022 • He Zhao, Isma Hadji, Nikita Dvornik, Konstantinos G. Derpanis, Richard P. Wildes, Allan D. Jepson
Our model is based on a transformer equipped with a memory module, which maps the start and goal observations to a sequence of plausible actions.
no code implementations • NeurIPS 2021 • Nikita Dvornik, Isma Hadji, Konstantinos G. Derpanis, Animesh Garg, Allan D. Jepson
In our experiments, we show that Drop-DTW is a robust similarity measure for sequence retrieval and demonstrate its effectiveness as a training loss on diverse applications.
1 code implementation • ECCV 2020 • Nikita Dvornik, Cordelia Schmid, Julien Mairal
Popular approaches for few-shot classification consist of first learning a generic data representation based on a large annotated dataset, before adapting the representation to new classes given only a few labeled samples.
Ranked #4 on Few-Shot Image Classification on Meta-Dataset Rank
1 code implementation • ICCV 2019 • Nikita Dvornik, Cordelia Schmid, Julien Mairal
Few-shot classification consists of learning a predictive model that is able to effectively adapt to a new class, given only a few annotated samples.
no code implementations • 6 Sep 2018 • Nikita Dvornik, Julien Mairal, Cordelia Schmid
In this work, we consider object detection, semantic and instance segmentation and augment the training images by blending objects in existing scenes, using instance segmentation annotations.
2 code implementations • ECCV 2018 • Nikita Dvornik, Julien Mairal, Cordelia Schmid
For this approach to be successful, we show that modeling appropriately the visual context surrounding objects is crucial to place them in the right environment.
2 code implementations • ICCV 2017 • Nikita Dvornik, Konstantin Shmelkov, Julien Mairal, Cordelia Schmid
Real-time scene understanding has become crucial in many applications such as autonomous driving.
Ranked #2 on Real-Time Object Detection on PASCAL VOC 2007