no code implementations • ICCV 2021 • Mohammadreza Zolfaghari, Yi Zhu, Peter Gehler, Thomas Brox
Contrastive learning allows us to flexibly define powerful losses by contrasting positive pairs from sets of negative samples.
1 code implementation • 11 Dec 2020 • Yi Zhu, Xinyu Li, Chunhui Liu, Mohammadreza Zolfaghari, Yuanjun Xiong, Chongruo wu, Zhi Zhang, Joseph Tighe, R. Manmatha, Mu Li
Video action recognition is one of the representative tasks for video understanding.
1 code implementation • NeurIPS 2020 • Simon Ging, Mohammadreza Zolfaghari, Hamed Pirsiavash, Thomas Brox
Many real-world video-text tasks involve different levels of granularity, such as frames and words, clip and sentences or videos and paragraphs, each with distinct semantics.
Ranked #4 on Video Captioning on ActivityNet Captions
1 code implementation • 4 Apr 2020 • Andres Munoz, Mohammadreza Zolfaghari, Max Argus, Thomas Brox
In this paper, we present a network architecture for video generation that models spatio-temporal consistency without resorting to costly 3D architectures.
1 code implementation • 9 May 2019 • Mohammadreza Zolfaghari, Özgün Çiçek, Syed Mohsin Ali, Farzaneh Mahdisoltani, Can Zhang, Thomas Brox
Foreseeing the future is one of the key factors of intelligence.
6 code implementations • ECCV 2018 • Mohammadreza Zolfaghari, Kamaljeet Singh, Thomas Brox
In this paper, we introduce a network architecture that takes long-term content into account and enables fast per-video processing at the same time.
Ranked #66 on Action Recognition on Something-Something V1
1 code implementation • ICCV 2017 • Mohammadreza Zolfaghari, Gabriel L. Oliveira, Nima Sedaghat, Thomas Brox
In this paper, we propose a network architecture that computes and integrates the most important visual cues for action recognition: pose, motion, and the raw images.
no code implementations • 12 Dec 2016 • Nima Sedaghat, Mohammadreza Zolfaghari, Thomas Brox
With the help of a sample-variant multi-tasking architecture, the network is trained on different tasks depending on the availability of ground-truth.
no code implementations • 12 Apr 2016 • Nima Sedaghat, Mohammadreza Zolfaghari, Ehsan Amiri, Thomas Brox
In this paper, we show that the object orientation plays an important role in 3D recognition.
Ranked #2 on 3D Object Classification on ModelNet10