no code implementations • 22 Sep 2024 • Yuxiao Chen, Kai Li, Wentao Bao, Deep Patel, Yu Kong, Martin Renqiang Min, Dimitris N. Metaxas
Learning to localize temporal boundaries of procedure steps in instructional videos is challenging due to the limited availability of annotated large-scale training videos.
no code implementations • 23 Aug 2024 • Alexandru Niculescu-Mizil, Deep Patel, Iain Melvin
MCTR leverages end-to-end detectors like DEtector TRansformer (DETR) to produce detections and detection embeddings independently for each camera view.
no code implementations • CVPR 2024 • Che-Jui Chang, Danrui Li, Deep Patel, Parth Goel, Honglu Zhou, Seonghyeon Moon, Samuel S. Sohn, Sejong Yoon, Vladimir Pavlovic, Mubbasir Kapadia
The study of complex human interactions and group activities has become a focal point in human-centric computer vision.
1 code implementation • 13 Sep 2023 • Christoph Reich, Biplob Debnath, Deep Patel, Srimat Chakradhar
the input image, the JPEG quality, the quantization tables, and the color conversion parameters.
no code implementations • 30 Aug 2023 • Christoph Reich, Biplob Debnath, Deep Patel, Tim Prangemeier, Daniel Cremers, Srimat Chakradhar
To overcome the deterioration of vision performance, this paper presents the first end-to-end learnable deep video codec control that considers both bandwidth constraints and downstream deep vision performance, while adhering to existing standardization.
no code implementations • 29 Jun 2023 • Che-Jui Chang, Danrui Li, Deep Patel, Parth Goel, Honglu Zhou, Seonghyeon Moon, Samuel S. Sohn, Sejong Yoon, Vladimir Pavlovic, Mubbasir Kapadia
The study of complex human interactions and group activities has become a focal point in human-centric computer vision.
no code implementations • 16 May 2023 • Yi Huang, Asim Kadav, Farley Lai, Deep Patel, Hans Peter Graf
Specifically, KeyNet introduces the use of object based keypoint information to capture context in the scene.
no code implementations • CVPR 2023 • Kai Li, Deep Patel, Erik Kruus, Martin Renqiang Min
Source-free domain adaptation (SFDA) is an emerging research topic that studies how to adapt a pretrained source model using unlabeled target data.
Source-Free Domain Adaptation Unsupervised Domain Adaptation
1 code implementation • 21 Jul 2021 • Deep Patel, P. S. Sastry
Deep Neural Networks, often owing to the overparameterization, are shown to be capable of exactly memorizing even randomly labelled data.
no code implementations • 7 Jul 2021 • Deep Patel, Erin Gao, Anirudh Koul, Siddha Ganju, Meher Anand Kasam
Collecting fully annotated datasets is challenging, especially for large scale satellite systems such as the unlabeled NASA's 35 PB Earth Imagery dataset.
1 code implementation • 29 Jun 2021 • Deep Patel, P. S. Sastry
Deep Neural Networks (DNNs) have been shown to be susceptible to memorization or overfitting in the presence of noisily-labelled data.
Ranked #38 on Image Classification on Clothing1M
Image Classification Image Classification with Label Noise +1