Search Results for author: Bharath Hariharan

Found 71 papers, 50 papers with code

Feature Pyramid Networks for Object Detection

84 code implementations • CVPR 2017 • Tsung-Yi Lin, Piotr Dollár, Ross Girshick, Kaiming He, Bharath Hariharan, Serge Belongie

Feature pyramids are a basic component in recognition systems for detecting objects at different scales.

Ranked #3 on Pedestrian Detection on TJU-Ped-campus

Object Object Detection +1

38,291

Paper
Code

Fashionpedia: Ontology, Segmentation, and an Attribute Localization Dataset

5 code implementations • ECCV 2020 • Menglin Jia, Mengyun Shi, Mikhail Sirotenko, Yin Cui, Claire Cardie, Bharath Hariharan, Hartwig Adam, Serge Belongie

In this work we explore the task of instance segmentation with attribute localization, which unifies instance segmentation (detect and segment each object instance) and fine-grained visual attribute categorization (recognize one or multiple attributes).

Attribute Fine-Grained Visual Categorization +5

5,176

Paper
Code

PointFlow: 3D Point Cloud Generation with Continuous Normalizing Flows

12 code implementations • ICCV 2019 • Guandao Yang, Xun Huang, Zekun Hao, Ming-Yu Liu, Serge Belongie, Bharath Hariharan

Specifically, we learn a two-level hierarchy of distributions where the first level is the distribution of shapes and the second level is the distribution of points given a shape.

Ranked #4 on Point Cloud Generation on ShapeNet Car

Point Cloud Generation Variational Inference

2,392

Paper
Code

Tracking Everything Everywhere All at Once

1 code implementation • ICCV 2023 • Qianqian Wang, Yen-Yu Chang, Ruojin Cai, Zhengqi Li, Bharath Hariharan, Aleksander Holynski, Noah Snavely

We present a new test-time optimization method for estimating dense and long-range motion from a video sequence.

Motion Estimation Optical Flow Estimation

2,026

Paper
Code

Pseudo-LiDAR from Visual Depth Estimation: Bridging the Gap in 3D Object Detection for Autonomous Driving

2 code implementations • CVPR 2019 • Yan Wang, Wei-Lun Chao, Divyansh Garg, Bharath Hariharan, Mark Campbell, Kilian Q. Weinberger

However, in this paper we argue that it is not the quality of the data but its representation that accounts for the majority of the difference.

Ranked #10 on 3D Object Detection From Stereo Images on KITTI Cars Moderate

3D Object Detection From Stereo Images Autonomous Driving +2

951

Paper
Code

Visual Prompt Tuning

6 code implementations • 23 Mar 2022 • Menglin Jia, Luming Tang, Bor-Chun Chen, Claire Cardie, Serge Belongie, Bharath Hariharan, Ser-Nam Lim

The current modus operandi in adapting pre-trained models involves updating all the backbone parameters, ie, full fine-tuning.

Ranked #2 on Prompt Engineering on ImageNet-21k

Image Classification Long-tail Learning +2

904

Paper
Code

Inferring and Executing Programs for Visual Reasoning

5 code implementations • ICCV 2017 • Justin Johnson, Bharath Hariharan, Laurens van der Maaten, Judy Hoffman, Li Fei-Fei, C. Lawrence Zitnick, Ross Girshick

Existing methods for visual reasoning attempt to directly map inputs to outputs using black-box architectures without explicitly modeling the underlying reasoning processes.

Ranked #5 on Visual Question Answering (VQA) on CLEVR-Humans

Visual Question Answering (VQA) Visual Reasoning

792

Paper
Code

Unsupervised Semantic Segmentation by Distilling Feature Correspondences

3 code implementations • ICLR 2022 • Mark Hamilton, Zhoutong Zhang, Bharath Hariharan, Noah Snavely, William T. Freeman

Unsupervised semantic segmentation aims to discover and localize semantically meaningful categories within image corpora without any form of annotation.

Ranked #4 on Unsupervised Semantic Segmentation on Potsdam-3

Unsupervised Semantic Segmentation

686

Paper
Code

Pseudo-LiDAR++: Accurate Depth for 3D Object Detection in Autonomous Driving

1 code implementation • ICLR 2020 • Yurong You, Yan Wang, Wei-Lun Chao, Divyansh Garg, Geoff Pleiss, Bharath Hariharan, Mark Campbell, Kilian Q. Weinberger

In this paper we provide substantial advances to the pseudo-LiDAR framework through improvements in stereo depth estimation.

Ranked #7 on 3D Object Detection From Stereo Images on KITTI Cars Moderate

3D Object Detection From Stereo Images Autonomous Driving +2

584

Paper
Code

Emergent Correspondence from Image Diffusion

1 code implementation • NeurIPS 2023 • Luming Tang, Menglin Jia, Qianqian Wang, Cheng Perng Phoo, Bharath Hariharan

We propose a simple strategy to extract this implicit knowledge out of diffusion networks as image features, namely DIffusion FeaTures (DIFT), and use them to establish correspondences between real images.

Semantic correspondence

471

Paper
Code

Low-shot Visual Recognition by Shrinking and Hallucinating Features

4 code implementations • ICCV 2017 • Bharath Hariharan, Ross Girshick

Low-shot visual learning---the ability to recognize novel object categories from very few examples---is a hallmark of human visual intelligence.

Ranked #5 on Few-Shot Image Classification on ImageNet-FS (5-shot, all)

BIG-bench Machine Learning Few-Shot Image Classification

307

Paper
Code

CLEVR: A Diagnostic Dataset for Compositional Language and Elementary Visual Reasoning

5 code implementations • CVPR 2017 • Justin Johnson, Bharath Hariharan, Laurens van der Maaten, Li Fei-Fei, C. Lawrence Zitnick, Ross Girshick

When building artificial intelligence systems that can reason and answer questions about visual data, we need diagnostic tests to analyze our progress and discover shortcomings.

Question Answering Visual Question Answering +1

296

Paper
Code

Learning Features by Watching Objects Move

1 code implementation • CVPR 2017 • Deepak Pathak, Ross Girshick, Piotr Dollár, Trevor Darrell, Bharath Hariharan

Given the extensive evidence that motion plays a key role in the development of the human visual system, we hope that this straightforward approach to unsupervised learning will be more effective than cleverly designed 'pretext' tasks studied in the literature.

object-detection Object Detection +1

259

Paper
Code

PiCIE: Unsupervised Semantic Segmentation using Invariance and Equivariance in Clustering

2 code implementations • CVPR 2021 • Jang Hyun Cho, Utkarsh Mall, Kavita Bala, Bharath Hariharan

With our novel learning objective, our framework can learn high-level semantic concepts.

Ranked #3 on Unsupervised Semantic Segmentation on COCO-Stuff-171

Clustering Inductive Bias +1

190

Paper
Code

End-to-End Pseudo-LiDAR for Image-Based 3D Object Detection

1 code implementation • CVPR 2020 • Rui Qian, Divyansh Garg, Yan Wang, Yurong You, Serge Belongie, Bharath Hariharan, Mark Campbell, Kilian Q. Weinberger, Wei-Lun Chao

Reliable and accurate 3D object detection is a necessity for safe autonomous driving.

3D Depth Estimation 3D Object Detection +3

186

Paper
Code

Geometry Processing with Neural Fields

1 code implementation • NeurIPS 2021 • Guandao Yang, Serge Belongie, Bharath Hariharan, Vladlen Koltun

Most existing geometry processing algorithms use meshes as the default shape representation.

185

Paper
Code

Learning Gradient Fields for Shape Generation

1 code implementation • ECCV 2020 • Ruojin Cai, Guandao Yang, Hadar Averbuch-Elor, Zekun Hao, Serge Belongie, Noah Snavely, Bharath Hariharan

Point cloud generation thus amounts to moving randomly sampled points to high-density areas.

Point Cloud Generation

182

Paper
Code

Learning Feature Descriptors using Camera Pose Supervision

1 code implementation • ECCV 2020 • Qianqian Wang, Xiaowei Zhou, Bharath Hariharan, Noah Snavely

Recent research on learned visual descriptors has shown promising improvements in correspondence estimation, a key component of many 3D vision tasks.

178

Paper
Code

Hypercolumns for Object Segmentation and Fine-grained Localization

6 code implementations • CVPR 2015 • Bharath Hariharan, Pablo Arbeláez, Ross Girshick, Jitendra Malik

Recognition algorithms based on convolutional networks (CNNs) typically use the output of the last layer as feature representation.

Object Semantic Segmentation

156

Paper
Code

Doppelgangers: Learning to Disambiguate Images of Similar Structures

1 code implementation • ICCV 2023 • Ruojin Cai, Joseph Tung, Qianqian Wang, Hadar Averbuch-Elor, Bharath Hariharan, Noah Snavely

Our evaluation shows that our method can distinguish illusory matches in difficult cases, and can be integrated into SfM pipelines to produce correct, disambiguated 3D reconstructions.

3D Reconstruction Binary Classification

146

Paper
Code

Better Monocular 3D Detectors with LiDAR from the Past

2 code implementations • 8 Apr 2024 • Yurong You, Cheng Perng Phoo, Carlos Andres Diaz-Ruiz, Katie Z Luo, Wei-Lun Chao, Mark Campbell, Bharath Hariharan, Kilian Q Weinberger

Accurate 3D object detection is crucial to autonomous driving.

3D Object Detection Autonomous Driving +1

131

Paper
Code

DeepBox: Learning Objectness with Convolutional Networks

1 code implementation • ICCV 2015 • Wei-cheng Kuo, Bharath Hariharan, Jitendra Malik

Existing object proposal approaches use primarily bottom-up cues to rank proposals, while we believe that objectness is in fact a high level construct.

128

Paper
Code

Train in Germany, Test in The USA: Making 3D Object Detectors Generalize

1 code implementation • CVPR 2020 • Yan Wang, Xiangyu Chen, Yurong You, Li Erran, Bharath Hariharan, Mark Campbell, Kilian Q. Weinberger, Wei-Lun Chao

In the domain of autonomous driving, deep learning has substantially improved the 3D object detection accuracy for LiDAR and stereo camera data alike.

3D Object Detection Autonomous Driving +2

122

Paper
Code

Wasserstein Distances for Stereo Disparity Estimation

1 code implementation • NeurIPS 2020 • Divyansh Garg, Yan Wang, Bharath Hariharan, Mark Campbell, Kilian Q. Weinberger, Wei-Lun Chao

Existing approaches to depth or disparity estimation output a distribution over a set of pre-defined discrete values.

Ranked #2 on Stereo Depth Estimation on KITTI2015 (three pixel error metric)

3D Object Detection From Stereo Images Autonomous Driving +5

103

Paper
Code

Few-Shot Classification with Feature Map Reconstruction Networks

1 code implementation • CVPR 2021 • Davis Wertheimer, Luming Tang, Bharath Hariharan

In this paper we reformulate few-shot classification as a reconstruction problem in latent space.

Classification Cross-Domain Few-Shot +1

Paper
Code

Learning to Detect Mobile Objects from LiDAR Scans Without Labels

2 code implementations • CVPR 2022 • Yurong You, Katie Z Luo, Cheng Perng Phoo, Wei-Lun Chao, Wen Sun, Bharath Hariharan, Mark Campbell, Kilian Q. Weinberger

Current 3D object detectors for autonomous driving are almost entirely trained on human-annotated data.

Autonomous Driving Common Sense Reasoning +1

Paper
Code

Low-Shot Learning from Imaginary Data

1 code implementation • CVPR 2018 • Yu-Xiong Wang, Ross Girshick, Martial Hebert, Bharath Hariharan

Humans can quickly learn new visual concepts, perhaps because they can easily visualize or imagine what novel objects look like from different views.

General Classification

Paper
Code

Resource Aware Person Re-identification across Multiple Resolutions

1 code implementation • CVPR 2018 • Yan Wang, Lequn Wang, Yurong You, Xu Zou, Vincent Chen, Serena Li, Gao Huang, Bharath Hariharan, Kilian Q. Weinberger

Not all people are equally easy to identify: color statistics might be enough for some cases while others might require careful reasoning about high- and low-level details.

Ranked #12 on Person Re-Identification on CUHK03 detected

Person Re-Identification

Paper
Code

Learning Single-View 3D Reconstruction with Limited Pose Supervision

1 code implementation • ECCV 2018 • Guandao Yang, Yin Cui, Serge Belongie, Bharath Hariharan

It is expensive to label images with 3D structure or precise camera pose.

3D Reconstruction Single-View 3D Reconstruction +1

Paper
Code

LDLS: 3-D Object Segmentation Through Label Diffusion From 2-D Images

1 code implementation • 30 Oct 2019 • Brian H. Wang, Wei-Lun Chao, Yan Wang, Bharath Hariharan, Kilian Q. Weinberger, Mark Campbell

We obtain 2-D segmentation predictions by applying Mask-RCNN to the RGB image, and then link this image to a 3-D lidar point cloud by building a graph of connections among 3-D points and 2-D pixels.

Image Segmentation Point Cloud Segmentation +2

Paper
Code

Polynomial Neural Fields for Subband Decomposition and Manipulation

1 code implementation • 9 Feb 2023 • Guandao Yang, Sagie Benaim, Varun Jampani, Kyle Genova, Jonathan T. Barron, Thomas Funkhouser, Bharath Hariharan, Serge Belongie

We use this framework to design Fourier PNFs, which match state-of-the-art performance in signal representation tasks that use neural fields.

Paper
Code

Low-shot learning with large-scale diffusion

1 code implementation • CVPR 2018 • Matthijs Douze, Arthur Szlam, Bharath Hariharan, Hervé Jégou

This paper considers the problem of inferring image labels from images when only a few annotated examples are available at training time.

Ranked #6 on Few-Shot Image Classification on ImageNet-FS (1-shot, novel)

Few-Shot Image Classification graph construction

Paper
Code

Few-Shot Learning with Localization in Realistic Settings

1 code implementation • CVPR 2019 • Davis Wertheimer, Bharath Hariharan

Traditional recognition methods typically require large, artificially-balanced training classes, while few-shot learning methods are tested on artificially small ones.

Few-Shot Learning

Paper
Code

Revisiting Pose-Normalization for Fine-Grained Few-Shot Recognition

1 code implementation • CVPR 2020 • Luming Tang, Davis Wertheimer, Bharath Hariharan

Few-shot, fine-grained classification requires a model to learn subtle, fine-grained distinctions between different classes (e. g., birds) based on a few images alone.

Classification General Classification

Paper
Code

Extreme Rotation Estimation using Dense Correlation Volumes

1 code implementation • CVPR 2021 • Ruojin Cai, Bharath Hariharan, Noah Snavely, Hadar Averbuch-Elor

We present a technique for estimating the relative 3D rotation of an RGB image pair in an extreme setting, where the images have little or no overlap.

Feature Correlation

Paper
Code

Self-training for Few-shot Transfer Across Extreme Task Differences

1 code implementation • ICLR 2021 • Cheng Perng Phoo, Bharath Hariharan

Most few-shot learning techniques are pre-trained on a large, labeled "base dataset".

Few-Shot Learning Transfer Learning

Paper
Code

Hindsight is 20/20: Leveraging Past Traversals to Aid 3D Perception

1 code implementation • ICLR 2022 • Yurong You, Katie Z Luo, Xiangyu Chen, Junan Chen, Wei-Lun Chao, Wen Sun, Bharath Hariharan, Mark Campbell, Kilian Q. Weinberger

Self-driving cars must detect vehicles, pedestrians, and other traffic participants accurately to operate safely.

3D Object Detection Autonomous Driving +2

Paper
Code

When Does Self-supervision Improve Few-shot Learning?

2 code implementations • ECCV 2020 • Jong-Chyi Su, Subhransu Maji, Bharath Hariharan

We investigate the role of self-supervised learning (SSL) in the context of few-shot learning.

Few-Shot Learning Self-Supervised Learning

Paper
Code

Stay Positive: Non-Negative Image Synthesis for Augmented Reality

1 code implementation • CVPR 2021 • Katie Luo, Guandao Yang, Wenqi Xian, Harald Haraldsson, Bharath Hariharan, Serge Belongie

In applications such as optical see-through and projector augmented reality, producing images amounts to solving non-negative image generation, where one can only add light to an existing image.

Image-to-Image Translation Style Transfer

Paper
Code

GeoStyle: Discovering Fashion Trends and Events

1 code implementation • ICCV 2019 • Utkarsh Mall, Kevin Matzen, Bharath Hariharan, Noah Snavely, Kavita Bala

Understanding fashion styles and trends is of great potential interest to retailers and consumers alike.

Paper
Code

Accurate Differential Operators for Hybrid Neural Fields

1 code implementation • 10 Dec 2023 • Aditya Chetan, Guandao Yang, Zichen Wang, Steve Marschner, Bharath Hariharan

Yet in many applications like rendering and simulation, hybrid neural fields can cause noticeable and unreasonable artifacts.

Paper
Code

Can We Characterize Tasks Without Labels or Features?

1 code implementation • CVPR 2021 • Bram Wallace, Ziyang Wu, Bharath Hariharan

The problem of expert model selection deals with choosing the appropriate pretrained network ("expert") to transfer to a target task.

Model Selection

Paper
Code

Extending and Analyzing Self-Supervised Learning Across Domains

1 code implementation • ECCV 2020 • Bram Wallace, Bharath Hariharan

There has been little to no work with these methods on other smaller domains, such as satellite, textural, or biological imagery.

Representation Learning Self-Supervised Learning

Paper
Code

Unsupervised Adaptation from Repeated Traversals for Autonomous Driving

1 code implementation • 27 Mar 2023 • Yurong You, Cheng Perng Phoo, Katie Z Luo, Travis Zhang, Wei-Lun Chao, Bharath Hariharan, Mark Campbell, Kilian Q. Weinberger

For a self-driving car to operate reliably, its perceptual system must generalize to the end-user's environment -- ideally without additional annotation efforts.

3D Object Detection Autonomous Driving +2

Paper
Code

Coarsely-Labeled Data for Better Few-Shot Transfer

1 code implementation • ICCV 2021 • Cheng Perng Phoo, Bharath Hariharan

Few-shot learning is based on the premise that labels are expensive, especially when they are fine-grained and require expertise.

Few-Shot Learning Representation Learning

Paper
Code

Field-Guide-Inspired Zero-Shot Learning

1 code implementation • ICCV 2021 • Utkarsh Mall, Bharath Hariharan, Kavita Bala

Annotating the full set of attributes for a novel category proves to be a tedious and expensive task in deployment.

Attribute Zero-Shot Learning

Paper
Code

Distilling from Similar Tasks for Transfer Learning on a Budget

1 code implementation • ICCV 2023 • Kenneth Borup, Cheng Perng Phoo, Bharath Hariharan

To alleviate this, we propose a weighted multi-source distillation method to distill multiple source models trained on different domains weighted by their relevance for the target task into a single efficient model (named DistillWeighted).

Transfer Learning

Paper
Code

Pre-Training LiDAR-Based 3D Object Detectors Through Colorization

1 code implementation • 23 Oct 2023 • Tai-Yu Pan, Chenyang Ma, Tianle Chen, Cheng Perng Phoo, Katie Z Luo, Yurong You, Mark Campbell, Kilian Q. Weinberger, Bharath Hariharan, Wei-Lun Chao

Accurate 3D object detection and understanding for self-driving cars heavily relies on LiDAR point clouds, necessitating large amounts of labeled data to train.

3D Object Detection Colorization +4

Paper
Code

Iterative Instance Segmentation

no code implementations • CVPR 2016 • Ke Li, Bharath Hariharan, Jitendra Malik

Existing methods for pixel-wise labelling tasks generally disregard the underlying structure of labellings, often leading to predictions that are visually implausible.

Instance Segmentation Segmentation +2

Paper
Add Code

Exploring Person Context and Local Scene Context for Object Detection

no code implementations • 25 Nov 2015 • Saurabh Gupta, Bharath Hariharan, Jitendra Malik

In this paper we explore two ways of using context for object detection.

Object object-detection +1

Paper
Add Code

Simultaneous Detection and Segmentation

no code implementations • 7 Jul 2014 • Bharath Hariharan, Pablo Arbeláez, Ross Girshick, Jitendra Malik

Unlike classical semantic segmentation, we require individual object instances.

Ranked #5 on Object Detection on PASCAL VOC 2012

object-detection Object Detection +2

Paper
Add Code

R-CNNs for Pose Estimation and Action Detection

no code implementations • 19 Jun 2014 • Georgia Gkioxari, Bharath Hariharan, Ross Girshick, Jitendra Malik

We present convolutional neural networks for the tasks of keypoint (pose) prediction and action classification of people in unconstrained images.

Action Classification Action Detection +3

Paper
Add Code

Deep Fundamental Matrix Estimation without Correspondences

no code implementations • 3 Oct 2018 • Omid Poursaeed, Guandao Yang, Aditya Prakash, Qiuren Fang, Hanqing Jiang, Bharath Hariharan, Serge Belongie

Estimating fundamental matrices is a classic problem in computer vision.

Paper
Add Code

A Deep-Learning-Based Fashion Attributes Detection Model

1 code implementation • 24 Oct 2018 • Menglin Jia, Yichen Zhou, Mengyun Shi, Bharath Hariharan

Such information analyzing process is called abstracting, which recognize similarities or differences across all the garments and collections.

Marketing

Paper
Code

Detecting Objects using Deformation Dictionaries

no code implementations • CVPR 2014 • Bharath Hariharan, C. L. Zitnick, Piotr Dollar

Several popular and effective object detectors separately model intra-class variations arising from deformations and appearance changes.

Object

Paper
Add Code

Using k-Poselets for Detecting People and Localizing Their Keypoints

no code implementations • CVPR 2014 • Georgia Gkioxari, Bharath Hariharan, Ross Girshick, Jitendra Malik

A k-poselet is a deformable part model (DPM) with k parts, where each of the parts is a poselet, aligned to a specific configuration of keypoints based on ground-truth annotations.

Human Detection

Paper
Add Code

Boosting Supervision with Self-Supervision for Few-shot Learning

no code implementations • 17 Jun 2019 • Jong-Chyi Su, Subhransu Maji, Bharath Hariharan

We present a technique to improve the transferability of deep representations learned on small labeled datasets by introducing self-supervised tasks as auxiliary loss functions.

Few-Shot Learning Self-Supervised Learning

Paper
Add Code

Few-Shot Generalization for Single-Image 3D Reconstruction via Priors

no code implementations • ICCV 2019 • Bram Wallace, Bharath Hariharan

To address this problem, we present a new model architecture that reframes single-view 3D reconstruction as learnt, category agnostic refinement of a provided, category-specific prior.

3D Reconstruction Single-View 3D Reconstruction

Paper
Add Code

On the Efficacy of Knowledge Distillation

no code implementations • ICCV 2019 • Jang Hyun Cho, Bharath Hariharan

In this paper, we present a thorough evaluation of the efficacy of knowledge distillation and its dependence on student and teacher architectures.

Knowledge Distillation

Paper
Add Code

Augmentation-Interpolative AutoEncoders for Unsupervised Few-Shot Image Generation

no code implementations • 25 Nov 2020 • Davis Wertheimer, Omid Poursaeed, Bharath Hariharan

We aim to build image generation models that generalize to new domains from few examples.

Image Generation

Paper
Add Code

Exploiting Playbacks in Unsupervised Domain Adaptation for 3D Object Detection

no code implementations • 26 Mar 2021 • Yurong You, Carlos Andres Diaz-Ruiz, Yan Wang, Wei-Lun Chao, Bharath Hariharan, Mark Campbell, Kilian Q Weinberger

Self-driving cars must detect other vehicles and pedestrians in 3D to plan safe routes and avoid collisions.

3D Object Detection Autonomous Driving +3

Paper
Add Code

A theoretically grounded characterization of feature representations

no code implementations • 29 Sep 2021 • Bharath Hariharan, Cheng Perng Phoo

We present theoretical results showing how these measurements can be used to bound the error of the downstream classifiers, and show empirically that these bounds correlate well with actual downstream performance.

Few-Shot Learning Self-Supervised Learning +1

Paper
Add Code

Orientation-Discriminative Feature Representation for Decentralized Pedestrian Tracking

no code implementations • 26 Feb 2022 • Vikram Shree, Carlos Diaz-Ruiz, Chang Liu, Bharath Hariharan, Mark Campbell

This paper focuses on the problem of decentralized pedestrian tracking using a sensor network.

Paper
Add Code

Activation Regression for Continuous Domain Generalization with Applications to Crop Classification

1 code implementation • 14 Apr 2022 • Samar Khanna, Bram Wallace, Kavita Bala, Bharath Hariharan

Geographic variance in satellite imagery impacts the ability of machine learning models to generalise to new regions.

Crop Classification Domain Generalization +1

Paper
Code

Diagnosing and Remedying Shot Sensitivity with Cosine Few-Shot Learners

no code implementations • 7 Jul 2022 • Davis Wertheimer, Luming Tang, Bharath Hariharan

Existing approaches generally assume that the shot number at test time is known in advance.

Novel Concepts

Paper
Add Code

Ithaca365: Dataset and Driving Perception under Repeated and Challenging Weather Conditions

no code implementations • CVPR 2022 • Carlos A. Diaz-Ruiz, Youya Xia, Yurong You, Jose Nino, Junan Chen, Josephine Monica, Xiangyu Chen, Katie Luo, Yan Wang, Marc Emond, Wei-Lun Chao, Bharath Hariharan, Kilian Q. Weinberger, Mark Campbell

Advances in perception for self-driving cars have accelerated in recent years due to the availability of large-scale datasets, typically collected at specific locations and under nice weather conditions.

3D Object Detection Anomaly Detection +7

Paper
Add Code

Image-to-Image Translation for Autonomous Driving from Coarsely-Aligned Image Pairs

no code implementations • 23 Sep 2022 • Youya Xia, Josephine Monica, Wei-Lun Chao, Bharath Hariharan, Kilian Q Weinberger, Mark Campbell

In this paper, we investigate the idea of turning sensor inputs (i. e., images) captured in an adverse condition into a benign one (i. e., sunny), upon which the downstream tasks (e. g., semantic segmentation) can attain high accuracy.

Autonomous Driving Image-to-Image Translation +4

Paper
Add Code

Change-Aware Sampling and Contrastive Learning for Satellite Images

no code implementations • CVPR 2023 • Utkarsh Mall, Bharath Hariharan, Kavita Bala

While a vast amount of spatio-temporal satellite image data is readily available, most of it remains unlabelled.

Change Detection Contrastive Learning +3

Paper
Add Code

Unsupervised Domain Adaptation for Self-Driving from Past Traversal Features

no code implementations • 21 Sep 2023 • Travis Zhang, Katie Luo, Cheng Perng Phoo, Yurong You, Wei-Lun Chao, Bharath Hariharan, Mark Campbell, Kilian Q. Weinberger

Additionally, we leverage the statistics for a novel self-training process to stabilize the training.

3D Object Detection object-detection +2

Paper
Add Code

RealFill: Reference-Driven Generation for Authentic Image Completion

no code implementations • 28 Sep 2023 • Luming Tang, Nataniel Ruiz, Qinghao Chu, Yuanzhen Li, Aleksander Holynski, David E. Jacobs, Bharath Hariharan, Yael Pritch, Neal Wadhwa, Kfir Aberman, Michael Rubinstein

Once personalized, RealFill is able to complete a target image with visually compelling contents that are faithful to the original scene.

Paper
Add Code

Remote Sensing Vision-Language Foundation Models without Annotations via Ground Remote Alignment

no code implementations • 12 Dec 2023 • Utkarsh Mall, Cheng Perng Phoo, Meilin Kelsey Liu, Carl Vondrick, Bharath Hariharan, Kavita Bala

We introduce a method to train vision-language models for remote-sensing images without using any textual annotations.

Image Classification Language Modelling +4

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.