Search Results for author: Haochen Wang

Found 23 papers, 14 papers with code

Few-Shot Semantic Segmentation with Democratic Attention Networks

no code implementations ECCV 2020 Haochen Wang, Xu-Dong Zhang, Yutao Hu, Yandan Yang, Xian-Bin Cao, Xian-Tong Zhen

The crux of few-shot segmentation is to extract object information from the support image and then propagate it to guide the segmentation of query images.

Few-Shot Semantic Segmentation Graph Attention +2

Alpha Invariance: On Inverse Scaling Between Distance and Volume Density in Neural Radiance Fields

no code implementations2 Apr 2024 Joshua Ahn, Haochen Wang, Raymond A. Yeh, Greg Shakhnarovich

Scale-ambiguity in 3D scene dimensions leads to magnitude-ambiguity of volumetric densities in neural radiance fields, i. e., the densities double when scene size is halved, and vice versa.

Dynamic Prototype Adaptation with Distillation for Few-shot Point Cloud Segmentation

no code implementations29 Jan 2024 Jie Liu, Wenzhe Yin, Haochen Wang, Yunlu Chen, Jan-Jakob Sonke, Efstratios Gavves

Existing prototype-based methods rely on support prototypes to guide the segmentation of query point clouds, but they encounter challenges when significant object variations exist between the support prototypes and query features.

Point Cloud Segmentation Transfer Learning

Bootstrap Masked Visual Modeling via Hard Patches Mining

1 code implementation21 Dec 2023 Haochen Wang, Junsong Fan, Yuxi Wang, Kaiyou Song, Tiancai Wang, Xiangyu Zhang, Zhaoxiang Zhang

To empower the model as a teacher, we propose Hard Patches Mining (HPM), predicting patch-wise losses and subsequently determining where to mask.

DropPos: Pre-Training Vision Transformers by Reconstructing Dropped Positions

1 code implementation NeurIPS 2023 Haochen Wang, Junsong Fan, Yuxi Wang, Kaiyou Song, Tong Wang, Zhaoxiang Zhang

As it is empirically observed that Vision Transformers (ViTs) are quite insensitive to the order of input tokens, the need for an appropriate self-supervised pretext task that enhances the location awareness of ViTs is becoming evident.

Position

Using Unreliable Pseudo-Labels for Label-Efficient Semantic Segmentation

no code implementations4 Jun 2023 Haochen Wang, Yuchao Wang, Yujun Shen, Junsong Fan, Yuxi Wang, Zhaoxiang Zhang

A common practice is to select the highly confident predictions as the pseudo-ground-truths for each pixel, but it leads to a problem that most pixels may be left unused due to their unreliability.

Semantic Segmentation

Balancing Logit Variation for Long-tailed Semantic Segmentation

1 code implementation CVPR 2023 Yuchao Wang, Jingjing Fei, Haochen Wang, Wei Li, Tianpeng Bao, Liwei Wu, Rui Zhao, Yujun Shen

In this way, we manage to close the gap between the feature areas of different categories, resulting in a more balanced representation.

Semantic Segmentation

Pulling Target to Source: A New Perspective on Domain Adaptive Semantic Segmentation

no code implementations23 May 2023 Haochen Wang, Yujun Shen, Jingjing Fei, Wei Li, Liwei Wu, Yuxi Wang, Zhaoxiang Zhang

To this end, we propose T2S-DA, which we interpret as a form of pulling Target to Source for Domain Adaptation, encouraging the model in learning similar cross-domain features.

Domain Generalization Semantic Segmentation

PiClick: Picking the desired mask in click-based interactive segmentation

1 code implementation23 Apr 2023 Cilin Yan, Haochen Wang, Jie Liu, XiaoLong Jiang, Yao Hu, Xu Tang, Guoliang Kang, Efstratios Gavves

Click-based interactive segmentation aims to generate target masks via human clicking, which facilitates efficient pixel-level annotation and image editing.

Interactive Segmentation Segmentation

Hard Patches Mining for Masked Image Modeling

1 code implementation CVPR 2023 Haochen Wang, Kaiyou Song, Junsong Fan, Yuxi Wang, Jin Xie, Zhaoxiang Zhang

We observe that the reconstruction loss can naturally be the metric of the difficulty of the pre-training task.

Towards Open-Vocabulary Video Instance Segmentation

1 code implementation ICCV 2023 Haochen Wang, Cilin Yan, Shuai Wang, XiaoLong Jiang, Xu Tang, Yao Hu, Weidi Xie, Efstratios Gavves

Video Instance Segmentation (VIS) aims at segmenting and categorizing objects in videos from a closed set of training categories, lacking the generalization ability to handle novel categories in real-world videos.

Instance Segmentation Segmentation +3

Few-shot Semantic Segmentation with Support-induced Graph Convolutional Network

no code implementations9 Jan 2023 Jie Liu, Yanqi Bao, Wenzhe Yin, Haochen Wang, Yang Gao, Jan-Jakob Sonke, Efstratios Gavves

However, the appearance variations between objects from the same category could be extremely large, leading to unreliable feature matching and query mask prediction.

Few-Shot Semantic Segmentation

LoopDraw: a Loop-Based Autoregressive Model for Shape Synthesis and Editing

no code implementations9 Dec 2022 Nam Anh Dinh, Haochen Wang, Greg Shakhnarovich, Rana Hanocka

There is no settled universal 3D representation for geometry with many alternatives such as point clouds, meshes, implicit functions, and voxels to name a few.

Score Jacobian Chaining: Lifting Pretrained 2D Diffusion Models for 3D Generation

1 code implementation CVPR 2023 Haochen Wang, Xiaodan Du, Jiahao Li, Raymond A. Yeh, Greg Shakhnarovich

We propose to apply chain rule on the learned gradients, and back-propagate the score of a diffusion model through the Jacobian of a differentiable renderer, which we instantiate to be a voxel radiance field.

3D Generation Text to 3D

NFormer: Robust Person Re-identification with Neighbor Transformer

1 code implementation CVPR 2022 Haochen Wang, Jiayi Shen, Yongtuo Liu, Yan Gao, Efstratios Gavves

To tackle this issue, we propose a Neighbor Transformer Network, or NFormer, which explicitly models interactions across all input images, thus suppressing outlier features and leading to more robust representations overall.

Person Re-Identification Representation Learning

Semi-Supervised Semantic Segmentation Using Unreliable Pseudo-Labels

1 code implementation CVPR 2022 Yuchao Wang, Haochen Wang, Yujun Shen, Jingjing Fei, Wei Li, Guoqiang Jin, Liwei Wu, Rui Zhao, Xinyi Le

A common practice is to select the highly confident predictions as the pseudo ground-truth, but it leads to a problem that most pixels may be left unused due to their unreliability.

Semi-Supervised Semantic Segmentation

Decoupled IoU Regression for Object Detection

no code implementations2 Feb 2022 Yan Gao, Qimeng Wang, Xu Tang, Haochen Wang, Fei Ding, Jing Li, Yao Hu

Prior works propose to predict Intersection-over-Union (IoU) between bounding boxes and corresponding ground-truths to improve NMS, while accurately predicting IoU is still a challenging problem.

Object object-detection +2

Attentional Prototype Inference for Few-Shot Segmentation

1 code implementation14 May 2021 Haoliang Sun, Xiankai Lu, Haochen Wang, Yilong Yin, XianTong Zhen, Cees G. M. Snoek, Ling Shao

We define a global latent variable to represent the prototype of each object category, which we model as a probabilistic distribution.

Bayesian Inference Few-Shot Semantic Segmentation +2

SwiftNet: Real-time Video Object Segmentation

1 code implementation CVPR 2021 Haochen Wang, XiaoLong Jiang, Haibing Ren, Yao Hu, Song Bai

In this work we present SwiftNet for real-time semisupervised video object segmentation (one-shot VOS), which reports 77. 8% J &F and 70 FPS on DAVIS 2017 validation dataset, leading all present solutions in overall accuracy and speed performance.

Object Segmentation +3

DIODE: A Dense Indoor and Outdoor DEpth Dataset

2 code implementations1 Aug 2019 Igor Vasiljevic, Nick Kolkin, Shanyi Zhang, Ruotian Luo, Haochen Wang, Falcon Z. Dai, Andrea F. Daniele, Mohammadreza Mostajabi, Steven Basart, Matthew R. Walter, Gregory Shakhnarovich

We introduce DIODE, a dataset that contains thousands of diverse high resolution color images with accurate, dense, long-range depth measurements.

Cannot find the paper you are looking for? You can Submit a new open access paper.