Search Results for author: Chenhang He

Found 15 papers, 12 papers with code

VoxGenesis: Unsupervised Discovery of Latent Speaker Manifold for Speech Synthesis

no code implementations • 1 Mar 2024 • Weiwei Lin, Chenhang He, Man-Wai Mak, Jiachen Lian, Kong Aik Lee

This forces the model to learn a speaker distribution disentangled from the semantic content.

Paper
Add Code

ScatterFormer: Efficient Voxel Transformer with Scattered Linear Attention

1 code implementation • 1 Jan 2024 • Chenhang He, Ruihuang Li, Guowen Zhang, Lei Zhang

Window-based transformers have demonstrated strong ability in large-scale point cloud understanding by capturing context-aware representations with affordable attention computation in a more localized manner.

Blocking

Paper
Code

Motion-Guided Latent Diffusion for Temporally Consistent Real-world Video Super-resolution

1 code implementation • 1 Dec 2023 • Xi Yang, Chenhang He, jianqi ma, Lei Zhang

To ensure the content consistency among adjacent frames, we exploit the temporal dynamics in LR videos to guide the diffusion process by optimizing the latent sampling path with a motion-guided loss, ensuring that the generated HR video maintains a coherent and continuous visual flow.

Image Restoration Video Super-Resolution

Paper
Code

Self-supervised Neural Factor Analysis for Disentangling Utterance-level Speech Representations

no code implementations • 14 May 2023 • Weiwei Lin, Chenhang He, Man-Wai Mak, Youzhi Tu

Self-supervised learning (SSL) speech models such as wav2vec and HuBERT have demonstrated state-of-the-art performance on automatic speech recognition (ASR) and proved to be extremely useful in low label-resource settings.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +2

Paper
Add Code

One-to-Few Label Assignment for End-to-End Dense Detection

1 code implementation • CVPR 2023 • Shuai Li, Minghan Li, Ruihuang Li, Chenhang He, Lei Zhang

The positive and negative weights of these soft anchors are dynamically adjusted during training so that they can contribute more to ``representation learning'' in the early training stage, and contribute more to ``duplicated prediction removal'' in the later stage.

Representation Learning

Paper
Code

MSF: Motion-guided Sequential Fusion for Efficient 3D Object Detection from Point Cloud Sequences

1 code implementation • CVPR 2023 • Chenhang He, Ruihuang Li, Yabin Zhang, Shuai Li, Lei Zhang

Current top-performing multi-frame detectors mostly follow a Detect-and-Fuse framework, which extracts features from each frame of the sequence and fuses them to detect the objects in the current frame.

3D Object Detection Autonomous Driving +1

Paper
Code

SIM: Semantic-aware Instance Mask Generation for Box-Supervised Instance Segmentation

1 code implementation • CVPR 2023 • Ruihuang Li, Chenhang He, Yabin Zhang, Shuai Li, Liyi Chen, Lei Zhang

Weakly supervised instance segmentation using only bounding box annotations has recently attracted much research attention.

Box-supervised Instance Segmentation Segmentation +2

Paper
Code

DynaMask: Dynamic Mask Selection for Instance Segmentation

1 code implementation • CVPR 2023 • Ruihuang Li, Chenhang He, Shuai Li, Yabin Zhang, Lei Zhang

The representative instance segmentation methods mostly segment different object instances with a mask of the fixed resolution, e. g., 28*28 grid.

Instance Segmentation Segmentation +1

Paper
Code

Masked Surfel Prediction for Self-Supervised Point Cloud Learning

1 code implementation • 7 Jul 2022 • Yabin Zhang, Jiehong Lin, Chenhang He, Yongwei Chen, Kui Jia, Lei Zhang

In this work, we make the first attempt, to the best of our knowledge, to consider the local geometry information explicitly into the masked auto-encoding, and propose a novel Masked Surfel Prediction (MaskSurf) method.

Point cloud reconstruction Self-Supervised Learning

Paper
Code

Voxel Set Transformer: A Set-to-Set Approach to 3D Object Detection from Point Clouds

1 code implementation • CVPR 2022 • Chenhang He, Ruihuang Li, Shuai Li, Lei Zhang

VoxSeT is built upon a voxel-based set attention (VSA) module, which reduces the self-attention in each voxel by two cross-attentions and models features in a hidden space induced by a group of latent codes.

3D Object Detection object-detection

184

Paper
Code

A Dual Weighting Label Assignment Scheme for Object Detection

1 code implementation • CVPR 2022 • Shuai Li, Chenhang He, Ruihuang Li, Lei Zhang

Existing LA methods mostly focus on the design of pos weighting function, while the neg weight is directly derived from the pos weight.

Object object-detection +2

136

Paper
Code

Class-Balanced Pixel-Level Self-Labeling for Domain Adaptive Semantic Segmentation

1 code implementation • CVPR 2022 • Ruihuang Li, Shuai Li, Chenhang He, Yabin Zhang, Xu Jia, Lei Zhang

One popular solution to this challenging task is self-training, which selects high-scoring predictions on target samples as pseudo labels for training.

Ranked #9 on Image-to-Image Translation on SYNTHIA-to-Cityscapes

Segmentation Semantic Segmentation +1

Paper
Code

Aug3D-RPN: Improving Monocular 3D Object Detection by Synthetic Images with Virtual Depth

no code implementations • 28 Jul 2021 • Chenhang He, Jianqiang Huang, Xian-Sheng Hua, Lei Zhang

Current geometry-based monocular 3D object detection models can efficiently detect objects by leveraging perspective geometry, but their performance is limited due to the absence of accurate depth information.

Depth Estimation Monocular 3D Object Detection +1

Paper
Add Code

Structure Aware Single-Stage 3D Object Detection From Point Cloud

1 code implementation • CVPR 2020 • Chenhang He, Hui Zeng, Jianqiang Huang, Xian-Sheng Hua, Lei Zhang

The auxiliary network is jointly optimized, by two point-level supervisions, to guide the convolutional features in the backbone network to be aware of the object structure.

3D Object Detection Autonomous Driving +1

486

Paper
Code

VerSe: A Vertebrae Labelling and Segmentation Benchmark for Multi-detector CT Images

2 code implementations • 24 Jan 2020 • Anjany Sekuboyina, Malek E. Husseini, Amirhossein Bayat, Maximilian Löffler, Hans Liebl, Hongwei Li, Giles Tetteh, Jan Kukačka, Christian Payer, Darko Štern, Martin Urschler, Maodong Chen, Dalong Cheng, Nikolas Lessmann, Yujin Hu, Tianfu Wang, Dong Yang, Daguang Xu, Felix Ambellan, Tamaz Amiranashvili, Moritz Ehlke, Hans Lamecker, Sebastian Lehnert, Marilia Lirio, Nicolás Pérez de Olaguer, Heiko Ramm, Manish Sahu, Alexander Tack, Stefan Zachow, Tao Jiang, Xinjun Ma, Christoph Angerman, Xin Wang, Kevin Brown, Alexandre Kirszenberg, Élodie Puybareau, Di Chen, Yiwei Bai, Brandon H. Rapazzo, Timyoas Yeah, Amber Zhang, Shangliang Xu, Feng Hou, Zhiqiang He, Chan Zeng, Zheng Xiangshang, Xu Liming, Tucker J. Netherton, Raymond P. Mumme, Laurence E. Court, Zixun Huang, Chenhang He, Li-Wen Wang, Sai Ho Ling, Lê Duy Huynh, Nicolas Boutry, Roman Jakubicek, Jiri Chmelik, Supriti Mulay, Mohanasankar Sivaprakasam, Johannes C. Paetzold, Suprosanna Shit, Ivan Ezhov, Benedikt Wiestler, Ben Glocker, Alexander Valentinitsch, Markus Rempfler, Björn H. Menze, Jan S. Kirschke

Two datasets containing a total of 374 multi-detector CT scans from 355 patients were prepared and 4505 vertebrae have individually been annotated at voxel-level by a human-machine hybrid algorithm (https://osf. io/nqjyw/, https://osf. io/t98fz/).

Anatomy Segmentation

187

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.