Search Results for author: Michael Bi Mi

Found 18 papers, 11 papers with code

DepGraph: Towards Any Structural Pruning

1 code implementation • CVPR 2023 • Gongfan Fang, Xinyin Ma, Mingli Song, Michael Bi Mi, Xinchao Wang

Structural pruning enables model acceleration by removing structurally-grouped parameters from neural networks.

Network Pruning Neural Network Compression

2,283

Paper
Code

ONCE-3DLanes: Building Monocular 3D Lane Detection

2 code implementations • CVPR 2022 • Fan Yan, Ming Nie, Xinyue Cai, Jianhua Han, Hang Xu, Zhen Yang, Chaoqiang Ye, Yanwei Fu, Michael Bi Mi, Li Zhang

We present ONCE-3DLanes, a real-world autonomous driving dataset with lane layout annotation in 3D space.

3D Lane Detection Autonomous Driving

391

Paper
Code

PoseTriplet: Co-evolving 3D Human Pose Estimation, Imitation, and Hallucination under Self-supervision

1 code implementation • CVPR 2022 • Kehong Gong, Bingbing Li, Jianfeng Zhang, Tao Wang, Jing Huang, Michael Bi Mi, Jiashi Feng, Xinchao Wang

Existing self-supervised 3D human pose estimation schemes have largely relied on weak supervisions like consistency loss to guide the learning, which, inevitably, leads to inferior results in real-world scenarios with unseen poses.

Ranked #37 on 3D Human Pose Estimation on MPI-INF-3DHP

3D Human Pose Estimation Hallucination

304

Paper
Code

Enhancing Video Super-Resolution via Implicit Resampling-based Alignment

1 code implementation • arXiv 2024 • Kai Xu, Ziwei Yu, Xin Wang, Michael Bi Mi, Angela Yao

We show that bilinear interpolation inherently attenuates high-frequency information while an MLP-based coordinate network can approximate more frequencies.

Ranked #1 on Video Super-Resolution on Vid4 - 4x upscaling

Video Super-Resolution

Paper
Code

Point2Seq: Detecting 3D Objects as Sequences

1 code implementation • CVPR 2022 • Yujing Xue, Jiageng Mao, Minzhe Niu, Hang Xu, Michael Bi Mi, Wei zhang, Xiaogang Wang, Xinchao Wang

We further propose a lightweight scene-to-sequence decoder that can auto-regressively generate words conditioned on features from a 3D scene as well as cues from the preceding words.

3D Object Detection Object +1

Paper
Code

TM2D: Bimodality Driven 3D Dance Generation via Music-Text Integration

1 code implementation • ICCV 2023 • Kehong Gong, Dongze Lian, Heng Chang, Chuan Guo, Zihang Jiang, Xinxin Zuo, Michael Bi Mi, Xinchao Wang

We propose a novel task for generating 3D dance movements that simultaneously incorporate both text and music modalities.

motion prediction Motion Synthesis

Paper
Code

Improving Deep Regression with Ordinal Entropy

1 code implementation • 21 Jan 2023 • Shihao Zhang, Linlin Yang, Michael Bi Mi, Xiaoxu Zheng, Angela Yao

In computer vision, it is often observed that formulating regression problems as a classification task often yields better performance.

Ranked #16 on Crowd Counting on ShanghaiTech B

Classification Crowd Counting +2

Paper
Code

Lens-to-lens bokeh effect transformation. NTIRE 2023 challenge report

1 code implementation • CVPRW 2023 • Marcos V. Conde, Manuel Kolmet, Tim Seizinger, Tom E. Bishop, Radu Timofte, Xiangyu Kong, Dafeng Zhang, Jinlong Wu, Fan Wang, Juewen Peng, Zhiyu Pan, Chengxin Liu, Xianrui Luo, Huiqiang Sun, Liao Shen, Zhiguo Cao, Ke Xian, Chaowei Liu, Zigeng Chen, Xingyi Yang, Songhua Liu, Yongcheng Jing, Michael Bi Mi, Xinchao Wang, Zhihao Yang, Wenyi Lian, Siyuan Lai, Haichuan Zhang, Trung Hoang, Amirsaeed Yazdani, Vishal Monga, Ziwei Luo, Fredrik K. Gustafsson, Zheng Zhao, Jens Sjölund, Thomas B. Schön, Yuxuan Zhao, Baoliang Chen, Yiqing Xu, JiXiang Niu

We present the new Bokeh Effect Transformation Dataset (BETD), and review the proposed solutions for this novel task at the NTIRE 2023 Bokeh Effect Transformation Challenge.

Bokeh Effect Rendering

Paper
Code

MotionMix: Weakly-Supervised Diffusion for Controllable Motion Generation

1 code implementation • 20 Jan 2024 • Nhat M. Hoang, Kehong Gong, Chuan Guo, Michael Bi Mi

Specifically, we separate the denoising objectives of a diffusion model into two stages: obtaining conditional rough motion approximations in the initial $T-T^*$ steps by learning the noisy annotated motions, followed by the unconditional refinement of these preliminary motions during the last $T^*$ steps using unannotated motions.

Denoising

Paper
Code

Object Detection in Foggy Scenes by Embedding Depth and Reconstruction into Domain Adaptation

1 code implementation • 24 Nov 2022 • Xin Yang, Michael Bi Mi, Yuan Yuan, Xin Wang, Robby T. Tan

In our DA framework, we retain the depth and background information during the domain feature alignment.

Domain Adaptation Object +2

Paper
Code

PARTNER: Level up the Polar Representation for LiDAR 3D Object Detection

1 code implementation • ICCV 2023 • Ming Nie, Yujing Xue, Chunwei Wang, Chaoqiang Ye, Hang Xu, Xinge Zhu, Qingqiu Huang, Michael Bi Mi, Xinchao Wang, Li Zhang

Recently, polar-based representation has shown promising properties in perceptual tasks.

3D Object Detection object-detection

Paper
Code

Bias-Compensated Integral Regression for Human Pose Estimation

no code implementations • 25 Jan 2023 • Kerui Gu, Linlin Yang, Michael Bi Mi, Angela Yao

Experimental results on both the human body and hand benchmarks show that BCIR is faster to train and more accurate than the original integral regression, making it competitive with state-of-the-art detection methods.

Hand Pose Estimation regression

Paper
Add Code

Overcoming the Trade-off Between Accuracy and Plausibility in 3D Hand Shape Reconstruction

no code implementations • CVPR 2023 • Ziwei Yu, Chen Li, Linlin Yang, Xiaoxu Zheng, Michael Bi Mi, Gim Hee Lee, Angela Yao

However, the reconstructed meshes are prone to artifacts and do not appear as plausible hand shapes.

Paper
Add Code

Priority-Centric Human Motion Generation in Discrete Latent Space

no code implementations • ICCV 2023 • Hanyang Kong, Kehong Gong, Dongze Lian, Michael Bi Mi, Xinchao Wang

We also present a motion discrete diffusion model that employs an innovative noise schedule, determined by the significance of each motion token within the entire motion sequence.

Paper
Add Code

Learning Unorthogonalized Matrices for Rotation Estimation

no code implementations • 1 Dec 2023 • Kerui Gu, Zhihao LI, Shiyong Liu, Jianzhuang Liu, Songcen Xu, Youliang Yan, Michael Bi Mi, Kenji Kawaguchi, Angela Yao

Estimating 3D rotations is a common procedure for 3D computer vision.

Pose Estimation

Paper
Add Code

DreamDrone

no code implementations • 14 Dec 2023 • Hanyang Kong, Dongze Lian, Michael Bi Mi, Xinchao Wang

We introduce DreamDrone, an innovative method for generating unbounded flythrough scenes from textual prompts.

Perpetual View Generation Scene Generation

Paper
Add Code

HEAP: Unsupervised Object Discovery and Localization with Contrastive Grouping

no code implementations • 29 Dec 2023 • Xin Zhang, Jinheng Xie, Yuan Yuan, Michael Bi Mi, Robby T. Tan

Further, to ensure the distinguishability among various regions, we introduce a region-level contrastive clustering loss to pull closer similar regions across images.

Object Object Discovery +2

Paper
Add Code

Semantic Segmentation in Multiple Adverse Weather Conditions with Domain Knowledge Retention

no code implementations • 15 Jan 2024 • Xin Yang, Wending Yan, Yuan Yuan, Michael Bi Mi, Robby T. Tan

They struggle to acquire new knowledge while also retaining previously learned knowledge. To address these problems, we propose a semantic segmentation method for multiple adverse weather conditions that incorporates adaptive knowledge acquisition, pseudolabel blending, and weather composition replay.

Multi-target Domain Adaptation Semantic Segmentation +1

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.