Search Results for author: Mengtian Li

Found 20 papers, 7 papers with code

GaussianBody: Clothed Human Reconstruction via 3d Gaussian Splatting

no code implementations • 18 Jan 2024 • Mengtian Li, Shengxiang Yao, Zhifeng Xie, Keyu Chen

In this work, we propose a novel clothed human reconstruction method called GaussianBody, based on 3D Gaussian Splatting.

Paper
Add Code

Hierarchical Fashion Design with Multi-stage Diffusion Models

no code implementations • 15 Jan 2024 • Zhifeng Xie, Hao Li, Huiming Ding, Mengtian Li, Ying Cao

Cross-modal fashion synthesis and editing offer intelligent support to fashion designers by enabling the automatic generation and local modification of design drafts. While current diffusion models demonstrate commendable stability and controllability in image synthesis, they still face significant challenges in generating fashion design from abstract design elements and fine-grained editing. Abstract sensory expressions, \eg office, business, and party, form the high-level design concepts, while measurable aspects like sleeve length, collar type, and pant length are considered the low-level attributes of clothing. Controlling and editing fashion images using lengthy text descriptions poses a difficulty. In this paper, we propose HieraFashDiff, a novel fashion design method using the shared multi-stage diffusion model encompassing high-level design concepts and low-level clothing attributes in a hierarchical structure. Specifically, we categorized the input text into different levels and fed them in different time step to the diffusion model according to the criteria of professional clothing designers. HieraFashDiff allows designers to add low-level attributes after high-level prompts for interactive editing incrementally. In addition, we design a differentiable loss function in the sampling process with a mask to keep non-edit areas. Comprehensive experiments performed on our newly conducted Hierarchical fashion dataset, demonstrate that our proposed method outperforms other state-of-the-art competitors.

Fashion Synthesis Image Generation

Paper
Add Code

Class-Imbalanced Semi-Supervised Learning for Large-Scale Point Cloud Semantic Segmentation via Decoupling Optimization

no code implementations • 13 Jan 2024 • Mengtian Li, Shaohui Lin, Zihan Wang, Yunhang Shen, Baochang Zhang, Lizhuang Ma

Semi-supervised learning (SSL), thanks to the significant reduction of data annotation costs, has been an active research topic for large-scale 3D scene understanding.

Pseudo Label Representation Learning +2

Paper
Add Code

Streaming Motion Forecasting for Autonomous Driving

1 code implementation • 2 Oct 2023 • Ziqi Pang, Deva Ramanan, Mengtian Li, Yu-Xiong Wang

Our benchmark inherently captures the disappearance and re-appearance of agents, presenting the emergent challenge of forecasting for occluded agents, which is a safety-critical problem yet overlooked by snapshot-based benchmarks.

Autonomous Navigation Motion Forecasting +1

Paper
Code

Learning Lightweight Object Detectors via Multi-Teacher Progressive Distillation

no code implementations • 17 Aug 2023 • Shengcao Cao, Mengtian Li, James Hays, Deva Ramanan, Yi-Xiong Wang, Liang-Yan Gui

To distill knowledge from a highly accurate but complex teacher model, we construct a sequence of teachers to help the student gradually adapt.

Edge-computing Instance Segmentation +5

Paper
Add Code

An Empirical Analysis of Range for 3D Object Detection

no code implementations • 8 Aug 2023 • Neehar Peri, Mengtian Li, Benjamin Wilson, Yu-Xiong Wang, James Hays, Deva Ramanan

LiDAR-based 3D detection plays a vital role in autonomous navigation.

3D Object Detection Autonomous Navigation +3

Paper
Add Code

Multi-Modal Face Stylization with a Generative Prior

no code implementations • 29 May 2023 • Mengtian Li, Yi Dong, Minxuan Lin, Haibin Huang, Pengfei Wan, Chongyang Ma

We also introduce a two-stage training strategy, where we train the encoder in the first stage to align the feature maps with StyleGAN and enable a faithful reconstruction of input faces.

Face Generation

Paper
Add Code

Learning to Zoom and Unzoom

no code implementations • CVPR 2023 • Chittesh Thavamani, Mengtian Li, Francesco Ferroni, Deva Ramanan

In this work (LZU), we "learn to zoom" in on the input image, compute spatial features, and then "unzoom" to revert any deformations.

Autonomous Navigation Monocular 3D Object Detection +3

Paper
Add Code

Far3Det: Towards Far-Field 3D Detection

no code implementations • 25 Nov 2022 • Shubham Gupta, Jeet Kanjani, Mengtian Li, Francesco Ferroni, James Hays, Deva Ramanan, Shu Kong

We focus on the task of far-field 3D detection (Far3Det) of objects beyond a certain distance from an observer, e. g., $>$50m.

Autonomous Vehicles Philosophy

Paper
Add Code

Forecasting from LiDAR via Future Object Detection

1 code implementation • CVPR 2022 • Neehar Peri, Jonathon Luiten, Mengtian Li, Aljoša Ošep, Laura Leal-Taixé, Deva Ramanan

Object detection and forecasting are fundamental components of embodied perception.

Motion Forecasting Object +2

114

Paper
Code

HybridCR: Weakly-Supervised 3D Point Cloud Semantic Segmentation via Hybrid Contrastive Regularization

1 code implementation • CVPR 2022 • Mengtian Li, Yuan Xie, Yunhang Shen, Bo Ke, Ruizhi Qiao, Bo Ren, Shaohui Lin, Lizhuang Ma

To address the huge labeling cost in large-scale point cloud semantic segmentation, we propose a novel hybrid contrastive regularization (HybridCR) framework in weakly-supervised setting, which obtains competitive performance compared to its fully-supervised counterpart.

Semantic Segmentation Semantic Similarity +1

Paper
Code

Implicit Neural Deformation for Sparse-View Face Reconstruction

no code implementations • 5 Dec 2021 • Moran Li, Haibin Huang, Yi Zheng, Mengtian Li, Nong Sang, Chongyang Ma

In this work, we present a new method for 3D face reconstruction from sparse-view RGB images.

3D Face Reconstruction

Paper
Add Code

FOVEA: Foveated Image Magnification for Autonomous Navigation

1 code implementation • ICCV 2021 • Chittesh Thavamani, Mengtian Li, Nicolas Cebron, Deva Ramanan

Efficient processing of high-res video streams is safety-critical for many robotics applications such as autonomous driving.

Autonomous Driving Autonomous Navigation +3

Paper
Code

Neighborhood-Aware Neural Architecture Search

no code implementations • 13 May 2021 • Xiaofang Wang, Shengcao Cao, Mengtian Li, Kris M. Kitani

To facilitate the application to gradient-based algorithms, we also propose a differentiable representation for the neighborhood of architectures.

Neural Architecture Search

Paper
Add Code

Towards Streaming Perception

1 code implementation • ECCV 2020 • Mengtian Li, Yu-Xiong Wang, Deva Ramanan

While past work has studied the algorithmic trade-off between latency and accuracy, there has not been a clear metric to compare different methods along the Pareto optimal latency-accuracy curve.

Ranked #2 on Real-Time Object Detection on Argoverse-HD (Detection-Only, Val) (using extra training data)

Instance Segmentation Motion Forecasting +5

Paper
Code

Budgeted Training: Rethinking Deep Neural Network Training Under Resource Constraints

1 code implementation • ICLR 2020 • Mengtian Li, Ersin Yumer, Deva Ramanan

We also revisit existing approaches for fast convergence and show that budget-aware learning schedules readily outperform such approaches under (the practical but under-explored) budgeted training setting.

General Classification Image Classification +6

127

Paper
Code

Photo-Sketching: Inferring Contour Drawings from Images

3 code implementations • 2 Jan 2019 • Mengtian Li, Zhe Lin, Radomir Mech, Ersin Yumer, Deva Ramanan

Edges, boundaries and contours are important subjects of study in both computer graphics and computer vision.

Boundary Detection

Paper
Code

Brute-Force Facial Landmark Analysis With A 140,000-Way Classifier

no code implementations • 6 Feb 2018 • Mengtian Li, Laszlo Jeni, Deva Ramanan

While most prior work treats this as a regression problem, we instead formulate it as a discrete $K$-way classification task, where a classifier is trained to return one of $K$ discrete alignments.

General Classification regression

Paper
Add Code

Guaranteed Parameter Estimation for Discrete Energy Minimization

no code implementations • 11 Jan 2017 • Mengtian Li, Daniel Huber

In this work, we propose a method to overcome this limitation through exploiting the properties of the joint problem of training time inference and learning.

Scene Parsing

Paper
Add Code

Complexity of Discrete Energy Minimization Problems

no code implementations • 29 Jul 2016 • Mengtian Li, Alexander Shekhovtsov, Daniel Huber

Specifically, we show that general energy minimization, even in the 2-label pairwise case, and planar energy minimization with three or more labels are exp-APX-complete.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.