Search Results for author: Mengtian Li

Found 20 papers, 7 papers with code

GaussianBody: Clothed Human Reconstruction via 3d Gaussian Splatting

no code implementations18 Jan 2024 Mengtian Li, Shengxiang Yao, Zhifeng Xie, Keyu Chen

In this work, we propose a novel clothed human reconstruction method called GaussianBody, based on 3D Gaussian Splatting.

Hierarchical Fashion Design with Multi-stage Diffusion Models

no code implementations15 Jan 2024 Zhifeng Xie, Hao Li, Huiming Ding, Mengtian Li, Ying Cao

Cross-modal fashion synthesis and editing offer intelligent support to fashion designers by enabling the automatic generation and local modification of design drafts. While current diffusion models demonstrate commendable stability and controllability in image synthesis, they still face significant challenges in generating fashion design from abstract design elements and fine-grained editing. Abstract sensory expressions, \eg office, business, and party, form the high-level design concepts, while measurable aspects like sleeve length, collar type, and pant length are considered the low-level attributes of clothing. Controlling and editing fashion images using lengthy text descriptions poses a difficulty. In this paper, we propose HieraFashDiff, a novel fashion design method using the shared multi-stage diffusion model encompassing high-level design concepts and low-level clothing attributes in a hierarchical structure. Specifically, we categorized the input text into different levels and fed them in different time step to the diffusion model according to the criteria of professional clothing designers. HieraFashDiff allows designers to add low-level attributes after high-level prompts for interactive editing incrementally. In addition, we design a differentiable loss function in the sampling process with a mask to keep non-edit areas. Comprehensive experiments performed on our newly conducted Hierarchical fashion dataset, demonstrate that our proposed method outperforms other state-of-the-art competitors.

Fashion Synthesis Image Generation

Class-Imbalanced Semi-Supervised Learning for Large-Scale Point Cloud Semantic Segmentation via Decoupling Optimization

no code implementations13 Jan 2024 Mengtian Li, Shaohui Lin, Zihan Wang, Yunhang Shen, Baochang Zhang, Lizhuang Ma

Semi-supervised learning (SSL), thanks to the significant reduction of data annotation costs, has been an active research topic for large-scale 3D scene understanding.

Pseudo Label Representation Learning +2

Streaming Motion Forecasting for Autonomous Driving

1 code implementation2 Oct 2023 Ziqi Pang, Deva Ramanan, Mengtian Li, Yu-Xiong Wang

Our benchmark inherently captures the disappearance and re-appearance of agents, presenting the emergent challenge of forecasting for occluded agents, which is a safety-critical problem yet overlooked by snapshot-based benchmarks.

Autonomous Navigation Motion Forecasting +1

Learning Lightweight Object Detectors via Multi-Teacher Progressive Distillation

no code implementations17 Aug 2023 Shengcao Cao, Mengtian Li, James Hays, Deva Ramanan, Yi-Xiong Wang, Liang-Yan Gui

To distill knowledge from a highly accurate but complex teacher model, we construct a sequence of teachers to help the student gradually adapt.

Edge-computing Instance Segmentation +5

Multi-Modal Face Stylization with a Generative Prior

no code implementations29 May 2023 Mengtian Li, Yi Dong, Minxuan Lin, Haibin Huang, Pengfei Wan, Chongyang Ma

We also introduce a two-stage training strategy, where we train the encoder in the first stage to align the feature maps with StyleGAN and enable a faithful reconstruction of input faces.

Face Generation

Learning to Zoom and Unzoom

no code implementations CVPR 2023 Chittesh Thavamani, Mengtian Li, Francesco Ferroni, Deva Ramanan

In this work (LZU), we "learn to zoom" in on the input image, compute spatial features, and then "unzoom" to revert any deformations.

Autonomous Navigation Monocular 3D Object Detection +3

Far3Det: Towards Far-Field 3D Detection

no code implementations25 Nov 2022 Shubham Gupta, Jeet Kanjani, Mengtian Li, Francesco Ferroni, James Hays, Deva Ramanan, Shu Kong

We focus on the task of far-field 3D detection (Far3Det) of objects beyond a certain distance from an observer, e. g., $>$50m.

Autonomous Vehicles Philosophy

HybridCR: Weakly-Supervised 3D Point Cloud Semantic Segmentation via Hybrid Contrastive Regularization

1 code implementation CVPR 2022 Mengtian Li, Yuan Xie, Yunhang Shen, Bo Ke, Ruizhi Qiao, Bo Ren, Shaohui Lin, Lizhuang Ma

To address the huge labeling cost in large-scale point cloud semantic segmentation, we propose a novel hybrid contrastive regularization (HybridCR) framework in weakly-supervised setting, which obtains competitive performance compared to its fully-supervised counterpart.

Semantic Segmentation Semantic Similarity +1

Implicit Neural Deformation for Sparse-View Face Reconstruction

no code implementations5 Dec 2021 Moran Li, Haibin Huang, Yi Zheng, Mengtian Li, Nong Sang, Chongyang Ma

In this work, we present a new method for 3D face reconstruction from sparse-view RGB images.

3D Face Reconstruction

Neighborhood-Aware Neural Architecture Search

no code implementations13 May 2021 Xiaofang Wang, Shengcao Cao, Mengtian Li, Kris M. Kitani

To facilitate the application to gradient-based algorithms, we also propose a differentiable representation for the neighborhood of architectures.

Neural Architecture Search

Towards Streaming Perception

1 code implementation ECCV 2020 Mengtian Li, Yu-Xiong Wang, Deva Ramanan

While past work has studied the algorithmic trade-off between latency and accuracy, there has not been a clear metric to compare different methods along the Pareto optimal latency-accuracy curve.

Instance Segmentation Motion Forecasting +5

Budgeted Training: Rethinking Deep Neural Network Training Under Resource Constraints

1 code implementation ICLR 2020 Mengtian Li, Ersin Yumer, Deva Ramanan

We also revisit existing approaches for fast convergence and show that budget-aware learning schedules readily outperform such approaches under (the practical but under-explored) budgeted training setting.

General Classification Image Classification +6

Photo-Sketching: Inferring Contour Drawings from Images

3 code implementations2 Jan 2019 Mengtian Li, Zhe Lin, Radomir Mech, Ersin Yumer, Deva Ramanan

Edges, boundaries and contours are important subjects of study in both computer graphics and computer vision.

Boundary Detection

Brute-Force Facial Landmark Analysis With A 140,000-Way Classifier

no code implementations6 Feb 2018 Mengtian Li, Laszlo Jeni, Deva Ramanan

While most prior work treats this as a regression problem, we instead formulate it as a discrete $K$-way classification task, where a classifier is trained to return one of $K$ discrete alignments.

General Classification regression

Guaranteed Parameter Estimation for Discrete Energy Minimization

no code implementations11 Jan 2017 Mengtian Li, Daniel Huber

In this work, we propose a method to overcome this limitation through exploiting the properties of the joint problem of training time inference and learning.

Scene Parsing

Complexity of Discrete Energy Minimization Problems

no code implementations29 Jul 2016 Mengtian Li, Alexander Shekhovtsov, Daniel Huber

Specifically, we show that general energy minimization, even in the 2-label pairwise case, and planar energy minimization with three or more labels are exp-APX-complete.

Cannot find the paper you are looking for? You can Submit a new open access paper.