Search Results for author: Luoqi Liu

Found 26 papers, 4 papers with code

SPDiffusion: Semantic Protection Diffusion for Multi-concept Text-to-image Generation

no code implementations2 Sep 2024 Yang Zhang, Rui Zhang, Xuecheng Nie, Haochen Li, Jikun Chen, Yifan Hao, Xin Zhang, Luoqi Liu, Ling Li

We found that attribute confusion occurs when a certain region of the latent features attend to multiple or incorrect prompt tokens.

Attribute Text-to-Image Generation

SAM-REF: Rethinking Image-Prompt Synergy for Refinement in Segment Anything

no code implementations21 Aug 2024 Chongkai Yu, Anqi Li, Xiaochao Qu, Luoqi Liu, Ting Liu

Experimentally, we demonstrated the high effectiveness and efficiency of our method in tackling complex cases with multiple interactions.

Interactive Segmentation

Rethinking Video Segmentation with Masked Video Consistency: Did the Model Learn as Intended?

no code implementations20 Aug 2024 Chen Liang, Qiang Guo, Xiaochao Qu, Luoqi Liu, Ting Liu

Video segmentation aims at partitioning video sequences into meaningful segments based on objects or regions of interest within frames.

Image Segmentation Segmentation +3

2nd Place Solution for MOSE Track in CVPR 2024 PVUW workshop: Complex Video Object Segmentation

no code implementations12 Jun 2024 Zhensong Xu, Jiangtao Yao, Chengjing Wu, Ting Liu, Luoqi Liu

Our method ranked 2nd in the MOSE track of PVUW 2024, with a $\mathcal{J}$ of 0. 8007, a $\mathcal{F}$ of 0. 8683 and a $\mathcal{J}$\&$\mathcal{F}$ of 0. 8345.

Instance Segmentation Semantic Segmentation +4

3rd Place Solution for PVUW Challenge 2024: Video Panoptic Segmentation

no code implementations6 Jun 2024 Ruipu Wu, Jifei Che, Han Li, Chengjing Wu, Ting Liu, Luoqi Liu

Video panoptic segmentation is an advanced task that extends panoptic segmentation by applying its concept to video sequences.

Segmentation Video Panoptic Segmentation +1

Customize your NeRF: Adaptive Source Driven 3D Scene Editing via Local-Global Iterative Training

no code implementations CVPR 2024 Runze He, Shaofei Huang, Xuecheng Nie, Tianrui Hui, Luoqi Liu, Jiao Dai, Jizhong Han, Guanbin Li, Si Liu

In this paper, we target the adaptive source driven 3D scene editing task by proposing a CustomNeRF model that unifies a text description or a reference image as the editing prompt.

3D scene Editing

DropKey for Vision Transformer

no code implementations CVPR 2023 Bonan Li, Yinhan Hu, Xuecheng Nie, Congying Han, Xiangjian Jiang, Tiande Guo, Luoqi Liu

Given exploration on the above three questions, we present the novel DropKey method that regards Key as the drop unit and exploits decreasing schedule for drop ratio, improving ViTs in a general way.

Human-Object Interaction Detection Image Classification +2

Multi-view Human Body Mesh Translator

no code implementations4 Oct 2022 Xiangjian Jiang, Xuecheng Nie, Zitian Wang, Luoqi Liu, Si Liu

Existing methods for human mesh recovery mainly focus on single-view frameworks, but they often fail to produce accurate results due to the ill-posed setup.

Human Mesh Recovery

DropKey

no code implementations4 Aug 2022 Bonan Li, Yinhan Hu, Xuecheng Nie, Congying Han, Xiangjian Jiang, Tiande Guo, Luoqi Liu

Given exploration on the above three questions, we present the novel DropKey method that regards Key as the drop unit and exploits decreasing schedule for drop ratio, improving ViTs in a general way.

Human-Object Interaction Detection Image Classification +2

Referring Image Segmentation via Cross-Modal Progressive Comprehension

1 code implementation CVPR 2020 Shaofei Huang, Tianrui Hui, Si Liu, Guanbin Li, Yunchao Wei, Jizhong Han, Luoqi Liu, Bo Li

In addition to the CMPC module, we further leverage a simple yet effective TGFE module to integrate the reasoned multimodal features from different levels with the guidance of textual information.

Attribute Image Segmentation +2

Recurrent 3D-2D Dual Learning for Large-Pose Facial Landmark Detection

no code implementations ICCV 2017 Shengtao Xiao, Jiashi Feng, Luoqi Liu, Xuecheng Nie, Wei Wang, Shuicheng Yan, Ashraf Kassim

To address these challenging issues, we introduce a novel recurrent 3D-2D dual learning model that alternatively performs 2D-based 3D face model refinement and 3D-to-2D projection based 2D landmark refinement to reliably reason about self-occluded landmarks, precisely capture the subtle landmark displacement and accurately detect landmarks even in presence of extremely large poses.

Face Model Facial Landmark Detection

Smart Mirror: Intelligent Makeup Recommendation and Synthesis

no code implementations22 Sep 2017 Tam V. Nguyen, Luoqi Liu

The female facial image beautification usually requires professional editing softwares, which are relatively difficult for common users.

Salient Object Detection with Semantic Priors

no code implementations23 May 2017 Tam V. Nguyen, Luoqi Liu

Salient object detection has increasingly become a popular topic in cognitive and computational sciences, including computer vision and artificial intelligence research.

Object object-detection +3

Video Scene Parsing with Predictive Feature Learning

no code implementations ICCV 2017 Xiaojie Jin, Xin Li, Huaxin Xiao, Xiaohui Shen, Zhe Lin, Jimei Yang, Yunpeng Chen, Jian Dong, Luoqi Liu, Zequn Jie, Jiashi Feng, Shuicheng Yan

In this way, the network can effectively learn to capture video dynamics and temporal context, which are critical clues for video scene parsing, without requiring extra manual annotations.

Representation Learning Scene Parsing

Online Feature Selection with Group Structure Analysis

no code implementations21 Aug 2016 Jing Wang, Meng Wang, Pei-Pei Li, Luoqi Liu, Zhong-Qiu Zhao, Xuegang Hu, Xindong Wu

The problem assumes that features are generated individually but there are group structure in the feature stream.

Face Verification feature selection +1

Peak-Piloted Deep Network for Facial Expression Recognition

no code implementations24 Jul 2016 Xiangyun Zhao, Xiaodan Liang, Luoqi Liu, Teng Li, Yugang Han, Nuno Vasconcelos, Shuicheng Yan

Objective functions for training of deep networks for face-related recognition tasks, such as facial expression recognition (FER), usually consider each sample independently.

Face Recognition Facial Expression Recognition +2

Personalized Age Progression with Aging Dictionary

no code implementations ICCV 2015 Xiangbo Shu, Jinhui Tang, Hanjiang Lai, Luoqi Liu, Shuicheng Yan

Second, it is challenging or even impossible to collect faces of all age groups for a particular subject, yet much easier and more practical to get face pairs from neighboring age groups.

Dictionary Learning Face Verification

Matching-CNN Meets KNN: Quasi-Parametric Human Parsing

no code implementations CVPR 2015 Si Liu, Xiaodan Liang, Luoqi Liu, Xiaohui Shen, Jianchao Yang, Changsheng Xu, Liang Lin, Xiaochun Cao, Shuicheng Yan

Under the classic K Nearest Neighbor (KNN)-based nonparametric framework, the parametric Matching Convolutional Neural Network (M-CNN) is proposed to predict the matching confidence and displacements of the best matched region in the testing image for a particular semantic region in one KNN image.

Human Parsing

Deep Human Parsing with Active Template Regression

1 code implementation9 Mar 2015 Xiaodan Liang, Si Liu, Xiaohui Shen, Jianchao Yang, Luoqi Liu, Jian Dong, Liang Lin, Shuicheng Yan

The first CNN network is with max-pooling, and designed to predict the template coefficients for each label mask, while the second CNN network is without max-pooling to preserve sensitivity to label mask position and accurately predict the active shape parameters.

Human Parsing Position +1

Cannot find the paper you are looking for? You can Submit a new open access paper.