Search Results for author: Daiheng Gao

Found 11 papers, 6 papers with code

MES-RAG: Bringing Multi-modal, Entity-Storage, and Secure Enhancements to RAG

1 code implementation17 Mar 2025 Pingyu Wu, Daiheng Gao, Jing Tang, Huimin Chen, Wenbo Zhou, Weiming Zhang, Nenghai Yu

Retrieval-Augmented Generation (RAG) improves Large Language Models (LLMs) by using external knowledge, but it struggles with precise entity information retrieval.

Information Retrieval Question Answering +2

EraseAnything: Enabling Concept Erasure in Rectified Flow Transformers

1 code implementation29 Dec 2024 Daiheng Gao, Shilin Lu, Shaw Walters, Wenbo Zhou, Jiaming Chu, Jie Zhang, Bang Zhang, Mengxi Jia, Jian Zhao, Zhaoxin Fan, Weiming Zhang

Removing unwanted concepts from large-scale text-to-image (T2I) diffusion models while maintaining their overall generative quality remains an open challenge.

Contrastive Learning

OutfitAnyone: Ultra-high Quality Virtual Try-On for Any Clothing and Any Person

no code implementations23 Jul 2024 Ke Sun, Jian Cao, Qi Wang, Linrui Tian, Xindi Zhang, Lian Zhuo, Bang Zhang, Liefeng Bo, Wenbo Zhou, Weiming Zhang, Daiheng Gao

Specifically, these models struggle to maintain a balance between control and consistency when generating images for virtual clothing trials.

Virtual Try-on

MaTe3D: Mask-guided Text-based 3D-aware Portrait Editing

1 code implementation12 Dec 2023 Kangneng Zhou, Daiheng Gao, Xuan Wang, Jie Zhang, Peng Zhang, Xusen Sun, Longhao Zhang, Shiqi Yang, Bang Zhang, Liefeng Bo, Yaxing Wang, Ming-Ming Cheng

This enhances masked-based editing in local areas; second, we present a novel distillation strategy: Conditional Distillation on Geometry and Texture (CDGT).

VividTalk: One-Shot Audio-Driven Talking Head Generation Based on 3D Hybrid Prior

no code implementations4 Dec 2023 Xusen Sun, Longhao Zhang, Hao Zhu, Peng Zhang, Bang Zhang, Xinya Ji, Kangneng Zhou, Daiheng Gao, Liefeng Bo, Xun Cao

Audio-driven talking head generation has drawn much attention in recent years, and many efforts have been made in lip-sync, expressive facial expressions, natural head pose generation, and high video quality.

Talking Head Generation

HAVE-FUN: Human Avatar Reconstruction from Few-Shot Unconstrained Images

no code implementations CVPR 2024 Xihe Yang, Xingyu Chen, Daiheng Gao, Shaohui Wang, Xiaoguang Han, Baoyuan Wang

As for human avatar reconstruction, contemporary techniques commonly necessitate the acquisition of costly data and struggle to achieve satisfactory results from a small number of casual images.

Cloth2Tex: A Customized Cloth Texture Generation Pipeline for 3D Virtual Try-On

no code implementations8 Aug 2023 Daiheng Gao, Xu Chen, Xindi Zhang, Qi Wang, Ke Sun, Bang Zhang, Liefeng Bo, QiXing Huang

Since traditional warping-based texture generation methods require a significant number of control points to be manually selected for each type of garment, which can be a time-consuming and tedious process.

Texture Synthesis Virtual Try-on

DART: Articulated Hand Model with Diverse Accessories and Rich Textures

1 code implementation14 Oct 2022 Daiheng Gao, Yuliang Xiu, Kailin Li, Lixin Yang, Feng Wang, Peng Zhang, Bang Zhang, Cewu Lu, Ping Tan

Unity GUI is also provided to generate synthetic hand data with user-defined settings, e. g., pose, camera, background, lighting, textures, and accessories.

Diversity Hand Pose Estimation +1

Multi-View Consistent Generative Adversarial Networks for 3D-aware Image Synthesis

1 code implementation CVPR 2022 Xuanmeng Zhang, Zhedong Zheng, Daiheng Gao, Bang Zhang, Pan Pan, Yi Yang

To address this challenge, we propose Multi-View Consistent Generative Adversarial Networks (MVCGAN) for high-quality 3D-aware image synthesis with geometry constraints.

3D-Aware Image Synthesis 3D geometry

A SPIKING SEQUENTIAL MODEL: RECURRENT LEAKY INTEGRATE-AND-FIRE

no code implementations25 Sep 2019 Daiheng Gao, Hongwei Wang, Hehui Zhang, Meng Wang, Zhenzhi Wu

Stemming from neuroscience, Spiking neural networks (SNNs), a brain-inspired neural network that is a versatile solution to fault-tolerant and energy efficient information processing pertains to the ”event-driven” characteristic as the analogy of the behavior of biological neurons.

Text Summarization Video Understanding

Cannot find the paper you are looking for? You can Submit a new open access paper.