Search Results for author: Huaxia Li

Found 15 papers, 9 papers with code

Chain-of-Thought Prompting Obscures Hallucination Cues in Large Language Models: An Empirical Evaluation

no code implementations20 Jun 2025 Jiahao Cheng, Tiancheng Su, Jia Yuan, Guoxiu He, Jiawei Liu, Xinqi Tao, Jingwen Xie, Huaxia Li

Building on this, we evaluate the impact of various CoT prompting methods on mainstream hallucination detection methods across both instruction-tuned and reasoning-oriented LLMs.

Dynamic Pyramid Network for Efficient Multimodal Large Language Model

no code implementations26 Mar 2025 Hao Ai, Kunyi Wang, Zezhou Wang, Hao Lu, Jin Tian, Yaxin Luo, Peng Xing, Jen-Yuan Huang, Huaxia Li, Gen Luo

To maximize the benefit of DPN, we further propose an innovative Dynamic Pooling Experts (DPE) that can dynamically choose the optimal visual compression rate according to input features.

Language Modeling Language Modelling +2

ShapefileGPT: A Multi-Agent Large Language Model Framework for Automated Shapefile Processing

no code implementations16 Oct 2024 Qingming Lin, Rui Hu, Huaxia Li, Sensen Wu, Yadong Li, Kai Fang, Hailin Feng, Zhenhong Du, Liuchang Xu

In comparison to traditional LLMs, ShapefileGPT effectively handles complex vector data analysis tasks, overcoming the limitations of traditional LLMs in spatial analysis.

Language Modeling Language Modelling +1

ControlCity: A Multimodal Diffusion Model Based Approach for Accurate Geospatial Data Generation and Urban Morphology Analysis

1 code implementation25 Sep 2024 Fangshuo Zhou, Huaxia Li, Rui Hu, Sensen Wu, Hailin Feng, Zhenhong Du, Liuchang Xu

This study confirms the effectiveness of our approach in generating urban building footprint data and capturing complex city characteristics.

StoryMaker: Towards Holistic Consistent Characters in Text-to-image Generation

1 code implementation19 Sep 2024 Zhengguang Zhou, Jing Li, Huaxia Li, Nemo Chen, Xu Tang

However, the lack of holistic consistency in scenes with multiple characters hampers these methods' ability to create a cohesive narrative.

Personalized Image Generation Text to Image Generation +1

Target-Driven Distillation: Consistency Distillation with Target Timestep Selection and Decoupled Guidance

1 code implementation2 Sep 2024 Cunzheng Wang, Ziyuan Guo, Yuxuan Duan, Huaxia Li, Nemo Chen, Xu Tang, Yao Hu

Consistency distillation methods have demonstrated significant success in accelerating generative tasks of diffusion models.

GeoFormer: Learning Point Cloud Completion with Tri-Plane Integrated Transformer

1 code implementation13 Aug 2024 Jinpeng Yu, Binbin Huang, Yuxuan Zhang, Huaxia Li, Xu Tang, Shenghua Gao

In this paper, we introduce a GeoFormer that simultaneously enhances the global geometric structure of the points and improves the local details.

Point Cloud Completion

Unified Video-Language Pre-training with Synchronized Audio

no code implementations12 May 2024 Shentong Mo, Haofan Wang, Huaxia Li, Xu Tang

Video-language pre-training is a typical and challenging problem that aims at learning visual and textual representations from large-scale data in a self-supervised way.

StableGarment: Garment-Centric Generation via Stable Diffusion

no code implementations16 Mar 2024 Rui Wang, Hailong Guo, Jiaming Liu, Huaxia Li, Haibo Zhao, Xu Tang, Yao Hu, Hao Tang, Peipei Li

In this paper, we introduce StableGarment, a unified framework to tackle garment-centric(GC) generation tasks, including GC text-to-image, controllable GC text-to-image, stylized GC text-to-image, and robust virtual try-on.

Denoising Image Generation +1

SSR-Encoder: Encoding Selective Subject Representation for Subject-Driven Generation

1 code implementation CVPR 2024 Yuxuan Zhang, Yiren Song, Jiaming Liu, Rui Wang, Jinpeng Yu, Hao Tang, Huaxia Li, Xu Tang, Yao Hu, Han Pan, Zhongliang Jing

Recent advancements in subject-driven image generation have led to zero-shot generation, yet precise selection and focus on crucial subject representations remain challenging.

Image Generation

StreamFlow: Streamlined Multi-Frame Optical Flow Estimation for Video Sequences

1 code implementation28 Nov 2023 Shangkun Sun, Jiaming Liu, Thomas H. Li, Huaxia Li, Guoqing Liu, Wei Gao

To address this issue, multi-frame optical flow methods leverage adjacent frames to mitigate the local ambiguity.

Optical Flow Estimation

STAR Loss: Reducing Semantic Ambiguity in Facial Landmark Detection

1 code implementation CVPR 2023 Zhenglin Zhou, Huaxia Li, Hong Liu, Nanyang Wang, Gang Yu, Rongrong Ji

To solve this problem, we propose a Self-adapTive Ambiguity Reduction (STAR) loss by exploiting the properties of semantic ambiguity.

Face Alignment Facial Landmark Detection

An Empirical Study of Propagation-based Methods for Video Object Segmentation

no code implementations30 Jul 2019 Hengkai Guo, Wenji Wang, Guanjun Guo, Huaxia Li, Jiachen Liu, Qian He, Xuefeng Xiao

While propagation-based approaches have achieved state-of-the-art performance for video object segmentation, the literature lacks a fair comparison of different methods using the same settings.

Object Semantic Segmentation +2

Multi-Task Recurrent Convolutional Network with Correlation Loss for Surgical Video Analysis

1 code implementation13 Jul 2019 Yueming Jin, Huaxia Li, Qi Dou, Hao Chen, Jing Qin, Chi-Wing Fu, Pheng-Ann Heng

Mutually leveraging both low-level feature sharing and high-level prediction correlating, our MTRCNet-CL method can encourage the interactions between the two tasks to a large extent, and hence can bring about benefits to each other.

Surgical phase recognition Surgical tool detection

Cannot find the paper you are looking for? You can Submit a new open access paper.