Search Results for author: Haoxuan Wang

Found 13 papers, 4 papers with code

UniCombine: Unified Multi-Conditional Combination with Diffusion Transformer

no code implementations12 Mar 2025 Haoxuan Wang, Jinlong Peng, Qingdong He, Hao Yang, Ying Jin, Jiafu Wu, Xiaobin Hu, Yanjie Pan, Zhenye Gan, Mingmin Chi, Bo Peng, Yabiao Wang

With the rapid development of diffusion models in image generation, the demand for more powerful and flexible controllable frameworks is increasing.

Image Generation

Understanding Zero-shot Rare Word Recognition Improvements Through LLM Integration

no code implementations22 Feb 2025 Haoxuan Wang

In this study, we investigate the integration of a large language model (LLM) with an automatic speech recognition (ASR) system, specifically focusing on enhancing rare word recognition performance.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +4

ChartKG: A Knowledge-Graph-Based Representation for Chart Images

no code implementations13 Oct 2024 Zhiguang Zhou, Haoxuan Wang, Zhengqing Zhao, Fengling Zheng, Yongheng Wang, Wei Chen, Yong Wang

We present four cases to illustrate how our knowledge-graph-based representation can model the detailed visual elements and semantic relations in charts, and further demonstrate how our approach can benefit downstream applications such as semantic-aware chart retrieval and chart question answering.

Chart Question Answering Knowledge Graph Completion +4

Mamba-YOLO-World: Marrying YOLO-World with Mamba for Open-Vocabulary Detection

1 code implementation13 Sep 2024 Haoxuan Wang, Qingdong He, Jinlong Peng, Hao Yang, Mingmin Chi, Yabiao Wang

However, its performance is hindered by its neck feature fusion mechanism, which causes the quadratic complexity and the limited guided receptive fields.

Mamba Open Vocabulary Object Detection

Distilling Long-tailed Datasets

1 code implementation24 Aug 2024 Zhenghao Zhao, Haoxuan Wang, Yuzhang Shang, Kai Wang, Yan Yan

It reduces the distance between the student and the biased expert trajectories and prevents the tail class bias from being distilled to the synthetic dataset.

Dataset Distillation Efficient Neural Network

PTQ4DiT: Post-training Quantization for Diffusion Transformers

1 code implementation25 May 2024 Junyi Wu, Haoxuan Wang, Yuzhang Shang, Mubarak Shah, Yan Yan

SSC extends this approach by dynamically adjusting the balanced salience to capture the temporal variations in activation.

Image Generation Quantization

Seed Optimization with Frozen Generator for Superior Zero-shot Low-light Enhancement

no code implementations15 Feb 2024 Yuxuan Gu, Yi Jin, Ben Wang, Zhixiang Wei, Xiaoxiao Ma, Pengyang Ling, Haoxuan Wang, Huaian Chen, Enhong Chen

In this work, we observe that the generators, which are pre-trained on massive natural images, inherently hold the promising potential for superior low-light image enhancement against varying scenarios. Specifically, we embed a pre-trained generator to Retinex model to produce reflectance maps with enhanced detail and vividness, thereby recovering features degraded by low-light conditions. Taking one step further, we introduce a novel optimization strategy, which backpropagates the gradients to the input seeds rather than the parameters of the low-light enhancement model, thus intactly retaining the generative knowledge learned from natural images and achieving faster convergence speed.

Low-Light Image Enhancement

QuEST: Low-bit Diffusion Model Quantization via Efficient Selective Finetuning

1 code implementation6 Feb 2024 Haoxuan Wang, Yuzhang Shang, Zhihang Yuan, Junyi Wu, Junchi Yan, Yan Yan

We empirically verify that our approach modifies the activation distribution and provides meaningful temporal information, facilitating easier and more accurate quantization.

Image Generation Model Compression +1

Learning Gradient Fields for Scalable and Generalizable Irregular Packing

no code implementations18 Oct 2023 Tianyang Xue, Mingdong Wu, Lin Lu, Haoxuan Wang, Hao Dong, Baoquan Chen

In this work, we delve deeper into a novel machine learning-based approach that formulates the packing problem as conditional generative modeling.

Collision Avoidance Layout Design +1

Leveraging Angular Information Between Feature and Classifier for Long-tailed Learning: A Prediction Reformulation Approach

no code implementations3 Dec 2022 Haoxuan Wang, Junchi Yan

Deep neural networks still struggle on long-tailed image datasets, and one of the reasons is that the imbalance of training data across categories leads to the imbalance of trained model parameters.

Long-tail Learning

Learning Calibrated Uncertainties for Domain Shift: A Distributionally Robust Learning Approach

no code implementations8 Oct 2020 Haoxuan Wang, Zhiding Yu, Yisong Yue, Anima Anandkumar, Anqi Liu, Junchi Yan

We propose a framework for learning calibrated uncertainties under domain shifts, where the source (training) distribution differs from the target (test) distribution.

Density Ratio Estimation Unsupervised Domain Adaptation

Distributionally Robust Learning for Unsupervised Domain Adaptation

no code implementations28 Sep 2020 Haoxuan Wang, Anqi Liu, Zhiding Yu, Yisong Yue, Anima Anandkumar

This formulation motivates the use of two neural networks that are jointly trained --- a discriminative network between the source and target domains for density-ratio estimation, in addition to the standard classification network.

Density Ratio Estimation Unsupervised Domain Adaptation

Cannot find the paper you are looking for? You can Submit a new open access paper.