Search Results for author: Haotian Wang

Found 34 papers, 14 papers with code

EmotiveTalk: Expressive Talking Head Generation through Audio Information Decoupling and Emotional Video Diffusion

no code implementations23 Nov 2024 Haotian Wang, Yuzhe Weng, Yueyan Li, Zilu Guo, Jun Du, Shutong Niu, Jiefeng Ma, Shan He, Xiaoyan Wu, Qiming Hu, Bing Yin, Cong Liu, Qingfeng Liu

Diffusion models have revolutionized the field of talking head generation, yet still face challenges in expressiveness, controllability, and stability in long-time generation.

Talking Head Generation

InLINE: Inner-Layer Information Exchange for Multi-task Learning on Heterogeneous Graphs

no code implementations29 Oct 2024 Xinyue Feng, Jinquan Hang, Yuequn Zhang, Haotian Wang, Desheng Zhang, Guang Wang

However, MTL introduces the issue of negative transfer, where the training of different tasks interferes with each other as they may focus on different information from the data, resulting in suboptimal performance.

Disentanglement Multi-Task Learning

Scale Propagation Network for Generalizable Depth Completion

1 code implementation24 Oct 2024 Haotian Wang, Meng Yang, Xinhu Zheng, Gang Hua

Although deep learning based methods have made tremendous progress in this problem, these models cannot generalize well across different scenes that are unobserved in training, posing a fundamental limitation that yet to be overcome.

Depth Completion

Enhancing Multimodal Sentiment Analysis for Missing Modality through Self-Distillation and Unified Modality Cross-Attention

1 code implementation19 Oct 2024 Yuzhe Weng, Haotian Wang, Tian Gao, Kewei Li, Shutong Niu, Jun Du

In multimodal sentiment analysis, collecting text data is often more challenging than video or audio due to higher annotation costs and inconsistent automatic speech recognition (ASR) quality.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +2

HypomimiaCoach: An AU-based Digital Therapy System for Hypomimia Detection & Rehabilitation with Parkinson's Disease

no code implementations13 Oct 2024 Yingjing Xu, Xueyan Cai, Zihong Zhou, Mengru Xue, Bo wang, Haotian Wang, Zhengke Li, Chentian Weng, Wei Luo, Cheng Yao, Bo Lin, Jianwei Yin

To investigate this, we developed HypomimaCoach, an Action Unit (AU)-based digital therapy system for hypomimia detection and rehabilitation in Parkinson's disease.

BeamAggR: Beam Aggregation Reasoning over Multi-source Knowledge for Multi-hop Question Answering

no code implementations28 Jun 2024 Zheng Chu, Jingchang Chen, Qianglong Chen, Haotian Wang, Kun Zhu, Xiyuan Du, Weijiang Yu, Ming Liu, Bing Qin

For composite questions, the LLM combines beam candidates, explores multiple reasoning paths through probabilistic aggregation, and prioritizes the most promising trajectory.

Multi-hop Question Answering Question Answering +1

An Information Bottleneck Perspective for Effective Noise Filtering on Retrieval-Augmented Generation

1 code implementation3 Jun 2024 Kun Zhu, Xiaocheng Feng, Xiyuan Du, Yuxuan Gu, Weijiang Yu, Haotian Wang, Qianglong Chen, Zheng Chu, Jingchang Chen, Bing Qin

Retrieval-augmented generation integrates the capabilities of large language models with relevant information retrieved from an extensive corpus, yet encounters challenges when confronted with real-world noisy data.

Answer Generation Question Answering +1

PPA-Game: Characterizing and Learning Competitive Dynamics Among Online Content Creators

no code implementations22 Mar 2024 Renzhe Xu, Haotian Wang, Xingxuan Zhang, Bo Li, Peng Cui

We introduce the Proportional Payoff Allocation Game (PPA-Game) to model how agents, akin to content creators on platforms like YouTube and TikTok, compete for divisible resources and consumers' attention.

Diversity

A Study of Dropout-Induced Modality Bias on Robustness to Missing Video Frames for Audio-Visual Speech Recognition

1 code implementation CVPR 2024 Yusheng Dai, Hang Chen, Jun Du, Ruoyu Wang, Shihao Chen, Jiefeng Ma, Haotian Wang, Chin-Hui Lee

In this paper, we investigate this contrasting phenomenon from the perspective of modality bias and reveal that an excessive modality bias on the audio caused by dropout is the underlying reason.

Audio-Visual Speech Recognition Knowledge Distillation +2

Composite Active Learning: Towards Multi-Domain Active Learning with Theoretical Guarantees

1 code implementation3 Feb 2024 Guang-Yuan Hao, Hengguan Huang, Haotian Wang, Jie Gao, Hao Wang

In this paper, we propose the first general method, dubbed composite active learning (CAL), for multi-domain AL. Our approach explicitly considers the domain-level and instance-level information in the problem; CAL first assigns domain-level budgets according to domain-level importance, which is estimated by optimizing an upper error bound that we develop; with the domain-level budgets, CAL then leverages a certain instance-level query strategy to select samples to label from each domain.

Active Learning

Learning to Break: Knowledge-Enhanced Reasoning in Multi-Agent Debate System

2 code implementations8 Dec 2023 Haotian Wang, Xiyuan Du, Weijiang Yu, Qianglong Chen, Kun Zhu, Zheng Chu, Lian Yan, Yi Guan

First, we involve a shared retrieval knowledge pool in the debate process to solve the problem of limited and different knowledge backgrounds.

Retrieval

TimeBench: A Comprehensive Evaluation of Temporal Reasoning Abilities in Large Language Models

1 code implementation29 Nov 2023 Zheng Chu, Jingchang Chen, Qianglong Chen, Weijiang Yu, Haotian Wang, Ming Liu, Bing Qin

Grasping the concept of time is a fundamental facet of human cognition, indispensable for truly comprehending the intricacies of the world.

Trends in Integration of Knowledge and Large Language Models: A Survey and Taxonomy of Methods, Benchmarks, and Applications

no code implementations10 Nov 2023 Zhangyin Feng, Weitao Ma, Weijiang Yu, Lei Huang, Haotian Wang, Qianglong Chen, Weihua Peng, Xiaocheng Feng, Bing Qin, Ting Liu

In this paper, we propose a review to discuss the trends in integration of knowledge and large language models, including taxonomy of methods, benchmarks, and applications.

knowledge editing Retrieval +1

G2-MonoDepth: A General Framework of Generalized Depth Inference from Monocular RGB+X Data

1 code implementation24 Oct 2023 Haotian Wang, Meng Yang, Nanning Zheng

This paper investigates a unified task of monocular depth inference, which infers high-quality depth maps from all kinds of input raw data from various robots in unseen scenes.

Data Augmentation Depth Completion +1

Breast Ultrasound Tumor Classification Using a Hybrid Multitask CNN-Transformer Network

no code implementations4 Aug 2023 Bryar Shareef, Min Xian, Aleksandar Vakanski, Haotian Wang

Vision Transformers have an improved capability of capturing global contextual information but may distort the local image patterns due to the tokenization operations.

Classification Image Classification

Competing for Shareable Arms in Multi-Player Multi-Armed Bandits

1 code implementation30 May 2023 Renzhe Xu, Haotian Wang, Xingxuan Zhang, Bo Li, Peng Cui

In reality, agents often have to learn and maximize the rewards of the resources at the same time.

Multi-Armed Bandits

E-NER: Evidential Deep Learning for Trustworthy Named Entity Recognition

1 code implementation29 May 2023 Zhen Zhang, Mengting Hu, Shiwan Zhao, Minlie Huang, Haotian Wang, Lemao Liu, Zhirui Zhang, Zhe Liu, Bingzhe Wu

Most named entity recognition (NER) systems focus on improving model performance, ignoring the need to quantify model uncertainty, which is critical to the reliability of NER systems in open environments.

Deep Learning named-entity-recognition +2

Enhanced Sharp-GAN For Histopathology Image Synthesis

no code implementations24 Jan 2023 Sujata Butte, Haotian Wang, Aleksandar Vakanski, Min Xian

To address the challenges, we propose a novel approach that enhances the quality of synthetic images by using nuclei topology and contour regularization.

Image Generation Segmentation

Domain Specified Optimization for Deployment Authorization

no code implementations ICCV 2023 Haotian Wang, Haoang Chi, Wenjing Yang, Zhipeng Lin, Mingyang Geng, Long Lan, Jing Zhang, DaCheng Tao

As a complementary of SDPA, we also propose Target-Combined Deployment Authorization (TPDA), where unauthorized domains are partially accessible, and simplify the DSO method to a perturbation operation on the pseudo predictions, referred to as Target-Dependent Domain-Specified Optimization (TDSO).

SIAN: Style-Guided Instance-Adaptive Normalization for Multi-Organ Histopathology Image Synthesis

no code implementations2 Sep 2022 Haotian Wang, Min Xian, Aleksandar Vakanski, Bryar Shareef

Existing deep neural networks for histopathology image synthesis cannot generate image styles that align with different organs, and cannot produce accurate boundaries of clustered nuclei.

Image Generation Instance Segmentation +2

On Cyclic Solutions to the Min-Max Latency Multi-Robot Patrolling Problem

no code implementations14 Mar 2022 Peyman Afshani, Mark De Berg, Kevin Buchin, Jie Gao, Maarten Loffler, Amir Nayyeri, Benjamin Raichel, Rik Sarkar, Haotian Wang, Hao-Tsung Yang

For the Euclidean version of the problem, for instance, combining our results with known results on Euclidean TSP, yields a PTAS for approximating an optimal cyclic solution, and it yields a $(2(1-1/k)+\varepsilon)$-approximation of the optimal unrestricted solution.

Sharp-GAN: Sharpness Loss Regularized GAN for Histopathology Image Synthesis

no code implementations27 Oct 2021 Sujata Butte, Haotian Wang, Min Xian, Aleksandar Vakanski

Conditional generative adversarial networks have been applied to generate synthetic histopathology images to alleviate this issue, but current approaches fail to generate clear contours for overlapped and touching nuclei.

Generative Adversarial Network Image Generation

Potato Crop Stress Identification in Aerial Images using Deep Learning-based Object Detection

no code implementations14 Jun 2021 Sujata Butte, Aleksandar Vakanski, Kasia Duellman, Haotian Wang, Amin Mirkouei

Recent research on the application of remote sensing and deep learning-based analysis in precision agriculture demonstrated a potential for improved crop management and reduced environmental impacts of agricultural production.

Management object-detection +1

Multi-Slice Low-Rank Tensor Decomposition Based Multi-Atlas Segmentation: Application to Automatic Pathological Liver CT Segmentation

no code implementations24 Feb 2021 Changfa Shi, Min Xian, Xiancheng Zhou, Haotian Wang, Heng-Da Cheng

Both qualitative and quantitative results demonstrate that, in the presence of major pathology, the proposed method is more accurate and robust than state-of-the-art methods.

Image Registration Liver Segmentation +2

Heterogeneous Interventions Reduce the Spread of COVID-19 in Simulations on Real Mobility Data

1 code implementation14 Aug 2020 Haotian Wang, Abhirup Ghosh, Jiaxin Ding, Rik Sarkar, Jie Gao

Major interventions have been introduced worldwide to slow down the spread of the SARS-CoV-2 virus.

Social and Information Networks Physics and Society

Bending Loss Regularized Network for Nuclei Segmentation in Histopathology Images

no code implementations3 Feb 2020 Haotian Wang, Min Xian, Aleksandar Vakanski

Separating overlapped nuclei is a major challenge in histopathology image analysis.

Cannot find the paper you are looking for? You can Submit a new open access paper.