Search Results for author: Yuhao Wang

Found 88 papers, 46 papers with code

Towards Provable (In)Secure Model Weight Release Schemes

no code implementations23 Jun 2025 Xin Yang, Bintao Tang, Yuhao Wang, Zimo Ji, Terry Jingchen Zhang, Wenyuan Jiang

Recent secure weight release schemes claim to enable open-source model distribution while protecting model ownership and preventing misuse.

EfficientVLA: Training-Free Acceleration and Compression for Vision-Language-Action Models

no code implementations11 Jun 2025 Yantai Yang, Yuhao Wang, Zichen Wen, Luo Zhongwei, Chang Zou, Zhipeng Zhang, Chuan Wen, Linfeng Zhang

Vision-Language-Action (VLA) models, particularly diffusion-based architectures, demonstrate transformative potential for embodied intelligence but are severely hampered by high computational and memory demands stemming from extensive inherent and inference-time redundancies.

Vision-Language-Action

SimpleDeepSearcher: Deep Information Seeking via Web-Powered Reasoning Trajectory Synthesis

3 code implementations22 May 2025 Shuang Sun, Huatong Song, Yuhao Wang, Ruiyang Ren, Jinhao Jiang, Junjie Zhang, Fei Bai, Jia Deng, Wayne Xin Zhao, Zheng Liu, Lei Fang, Zhongyuan Wang, Ji-Rong Wen

Retrieval-augmented generation (RAG) systems have advanced large language models (LLMs) in complex deep search scenarios requiring multi-step reasoning and iterative information retrieval.

Diversity Information Retrieval +3

Silent Leaks: Implicit Knowledge Extraction Attack on RAG Systems through Benign Queries

1 code implementation21 May 2025 Yuhao Wang, Wenjie Qu, Yanze Jiang, Zichen Liu, Yue Liu, Shengfang Zhai, Yinpeng Dong, Jiaheng Zhang

Retrieval-Augmented Generation (RAG) systems enhance large language models (LLMs) by incorporating external knowledge bases, but they are vulnerable to privacy risks from data extraction attacks.

RAG Retrieval-augmented Generation

VocalBench: Benchmarking the Vocal Conversational Abilities for Speech Interaction Models

1 code implementation21 May 2025 Heyang Liu, Yuhao Wang, Ziyang Cheng, Ronghua Wu, Qunshan Gu, Yanfeng Wang, Yu Wang

The rapid advancement of large language models (LLMs) has accelerated the development of multi-modal models capable of vocal communication.

Benchmarking

Unveiling Knowledge Utilization Mechanisms in LLM-based Retrieval-Augmented Generation

no code implementations17 May 2025 Yuhao Wang, Ruiyang Ren, Yucheng Wang, Wayne Xin Zhao, Jing Liu, Hua Wu, Haifeng Wang

In this paper, we present a systematic investigation of the intrinsic mechanisms by which LLMs integrate internal (parametric) and external (retrieved) knowledge in RAG scenarios.

Open-Domain Question Answering RAG +2

The Geography of Transportation Cybersecurity: Visitor Flows, Industry Clusters, and Spatial Dynamics

no code implementations12 May 2025 Yuhao Wang, Kailai Wang, Songhua Hu, Yunpeng, Zhang, Gino Lim, Pengyu Zhu

The rapid evolution of the transportation cybersecurity ecosystem, encompassing cybersecurity, automotive, and transportation and logistics sectors, will lead to the formation of distinct spatial clusters and visitor flow patterns across the US.

Clustering

Learning High-dimensional Gaussians from Censored Data

no code implementations28 Apr 2025 Arnab Bhattacharyya, Constantinos Daskalakis, Themis Gouleakis, Yuhao Wang

We design an efficient mean estimation algorithm, assuming that none of the possible missingness patterns is very rare conditioned on the values of the observed coordinates and that any small subset of coordinates is observed with sufficiently high probability.

SD-ReID: View-aware Stable Diffusion for Aerial-Ground Person Re-Identification

no code implementations13 Apr 2025 Xiang Hu, Pingping Zhang, Yuhao Wang, Bin Yan, Huchuan Lu

Furthermore, we propose the View-Refine Decoder (VRD) to obtain additional controllable conditions to generate missing cross-view features.

Person Re-Identification

VocalNet: Speech LLM with Multi-Token Prediction for Faster and High-Quality Generation

1 code implementation5 Apr 2025 Yuhao Wang, Heyang Liu, Ziyang Cheng, Ronghua Wu, Qunshan Gu, Yanfeng Wang, Yu Wang

Speech large language models (LLMs) have emerged as a prominent research focus in speech processing.

ERPO: Advancing Safety Alignment via Ex-Ante Reasoning Preference Optimization

no code implementations3 Apr 2025 Kehua Feng, Keyan Ding, Jing Yu, MengHan Li, Yuhao Wang, Tong Xu, Xinda Wang, Qiang Zhang, Huajun Chen

Recent advancements in large language models (LLMs) have accelerated progress toward artificial general intelligence, yet their potential to generate harmful content poses critical safety challenges.

Safety Alignment

LATex: Leveraging Attribute-based Text Knowledge for Aerial-Ground Person Re-Identification

no code implementations31 Mar 2025 Xiang Hu, Yuhao Wang, Pingping Zhang, Huchuan Lu

Then, with these features, we propose a Prompted Attribute Classifier Group (PACG) to generate person attribute predictions and obtain the encoded representations of predicted attributes.

Attribute Person Re-Identification

Histomorphology-driven multi-instance learning for breast cancer WSI classification

1 code implementation23 Mar 2025 Baizhi Wang, Rui Yan, Wenxin Ma, Xu Zhang, Yuhao Wang, Xiaolong Li, Yunjie Gu, Zihang Jiang, S. Kevin Zhou

With the incorporation of histomorphological information, our framework strengthens the model's ability to capture key and fine-grained pathological patterns, thereby enhancing WSI classification performance.

Classification Diagnostic

ECKGBench: Benchmarking Large Language Models in E-commerce Leveraging Knowledge Graph

no code implementations20 Mar 2025 Langming Liu, Haibin Chen, Yuhao Wang, Yujin Yuan, Shilei Liu, Wenbo Su, Xiangyu Zhao, Bo Zheng

To bridge the evaluation gap, we propose ECKGBench, a dataset specifically designed to evaluate the capacities of LLMs in e-commerce knowledge.

Benchmarking Hallucination +1

GeoRSMLLM: A Multimodal Large Language Model for Vision-Language Tasks in Geoscience and Remote Sensing

no code implementations16 Mar 2025 Zilun Zhang, Haozhan Shen, Tiancheng Zhao, Bin Chen, Zian Guan, Yuhao Wang, Xu Jia, Yuxiang Cai, Yongheng Shang, Jianwei Yin

The application of Vision-Language Models (VLMs) in remote sensing (RS) has demonstrated significant potential in traditional tasks such as scene classification, object detection, and image captioning.

Change Detection Image Captioning +11

Ranking and Selection with Simultaneous Input Data Collection

no code implementations14 Mar 2025 Yuhao Wang, Enlu Zhou

In this paper, we propose a general and novel formulation of ranking and selection with the existence of streaming input data.

Joint Modeling in Recommendations: A Survey

no code implementations28 Feb 2025 Xiangyu Zhao, Yichao Wang, Bo Chen, Jingtong Gao, Yuhao Wang, Xiaopeng Li, Pengyue Jia, Qidong Liu, Huifeng Guo, Ruiming Tang

In today's digital landscape, Deep Recommender Systems (DRS) play a crucial role in navigating and customizing online content for individual preferences.

Recommendation Systems Survey

Toward Universal Laws of Outlier Propagation

no code implementations12 Feb 2025 Aram Ebtekar, Yuhao Wang, Dominik Janzing

We argue that Algorithmic Information Theory (AIT) admits a principled way to quantify outliers in terms of so-called randomness deficiency.

Vevo: Controllable Zero-Shot Voice Imitation with Self-Supervised Disentanglement

no code implementations11 Feb 2025 Xueyao Zhang, Xiaohui Zhang, Kainan Peng, Zhenyu Tang, Vimal Manohar, Yingru Liu, Jeff Hwang, Dangna Li, Yuhao Wang, Julian Chan, Yuan Huang, Zhizheng Wu, Mingbo Ma

However, existing methods rely heavily on annotated data, and struggle with effectively disentangling timbre and style, leading to challenges in achieving controllable generation, especially in zero-shot scenarios.

Disentanglement text-to-speech +2

Holistically Guided Monte Carlo Tree Search for Intricate Information Seeking

no code implementations7 Feb 2025 Ruiyang Ren, Yuhao Wang, Junyi Li, Jinhao Jiang, Wayne Xin Zhao, Wenjie Wang, Tat-Seng Chua

We reformulate the task as a progressive information collection process with a knowledge memory and unite an adaptive checklist with multi-perspective reward modeling in MCTS.

Unity is Strength: Unifying Convolutional and Transformeral Features for Better Person Re-Identification

1 code implementation23 Dec 2024 Yuhao Wang, Pingping Zhang, Xuehu Liu, Zhengzheng Tu, Huchuan Lu

We propose a novel fusion framework called FusionReID to unify the strengths of CNNs and Transformers for image-based person ReID.

Person Re-Identification Unity

Scenario-Wise Rec: A Multi-Scenario Recommendation Benchmark

1 code implementation23 Dec 2024 Xiaopeng Li, Jingtong Gao, Pengyue Jia, Yichao Wang, Wanyu Wang, Yejing Wang, Yuhao Wang, Xiangyu Zhao, Huifeng Guo, Ruiming Tang

Multi Scenario Recommendation (MSR) tasks, referring to building a unified model to enhance performance across all recommendation scenarios, have recently gained much attention.

Drawing the Line: Enhancing Trustworthiness of MLLMs Through the Power of Refusal

no code implementations15 Dec 2024 Yuhao Wang, Zhiyuan Zhu, Heyang Liu, Yusheng Liao, Hongcheng Liu, Yanfeng Wang, Yu Wang

Multimodal large language models (MLLMs) excel at multimodal perception and understanding, yet their tendency to generate hallucinated or inaccurate responses undermines their trustworthiness.

DeMo: Decoupled Feature-Based Mixture of Experts for Multi-Modal Object Re-Identification

1 code implementation14 Dec 2024 Yuhao Wang, Yang Liu, Aihua Zheng, Pingping Zhang

To address these issues, we propose a novel feature learning framework called DeMo for multi-modal object ReID, which adaptively balances decoupled features using a mixture of experts.

Mixture-of-Experts Object

MambaPro: Multi-Modal Object Re-Identification with Mamba Aggregation and Synergistic Prompt

1 code implementation14 Dec 2024 Yuhao Wang, Xuehu Liu, Tianyu Yan, Yang Liu, Aihua Zheng, Pingping Zhang, Huchuan Lu

Furthermore, current multi-modal aggregation methods have obvious limitations in dealing with long sequences from different modalities.

Mamba Object

Bridging Relevance and Reasoning: Rationale Distillation in Retrieval-Augmented Generation

no code implementations11 Dec 2024 Pengyue Jia, Derong Xu, Xiaopeng Li, Zhaocheng Du, Xiangyang Li, Xiangyu Zhao, Yichao Wang, Yuhao Wang, Huifeng Guo, Ruiming Tang

The reranker and generator are two critical components in the Retrieval-Augmented Generation (i. e., RAG) pipeline, responsible for ranking relevant documents and generating responses.

RAG Retrieval +1

Pre-train, Align, and Disentangle: Empowering Sequential Recommendation with Large Language Models

1 code implementation5 Dec 2024 Yuhao Wang, Junwei Pan, Pengyue Jia, Wanyu Wang, Maolin Wang, Zhixiang Feng, Xiaotian Li, Jie Jiang, Xiangyu Zhao

Sequential Recommendation (SR) aims to leverage the sequential patterns in users' historical interactions to accurately track their preferences.

Sequential Recommendation

On the physics of nested Markov models: a generalized probabilistic theory perspective

no code implementations18 Nov 2024 Xingjian Zhang, Yuhao Wang

In this work, we inspect the nested Markov model through the lens of generalized probabilistic theory, an axiomatic framework to describe general physical theories.

valid

ImageRAG: Enhancing Ultra High Resolution Remote Sensing Imagery Analysis with ImageRAG

1 code implementation12 Nov 2024 Zilun Zhang, Haozhan Shen, Tiancheng Zhao, Zian Guan, Bin Chen, Yuhao Wang, Xu Jia, Yuxiang Cai, Yongheng Shang, Jianwei Yin

If choose to resize the UHR image to standard input image size, the extensive spatial and contextual information that UHR images contain will be neglected.

RAG Retrieval +1

Self-Calibrated Listwise Reranking with Large Language Models

no code implementations7 Nov 2024 Ruiyang Ren, Yuhao Wang, Kun Zhou, Wayne Xin Zhao, Wenjie Wang, Jing Liu, Ji-Rong Wen, Tat-Seng Chua

Large language models (LLMs), with advanced linguistic capabilities, have been employed in reranking tasks through a sequence-to-sequence approach.

Reranking

DMT-HI: MOE-based Hyperbolic Interpretable Deep Manifold Transformation for Unspervised Dimensionality Reduction

1 code implementation25 Oct 2024 Zelin Zang, Yuhao Wang, Jinlin Wu, Hong Liu, Yue Shen, Stan. Z Li, Zhen Lei

DMT-HI enhances DR accuracy by leveraging hyperbolic embeddings to represent the hierarchical nature of data, while also improving interpretability by explicitly linking input data, embedding outcomes, and key features through the MOE structure.

Dimensionality Reduction Mixture-of-Experts

TRRG: Towards Truthful Radiology Report Generation With Cross-modal Disease Clue Enhanced Large Language Model

no code implementations22 Aug 2024 Yuhao Wang, Chao Hao, Yawen Cui, Xinqi Su, Weicheng Xie, Tao Tan, Zitong Yu

This significantly enhances the report generation capability and clinical effectiveness of multi-modal large language models in the field of radiology reportgeneration.

Contrastive Learning Language Modeling +2

DAAD: Dynamic Analysis and Adaptive Discriminator for Fake News Detection

1 code implementation20 Aug 2024 Xinqi Su, Yawen Cui, Ajian Liu, Xun Lin, Yuhao Wang, Haochen Liang, Wenhui Li, Zitong Yu

In current web environment, fake news spreads rapidly across online social networks, posing serious threats to society.

Fake News Detection Image Manipulation

Mimicking the Mavens: Agent-based Opinion Synthesis and Emotion Prediction for Social Media Influencers

no code implementations30 Jul 2024 Qinglan Wei, Ruiqi Xue, Yutian Wang, Hongjiang Xiao, Yuhao Wang, Xiaoyan Duan

Predicting influencers' views and public sentiment on social media is crucial for anticipating societal trends and guiding strategic responses.

Language Modelling RAG +1

Decoding Linguistic Representations of Human Brain

no code implementations30 Jul 2024 Yu Wang, Heyang Liu, Yuhao Wang, Chuan Xuan, Yixuan Hou, Sheng Feng, Hongcheng Liu, Yusheng Liao, Yanfeng Wang

Language, as an information medium created by advanced organisms, has always been a concern of neuroscience regarding how it is represented in the brain.

Brain Computer Interface

LLMBox: A Comprehensive Library for Large Language Models

1 code implementation8 Jul 2024 Tianyi Tang, Yiwen Hu, Bingqian Li, Wenyang Luo, Zijing Qin, Haoxiang Sun, Jiapeng Wang, Shiyi Xu, Xiaoxue Cheng, Geyang Guo, Han Peng, Bowen Zheng, Yiru Tang, Yingqian Min, Yushuo Chen, Jie Chen, Yuanqian Zhao, Luran Ding, Yuhao Wang, Zican Dong, Chunxuan Xia, Junyi Li, Kun Zhou, Wayne Xin Zhao, Ji-Rong Wen

To facilitate the research on large language models (LLMs), this paper presents a comprehensive and unified library, LLMBox, to ease the development, use, and evaluation of LLMs.

LLM4MSR: An LLM-Enhanced Paradigm for Multi-Scenario Recommendation

no code implementations18 Jun 2024 Yuhao Wang, Yichao Wang, Zichuan Fu, Xiangyang Li, Xiangyu Zhao, Huifeng Guo, Ruiming Tang

As the demand for more personalized recommendation grows and a dramatic boom in commercial scenarios arises, the study on multi-scenario recommendation (MSR) has attracted much attention, which uses the data from all scenarios to simultaneously improve their recommendation performance.

Language Modelling Large Language Model +1

Sigma: Siamese Mamba Network for Multi-Modal Semantic Segmentation

1 code implementation5 Apr 2024 Zifu Wan, Pingping Zhang, Yuhao Wang, Silong Yong, Simon Stepputtis, Katia Sycara, Yaqi Xie

Multi-modal semantic segmentation significantly enhances AI agents' perception and scene understanding, especially under adverse conditions like low-light or overexposed environments.

Decoder Mamba +5

LLMTreeRec: Unleashing the Power of Large Language Models for Cold-Start Recommendations

1 code implementation31 Mar 2024 Wenlin Zhang, Chuhan Wu, Xiangyang Li, Yuhao Wang, Kuicai Dong, Yichao Wang, Xinyi Dai, Xiangyu Zhao, Huifeng Guo, Ruiming Tang

The lack of training data gives rise to the system cold-start problem in recommendation systems, making them struggle to provide effective recommendations.

Recommendation Systems Re-Ranking +1

Magic Tokens: Select Diverse Tokens for Multi-modal Object Re-Identification

2 code implementations CVPR 2024 Pingping Zhang, Yuhao Wang, Yang Liu, Zhengzheng Tu, Huchuan Lu

To address above issues, we propose a novel learning framework named \textbf{EDITOR} to select diverse tokens from vision Transformers for multi-modal object ReID.

Object

Automatic Interactive Evaluation for Large Language Models with State Aware Patient Simulator

4 code implementations13 Mar 2024 Yusheng Liao, Yutong Meng, Yuhao Wang, Hongcheng Liu, Yanfeng Wang, Yu Wang

Large Language Models (LLMs) have demonstrated remarkable proficiency in human interactions, yet their application within the medical field remains insufficiently explored.

Reusing Historical Trajectories in Natural Policy Gradient via Importance Sampling: Convergence and Convergence Rate

no code implementations1 Mar 2024 Yifan Lin, Yuhao Wang, Enlu Zhou

The efficient utilization of historical trajectories obtained from previous policies is essential for expediting policy optimization.

Policy Gradient Methods

REAR: A Relevance-Aware Retrieval-Augmented Framework for Open-Domain Question Answering

1 code implementation27 Feb 2024 Yuhao Wang, Ruiyang Ren, Junyi Li, Wayne Xin Zhao, Jing Liu, Ji-Rong Wen

By combining the improvements in both architecture and training, our proposed REAR can better utilize external knowledge by effectively perceiving the relevance of retrieved documents.

Open-Domain Question Answering RAG +2

Infrared and visible Image Fusion with Language-driven Loss in CLIP Embedding Space

1 code implementation26 Feb 2024 Yuhao Wang, Lingjuan Miao, Zhiqiang Zhou, Lei Zhang, Yajun Qiao

A language-driven fusion model is then constructed in the embedding space, by establishing the relationship among the embedded vectors to represent the fusion objective and input image modalities.

Infrared And Visible Image Fusion

Optimal estimation of Gaussian (poly)trees

1 code implementation9 Feb 2024 Yuhao Wang, Ming Gao, Wai Ming Tai, Bryon Aragam, Arnab Bhattacharyya

We develop optimal algorithms for learning undirected Gaussian trees and directed Gaussian polytrees from data.

MM-SAP: A Comprehensive Benchmark for Assessing Self-Awareness of Multimodal Large Language Models in Perception

1 code implementation15 Jan 2024 Yuhao Wang, Yusheng Liao, Heyang Liu, Hongcheng Liu, Yu Wang, Yanfeng Wang

We believe that these hallucinations are partially due to the models' struggle with understanding what they can and cannot perceive from images, a capability we refer to as self-awareness in perception.

TOP-ReID: Multi-spectral Object Re-Identification with Token Permutation

1 code implementation15 Dec 2023 Yuhao Wang, Xuehu Liu, Pingping Zhang, Hu Lu, Zhengzheng Tu, Huchuan Lu

In addition, most of current Transformer-based ReID methods only utilize the global feature of class tokens to achieve the holistic retrieval, ignoring the local discriminative ones.

Scenario-Aware Hierarchical Dynamic Network for Multi-Scenario Recommendation

no code implementations5 Sep 2023 Jingtong Gao, Bo Chen, Menghui Zhu, Xiangyu Zhao, Xiaopeng Li, Yuhao Wang, Yichao Wang, Huifeng Guo, Ruiming Tang

To address these limitations, we propose a Scenario-Aware Hierarchical Dynamic Network for Multi-Scenario Recommendations (HierRec), which perceives implicit patterns adaptively and conducts explicit and implicit scenario modeling jointly.

Click-Through Rate Prediction

Investigating the Factual Knowledge Boundary of Large Language Models with Retrieval Augmentation

1 code implementation20 Jul 2023 Ruiyang Ren, Yuhao Wang, Yingqi Qu, Wayne Xin Zhao, Jing Liu, Hao Tian, Hua Wu, Ji-Rong Wen, Haifeng Wang

In this study, we present the first analysis on the factual knowledge boundaries of LLMs and how retrieval augmentation affects LLMs on open-domain question answering (QA), with a bunch of important findings.

Open-Domain Question Answering Retrieval +1

Unified Medical Image-Text-Label Contrastive Learning With Continuous Prompt

no code implementations12 Jul 2023 Yuhao Wang

In this paper, we propose a unified Image-Text-Label contrastive learning framework based on continuous prompts, with three main contributions.

Contrastive Learning

Reading Radiology Imaging Like The Radiologist

no code implementations12 Jul 2023 Yuhao Wang

By referencing the disease-oriented similar report and the visual features, the factual consistency model can generate a more accurate radiology report.

Image Captioning Retrieval +1

SelfEvolve: A Code Evolution Framework via Large Language Models

no code implementations5 Jun 2023 Shuyang Jiang, Yuhao Wang, Yu Wang

However, while various methods have been proposed to augment LLMs with retrieved knowledge and enhance the quality of code generation, the performance of these retrieval-based methods is limited by the strength of the retrievers used.

Code Generation HumanEval +1

Multi-Task Deep Recommender Systems: A Survey

no code implementations7 Feb 2023 Yuhao Wang, Ha Tsz Lam, Yi Wong, Ziru Liu, Xiangyu Zhao, Yichao Wang, Bo Chen, Huifeng Guo, Ruiming Tang

Multi-task learning (MTL) aims at learning related tasks in a unified model to achieve mutual improvement among tasks considering their shared knowledge.

Multi-Task Learning Recommendation Systems +2

Exploration and Regularization of the Latent Action Space in Recommendation

1 code implementation7 Feb 2023 Shuchang Liu, Qingpeng Cai, Bowen Sun, Yuhao Wang, Ji Jiang, Dong Zheng, Kun Gai, Peng Jiang, Xiangyu Zhao, Yongfeng Zhang

To overcome this challenge, we propose a hyper-actor and critic learning framework where the policy decomposes the item list generation process into a hyper-action inference step and an effect-action selection step.

Recommendation Systems

TextBox 2.0: A Text Generation Library with Pre-trained Language Models

1 code implementation26 Dec 2022 Tianyi Tang, Junyi Li, Zhipeng Chen, Yiwen Hu, Zhuohao Yu, Wenxun Dai, Zican Dong, Xiaoxue Cheng, Yuhao Wang, Wayne Xin Zhao, Jian-Yun Nie, Ji-Rong Wen

To facilitate research on text generation, this paper presents a comprehensive and unified library, TextBox 2. 0, focusing on the use of pre-trained language models (PLMs).

Abstractive Text Summarization Data-to-Text Generation +7

Generative Modeling in Sinogram Domain for Sparse-view CT Reconstruction

1 code implementation25 Nov 2022 Bing Guan, Cailian Yang, Liu Zhang, Shanzhou Niu, Minghui Zhang, Yuhao Wang, Weiwen Wu, Qiegen Liu

When the number of projection view changes, the DL network should be retrained with updated sparse-view/full-view CT image pairs.

Computed Tomography (CT) CT Reconstruction

Generative Modeling in Structural-Hankel Domain for Color Image Inpainting

3 code implementations25 Nov 2022 Zihao Li, CHUNHUA WU, Shenglin Wu, Wenbo Wan, Yuhao Wang, Qiegen Liu

To better apply the score-based generative model to learn the internal statistical distribution within patches, the large-scale Hankel matrices are finally folded into the higher dimensional tensors for prior learning.

Diversity Image Inpainting

Self adaptive global-local feature enhancement for radiology report generation

no code implementations21 Nov 2022 Yuhao Wang, Kai Wang, Xiaohong Liu, Tianrun Gao, Jingyue Zhang, Guangyu Wang

Automated radiology report generation aims at automatically generating a detailed description of medical images, which can greatly alleviate the workload of radiologists and provide better medical services to remote areas.

Anatomy

Factorized Blank Thresholding for Improved Runtime Efficiency of Neural Transducers

no code implementations2 Nov 2022 Duc Le, Frank Seide, Yuhao Wang, Yang Li, Kjell Schubert, Ozlem Kalinli, Michael L. Seltzer

We show how factoring the RNN-T's output distribution can significantly reduce the computation cost and power consumption for on-device ASR inference with no loss in accuracy.

Risk-averse Contextual Multi-armed Bandit Problem with Linear Payoffs

no code implementations24 Jun 2022 Yifan Lin, Yuhao Wang, Enlu Zhou

In particular, we consider mean-variance as the risk criterion, and the best arm is the one with the largest mean-variance reward.

Thompson Sampling

Long-term Causal Inference Under Persistent Confounding via Data Combination

1 code implementation15 Feb 2022 Guido Imbens, Nathan Kallus, Xiaojie Mao, Yuhao Wang

In this paper, we uniquely tackle the challenge of persistent unmeasured confounders, i. e., some unmeasured confounders that can simultaneously affect the treatment, short-term outcomes and the long-term outcome, noting that they invalidate identification strategies in previous literature.

Causal Inference

Virtual Coil Augmentation Technology for MR Coil Extrapolation via Deep Learning

no code implementations19 Jan 2022 Cailian Yang, Xianghao Liao, Yuhao Wang, Minghui Zhang, Qiegen Liu

Two main components are incorporated into the network design, namely variable augmentation technology and sum of squares (SOS) objective function.

Image Reconstruction Super-Resolution

Variable Augmented Network for Invertible MR Coil Compression

1 code implementation19 Jan 2022 Xianghao Liao, Shanshan Wang, Lanlan Tu, Yuhao Wang, Dong Liang, Qiegen Liu

Additionally, its performance is not susceptible to different number of virtual coils.

MRI Reconstruction Using Deep Energy-Based Model

1 code implementation7 Sep 2021 Yu Guan, Zongjiang Tu, Shanshan Wang, Qiegen Liu, Yuhao Wang, Dong Liang

In contrast to other generative models for reconstruction, the proposed method utilizes deep energy-based information as the image prior in reconstruction to improve the quality of image.

Image Generation model +1

Variable Augmented Network for Invertible Modality Synthesis-Fusion

1 code implementation2 Sep 2021 Yuhao Wang, Ruirui Liu, Zihao Li, Cailian Yang, Qiegen Liu

As an effective way to integrate the information contained in multiple medical images under different modalities, medical image synthesis and fusion have emerged in various clinical applications such as disease diagnosis and treatment planning.

Image Generation

High-dimensional Assisted Generative Model for Color Image Restoration

1 code implementation14 Aug 2021 Kai Hong, CHUNHUA WU, Cailian Yang, Minghui Zhang, Yancheng Lu, Yuhao Wang, Qiegen Liu

This work presents an unsupervised deep learning scheme that exploiting high-dimensional assisted score-based generative model for color image restoration tasks.

Demosaicking Denoising +2

Learning Sparse Fixed-Structure Gaussian Bayesian Networks

1 code implementation22 Jul 2021 Arnab Bhattacharyya, Davin Choo, Rishikesh Gajjala, Sutanu Gayen, Yuhao Wang

We also study a couple of new algorithms for the problem: - BatchAvgLeastSquares takes the average of several batches of least squares solutions at each node, so that one can interpolate between the batch size and the number of batches.

Wavelet Transform-assisted Adaptive Generative Modeling for Colorization

4 code implementations9 Jul 2021 Jin Li, Wanyun Li, Zichen Xu, Yuhao Wang, Qiegen Liu

Unsupervised deep learning has recently demonstrated the promise of producing high-quality samples.

Colorization Denoising +2

Identifiability of AMP chain graph models

1 code implementation17 Jun 2021 Yuhao Wang, Arnab Bhattacharyya

AMP models are described by DAGs on chain components which themselves are undirected graphs.

Direction-Aggregated Attack for Transferable Adversarial Examples

1 code implementation19 Apr 2021 Tianjin Huang, Vlado Menkovski, Yulong Pei, Yuhao Wang, Mykola Pechenizkiy

Deep neural networks are vulnerable to adversarial examples that are crafted by imposing imperceptible changes to the inputs.

Sparse Code Multiple Access for 6G Wireless Communication Networks: Recent Advances and Future Directions

no code implementations3 Apr 2021 Lisu Yu, Zilong Liu, Miaowen Wen, Donghong Cai, Shuping Dang, Yuhao Wang, Pei Xiao

As 5G networks rolling out in many different countries nowadays, the time has come to investigate how to upgrade and expand them towards 6G, where the latter is expected to realize the interconnection of everything as well as the development of a ubiquitous intelligent mobile world for intelligent life.

Joint Intensity-Gradient Guided Generative Modeling for Colorization

7 code implementations28 Dec 2020 Kai Hong, Jin Li, Wanyun Li, Cailian Yang, Minghui Zhang, Yuhao Wang, Qiegen Liu

Furthermore, the joint intensity-gradient constraint in data-fidelity term is proposed to limit the degree of freedom within generative model at the iterative colorization stage, and it is conducive to edge-preserving.

Colorization

Amortized Variational Deep Q Network

1 code implementation3 Nov 2020 Haotian Zhang, Yuhao Wang, Jianyong Sun, Zongben Xu

Efficient exploration is one of the most important issues in deep reinforcement learning.

Deep Reinforcement Learning Efficient Exploration +2

Joint Inference of Multiple Graphs from Matrix Polynomials

no code implementations16 Oct 2020 Madeline Navarro, Yuhao Wang, Antonio G. Marques, Caroline Uhler, Santiago Segarra

Inferring graph structure from observations on the nodes is an important and popular network science task.

Learning in the Frequency Domain

4 code implementations CVPR 2020 Kai Xu, Minghai Qin, Fei Sun, Yuhao Wang, Yen-Kuang Chen, Fengbo Ren

Experiment results show that learning in the frequency domain with static channel selection can achieve higher accuracy than the conventional spatial downsampling approach and meanwhile further reduce the input data size.

channel selection Instance Segmentation +1

Causal Discovery from Incomplete Data: A Deep Learning Approach

no code implementations15 Jan 2020 Yuhao Wang, Vlado Menkovski, Hao Wang, Xin Du, Mykola Pechenizkiy

As systems are getting more autonomous with the development of artificial intelligence, it is important to discover the causal knowledge from observational sensory inputs.

Causal Discovery Deep Learning +1

DARB: A Density-Aware Regular-Block Pruning for Deep Neural Networks

no code implementations19 Nov 2019 Ao Ren, Tao Zhang, Yuhao Wang, Sheng Lin, Peiyan Dong, Yen-Kuang Chen, Yuan Xie, Yanzhi Wang

As a further optimization, we propose a density-adaptive regular-block (DARB) pruning that outperforms prior structured pruning work with high pruning ratio and decoding efficiency.

Model Compression Network Pruning

Learning Priors in High-frequency Domain for Inverse Imaging Reconstruction

1 code implementation23 Oct 2019 Zhuonan He, Jinjie Zhou, Dong Liang, Yuhao Wang, Qiegen Liu

Ill-posed inverse problems in imaging remain an active research topic in several decades, with new approaches constantly emerging.

Denoising Dictionary Learning +1

Direct Estimation of Differences in Causal Graphs

1 code implementation NeurIPS 2018 Yuhao Wang, Chandler Squires, Anastasiya Belyaeva, Caroline Uhler

We consider the problem of estimating the differences between two causal directed acyclic graph (DAG) models given i. i. d.~samples from each model.

Methodology

Permutation-based Causal Inference Algorithms with Interventions

no code implementations NeurIPS 2017 Yuhao Wang, Liam Solus, Karren Yang, Caroline Uhler

Learning directed acyclic graphs using both observational and interventional data is now a fundamentally important problem due to recent technological developments in genomics that generate such single-cell gene expression data at a very large scale.

Causal Inference

Cannot find the paper you are looking for? You can Submit a new open access paper.