Search Results for author: Qian Yang

Found 46 papers, 18 papers with code

Multimodal RAG-driven Anomaly Detection and Classification in Laser Powder Bed Fusion using Large Language Models

no code implementations20 May 2025 Kiarash Naghavi Khanghah, Zhiling Chen, Lela Romeo, Qian Yang, Rajiv Malhotra, Farhad Imani, Hongyi Xu

This study presents a novel multimodal Retrieval-Augmented Generation-based framework that automates anomaly detection across various Additive Manufacturing processes leveraging retrieved information from literature, including images and descriptive text, rather than training datasets.

Anomaly Detection Descriptive +7

DeepSelective: Feature Gating and Representation Matching for Interpretable Clinical Prediction

no code implementations15 Apr 2025 Ruochi Zhang, Qian Yang, Xiaoyang Wang, Haoran Wu, Qiong Zhou, Yu Wang, Kewei Li, Yueying Wang, Yusi Fan, Jiale Zhang, Lan Huang, Chang Liu, Fengfeng Zhou

The rapid accumulation of Electronic Health Records (EHRs) has transformed healthcare by providing valuable data that enhance clinical predictions and diagnoses.

Data Compression Decision Making +3

My Precious Crash Data: Barriers and Opportunities in Encouraging Autonomous Driving Companies to Share Safety-Critical Data

no code implementations10 Apr 2025 Hauke Sandhaus, Angel Hsing-Chi Hwang, Wendy Ju, Qian Yang

Findings suggest two key, previously unknown barriers to data sharing: (1) Datasets inherently embed salient knowledge that is key to improving AV safety and are resource-intensive.

Autonomous Driving

How Problematic Writer-AI Interactions (Rather than Problematic AI) Hinder Writers' Idea Generation

no code implementations14 Mar 2025 Khonzoda Umarova, Talia Wise, Zhuoer Lyu, Mina Lee, Qian Yang

Through a case study, we demonstrate that the impact of genAI on students' idea development depends not only on the AI but also on the students and, crucially, their interactions in between.

Assessing and Learning Alignment of Unimodal Vision and Language Models

no code implementations CVPR 2025 Le Zhang, Qian Yang, Aishwarya Agrawal

Next, we introduce Swift Alignment of Image and Language (SAIL), a efficient transfer learning framework that aligns pretrained unimodal vision and language models for downstream vision-language tasks.

Semantic Segmentation Transfer Learning

Thoughtful Adoption of NLP for Civic Participation: Understanding Differences Among Policymakers

no code implementations30 Oct 2024 Jose A. Guridi, Cristobal Cheyre, Qian Yang

We interviewed seven politicians (politically appointed officials as heads of government institutions) and thirteen public servants (career government employees who design and administrate policy interventions), inquiring how they choose whether and how to use NLP tools to support civic participation processes.

Fairness

Diabetica: Adapting Large Language Model to Enhance Multiple Medical Tasks in Diabetes Care and Management

1 code implementation20 Sep 2024 Lai Wei, Zhen Ying, Muyang He, Yutong Chen, Qian Yang, Yanzhe Hong, Jiaping Lu, Kaipeng Zheng, Shaoting Zhang, Xiaoying Li, Weiran Huang, Ying Chen

Generally, our introduced framework helps develop diabetes-specific LLMs and highlights their potential to enhance clinical practice and provide personalized, data-driven support for diabetes management across different end users.

Language Modeling Language Modelling +2

WavTokenizer: an Efficient Acoustic Discrete Codec Tokenizer for Audio Language Modeling

1 code implementation29 Aug 2024 Shengpeng Ji, Ziyue Jiang, Wen Wang, Yifu Chen, Minghui Fang, Jialong Zuo, Qian Yang, Xize Cheng, Zehan Wang, RuiQi Li, Ziang Zhang, Xiaoda Yang, Rongjie Huang, Yidi Jiang, Qian Chen, Siqi Zheng, Zhou Zhao

Despite the reduced number of tokens, WavTokenizer achieves state-of-the-art reconstruction quality with outstanding UTMOS scores and inherently contains richer semantic information.

Language Modeling Language Modelling

MSceneSpeech: A Multi-Scene Speech Dataset For Expressive Speech Synthesis

no code implementations19 Jul 2024 Qian Yang, Jialong Zuo, Zhe Su, Ziyue Jiang, Mingze Li, Zhou Zhao, Feiyang Chen, Zhefeng Wang, Baoxing Huai

We introduce an open source high-quality Mandarin TTS dataset MSceneSpeech (Multiple Scene Speech Dataset), which is intended to provide resources for expressive speech synthesis.

Expressive Speech Synthesis

Qwen2-Audio Technical Report

2 code implementations15 Jul 2024 Yunfei Chu, Jin Xu, Qian Yang, Haojie Wei, Xipin Wei, Zhifang Guo, Yichong Leng, YuanJun Lv, Jinzheng He, Junyang Lin, Chang Zhou, Jingren Zhou

We introduce the latest progress of Qwen-Audio, a large-scale audio-language model called Qwen2-Audio, which is capable of accepting various audio signal inputs and performing audio analysis or direct textual responses with regard to speech instructions.

Instruction Following Language Modelling

Decompose and Compare Consistency: Measuring VLMs' Answer Reliability via Task-Decomposition Consistency Comparison

no code implementations10 Jul 2024 Qian Yang, Weixiang Yan, Aishwarya Agrawal

Despite tremendous advancements, current state-of-the-art Vision-Language Models (VLMs) are still far from perfect.

CodeHalu: Investigating Code Hallucinations in LLMs via Execution-based Verification

1 code implementation30 Apr 2024 Yuchen Tian, Weixiang Yan, Qian Yang, Xuandong Zhao, Qian Chen, Wen Wang, Ziyang Luo, Lei Ma, Dawn Song

By evaluating 17 popular LLMs using this benchmark, we reveal significant differences in their accuracy and reliability in code generation, offering detailed insights for further improving the code generation capabilities of LLMs.

Code Generation Hallucination

A Piece of Theatre: Investigating How Teachers Design LLM Chatbots to Assist Adolescent Cyberbullying Education

no code implementations27 Feb 2024 Michael A. Hedderich, Natalie N. Bazarova, Wenting Zou, Ryun Shim, Xinda Ma, Qian Yang

In offering this tool, we explore teachers' distinctive needs when designing chatbots to assist their teaching, and how chatbot design tools might better support them.

Chatbot

AIR-Bench: Benchmarking Large Audio-Language Models via Generative Comprehension

1 code implementation12 Feb 2024 Qian Yang, Jin Xu, Wenrui Liu, Yunfei Chu, Ziyue Jiang, Xiaohuan Zhou, Yichong Leng, YuanJun Lv, Zhou Zhao, Chang Zhou, Jingren Zhou

By revealing the limitations of existing LALMs through evaluation results, AIR-Bench can provide insights into the direction of future research.

2k Automatic Speech Recognition +4

Exploring the Best Practices of Query Expansion with Large Language Models

1 code implementation12 Jan 2024 Le Zhang, Yihong Wu, Qian Yang, Jian-Yun Nie

Large Language Models (LLMs) are foundational in language technologies, particularly in information retrieval (IR).

Information Retrieval Re-Ranking +3

Leveraging Generative AI for Clinical Evidence Summarization Needs to Ensure Trustworthiness

no code implementations19 Nov 2023 Gongbo Zhang, Qiao Jin, Denis Jered McInerney, Yong Chen, Fei Wang, Curtis L. Cole, Qian Yang, Yanshan Wang, Bradley A. Malin, Mor Peleg, Byron C. Wallace, Zhiyong Lu, Chunhua Weng, Yifan Peng

Evidence-based medicine promises to improve the quality of healthcare by empowering medical decisions and practices with the best available evidence.

The Participatory Turn in AI Design: Theoretical Foundations and the Current State of Practice

no code implementations2 Oct 2023 Fernando Delgado, Stephen Yang, Michael Madaio, Qian Yang

Despite the growing consensus that stakeholders affected by AI systems should participate in their design, enormous variation and implicit disagreements exist among current approaches.

Mega-TTS 2: Boosting Prompting Mechanisms for Zero-Shot Speech Synthesis

no code implementations14 Jul 2023 Ziyue Jiang, Jinglin Liu, Yi Ren, Jinzheng He, Zhenhui Ye, Shengpeng Ji, Qian Yang, Chen Zhang, Pengfei Wei, Chunfeng Wang, Xiang Yin, Zejun Ma, Zhou Zhao

However, the prompting mechanisms of zero-shot TTS still face challenges in the following aspects: 1) previous works of zero-shot TTS are typically trained with single-sentence prompts, which significantly restricts their performance when the data is relatively sufficient during the inference stage.

In-Context Learning Language Modelling +5

Mega-TTS: Zero-Shot Text-to-Speech at Scale with Intrinsic Inductive Bias

no code implementations6 Jun 2023 Ziyue Jiang, Yi Ren, Zhenhui Ye, Jinglin Liu, Chen Zhang, Qian Yang, Shengpeng Ji, Rongjie Huang, Chunfeng Wang, Xiang Yin, Zejun Ma, Zhou Zhao

3) We further use a VQGAN-based acoustic model to generate the spectrogram and a latent code language model to fit the distribution of prosody, since prosody changes quickly over time in a sentence, and language models can capture both local and long-range dependencies.

Attribute Inductive Bias +6

Med-EASi: Finely Annotated Dataset and Models for Controllable Simplification of Medical Texts

1 code implementation17 Feb 2023 Chandrayee Basu, Rosni Vasu, Michihiro Yasunaga, Qian Yang

Automatic medical text simplification can assist providers with patient-friendly communication and make medical texts more accessible, thereby improving health literacy.

Position Text Simplification

Semi-Siamese Network for Robust Change Detection Across Different Domains with Applications to 3D Printing

no code implementations16 Dec 2022 Yushuo Niu, Ethan Chadwick, Anson W. K. Ma, Qian Yang

In this work, we approach the defect detection problem using a novel Semi-Siamese deep learning model that directly compares a reference schematic of the desired print and a camera image of the achieved print.

Change Detection Defect Detection +2

Enhancing Multi-modal and Multi-hop Question Answering via Structured Knowledge and Unified Retrieval-Generation

1 code implementation16 Dec 2022 Qian Yang, Qian Chen, Wen Wang, Baotian Hu, Min Zhang

Moreover, the pipelined approaches of retrieval and generation might result in poor generation performance when retrieval performance is low.

Answer Generation Decoder +5

Dict-TTS: Learning to Pronounce with Prior Dictionary Knowledge for Text-to-Speech

1 code implementation5 Jun 2022 Ziyue Jiang, Zhe Su, Zhou Zhao, Qian Yang, Yi Ren, Jinglin Liu, Zhenhui Ye

This paper tackles the polyphone disambiguation problem from a concise and novel perspective: we propose Dict-TTS, a semantic-aware generative text-to-speech model with an online website dictionary (the existing prior information in the natural language).

Polyphone disambiguation text-to-speech +1

CoAuthor: Designing a Human-AI Collaborative Writing Dataset for Exploring Language Model Capabilities

1 code implementation18 Jan 2022 Mina Lee, Percy Liang, Qian Yang

Large language models (LMs) offer unprecedented language generation capabilities and exciting opportunities for interaction design.

Language Modeling Language Modelling +1

Stakeholder Participation in AI: Beyond "Add Diverse Stakeholders and Stir"

no code implementations1 Nov 2021 Fernando Delgado, Stephen Yang, Michael Madaio, Qian Yang

There is a growing consensus in HCI and AI research that the design of AI systems needs to engage and empower stakeholders who will be affected by AI.

Clinical Evidence Engine: Proof-of-Concept For A Clinical-Domain-Agnostic Decision Support Infrastructure

no code implementations31 Oct 2021 BoJian Hou, Hao Zhang, Gur Ladizhinsky, Stephen Yang, Volodymyr Kuleshov, Fei Wang, Qian Yang

As a result, clinicians cannot easily or rapidly scrutinize the CDSS recommendation when facing a difficult diagnosis or treatment decision in practice.

Diagnostic

Sentence-level Online Handwritten Chinese Character Recognition

no code implementations4 Jul 2021 Yunxin Li, Qian Yang, Qingcai Chen, Lin Ma, Baotian Hu, Xiaolong Wang, Yuxin Ding

Single online handwritten Chinese character recognition~(single OLHCCR) has achieved prominent performance.

Sentence Word Embeddings

FLOP: Federated Learning on Medical Datasets using Partial Networks

no code implementations10 Feb 2021 Qian Yang, Jianyi Zhang, Weituo Hao, Gregory Spell, Lawrence Carin

While different data-driven deep learning models have been developed to mitigate the diagnosis of COVID-19, the data itself is still scarce due to patient privacy concerns.

Deep Learning Federated Learning

Fluctuation spectra of large random dynamical systems reveal hidden structure in ecological networks

no code implementations10 Nov 2020 Yvonne Krumbeck, Qian Yang, George W. A. Constable, Tim Rogers

Understanding the relationship between complexity and stability in large dynamical systems -- such as ecosystems -- remains a key open question in complexity theory which has inspired a rich body of work developed over more than fifty years.

Open-Ended Question Answering

Faster On-Device Training Using New Federated Momentum Algorithm

no code implementations6 Feb 2020 Zhouyuan Huo, Qian Yang, Bin Gu, Lawrence Carin. Heng Huang

Mobile crowdsensing has gained significant attention in recent years and has become a critical paradigm for emerging Internet of Things applications.

Federated Learning

Graph-Driven Generative Models for Heterogeneous Multi-Task Learning

no code implementations20 Nov 2019 Wenlin Wang, Hongteng Xu, Zhe Gan, Bai Li, Guoyin Wang, Liqun Chen, Qian Yang, Wenqi Wang, Lawrence Carin

We propose a novel graph-driven generative model, that unifies multiple heterogeneous learning tasks into the same framework.

Multi-Task Learning Type prediction

Two Case Studies of Experience Prototyping Machine Learning Systems in the Wild

no code implementations21 Oct 2019 Qian Yang

When writing with the prototype, however, authors shared that they need to "see where the sentence is going two paragraphs later" in order to decide whether the suggestion aligns with their writing; Some even considered adopting machine suggestions as plagiarism, therefore "is simply wrong".

BIG-bench Machine Learning Decision Making +1

Learning Compressed Sentence Representations for On-Device Text Processing

1 code implementation ACL 2019 Dinghan Shen, Pengyu Cheng, Dhanasekar Sundararaman, Xinyuan Zhang, Qian Yang, Meng Tang, Asli Celikyilmaz, Lawrence Carin

Vector representations of sentences, trained on massive text corpora, are widely used as generic sentence embeddings across a variety of NLP problems.

Retrieval Sentence +1

Decoupled Parallel Backpropagation with Convergence Guarantee

3 code implementations ICML 2018 Zhouyuan Huo, Bin Gu, Qian Yang, Heng Huang

The backward locking in backpropagation algorithm constrains us from updating network layers in parallel and fully leveraging the computing resources.

Neural Machine Translation with Pivot Languages

no code implementations15 Nov 2016 Yong Cheng, Yang Liu, Qian Yang, Maosong Sun, Wei Xu

While recent neural machine translation approaches have delivered state-of-the-art performance for resource-rich language pairs, they suffer from the data scarcity problem for resource-scarce language pairs.

Machine Translation Translation

Cannot find the paper you are looking for? You can Submit a new open access paper.