Search Results for author: Yuhao Zhang

Found 51 papers, 30 papers with code

Overview of the MEDIQA 2021 Shared Task on Summarization in the Medical Domain

no code implementations • NAACL (BioNLP) 2021 • Asma Ben Abacha, Yassine Mrabet, Yuhao Zhang, Chaitanya Shivade, Curtis Langlotz, Dina Demner-Fushman

The MEDIQA 2021 shared tasks at the BioNLP 2021 workshop addressed three tasks on summarization for medical text: (i) a question summarization task aimed at exploring new approaches to understanding complex real-world consumer health queries, (ii) a multi-answer summarization task that targeted aggregation of multiple relevant answers to a biomedical question into one concise and relevant answer, and (iii) a radiology report summarization task addressing the development of clinically relevant impressions from radiology report findings.

Text Summarization

Paper
Add Code

A Contrastive Framework for Learning Sentence Representations from Pairwise and Triple-wise Perspective in Angular Space

no code implementations • ACL 2022 • Yuhao Zhang, Hongji Zhu, Yongliang Wang, Nan Xu, Xiaobo Li, Binqiang Zhao

Learning high-quality sentence representations is a fundamental problem of natural language processing which could benefit a wide range of downstream tasks.

Contrastive Learning Semantic Textual Similarity +2

Paper
Add Code

The NiuTrans’s Submission to the IWSLT22 English-to-Chinese Offline Speech Translation Task

no code implementations • IWSLT (ACL) 2022 • Yuhao Zhang, Canan Huang, Chen Xu, Xiaoqian Liu, Bei Li, Anxiang Ma, Tong Xiao, Jingbo Zhu

This paper describes NiuTrans’s submission to the IWSLT22 English-to-Chinese (En-Zh) offline speech translation task.

Machine Translation Translation

Paper
Add Code

The NiuTrans Machine Translation Systems for WMT20

no code implementations • WMT (EMNLP) 2020 • Yuhao Zhang, Ziyang Wang, Runzhe Cao, Binghao Wei, Weiqiao Shan, Shuhan Zhou, Abudurexiti Reheman, Tao Zhou, Xin Zeng, Laohu Wang, Yongyu Mu, Jingnan Zhang, Xiaoqian Liu, Xuanjun Zhou, Yinqiao Li, Bei Li, Tong Xiao, Jingbo Zhu

This paper describes NiuTrans neural machine translation systems of the WMT20 news translation tasks.

Knowledge Distillation Machine Translation +1

Paper
Add Code

Ensuring Safe and High-Quality Outputs: A Guideline Library Approach for Language Models

1 code implementation • 18 Mar 2024 • Yi Luo, Zhenghao Lin, Yuhao Zhang, Jiashuo Sun, Chen Lin, Chengjin Xu, Xiangdong Su, Yelong Shen, Jian Guo, Yeyun Gong

Subsequently, the retrieval model correlates new inputs with relevant guidelines, which guide LLMs in response generation to ensure safe and high-quality outputs, thereby aligning with human values.

Response Generation Retrieval

Paper
Code

Verified Training for Counterfactual Explanation Robustness under Data Shift

no code implementations • 6 Mar 2024 • Anna P. Meyer, Yuhao Zhang, Aws Albarghouthi, Loris D'Antoni

Our empirical evaluation demonstrates that VeriTraCER generates CEs that (1) are verifiably robust to small model updates and (2) display competitive robustness to state-of-the-art approaches in handling empirical model updates including random initialization, leave-one-out, and distribution shifts.

counterfactual Counterfactual Explanation

Paper
Add Code

Soft Alignment of Modality Space for End-to-end Speech Translation

no code implementations • 18 Dec 2023 • Yuhao Zhang, Kaiqi Kou, Bei Li, Chen Xu, Chunliang Zhang, Tong Xiao, Jingbo Zhu

End-to-end Speech Translation (ST) aims to convert speech into target text within a unified model.

Cross-Lingual Transfer Translation

Paper
Add Code

DragVideo: Interactive Drag-style Video Editing

1 code implementation • 3 Dec 2023 • Yufan Deng, Ruida Wang, Yuhao Zhang, Yu-Wing Tai, Chi-Keung Tang

The main issues are: 1) how to perform direct and accurate user control in editing; 2) how to execute editings like changing shape, expression, and layout without unsightly distortion and artifacts to the edited content; and 3) how to maintain spatio-temporal consistency of video after editing.

Video Editing Video Generation

Paper
Code

Semantic-Aware Frame-Event Fusion based Pattern Recognition via Large Vision-Language Models

1 code implementation • 30 Nov 2023 • Dong Li, Jiandong Jin, Yuhao Zhang, Yanlin Zhong, Yaoyang Wu, Lan Chen, Xiao Wang, Bin Luo

Current methods typically employ backbone networks to individually extract the features of RGB frames and event streams, and subsequently fuse these features for pattern recognition.

Language Modelling Prompt Engineering

Paper
Code

TrainerAgent: Customizable and Efficient Model Training through LLM-Powered Multi-Agent System

no code implementations • 11 Nov 2023 • Haoyuan Li, Hao Jiang, Tianke Zhang, Zhelun Yu, Aoxiong Yin, Hao Cheng, Siming Fu, Yuhao Zhang, Wanggui He

We anticipate that our work will contribute to the advancement of research on TrainerAgent in both academic and industry communities, potentially establishing it as a new paradigm for model development in the field of AI.

Decision Making Language Modelling +1

Paper
Add Code

Rethinking and Improving Multi-task Learning for End-to-end Speech Translation

1 code implementation • 7 Nov 2023 • Yuhao Zhang, Chen Xu, Bei Li, Hao Chen, Tong Xiao, Chunliang Zhang, Jingbo Zhu

Significant improvements in end-to-end speech translation (ST) have been achieved through the application of multi-task learning.

Multi-Task Learning

Paper
Code

Bridging the Gaps of Both Modality and Language: Synchronous Bilingual CTC for Speech Translation and Speech Recognition

1 code implementation • 21 Sep 2023 • Chen Xu, Xiaoqian Liu, Erfeng He, Yuhao Zhang, Qianqian Dong, Tong Xiao, Jingbo Zhu, Dapeng Man, Wu Yang

In this study, we present synchronous bilingual Connectionist Temporal Classification (CTC), an innovative framework that leverages dual CTC to bridge the gaps of both modality and language in the speech translation (ST) task.

speech-recognition Speech Recognition +1

Paper
Code

Channel sensing for holographic interference surfaces based on the principle of interferometry

no code implementations • 20 Aug 2023 • Jindiao Huang, Yuyao Wu, Haifan Yin, Yuhao Zhang, Ruikun Zhang

In this paper, we derive the principles of holographic interference theory for electromagnetic wave reception and transmission, whereby the optical holography is extended to communication holography and a channel sensing architecture for holographic interference surfaces is established.

Paper
Add Code

Generate then Select: Open-ended Visual Question Answering Guided by World Knowledge

no code implementations • 30 May 2023 • Xingyu Fu, Sheng Zhang, Gukyeong Kwon, Pramuditha Perera, Henghui Zhu, Yuhao Zhang, Alexander Hanbo Li, William Yang Wang, Zhiguo Wang, Vittorio Castelli, Patrick Ng, Dan Roth, Bing Xiang

The open-ended Visual Question Answering (VQA) task requires AI models to jointly reason over visual and natural language inputs using world knowledge.

Answer Selection Visual Question Answering +1

Paper
Add Code

CTC-based Non-autoregressive Speech Translation

1 code implementation • 27 May 2023 • Chen Xu, Xiaoqian Liu, Xiaowen Liu, Qingxuan Sun, Yuhao Zhang, Murun Yang, Qianqian Dong, Tom Ko, Mingxuan Wang, Tong Xiao, Anxiang Ma, Jingbo Zhu

Combining end-to-end speech translation (ST) and non-autoregressive (NAR) generation is promising in language and speech processing for their advantages of less error propagation and low latency.

Translation

Paper
Code

Bridging the Granularity Gap for Acoustic Modeling

1 code implementation • 27 May 2023 • Chen Xu, Yuhao Zhang, Chengbo Jiao, Xiaoqian Liu, Chi Hu, Xin Zeng, Tong Xiao, Anxiang Ma, Huizhen Wang, Jingbo Zhu

While Transformer has become the de-facto standard for speech, modeling upon the fine-grained frame-level features remains an open challenge of capturing long-distance dependencies and distributing the attention weights.

speech-recognition Speech Recognition

Paper
Code

A multi-functional simulation platform for on-demand ride service operations

1 code implementation • 22 Mar 2023 • Siyuan Feng, Taijie Chen, Yuhao Zhang, Jintao Ke, Zhengfei Zheng, Hai Yang

In addition, the existing simulators still face many challenges, ranging from their closeness to real environments of ride-sourcing systems, to the completeness of different tasks they can implement.

Paper
Code

Reliability Assurance for Deep Neural Network Architectures Against Numerical Defects

1 code implementation • 13 Feb 2023 • Linyi Li, Yuhao Zhang, Luyao Ren, Yingfei Xiong, Tao Xie

To assure high reliability against numerical defects, in this paper, we propose the RANUM approach including novel techniques for three reliability assurance tasks: detection of potential numerical defects, confirmation of potential-defect feasibility, and suggestion of defect fixes.

Paper
Code

PECAN: A Deterministic Certified Defense Against Backdoor Attacks

no code implementations • 27 Jan 2023 • Yuhao Zhang, Aws Albarghouthi, Loris D'Antoni

Neural networks are vulnerable to backdoor poisoning attacks, where the attackers maliciously poison the training set and insert triggers into the test input to change the prediction of the victim model.

backdoor defense Image Classification +1

Paper
Add Code

Tokenization Consistency Matters for Generative Models on Extractive NLP Tasks

1 code implementation • 19 Dec 2022 • Kaiser Sun, Peng Qi, Yuhao Zhang, Lan Liu, William Yang Wang, Zhiheng Huang

We show that, with consistent tokenization, the model performs better in both in-domain and out-of-domain datasets, with a notable average of +1. 7 F2 gain when a BART model is trained on SQuAD and evaluated on 8 QA datasets.

Extractive Question-Answering Hallucination +1

Paper
Code

Improving Cross-task Generalization of Unified Table-to-text Models with Compositional Task Configurations

no code implementations • 17 Dec 2022 • Jifan Chen, Yuhao Zhang, Lan Liu, Rui Dong, Xinchi Chen, Patrick Ng, William Yang Wang, Zhiheng Huang

There has been great progress in unifying various table-to-text tasks using a single encoder-decoder model trained via multi-task learning (Xie et al., 2022).

Multi-Task Learning

Paper
Add Code

Improving End-to-end Speech Translation by Leveraging Auxiliary Speech and Text Data

no code implementations • 4 Dec 2022 • Yuhao Zhang, Chen Xu, Bojie Hu, Chunliang Zhang, Tong Xiao, Jingbo Zhu

We present a method for introducing a text encoder into pre-trained end-to-end speech translation systems.

Denoising Translation

Paper
Add Code

Overwatch: Learning Patterns in Code Edit Sequences

no code implementations • 25 Jul 2022 • Yuhao Zhang, Yasharth Bajpai, Priyanshu Gupta, Ameya Ketkar, Miltiadis Allamanis, Titus Barik, Sumit Gulwani, Arjun Radhakrishna, Mohammad Raza, Gustavo Soares, Ashish Tiwari

Our experiments show that Overwatch has 78% precision and that Overwatch not only completed edits when developers missed the opportunity to use the IDE tool support but also predicted new edits that have no tool support in the IDE.

Paper
Add Code

Robustar: Interactive Toolbox Supporting Precise Data Annotation for Robust Vision Learning

1 code implementation • 18 Jul 2022 • Chonghan Chen, Haohan Wang, Leyang Hu, Yuhao Zhang, Shuguang Lyu, Jingcheng Wu, Xinnuo Li, Linjing Sun, Eric P. Xing

We introduce the initial release of our software Robustar, which aims to improve the robustness of vision classification machine learning models through a data-driven perspective.

BIG-bench Machine Learning Image Classification

Paper
Code

Vertical GaN Diode BV Maximization through Rapid TCAD Simulation and ML-enabled Surrogate Model

no code implementations • 18 Jul 2022 • Albert Lu, Jordan Marshall, Yifan Wang, Ming Xiao, Yuhao Zhang, Hiu Yung Wong

In this paper, two methodologies are used to speed up the maximization of the breakdown volt-age (BV) of a vertical GaN diode that has a theoretical maximum BV of ~2100V.

Paper
Add Code

An Ultra-low Power TinyML System for Real-time Visual Processing at Edge

1 code implementation • 11 Jul 2022 • Kunran Xu, Huawei Zhang, Yishi Li, Yuhao Zhang, Rui Lai, Yi Liu

Tiny machine learning (TinyML), executing AI workloads on resource and power strictly restricted systems, is an important and challenging topic.

object-detection Object Detection

Paper
Code

BagFlip: A Certified Defense against Data Poisoning

1 code implementation • 26 May 2022 • Yuhao Zhang, Aws Albarghouthi, Loris D'Antoni

Machine learning models are vulnerable to data-poisoning attacks, in which an attacker maliciously modifies the training set to change the prediction of a learned model.

Backdoor Attack Data Poisoning +2

Paper
Code

Towards Lossless ANN-SNN Conversion under Ultra-Low Latency with Dual-Phase Optimization

1 code implementation • 16 May 2022 • ZiMing Wang, Shuang Lian, Yuhao Zhang, Xiaoxin Cui, Rui Yan, Huajin Tang

By evaluating on challenging datasets including CIFAR-10, CIFAR- 100 and ImageNet, the proposed method demonstrates the state-of-the-art performance in terms of accuracy, latency and energy preservation.

object-detection Object Detection +1

Paper
Code

RadGraph: Extracting Clinical Entities and Relations from Radiology Reports

1 code implementation • 28 Jun 2021 • Saahil Jain, Ashwin Agrawal, Adriel Saporta, Steven QH Truong, Du Nguyen Duong, Tan Bui, Pierre Chambon, Yuhao Zhang, Matthew P. Lungren, Andrew Y. Ng, Curtis P. Langlotz, Pranav Rajpurkar

We release a development dataset, which contains board-certified radiologist annotations for 500 radiology reports from the MIMIC-CXR dataset (14, 579 entities and 10, 889 relations), and a test dataset, which contains two independent sets of board-certified radiologist annotations for 100 radiology reports split equally across the MIMIC-CXR and CheXpert datasets.

Relation Extraction

Paper
Code

Stacked Acoustic-and-Textual Encoding: Integrating the Pre-trained Models into Speech Translation Encoders

no code implementations • ACL 2021 • Chen Xu, Bojie Hu, Yanyang Li, Yuhao Zhang, Shen Huang, Qi Ju, Tong Xiao, Jingbo Zhu

To our knowledge, we are the first to develop an end-to-end ST system that achieves comparable or even better BLEU performance than the cascaded ST counterpart when large-scale ASR and MT data is available.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +4

Paper
Add Code

Brain Tumors Classification for MR images based on Attention Guided Deep Learning Model

no code implementations • 6 Apr 2021 • Yuhao Zhang, Shuhang Wang, Haoxiang Wu, Kejia Hu, Shufan Ji

In the clinical diagnosis and treatment of brain tumors, manual image reading consumes a lot of energy and time.

General Classification

Paper
Add Code

Certified Robustness to Programmable Transformations in LSTMs

1 code implementation • EMNLP 2021 • Yuhao Zhang, Aws Albarghouthi, Loris D'Antoni

Deep neural networks for natural language processing are fragile in the face of adversarial examples -- small input perturbations, like synonym substitution or word duplication, which cause a neural network to change its prediction.

Paper
Code

Improving Factual Completeness and Consistency of Image-to-Text Radiology Report Generation

3 code implementations • NAACL 2021 • Yasuhide Miura, Yuhao Zhang, Emily Bao Tsai, Curtis P. Langlotz, Dan Jurafsky

We further show via a human evaluation and a qualitative analysis that our system leads to generations that are more factually complete and consistent compared to the baselines.

Natural Language Inference Text Generation

129

Paper
Code

Contrastive Learning of Medical Visual Representations from Paired Images and Text

7 code implementations • 2 Oct 2020 • Yuhao Zhang, Hang Jiang, Yasuhide Miura, Christopher D. Manning, Curtis P. Langlotz

Existing work commonly relies on fine-tuning weights transferred from ImageNet pretraining, which is suboptimal due to drastically different image characteristics, or rule-based label extraction from the textual report data paired with medical images, which is inaccurate and hard to generalize.

Contrastive Learning Descriptive +3

143

Paper
Code

Do Syntax Trees Help Pre-trained Transformers Extract Information?

1 code implementation • EACL 2021 • Devendra Singh Sachan, Yuhao Zhang, Peng Qi, William Hamilton

Our empirical analysis demonstrates that these syntax-infused transformers obtain state-of-the-art results on SRL and relation extraction tasks.

named-entity-recognition Named Entity Recognition +4

Paper
Code

Biomedical and Clinical English Model Packages in the Stanza Python NLP Library

5 code implementations • 29 Jul 2020 • Yuhao Zhang, Yuhui Zhang, Peng Qi, Christopher D. Manning, Curtis P. Langlotz

We introduce biomedical and clinical English model packages for the Stanza Python NLP library.

Named Entity Recognition Named Entity Recognition (NER)

7,043

Paper
Code

Learning Architectures from an Extended Search Space for Language Modeling

no code implementations • ACL 2020 • Yinqiao Li, Chi Hu, Yuhao Zhang, Nuo Xu, Yufan Jiang, Tong Xiao, Jingbo Zhu, Tongran Liu, Changliang Li

Neural architecture search (NAS) has advanced significantly in recent years but most NAS systems restrict search to learning architectures of a recurrent or convolutional cell.

Chunking Language Modelling +4

Paper
Add Code

Stay Hungry, Stay Focused: Generating Informative and Specific Questions in Information-Seeking Conversations

1 code implementation • Findings of the Association for Computational Linguistics 2020 • Peng Qi, Yuhao Zhang, Christopher D. Manning

We investigate the problem of generating informative questions in information-asymmetric conversations.

Informativeness Question Generation +2

Paper
Code

Stanza: A Python Natural Language Processing Toolkit for Many Human Languages

5 code implementations • ACL 2020 • Peng Qi, Yuhao Zhang, Yuhui Zhang, Jason Bolton, Christopher D. Manning

We introduce Stanza, an open-source Python natural language processing toolkit supporting 66 human languages.

Dependency Parsing Lemmatization +3

7,043

Paper
Code

Robustness to Programmable String Transformations via Augmented Abstract Training

1 code implementation • ICML 2020 • Yuhao Zhang, Aws Albarghouthi, Loris D'Antoni

We then present an approach to adversarially training models that are robust to such user-defined string transformations.

Paper
Code

Optimizing the Factual Correctness of a Summary: A Study of Summarizing Radiology Reports

no code implementations • ACL 2020 • Yuhao Zhang, Derek Merck, Emily Bao Tsai, Christopher D. Manning, Curtis P. Langlotz

Neural abstractive summarization models are able to generate summaries which have high overlap with human references.

Abstractive Text Summarization Fact Checking +1

Paper
Add Code

The NiuTrans Machine Translation Systems for WMT19

no code implementations • WS 2019 • Bei Li, Yinqiao Li, Chen Xu, Ye Lin, Jiqiang Liu, Hui Liu, Ziyang Wang, Yuhao Zhang, Nuo Xu, Zeyang Wang, Kai Feng, Hexuan Chen, Tengbo Liu, Yanyang Li, Qiang Wang, Tong Xiao, Jingbo Zhu

We participated in 13 translation directions, including 11 supervised tasks, namely EN↔{ZH, DE, RU, KK, LT}, GU→EN and the unsupervised DE↔CS sub-track.

Knowledge Distillation Machine Translation +2

Paper
Add Code

Universal Dependency Parsing from Scratch

1 code implementation • CONLL 2018 • Peng Qi, Timothy Dozat, Yuhao Zhang, Christopher D. Manning

This paper describes Stanford's system at the CoNLL 2018 UD Shared Task.

Ranked #4 on Dependency Parsing on Universal Dependencies

Dependency Parsing POS +3

110

Paper
Code

Graph Convolution over Pruned Dependency Trees Improves Relation Extraction

1 code implementation • EMNLP 2018 • Yuhao Zhang, Peng Qi, Christopher D. Manning

Dependency trees help relation extraction models capture long-range relations between words.

Ranked #5 on Relation Classification on TACRED

Negation Relation +1

371

Paper
Code

Learning to Summarize Radiology Findings

2 code implementations • WS 2018 • Yuhao Zhang, Daisy Yi Ding, Tianpei Qian, Christopher D. Manning, Curtis P. Langlotz

The Impression section of a radiology report summarizes crucial radiology findings in natural language and plays a central role in communicating these findings to physicians.

Paper
Code

MULDEF: Multi-model-based Defense Against Adversarial Examples for Neural Networks

no code implementations • 31 Aug 2018 • Siwakorn Srisakaokul, Yuhao Zhang, Zexuan Zhong, Wei Yang, Tao Xie, Bo Li

In particular, given a target model, our framework includes multiple models (constructed from the target model) to form a model family.

Paper
Add Code

Multi-Perspective Relevance Matching with Hierarchical ConvNets for Social Media Search

3 code implementations • 21 May 2018 • Jinfeng Rao, Wei Yang, Yuhao Zhang, Ferhan Ture, Jimmy Lin

To our best knowledge, this paper presents the first substantial work tackling search over social media posts using neural ranking models.

Information Retrieval Retrieval

Paper
Code

Cross-type Biomedical Named Entity Recognition with Deep Multi-Task Learning

2 code implementations • 30 Jan 2018 • Xuan Wang, Yu Zhang, Xiang Ren, Yuhao Zhang, Marinka Zitnik, Jingbo Shang, Curtis Langlotz, Jiawei Han

Motivation: State-of-the-art biomedical named entity recognition (BioNER) systems often require handcrafted features specific to each entity type, such as genes, chemicals and diseases.

Feature Engineering Multi-Task Learning +4

129

Paper
Code

Position-aware Attention and Supervised Data Improve Slot Filling

2 code implementations • EMNLP 2017 • Yuhao Zhang, Victor Zhong, Danqi Chen, Gabor Angeli, Christopher D. Manning

The combination of better supervised data and a more appropriate high-capacity model enables much better relation extraction performance.

Ranked #7 on Relation Extraction on Re-TACRED

Knowledge Base Population Knowledge Graphs +5

355

Paper
Code

Segmental Convolutional Neural Networks for Detection of Cardiac Abnormality With Noisy Heart Sound Recordings

no code implementations • 6 Dec 2016 • Yuhao Zhang, Sandeep Ayyar, Long-Huei Chen, Ethan J. Li

Heart diseases constitute a global health burden, and the problem is exacerbated by the error-prone nature of listening to and interpreting heart sounds.

Classification General Classification

Paper
Add Code

Deep Convolutional Network for Handwritten Chinese Character Recognition

1 code implementation • standford.edu 2015 • Yuhao Zhang

In this project we explored the performance of deep convolutional neural network on recognizing handwritten Chinese characters.

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.