Search Results for author: Hong Yu

Found 98 papers, 38 papers with code

Paper
Add Code

Identifying Key Concepts from EHR Notes Using Domain Adaptation

no code implementations • WS 2015 • Jiaping Zheng, Hong Yu

Domain Adaptation Information Retrieval +1

Paper
Add Code

Key Concept Identification for Medical Information Retrieval

no code implementations • EMNLP 2015 • Jiaping Zheng, Hong Yu

Information Retrieval Language Modelling +1

Paper
Add Code

Heuristic algorithms for finding distribution reducts in probabilistic rough set model

no code implementations • 22 Dec 2015 • Xi'ao Ma, Guoyin Wang, Hong Yu

This is partly due to the fact that there are no monotonic fitness functions that are used to design heuristic attribute reduction algorithms in probabilistic rough set model.

Attribute

Paper
Add Code

Building an Evaluation Scale using Item Response Theory

no code implementations • EMNLP 2016 • John P. Lalor, Hao Wu, Hong Yu

Evaluation of NLP methods requires testing against a previously vetted gold-standard test set and reporting standard metrics (accuracy/precision/recall/F1).

Natural Language Inference

Paper
Add Code

Bidirectional RNN for Medical Event Detection in Electronic Health Records

no code implementations • NAACL 2016 • Abhyuday N. Jagannatha, Hong Yu

Event Detection Intrusion Detection +1

Paper
Add Code

Bidirectional Recurrent Neural Networks for Medical Event Detection in Electronic Health Records

1 code implementation • 25 Jun 2016 • Abhyuday Jagannatha, Hong Yu

Sequence labeling for extraction of medical events and their attributes from unstructured text in Electronic Health Record (EHR) notes is a key step towards semantic understanding of EHRs.

BIG-bench Machine Learning Event Detection

Paper
Code

Learning for Biomedical Information Extraction: Methodological Review of Recent Advances

no code implementations • 26 Jun 2016 • Feifan Liu, Jinying Chen, Abhyuday Jagannatha, Hong Yu

Biomedical information extraction (BioIE) is important to many applications, including clinical decision support, integrative biology, and pharmacovigilance, and therefore it has been an active research.

Open Information Extraction

Paper
Add Code

Neural Semantic Encoders

3 code implementations • EACL 2017 • Tsendsuren Munkhdalai, Hong Yu

We present a memory augmented neural network for natural language understanding: Neural Semantic Encoders.

Ranked #18 on Question Answering on WikiQA

General Classification Machine Translation +7

264

Paper
Code

Neural Tree Indexers for Text Understanding

1 code implementation • EACL 2017 • Tsendsuren Munkhdalai, Hong Yu

NTI constructs a full n-ary tree by processing the input text with its node function in a bottom-up fashion.

Ranked #46 on Natural Language Inference on SNLI

Natural Language Inference Sentence +1

Paper
Code

Structured prediction models for RNN based sequence labeling in clinical text

1 code implementation • EMNLP 2016 • Abhyuday Jagannatha, Hong Yu

In this work we experimented with various CRF based structured learning models with Recurrent Neural Networks.

named-entity-recognition Named Entity Recognition +2

174

Paper
Code

Reasoning with Memory Augmented Neural Networks for Language Comprehension

no code implementations • 20 Oct 2016 • Tsendsuren Munkhdalai, Hong Yu

Hypothesis testing is an important cognitive process that supports human reasoning.

Reading Comprehension Two-sample testing

Paper
Add Code

Citation Analysis with Neural Attention Models

no code implementations • WS 2016 • Tsendsuren Munkhdalai, John P. Lalor, Hong Yu

Information Retrieval Question Answering +1

Paper
Add Code

Learning to Rank Scientific Documents from the Crowd

no code implementations • 4 Nov 2016 • Jesse M Lingeman, Hong Yu

Finding related published articles is an important task in any science, but with the explosion of new work in the biomedical domain it has become especially challenging.

Document Ranking Learning-To-Rank +1

Paper
Add Code

Ranking medical jargon in electronic health record notes by adapted distant supervision

no code implementations • 14 Nov 2016 • Jinying Chen, Abhyuday N. Jagannatha, Samah J. Jarad, Hong Yu

Methods: We developed an innovative adapted distant supervision (ADS) model based on support vector machines to rank medical jargon from EHRs.

Paper
Add Code

DNN Filter Bank Cepstral Coefficients for Spoofing Detection

no code implementations • 13 Feb 2017 • Hong Yu, Zheng-Hua Tan, Zhanyu Ma, Jun Guo

In order to improve the reliability of speaker verification systems, we develop a new filter bank based cepstral feature, deep neural network filter bank cepstral coefficients (DNN-FBCC), to distinguish between natural and spoofed speech.

Speaker Verification Speech Synthesis

Paper
Add Code

Understanding Deep Learning Performance through an Examination of Test Set Difficulty: A Psychometric Case Study

no code implementations • EMNLP 2018 • John P. Lalor, Hao Wu, Tsendsuren Munkhdalai, Hong Yu

We examine the impact of a test set question's difficulty to determine if there is a relationship between difficulty and performance.

Natural Language Inference Sentiment Analysis

Paper
Add Code

Soft Label Memorization-Generalization for Natural Language Inference

no code implementations • 27 Feb 2017 • John P. Lalor, Hao Wu, Hong Yu

Often when multiple labels are obtained for a training example it is assumed that there is an element of noise that must be accounted for.

Memorization Natural Language Inference

Paper
Add Code

Unsupervised Ensemble Ranking of Terms in Electronic Health Record Notes Based on Their Importance to Patients

no code implementations • 1 Mar 2017 • Jinying Chen, Hong Yu

Objective: The aim of this work was to develop FIT (Finding Important Terms for patients), an unsupervised natural language processing (NLP) system that ranks medical terms in EHR notes based on their importance to patients.

Paper
Add Code

Meta Networks

1 code implementation • ICML 2017 • Tsendsuren Munkhdalai, Hong Yu

Neural networks have been successfully applied in applications with a large amount of labeled data.

Continual Learning Meta-Learning

Paper
Code

Sentence Simplification with Memory-Augmented Neural Networks

no code implementations • NAACL 2018 • Tu Vu, Baotian Hu, Tsendsuren Munkhdalai, Hong Yu

Sentence simplification aims to simplify the content and structure of complex sentences, and thus make them easier to interpret for human readers, and easier to process for downstream NLP applications.

Ranked #1 on Text Simplification on PWKP / WikiSmall

Machine Translation Sentence +2

Paper
Add Code

Histogram Transform-based Speaker Identification

no code implementations • 2 Aug 2018 • Zhanyu Ma, Hong Yu

A novel text-independent speaker identification (SI) method is proposed.

Speaker Identification

Paper
Add Code

Deep Neural Network for Analysis of DNA Methylation Data

no code implementations • 2 Aug 2018 • Hong Yu, Zhanyu Ma

Many researches demonstrated that the DNA methylation, which occurs in the context of a CpG, has strong correlation with diseases, including cancer.

Paper
Add Code

Language Identification with Deep Bottleneck Features

no code implementations • 18 Sep 2018 • Zhanyu Ma, Hong Yu

In order to improve the SLD accuracy of short utterances a phase vocoder based time-scale modification(TSM) method is used to reduce and increase speech rated of the test utterance.

Language Identification Transfer Learning

Paper
Add Code

HYPE: A High Performing NLP System for Automatically Detecting Hypoglycemia Events from Electronic Health Record Notes

no code implementations • 29 Nov 2018 • Yonghao Jin, Fei Li, Hong Yu

We used this annotated dataset to train and evaluate HYPE, supervised NLP systems for hypoglycemia detection.

Paper
Add Code

Learning Latent Parameters without Human Response Patterns: Item Response Theory with Artificial Crowds

1 code implementation • IJCNLP 2019 • John P. Lalor, Hao Wu, Hong Yu

We demonstrate a use-case for latent difficulty item parameters, namely training set filtering, and show that using difficulty to sample training data outperforms baseline methods.

Natural Language Inference Sentiment Analysis

108

Paper
Code

Generating Classical Chinese Poems from Vernacular Chinese

1 code implementation • IJCNLP 2019 • Zhichao Yang, Pengshan Cai, Yansong Feng, Fei Li, Weijiang Feng, Elena Suet-Ying Chiu, Hong Yu

According to experiments, our approach significantly improve the perplexity and BLEU compared with typical UMT models.

Cultural Vocal Bursts Intensity Prediction Reinforcement Learning (RL) +2

Paper
Code

Large-scale Gastric Cancer Screening and Localization Using Multi-task Deep Neural Network

no code implementations • 9 Oct 2019 • Hong Yu, Xiaofan Zhang, Lingjun Song, Liren Jiang, Xiaodi Huang, Wen Chen, Chenbin Zhang, Jiahui Li, Jiji Yang, Zhiqiang Hu, Qi Duan, Wanyuan Chen, Xianglei He, Jinshuang Fan, Weihai Jiang, Li Zhang, Chengmin Qiu, Minmin Gu, Weiwei Sun, Yangqiong Zhang, Guangyin Peng, Weiwei Shen, Guohui Fu

Gastric cancer is one of the most common cancers, which ranks third among the leading causes of cancer death.

Specificity whole slide images

Paper
Add Code

Bacteria Biotope Relation Extraction via Lexical Chains and Dependency Graphs

no code implementations • WS 2019 • Wuti Xiong, Fei Li, Ming Cheng, Hong Yu, Donghong Ji

abstract In this article, we describe our approach for the Bacteria Biotopes relation extraction (BB-rel) subtask in the BioNLP Shared Task 2019.

graph construction Relation +2

Paper
Add Code

ICD Coding from Clinical Text Using Multi-Filter Residual Convolutional Neural Network

3 code implementations • 25 Nov 2019 • Fei Li, Hong Yu

The innovations of our model are two-folds: it utilizes a multi-filter convolutional layer to capture various text patterns with different lengths and a residual convolutional layer to enlarge the receptive field.

Ranked #9 on Medical Code Prediction on MIMIC-III

Medical Code Prediction

Paper
Code

MetaMT,a MetaLearning Method Leveraging Multiple Domain Data for Low Resource Machine Translation

no code implementations • 11 Dec 2019 • Rumeng Li, Xun Wang, Hong Yu

Manipulating training data leads to robust neural models for MT.

Machine Translation Translation

Paper
Add Code

Continual Domain-Tuning for Pretrained Language Models

no code implementations • 5 Apr 2020 • Subendhu Rongali, Abhyuday Jagannatha, Bhanu Pratap Singh Rawat, Hong Yu

Pre-trained language models (LM) such as BERT, DistilBERT, and RoBERTa can be tuned for different domains (domain-tuning) by continuing the pre-training phase on a new target domain corpus.

Continual Learning

Paper
Add Code

Calibrating Structured Output Predictors for Natural Language Processing

no code implementations • ACL 2020 • Abhyuday Jagannatha, Hong Yu

Additionally, we show that our calibration method can also be used as an uncertainty-aware, entity-specific decoding step to improve the performance of the underlying model at no additional training cost or data requirements.

named-entity-recognition Named Entity Recognition +3

Paper
Add Code

Neural Data-to-Text Generation with Dynamic Content Planning

no code implementations • 16 Apr 2020 • Kai Chen, Fayuan Li, Baotian Hu, Weihua Peng, Qingcai Chen, Hong Yu

We further design a reconstruction mechanism with a novel objective function that can reconstruct the whole entry of the used data sequentially from the hidden states of the decoder, which aids the accuracy of the generated text.

Data-to-Text Generation

Paper
Add Code

Ontology-based systematic classification and analysis of coronaviruses, hosts, and host-coronavirus interactions towards deep understanding of COVID-19

1 code implementation • 31 May 2020 • Hong Yu, Li Li, Hsin-hui Huang, Yang Wang, Yingtong Liu, Edison Ong, Anthony Huffman, Tao Zeng, Jingsong Zhang, Pengpai Li, Zhiping Liu, Xiangyan Zhang, Xianwei Ye, Samuel K. Handelman, Gerry Higgins, Gilbert S. Omenn, Brian Athey, Junguk Hur, Luonan Chen, Yongqun He

We hypothesized that ontology can be used as an integrative platform to classify and analyze HCI and disease outcomes.

Other Quantitative Biology

Paper
Code

Conversational Machine Comprehension: a Literature Review

no code implementations • COLING 2020 • Somil Gupta, Bhanu Pratap Singh Rawat, Hong Yu

Conversational Machine Comprehension (CMC), a research track in conversational AI, expects the machine to understand an open-domain natural language text and thereafter engage in a multi-turn conversation to answer questions related to the text.

Machine Reading Comprehension Natural Language Understanding +1

Paper
Add Code

BENTO: A Visual Platform for Building Clinical NLP Pipelines Based on CodaLab

no code implementations • ACL 2020 • Yonghao Jin, Fei Li, Hong Yu

In addition, the GUI interface enables researchers with limited computer background to compose tools into NLP pipelines and then apply the pipelines on their own datasets in a {``}what you see is what you get{''} (WYSIWYG) way.

Management named-entity-recognition +2

Paper
Add Code

Ontology-based annotation and analysis of COVID-19 phenotypes

no code implementations • 5 Aug 2020 • Yang Wang, Fengwei Zhang, Hong Yu, Xianwei Ye, Yongqun He

The commonly occurring 17 phenotypes were classified into different groups based on the Human Phenotype Ontology (HPO).

Paper
Add Code

Conversational Semantic Parsing for Dialog State Tracking

1 code implementation • EMNLP 2020 • Jianpeng Cheng, Devang Agrawal, Hector Martinez Alonso, Shruti Bhargava, Joris Driesen, Federico Flego, Shaona Ghosh, Dain Kaplan, Dimitri Kartsaklis, Lin Li, Dhivya Piraviperumal, Jason D Williams, Hong Yu, Diarmuid O Seaghdha, Anders Johannsen

We consider a new perspective on dialog state tracking (DST), the task of estimating a user's goal through the course of a dialog.

dialog state tracking Semantic Parsing

Paper
Code

Dynamic Data Selection for Curriculum Learning via Ability Estimation

no code implementations • Findings of the Association for Computational Linguistics 2020 • John P. Lalor, Hong Yu

Curriculum learning methods typically rely on heuristics to estimate the difficulty of training examples or the ability of the model.

Paper
Add Code

Generating Accurate Electronic Health Assessment from Medical Graph

no code implementations • Findings of the Association for Computational Linguistics 2020 • Zhichao Yang, Hong Yu

One of the fundamental goals of artificial intelligence is to build computer-based expert systems.

Clinical Knowledge

Paper
Add Code

A prognostic dynamic model applicable to infectious diseases providing easily visualized guides -- A case study of COVID-19 in the UK

1 code implementation • 14 Dec 2020 • Yuxuan Zhang, Chen Gong, Dawei Li, Zhi-Wei Wang, Shengda D Pu, Alex W Robertson, Hong Yu, John Parrington

A reasonable prediction of infectious diseases transmission process under different disease control strategies is an important reference point for policy makers.

Paper
Code

TransBTS: Multimodal Brain Tumor Segmentation Using Transformer

2 code implementations • 7 Mar 2021 • Wenxuan Wang, Chen Chen, Meng Ding, Jiangyun Li, Hong Yu, Sen Zha

To capture the local 3D context information, the encoder first utilizes 3D CNN to extract the volumetric spatial feature maps.

Brain Tumor Segmentation Image Classification +3

368

Paper
Code

A Dual-Questioning Attention Network for Emotion-Cause Pair Extraction with Context Awareness

1 code implementation • 15 Apr 2021 • Qixuan Sun, Yaqi Yin, Hong Yu

Existing work follows a two-stage pipeline which identifies emotions and causes at the first step and pairs them at the second step.

Emotion Cause Extraction Emotion-Cause Pair Extraction +1

Paper
Code

Membership Inference Attack Susceptibility of Clinical Language Models

no code implementations • 16 Apr 2021 • Abhyuday Jagannatha, Bhanu Pratap Singh Rawat, Hong Yu

We show that membership inference attacks on CLMs lead to non-trivial privacy leakages of up to 7%.

Inference Attack Membership Inference Attack

Paper
Add Code

CREAD: Combined Resolution of Ellipses and Anaphora in Dialogues

1 code implementation • NAACL 2021 • Bo-Hsiang Tseng, Shruti Bhargava, Jiarui Lu, Joel Ruben Antony Moniz, Dhivya Piraviperumal, Lin Li, Hong Yu

In this work, we propose a novel joint learning framework of modeling coreference resolution and query rewriting for complex, multi-turn dialogue understanding.

coreference-resolution Dialogue Understanding

Paper
Code

Improving Formality Style Transfer with Context-Aware Rule Injection

no code implementations • ACL 2021 • Zonghai Yao, Hong Yu

Models pre-trained on large-scale regular text corpora often do not work well for user-generated data where the language styles differ significantly from the mainstream text.

Formality Style Transfer Sentiment Analysis +1

Paper
Add Code

TransBTSV2: Towards Better and More Efficient Volumetric Segmentation of Medical Images

1 code implementation • 30 Jan 2022 • Jiangyun Li, Wenxuan Wang, Chen Chen, Tianxiang Zhang, Sen Zha, Jing Wang, Hong Yu

Different from TransBTS, the proposed TransBTSV2 is not limited to brain tumor segmentation (BTS) but focuses on general medical image segmentation, providing a stronger and more efficient 3D baseline for volumetric segmentation of medical images.

Brain Tumor Segmentation Image Segmentation +3

368

Paper
Code

Category Guided Attention Network for Brain Tumor Segmentation in MRI

1 code implementation • 29 Mar 2022 • Jiangyun Li, Hong Yu, Chen Chen, Meng Ding, Sen Zha

In this model, we design a Supervised Attention Module (SAM) based on the attention mechanism, which can capture more accurate and stable long-range dependency in feature maps without introducing much computational cost.

Brain Tumor Segmentation Segmentation +1

Paper
Code

Attention guided global enhancement and local refinement network for semantic segmentation

1 code implementation • 9 Apr 2022 • Jiangyun Li, Sen Zha, Chen Chen, Meng Ding, Tianxiang Zhang, Hong Yu

First, commonly used upsampling methods in the decoder such as interpolation and deconvolution suffer from a local receptive field, unable to encode global contexts.

Semantic Segmentation

Paper
Code

Caption Feature Space Regularization for Audio Captioning

1 code implementation • 18 Apr 2022 • Yiming Zhang, Hong Yu, Ruoyi Du, Zhanyu Ma, Yuan Dong

To eliminate this negative effect, in this paper, we propose a two-stage framework for audio captioning: (i) in the first stage, via the contrastive learning, we construct a proxy feature space to reduce the distances between captions correlated to the same audio, and (ii) in the second stage, the proxy feature space is utilized as additional supervision to encourage the model to be optimized in the direction that benefits all the correlated captions.

Audio captioning Contrastive Learning

Paper
Code

ScAN: Suicide Attempt and Ideation Events Dataset

1 code implementation • NAACL 2022 • Bhanu Pratap Singh Rawat, Samuel Kovaly, Wilfred R. Pigeon, Hong Yu

In this study, we first built Suicide Attempt and Ideation Events (ScAN) dataset, a subset of the publicly available MIMIC III dataset spanning over 12k+ EHR notes with 19k+ annotated SA and SI events information.

Retrieval

Paper
Code

Learning as Conversation: Dialogue Systems Reinforced for Information Acquisition

1 code implementation • NAACL 2022 • Pengshan Cai, Hui Wan, Fei Liu, Mo Yu, Hong Yu, Sachindra Joshi

We propose novel AI-empowered chat bots for learning as conversation where a user does not read a passage but gains information and knowledge through conversation with a teacher bot.

Paper
Code

A Simple Meta-learning Paradigm for Zero-shot Intent Classification with Mixture Attention Mechanism

no code implementations • 5 Jun 2022 • Han Liu, Siyang Zhao, Xiaotong Zhang, Feng Zhang, Junjie Sun, Hong Yu, Xianchao Zhang

Zero-shot intent classification is a vital and challenging task in dialogue systems, which aims to deal with numerous fast-emerging unacquainted intents without annotated training data.

Classification intent-classification +4

Paper
Add Code

Label-enhanced Prototypical Network with Contrastive Learning for Multi-label Few-shot Aspect Category Detection

no code implementations • 14 Jun 2022 • Han Liu, Feng Zhang, Xiaotong Zhang, Siyang Zhao, Junjie Sun, Hong Yu, Xianchao Zhang

Multi-label aspect category detection allows a given review sentence to contain multiple aspect categories, which is shown to be more practical in sentiment analysis and attracting increasing attention.

Aspect Category Detection Contrastive Learning +2

Paper
Add Code

Hyperspectral image reconstruction for spectral camera based on ghost imaging via sparsity constraints using V-DUnet

no code implementations • 28 Jun 2022 • Ziyan Chen, Zhentao Liu, Chenyu Hu, Heng Wu, Jianrong Wu, Jinda Lin, Zhishen Tong, Hong Yu, Shensheng Han

When applying deep learning into GISC spectral camera, there are several challenges need to be solved: 1) how to deal with the large amount of 3D hyperspectral data, 2) how to reduce the influence caused by the uncertainty of the random reference measurements, 3) how to improve the reconstructed image quality as far as possible.

Compressive Sensing Image Reconstruction

Paper
Add Code

Advanced Conditional Variational Autoencoders (A-CVAE): Towards interpreting open-domain conversation generation via disentangling latent feature representation

no code implementations • 26 Jul 2022 • Ye Wang, Jingbo Liao, Hong Yu, Guoyin Wang, Xiaoxia Zhang, Li Liu

Particularly, the model integrates the macro-level guided-category knowledge and micro-level open-domain dialogue data for the training, leveraging the priori knowledge into the latent space, which enables the model to disentangle the latent variables within the mesoscopic scale.

Disentanglement

Paper
Add Code

Extracting Biomedical Factual Knowledge Using Pretrained Language Model and Electronic Health Record Context

no code implementations • 26 Aug 2022 • Zonghai Yao, Yi Cao, Zhichao Yang, Vijeta Deshpande, Hong Yu

In order to make LMs as KBs more in line with the actual application scenarios of the biomedical domain, we specifically add EHR notes as context to the prompt to improve the low bound in the biomedical domain.

Language Modelling

Paper
Add Code

Knowledge Injected Prompt Based Fine-tuning for Multi-label Few-shot ICD Coding

1 code implementation • 7 Oct 2022 • Zhichao Yang, Shufan Wang, Bhanu Pratap Singh Rawat, Avijit Mitra, Hong Yu

Automatic International Classification of Diseases (ICD) coding aims to assign multiple ICD codes to a medical note with average length of 3, 000+ tokens.

Ranked #1 on Medical Code Prediction on MIMIC-III

Contrastive Learning Medical Code Prediction

Paper
Code

MedJEx: A Medical Jargon Extraction Model with Wiki's Hyperlink Span and Contextualized Masked Language Model Score

1 code implementation • 12 Oct 2022 • Sunjae Kwon, Zonghai Yao, Harmon S. Jordan, David A. Levy, Brian Corner, Hong Yu

We first present a novel and publicly available dataset with expert-annotated medical jargon terms from 18K+ EHR note sentences ($MedJ$).

Language Modelling named-entity-recognition +2

Paper
Code

Context Variance Evaluation of Pretrained Language Models for Prompt-based Biomedical Knowledge Probing

no code implementations • 18 Nov 2022 • Zonghai Yao, Yi Cao, Zhichao Yang, Hong Yu

Different from the previous known-unknown evaluation criteria, we propose the concept of "Misunderstand" in LAMA for the first time.

Knowledge Probing

Paper
Add Code

Multi-label Few-shot ICD Coding as Autoregressive Generation with Prompt

1 code implementation • 24 Nov 2022 • Zhichao Yang, Sunjae Kwon, Zonghai Yao, Hong Yu

This task is challenging due to the high-dimensional space of multi-label assignment (155, 000+ ICD code candidates) and the long-tail challenge - Many ICD codes are infrequently assigned yet infrequent ICD codes are important clinically.

Multi-Label Classification

Paper
Code

An Automatic SOAP Classification System Using Weakly Supervision And Transfer Learning

no code implementations • 26 Nov 2022 • Sunjae Kwon, Zhichao Yang, Hong Yu

The transfer learning framework helps SOAP classification model's inter-hospital migration with a minimal size of the manually annotated dataset.

Classification Language Modelling +1

Paper
Add Code

Automated Identification of Eviction Status from Electronic Health Record Notes

1 code implementation • 6 Dec 2022 • Zonghai Yao, Jack Tsai, Weisong Liu, David A. Levy, Emily Druhl, Joel I Reisman, Hong Yu

Materials and Methods: We first defined eviction status (eviction presence and eviction period) and then annotated eviction status in 5000 EHR notes from the Veterans Health Administration (VHA).

Paper
Code

Associations Between Natural Language Processing (NLP) Enriched Social Determinants of Health and Suicide Death among US Veterans

1 code implementation • 11 Dec 2022 • Avijit Mitra, Richeek Pradhan, Rachel D Melamed, Kun Chen, David C Hoaglin, Katherine L Tucker, Joel I Reisman, Zhichao Yang, Weisong Liu, Jack Tsai, Hong Yu

All SDOH, measured by structured data and NLP, were significantly associated with increased risk of suicide.

Paper
Code

Enhancing the prediction of disease outcomes using electronic health records and pretrained deep learning models

no code implementations • 22 Dec 2022 • Zhichao Yang, Weisong Liu, Dan Berlowitz, Hong Yu

Question: Can an encoder-decoder architecture pretrained on a large dataset of longitudinal electronic health records improves patient outcome predictions?

Denoising

Paper
Add Code

Boosting Few-Shot Text Classification via Distribution Estimation

no code implementations • 26 Mar 2023 • Han Liu, Feng Zhang, Xiaotong Zhang, Siyang Zhao, Fenglong Ma, Xiao-Ming Wu, Hongyang Chen, Hong Yu, Xianchao Zhang

Distribution estimation has been demonstrated as one of the most effective approaches in dealing with few-shot image classification, as the low-level patterns and underlying representations can be easily transferred across different tasks in computer vision domain.

Few-Shot Image Classification Few-Shot Text Classification +1

Paper
Add Code

Vision Meets Definitions: Unsupervised Visual Word Sense Disambiguation Incorporating Gloss Information

1 code implementation • 2 May 2023 • Sunjae Kwon, Rishabh Garodia, Minhwa Lee, Zhichao Yang, Hong Yu

Specifically, we suggest employing Bayesian inference to incorporate the sense definitions when sense information of the answer is not provided.

Bayesian Inference Image-text matching +2

Paper
Code

Revisiting the Architectures like Pointer Networks to Efficiently Improve the Next Word Distribution, Summarization Factuality, and Beyond

1 code implementation • 20 May 2023 • Haw-Shiuan Chang, Zonghai Yao, Alolika Gon, Hong Yu, Andrew McCallum

Is the output softmax layer, which is adopted by most language models (LMs), always the best way to compute the next word probability?

Paper
Code

5IDER: Unified Query Rewriting for Steering, Intent Carryover, Disfluencies, Entity Carryover and Repair

no code implementations • 2 Jun 2023 • Jiarui Lu, Bo-Hsiang Tseng, Joel Ruben Antony Moniz, Site Li, Xueyun Zhu, Hong Yu, Murat Akbacak

Providing voice assistants the ability to navigate multi-turn conversations is a challenging problem.

Navigate

Paper
Add Code

Referring to Screen Texts with Voice Assistants

no code implementations • 10 Jun 2023 • Shruti Bhargava, Anand Dhoot, Ing-Marie Jonsson, Hoang Long Nguyen, Alkesh Patel, Hong Yu, Vincent Renkens

We collect a dataset and propose a lightweight general-purpose model for this novel experience.

Navigate Visual Grounding

Paper
Add Code

UMASS_BioNLP at MEDIQA-Chat 2023: Can LLMs generate high-quality synthetic note-oriented doctor-patient conversations?

1 code implementation • 29 Jun 2023 • Junda Wang, Zonghai Yao, Avijit Mitra, Samuel Osebe, Zhichao Yang, Hong Yu

This paper presents UMASS_BioNLP team participation in the MEDIQA-Chat 2023 shared task for Task-A and Task-C. We focus especially on Task-C and propose a novel LLMs cooperation system named a doctor-patient loop to generate high-quality conversation data sets.

Paper
Code

ODD: A Benchmark Dataset for the Natural Language Processing based Opioid Related Aberrant Behavior Detection

1 code implementation • 5 Jul 2023 • Sunjae Kwon, Xun Wang, Weisong Liu, Emily Druhl, Minhee L. Sung, Joel I. Reisman, Wenjun Li, Robert D. Kerns, William Becker, Hong Yu

Experimental results show that the prompt-tuning models outperformed the fine-tuning models in most categories and the gains were especially higher among uncommon categories (Suggested Aberrant Behavior, Confirmed Aberrant Behaviors, Diagnosed Opioid Dependence, and Medication Change).

Paper
Code

Early Prediction of Alzheimers Disease Leveraging Symptom Occurrences from Longitudinal Electronic Health Records of US Military Veterans

no code implementations • 23 Jul 2023 • Rumeng Li, Xun Wang, Dan Berlowitz, Brian Silver, Wen Hu, Heather Keating, Raelene Goodwin, Weisong Liu, Honghuang Lin, Hong Yu

We used a panel of AD-related keywords and their occurrences over time in a patient's longitudinal EHRs as predictors for AD prediction with four machine learning models.

Paper
Add Code

Mental-LLM: Leveraging Large Language Models for Mental Health Prediction via Online Text Data

1 code implementation • 26 Jul 2023 • Xuhai Xu, Bingsheng Yao, Yuanzhe Dong, Saadia Gabriel, Hong Yu, James Hendler, Marzyeh Ghassemi, Anind K. Dey, Dakuo Wang

More importantly, our experiments show that instruction finetuning can significantly boost the performance of LLMs for all tasks simultaneously.

Language Modelling

Paper
Code

Intelligent Assistant Language Understanding On Device

no code implementations • 7 Aug 2023 • Cecilia Aas, Hisham Abdelsalam, Irina Belousova, Shruti Bhargava, Jianpeng Cheng, Robert Daland, Joris Driesen, Federico Flego, Tristan Guigue, Anders Johannsen, Partha Lal, Jiarui Lu, Joel Ruben Antony Moniz, Nathan Perkins, Dhivya Piraviperumal, Stephen Pulman, Diarmuid Ó Séaghdha, David Q. Sun, John Torr, Marco Del Vecchio, Jay Wacker, Jason D. Williams, Hong Yu

It has recently become feasible to run personal digital assistants on phones and other personal devices.

Natural Language Understanding

Paper
Add Code

PaniniQA: Enhancing Patient Education Through Interactive Question Answering

1 code implementation • 7 Aug 2023 • Pengshan Cai, Zonghai Yao, Fei Liu, Dakuo Wang, Meghan Reilly, Huixue Zhou, Lingxi Li, Yi Cao, Alok Kapoor, Adarsha Bajracharya, Dan Berlowitz, Hong Yu

Patient portal allows discharged patients to access their personalized discharge instructions in electronic health records (EHRs).

Question Answering

Paper
Code

NoteChat: A Dataset of Synthetic Doctor-Patient Conversations Conditioned on Clinical Notes

1 code implementation • 24 Oct 2023 • Junda Wang, Zonghai Yao, Zhichao Yang, Huixue Zhou, Rumeng Li, Xun Wang, Yucheng Xu, Hong Yu

We introduce NoteChat, a novel cooperative multi-agent framework leveraging Large Language Models (LLMs) to generate patient-physician dialogues.

Dialogue Generation

Paper
Code

STEER: Semantic Turn Extension-Expansion Recognition for Voice Assistants

no code implementations • 25 Oct 2023 • Leon Liyang Zhang, Jiarui Lu, Joel Ruben Antony Moniz, Aditya Kulkarni, Dhivya Piraviperumal, Tien Dung Tran, Nicholas Tzou, Hong Yu

In the context of a voice assistant system, steering refers to the phenomenon in which a user issues a follow-up command attempting to direct or clarify a previous turn.

Sentence

Paper
Add Code

Boosting Decision-Based Black-Box Adversarial Attack with Gradient Priors

no code implementations • 29 Oct 2023 • Han Liu, Xingshuo Huang, Xiaotong Zhang, Qimai Li, Fenglong Ma, Wei Wang, Hongyang Chen, Hong Yu, Xianchao Zhang

Decision-based methods have shown to be effective in black-box adversarial attacks, as they can obtain satisfactory performance and only require to access the final model prediction.

Adversarial Attack

Paper
Add Code

BioInstruct: Instruction Tuning of Large Language Models for Biomedical Natural Language Processing

no code implementations • 30 Oct 2023 • Hieu Tran, Zhichao Yang, Zonghai Yao, Hong Yu

We also examined whether categories(e. g., QA, IE, and generation) of instructions impact model performance.

Language Modelling Multi-Task Learning +2

Paper
Add Code

EHRTutor: Enhancing Patient Understanding of Discharge Instructions

no code implementations • 30 Oct 2023 • Zihao Zhang, Zonghai Yao, Huixue Zhou, Feiyun ouyang, Hong Yu

This paper presents EHRTutor, an innovative multi-component framework leveraging the Large Language Model (LLM) for patient education through conversational question-answering.

Conversational Question Answering Language Modelling +1

Paper
Add Code

Synthetic Imitation Edit Feedback for Factual Alignment in Clinical Summarization

1 code implementation • 30 Oct 2023 • Prakamya Mishra, Zonghai Yao, Shuwei Chen, Beining Wang, Rohan Mittal, Hong Yu

In this work, we propose a new pipeline using ChatGPT instead of human experts to generate high-quality feedback data for improving factual consistency in the clinical note summarization task.

Hallucination

Paper
Code

MARRS: Multimodal Reference Resolution System

no code implementations • 3 Nov 2023 • Halim Cagri Ates, Shruti Bhargava, Site Li, Jiarui Lu, Siddhardha Maddula, Joel Ruben Antony Moniz, Anil Kumar Nalamalapu, Roman Hoang Nguyen, Melis Ozyildirim, Alkesh Patel, Dhivya Piraviperumal, Vincent Renkens, Ankit Samal, Thy Tran, Bo-Hsiang Tseng, Hong Yu, Yuan Zhang, Rong Zou

Successfully handling context is essential for any dialog understanding task.

Natural Language Understanding

Paper
Add Code

SELF-EXPLAIN: Teaching Large Language Models to Reason Complex Questions by Themselves

no code implementations • 12 Nov 2023 • Jiachen Zhao, Zonghai Yao, Zhichao Yang, Hong Yu

Large language models (LLMs) can generate intermediate reasoning steps.

Question Answering Retrieval +1

Paper
Add Code

Do Physicians Know How to Prompt? The Need for Automatic Prompt Optimization Help in Clinical Note Generation

no code implementations • 16 Nov 2023 • Zonghai Yao, Ahmed Jaafar, Beining Wang, Zhichao Yang, Hong Yu

We recommend a two-phase optimization process, leveraging APO-GPT4 for consistency and expert input for personalization.

Prompt Engineering

Paper
Add Code

Two Directions for Clinical Data Generation with Large Language Models: Data-to-Label and Label-to-Data

no code implementations • 9 Dec 2023 • Rumeng Li, Xun Wang, Hong Yu

We train a system to detect AD-related signs and symptoms from EHRs, using three datasets: (1) a gold dataset annotated by human experts on longitudinal EHRs of AD patients; (2) a silver dataset created by the data-to-label method; and (3) a bronze dataset created by the label-to-data method.

Paper
Add Code

README: Bridging Medical Jargon and Lay Understanding for Patient Education through Data-Centric NLP

1 code implementation • 24 Dec 2023 • Zonghai Yao, Nandyala Siddharth Kantu, Guanghao Wei, Hieu Tran, Zhangqi Duan, Sunjae Kwon, Zhichao Yang, README annotation team, Hong Yu

The advancement in healthcare has shifted focus toward patient-centric approaches, particularly in self-care and patient education, facilitated by access to Electronic Health Records (EHR).

Paper
Code

EHR Interaction Between Patients and AI: NoteAid EHR Interaction

no code implementations • 29 Dec 2023 • Xiaocheng Zhang, Zonghai Yao, Hong Yu

Through a comprehensive evaluation of the entire dataset using LLM assessment and a rigorous manual evaluation of 64 instances, we showcase the potential of LLMs in patient education.

Paper
Add Code

Can Large Language Models Understand Context?

no code implementations • 1 Feb 2024 • YIlun Zhu, Joel Ruben Antony Moniz, Shruti Bhargava, Jiarui Lu, Dhivya Piraviperumal, Site Li, Yuan Zhang, Hong Yu, Bo-Hsiang Tseng

Understanding context is key to understanding human language, an ability which Large Language Models (LLMs) have been increasingly seen to demonstrate to an impressive extent.

In-Context Learning Quantization

Paper
Add Code

HQA-Attack: Toward High Quality Black-Box Hard-Label Adversarial Attack on Text

1 code implementation • NeurIPS 2023 • Han Liu, Zhi Xu, Xiaotong Zhang, Feng Zhang, Fenglong Ma, Hongyang Chen, Hong Yu, Xianchao Zhang

Black-box hard-label adversarial attack on text is a practical and challenging task, as the text data space is inherently discrete and non-differentiable, and only the predicted label is accessible.

Adversarial Attack Hard-label Attack +5

Paper
Code

SynthDST: Synthetic Data is All You Need for Few-Shot Dialog State Tracking

no code implementations • 3 Feb 2024 • Atharva Kulkarni, Bo-Hsiang Tseng, Joel Ruben Antony Moniz, Dhivya Piraviperumal, Hong Yu, Shruti Bhargava

Remarkably, our few-shot learning approach recovers nearly $98%$ of the performance compared to the few-shot setup using human-annotated training data.

dialog state tracking Few-Shot Learning +2

Paper
Add Code

LocalTweets to LocalHealth: A Mental Health Surveillance Framework Based on Twitter Data

no code implementations • 21 Feb 2024 • Vijeta Deshpande, Minhwa Lee, Zonghai Yao, Zihao Zhang, Jason Brian Gibbons, Hong Yu

Prior research on Twitter (now X) data has provided positive evidence of its utility in developing supplementary health surveillance systems.

Paper
Add Code

SYNFAC-EDIT: Synthetic Imitation Edit Feedback for Factual Alignment in Clinical Summarization

1 code implementation • 21 Feb 2024 • Prakamya Mishra, Zonghai Yao, Parth Vashisht, Feiyun ouyang, Beining Wang, Vidhi Dhaval Mody, Hong Yu

Large Language Models (LLMs) such as GPT & Llama have demonstrated significant achievements in summarization tasks but struggle with factual inaccuracies, a critical issue in clinical NLP applications where errors could lead to serious consequences.

Paper
Code

JMLR: Joint Medical LLM and Retrieval Training for Enhancing Reasoning and Professional Question Answering Capability

no code implementations • 27 Feb 2024 • Junda Wang, Zhichao Yang, Zonghai Yao, Hong Yu

Unlike previous methods in RAG where the retrieval model was trained separately from the LLM, we introduce JMLR (for Jointly trains LLM and information Retrieval (IR)) during the fine-tuning phase.

Information Retrieval Question Answering +1

Paper
Add Code

ClinicalMamba: A Generative Clinical Language Model on Longitudinal Clinical Notes

1 code implementation • 9 Mar 2024 • Zhichao Yang, Avijit Mitra, Sunjae Kwon, Hong Yu

The advancement of natural language processing (NLP) systems in healthcare hinges on language model ability to interpret the intricate information contained within clinical notes.

Few-Shot Learning Language Modelling

Paper
Code

ReALM: Reference Resolution As Language Modeling

no code implementations • 29 Mar 2024 • Joel Ruben Antony Moniz, Soundarya Krishnan, Melis Ozyildirim, Prathamesh Saraf, Halim Cagri Ates, Yuan Zhang, Hong Yu, Nidhi Rajshree

Reference resolution is an important problem, one that is essential to understand and successfully handle context of different kinds.

Language Modelling

Paper
Add Code

Generation of Patient After-Visit Summaries to Support Physicians

1 code implementation • COLING 2022 • Pengshan Cai, Fei Liu, Adarsha Bajracharya, Joe Sills, Alok Kapoor, Weisong Liu, Dan Berlowitz, David Levy, Richeek Pradhan, Hong Yu

Crucially, we introduce a feedback mechanism that alerts physicians when an automatic summary fails to capture the important details of the clinical notes or when it contains hallucinated facts that are potentially detrimental to the summary quality.

Management

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.