Search Results for author: Hua Xu

Found 71 papers, 30 papers with code

Conversational Bots for Psychotherapy: A Study of Generative Transformer Models Using Domain-specific Dialogues

no code implementations BioNLP (ACL) 2022 Avisha Das, Salih Selek, Alia R. Warner, Xu Zuo, Yan Hu, Vipina Kuttichi Keloth, Jianfu Li, W. Jim Zheng, Hua Xu

Through quantitative evaluation of the linguistic quality, we observe that the dialog generation model - DialoGPT (345M) with transfer learning on video data attains scores similar to a human response baseline.

Response Generation Transfer Learning

Map2Text: New Content Generation from Low-Dimensional Visualizations

no code implementations24 Dec 2024 Xingjian Zhang, Ziyang Xiong, Shixuan Liu, Yutong Xie, Tolga Ergen, Dongsub Shim, Hua Xu, Honglak Lee, Qiaozhu Me

Low-dimensional visualizations, or "projection maps" of datasets, are widely used across scientific research and creative industries as effective tools for interpreting large-scale and complex information.

Navigate

Align Anything: Training All-Modality Models to Follow Instructions with Language Feedback

1 code implementation20 Dec 2024 Jiaming Ji, Jiayi Zhou, Hantao Lou, Boyuan Chen, Donghai Hong, Xuyao Wang, Wenqi Chen, Kaile Wang, Rui Pan, Jiahao Li, Mohan Wang, Josef Dai, Tianyi Qiu, Hua Xu, Dong Li, WeiPeng Chen, Jun Song, Bo Zheng, Yaodong Yang

In this work, we make the first attempt to fine-tune all-modality models (i. e. input and output with any modality, also named any-to-any models) using human preference data across all modalities (including text, image, audio, and video), ensuring its behavior aligns with human intentions.

Instruction Following

Bridging AI and Science: Implications from a Large-Scale Literature Analysis of AI4Science

no code implementations27 Nov 2024 Yutong Xie, Yijun Pan, Hua Xu, Qiaozhu Mei

Artificial Intelligence has proven to be a transformative tool for advancing scientific research across a wide range of disciplines.

Link Prediction scientific discovery

A Comparative Study of Recent Large Language Models on Generating Hospital Discharge Summaries for Lung Cancer Patients

no code implementations6 Nov 2024 Yiming Li, Fang Li, Kirk Roberts, Licong Cui, Cui Tao, Hua Xu

Evaluation metrics included token-level analysis (BLEU, ROUGE-1, ROUGE-2, ROUGE-L) and semantic similarity scores between model-generated summaries and physician-written gold standards.

Semantic Similarity Semantic Textual Similarity

GPTON: Generative Pre-trained Transformers enhanced with Ontology Narration for accurate annotation of biological data

no code implementations12 Oct 2024 Rongbin Li, Wenbo Chen, Jinbo Li, Hanwen Xing, Hua Xu, Zhao Li, W. Jim Zheng

By leveraging GPT-4 for ontology narration, we developed GPTON to infuse structured knowledge into LLMs through verbalized ontology terms, achieving accurate text and ontology annotations for over 68% of gene sets in the top five predictions.

A practical applicable quantum-classical hybrid ant colony algorithm for the NISQ era

no code implementations8 Oct 2024 Qian Qiu, Liang Zhang, Mohan Wu, Qichun Sun, Xiaogang Li, Da-Chuang Li, Hua Xu

Quantum ant colony optimization (QACO) has drew much attention since it combines the advantages of quantum computing and ant colony optimization (ACO) algorithm overcoming some limitations of the traditional ACO algorithm.

Clustering

Language Enhanced Model for Eye (LEME): An Open-Source Ophthalmology-Specific Large Language Model

no code implementations1 Oct 2024 Aidan Gilson, Xuguang Ai, Qianqian Xie, Sahana Srinivasan, Krithi Pushpanathan, Maxwell B. Singer, Jimin Huang, Hyunjae Kim, Erping Long, Peixing Wan, Luciano V. Del Priore, Lucila Ohno-Machado, Hua Xu, Dianbo Liu, Ron A. Adelman, Yih-Chung Tham, Qingyu Chen

In external validations, LEME excelled in long-form QA with a Rouge-L of 0. 19 (all p<0. 0001), ranked second in MCQ accuracy (0. 68; all p<0. 0001), and scored highest in EHR summarization and clinical QA (ranging from 4. 24 to 4. 83 out of 5 for correctness, completeness, and readability).

Language Modeling Language Modelling +2

NeuralQP: A General Hypergraph-based Optimization Framework for Large-scale QCQPs

no code implementations28 Sep 2024 Zhixiao Xiong, Fangyu Zong, Huigen Ye, Hua Xu

Machine Learning (ML) optimization frameworks have gained attention for their ability to accelerate the optimization of large-scale Quadratically Constrained Quadratic Programs (QCQPs) by learning shared problem structures.

Suicide Phenotyping from Clinical Notes in Safety-Net Psychiatric Hospital Using Multi-Label Classification with Pre-Trained Language Models

no code implementations27 Sep 2024 Zehan Li, Yan Hu, Scott Lane, Salih Selek, Lokesh Shahani, Rodrigo Machado-Vieira, Jair Soares, Hua Xu, Hongfang Liu, Ming Huang

We evaluated the performance of four BERT-based models using two fine-tuning strategies (multiple single-label and single multi-label) for detecting coexisting suicidal events from 500 annotated psychiatric evaluation notes.

Classification Model Optimization +2

Quantum Long Short-Term Memory for Drug Discovery

no code implementations29 Jul 2024 Liang Zhang, Yin Xu, Mohan Wu, Liang Wang, Hua Xu

Quantum computing combined with machine learning (ML) is an extremely promising research area, with numerous studies demonstrating that quantum machine learning (QML) is expected to solve scientific problems more effectively than classical ML.

Drug Discovery Quantum Machine Learning

Generative AI for Health Technology Assessment: Opportunities, Challenges, and Policy Considerations

no code implementations9 Jul 2024 Rachael Fleurence, Jiang Bian, Xiaoyan Wang, Hua Xu, Dalia Dawoud, Mitch Higashi, Jagpreet Chhatwal

This review introduces the transformative potential of generative Artificial Intelligence (AI) and foundation models, including large language models (LLMs), for health technology assessment (HTA).

Improving Entity Recognition Using Ensembles of Deep Learning and Fine-tuned Large Language Models: A Case Study on Adverse Event Extraction from Multiple Sources

no code implementations26 Jun 2024 Yiming Li, Deepthi Viswaroopan, William He, Jianfu Li, Xu Zuo, Hua Xu, Cui Tao

This study aims to evaluate the effectiveness of LLMs and traditional deep learning models in AE extraction, and to assess the impact of ensembling these models on performance.

Deep Learning Event Extraction +3

Geneverse: A collection of Open-source Multimodal Large Language Models for Genomic and Proteomic Research

1 code implementation21 Jun 2024 Tianyu Liu, Yijia Xiao, Xiao Luo, Hua Xu, W. Jim Zheng, Hongyu Zhao

The applications of large language models (LLMs) are promising for biomedical and healthcare research.

CancerLLM: A Large Language Model in Cancer Domain

no code implementations15 Jun 2024 Mingchen Li, Jiatan Huang, Jeremy Yeung, Anne Blaes, Steven Johnson, Hongfang Liu, Hua Xu, Rui Zhang

Medical Large Language Models (LLMs) such as ClinicalCamel 70B, Llama3-OpenBioLLM 70B have demonstrated impressive performance on a wide variety of medical NLP task. However, there still lacks a large language model (LLM) specifically designed for cancer domain.

Language Modeling Language Modelling +1

Augmenting Biomedical Named Entity Recognition with General-domain Resources

1 code implementation15 Jun 2024 Yu Yin, Hyunjae Kim, Xiao Xiao, Chih Hsuan Wei, Jaewoo Kang, Zhiyong Lu, Hua Xu, Meng Fang, Qingyu Chen

Specifically, our models consistently outperformed the baseline models in six out of eight entity types, achieving an average improvement of 0. 9% over the best baseline performance across eight entities.

Language Modelling Multi-Task Learning +3

XL3M: A Training-free Framework for LLM Length Extension Based on Segment-wise Inference

no code implementations28 May 2024 Shengnan Wang, Youhui Bai, Lin Zhang, Pingyi Zhou, Shixiong Zhao, Gong Zhang, Sen Wang, Renhai Chen, Hua Xu, Hongwei Sun

Under the XL3M framework, the input context will be firstly decomposed into multiple short sub-contexts, where each sub-context contains an independent segment and a common ``question'' which is a few tokens from the end of the original context.

Language Modeling Language Modelling +1

Unsupervised Multimodal Clustering for Semantics Discovery in Multimodal Utterances

1 code implementation21 May 2024 Hanlei Zhang, Hua Xu, Fei Long, Xin Wang, Kai Gao

UMC shows remarkable improvements of 2-6\% scores in clustering metrics over state-of-the-art methods, marking the first successful endeavor in this domain.

Clustering Representation Learning

BiomedRAG: A Retrieval Augmented Large Language Model for Biomedicine

1 code implementation1 May 2024 Mingchen Li, Halil Kilicoglu, Hua Xu, Rui Zhang

Large Language Models (LLMs) have swiftly emerged as vital resources for different applications in the biomedical and healthcare domains; however, these models encounter issues such as generating inaccurate information or hallucinations.

Language Modeling Language Modelling +7

Relation Extraction Using Large Language Models: A Case Study on Acupuncture Point Locations

no code implementations8 Apr 2024 Yiming Li, Xueqing Peng, Jianfu Li, Xu Zuo, Suyuan Peng, Donghong Pei, Cui Tao, Hua Xu, Na Hong

This study underscores the effectiveness of LLMs like GPT in extracting relations related to acupoint locations, with implications for accurately modeling acupuncture knowledge and promoting standard implementation in acupuncture training and practice.

Relation Relation Extraction

MIntRec2.0: A Large-scale Benchmark Dataset for Multimodal Intent Recognition and Out-of-scope Detection in Conversations

1 code implementation16 Mar 2024 Hanlei Zhang, Xin Wang, Hua Xu, Qianrui Zhou, Kai Gao, Jianhua Su, jinyue Zhao, Wenrui Li, Yanting Chen

We believe that MIntRec2. 0 will serve as a valuable resource, providing a pioneering foundation for research in human-machine conversational interactions, and significantly facilitating related applications.

Multimodal Intent Recognition

Me LLaMA: Foundation Large Language Models for Medical Applications

1 code implementation20 Feb 2024 Qianqian Xie, Qingyu Chen, Aokun Chen, Cheng Peng, Yan Hu, Fongci Lin, Xueqing Peng, Jimin Huang, Jeffrey Zhang, Vipina Keloth, Xinyu Zhou, Lingfei Qian, Huan He, Dennis Shung, Lucila Ohno-Machado, Yonghui Wu, Hua Xu, Jiang Bian

This work underscores the importance of domain-specific data in developing medical LLMs and addresses the high computational costs involved in training, highlighting a balance between pre-training and fine-tuning strategies.

Few-Shot Learning

A Span-based Model for Extracting Overlapping PICO Entities from RCT Publications

no code implementations8 Jan 2024 Gongbo Zhang, Yiliang Zhou, Yan Hu, Hua Xu, Chunhua Weng, Yifan Peng

On the PICO-Corpus, PICOX obtained higher recall and F1 scores than the baseline and improved the micro recall score from 56. 66 to 67. 33.

Data Augmentation PICO

Token-Level Contrastive Learning with Modality-Aware Prompting for Multimodal Intent Recognition

1 code implementation22 Dec 2023 Qianrui Zhou, Hua Xu, Hao Li, Hanlei Zhang, Xiaohan Zhang, Yifan Wang, Kai Gao

To establish an optimal multimodal semantic environment for text modality, we develop a modality-aware prompting module (MAP), which effectively aligns and fuses features from text, video and audio modalities with similarity-based modality alignment and cross-modality attention mechanism.

Contrastive Learning Multimodal Intent Recognition

AI Alignment: A Comprehensive Survey

no code implementations30 Oct 2023 Jiaming Ji, Tianyi Qiu, Boyuan Chen, Borong Zhang, Hantao Lou, Kaile Wang, Yawen Duan, Zhonghao He, Jiayi Zhou, Zhaowei Zhang, Fanzhi Zeng, Kwan Yee Ng, Juntao Dai, Xuehai Pan, Aidan O'Gara, Yingshan Lei, Hua Xu, Brian Tse, Jie Fu, Stephen Mcaleer, Yaodong Yang, Yizhou Wang, Song-Chun Zhu, Yike Guo, Wen Gao

The former aims to make AI systems aligned via alignment training, while the latter aims to gain evidence about the systems' alignment and govern them appropriately to avoid exacerbating misalignment risks.

Survey

Identifying and Extracting Rare Disease Phenotypes with Large Language Models

1 code implementation22 Jun 2023 Cathy Shyr, Yan Hu, Paul A. Harris, Hua Xu

Despite this, ChatGPT achieved similar or higher accuracy for certain entities (i. e., rare diseases and signs) in the one-shot setting (F1 of 0. 776 and 0. 725).

Language Modelling Large Language Model +4

A Clustering Framework for Unsupervised and Semi-supervised New Intent Discovery

1 code implementation16 Apr 2023 Hanlei Zhang, Hua Xu, Xin Wang, Fei Long, Kai Gao

New intent discovery is of great value to natural language processing, allowing for a better understanding of user needs and providing friendly services.

Clustering Intent Discovery +3

Adaptive Constraint Partition based Optimization Framework for Large-scale Integer Linear Programming(Student Abstract)

no code implementations18 Nov 2022 Huigen Ye, Hongyan Wang, Hua Xu, Chengming Wang, Yu Jiang

Integer programming problems (IPs) are challenging to be solved efficiently due to the NP-hardness, especially for large-scale IPs.

A Self-Adjusting Fusion Representation Learning Model for Unaligned Text-Audio Sequences

no code implementations12 Nov 2022 Kaicheng Yang, Ruxuan Zhang, Hua Xu, Kai Gao

In this paper, a Self-Adjusting Fusion Representation Learning Model (SA-FRLM) is proposed to learn robust crossmodal fusion representations directly from the unaligned text and audio sequences.

Multimodal Sentiment Analysis Representation Learning

Make Acoustic and Visual Cues Matter: CH-SIMS v2.0 Dataset and AV-Mixup Consistent Module

1 code implementation22 Aug 2022 Yihe Liu, Ziqi Yuan, Huisheng Mao, Zhiyun Liang, Wanqiuyue Yang, Yuanzhe Qiu, Tie Cheng, Xiaoteng Li, Hua Xu, Kai Gao

The designed modality mixup module can be regarded as an augmentation, which mixes the acoustic and visual modalities from different videos.

Multimodal Sentiment Analysis

M-SENA: An Integrated Platform for Multimodal Sentiment Analysis

3 code implementations ACL 2022 Huisheng Mao, Ziqi Yuan, Hua Xu, Wenmeng Yu, Yihe Liu, Kai Gao

The platform features a fully modular video sentiment analysis framework consisting of data management, feature extraction, model training, and result analysis modules.

Management Multimodal Sentiment Analysis

Learning Discriminative Representations and Decision Boundaries for Open Intent Detection

1 code implementation11 Mar 2022 Hanlei Zhang, Hua Xu, Shaojie Zhao, Qianrui Zhou

To address these issues, this paper presents an original framework called DA-ADB, which successively learns distance-aware intent representations and adaptive decision boundaries for open intent detection.

Natural Language Understanding Open Intent Detection

Consistent Representation Learning for Continual Relation Extraction

1 code implementation Findings (ACL) 2022 Kang Zhao, Hua Xu, Jiangong Yang, Kai Gao

Specifically, supervised contrastive learning based on a memory bank is first used to train each new task so that the model can effectively learn the relation representation.

Continual Relation Extraction Contrastive Learning +3

Simple Recurrent Neural Networks is all we need for clinical events predictions using EHR data

1 code implementation3 Oct 2021 Laila Rasmy, Jie Zhu, Zhiheng Li, Xin Hao, Hong Thoai Tran, Yujia Zhou, Firat Tiryaki, Yang Xiang, Hua Xu, Degui Zhi

As a result, deep learning models developed for sequence modeling, like recurrent neural networks (RNNs) are common architecture for EHR-based clinical events predictive models.

Bayesian Optimization

An Empirical Study of UMLS Concept Extraction from Clinical Notes using Boolean Combination Ensembles

no code implementations4 Aug 2021 Greg M. Silverman, Raymond L. Finzel, Michael V. Heinz, Jake Vasilakes, Jacob C. Solinsky, Reed McEwan, Benjamin C. Knoll, Christopher J. Tignanelli, Hongfang Liu, Hua Xu, Xiaoqian Jiang, Genevieve B. Melton, Serguei VS Pakhomov

Our objective in this study is to investigate the behavior of Boolean operators on combining annotation output from multiple Natural Language Processing (NLP) systems across multiple corpora and to assess how filtering by aggregation of Unified Medical Language System (UMLS) Metathesaurus concepts affects system performance for Named Entity Recognition (NER) of UMLS concepts.

named-entity-recognition Named Entity Recognition +1

RGB Image Classification with Quantum Convolutional Ansaetze

no code implementations23 Jul 2021 Yu Jing, Xiaogang Li, Yang Yang, Chonghang Wu, Wenbing Fu, Wei Hu, Yuanyuan Li, Hua Xu

With the rapid growth of qubit numbers and coherence times in quantum hardware technology, implementing shallow neural networks on the so-called Noisy Intermediate-Scale Quantum (NISQ) devices has attracted a lot of interest.

Classification Image Classification

Discovering novel drug-supplement interactions using a dietary supplements knowledge graph generated from the biomedical literature

no code implementations24 Jun 2021 Dalton Schutte, Jake Vasilakes, Anu Bompelli, Yuqi Zhou, Marcelo Fiszman, Hua Xu, Halil Kilicoglu, Jeffrey R. Bishop, Terrence Adam, Rui Zhang

MATERIALS AND METHODS: We created SemRepDS (an extension of SemRep), capable of extracting semantic relations from abstracts by leveraging a DS-specific terminology (iDISK) containing 28, 884 DS terms not found in the UMLS.

Deep Open Intent Classification with Adaptive Decision Boundary

1 code implementation18 Dec 2020 Hanlei Zhang, Hua Xu, Ting-En Lin

In this paper, we propose a post-processing method to learn the adaptive decision boundary (ADB) for open intent classification.

Classification General Classification +3

Discovering New Intents with Deep Aligned Clustering

2 code implementations16 Dec 2020 Hanlei Zhang, Hua Xu, Ting-En Lin, Rui Lyu

In this work, we propose an effective method, Deep Aligned Clustering, to discover new intents with the aid of the limited known intent data.

Clustering Open Intent Discovery +1

Cross-Modal BERT for Text-Audio Sentiment Analysis

1 code implementation ACM Multimedia 2020 Kaicheng Yang, Hua Xu, Kai Gao

In this paper, we propose the Cross-Modal BERT (CM-BERT), which relies on the interaction of text and audio modality to fine-tune the pre-trained BERT model.

Multimodal Sentiment Analysis Natural Language Inference +1

COVID-19 SignSym: a fast adaptation of a general clinical NLP tool to identify and normalize COVID-19 signs and symptoms to OMOP common data model

no code implementations13 Jul 2020 Jingqi Wang, Noor Abu-el-rub, Josh Gray, Huy Anh Pham, Yujia Zhou, Frank Manion, Mei Liu, Xing Song, Hua Xu, Masoud Rouhizadeh, Yaoyun Zhang

To this end, this study aims at adapting the existing CLAMP natural language processing tool to quickly build COVID-19 SignSym, which can extract COVID-19 signs/symptoms and their 8 attributes (body location, severity, temporal expression, subject, condition, uncertainty, negation, and course) from clinical text.

Negation

Finding structural hole spanners based on community forest model and diminishing marginal utility in large scale social networks

no code implementations Knowledge-Based Systems, 105916. 2020 Yan Zhang, Hua Xu, Yunfeng Xu, Junhui Deng, Juan Gu, Rui Ma, Jie Lai, Jiangtao Hu, Xiaoshuai Yu, Lei Hou, Lidong Gu, Yanling Wei, Yichao Xiao, Junhao Lu

In this paper, we try to give a more visual and detailed definition of structural hole spanner based on the existing work, and propose a novel algorithm to identify structural hole spanner based on community forest model and diminishing marginal utility.

Community Detection Link Prediction +2

Robustly Pre-trained Neural Model for Direct Temporal Relation Extraction

no code implementations13 Apr 2020 Hong Guan, Jianfu Li, Hua Xu, Murthy Devarakonda

Background: Identifying relationships between clinical events and temporal expressions is a key challenge in meaningfully analyzing clinical text for use in advanced AI applications.

Language Modeling Language Modelling +2

A Post-processing Method for Detecting Unknown Intent of Dialogue System via Pre-trained Deep Neural Network Classifier

1 code implementation7 Mar 2020 Ting-En Lin, Hua Xu

In this paper, we propose SofterMax and deep novelty detection (SMDN), a simple yet effective post-processing method for detecting unknown intent in dialogue systems based on pre-trained deep neural network classifiers.

Intent Detection Novelty Detection

BERT-based Ranking for Biomedical Entity Normalization

no code implementations9 Aug 2019 Zongcheng Ji, Qiang Wei, Hua Xu

Developing high-performance entity normalization algorithms that can alleviate the term variation problem is of great interest to the biomedical community.

Word Embeddings

Deep Unknown Intent Detection with Margin Loss

1 code implementation ACL 2019 Ting-En Lin, Hua Xu

With margin loss, we can learn discriminative deep features by forcing the network to maximize inter-class variance and to minimize intra-class variance.

Novelty Detection Open Intent Detection

Enhancing Clinical Concept Extraction with Contextual Embeddings

no code implementations22 Feb 2019 Yuqi Si, Jingqi Wang, Hua Xu, Kirk Roberts

We explore a battery of embedding methods consisting of traditional word embeddings and contextual embeddings, and compare these on four concept extraction corpora: i2b2 2010, i2b2 2012, SemEval 2014, and SemEval 2015.

Clinical Concept Extraction Language Modelling +2

Cannot find the paper you are looking for? You can Submit a new open access paper.