Search Results for author: Zhichao Yang

Found 24 papers, 13 papers with code

AesExpert: Towards Multi-modality Foundation Model for Image Aesthetics Perception

1 code implementation15 Apr 2024 Yipo Huang, Xiangfei Sheng, Zhichao Yang, Quan Yuan, Zhichao Duan, Pengfei Chen, Leida Li, Weisi Lin, Guangming Shi

To address the above challenge, we first introduce a comprehensively annotated Aesthetic Multi-Modality Instruction Tuning (AesMMIT) dataset, which serves as the footstone for building multi-modality aesthetics foundation models.

ClinicalMamba: A Generative Clinical Language Model on Longitudinal Clinical Notes

1 code implementation9 Mar 2024 Zhichao Yang, Avijit Mitra, Sunjae Kwon, Hong Yu

The advancement of natural language processing (NLP) systems in healthcare hinges on language model ability to interpret the intricate information contained within clinical notes.

Few-Shot Learning Language Modelling

JMLR: Joint Medical LLM and Retrieval Training for Enhancing Reasoning and Professional Question Answering Capability

1 code implementation27 Feb 2024 Junda Wang, Zhichao Yang, Zonghai Yao, Hong Yu

Unlike previous methods in RAG where the retrieval model was trained separately from the LLM, we introduce JMLR (for Jointly trains LLM and information Retrieval (IR)) during the fine-tuning phase.

Information Retrieval Question Answering +1

README: Bridging Medical Jargon and Lay Understanding for Patient Education through Data-Centric NLP

1 code implementation24 Dec 2023 Zonghai Yao, Nandyala Siddharth Kantu, Guanghao Wei, Hieu Tran, Zhangqi Duan, Sunjae Kwon, Zhichao Yang, README annotation team, Hong Yu

The advancement in healthcare has shifted focus toward patient-centric approaches, particularly in self-care and patient education, facilitated by access to Electronic Health Records (EHR).

Surpassing GPT-4 Medical Coding with a Two-Stage Approach

no code implementations22 Nov 2023 Zhichao Yang, Sanjit Singh Batra, Joel Stremmel, Eran Halperin

Recent advances in large language models (LLMs) show potential for clinical applications, such as clinical decision support and trial recommendations.

Sentence

Do Physicians Know How to Prompt? The Need for Automatic Prompt Optimization Help in Clinical Note Generation

no code implementations16 Nov 2023 Zonghai Yao, Ahmed Jaafar, Beining Wang, Zhichao Yang, Hong Yu

We recommend a two-phase optimization process, leveraging APO-GPT4 for consistency and expert input for personalization.

Prompt Engineering

NoteChat: A Dataset of Synthetic Doctor-Patient Conversations Conditioned on Clinical Notes

1 code implementation24 Oct 2023 Junda Wang, Zonghai Yao, Zhichao Yang, Huixue Zhou, Rumeng Li, Xun Wang, Yucheng Xu, Hong Yu

We introduce NoteChat, a novel cooperative multi-agent framework leveraging Large Language Models (LLMs) to generate patient-physician dialogues.

Dialogue Generation

UMASS_BioNLP at MEDIQA-Chat 2023: Can LLMs generate high-quality synthetic note-oriented doctor-patient conversations?

1 code implementation29 Jun 2023 Junda Wang, Zonghai Yao, Avijit Mitra, Samuel Osebe, Zhichao Yang, Hong Yu

This paper presents UMASS_BioNLP team participation in the MEDIQA-Chat 2023 shared task for Task-A and Task-C. We focus especially on Task-C and propose a novel LLMs cooperation system named a doctor-patient loop to generate high-quality conversation data sets.

Interpretable Math Word Problem Solution Generation Via Step-by-step Planning

no code implementations1 Jun 2023 Mengxue Zhang, Zichao Wang, Zhichao Yang, Weiqi Feng, Andrew Lan

We propose a step-by-step planning approach for intermediate solution generation, which strategically plans the generation of the next solution step based on the MWP and the previous solution steps.

GSM8K Language Modelling +1

Vision Meets Definitions: Unsupervised Visual Word Sense Disambiguation Incorporating Gloss Information

1 code implementation2 May 2023 Sunjae Kwon, Rishabh Garodia, Minhwa Lee, Zhichao Yang, Hong Yu

Specifically, we suggest employing Bayesian inference to incorporate the sense definitions when sense information of the answer is not provided.

Bayesian Inference Image-text matching +2

Copula-based transferable models for synthetic population generation

no code implementations17 Feb 2023 Pascal Jutras-Dubé, Mohammad B. Al-Khasawneh, Zhichao Yang, Javier Bas, Fabian Bastin, Cinzia Cirillo

Population synthesis involves generating synthetic yet realistic representations of a target population of micro-agents for behavioral modeling and simulation.

Enhancing the prediction of disease outcomes using electronic health records and pretrained deep learning models

no code implementations22 Dec 2022 Zhichao Yang, Weisong Liu, Dan Berlowitz, Hong Yu

Question: Can an encoder-decoder architecture pretrained on a large dataset of longitudinal electronic health records improves patient outcome predictions?

Denoising

An Automatic SOAP Classification System Using Weakly Supervision And Transfer Learning

no code implementations26 Nov 2022 Sunjae Kwon, Zhichao Yang, Hong Yu

The transfer learning framework helps SOAP classification model's inter-hospital migration with a minimal size of the manually annotated dataset.

Classification Language Modelling +1

Multi-label Few-shot ICD Coding as Autoregressive Generation with Prompt

1 code implementation24 Nov 2022 Zhichao Yang, Sunjae Kwon, Zonghai Yao, Hong Yu

This task is challenging due to the high-dimensional space of multi-label assignment (155, 000+ ICD code candidates) and the long-tail challenge - Many ICD codes are infrequently assigned yet infrequent ICD codes are important clinically.

Multi-Label Classification

Context Variance Evaluation of Pretrained Language Models for Prompt-based Biomedical Knowledge Probing

no code implementations18 Nov 2022 Zonghai Yao, Yi Cao, Zhichao Yang, Hong Yu

Different from the previous known-unknown evaluation criteria, we propose the concept of "Misunderstand" in LAMA for the first time.

Knowledge Probing

Knowledge Injected Prompt Based Fine-tuning for Multi-label Few-shot ICD Coding

1 code implementation7 Oct 2022 Zhichao Yang, Shufan Wang, Bhanu Pratap Singh Rawat, Avijit Mitra, Hong Yu

Automatic International Classification of Diseases (ICD) coding aims to assign multiple ICD codes to a medical note with average length of 3, 000+ tokens.

Contrastive Learning Medical Code Prediction

Extracting Biomedical Factual Knowledge Using Pretrained Language Model and Electronic Health Record Context

no code implementations26 Aug 2022 Zonghai Yao, Yi Cao, Zhichao Yang, Vijeta Deshpande, Hong Yu

In order to make LMs as KBs more in line with the actual application scenarios of the biomedical domain, we specifically add EHR notes as context to the prompt to improve the low bound in the biomedical domain.

Language Modelling

Word Embedding Perturbation for Sentence Classification

1 code implementation22 Apr 2018 Dongxu Zhang, Zhichao Yang

In this technique report, we aim to mitigate the overfitting problem of natural language by applying data augmentation methods.

Classification Data Augmentation +3

Cannot find the paper you are looking for? You can Submit a new open access paper.