Search Results for author: Hoifung Poon

Found 54 papers, 16 papers with code

Offset Unlearning for Large Language Models

no code implementations • 17 Apr 2024 • James Y. Huang, Wenxuan Zhou, Fei Wang, Fred Morstatter, Sheng Zhang, Hoifung Poon, Muhao Chen

Despite the strong capabilities of Large Language Models (LLMs) to acquire knowledge from their training corpora, the memorization of sensitive information in the corpora such as copyrighted, harmful, and private content has led to ethical and legal concerns.

Memorization

Paper
Add Code

Training Small Multimodal Models to Bridge Biomedical Competency Gap: A Case Study in Radiology Imaging

no code implementations • 12 Mar 2024 • Juan Manuel Zambrano Chaves, Shih-Cheng Huang, Yanbo Xu, Hanwen Xu, Naoto Usuyama, Sheng Zhang, Fei Wang, Yujia Xie, Mahmoud Khademi, ZiYi Yang, Hany Awadalla, Julia Gong, Houdong Hu, Jianwei Yang, Chunyuan Li, Jianfeng Gao, Yu Gu, Cliff Wong, Mu Wei, Tristan Naumann, Muhao Chen, Matthew P. Lungren, Serena Yeung-Levy, Curtis P. Langlotz, Sheng Wang, Hoifung Poon

Frontier models such as GPT-4V still have major competency gaps in multimodal capabilities for biomedical applications.

Cross-Modal Retrieval

Paper
Add Code

Attribute Structuring Improves LLM-Based Evaluation of Clinical Text Summaries

1 code implementation • 1 Mar 2024 • Zelalem Gero, Chandan Singh, Yiqing Xie, Sheng Zhang, Tristan Naumann, Jianfeng Gao, Hoifung Poon

Summarizing clinical text is crucial in health decision-support and clinical research.

Attribute Text Summarization

Paper
Code

T-Rex: Text-assisted Retrosynthesis Prediction

1 code implementation • 26 Jan 2024 • Yifeng Liu, Hanwen Xu, Tangqi Fang, Haocheng Xi, Zixuan Liu, Sheng Zhang, Hoifung Poon, Sheng Wang

As a fundamental task in computational chemistry, retrosynthesis prediction aims to identify a set of reactants to synthesize a target molecule.

Re-Ranking Retrosynthesis

Paper
Code

Foundation Models for Biomedical Image Segmentation: A Survey

no code implementations • 15 Jan 2024 • Ho Hin Lee, Yu Gu, Theodore Zhao, Yanbo Xu, Jianwei Yang, Naoto Usuyama, Cliff Wong, Mu Wei, Bennett A. Landman, Yuankai Huo, Alberto Santamaria-Pang, Hoifung Poon

This transformative technology, originally developed for general-purpose computer vision, has found rapid application in medical image processing.

Image Segmentation Semantic Segmentation +1

Paper
Add Code

When an Image is Worth 1,024 x 1,024 Words: A Case Study in Computational Pathology

no code implementations • 6 Dec 2023 • Wenhui Wang, Shuming Ma, Hanwen Xu, Naoto Usuyama, Jiayu Ding, Hoifung Poon, Furu Wei

This technical report presents LongViT, a vision Transformer that can process gigapixel images in an end-to-end manner.

Survival Prediction whole slide images

Paper
Add Code

Can Generalist Foundation Models Outcompete Special-Purpose Tuning? Case Study in Medicine

1 code implementation • 28 Nov 2023 • Harsha Nori, Yin Tat Lee, Sheng Zhang, Dean Carignan, Richard Edgar, Nicolo Fusi, Nicholas King, Jonathan Larson, Yuanzhi Li, Weishung Liu, Renqian Luo, Scott Mayer McKinney, Robert Osazuwa Ness, Hoifung Poon, Tao Qin, Naoto Usuyama, Chris White, Eric Horvitz

We find that prompting innovation can unlock deeper specialist capabilities and show that GPT-4 easily tops prior leading results for medical benchmarks.

Ranked #1 on Question Answering on MedQA

Electrical Engineering Experimental Design +3

5,050

Paper
Code

DocLens: Multi-aspect Fine-grained Evaluation for Medical Text Generation

1 code implementation • 16 Nov 2023 • Yiqing Xie, Sheng Zhang, Hao Cheng, PengFei Liu, Zelalem Gero, Cliff Wong, Tristan Naumann, Hoifung Poon, Carolyn Rose

Medical text generation aims to assist with administrative work and highlight salient information to support decision-making.

Decision Making Instruction Following +1

Paper
Code

TRIALSCOPE: A Unifying Causal Framework for Scaling Real-World Evidence Generation with Biomedical Language Models

no code implementations • 2 Nov 2023 • Javier González, Cliff Wong, Zelalem Gero, Jass Bagga, Risa Ueno, Isabel Chien, Eduard Oravkin, Emre Kiciman, Aditya Nori, Roshanthi Weerasinghe, Rom S. Leidner, Brian Piening, Tristan Naumann, Carlo Bifulco, Hoifung Poon

The rapid digitization of real-world data offers an unprecedented opportunity for optimizing healthcare delivery and accelerating biomedical discovery.

Causal Inference Denoising +1

Paper
Add Code

Exploring the Boundaries of GPT-4 in Radiology

no code implementations • 23 Oct 2023 • Qianchu Liu, Stephanie Hyland, Shruthi Bannur, Kenza Bouzid, Daniel C. Castro, Maria Teodora Wetscherek, Robert Tinn, Harshita Sharma, Fernando Pérez-García, Anton Schwaighofer, Pranav Rajpurkar, Sameer Tajdin Khanna, Hoifung Poon, Naoto Usuyama, Anja Thieme, Aditya V. Nori, Matthew P. Lungren, Ozan Oktay, Javier Alvarez-Valle

In this paper, we focus on assessing the performance of GPT-4, the most capable LLM so far, on the text-based applications for radiology reports, comparing against state-of-the-art (SOTA) radiology-specific models.

Natural Language Inference Sentence +1

Paper
Add Code

BiomedJourney: Counterfactual Biomedical Image Generation by Instruction-Learning from Multimodal Patient Journeys

no code implementations • 16 Oct 2023 • Yu Gu, Jianwei Yang, Naoto Usuyama, Chunyuan Li, Sheng Zhang, Matthew P. Lungren, Jianfeng Gao, Hoifung Poon

In a comprehensive battery of tests on counterfactual medical image generation, BiomedJourney substantially outperforms prior state-of-the-art methods in instruction image editing and medical image generation such as InstructPix2Pix and RoentGen.

counterfactual Denoising +2

Paper
Add Code

UniversalNER: Targeted Distillation from Large Language Models for Open Named Entity Recognition

no code implementations • 7 Aug 2023 • Wenxuan Zhou, Sheng Zhang, Yu Gu, Muhao Chen, Hoifung Poon

Instruction tuning has proven effective for distilling LLMs into more cost-efficient models such as Alpaca and Vicuna.

Ranked #1 on Named Entity Recognition (NER) on NCBI Disease

named-entity-recognition Named Entity Recognition +2

Paper
Add Code

Scaling Clinical Trial Matching Using Large Language Models: A Case Study in Oncology

no code implementations • 4 Aug 2023 • Cliff Wong, Sheng Zhang, Yu Gu, Christine Moung, Jacob Abel, Naoto Usuyama, Roshanthi Weerasinghe, Brian Piening, Tristan Naumann, Carlo Bifulco, Hoifung Poon

Clinical trial matching is a key process in health delivery and discovery.

Paper
Add Code

Distilling Large Language Models for Biomedical Knowledge Extraction: A Case Study on Adverse Drug Events

no code implementations • 12 Jul 2023 • Yu Gu, Sheng Zhang, Naoto Usuyama, Yonas Woldesenbet, Cliff Wong, Praneeth Sanapathi, Mu Wei, Naveen Valluri, Erika Strandberg, Tristan Naumann, Hoifung Poon

We find that while LLMs already possess decent competency in structuring biomedical text, by distillation into a task-specific student model through self-supervised learning, substantial gains can be attained over out-of-box LLMs, with additional advantages such as cost, efficiency, and white-box model access.

Self-Supervised Learning

Paper
Add Code

Automatic Calibration and Error Correction for Generative Large Language Models via Pareto Optimal Self-Supervision

no code implementations • 28 Jun 2023 • Theodore Zhao, Mu Wei, J. Samuel Preston, Hoifung Poon

Generative Large language models (LLMs) have demonstrated remarkable capabilities for a wide range of applications, but reducing ungrounded or erroneous responses remains a major growth area.

Relation Extraction

Paper
Add Code

LLaVA-Med: Training a Large Language-and-Vision Assistant for Biomedicine in One Day

no code implementations • NeurIPS 2023 • Chunyuan Li, Cliff Wong, Sheng Zhang, Naoto Usuyama, Haotian Liu, Jianwei Yang, Tristan Naumann, Hoifung Poon, Jianfeng Gao

In this paper, we propose a cost-efficient approach for training a vision-language conversational assistant that can answer open-ended research questions of biomedical images.

Instruction Following Language Modelling +2

Paper
Add Code

Self-Verification Improves Few-Shot Clinical Information Extraction

1 code implementation • 30 May 2023 • Zelalem Gero, Chandan Singh, Hao Cheng, Tristan Naumann, Michel Galley, Jianfeng Gao, Hoifung Poon

Extracting patient information from unstructured text is a critical task in health decision-support and clinical research.

In-Context Learning

Paper
Code

Compositional Zero-Shot Domain Transfer with Text-to-Text Models

no code implementations • 23 Mar 2023 • Fangyu Liu, Qianchu Liu, Shruthi Bannur, Fernando Pérez-García, Naoto Usuyama, Sheng Zhang, Tristan Naumann, Aditya Nori, Hoifung Poon, Javier Alvarez-Valle, Ozan Oktay, Stephanie L. Hyland

We evaluate DoT5 on the biomedical domain and the resource-lean subdomain of radiology, focusing on NLI, text summarisation and embedding learning.

Data Augmentation Multi-Task Learning

Paper
Add Code

Context-faithful Prompting for Large Language Models

1 code implementation • 20 Mar 2023 • Wenxuan Zhou, Sheng Zhang, Hoifung Poon, Muhao Chen

However, their reliance on parametric knowledge may cause them to overlook contextual cues, leading to incorrect predictions in context-sensitive NLP tasks (e. g., knowledge acquisition tasks).

counterfactual Machine Reading Comprehension +1

Paper
Code

BiomedCLIP: a multimodal biomedical foundation model pretrained from fifteen million scientific image-text pairs

3 code implementations • 2 Mar 2023 • Sheng Zhang, Yanbo Xu, Naoto Usuyama, Hanwen Xu, Jaspreet Bagga, Robert Tinn, Sam Preston, Rajesh Rao, Mu Wei, Naveen Valluri, Cliff Wong, Andrea Tupini, Yu Wang, Matt Mazzola, Swadheen Shukla, Lars Liden, Jianfeng Gao, Matthew P. Lungren, Tristan Naumann, Sheng Wang, Hoifung Poon

Therefore, training an effective generalist biomedical model requires high-quality multimodal data, such as parallel image-text pairs.

Ranked #3 on Medical Visual Question Answering on SLAKE-English

Medical Visual Question Answering Pneumonia Detection +3

Paper
Code

BLIAM: Literature-based Data Synthesis for Synergistic Drug Combination Prediction

no code implementations • 14 Feb 2023 • Cai Yang, Addie Woicik, Hoifung Poon, Sheng Wang

Instead of obtaining features from language models, we propose BLIAM, a literature-based data synthesis approach to directly generate training data points that are interpretable and model-agnostic to downstream applications.

Data Augmentation Language Modelling

Paper
Add Code

Continual Contrastive Finetuning Improves Low-Resource Relation Extraction

no code implementations • 21 Dec 2022 • Wenxuan Zhou, Sheng Zhang, Tristan Naumann, Muhao Chen, Hoifung Poon

In this paper, we aim at bridging the gap and propose to pretrain and finetune the RE model using consistent objectives of contrastive learning.

Contrastive Learning Relation +3

Paper
Add Code

BioGPT: Generative Pre-trained Transformer for Biomedical Text Generation and Mining

2 code implementations • 19 Oct 2022 • Renqian Luo, Liai Sun, Yingce Xia, Tao Qin, Sheng Zhang, Hoifung Poon, Tie-Yan Liu

Pre-trained language models have attracted increasing attention in the biomedical domain, inspired by their great success in the general natural language domain.

Ranked #1 on Document Classification on HOC (Micro F1 metric)

Document Classification Language Modelling +3

124,984

Paper
Code

Optimizing Bi-Encoder for Named Entity Recognition via Contrastive Learning

1 code implementation • 30 Aug 2022 • Sheng Zhang, Hao Cheng, Jianfeng Gao, Hoifung Poon

We present a bi-encoder framework for named entity recognition (NER), which applies contrastive learning to map candidate text spans and entity types into the same vector representation space.

Ranked #1 on Named Entity Recognition (NER) on BC5CDR

Contrastive Learning Metric Learning +5

Paper
Code

Making the Most of Text Semantics to Improve Biomedical Vision--Language Processing

1 code implementation • 21 Apr 2022 • Benedikt Boecking, Naoto Usuyama, Shruthi Bannur, Daniel C. Castro, Anton Schwaighofer, Stephanie Hyland, Maria Wetscherek, Tristan Naumann, Aditya Nori, Javier Alvarez-Valle, Hoifung Poon, Ozan Oktay

We release a new dataset with locally-aligned phrase grounding annotations by radiologists to facilitate the study of complex semantic modelling in biomedical vision--language processing.

Contrastive Learning Language Modelling +4

233

Paper
Code

Towards Structuring Real-World Data at Scale: Deep Learning for Extracting Key Oncology Information from Clinical Text with Patient-Level Supervision

no code implementations • 20 Mar 2022 • Sam Preston, Mu Wei, Rajesh Rao, Robert Tinn, Naoto Usuyama, Michael Lucas, Roshanthi Weerasinghe, Soohee Lee, Brian Piening, Paul Tittel, Naveen Valluri, Tristan Naumann, Carlo Bifulco, Hoifung Poon

Results: We conduct an extensive study on 135, 107 patients from the cancer registry of a large integrated delivery network (IDN) comprising healthcare systems in five western US states.

Sentence

Paper
Add Code

Fine-Tuning Large Neural Language Models for Biomedical Natural Language Processing

no code implementations • 15 Dec 2021 • Robert Tinn, Hao Cheng, Yu Gu, Naoto Usuyama, Xiaodong Liu, Tristan Naumann, Jianfeng Gao, Hoifung Poon

Overall, domainspecific vocabulary and pretraining facilitate more robust models for fine-tuning.

text similarity Transfer Learning

Paper
Add Code

Knowledge-Rich Self-Supervision for Biomedical Entity Linking

no code implementations • 15 Dec 2021 • Sheng Zhang, Hao Cheng, Shikhar Vashishth, Cliff Wong, Jinfeng Xiao, Xiaodong Liu, Tristan Naumann, Jianfeng Gao, Hoifung Poon

Zero-shot entity linking has emerged as a promising direction for generalizing to new entities, but it still requires example gold entity mentions during training and canonical descriptions for all entities, both of which are rarely available outside of Wikipedia.

Contrastive Learning Entity Linking

Paper
Add Code

Modular Self-Supervision for Document-Level Relation Extraction

no code implementations • EMNLP 2021 • Sheng Zhang, Cliff Wong, Naoto Usuyama, Sarthak Jain, Tristan Naumann, Hoifung Poon

Extracting relations across large text spans has been relatively underexplored in NLP, but it is particularly important for high-value domains such as biomedicine, where obtaining high recall of the latest findings is crucial for practical applications.

Document-level Relation Extraction Reading Comprehension +1

Paper
Add Code

Combining Probabilistic Logic and Deep Learning for Self-Supervised Learning

no code implementations • 27 Jul 2021 • Hoifung Poon, Hai Wang, Hunter Lang

We first present deep probabilistic logic(DPL), which offers a unifying framework for task-specific self-supervision by composing probabilistic logic with deep learning.

Active Learning Language Modelling +5

Paper
Add Code

Domain-Specific Pretraining for Vertical Search: Case Study on Biomedical Literature

no code implementations • 25 Jun 2021 • Yu Wang, Jinchao Li, Tristan Naumann, Chenyan Xiong, Hao Cheng, Robert Tinn, Cliff Wong, Naoto Usuyama, Richard Rogahn, Zhihong Shen, Yang Qin, Eric Horvitz, Paul N. Bennett, Jianfeng Gao, Hoifung Poon

A prominent case in point is the explosion of the biomedical literature on COVID-19, which swelled to hundreds of thousands of papers in a matter of months.

Distributed Computing Self-Supervised Learning

Paper
Add Code

Targeted Adversarial Training for Natural Language Understanding

1 code implementation • NAACL 2021 • Lis Pereira, Xiaodong Liu, Hao Cheng, Hoifung Poon, Jianfeng Gao, Ichiro Kobayashi

We present a simple yet effective Targeted Adversarial Training (TAT) algorithm to improve adversarial training for natural language understanding.

Natural Language Understanding

2,199

Paper
Code

Self-supervised self-supervision by combining deep learning and probabilistic logic

no code implementations • 23 Dec 2020 • Hunter Lang, Hoifung Poon

Labeling training examples at scale is a perennial challenge in machine learning.

Active Learning Self-Supervised Learning

Paper
Add Code

CMT in TREC-COVID Round 2: Mitigating the Generalization Gaps from Web to Special Domain Search

3 code implementations • 3 Nov 2020 • Chenyan Xiong, Zhenghao Liu, Si Sun, Zhuyun Dai, Kaitao Zhang, Shi Yu, Zhiyuan Liu, Hoifung Poon, Jianfeng Gao, Paul Bennett

Neural rankers based on deep pretrained language models (LMs) have been shown to improve many information retrieval benchmarks.

Domain Adaptation Few-Shot Learning +2

443

Paper
Code

Domain-Specific Language Model Pretraining for Biomedical Natural Language Processing

1 code implementation • 31 Jul 2020 • Yu Gu, Robert Tinn, Hao Cheng, Michael Lucas, Naoto Usuyama, Xiaodong Liu, Tristan Naumann, Jianfeng Gao, Hoifung Poon

In this paper, we challenge this assumption by showing that for domains with abundant unlabeled text, such as biomedicine, pretraining language models from scratch results in substantial gains over continual pretraining of general-domain language models.

Ranked #2 on Participant Intervention Comparison Outcome Extraction on EBM-NLP (using extra training data)

Continual Pretraining +11

Paper
Code

Adversarial Training for Large Neural Language Models

3 code implementations • 20 Apr 2020 • Xiaodong Liu, Hao Cheng, Pengcheng He, Weizhu Chen, Yu Wang, Hoifung Poon, Jianfeng Gao

In natural language processing (NLP), pre-training large neural language models such as BERT have demonstrated impressive gain in generalization for a variety of tasks, with further improvement from adversarial fine-tuning.

Ranked #6 on Natural Language Inference on ANLI test (using extra training data)

Natural Language Inference Natural Language Understanding

2,199

Paper
Code

The Microsoft Toolkit of Multi-Task Deep Neural Networks for Natural Language Understanding

3 code implementations • ACL 2020 • Xiaodong Liu, Yu Wang, Jianshu ji, Hao Cheng, Xueyun Zhu, Emmanuel Awa, Pengcheng He, Weizhu Chen, Hoifung Poon, Guihong Cao, Jianfeng Gao

We present MT-DNN, an open-source natural language understanding (NLU) toolkit that makes it easy for researchers and developers to train customized deep learning models.

Knowledge Distillation Multi-Task Learning +2

2,199

Paper
Code

DoubleTransfer at MEDIQA 2019: Multi-Source Transfer Learning for Natural Language Understanding in the Medical Domain

no code implementations • WS 2019 • Yichong Xu, Xiaodong Liu, Chunyuan Li, Hoifung Poon, Jianfeng Gao

We use a multi-source transfer learning approach to transfer the knowledge from MT-DNN and SciBERT to natural language understanding tasks in the medical domain.

Multi-Task Learning Natural Language Understanding

Paper
Add Code

Document-Level N-ary Relation Extraction with Multiscale Representation Learning

no code implementations • NAACL 2019 • Robin Jia, Cliff Wong, Hoifung Poon

Most information extraction methods focus on binary relations expressed within single sentences.

Reading Comprehension Relation +3

Paper
Add Code

Document-Level $N$-ary Relation Extraction with Multiscale Representation Learning

no code implementations • 4 Apr 2019 • Robin Jia, Cliff Wong, Hoifung Poon

Widening the system's purview to the entire document maximizes potential recall.

Reading Comprehension Relation +3

Paper
Add Code

Deep Probabilistic Logic: A Unifying Framework for Indirect Supervision

no code implementations • EMNLP 2018 • Hai Wang, Hoifung Poon

In this paper, we propose deep probabilistic logic (DPL) as a general framework for indirect supervision, by composing probabilistic logic with deep learning.

Reading Comprehension Representation Learning

Paper
Add Code

Neural-Symbolic Learning and Reasoning: A Survey and Interpretation

no code implementations • 10 Nov 2017 • Tarek R. Besold, Artur d'Avila Garcez, Sebastian Bader, Howard Bowman, Pedro Domingos, Pascal Hitzler, Kai-Uwe Kuehnberger, Luis C. Lamb, Daniel Lowd, Priscila Machado Vieira Lima, Leo de Penning, Gadi Pinkas, Hoifung Poon, Gerson Zaverucha

Recent studies in cognitive science, artificial intelligence, and psychology have produced a number of cognitive models of reasoning, learning, and language that are underpinned by computation.

Philosophy

Paper
Add Code

EZLearn: Exploiting Organic Supervision in Large-Scale Data Annotation

no code implementations • 25 Sep 2017 • Maxim Grechkin, Hoifung Poon, Bill Howe

In science and other high-value domains, large repositories of data samples are often available, together with two sources of organic supervision: a lexicon for the annotation classes, and text descriptions that accompany some data samples.

Paper
Add Code

Cross-Sentence N-ary Relation Extraction with Graph LSTMs

no code implementations • TACL 2017 • Nanyun Peng, Hoifung Poon, Chris Quirk, Kristina Toutanova, Wen-tau Yih

Past work in relation extraction has focused on binary relations in single sentences.

Multi-Task Learning Relation +2

Paper
Add Code

NLP for Precision Medicine

no code implementations • ACL 2017 • Hoifung Poon, Chris Quirk, Kristina Toutanova, Wen-tau Yih

We will introduce precision medicine and showcase the vast opportunities for NLP in this burgeoning field with great societal impact.

Decision Making Entity Linking +2

Paper
Add Code

Estimating Accuracy from Unlabeled Data: A Probabilistic Logic Approach

no code implementations • NeurIPS 2017 • Emmanouil A. Platanios, Hoifung Poon, Tom M. Mitchell, Eric Horvitz

We propose an efficient method to estimate the accuracy of classifiers using only unlabeled data.

Paper
Add Code

Distant Supervision for Relation Extraction beyond the Sentence Boundary

no code implementations • EACL 2017 • Chris Quirk, Hoifung Poon

At the core of our approach is a graph representation that can incorporate both standard dependencies and discourse relations, thus providing a unifying way to model relations within and across sentences.

Relation Relation Extraction +1