Search Results for author: Yu Gu

Found 51 papers, 19 papers with code

AgentBench: Evaluating LLMs as Agents

1 code implementation • 7 Aug 2023 • Xiao Liu, Hao Yu, Hanchen Zhang, Yifan Xu, Xuanyu Lei, Hanyu Lai, Yu Gu, Hangliang Ding, Kaiwen Men, Kejuan Yang, Shudan Zhang, Xiang Deng, Aohan Zeng, Zhengxiao Du, Chenhui Zhang, Sheng Shen, Tianjun Zhang, Yu Su, Huan Sun, Minlie Huang, Yuxiao Dong, Jie Tang

We present AgentBench, a multi-dimensional evolving benchmark that currently consists of 8 distinct environments to assess LLM-as-Agent's reasoning and decision-making abilities in a multi-turn open-ended generation setting.

Decision Making Instruction Following

1,786

Paper
Code

Mind2Web: Towards a Generalist Agent for the Web

1 code implementation • NeurIPS 2023 • Xiang Deng, Yu Gu, Boyuan Zheng, Shijie Chen, Samuel Stevens, Boshi Wang, Huan Sun, Yu Su

We introduce Mind2Web, the first dataset for developing and evaluating generalist agents for the web that can follow language instructions to complete complex tasks on any website.

558

Paper
Code

Structure-Aware Language Model Pretraining Improves Dense Retrieval on Structured Data

1 code implementation • 31 May 2023 • Xinze Li, Zhenghao Liu, Chenyan Xiong, Shi Yu, Yu Gu, Zhiyuan Liu, Ge Yu

SANTA proposes two pretraining methods to make language models structure-aware and learn effective representations for structured data: 1) Structured Data Alignment, which utilizes the natural alignment relations between structured data and unstructured data for structure-aware pretraining.

Code Search Language Modelling +1

132

Paper
Code

Beyond I.I.D.: Three Levels of Generalization for Question Answering on Knowledge Bases

1 code implementation • 16 Nov 2020 • Yu Gu, Sue Kase, Michelle Vanni, Brian Sadler, Percy Liang, Xifeng Yan, Yu Su

To facilitate the development of KBQA models with stronger generalization, we construct and release a new large-scale, high-quality dataset with 64, 331 questions, GrailQA, and provide evaluation settings for all three levels of generalization.

Knowledge Base Question Answering

Paper
Code

Don't Generate, Discriminate: A Proposal for Grounding Language Models to Real-World Environments

2 code implementations • 19 Dec 2022 • Yu Gu, Xiang Deng, Yu Su

Most existing work for grounded language understanding uses LMs to directly generate plans that can be executed in the environment to achieve the desired effects.

In-Context Learning Knowledge Base Question Answering +1

Paper
Code

FlexKBQA: A Flexible LLM-Powered Framework for Few-Shot Knowledge Base Question Answering

1 code implementation • 23 Aug 2023 • Zhenyu Li, Sunqi Fan, Yu Gu, Xiuxing Li, Zhichao Duan, Bowen Dong, Ning Liu, Jianyong Wang

Knowledge base question answering (KBQA) is a critical yet challenging task due to the vast number of entities within knowledge bases and the diversity of natural language questions posed by users.

Knowledge Base Question Answering

Paper
Code

Few-shot In-context Learning for Knowledge Base Question Answering

1 code implementation • 2 May 2023 • Tianle Li, Xueguang Ma, Alex Zhuang, Yu Gu, Yu Su, Wenhu Chen

On GrailQA and WebQSP, our model is also on par with other fully-trained models.

In-Context Learning Knowledge Base Question Answering

Paper
Code

KoLA: Carefully Benchmarking World Knowledge of Large Language Models

1 code implementation • 15 Jun 2023 • Jifan Yu, Xiaozhi Wang, Shangqing Tu, Shulin Cao, Daniel Zhang-li, Xin Lv, Hao Peng, Zijun Yao, Xiaohan Zhang, Hanming Li, Chunyang Li, Zheyuan Zhang, Yushi Bai, Yantao Liu, Amy Xin, Nianyi Lin, Kaifeng Yun, Linlu Gong, Jianhui Chen, Zhili Wu, Yunjia Qi, Weikai Li, Yong Guan, Kaisheng Zeng, Ji Qi, Hailong Jin, Jinxin Liu, Yu Gu, Yuan YAO, Ning Ding, Lei Hou, Zhiyuan Liu, Bin Xu, Jie Tang, Juanzi Li

The unprecedented performance of large language models (LLMs) necessitates improvements in evaluations.

Benchmarking Hallucination +1

Paper
Code

Text Matching Improves Sequential Recommendation by Reducing Popularity Biases

1 code implementation • 27 Aug 2023 • Zhenghao Liu, Sen Mei, Chenyan Xiong, Xiaohua LI, Shi Yu, Zhiyuan Liu, Yu Gu, Ge Yu

TASTE alleviates the cold start problem by representing long-tail items using full-text modeling and bringing the benefits of pretrained language models to recommendation systems.

Sequential Recommendation Text Matching

Paper
Code

Improved Planetary Rover Inertial Navigation and Wheel Odometry Performance through Periodic Use of Zero-Type Constraints

4 code implementations • 20 Jun 2019 • Cagri Kilic, Jason N. Gross, Nicholas Ohi, Ryan Watson, Jared Strader, Thomas Swiger, Scott Harper, Yu Gu

We present an approach to enhance wheeled planetary rover dead-reckoning localization performance by leveraging the use of zero-type constraint equations in the navigation filter.

Robotics

Paper
Code

Slip-Based Autonomous ZUPT through Gaussian Process to Improve Planetary Rover Localization

1 code implementation • 13 Mar 2021 • Cagri Kilic, Nicholas Ohi, Yu Gu, Jason N. Gross

The zero-velocity update (ZUPT) algorithm provides valuable state information to maintain the inertial navigation system (INS) reliability when stationary conditions are satisfied.

Paper
Code

Clues Before Answers: Generation-Enhanced Multiple-Choice QA

1 code implementation • NAACL 2022 • Zixian Huang, Ao Wu, Jiaying Zhou, Yu Gu, Yue Zhao, Gong Cheng

A trending paradigm for multiple-choice question answering (MCQA) is using a text-to-text framework.

Ranked #10 on Question Answering on OpenBookQA

Multiple-choice Multiple Choice Question Answering (MCQA)

Paper
Code

ArcaneQA: Dynamic Program Induction and Contextualized Encoding for Knowledge Base Question Answering

1 code implementation • COLING 2022 • Yu Gu, Yu Su

Question answering on knowledge bases (KBQA) poses a unique challenge for semantic parsing research due to two intertwined challenges: large search space and ambiguities in schema linking.

Knowledge Base Question Answering Program induction +1

Paper
Code

MARVEL: Unlocking the Multi-Modal Capability of Dense Retrieval via Visual Module Plugin

1 code implementation • 21 Oct 2023 • Tianshuo Zhou, Sen Mei, Xinze Li, Zhenghao Liu, Chenyan Xiong, Zhiyuan Liu, Yu Gu, Ge Yu

To facilitate the multi-modal retrieval tasks, we build the ClueWeb22-MM dataset based on the ClueWeb22 dataset, which regards anchor texts as queries, and exacts the related text and image documents from anchor-linked web pages.

Language Modelling Retrieval +1

Paper
Code

Dimension Reduction for Efficient Dense Retrieval via Conditional Autoencoder

1 code implementation • 6 May 2022 • Zhenghao Liu, Han Zhang, Chenyan Xiong, Zhiyuan Liu, Yu Gu, Xiaohua LI

These embeddings need to be high-dimensional to fit training signals and guarantee the retrieval effectiveness of dense retrievers.

Ranked #1 on Information Retrieval on MS MARCO

Dimensionality Reduction Information Retrieval +1

Paper
Code

A Systematic Investigation of KB-Text Embedding Alignment at Scale

1 code implementation • ACL 2021 • Vardaan Pahuja, Yu Gu, Wenhu Chen, Mehdi Bahrami, Lei Liu, Wei-Peng Chen, Yu Su

Knowledge bases (KBs) and text often contain complementary knowledge: KBs store structured knowledge that can support long range reasoning, while text stores more comprehensive and timely knowledge in an unstructured way.

Link Prediction

Paper
Code

Domain-Specific Language Model Pretraining for Biomedical Natural Language Processing

1 code implementation • 31 Jul 2020 • Yu Gu, Robert Tinn, Hao Cheng, Michael Lucas, Naoto Usuyama, Xiaodong Liu, Tristan Naumann, Jianfeng Gao, Hoifung Poon

In this paper, we challenge this assumption by showing that for domains with abundant unlabeled text, such as biomedicine, pretraining language models from scratch results in substantial gains over continual pretraining of general-domain language models.

Ranked #2 on Participant Intervention Comparison Outcome Extraction on EBM-NLP (using extra training data)

Continual Pretraining +11

Paper
Code

Multichannel AV-wav2vec2: A Framework for Learning Multichannel Multi-Modal Speech Representation

1 code implementation • 7 Jan 2024 • Qiushi Zhu, Jie Zhang, Yu Gu, Yuchen Hu, LiRong Dai

Considering that visual information helps to improve speech recognition performance in noisy scenes, in this work we propose a multichannel multi-modal speech self-supervised learning framework AV-wav2vec2, which utilizes video and multichannel audio data as inputs.

Audio-Visual Speech Recognition Automatic Speech Recognition +7

Paper
Code

Single-Pass PCA of Large High-Dimensional Data

no code implementations • 25 Apr 2017 • Wenjian Yu, Yu Gu, Jian Li, Shenghua Liu, Yaohang Li

Principal component analysis (PCA) is a fundamental dimension reduction tool in statistics and machine learning.

Dimensionality Reduction Vocal Bursts Intensity Prediction

Paper
Add Code

ByteSing: A Chinese Singing Voice Synthesis System Using Duration Allocated Encoder-Decoder Acoustic Models and WaveRNN Vocoders

no code implementations • 23 Apr 2020 • Yu Gu, Xiang Yin, Yonghui Rao, Yuan Wan, Benlai Tang, Yang Zhang, Jitong Chen, Yuxuan Wang, Zejun Ma

This paper presents ByteSing, a Chinese singing voice synthesis (SVS) system based on duration allocated Tacotron-like acoustic models and WaveRNN neural vocoders.

Singing Voice Synthesis

Paper
Add Code

BeSense: Leveraging WiFi Channel Data and Computational Intelligence for Behavior Analysis

no code implementations • 13 Jul 2019 • Yu Gu, Xiang Zhang, Zhi Liu, Fuji Ren

The ever evolving informatics technology has gradually bounded human and computer in a compact way.

Paper
Add Code

Cooperative Extended State Observer Based Control of Vehicle Platoons With Arbitrarily Small Time Headway

no code implementations • 21 Apr 2020 • Anquan Liu, Tao Li, Yu Gu, Haohui Dai

By using the stability theory of perturbed linear systems, we show that the control parameters can be properly designed to ensure the closed-loop and L2 string stabilities for any given positive time headway.

Paper
Add Code

Multi-Classifier Interactive Learning for Ambiguous Speech Emotion Recognition

no code implementations • 10 Dec 2020 • Ying Zhou, Xuefeng Liang, Yu Gu, Yifei Yin, Longshan Yao

In recent years, speech emotion recognition technology is of great significance in industrial applications such as call centers, social robots and health care.

Speech Emotion Recognition speech-recognition +1

Paper
Add Code

Search Planning of a UAV/UGV Team with Localization Uncertainty in a Subterranean Environment

no code implementations • 11 Feb 2021 • Matteo De Petrillo, Jared Beard, Yu Gu, Jason N. Gross

We present a waypoint planning algorithm for an unmanned aerial vehicle (UAV) that is teamed with an unmanned ground vehicle (UGV) for the task of search and rescue in a subterranean environment.

Simultaneous Localization and Mapping Trajectory Planning Robotics

Paper
Add Code

SentiPrompt: Sentiment Knowledge Enhanced Prompt-Tuning for Aspect-Based Sentiment Analysis

no code implementations • 17 Sep 2021 • Chengxi Li, Feiyu Gao, Jiajun Bu, Lu Xu, Xiang Chen, Yu Gu, Zirui Shao, Qi Zheng, Ningyu Zhang, Yongpan Wang, Zhi Yu

We inject sentiment knowledge regarding aspects, opinions, and polarities into prompt and explicitly model term relations via constructing consistency and polarity judgment templates from the ground truth triplets.

Aspect-Based Sentiment Analysis Aspect-Based Sentiment Analysis (ABSA) +3

Paper
Add Code

Fine-Tuning Large Neural Language Models for Biomedical Natural Language Processing

no code implementations • 15 Dec 2021 • Robert Tinn, Hao Cheng, Yu Gu, Naoto Usuyama, Xiaodong Liu, Tristan Naumann, Jianfeng Gao, Hoifung Poon

Overall, domainspecific vocabulary and pretraining facilitate more robust models for fine-tuning.

text similarity Transfer Learning

Paper
Add Code

HTGN-BTW: Heterogeneous Temporal Graph Network with Bi-Time-Window Training Strategy for Temporal Link Prediction

no code implementations • 25 Feb 2022 • Chongjian Yue, Lun Du, Qiang Fu, Wendong Bi, Hengyu Liu, Yu Gu, Di Yao

The Temporal Link Prediction task of WSDM Cup 2022 expects a single model that can work well on two kinds of temporal graphs simultaneously, which have quite different characteristics and data properties, to predict whether a link of a given type will occur between two given nodes within a given time span.

Link Prediction

Paper
Add Code

Combining deep learning and crowdsourcing geo-images to predict housing quality in rural China

no code implementations • 15 Aug 2022 • Weipan Xu, Yu Gu, Yifan Chen, Yongtian Wang, Weihuan Deng, Xun Li

Housing quality is an essential proxy for regional wealth, security and health.

Paper
Add Code

Knowledge Base Question Answering: A Semantic Parsing Perspective

no code implementations • 12 Sep 2022 • Yu Gu, Vardaan Pahuja, Gong Cheng, Yu Su

In this survey, we situate KBQA in the broader literature of semantic parsing and give a comprehensive account of how existing KBQA approaches attempt to address the unique challenges.

Attribute Knowledge Base Question Answering +2

Paper
Add Code

Event-Triggered Extended State Observer Based Distributed Control of Nonlinear Vehicle Platoons

no code implementations • 16 Sep 2022 • Anquan Liu, Tao Li, Yu Gu

Finally, we give the range of the control parameters to ensure the stability of the vehicle platoon system.

Paper
Add Code

CHGNN: A Semi-Supervised Contrastive Hypergraph Learning Network

no code implementations • 10 Mar 2023 • Yumeng Song, Yu Gu, Tianyi Li, Jianzhong Qi, Zhenghao Liu, Christian S. Jensen, Ge Yu

However, recent studies on hypergraph learning that extend graph convolutional networks to hypergraphs cannot learn effectively from features of unlabeled data.

Contrastive Learning Node Classification

Paper
Add Code

Eeg2vec: Self-Supervised Electroencephalographic Representation Learning

no code implementations • 23 May 2023 • Qiushi Zhu, Xiaoying Zhao, Jie Zhang, Yu Gu, Chao Weng, Yuchen Hu

Recently, many efforts have been made to explore how the brain processes speech using electroencephalographic (EEG) signals, where deep learning-based approaches were shown to be applicable in this field.

EEG Representation Learning

Paper
Add Code

ReSup: Reliable Label Noise Suppression for Facial Expression Recognition

1 code implementation • 29 May 2023 • Xiang Zhang, Yan Lu, Huan Yan, Jingyang Huang, Yusheng Ji, Yu Gu

To further enhance the reliability of our noise decision results, ReSup uses two networks to jointly achieve noise suppression.

Facial Expression Recognition Facial Expression Recognition (FER)

Paper
Code

Self-supervised Interest Point Detection and Description for Fisheye and Perspective Images

no code implementations • 2 Jun 2023 • Marcela Mera-Trujillo, Shivang Patel, Yu Gu, Gianfranco Doretto

Keypoint detection and matching is a fundamental task in many computer vision problems, from shape reconstruction, to structure from motion, to AR/VR applications and robotics.

Interest Point Detection Keypoint Detection

Paper
Add Code

Distilling Large Language Models for Biomedical Knowledge Extraction: A Case Study on Adverse Drug Events

no code implementations • 12 Jul 2023 • Yu Gu, Sheng Zhang, Naoto Usuyama, Yonas Woldesenbet, Cliff Wong, Praneeth Sanapathi, Mu Wei, Naveen Valluri, Erika Strandberg, Tristan Naumann, Hoifung Poon

We find that while LLMs already possess decent competency in structuring biomedical text, by distillation into a task-specific student model through self-supervised learning, substantial gains can be attained over out-of-box LLMs, with additional advantages such as cost, efficiency, and white-box model access.

Self-Supervised Learning

Paper
Add Code

Scaling Clinical Trial Matching Using Large Language Models: A Case Study in Oncology

no code implementations • 4 Aug 2023 • Cliff Wong, Sheng Zhang, Yu Gu, Christine Moung, Jacob Abel, Naoto Usuyama, Roshanthi Weerasinghe, Brian Piening, Tristan Naumann, Carlo Bifulco, Hoifung Poon

Clinical trial matching is a key process in health delivery and discovery.

Paper
Add Code

UniversalNER: Targeted Distillation from Large Language Models for Open Named Entity Recognition

no code implementations • 7 Aug 2023 • Wenxuan Zhou, Sheng Zhang, Yu Gu, Muhao Chen, Hoifung Poon

Instruction tuning has proven effective for distilling LLMs into more cost-efficient models such as Alpaca and Vicuna.

Ranked #1 on Named Entity Recognition (NER) on NCBI Disease

named-entity-recognition Named Entity Recognition +2

Paper
Add Code

Rep2wav: Noise Robust text-to-speech Using self-supervised representations

no code implementations • 28 Aug 2023 • Qiushi Zhu, Yu Gu, Rilin Chen, Chao Weng, Yuchen Hu, LiRong Dai, Jie Zhang

Noise-robust TTS models are often trained using the enhanced speech, which thus suffer from speech distortion and background noise that affect the quality of the synthesized speech.

Speech Enhancement

Paper
Add Code

DurIAN-E: Duration Informed Attention Network For Expressive Text-to-Speech Synthesis

no code implementations • 22 Sep 2023 • Yu Gu, Yianrao Bian, Guangzhi Lei, Chao Weng, Dan Su

This paper introduces an improved duration informed attention neural network (DurIAN-E) for expressive and high-fidelity text-to-speech (TTS) synthesis.

Denoising Speech Synthesis +1

Paper
Add Code

ONNXExplainer: an ONNX Based Generic Framework to Explain Neural Networks Using Shapley Values

no code implementations • 29 Sep 2023 • Yong Zhao, Runxin He, Nicholas Kersting, Can Liu, Shubham Agrawal, Chiranjeet Chetia, Yu Gu

SHAP package is a leading implementation of Shapley values to explain neural networks implemented in TensorFlow or PyTorch but lacks cross-platform support, one-shot deployment and is highly inefficient.

Paper
Add Code

BiomedJourney: Counterfactual Biomedical Image Generation by Instruction-Learning from Multimodal Patient Journeys

no code implementations • 16 Oct 2023 • Yu Gu, Jianwei Yang, Naoto Usuyama, Chunyuan Li, Sheng Zhang, Matthew P. Lungren, Jianfeng Gao, Hoifung Poon

In a comprehensive battery of tests on counterfactual medical image generation, BiomedJourney substantially outperforms prior state-of-the-art methods in instruction image editing and medical image generation such as InstructPix2Pix and RoentGen.

counterfactual Denoising +2

Paper
Add Code

Modeling User Viewing Flow Using Large Language Models for Article Recommendation

no code implementations • 12 Nov 2023 • Zhenghao Liu, Zulong Chen, Moufeng Zhang, Shaoyang Duan, Hong Wen, Liangyue Li, Nan Li, Yu Gu, Ge Yu

This paper proposes the User Viewing Flow Modeling (SINGLE) method for the article recommendation task, which models the user constant preference and instant interest from user-clicked articles.

Paper
Add Code

Comprehensive Evaluation of GNN Training Systems: A Data Management Perspective

no code implementations • 22 Nov 2023 • Hao Yuan, Yajiong Liu, Yanfeng Zhang, Xin Ai, Qiange Wang, Chaoyi Chen, Yu Gu, Ge Yu

Many Graph Neural Network (GNN) training systems have emerged recently to support efficient GNN training.

Management

Paper
Add Code

NeutronOrch: Rethinking Sample-based GNN Training under CPU-GPU Heterogeneous Environments

no code implementations • 22 Nov 2023 • Xin Ai, Qiange Wang, Chunyu Cao, Yanfeng Zhang, Chaoyi Chen, Hao Yuan, Yu Gu, Ge Yu

After extensive experiments and analysis, we find that existing task orchestrating methods fail to fully utilize the heterogeneous resources, limited by inefficient CPU processing or GPU resource contention.

Paper
Add Code

NeutronStream: A Dynamic GNN Training Framework with Sliding Window for Graph Streams

no code implementations • 5 Dec 2023 • Chaoyi Chen, Dechao Gao, Yanfeng Zhang, Qiange Wang, Zhenbo Fu, Xuecang Zhang, Junhua Zhu, Yu Gu, Ge Yu

Though many dynamic GNN models have emerged to learn from evolving graphs, the training process of these dynamic GNNs is dramatically different from traditional GNNs in that it captures both the spatial and temporal dependencies of graph updates.

Paper
Add Code

Bringing Back the Context: Camera Trap Species Identification as Link Prediction on Multimodal Knowledge Graphs

no code implementations • 31 Dec 2023 • Vardaan Pahuja, Weidi Luo, Yu Gu, Cheng-Hao Tu, Hong-You Chen, Tanya Berger-Wolf, Charles Stewart, Song Gao, Wei-Lun Chao, Yu Su

In this work, we leverage the structured context associated with the camera trap images to improve out-of-distribution generalization for the task of species identification in camera traps.

Knowledge Graphs Link Prediction +1

Paper
Add Code

Foundation Models for Biomedical Image Segmentation: A Survey

no code implementations • 15 Jan 2024 • Ho Hin Lee, Yu Gu, Theodore Zhao, Yanbo Xu, Jianwei Yang, Naoto Usuyama, Cliff Wong, Mu Wei, Bennett A. Landman, Yuankai Huo, Alberto Santamaria-Pang, Hoifung Poon

This transformative technology, originally developed for general-purpose computer vision, has found rapid application in medical image processing.

Image Segmentation Semantic Segmentation +1

Paper
Add Code

LR-CNN: Lightweight Row-centric Convolutional Neural Network Training for Memory Reduction

no code implementations • 21 Jan 2024 • Zhigang Wang, Hangyu Yang, Ning Wang, Chuanfei Xu, Jie Nie, Zhiqiang Wei, Yu Gu, Ge Yu

However, training its complex network is very space-consuming, since a lot of intermediate data are preserved across layers, especially when processing high-dimension inputs with a big batch size.

Paper
Add Code

LegalDuet: Learning Effective Representations for Legal Judgment Prediction through a Dual-View Legal Clue Reasoning

no code implementations • 27 Jan 2024 • Pengjie Liu, Zhenghao Liu, Xiaoyuan Yi, Liner Yang, Shuo Wang, Yu Gu, Ge Yu, Xing Xie, Shuang-Hua Yang

It proposes a dual-view legal clue reasoning mechanism, which derives from two reasoning chains of judges: 1) Law Case Reasoning, which makes legal judgments according to the judgment experiences learned from analogy/confusing legal cases; 2) Legal Ground Reasoning, which lies in matching the legal clues between criminal cases and legal decisions.

Paper
Add Code

Middleware for LLMs: Tools Are Instrumental for Language Agents in Complex Environments

no code implementations • 22 Feb 2024 • Yu Gu, Yiheng Shu, Hao Yu, Xiao Liu, Yuxiao Dong, Jie Tang, Jayanth Srinivasa, Hugo Latapie, Yu Su

The applications of large language models (LLMs) have expanded well beyond the confines of text processing, signaling a new era where LLMs are envisioned as generalist language agents capable of operating within complex real-world environments.

Paper
Add Code

Training Small Multimodal Models to Bridge Biomedical Competency Gap: A Case Study in Radiology Imaging

no code implementations • 12 Mar 2024 • Juan Manuel Zambrano Chaves, Shih-Cheng Huang, Yanbo Xu, Hanwen Xu, Naoto Usuyama, Sheng Zhang, Fei Wang, Yujia Xie, Mahmoud Khademi, ZiYi Yang, Hany Awadalla, Julia Gong, Houdong Hu, Jianwei Yang, Chunyuan Li, Jianfeng Gao, Yu Gu, Cliff Wong, Mu Wei, Tristan Naumann, Muhao Chen, Matthew P. Lungren, Serena Yeung-Levy, Curtis P. Langlotz, Sheng Wang, Hoifung Poon

Frontier models such as GPT-4V still have major competency gaps in multimodal capabilities for biomedical applications.

Cross-Modal Retrieval

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.