Search Results for author: Sheng Zhang

Found 103 papers, 32 papers with code

BioGPT: Generative Pre-trained Transformer for Biomedical Text Generation and Mining

2 code implementations19 Oct 2022 Renqian Luo, Liai Sun, Yingce Xia, Tao Qin, Sheng Zhang, Hoifung Poon, Tie-Yan Liu

Pre-trained language models have attracted increasing attention in the biomedical domain, inspired by their great success in the general natural language domain.

 Ranked #1 on Document Classification on HOC (Micro F1 metric)

Document Classification Language Modelling +3

DouZero: Mastering DouDizhu with Self-Play Deep Reinforcement Learning

1 code implementation11 Jun 2021 Daochen Zha, Jingru Xie, Wenye Ma, Sheng Zhang, Xiangru Lian, Xia Hu, Ji Liu

Games are abstractions of the real world, where artificial agents learn to compete and cooperate with other agents.

Game of Poker Multi-agent Reinforcement Learning +2

Stacked Filters Stationary Flow For Hardware-Oriented Acceleration Of Deep Convolutional Neural Networks

1 code implementation23 Jan 2018 Yuechao Gao, Nianhong Liu, Sheng Zhang

To address memory and computation resource limitations for hardware-oriented acceleration of deep convolutional neural networks (CNNs), we present a computation flow, stacked filters stationary flow (SFS), and a corresponding data encoding format, relative indexed compressed sparse filter format (CSF), to make the best of data sparsity, and simplify data handling at execution time.

Deep Generalized Canonical Correlation Analysis

3 code implementations WS 2019 Adrian Benton, Huda Khayrallah, Biman Gujral, Dee Ann Reisinger, Sheng Zhang, Raman Arora

We present Deep Generalized Canonical Correlation Analysis (DGCCA) -- a method for learning nonlinear transformations of arbitrarily many views of data, such that the resulting transformations are maximally informative of each other.

Representation Learning Stochastic Optimization

AMR Parsing as Sequence-to-Graph Transduction

1 code implementation ACL 2019 Sheng Zhang, Xutai Ma, Kevin Duh, Benjamin Van Durme

Our experimental results outperform all previously reported SMATCH scores, on both AMR 2. 0 (76. 3% F1 on LDC2017T10) and AMR 1. 0 (70. 2% F1 on LDC2014T12).

AMR Parsing

Optimizing Bi-Encoder for Named Entity Recognition via Contrastive Learning

1 code implementation30 Aug 2022 Sheng Zhang, Hao Cheng, Jianfeng Gao, Hoifung Poon

We present a bi-encoder framework for named entity recognition (NER), which applies contrastive learning to map candidate text spans and entity types into the same vector representation space.

Contrastive Learning Metric Learning +5

UNITE: A Unified Benchmark for Text-to-SQL Evaluation

1 code implementation25 May 2023 Wuwei Lan, Zhiguo Wang, Anuj Chauhan, Henghui Zhu, Alexander Li, Jiang Guo, Sheng Zhang, Chung-Wei Hang, Joseph Lilien, Yiqun Hu, Lin Pan, Mingwen Dong, Jun Wang, Jiarong Jiang, Stephen Ash, Vittorio Castelli, Patrick Ng, Bing Xiang

A practical text-to-SQL system should generalize well on a wide variety of natural language questions, unseen database schemas, and novel SQL query structures.

Text-To-SQL

PromptCAL: Contrastive Affinity Learning via Auxiliary Prompts for Generalized Novel Category Discovery

1 code implementation CVPR 2023 Sheng Zhang, Salman Khan, Zhiqiang Shen, Muzammal Naseer, Guangyi Chen, Fahad Khan

The GNCD setting aims to categorize unlabeled training data coming from known and novel classes by leveraging the information of partially labeled known classes.

Graph Generation

Context-faithful Prompting for Large Language Models

1 code implementation20 Mar 2023 Wenxuan Zhou, Sheng Zhang, Hoifung Poon, Muhao Chen

However, their reliance on parametric knowledge may cause them to overlook contextual cues, leading to incorrect predictions in context-sensitive NLP tasks (e. g., knowledge acquisition tasks).

counterfactual Machine Reading Comprehension +1

Towards Realistic Zero-Shot Classification via Self Structural Semantic Alignment

1 code implementation24 Aug 2023 Sheng Zhang, Muzammal Naseer, Guangyi Chen, Zhiqiang Shen, Salman Khan, Kun Zhang, Fahad Khan

To address this challenge, we propose the Self Structural Semantic Alignment (S^3A) framework, which extracts the structural semantic information from unlabeled data while simultaneously self-learning.

Self-Learning Zero-Shot Learning

DecAF: Joint Decoding of Answers and Logical Forms for Question Answering over Knowledge Bases

1 code implementation30 Sep 2022 Donghan Yu, Sheng Zhang, Patrick Ng, Henghui Zhu, Alexander Hanbo Li, Jun Wang, Yiqun Hu, William Wang, Zhiguo Wang, Bing Xiang

Question answering over knowledge bases (KBs) aims to answer natural language questions with factual information such as entities and relations in KBs.

Entity Linking Question Answering +2

FairLex: A Multilingual Benchmark for Evaluating Fairness in Legal Text Processing

1 code implementation ACL 2022 Ilias Chalkidis, Tommaso Pasini, Sheng Zhang, Letizia Tomada, Sebastian Felix Schwemer, Anders Søgaard

We present a benchmark suite of four datasets for evaluating the fairness of pre-trained language models and the techniques used to fine-tune them for downstream tasks.

Fairness

Neural-Davidsonian Semantic Proto-role Labeling

1 code implementation EMNLP 2018 Rachel Rudinger, Adam Teichert, Ryan Culkin, Sheng Zhang, Benjamin Van Durme

We present a model for semantic proto-role labeling (SPRL) using an adapted bidirectional LSTM encoding strategy that we call "Neural-Davidsonian": predicate-argument structure is represented as pairs of hidden states corresponding to predicate and argument head tokens of the input sequence.

Attribute

Joint Universal Syntactic and Semantic Parsing

1 code implementation12 Apr 2021 Elias Stengel-Eskin, Kenton Murray, Sheng Zhang, Aaron Steven White, Benjamin Van Durme

While numerous attempts have been made to jointly parse syntax and semantics, high performance in one domain typically comes at the price of performance in the other.

Semantic Parsing

T-Rex: Text-assisted Retrosynthesis Prediction

1 code implementation26 Jan 2024 Yifeng Liu, Hanwen Xu, Tangqi Fang, Haocheng Xi, Zixuan Liu, Sheng Zhang, Hoifung Poon, Sheng Wang

As a fundamental task in computational chemistry, retrosynthesis prediction aims to identify a set of reactants to synthesize a target molecule.

Re-Ranking Retrosynthesis

Auction-Based Combinatorial Multi-Armed Bandit Mechanisms with Strategic Arms

1 code implementation IEEE Conference on Computer Communications 2021 Guoju Gao, He Huang, Mingjun Xiao, Jie Wu, Yu-E Sun, Sheng Zhang

The multi-armed bandit (MAB) model has been deeply studied to solve many online learning problems, such as rate allocation in communication networks, Ad recommendation in social networks, etc.

Computational Efficiency

Dynamic Multimodal Information Bottleneck for Multimodality Classification

1 code implementation2 Nov 2023 Yingying Fang, Shuang Wu, Sheng Zhang, Chaoyan Huang, Tieyong Zeng, Xiaodan Xing, Simon Walsh, Guang Yang

Specifically, our information bottleneck module serves to filter out the task-irrelevant information and noises in the fused feature, and we further introduce a sufficiency loss to prevent dropping of task-relevant information, thus explicitly preserving the sufficiency of prediction information in the distilled feature.

Classification Medical Diagnosis +1

Finite-Sample Analysis of Nonlinear Stochastic Approximation with Applications in Reinforcement Learning

1 code implementation27 May 2019 Zaiwei Chen, Sheng Zhang, Thinh T. Doan, John-Paul Clarke, Siva Theja Maguluri

To demonstrate the generality of our theoretical results on Markovian SA, we use it to derive the finite-sample bounds of the popular $Q$-learning with linear function approximation algorithm, under a condition on the behavior policy.

Q-Learning reinforcement-learning +1

Worst-Case-Aware Curriculum Learning for Zero and Few Shot Transfer

1 code implementation23 Sep 2020 Sheng Zhang, Xin Zhang, Weiming Zhang, Anders Søgaard

Multi-task transfer learning based on pre-trained language encoders achieves state-of-the-art performance across a range of tasks.

Transfer Learning

Zero-Shot Dependency Parsing with Worst-Case Aware Automated Curriculum Learning

1 code implementation ACL 2022 Miryam de Lhoneux, Sheng Zhang, Anders Søgaard

Large multilingual pretrained language models such as mBERT and XLM-RoBERTa have been found to be surprisingly effective for cross-lingual transfer of syntactic parsing models (Wu and Dredze 2019), but only between related languages.

Cross-Lingual Transfer Dependency Parsing +1

Halo: Learning Semantics-Aware Representations for Cross-Lingual Information Extraction

no code implementations SEMEVAL 2018 Hongyuan Mei, Sheng Zhang, Kevin Duh, Benjamin Van Durme

Cross-lingual information extraction (CLIE) is an important and challenging task, especially in low resource scenarios.

TAG

Cross-lingual Semantic Parsing

no code implementations21 Apr 2018 Sheng Zhang, Kevin Duh, Benjamin Van Durme

We introduce the task of cross-lingual semantic parsing: mapping content provided in a source language into a meaning representation based on a target language.

Semantic Parsing

FPGA Implementations of 3D-SIMD Processor Architecture for Deep Neural Networks Using Relative Indexed Compressed Sparse Filter Encoding Format and Stacked Filters Stationary Flow

no code implementations28 Mar 2018 Yuechao Gao, Nianhong Liu, Sheng Zhang

It is a challenging task to deploy computationally and memory intensive State-of-the-art deep neural networks (DNNs) on embedded systems with limited hardware resources and power budgets.

Random Euler Complex-Valued Nonlinear Filters

no code implementations2 Jan 2018 Jiashu Zhang, Sheng Zhang, Defang Li

Over the last decade, both the neural network and kernel adaptive filter have successfully been used for nonlinear signal processing.

Feature Enhancement Network: A Refined Scene Text Detector

no code implementations12 Nov 2017 Sheng Zhang, Yuliang Liu, Lianwen Jin, Canjie Luo

In this paper, we propose a refined scene text detector with a \textit{novel} Feature Enhancement Network (FEN) for Region Proposal and Text Detection Refinement.

object-detection Object Detection +3

Ordinal Common-sense Inference

no code implementations TACL 2017 Sheng Zhang, Rachel Rudinger, Kevin Duh, Benjamin Van Durme

Humans have the capacity to draw common-sense inferences from natural language: various things that are likely but not certain to hold based on established discourse, and are rarely stated explicitly.

Common Sense Reasoning Natural Language Inference

Randomized Structural Sparsity based Support Identification with Applications to Locating Activated or Discriminative Brain Areas: A Multi-center Reproducibility Study

no code implementations7 Jun 2015 Yi-Lun Wang, Sheng Zhang, Junjie Zheng, Heng Chen, Huafu Chen

In this paper, we focus on how to locate the relevant or discriminative brain regions related with external stimulus or certain mental decease, which is also called support identification, based on the neuroimaging data.

feature selection

Randomized Structural Sparsity via Constrained Block Subsampling for Improved Sensitivity of Discriminative Voxel Identification

no code implementations17 Oct 2014 Yi-Lun Wang, Junjie Zheng, Sheng Zhang, Xujun Duan, Huafu Chen

In this paper, we consider voxel selection for functional Magnetic Resonance Imaging (fMRI) brain data with the aim of finding a more complete set of probably correlated discriminative voxels, thus improving interpretation of the discovered potential biomarkers.

feature selection

Cross-lingual Decompositional Semantic Parsing

no code implementations EMNLP 2018 Sheng Zhang, Xutai Ma, Rachel Rudinger, Kevin Duh, Benjamin Van Durme

We introduce the task of cross-lingual decompositional semantic parsing: mapping content provided in a source language into a decompositional semantic analysis based on a target language.

Semantic Parsing

Selective Decoding for Cross-lingual Open Information Extraction

no code implementations IJCNLP 2017 Sheng Zhang, Kevin Duh, Benjamin Van Durme

Cross-lingual open information extraction is the task of distilling facts from the source language into representations in the target language.

Machine Translation Open Information Extraction

Neural Machine Reading Comprehension: Methods and Trends

no code implementations2 Jul 2019 Shanshan Liu, Xin Zhang, Sheng Zhang, Hui Wang, Weiming Zhang

Machine reading comprehension (MRC), which requires a machine to answer questions based on a given context, has attracted increasing attention with the incorporation of various deep-learning techniques over the past few years.

Machine Reading Comprehension

SubTSBR to tackle high noise and outliers for data-driven discovery of differential equations

no code implementations17 Jul 2019 Sheng Zhang, Guang Lin

We demonstrate how to use our algorithm step by step and compare our algorithm with threshold sparse Bayesian regression (TSBR) for the discovery of differential equations.

Bayesian Inference regression

Broad-Coverage Semantic Parsing as Transduction

no code implementations IJCNLP 2019 Sheng Zhang, Xutai Ma, Kevin Duh, Benjamin Van Durme

We unify different broad-coverage semantic parsing tasks under a transduction paradigm, and propose an attention-based neural framework that incrementally builds a meaning representation via a sequence of semantic relations.

AMR Parsing UCCA Parsing

Universal Decompositional Semantic Parsing

no code implementations ACL 2020 Elias Stengel-Eskin, Aaron Steven White, Sheng Zhang, Benjamin Van Durme

We introduce a transductive model for parsing into Universal Decompositional Semantics (UDS) representations, which jointly learns to map natural language utterances into UDS graph structures and annotate the graph with decompositional semantic attribute scores.

Attribute Semantic Parsing

Quantum Communication between Multiplexed Atomic Quantum Memories

no code implementations5 Sep 2019 Chang Li, Nan Jiang, Yukai Wu, Wei Chang, Yunfei Pu, Sheng Zhang, Lu-Ming Duan

The use of multiplexed atomic quantum memories (MAQM) can significantly enhance the efficiency to establish entanglement in a quantum network.

Quantum Physics

Discrete Weierstrass Fourier Transform and Experiments

no code implementations12 Feb 2015 Sheng Zhang, Brendan Harding

We established a new method called Discrete Weierstrass Fourier Transform, a faster and more generalized Discrete Fourier Transform, to approximate discrete data.

Numerical Analysis

GraphCGAN: Convolutional Graph Neural Network with Generative Adversarial Networks

no code implementations1 Jan 2021 Sheng Zhang, Rui Song, Wenbin Lu

In a number of experiments on benchmark datasets, we show that the proposed GraphCGAN outperforms the baseline methods by a significant margin.

Augmented Gaussian Random Field: Theory and Computation

no code implementations3 Sep 2020 Sheng Zhang, Xiu Yang, Samy Tindel, Guang Lin

We prove that under certain conditions, the observable and its derivatives of any order are governed by a single Gaussian random field, which is the aforementioned AGRF.

Statistics Theory Probability Statistics Theory

A submetric characterization of Rolewicz's property ($β$)

no code implementations21 Jan 2021 Sheng Zhang

The main result is a submetric characterization of the class of Banach spaces admitting an equivalent norm with Rolewicz's property ($\beta$).

Functional Analysis

Experimental demonstration of memory-enhanced scaling for entanglement connection of quantum repeater segments

no code implementations21 Jan 2021 Yunfei Pu, Sheng Zhang, Yukai Wu, Nan Jiang, Wei Chang, Chang Li, Luming Duan

The experimental realization of entanglement connection of two quantum repeater segments with an efficient memory-enhanced scaling demonstrates a key advantage of the quantum repeater protocol, which makes a cornerstone towards future large-scale quantum networks.

Quantum Physics

L-SNet: from Region Localization to Scale Invariant Medical Image Segmentation

no code implementations11 Feb 2021 Jiahao Xie, Sheng Zhang, Jianwei Lu, Ye Luo

Coarse-to-fine models and cascade segmentation architectures are widely adopted to solve the problem of large scale variations in medical image segmentation.

Image Segmentation Medical Image Segmentation +2

TransHash: Transformer-based Hamming Hashing for Efficient Image Retrieval

no code implementations5 May 2021 Yongbiao Chen, Sheng Zhang, Fangxin Liu, Zhigang Chang, Mang Ye, Zhengwei Qi

Until now, the deep hashing for the image retrieval community has been dominated by convolutional neural network architectures, e. g. \texttt{Resnet}\cite{he2016deep}.

Deep Hashing Image Retrieval

Anomalous phase separation dynamics in a correlated electron system: machine-learning enabled large-scale kinetic Monte Carlo simulations

no code implementations27 May 2021 Sheng Zhang, Puhan Zhang, Gia-Wei Chern

With the aid of modern machine learning methods, we demonstrate the first-ever large-scale kinetic Monte Carlo simulations of the phase separation process for the Falicov-Kimball model, which is one of the canonical strongly correlated electron systems.

BIG-bench Machine Learning

Real-time Keypoints Detection for Autonomous Recovery of the Unmanned Ground Vehicle

no code implementations27 Jul 2021 Jie Li, Sheng Zhang, Kai Han, Xia Yuan, Chunxia Zhao, Yu Liu

UGV-KPNet is computationally efficient with a small number of parameters and provides pixel-level accurate keypoints detection results in real-time.

Keypoint Detection

Modular Self-Supervision for Document-Level Relation Extraction

no code implementations EMNLP 2021 Sheng Zhang, Cliff Wong, Naoto Usuyama, Sarthak Jain, Tristan Naumann, Hoifung Poon

Extracting relations across large text spans has been relatively underexplored in NLP, but it is particularly important for high-value domains such as biomedicine, where obtaining high recall of the latest findings is crucial for practical applications.

Document-level Relation Extraction Reading Comprehension +1

Sociolectal Analysis of Pretrained Language Models

no code implementations EMNLP 2021 Sheng Zhang, Xin Zhang, Weiming Zhang, Anders Søgaard

Using data from English cloze tests, in which subjects also self-reported their gender, age, education, and race, we examine performance differences of pretrained language models across demographic groups, defined by these (protected) attributes.

Finite Sample Analysis of Average-Reward TD Learning and $Q$-Learning

no code implementations NeurIPS 2021 Sheng Zhang, Zhe Zhang, Siva Theja Maguluri

The focus of this paper is on sample complexity guarantees of average-reward reinforcement learning algorithms, which are known to be more challenging to study than their discounted-reward counterparts.

Q-Learning

Can depth-adaptive BERT perform better on binary classification tasks

no code implementations22 Nov 2021 Jing Fan, Xin Zhang, Sheng Zhang, Yan Pan, Lixiang Guo

In light of the success of transferring language models into NLP tasks, we ask whether the full BERT model is always the best and does it exist a simple but effective method to find the winning ticket in state-of-the-art deep neural networks without complex calculations.

Binary Classification Text Classification

A Review on Graph Neural Network Methods in Financial Applications

no code implementations27 Nov 2021 Jianian Wang, Sheng Zhang, Yanghua Xiao, Rui Song

With multiple components and relations, financial data are often presented as graph data, since it could represent both the individual features and the complicated relations.

DVHN: A Deep Hashing Framework for Large-scale Vehicle Re-identification

no code implementations9 Dec 2021 Yongbiao Chen, Sheng Zhang, Fangxin Liu, Chenggang Wu, Kaicheng Guo, Zhengwei Qi

Specifically, we directly constrain the output from the convolutional neural network to be discrete binary codes and ensure the learned binary codes are optimal for classification.

Deep Hashing Playing the Game of 2048 +1

Descriptors for Machine Learning Model of Generalized Force Field in Condensed Matter Systems

no code implementations3 Jan 2022 Puhan Zhang, Sheng Zhang, Gia-Wei Chern

A general theory of the descriptor for the classical fields is formulated, and two types of models are distinguished depending on the presence or absence of an internal symmetry for the classical field.

BIG-bench Machine Learning

Knowledge-Rich Self-Supervision for Biomedical Entity Linking

no code implementations15 Dec 2021 Sheng Zhang, Hao Cheng, Shikhar Vashishth, Cliff Wong, Jinfeng Xiao, Xiaodong Liu, Tristan Naumann, Jianfeng Gao, Hoifung Poon

Zero-shot entity linking has emerged as a promising direction for generalizing to new entities, but it still requires example gold entity mentions during training and canonical descriptions for all entities, both of which are rarely available outside of Wikipedia.

Contrastive Learning Entity Linking

Rule Mining over Knowledge Graphs via Reinforcement Learning

no code implementations21 Feb 2022 Lihan Chen, Sihang Jiang, Jingping Liu, Chao Wang, Sheng Zhang, Chenhao Xie, Jiaqing Liang, Yanghua Xiao, Rui Song

Knowledge graphs (KGs) are an important source repository for a wide range of applications and rule mining from KGs recently attracts wide research interest in the KG-related research community.

Knowledge Graphs reinforcement-learning +1

Machine learning predictions for local electronic properties of disordered correlated electron systems

no code implementations12 Apr 2022 Yi-Hsuan Liu, Sheng Zhang, Puhan Zhang, Ting-Kuo Lee, Gia-Wei Chern

We present a scalable machine learning (ML) model to predict local electronic properties such as on-site electron number and double occupation for disordered correlated electron systems.

BIG-bench Machine Learning Variational Monte Carlo

Locally Aggregated Feature Attribution on Natural Language Model Understanding

no code implementations NAACL 2022 Sheng Zhang, Jin Wang, Haitao Jiang, Rui Song

Some feature attribution methods have shown promising results in computer vision, especially the gradient-based methods where effectively smoothing the gradients with reference data is key to a robust and faithful result.

Language Modelling Sentiment Analysis

REKnow: Enhanced Knowledge for Joint Entity and Relation Extraction

no code implementations10 Jun 2022 Sheng Zhang, Patrick Ng, Zhiguo Wang, Bing Xiang

Our generative model is a unified framework to sequentially generate relational triplets under various relation extraction settings and explicitly utilizes relevant knowledge from Knowledge Graph (KG) to resolve ambiguities.

Joint Entity and Relation Extraction Relation

Importance of Synthesizing High-quality Data for Text-to-SQL Parsing

no code implementations17 Dec 2022 Yiyun Zhao, Jiarong Jiang, Yiqun Hu, Wuwei Lan, Henry Zhu, Anuj Chauhan, Alexander Li, Lin Pan, Jun Wang, Chung-Wei Hang, Sheng Zhang, Marvin Dong, Joe Lilien, Patrick Ng, Zhiguo Wang, Vittorio Castelli, Bing Xiang

In this paper, we first examined the existing synthesized datasets and discovered that state-of-the-art text-to-SQL algorithms did not further improve on popular benchmarks when trained with augmented synthetic data.

SQL Parsing SQL-to-Text +2

Continual Contrastive Finetuning Improves Low-Resource Relation Extraction

no code implementations21 Dec 2022 Wenxuan Zhou, Sheng Zhang, Tristan Naumann, Muhao Chen, Hoifung Poon

In this paper, we aim at bridging the gap and propose to pretrain and finetune the RE model using consistent objectives of contrastive learning.

Contrastive Learning Relation +3

Machine learning for phase ordering dynamics of charge density waves

no code implementations6 Mar 2023 Chen Cheng, Sheng Zhang, Gia-Wei Chern

We present a machine learning (ML) framework for large-scale dynamical simulations of charge density wave (CDW) states.

Regularized Shallow Image Prior for Electrical Impedance Tomography

no code implementations30 Mar 2023 Zhe Liu, Zhou Chen, Qi Wang, Sheng Zhang, Yunjie Yang

The results suggest that combining the shallow image prior and the hand-crafted regularization can achieve similar performance to the Deep Image Prior (DIP) but with less architectural dependency and complexity of the neural network.

Machine learning for structure-property relationships: Scalability and limitations

no code implementations11 Apr 2023 Zhongzheng Tian, Sheng Zhang, Gia-Wei Chern

Based on the locality assumption, ML model is developed for the prediction of intensive properties of a finite-size block.

Computational Efficiency

Semi-Asynchronous Federated Edge Learning Mechanism via Over-the-air Computation

no code implementations6 May 2023 Zhoubin Kou, Yun Ji, Xiaoxiong Zhong, Sheng Zhang

However, existing FEEL systems with AirComp scheme often employ traditional synchronous aggregation mechanisms for local model aggregation in each global round, which suffer from the stragglers issues.

Distilling Large Language Models for Biomedical Knowledge Extraction: A Case Study on Adverse Drug Events

no code implementations12 Jul 2023 Yu Gu, Sheng Zhang, Naoto Usuyama, Yonas Woldesenbet, Cliff Wong, Praneeth Sanapathi, Mu Wei, Naveen Valluri, Erika Strandberg, Tristan Naumann, Hoifung Poon

We find that while LLMs already possess decent competency in structuring biomedical text, by distillation into a task-specific student model through self-supervised learning, substantial gains can be attained over out-of-box LLMs, with additional advantages such as cost, efficiency, and white-box model access.

Self-Supervised Learning

Asynchronous Federated Learning with Incentive Mechanism Based on Contract Theory

no code implementations10 Oct 2023 Danni Yang, Yun Ji, Zhoubin Kou, Xiaoxiong Zhong, Sheng Zhang

To address the challenges posed by the heterogeneity inherent in federated learning (FL) and to attract high-quality clients, various incentive mechanisms have been employed.

Federated Learning

BiomedJourney: Counterfactual Biomedical Image Generation by Instruction-Learning from Multimodal Patient Journeys

no code implementations16 Oct 2023 Yu Gu, Jianwei Yang, Naoto Usuyama, Chunyuan Li, Sheng Zhang, Matthew P. Lungren, Jianfeng Gao, Hoifung Poon

In a comprehensive battery of tests on counterfactual medical image generation, BiomedJourney substantially outperforms prior state-of-the-art methods in instruction image editing and medical image generation such as InstructPix2Pix and RoentGen.

counterfactual Denoising +2

Code Search Debiasing:Improve Search Results beyond Overall Ranking Performance

no code implementations25 Nov 2023 Sheng Zhang, Hui Li, Yanlin Wang, Zhao Wei, Yong Xiu, Juhong Wang, Rongong Ji

To mitigate biases, we develop a general debiasing framework that employs reranking to calibrate search results.

Code Search

Machine learning force-field models for metallic spin glass

no code implementations28 Nov 2023 Menglin Shi, Sheng Zhang, Gia-Wei Chern

Metallic spin glass systems, such as dilute magnetic alloys, are characterized by randomly distributed local moments coupled to each other through a long-range electron-mediated effective interaction.

LLaVA-Med: Training a Large Language-and-Vision Assistant for Biomedicine in One Day

no code implementations NeurIPS 2023 Chunyuan Li, Cliff Wong, Sheng Zhang, Naoto Usuyama, Haotian Liu, Jianwei Yang, Tristan Naumann, Hoifung Poon, Jianfeng Gao

In this paper, we propose a cost-efficient approach for training a vision-language conversational assistant that can answer open-ended research questions of biomedical images.

Instruction Following Language Modelling +2

Aircraft Landing Time Prediction with Deep Learning on Trajectory Images

no code implementations2 Jan 2024 Liping Huang, Sheng Zhang, YiCheng Zhang, Yi Zhang, Yifang Yin

Aircraft landing time (ALT) prediction is crucial for air traffic management, especially for arrival aircraft sequencing on the runway.

SpineCLUE: Automatic Vertebrae Identification Using Contrastive Learning and Uncertainty Estimation

no code implementations14 Jan 2024 Sheng Zhang, Minheng Chen, Junxian Wu, Ziyue Zhang, Tonglong Li, Cheng Xue, Youyong Kong

In this paper, we propose a three-stage method to address the challenges in 3D CT vertebrae identification at vertebrae-level.

Contrastive Learning

Make it more specific: A novel uncertainty based airway segmentation application on 3D U-Net and its variants

no code implementations12 Feb 2024 Shiyi Wang, Yang Nan, Felder Federico N, Sheng Zhang, Walsh Simon L F, Guang Yang

The most popular algorithms in medical segmentation, 3D U-Net and its variants, can directly implement the task of lung trachea segmentation, but its failure to consider the special tree-like structure of the trachea suggests that there is much room for improvement in its segmentation accuracy.

Segmentation

Coarsening of chiral domains in itinerant electron magnets: A machine learning force field approach

no code implementations18 Mar 2024 Yunhao Fan, Sheng Zhang, Gia-Wei Chern

While the chiral phase is described by a broken $Z_2$ Ising-type symmetry, we find that the characteristic size of chiral domains increases linearly with time, in stark contrast to the expected Allen-Cahn domain growth law for a non-conserved Ising order parameter field.

Cannot find the paper you are looking for? You can Submit a new open access paper.