Search Results for author: Zexue He

Found 18 papers, 6 papers with code

Learning Robust Representations by Projecting Superficial Statistics Out

no code implementations ICLR 2019 Haohan Wang, Zexue He, Zachary C. Lipton, Eric P. Xing

We test our method on the battery of standard domain generalization data sets and, interestingly, achieve comparable or better performance as compared to other domain generalization methods that explicitly require samples from the target distribution for training.

Domain Generalization

Weakly Supervised Contrastive Learning for Chest X-Ray Report Generation

1 code implementation Findings (EMNLP) 2021 An Yan, Zexue He, Xing Lu, Jiang Du, Eric Chang, Amilcare Gentili, Julian McAuley, Chun-Nan Hsu

Radiology report generation aims at generating descriptive text from radiology images automatically, which may present an opportunity to improve radiology reporting and interpretation.

Contrastive Learning Descriptive +2

Leashing the Inner Demons: Self-Detoxification for Language Models

no code implementations6 Mar 2022 Canwen Xu, Zexue He, Zhankui He, Julian McAuley

Language models (LMs) can reproduce (or amplify) toxic language seen during training, which poses a risk to their practical application.

Controlling Bias Exposure for Fair Interpretable Predictions

1 code implementation14 Oct 2022 Zexue He, Yu Wang, Julian McAuley, Bodhisattwa Prasad Majumder

However, when sensitive information is semantically entangled with the task information of the input, e. g., gender information is predictive for a profession, a fair trade-off between task performance and bias mitigation is difficult to achieve.

Attribute Task 2 +2

InterFair: Debiasing with Natural Language Feedback for Fair Interpretable Predictions

no code implementations14 Oct 2022 Bodhisattwa Prasad Majumder, Zexue He, Julian McAuley

In the other setup, human feedback was able to disentangle associated bias and predictive information from the input leading to superior bias mitigation and improved task performance (4-5%) simultaneously.

Attribute

Synthetic Pre-Training Tasks for Neural Machine Translation

no code implementations19 Dec 2022 Zexue He, Graeme Blackwood, Rameswar Panda, Julian McAuley, Rogerio Feris

Pre-training models with large crawled corpora can lead to issues such as toxicity and bias, as well as copyright and privacy concerns.

Machine Translation NMT +1

"Nothing Abnormal": Disambiguating Medical Reports via Contrastive Knowledge Infusion

no code implementations15 May 2023 Zexue He, An Yan, Amilcare Gentili, Julian McAuley, Chun-Nan Hsu

Based on our analysis, we define a disambiguation rewriting task to regenerate an input to be unambiguous while preserving information about the original content.

Targeted Data Generation: Finding and Fixing Model Weaknesses

no code implementations28 May 2023 Zexue He, Marco Tulio Ribeiro, Fereshte Khani

Even when aggregate accuracy is high, state-of-the-art NLP models often fail systematically on specific subgroups of data, resulting in unfair outcomes and eroding user trust.

Data Augmentation Natural Language Inference +2

Learning Concise and Descriptive Attributes for Visual Recognition

1 code implementation ICCV 2023 An Yan, Yu Wang, Yiwu Zhong, chengyu dong, Zexue He, Yujie Lu, William Wang, Jingbo Shang, Julian McAuley

Recent advances in foundation models present new opportunities for interpretable visual recognition -- one can first query Large Language Models (LLMs) to obtain a set of attributes that describe each class, then apply vision-language models to classify images via these attributes.

Descriptive

Robust and Interpretable Medical Image Classifiers via Concept Bottleneck Models

no code implementations4 Oct 2023 An Yan, Yu Wang, Yiwu Zhong, Zexue He, Petros Karypis, Zihan Wang, chengyu dong, Amilcare Gentili, Chun-Nan Hsu, Jingbo Shang, Julian McAuley

Medical image classification is a critical problem for healthcare, with the potential to alleviate the workload of doctors and facilitate diagnoses of patients.

Image Classification Language Modelling +1

Farzi Data: Autoregressive Data Distillation

no code implementations15 Oct 2023 Noveen Sachdeva, Zexue He, Wang-Cheng Kang, Jianmo Ni, Derek Zhiyuan Cheng, Julian McAuley

We study data distillation for auto-regressive machine learning tasks, where the input and output have a strict left-to-right causal structure.

Language Modelling Sequential Recommendation

Deciphering Compatibility Relationships with Textual Descriptions via Extraction and Explanation

1 code implementation17 Dec 2023 Yu Wang, Zexue He, Zhankui He, Hao Xu, Julian McAuley

This fine-tuning allows the model to generate explanations that convey the compatibility relationships between items.

InfoRank: Unbiased Learning-to-Rank via Conditional Mutual Information Minimization

no code implementations23 Jan 2024 Jiarui Jin, Zexue He, Mengyue Yang, Weinan Zhang, Yong Yu, Jun Wang, Julian McAuley

Subsequently, we minimize the mutual information between the observation estimation and the relevance estimation conditioned on the input features.

Learning-To-Rank Recommendation Systems

LVCHAT: Facilitating Long Video Comprehension

1 code implementation19 Feb 2024 Yu Wang, Zeyuan Zhang, Julian McAuley, Zexue He

To address this issue, we propose Long Video Chat (LVChat), where Frame-Scalable Encoding (FSE) is introduced to dynamically adjust the number of embeddings in alignment with the duration of the video to ensure long videos are not overly compressed into a few embeddings.

Video Captioning

Cannot find the paper you are looking for? You can Submit a new open access paper.