Search Results for author: Weixin Liang

Found 23 papers, 15 papers with code

Mapping the Increasing Use of LLMs in Scientific Papers

no code implementations • 1 Apr 2024 • Weixin Liang, Yaohui Zhang, Zhengxuan Wu, Haley Lepp, Wenlong Ji, Xuandong Zhao, Hancheng Cao, Sheng Liu, Siyu He, Zhi Huang, Diyi Yang, Christopher Potts, Christopher D Manning, James Y. Zou

To address this gap, we conduct the first systematic, large-scale analysis across 950, 965 papers published between January 2020 and February 2024 on the arXiv, bioRxiv, and Nature portfolio journals, using a population-level statistical framework to measure the prevalence of LLM-modified content over time.

Paper
Add Code

Monitoring AI-Modified Content at Scale: A Case Study on the Impact of ChatGPT on AI Conference Peer Reviews

no code implementations • 11 Mar 2024 • Weixin Liang, Zachary Izzo, Yaohui Zhang, Haley Lepp, Hancheng Cao, Xuandong Zhao, Lingjiao Chen, Haotian Ye, Sheng Liu, Zhi Huang, Daniel A. McFarland, James Y. Zou

We present an approach for estimating the fraction of text in a large corpus which is likely to be substantially modified or produced by a large language model (LLM).

Language Modelling Large Language Model

Paper
Add Code

What's documented in AI? Systematic Analysis of 32K AI Model Cards

1 code implementation • 7 Feb 2024 • Weixin Liang, Nazneen Rajani, Xinyu Yang, Ezinwanne Ozoani, Eric Wu, Yiqun Chen, Daniel Scott Smith, James Zou

To evaluate the impact of model cards, we conducted an intervention study by adding detailed model cards to 42 popular models which had no or sparse model cards previously.

Informativeness

Paper
Code

Navigating Dataset Documentations in AI: A Large-Scale Analysis of Dataset Cards on Hugging Face

1 code implementation • 24 Jan 2024 • Xinyu Yang, Weixin Liang, James Zou

By analyzing all 7, 433 dataset documentation on Hugging Face, our investigation provides an overview of the Hugging Face dataset ecosystem and insights into dataset documentation practices, yielding 5 main findings: (1) The dataset card completion rate shows marked heterogeneity correlated with dataset popularity.

Paper
Code

Can large language models provide useful feedback on research papers? A large-scale empirical analysis

1 code implementation • 3 Oct 2023 • Weixin Liang, Yuhui Zhang, Hancheng Cao, Binglu Wang, Daisy Ding, Xinyu Yang, Kailas Vodrahalli, Siyu He, Daniel Smith, Yian Yin, Daniel McFarland, James Zou

We first quantitatively compared GPT-4's generated feedback with human peer reviewer feedback in 15 Nature family journals (3, 096 papers in total) and the ICLR machine learning conference (1, 709 papers).

458

Paper
Code

Accuracy on the Curve: On the Nonlinear Correlation of ML Performance Between Data Subpopulations

1 code implementation • 4 May 2023 • Weixin Liang, Yining Mao, Yongchan Kwon, Xinyu Yang, James Zou

Our work highlights the importance of understanding the nonlinear effects of model improvement on performance in different subpopulations, and has the potential to inform the development of more equitable and responsible machine learning models.

Fairness

Paper
Code

GPT detectors are biased against non-native English writers

2 code implementations • 6 Apr 2023 • Weixin Liang, Mert Yuksekgonul, Yining Mao, Eric Wu, James Zou

In this study, we evaluate the performance of several widely-used GPT detectors using writing samples from native and non-native English writers.

Fairness

6,388

Paper
Code

SEAL : Interactive Tool for Systematic Error Analysis and Labeling

no code implementations • 11 Oct 2022 • Nazneen Rajani, Weixin Liang, Lingjiao Chen, Meg Mitchell, James Zou

With the advent of Transformers, large language models (LLMs) have saturated well-known NLP benchmarks and leaderboards with high aggregate performance.

Paper
Add Code

Data Budgeting for Machine Learning

no code implementations • 3 Oct 2022 • Xinyi Zhao, Weixin Liang, James Zou

Data is the fuel powering AI and creates tremendous value for many domains.

Paper
Add Code

GSCLIP : A Framework for Explaining Distribution Shifts in Natural Language

1 code implementation • 30 Jun 2022 • Zhiying Zhu, Weixin Liang, James Zou

Motivated by this, we propose a novel task, dataset explanation.

Language Modelling

514

Paper
Code

Disparities in Dermatology AI Performance on a Diverse, Curated Clinical Image Set

no code implementations • 15 Mar 2022 • Roxana Daneshjou, Kailas Vodrahalli, Roberto A Novoa, Melissa Jenkins, Weixin Liang, Veronica Rotemberg, Justin Ko, Susan M Swetter, Elizabeth E Bailey, Olivier Gevaert, Pritam Mukherjee, Michelle Phung, Kiana Yekrang, Bradley Fong, Rachna Sahasrabudhe, Johan A. C. Allerup, Utako Okata-Karigane, James Zou, Albert Chiou

To ascertain potential biases in algorithm performance in this context, we curated the Diverse Dermatology Images (DDI) dataset-the first publicly available, expertly curated, and pathologically confirmed image dataset with diverse skin tones.

Paper
Add Code

Mind the Gap: Understanding the Modality Gap in Multi-modal Contrastive Representation Learning

2 code implementations • 3 Mar 2022 • Weixin Liang, Yuhui Zhang, Yongchan Kwon, Serena Yeung, James Zou

Our systematic analysis demonstrates that this gap is caused by a combination of model initialization and contrastive learning optimization.

Contrastive Learning Fairness +2

Paper
Code

MetaShift: A Dataset of Datasets for Evaluating Contextual Distribution Shifts and Training Conflicts

1 code implementation • ICLR 2022 • Weixin Liang, James Zou

We present MetaShift--a collection of 12, 868 sets of natural images across 410 classes--to address this challenge.

Benchmarking

103

Paper
Code

Improving Out-of-Distribution Robustness via Selective Augmentation

2 code implementations • 2 Jan 2022 • Huaxiu Yao, Yu Wang, Sai Li, Linjun Zhang, Weixin Liang, James Zou, Chelsea Finn

Machine learning algorithms typically assume that training and test examples are drawn from the same distribution.

174

Paper
Code

Disparities in Dermatology AI: Assessments Using Diverse Clinical Images

no code implementations • 15 Nov 2021 • Roxana Daneshjou, Kailas Vodrahalli, Weixin Liang, Roberto A Novoa, Melissa Jenkins, Veronica Rotemberg, Justin Ko, Susan M Swetter, Elizabeth E Bailey, Olivier Gevaert, Pritam Mukherjee, Michelle Phung, Kiana Yekrang, Bradley Fong, Rachna Sahasrabudhe, James Zou, Albert Chiou

AI diagnostic tools may aid in early skin cancer detection; however most models have not been assessed on images of diverse skin tones or uncommon diseases.

Paper
Add Code

HERALD: An Annotation Efficient Method to Detect User Disengagement in Social Conversations

1 code implementation • ACL 2021 • Weixin Liang, Kai-Hui Liang, Zhou Yu

Open-domain dialog systems have a user-centric goal: to provide humans with an engaging conversation experience.

Denoising Open-Domain Dialog

Paper
Code

GraghVQA: Language-Guided Graph Neural Networks for Graph-based Visual Question Answering

1 code implementation • NAACL (maiworkshop) 2021 • Weixin Liang, Yanhao Jiang, Zixuan Liu

Images are more than a collection of objects or attributes -- they represent a web of relationships among interconnected objects.

Ranked #1 on Graph Question Answering on GQA

Graph Question Answering Question Answering +1

Paper
Code

Neural Group Testing to Accelerate Deep Learning

1 code implementation • 21 Nov 2020 • Weixin Liang, James Zou

A key challenge of neural group testing is to modify a deep neural network so that it could test multiple samples in one forward pass.

Paper
Code

LRTA: A Transparent Neural-Symbolic Reasoning Framework with Modular Supervision for Visual Question Answering

2 code implementations • 21 Nov 2020 • Weixin Liang, Feiyang Niu, Aishwarya Reganti, Govind Thattai, Gokhan Tur

We show that LRTA makes a step towards truly understanding the question while the state-of-the-art model tends to learn superficial correlations from the training data.

Answer Generation Question Answering +1

Paper
Code

ALICE: Active Learning with Contrastive Natural Language Explanations

no code implementations • EMNLP 2020 • Weixin Liang, James Zou, Zhou Yu

We propose Active Learning with Contrastive Explanations (ALICE), an expert-in-the-loop training framework that utilizes contrastive natural language explanations to improve data efficiency in learning.

Active Learning Classification +1

Paper
Add Code

Beyond User Self-Reported Likert Scale Ratings: A Comparison Model for Automatic Dialog Evaluation

1 code implementation • ACL 2020 • Weixin Liang, James Zou, Zhou Yu

Our experiments show that CMADE achieves 89. 2% accuracy in the dialog comparison task.

Open-Domain Dialog

Paper
Code

DAWSON: A Domain Adaptive Few Shot Generation Framework

no code implementations • 2 Jan 2020 • Weixin Liang, Zixuan Liu, Can Liu

Based on DAWSON, We also propose MUSIC MATINEE, which is the first few-shot music generation model.

Meta-Learning Music Generation

Paper
Add Code

MOSS: End-to-End Dialog System Framework with Modular Supervision

1 code implementation • 12 Sep 2019 • Weixin Liang, Youzhi Tian, Chengcai Chen, Zhou Yu

To utilize limited training data more efficiently, we propose Modular Supervision Network (MOSS), an encoder-decoder training framework that could incorporate supervision from various intermediate dialog system modules including natural language understanding, dialog state tracking, dialog policy learning, and natural language generation.

dialog state tracking Natural Language Understanding +1

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.