Search Results for author: Haotian Ye

Found 18 papers, 7 papers with code

Monitoring AI-Modified Content at Scale: A Case Study on the Impact of ChatGPT on AI Conference Peer Reviews

no code implementations • 11 Mar 2024 • Weixin Liang, Zachary Izzo, Yaohui Zhang, Haley Lepp, Hancheng Cao, Xuandong Zhao, Lingjiao Chen, Haotian Ye, Sheng Liu, Zhi Huang, Daniel A. McFarland, James Y. Zou

We present an approach for estimating the fraction of text in a large corpus which is likely to be substantially modified or produced by a large language model (LLM).

Language Modelling Large Language Model

Paper
Add Code

DOF: Accelerating High-order Differential Operators with Forward Propagation

no code implementations • 15 Feb 2024 • Ruichen Li, Chuwei Wang, Haotian Ye, Di He, LiWei Wang

Solving partial differential equations (PDEs) efficiently is essential for analyzing complex physical systems.

Paper
Add Code

Selecting Large Language Model to Fine-tune via Rectified Scaling Law

no code implementations • 4 Feb 2024 • Haowei Lin, Baizhou Huang, Haotian Ye, Qinyu Chen, ZiHao Wang, Sujian Li, Jianzhu Ma, Xiaojun Wan, James Zou, Yitao Liang

The ever-growing ecosystem of LLMs has posed a challenge in selecting the most appropriate pre-trained model to fine-tune amidst a sea of options.

Language Modelling Large Language Model

Paper
Add Code

TransliCo: A Contrastive Learning Framework to Address the Script Barrier in Multilingual Pretrained Language Models

1 code implementation • 12 Jan 2024 • Yihong Liu, Chunlan Ma, Haotian Ye, Hinrich Schütze

As a result, mPLMs present a script barrier: representations from different scripts are located in different subspaces, which is a strong indicator of why crosslingual transfer involving languages of different scripts shows sub-optimal performance.

Contrastive Learning Transliteration

Paper
Code

MoSECroT: Model Stitching with Static Word Embeddings for Crosslingual Zero-shot Transfer

no code implementations • 9 Jan 2024 • Haotian Ye, Yihong Liu, Chunlan Ma, Hinrich Schütze

In this paper, we introduce MoSECroT Model Stitching with Static Word Embeddings for Crosslingual Zero-shot Transfer), a novel and challenging task that is especially relevant to low-resource languages for which static word embeddings are available.

Word Embeddings

Paper
Add Code

In-context Vectors: Making In Context Learning More Effective and Controllable Through Latent Space Steering

1 code implementation • 11 Nov 2023 • Sheng Liu, Haotian Ye, Lei Xing, James Zou

On a new query, instead of adding demonstrations to the prompt, we shift the latent states of the LLM using the ICV.

In-Context Learning Style Transfer

Paper
Code

Forward Laplacian: A New Computational Framework for Neural Network-based Variational Monte Carlo

2 code implementations • 17 Jul 2023 • Ruichen Li, Haotian Ye, Du Jiang, Xuelan Wen, Chuwei Wang, Zhe Li, Xiang Li, Di He, Ji Chen, Weiluo Ren, LiWei Wang

Neural network-based variational Monte Carlo (NN-VMC) has emerged as a promising cutting-edge technique of ab initio quantum chemistry.

Efficient Neural Network Variational Monte Carlo

Paper
Code

Towards Revealing the Mystery behind Chain of Thought: A Theoretical Perspective

no code implementations • NeurIPS 2023 • Guhao Feng, Bohang Zhang, Yuntian Gu, Haotian Ye, Di He, LiWei Wang

By using circuit complexity theory, we first give impossibility results showing that bounded-depth Transformers are unable to directly produce correct answers for basic arithmetic/equation tasks unless the model size grows super-polynomially with respect to the input length.

Decision Making Math

Paper
Add Code

A study of conceptual language similarity: comparison and evaluation

no code implementations • 22 May 2023 • Haotian Ye, Yihong Liu, Hinrich Schütze

An interesting line of research in natural language processing (NLP) aims to incorporate linguistic typology to bridge linguistic diversity and assist the research of low-resource languages.

Binary Classification

Paper
Add Code

Crosslingual Transfer Learning for Low-Resource Languages Based on Multilingual Colexification Graphs

2 code implementations • 22 May 2023 • Yihong Liu, Haotian Ye, Leonie Weissweiler, Renhao Pei, Hinrich Schütze

ColexNet's nodes are concepts and its edges are colexifications.

Multilingual NLP Retrieval +3

Paper
Code

A Crosslingual Investigation of Conceptualization in 1335 Languages

3 code implementations • 15 May 2023 • Yihong Liu, Haotian Ye, Leonie Weissweiler, Philipp Wicke, Renhao Pei, Robert Zangenfeind, Hinrich Schütze

The resulting measure for the conceptual similarity of two languages is complementary to standard genealogical, typological, and surface similarity measures.

Paper
Code

Taxi1500: A Multilingual Dataset for Text Classification in 1500 Languages

no code implementations • 15 May 2023 • Chunlan Ma, Ayyoob ImaniGooghari, Haotian Ye, Ehsaneddin Asgari, Hinrich Schütze

While natural language processing tools have been developed extensively for some of the world's languages, a significant portion of the world's over 7000 languages are still neglected.

text-classification Text Classification

Paper
Add Code

Discovering Latent Knowledge in Language Models Without Supervision

1 code implementation • 7 Dec 2022 • Collin Burns, Haotian Ye, Dan Klein, Jacob Steinhardt

Existing techniques for training language models can be misaligned with the truth: if we train models with imitation learning, they may reproduce errors that humans make; if we train them to generate text that humans rate highly, they may output errors that human evaluators can't detect.

Imitation Learning Language Modelling +2

231

Paper
Code

Freeze then Train: Towards Provable Representation Learning under Spurious Correlations and Feature Noise

1 code implementation • 20 Oct 2022 • Haotian Ye, James Zou, Linjun Zhang

This opens a promising strategy to first train a feature learner rather than a classifier, and then perform linear probing (last layer retraining) in the test environment.

Representation Learning

Paper
Code

On the Power of Pre-training for Generalization in RL: Provable Benefits and Hardness

no code implementations • 19 Oct 2022 • Haotian Ye, Xiaoyu Chen, LiWei Wang, Simon S. Du

Generalization in Reinforcement Learning (RL) aims to learn an agent during training that generalizes to the target environment.

Reinforcement Learning (RL)

Paper
Add Code

Towards a Theoretical Framework of Out-of-Distribution Generalization

no code implementations • NeurIPS 2021 • Haotian Ye, Chuanlong Xie, Tianle Cai, Ruichen Li, Zhenguo Li, LiWei Wang

We also introduce a new concept of expansion function, which characterizes to what extent the variance is amplified in the test domains over the training domains, and therefore give a quantitative meaning of invariant features.

Domain Generalization Model Selection +1

Paper
Add Code

Out-of-Distribution Generalization Analysis via Influence Function

no code implementations • 21 Jan 2021 • Haotian Ye, Chuanlong Xie, Yue Liu, Zhenguo Li

One of the definitions of OOD accuracy is worst-domain accuracy.

Out-of-Distribution Generalization

Paper
Add Code

Risk Variance Penalization

no code implementations • 13 Jun 2020 • Chuanlong Xie, Haotian Ye, Fei Chen, Yue Liu, Rui Sun, Zhenguo Li

The key of the out-of-distribution (OOD) generalization is to generalize invariance from training domains to target domains.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.