Search Results for author: Linjun Zhang

Found 41 papers, 14 papers with code

A Unified Combination Framework for Dependent Tests with Applications to Microbiome Association Studies

no code implementations • 14 Apr 2024 • Xiufan Yu, Linjun Zhang, Arun Srinivasan, Min-ge Xie, Lingzhou Xue

Compared to the existing $p$-value combination methods, including the vanilla Cauchy combination method, the proposed combination framework can handle the dependence accurately and utilizes the information efficiently to construct tests with accurate size and enhanced power.

Paper
Add Code

FAIRM: Learning invariant representations for algorithmic fairness and domain generalization with minimax optimality

1 code implementation • 2 Apr 2024 • Sai Li, Linjun Zhang

Machine learning methods often assume that the test data have the same distribution as the training data.

Domain Generalization Fairness

Paper
Code

Contrastive Learning on Multimodal Analysis of Electronic Health Records

no code implementations • 22 Mar 2024 • Tianxi Cai, Feiqing Huang, Ryumei Nakada, Linjun Zhang, Doudou Zhou

To accommodate the statistical analysis of multimodal EHR data, in this paper, we propose a novel multimodal feature embedding generative model and design a multimodal contrastive loss to obtain the multimodal EHR feature representation.

Contrastive Learning Privacy Preserving +1

Paper
Add Code

Provable Multi-Party Reinforcement Learning with Diverse Human Feedback

no code implementations • 8 Mar 2024 • Huiying Zhong, Zhun Deng, Weijie J. Su, Zhiwei Steven Wu, Linjun Zhang

Our work \textit{initiates} the theoretical study of multi-party RLHF that explicitly models the diverse preferences of multiple individuals.

Fairness Meta-Learning +1

Paper
Add Code

Distribution-Free Fair Federated Learning with Small Samples

no code implementations • 25 Feb 2024 • Qichuan Yin, Junzhou Huang, Huaxiu Yao, Linjun Zhang

As federated learning gains increasing importance in real-world applications due to its capacity for decentralized data training, addressing fairness concerns across demographic groups becomes critically important.

Fairness Federated Learning

Paper
Add Code

Selective Learning: Towards Robust Calibration with Dynamic Regularization

no code implementations • 13 Feb 2024 • Zongbo Han, Yifeng Yang, Changqing Zhang, Linjun Zhang, Joey Tianyi Zhou, QinGhua Hu, Huaxiu Yao

The objective can be understood as seeking a model that fits the ground-truth labels by increasing the confidence while also maximizing the entropy of predicted probabilities by decreasing the confidence.

Paper
Add Code

Differentially Private Sliced Inverse Regression: Minimax Optimality and Algorithm

no code implementations • 16 Jan 2024 • Xintao Xia, Linjun Zhang, Zhanrui Cai

Privacy preservation has become a critical concern in high-dimensional data analysis due to the growing prevalence of data-driven applications.

Dimensionality Reduction regression

Paper
Add Code

Can AI Be as Creative as Humans?

no code implementations • 3 Jan 2024 • Haonan Wang, James Zou, Michael Mozer, Anirudh Goyal, Alex Lamb, Linjun Zhang, Weijie J Su, Zhun Deng, Michael Qizhe Xie, Hannah Brown, Kenji Kawaguchi

With the rise of advanced generative AI models capable of tasks once reserved for human creativity, the study of AI's creative potential becomes imperative for its responsible development and application.

Paper
Add Code

Holistic Analysis of Hallucination in GPT-4V(ision): Bias and Interference Challenges

1 code implementation • 6 Nov 2023 • Chenhang Cui, Yiyang Zhou, Xinyu Yang, Shirley Wu, Linjun Zhang, James Zou, Huaxiu Yao

To bridge this gap, we introduce a new benchmark, namely, the Bias and Interference Challenges in Visual Language Models (Bingo).

Hallucination

Paper
Code

Conformal Prediction for Deep Classifier via Label Ranking

2 code implementations • 10 Oct 2023 • Jianguo Huang, Huajun Xi, Linjun Zhang, Huaxiu Yao, Yue Qiu, Hongxin Wei

In this paper, we empirically and theoretically show that disregarding the probabilities' value will mitigate the undesirable effect of miscalibrated probability values.

Conformal Prediction

156

Paper
Code

Analyzing and Mitigating Object Hallucination in Large Vision-Language Models

1 code implementation • 1 Oct 2023 • Yiyang Zhou, Chenhang Cui, Jaehong Yoon, Linjun Zhang, Zhun Deng, Chelsea Finn, Mohit Bansal, Huaxiu Yao

Large vision-language models (LVLMs) have shown remarkable abilities in understanding visual information with human languages.

Hallucination Hallucination Evaluation +1

100

Paper
Code

Multi-dimensional domain generalization with low-rank structures

1 code implementation • 18 Sep 2023 • Sai Li, Linjun Zhang

In conventional statistical and machine learning methods, it is typically assumed that the test data are identically distributed with the training data.

Domain Generalization

Paper
Code

What Should Data Science Education Do with Large Language Models?

no code implementations • 6 Jul 2023 • Xinming Tu, James Zou, Weijie J. Su, Linjun Zhang

LLMs can also play a significant role in the classroom as interactive teaching and learning tools, contributing to personalized education.

Paper
Add Code

Safeguarding Data in Multimodal AI: A Differentially Private Approach to CLIP Training

1 code implementation • 13 Jun 2023 • Alyssa Huang, Peihan Liu, Ryumei Nakada, Linjun Zhang, Wanrong Zhang

The surge in multimodal AI's success has sparked concerns over data privacy in vision-and-language tasks.

Image Classification Privacy Preserving +2

Paper
Code

Discover and Cure: Concept-aware Mitigation of Spurious Correlation

1 code implementation • 1 May 2023 • Shirley Wu, Mert Yuksekgonul, Linjun Zhang, James Zou

Deep neural networks often rely on spurious correlations to make predictions, which hinders generalization beyond training environments.

Lesion Classification Object Recognition +1

Paper
Code

Score Attack: A Lower Bound Technique for Optimal Differentially Private Learning

no code implementations • 13 Mar 2023 • T. Tony Cai, Yichen Wang, Linjun Zhang

The score attack method is based on the tracing attack concept in differential privacy and can be applied to any statistical model with a well-defined score statistic.

Paper
Add Code

HappyMap: A Generalized Multi-calibration Method

no code implementations • 8 Mar 2023 • Zhun Deng, Cynthia Dwork, Linjun Zhang

Fairness is captured by incorporating demographic subgroups into the class of functions~$\mathcal{C}$.

Conformal Prediction Fairness +1

Paper
Add Code

Understanding Multimodal Contrastive Learning and Incorporating Unpaired Data

1 code implementation • 13 Feb 2023 • Ryumei Nakada, Halil Ibrahim Gulluk, Zhun Deng, Wenlong Ji, James Zou, Linjun Zhang

We show that the algorithm can detect the ground-truth pairs and improve performance by fully exploiting unpaired datasets.

Contrastive Learning

Paper
Code

FaiREE: Fair Classification with Finite-Sample and Distribution-Free Guarantee

1 code implementation • 28 Nov 2022 • Puheng Li, James Zou, Linjun Zhang

Several group fairness notions and algorithms have been proposed.

Fairness

Paper
Code

Reinforcement Learning with Stepwise Fairness Constraints

no code implementations • 8 Nov 2022 • Zhun Deng, He Sun, Zhiwei Steven Wu, Linjun Zhang, David C. Parkes

AI methods are used in societally important settings, ranging from credit to employment to housing, and it is crucial to provide fairness in regard to algorithmic decision making.

Decision Making Fairness +2

Paper
Add Code

Freeze then Train: Towards Provable Representation Learning under Spurious Correlations and Feature Noise

1 code implementation • 20 Oct 2022 • Haotian Ye, James Zou, Linjun Zhang

This opens a promising strategy to first train a feature learner rather than a classifier, and then perform linear probing (last layer retraining) in the test environment.

Representation Learning

Paper
Code

C-Mixup: Improving Generalization in Regression

1 code implementation • 11 Oct 2022 • Huaxiu Yao, Yiping Wang, Linjun Zhang, James Zou, Chelsea Finn

In this paper, we propose a simple yet powerful algorithm, C-Mixup, to improve generalization on regression tasks.

regression

Paper
Code

FIFA: Making Fairness More Generalizable in Classifiers Trained on Imbalanced Data

no code implementations • 6 Jun 2022 • Zhun Deng, Jiayao Zhang, Linjun Zhang, Ting Ye, Yates Coley, Weijie J. Su, James Zou

Specifically, FIFA encourages both classification and fairness generalization and can be flexibly combined with many existing fair learning methods with logits-based losses.

Classification Fairness

Paper
Add Code

Improving Out-of-Distribution Robustness via Selective Augmentation

2 code implementations • 2 Jan 2022 • Huaxiu Yao, Yu Wang, Sai Li, Linjun Zhang, Weixin Liang, James Zou, Chelsea Finn

Machine learning algorithms typically assume that training and test examples are drawn from the same distribution.

174

Paper
Code

Scaffolding Sets

no code implementations • 4 Nov 2021 • Maya Burhanpurkar, Zhun Deng, Cynthia Dwork, Linjun Zhang

Predictors map individual instances in a population to the interval $[0, 1]$.

Paper
Add Code

The Power of Contrast for Feature Learning: A Theoretical Analysis

no code implementations • 6 Oct 2021 • Wenlong Ji, Zhun Deng, Ryumei Nakada, James Zou, Linjun Zhang

Contrastive learning has achieved state-of-the-art performance in various self-supervised learning tasks and even outperforms its supervised counterpart.

Contrastive Learning Self-Supervised Learning +1

Paper
Add Code

Understanding Dynamics of Nonlinear Representation Learning and Its Application

no code implementations • 28 Jun 2021 • Kenji Kawaguchi, Linjun Zhang, Zhun Deng

Representation learning allows us to automatically discover suitable representations from raw sensory data.

Representation Learning

Paper
Add Code

Adversarial Training Helps Transfer Learning via Better Representations

no code implementations • NeurIPS 2021 • Zhun Deng, Linjun Zhang, Kailas Vodrahalli, Kenji Kawaguchi, James Zou

Recent works empirically demonstrate that adversarial training in the source data can improve the ability of models to transfer to new domains.

Transfer Learning

Paper
Add Code

Meta-Learning with Fewer Tasks through Task Interpolation

1 code implementation • ICLR 2022 • Huaxiu Yao, Linjun Zhang, Chelsea Finn

Meta-learning enables algorithms to quickly learn a newly encountered task with just a few labeled examples by transferring previously learned knowledge.

Image Classification Medical Image Classification +3

Paper
Code

High-Dimensional Differentially-Private EM Algorithm: Methods and Near-Optimal Statistical Guarantees

no code implementations • 1 Apr 2021 • Zhe Zhang, Linjun Zhang

In this paper, we develop a general framework to design differentially private expectation-maximization (EM) algorithms in high-dimensional latent variable models, based on the noisy iterative hard-thresholding.

regression

Paper
Add Code

A Central Limit Theorem for Differentially Private Query Answering

no code implementations • NeurIPS 2021 • Jinshuo Dong, Weijie J. Su, Linjun Zhang

The central question, therefore, is to understand which noise distribution optimizes the privacy-accuracy trade-off, especially when the dimension of the answer vector is high.

Paper
Add Code

When and How Mixup Improves Calibration

no code implementations • 11 Feb 2021 • Linjun Zhang, Zhun Deng, Kenji Kawaguchi, James Zou

In addition, we study how Mixup improves calibration in semi-supervised learning.

Data Augmentation

Paper
Add Code

The Cost of Privacy in Generalized Linear Models: Algorithms and Minimax Lower Bounds

no code implementations • 8 Nov 2020 • T. Tony Cai, Yichen Wang, Linjun Zhang

We propose differentially private algorithms for parameter estimation in both low-dimensional and high-dimensional sparse generalized linear models (GLMs) by constructing private versions of projected gradient descent.

LEMMA

Paper
Add Code

Estimation, Confidence Intervals, and Large-Scale Hypotheses Testing for High-Dimensional Mixed Linear Regression

no code implementations • 6 Nov 2020 • Linjun Zhang, Rong Ma, T. Tony Cai, Hongzhe Li

Based on the iterative estimators, we further construct debiased estimators and establish their asymptotic normality.

regression

Paper
Add Code

How Does Mixup Help With Robustness and Generalization?

no code implementations • ICLR 2021 • Linjun Zhang, Zhun Deng, Kenji Kawaguchi, Amirata Ghorbani, James Zou

For robustness, we show that minimizing the Mixup loss corresponds to approximately minimizing an upper bound of the adversarial loss.

Data Augmentation

Paper
Add Code

Interpreting Robust Optimization via Adversarial Influence Functions

no code implementations • ICML 2020 • Zhun Deng, Cynthia Dwork, Jialiang Wang, Linjun Zhang

Robust optimization has been widely used in nowadays data science, especially in adversarial training.

Paper
Add Code

A Lightweight Algorithm to Uncover Deep Relationships in Data Tables

no code implementations • 7 Sep 2020 • Jin Cao, Yibo Zhao, Linjun Zhang, Jason Li

The key to our approach is a computationally lightweight forward addition algorithm that we developed to recursively extract the functional dependencies between table columns that are scalable to tables with many columns.

Paper
Add Code

Improving Generalization in Meta-learning via Task Augmentation

1 code implementation • 26 Jul 2020 • Huaxiu Yao, Long-Kai Huang, Linjun Zhang, Ying WEI, Li Tian, James Zou, Junzhou Huang, Zhenhui Li

Moreover, both MetaMix and Channel Shuffle outperform state-of-the-art results by a large margin across many datasets and are compatible with existing meta-learning algorithms.

Meta-Learning

Paper
Code

Improving Adversarial Robustness via Unlabeled Out-of-Domain Data

no code implementations • 15 Jun 2020 • Zhun Deng, Linjun Zhang, Amirata Ghorbani, James Zou

In this work, we investigate how adversarial robustness can be enhanced by leveraging out-of-domain unlabeled data.

Adversarial Robustness Data Augmentation +2

Paper
Add Code

The Cost of Privacy: Optimal Rates of Convergence for Parameter Estimation with Differential Privacy

no code implementations • 12 Feb 2019 • T. Tony Cai, Yichen Wang, Linjun Zhang

By refining the "tracing adversary" technique for lower bounds in the theoretical computer science literature, we formulate a general lower bound argument for minimax risks with differential privacy constraints, and apply this argument to high-dimensional mean estimation and linear regression problems.

Privacy Preserving regression

Paper
Add Code

A Sparse PCA Approach to Clustering

no code implementations • 16 Feb 2016 • T. Tony Cai, Linjun Zhang

We discuss a clustering method for Gaussian mixture model based on the sparse principal component analysis (SPCA) method and compare it with the IF-PCA method.

Clustering

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.