Search Results for author: Sanmi Koyejo

Found 77 papers, 21 papers with code

Optimization and Analysis of the pAp@k Metric for Recommender Systems

no code implementations ICML 2020 Gaurush Hiranandani, Warut Vijitbenjaronk, Sanmi Koyejo, Prateek Jain

Modern recommendation and notification systems must be robust to data imbalance, limitations on the number of recommendations/notifications, and heterogeneous engagement profiles across users.

Recommendation Systems

High-Dimensional Markov-switching Ordinary Differential Processes

no code implementations30 Dec 2024 Katherine Tsai, Mladen Kolar, Sanmi Koyejo

We investigate the parameter recovery of Markov-switching ordinary differential processes from discrete observations, where the differential equations are nonlinear additive models.

Additive models

The Utility and Complexity of In- and Out-of-Distribution Machine Unlearning

no code implementations12 Dec 2024 Youssef Allouah, Joshua Kazdan, Rachid Guerraoui, Sanmi Koyejo

Machine unlearning, the process of selectively removing data from trained models, is increasingly crucial for addressing privacy concerns and knowledge gaps post-deployment.

Machine Unlearning

Context Clues: Evaluating Long Context Models for Clinical Prediction Tasks on EHRs

1 code implementation9 Dec 2024 Michael Wornow, Suhana Bedi, Miguel Angel Fuentes Hernandez, Ethan Steinberg, Jason Alan Fries, Christopher Ré, Sanmi Koyejo, Nigam H. Shah

We find that longer context models improve predictive performance -- our Mamba-based model surpasses the prior state-of-the-art on 9/14 tasks on the EHRSHOT prediction benchmark.

Mamba

MoSH: Modeling Multi-Objective Tradeoffs with Soft and Hard Bounds

no code implementations9 Dec 2024 Edward Chen, Natalie Dullerud, Thomas Niedermayr, Elizabeth Kidd, Ransalu Senanayake, Pang Wei Koh, Sanmi Koyejo, Carlos Guestrin

Leveraging a novel minimax formulation for Pareto frontier sampling, we propose a two-step process for obtaining a compact set of Pareto-optimal points which respect the user-defined soft and hard bounds: (1) densely sample the Pareto frontier using Bayesian optimization, and (2) sparsify the selected set to surface to the user, using robust submodular function optimization.

Bayesian Optimization Large Language Model

More than Marketing? On the Information Value of AI Benchmarks for Practitioners

no code implementations7 Dec 2024 Amelia Hardy, Anka Reuel, Kiana Jafari Meimandi, Lisa Soder, Allie Griffith, Dylan M. Asmar, Sanmi Koyejo, Michael S. Bernstein, Mykel J. Kochenderfer

Based on the analyses of interviews with 19 individuals who have used, or decided against using, benchmarks in their day-to-day work, we find that across these settings, participants use benchmarks as a signal of relative performance difference between models.

Marketing

Best-of-N Jailbreaking

1 code implementation4 Dec 2024 John Hughes, Sara Price, Aengus Lynch, Rylan Schaeffer, Fazl Barez, Sanmi Koyejo, Henry Sleight, Erik Jones, Ethan Perez, Mrinank Sharma

We find that BoN Jailbreaking achieves high attack success rates (ASRs) on closed-source language models, such as 89% on GPT-4o and 78% on Claude 3. 5 Sonnet when sampling 10, 000 augmented prompts.

Shaping AI's Impact on Billions of Lives

no code implementations3 Dec 2024 Mariano-Florentino Cuéllar, Jeff Dean, Finale Doshi-Velez, John Hennessy, Andy Konwinski, Sanmi Koyejo, Pelonomi Moiloa, Emma Pierson, David Patterson

Artificial Intelligence (AI), like any transformative technology, has the potential to be a double-edged sword, leading either toward significant advancements or detrimental outcomes for society as a whole.

The Reality of AI and Biorisk

no code implementations2 Dec 2024 Aidan Peppin, Anka Reuel, Stephen Casper, Elliot Jones, Andrew Strait, Usman Anwar, Anurag Agrawal, Sayash Kapoor, Sanmi Koyejo, Marie Pellat, Rishi Bommasani, Nick Frosst, Sara Hooker

To accurately and confidently answer the question 'could an AI model or system increase biorisk', it is necessary to have both a sound theoretical threat model for how AI models or systems could increase biorisk and a robust method for testing that threat model.

Differentially Private Adaptation of Diffusion Models via Noisy Aggregated Embeddings

no code implementations22 Nov 2024 Pura Peetathawatchai, Wei-Ning Chen, Berivan Isik, Sanmi Koyejo, Albert No

We introduce novel methods for adapting diffusion models under differential privacy (DP) constraints, enabling privacy-preserving style and content transfer without fine-tuning.

Privacy Preserving Style Transfer

ZIP-FIT: Embedding-Free Data Selection via Compression-Based Alignment

no code implementations23 Oct 2024 Elyas Obbad, Iddah Mlauzi, Brando Miranda, Rylan Schaeffer, Kamal Obbad, Suhana Bedi, Sanmi Koyejo

Data selection is crucial for optimizing language model (LM) performance on specific tasks, yet most existing methods fail to effectively consider the target task distribution.

Code Generation Domain Adaptation

Collapse or Thrive? Perils and Promises of Synthetic Data in a Self-Generating World

no code implementations22 Oct 2024 Joshua Kazdan, Rylan Schaeffer, Apratim Dey, Matthias Gerstgrasser, Rafael Rafailov, David L. Donoho, Sanmi Koyejo

Others see collapse as avoidable; in an `{\it accumulate}' scenario, a sequence of models is trained, but each training uses all real and synthetic data generated so far.

Pantograph: A Machine-to-Machine Interaction Interface for Advanced Theorem Proving, High Level Reasoning, and Data Extraction in Lean 4

1 code implementation21 Oct 2024 Leni Aniva, Chuyue Sun, Brando Miranda, Clark Barrett, Sanmi Koyejo

Machine-assisted theorem proving refers to the process of conducting structured reasoning to automatically generate proofs for mathematical theorems.

Automated Theorem Proving

Optimization and Generalization Guarantees for Weight Normalization

no code implementations13 Sep 2024 Pedro Cisneros-Velarde, Zhijie Chen, Sanmi Koyejo, Arindam Banerjee

For generalization, we use WeightNorm to get a uniform convergence based generalization bound, which is independent from the width and depends sublinearly on the depth.

Failures to Find Transferable Image Jailbreaks Between Vision-Language Models

no code implementations21 Jul 2024 Rylan Schaeffer, Dan Valentine, Luke Bailey, James Chua, Cristóbal Eyzaguirre, Zane Durante, Joe Benton, Brando Miranda, Henry Sleight, John Hughes, Rajashree Agrawal, Mrinank Sharma, Scott Emmons, Sanmi Koyejo, Ethan Perez

These results stand in stark contrast to existing evidence of universal and transferable text jailbreaks against language models and transferable adversarial attacks against image classifiers, suggesting that VLMs may be more robust to gradient-based transfer attacks.

Instruction Following Language Modelling +1

Learning to (Learn at Test Time): RNNs with Expressive Hidden States

3 code implementations5 Jul 2024 Yu Sun, Xinhao Li, Karan Dalal, Jiarui Xu, Arjun Vikram, Genghan Zhang, Yann Dubois, Xinlei Chen, Xiaolong Wang, Sanmi Koyejo, Tatsunori Hashimoto, Carlos Guestrin

We evaluate our instantiations at the scale of 125M to 1. 3B parameters, comparing with a strong Transformer and Mamba, a modern RNN.

16k 8k +2

Lottery Ticket Adaptation: Mitigating Destructive Interference in LLMs

1 code implementation24 Jun 2024 Ashwinee Panda, Berivan Isik, Xiangyu Qi, Sanmi Koyejo, Tsachy Weissman, Prateek Mittal

The resulting effects, such as catastrophic forgetting of earlier tasks, make it challenging to obtain good performance on multiple tasks at the same time.

Instruction Following Math

In-Context Learning of Energy Functions

no code implementations18 Jun 2024 Rylan Schaeffer, Mikail Khona, Sanmi Koyejo

In-context learning is a powerful capability of certain machine learning models that arguably underpins the success of today's frontier AI models.

In-Context Learning Language Modeling +1

Quantifying Variance in Evaluation Benchmarks

no code implementations14 Jun 2024 Lovish Madaan, Aaditya K. Singh, Rylan Schaeffer, Andrew Poulton, Sanmi Koyejo, Pontus Stenetorp, Sharan Narang, Dieuwke Hupkes

Evaluation benchmarks are the cornerstone of measuring capabilities of large language models (LLMs), as well as driving progress in said capabilities.

MMLU

Towards an Improved Understanding and Utilization of Maximum Manifold Capacity Representations

no code implementations13 Jun 2024 Rylan Schaeffer, Victor Lecomte, Dhruv Bhandarkar Pai, Andres Carranza, Berivan Isik, Alyssa Unell, Mikail Khona, Thomas Yerxa, Yann Lecun, SueYeon Chung, Andrey Gromov, Ravid Shwartz-Ziv, Sanmi Koyejo

We then leverage tools from information theory to show that such embeddings maximize a well-known lower bound on mutual information between views, thereby connecting the geometric perspective of MMCR to the information-theoretic perspective commonly discussed in MVSSL.

Self-Supervised Learning

Why Has Predicting Downstream Capabilities of Frontier AI Models with Scale Remained Elusive?

no code implementations6 Jun 2024 Rylan Schaeffer, Hailey Schoelkopf, Brando Miranda, Gabriel Mukobi, Varun Madan, Adam Ibrahim, Herbie Bradley, Stella Biderman, Sanmi Koyejo

We then reveal the mechanism causing this degradation: downstream metrics require comparing the correct choice against a small number of specific incorrect choices, meaning accurately predicting downstream capabilities requires predicting not just how probability mass concentrates on the correct choice with scale, but also how probability mass fluctuates on specific incorrect choices with scale.

Multiple-choice Question Answering

Scalable Ensembling For Mitigating Reward Overoptimisation

no code implementations3 Jun 2024 Ahmed M. Ahmed, Rafael Rafailov, Stepan Sharkov, Xuechen Li, Sanmi Koyejo

Reinforcement Learning from Human Feedback (RLHF) has enabled significant advancements within language modeling for powerful, instruction-following models.

Instruction Following Language Modeling +3

On Fairness of Low-Rank Adaptation of Large Models

1 code implementation27 May 2024 Zhoujie Ding, Ken Ziyu Liu, Pura Peetathawatchai, Berivan Isik, Sanmi Koyejo

Low-rank adaptation of large models, particularly LoRA, has gained traction due to its computational efficiency.

Computational Efficiency Fairness

Causally Inspired Regularization Enables Domain General Representations

1 code implementation25 Apr 2024 Olawale Salaudeen, Sanmi Koyejo

Given a causal graph representing the data-generating process shared across different domains/distributions, enforcing sufficient graph-implied conditional independencies can identify domain-general (non-spurious) feature representations.

Is Model Collapse Inevitable? Breaking the Curse of Recursion by Accumulating Real and Synthetic Data

no code implementations1 Apr 2024 Matthias Gerstgrasser, Rylan Schaeffer, Apratim Dey, Rafael Rafailov, Henry Sleight, John Hughes, Tomasz Korbak, Rajashree Agrawal, Dhruv Pai, Andrey Gromov, Daniel A. Roberts, Diyi Yang, David L. Donoho, Sanmi Koyejo

The proliferation of generative models, combined with pretraining on web-scale data, raises a timely question: what happens when these models are trained on their own generated outputs?

Image Generation

Proxy Methods for Domain Adaptation

no code implementations12 Mar 2024 Katherine Tsai, Stephen R. Pfohl, Olawale Salaudeen, Nicole Chiou, Matt J. Kusner, Alexander D'Amour, Sanmi Koyejo, Arthur Gretton

We study the problem of domain adaptation under distribution shift, where the shift is due to a change in the distribution of an unobserved, latent variable that confounds both the covariates and the labels.

Domain Adaptation

Crossing Linguistic Horizons: Finetuning and Comprehensive Evaluation of Vietnamese Large Language Models

1 code implementation5 Mar 2024 Sang T. Truong, Duc Q. Nguyen, Toan Nguyen, Dong D. Le, Nhi N. Truong, Tho Quan, Sanmi Koyejo

Recent advancements in large language models (LLMs) have underscored their importance in the evolution of artificial intelligence.

The Case for Globalizing Fairness: A Mixed Methods Study on Colonialism, AI, and Health in Africa

no code implementations5 Mar 2024 Mercy Asiedu, Awa Dieng, Iskandar Haykel, Negar Rostamzadeh, Stephen Pfohl, Chirag Nagpal, Maria Nagawa, Abigail Oppong, Sanmi Koyejo, Katherine Heller

Whereas experts generally expressed a shared view about the relevance of colonial history for the development and implementation of AI technologies in Africa, the majority of the general population participants surveyed did not think there was a direct link between AI and colonialism.

Attribute Fairness

Robustness to Subpopulation Shift with Domain Label Noise via Regularized Annotation of Domains

no code implementations16 Feb 2024 Nathan Stromberg, Rohan Ayyagari, Monica Welfert, Sanmi Koyejo, Richard Nock, Lalitha Sankar

Existing methods for last layer retraining that aim to optimize worst-group accuracy (WGA) rely heavily on well-annotated groups in the training data.

Bridging Associative Memory and Probabilistic Modeling

no code implementations15 Feb 2024 Rylan Schaeffer, Nika Zahedi, Mikail Khona, Dhruv Pai, Sang Truong, Yilun Du, Mitchell Ostrow, Sarthak Chandra, Andres Carranza, Ila Rani Fiete, Andrey Gromov, Sanmi Koyejo

Based on the observation that associative memory's energy functions can be seen as probabilistic modeling's negative log likelihoods, we build a bridge between the two that enables useful flow of ideas in both directions.

In-Context Learning

Transforming and Combining Rewards for Aligning Large Language Models

no code implementations1 Feb 2024 ZiHao Wang, Chirag Nagpal, Jonathan Berant, Jacob Eisenstein, Alex D'Amour, Sanmi Koyejo, Victor Veitch

A common approach for aligning language models to human preferences is to first learn a reward model from preference data, and then use this reward model to update the language model.

Language Modelling

Disentangling Fact from Grid Cell Fiction in Trained Deep Path Integrators

no code implementations6 Dec 2023 Rylan Schaeffer, Mikail Khona, Sanmi Koyejo, Ila Rani Fiete

Work on deep learning-based models of grid cells suggests that grid cells generically and robustly arise from optimizing networks to path integrate, i. e., track one's spatial position by integrating self-velocity signals.

What Causes Polysemanticity? An Alternative Origin Story of Mixed Selectivity from Incidental Causes

no code implementations5 Dec 2023 Victor Lecomte, Kushal Thaman, Rylan Schaeffer, Naomi Bashkansky, Trevor Chow, Sanmi Koyejo

Using a combination of theory and experiments, we show that incidental polysemanticity can arise due to multiple reasons including regularization and neural noise; this incidental polysemanticity occurs because random initialization can, by chance alone, initially assign multiple features to the same neuron, and the training dynamics then strengthen such overlap.

Testing Assumptions Underlying a Unified Theory for the Origin of Grid Cells

no code implementations27 Nov 2023 Rylan Schaeffer, Mikail Khona, Adrian Bertagnoli, Sanmi Koyejo, Ila Rani Fiete

At both the population and single-cell levels, we find evidence suggesting that neither of the assumptions are likely true in biological neural representations.

Learning to (Learn at Test Time)

1 code implementation20 Oct 2023 Yu Sun, Xinhao Li, Karan Dalal, Chloe Hsu, Sanmi Koyejo, Carlos Guestrin, Xiaolong Wang, Tatsunori Hashimoto, Xinlei Chen

Our inner loop turns out to be equivalent to linear attention when the inner-loop learner is only a linear model, and to self-attention when it is a kernel estimator.

Representation Engineering: A Top-Down Approach to AI Transparency

5 code implementations2 Oct 2023 Andy Zou, Long Phan, Sarah Chen, James Campbell, Phillip Guo, Richard Ren, Alexander Pan, Xuwang Yin, Mantas Mazeika, Ann-Kathrin Dombrowski, Shashwat Goel, Nathaniel Li, Michael J. Byun, Zifan Wang, Alex Mallen, Steven Basart, Sanmi Koyejo, Dawn Song, Matt Fredrikson, J. Zico Kolter, Dan Hendrycks

In this paper, we identify and characterize the emerging area of representation engineering (RepE), an approach to enhancing the transparency of AI systems that draws on insights from cognitive neuroscience.

Question Answering

FACADE: A Framework for Adversarial Circuit Anomaly Detection and Evaluation

no code implementations20 Jul 2023 Dhruv Pai, Andres Carranza, Rylan Schaeffer, Arnuv Tandon, Sanmi Koyejo

We present FACADE, a novel probabilistic and geometric framework designed for unsupervised mechanistic anomaly detection in deep neural networks.

Anomaly Detection

Invalid Logic, Equivalent Gains: The Bizarreness of Reasoning in Language Model Prompting

no code implementations20 Jul 2023 Rylan Schaeffer, Kateryna Pistunova, Samar Khanna, Sarthak Consul, Sanmi Koyejo

We find that the logically \textit{invalid} reasoning prompts do indeed achieve similar performance gains on BBH tasks as logically valid reasoning prompts.

Language Modeling Language Modelling +1

Deceptive Alignment Monitoring

no code implementations20 Jul 2023 Andres Carranza, Dhruv Pai, Rylan Schaeffer, Arnuv Tandon, Sanmi Koyejo

As the capabilities of large machine learning models continue to grow, and as the autonomy afforded to such models continues to expand, the spectre of a new adversary looms: the models themselves.

Safety Alignment

Beyond Scale: The Diversity Coefficient as a Data Quality Metric for Variability in Natural Language Data

no code implementations24 Jun 2023 Brando Miranda, Alycia Lee, Sudharsan Sundar, Allison Casasola, Sanmi Koyejo

We conclude that our formal notion of diversity is an important aspect of data quality that captures variability and causally leads to improved evaluation performance.

Diversity

Is Pre-training Truly Better Than Meta-Learning?

no code implementations24 Jun 2023 Brando Miranda, Patrick Yu, Saumya Goyal, Yu-Xiong Wang, Sanmi Koyejo

Using this analysis, we demonstrate the following: 1. when the formal diversity of a data set is low, PT beats MAML on average and 2. when the formal diversity is high, MAML beats PT on average.

Diversity Few-Shot Learning

Adaptive Compression in Federated Learning via Side Information

1 code implementation22 Jun 2023 Berivan Isik, Francesco Pase, Deniz Gunduz, Sanmi Koyejo, Tsachy Weissman, Michele Zorzi

The high communication cost of sending model updates from the clients to the server is a significant bottleneck for scalable federated learning (FL).

Federated Learning

DecodingTrust: A Comprehensive Assessment of Trustworthiness in GPT Models

no code implementations NeurIPS 2023 Boxin Wang, Weixin Chen, Hengzhi Pei, Chulin Xie, Mintong Kang, Chenhui Zhang, Chejian Xu, Zidi Xiong, Ritik Dutta, Rylan Schaeffer, Sang T. Truong, Simran Arora, Mantas Mazeika, Dan Hendrycks, Zinan Lin, Yu Cheng, Sanmi Koyejo, Dawn Song, Bo Li

Yet, while the literature on the trustworthiness of GPT models remains limited, practitioners have proposed employing capable GPT models for sensitive applications such as healthcare and finance -- where mistakes can be costly.

Adversarial Robustness Ethics +1

Pairwise Ranking Losses of Click-Through Rates Prediction for Welfare Maximization in Ad Auctions

no code implementations1 Jun 2023 Boxiang Lyu, Zhe Feng, Zachary Robertson, Sanmi Koyejo

We study the design of loss functions for click-through rates (CTR) to optimize (social) welfare in advertising auctions.

Learning-To-Rank

HiFA: High-fidelity Text-to-3D Generation with Advanced Diffusion Guidance

1 code implementation30 May 2023 Junzhe Zhu, Peiye Zhuang, Sanmi Koyejo

To address texture flickering issues in NeRFs, we introduce a kernel smoothing technique that refines importance sampling weights coarse-to-fine, ensuring accurate and thorough sampling in high-density regions.

3D Generation 3D geometry +2

Are Emergent Abilities of Large Language Models a Mirage?

no code implementations NeurIPS 2023 Rylan Schaeffer, Brando Miranda, Sanmi Koyejo

Recent work claims that large language models display emergent abilities, abilities not present in smaller-scale models that are present in larger-scale models.

Globalizing Fairness Attributes in Machine Learning: A Case Study on Health in Africa

no code implementations5 Apr 2023 Mercy Nyamewaa Asiedu, Awa Dieng, Abigail Oppong, Maria Nagawa, Sanmi Koyejo, Katherine Heller

With growing machine learning (ML) applications in healthcare, there have been calls for fairness in ML to understand and mitigate ethical concerns these systems may pose.

Fairness

Finite-sample Guarantees for Nash Q-learning with Linear Function Approximation

no code implementations1 Mar 2023 Pedro Cisneros-Velarde, Sanmi Koyejo

Nash Q-learning may be considered one of the first and most known algorithms in multi-agent reinforcement learning (MARL) for learning policies that constitute a Nash equilibrium of an underlying general-sum Markov game.

Multi-agent Reinforcement Learning Q-Learning

Principled Federated Domain Adaptation: Gradient Projection and Auto-Weighting

1 code implementation10 Feb 2023 Enyi Jiang, Yibo Jacky Zhang, Sanmi Koyejo

Further, we propose a novel lightweight aggregation rule, Federated Gradient Projection ($\texttt{FedGP}$), which significantly improves the target performance with domain shift and data scarcity.

Domain Adaptation Federated Learning

Adapting to Latent Subgroup Shifts via Concepts and Proxies

no code implementations21 Dec 2022 Ibrahim Alabdulmohsin, Nicole Chiou, Alexander D'Amour, Arthur Gretton, Sanmi Koyejo, Matt J. Kusner, Stephen R. Pfohl, Olawale Salaudeen, Jessica Schrouff, Katherine Tsai

We show that the optimal target predictor can be non-parametrically identified with the help of concept and proxy variables available only in the source domain, and unlabeled data from the target.

Unsupervised Domain Adaptation

Latent Multimodal Functional Graphical Model Estimation

no code implementations31 Oct 2022 Katherine Tsai, Boxin Zhao, Sanmi Koyejo, Mladen Kolar

Joint multimodal functional data acquisition, where functional data from multiple modes are measured simultaneously from the same subject, has emerged as an exciting modern approach enabled by recent engineering breakthroughs in the neurological and biological sciences.

Functional Connectivity

Invariant Aggregator for Defending against Federated Backdoor Attacks

no code implementations4 Oct 2022 Xiaoyang Wang, Dimitrios Dimitriadis, Sanmi Koyejo, Shruti Tople

Federated learning enables training high-utility models across several clients without directly sharing their private data.

Federated Learning Model Optimization

Unraveling the Connections between Privacy and Certified Robustness in Federated Learning Against Poisoning Attacks

no code implementations8 Sep 2022 Chulin Xie, Yunhui Long, Pin-Yu Chen, Qinbin Li, Arash Nourian, Sanmi Koyejo, Bo Li

We then provide two robustness certification criteria: certified prediction and certified attack inefficacy for DPFL on both user and instance levels.

Federated Learning

The Curse of Low Task Diversity: On the Failure of Transfer Learning to Outperform MAML and Their Empirical Equivalence

no code implementations2 Aug 2022 Brando Miranda, Patrick Yu, Yu-Xiong Wang, Sanmi Koyejo

This novel insight contextualizes claims that transfer learning solutions are better than meta-learned solutions in the regime of low diversity under a fair comparison.

Diversity Few-Shot Learning +1

One Policy is Enough: Parallel Exploration with a Single Policy is Near-Optimal for Reward-Free Reinforcement Learning

no code implementations31 May 2022 Pedro Cisneros-Velarde, Boxiang Lyu, Sanmi Koyejo, Mladen Kolar

Although parallelism has been extensively used in reinforcement learning (RL), the quantitative effects of parallel exploration are not well understood theoretically.

Reinforcement Learning (RL)

A Word is Worth A Thousand Dollars: Adversarial Attack on Tweets Fools Stock Predictions

1 code implementation NAACL 2022 Yong Xie, Dakuo Wang, Pin-Yu Chen, JinJun Xiong, Sijia Liu, Sanmi Koyejo

More and more investors and machine learning models rely on social media (e. g., Twitter and Reddit) to gather real-time information and sentiment to predict stock price movements.

Adversarial Attack Combinatorial Optimization +1

Fair Wrapping for Black-box Predictions

1 code implementation31 Jan 2022 Alexander Soen, Ibrahim Alabdulmohsin, Sanmi Koyejo, Yishay Mansour, Nyalleng Moorosi, Richard Nock, Ke Sun, Lexing Xie

We introduce a new family of techniques to post-process ("wrap") a black-box classifier in order to reduce its bias.

Fairness

The Curse of Zero Task Diversity: On the Failure of Transfer Learning to Outperform MAML and their Empirical Equivalence

no code implementations24 Dec 2021 Brando Miranda, Yu-Xiong Wang, Sanmi Koyejo

We hypothesize that the diversity coefficient of the few-shot learning benchmark is predictive of whether meta-learning solutions will succeed or not.

Diversity Few-Shot Learning +1

Does MAML Only Work via Feature Re-use? A Data Centric Perspective

1 code implementation24 Dec 2021 Brando Miranda, Yu-Xiong Wang, Sanmi Koyejo

Recent work has suggested that a good embedding is all we need to solve many few-shot learning benchmarks.

Few-Shot Learning

EMIXER: End-to-end Multimodal X-ray Generation via Self-supervision

no code implementations10 Jul 2020 Siddharth Biswal, Peiye Zhuang, Ayis Pyrros, Nasir Siddiqui, Sanmi Koyejo, Jimeng Sun

EMIXER is an conditional generative adversarial model by 1) generating an image based on a label, 2) encoding the image to a hidden embedding, 3) producing the corresponding text via a hierarchical decoder from the image embedding, and 4) a joint discriminator for assessing both the image and the corresponding text.

Data Augmentation Image Classification

Fairness with Overlapping Groups

no code implementations24 Jun 2020 Forest Yang, Moustapha Cisse, Sanmi Koyejo

In algorithmically fair prediction problems, a standard goal is to ensure the equality of fairness metrics across multiple overlapping groups simultaneously.

Classification Fairness +1

FMRI data augmentation via synthesis

no code implementations13 Jul 2019 Peiye Zhuang, Alexander G. Schwing, Sanmi Koyejo

Thus, our results suggest that data augmentation via synthesis is a promising approach to address the limited availability of fMRI data, and to improve the quality of predictive fMRI models.

Data Augmentation Generative Adversarial Network

Max-Sliced Wasserstein Distance and its use for GANs

no code implementations CVPR 2019 Ishan Deshpande, Yuan-Ting Hu, Ruoyu Sun, Ayis Pyrros, Nasir Siddiqui, Sanmi Koyejo, Zhizhen Zhao, David Forsyth, Alexander Schwing

Generative adversarial nets (GANs) and variational auto-encoders have significantly improved our distribution modeling capabilities, showing promise for dataset augmentation, image-to-image translation and feature learning.

Image-to-Image Translation Translation

Zeno++: Robust Fully Asynchronous SGD

1 code implementation ICML 2020 Cong Xie, Sanmi Koyejo, Indranil Gupta

We propose Zeno++, a new robust asynchronous Stochastic Gradient Descent~(SGD) procedure which tolerates Byzantine failures of the workers.

Fall of Empires: Breaking Byzantine-tolerant SGD by Inner Product Manipulation

4 code implementations10 Mar 2019 Cong Xie, Sanmi Koyejo, Indranil Gupta

Recently, new defense techniques have been developed to tolerate Byzantine failures for distributed machine learning.

BIG-bench Machine Learning

Asynchronous Federated Optimization

1 code implementation10 Mar 2019 Cong Xie, Sanmi Koyejo, Indranil Gupta

Federated learning enables training on a massive number of edge devices.

Federated Learning

On the Consistency of Top-k Surrogate Losses

no code implementations ICML 2020 Forest Yang, Sanmi Koyejo

Our analysis continues by showing previously proposed hinge-like top-$k$ surrogate losses are not top-$k$ calibrated and suggests no convex hinge loss is top-$k$ calibrated.

General Classification

A Contextual-bandit-based Approach for Informed Decision-making in Clinical Trials

no code implementations1 Sep 2018 Yogatheesan Varatharajah, Brent Berry, Sanmi Koyejo, Ravishankar Iyer

However, those approaches have failed to account for the variability among participants that is becoming increasingly evident as a result of recent clinical-trial-based studies.

Decision Making reinforcement-learning +2

Cannot find the paper you are looking for? You can Submit a new open access paper.