Search Results for author: Sewoong Oh

Found 93 papers, 34 papers with code

InfoGAN-CR: Disentangling Generative Adversarial Networks with Contrastive Regularizers

1 code implementation • ICML 2020 • Zinan Lin, Kiran Thekumparampil, Giulia Fanti, Sewoong Oh

This contrastive regularizer is inspired by a natural notion of disentanglement: latent traversal.

Paper
Code

On the Convergence of Differentially-Private Fine-tuning: To Linearly Probe or to Fully Fine-tune?

no code implementations • 29 Feb 2024 • Shuqi Ke, Charlie Hou, Giulia Fanti, Sewoong Oh

We provide theoretical insights into the convergence of DP fine-tuning within an overparameterized neural network and establish a utility curve that determines the allocation of privacy budget between linear probing and full fine-tuning.

Paper
Add Code

Privacy-Preserving Instructions for Aligning Large Language Models

no code implementations • 21 Feb 2024 • Da Yu, Peter Kairouz, Sewoong Oh, Zheng Xu

Service providers of large language model (LLM) applications collect user instructions in the wild and use them in further aligning LLMs with users' intentions.

Language Modelling Large Language Model +1

Paper
Add Code

DeepPolar: Inventing Nonlinear Large-Kernel Polar Codes via Deep Learning

no code implementations • 14 Feb 2024 • S Ashwin Hebbar, Sravan Kumar Ankireddy, Hyeji Kim, Sewoong Oh, Pramod Viswanath

Polar codes, developed on the foundation of Arikan's polarization kernel, represent a breakthrough in coding theory and have emerged as the state-of-the-art error-correction-code in short-to-medium block length regimes.

Paper
Add Code

DPZero: Private Fine-Tuning of Language Models without Backpropagation

no code implementations • 14 Oct 2023 • Liang Zhang, Bingcong Li, Kiran Koshy Thekumparampil, Sewoong Oh, Niao He

The widespread practice of fine-tuning large language models (LLMs) on domain-specific data faces two major challenges in memory and privacy.

Paper
Add Code

Profit: Benchmarking Personalization and Robustness Trade-off in Federated Prompt Tuning

no code implementations • 6 Oct 2023 • Liam Collins, Shanshan Wu, Sewoong Oh, Khe Chai Sim

In many applications of federated learning (FL), clients desire models that are personalized using their local data, yet are also robust in the sense that they retain general global knowledge.

Benchmarking Federated Learning +2

Paper
Add Code

Private Federated Learning with Autotuned Compression

1 code implementation • 20 Jul 2023 • Enayat Ullah, Christopher A. Choquette-Choo, Peter Kairouz, Sewoong Oh

We propose new techniques for reducing communication in private federated learning without the need for setting or tuning compression rates.

Federated Learning

647

Paper
Code

Can Public Large Language Models Help Private Cross-device Federated Learning?

no code implementations • 20 May 2023 • Boxin Wang, Yibo Jacky Zhang, Yuan Cao, Bo Li, H. Brendan McMahan, Sewoong Oh, Zheng Xu, Manzil Zaheer

We study (differentially) private federated learning (FL) of language models.

Federated Learning

Paper
Add Code

DataComp: In search of the next generation of multimodal datasets

1 code implementation • NeurIPS 2023 • Samir Yitzhak Gadre, Gabriel Ilharco, Alex Fang, Jonathan Hayase, Georgios Smyrnis, Thao Nguyen, Ryan Marten, Mitchell Wortsman, Dhruba Ghosh, Jieyu Zhang, Eyal Orgad, Rahim Entezari, Giannis Daras, Sarah Pratt, Vivek Ramanujan, Yonatan Bitton, Kalyani Marathe, Stephen Mussmann, Richard Vencu, Mehdi Cherti, Ranjay Krishna, Pang Wei Koh, Olga Saukh, Alexander Ratner, Shuran Song, Hannaneh Hajishirzi, Ali Farhadi, Romain Beaumont, Sewoong Oh, Alex Dimakis, Jenia Jitsev, Yair Carmon, Vaishaal Shankar, Ludwig Schmidt

Multimodal datasets are a critical component in recent breakthroughs such as Stable Diffusion and GPT-4, yet their design does not receive the same research attention as model architectures or training algorithms.

Paper
Code

Why Is Public Pretraining Necessary for Private Model Training?

no code implementations • 19 Feb 2023 • Arun Ganesh, Mahdi Haghifam, Milad Nasr, Sewoong Oh, Thomas Steinke, Om Thakkar, Abhradeep Thakurta, Lun Wang

To explain this phenomenon, we hypothesize that the non-convex loss landscape of a model training necessitates an optimization algorithm to go through two phases.

Transfer Learning

Paper
Add Code

One-shot Empirical Privacy Estimation for Federated Learning

no code implementations • 6 Feb 2023 • Galen Andrew, Peter Kairouz, Sewoong Oh, Alina Oprea, H. Brendan McMahan, Vinith M. Suriyakumar

Privacy estimation techniques for differentially private (DP) algorithms are useful for comparing against analytical bounds, or to empirically measure privacy loss in settings where known analytical bounds are not tight.

Federated Learning

Paper
Add Code

Near Optimal Private and Robust Linear Regression

no code implementations • 30 Jan 2023 • Xiyang Liu, Prateek Jain, Weihao Kong, Sewoong Oh, Arun Sai Suggala

Under label-corruption, this is the first efficient linear regression algorithm to guarantee both $(\varepsilon,\delta)$-DP and robustness.

regression

Paper
Add Code

Machine Learning-Aided Efficient Decoding of Reed-Muller Subcodes

no code implementations • 16 Jan 2023 • Mohammad Vahid Jamali, Xiyang Liu, Ashok Vardhan Makkuva, Hessam Mahdavifar, Sewoong Oh, Pramod Viswanath

Next, we derive the soft-decision based version of our algorithm, called soft-subRPA, that not only improves upon the performance of subRPA but also enables a differentiable decoding algorithm.

Paper
Add Code

Sequence-to-sequence translation from mass spectra to peptides with a transformer model

2 code implementations • bioRxiv 2023 • Melih Yilmaz, William E. Fondrie, Wout Bittremieux, Rowan Nelson, Varun Ananth, Sewoong Oh, William Stafford Noble

A fundamental challenge for any mass spectrometry-based proteomics experiment is the identification of the peptide that generated each acquired tandem mass spectrum.

de novo peptide sequencing

Paper
Code

MAUVE Scores for Generative Models: Theory and Practice

1 code implementation • 30 Dec 2022 • Krishna Pillutla, Lang Liu, John Thickstun, Sean Welleck, Swabha Swayamdipta, Rowan Zellers, Sewoong Oh, Yejin Choi, Zaid Harchaoui

We present MAUVE, a family of comparison measures between pairs of distributions such as those encountered in the generative modeling of text or images.

Quantization

259

Paper
Code

Learning to Generate Image Embeddings with User-level Differential Privacy

1 code implementation • CVPR 2023 • Zheng Xu, Maxwell Collins, Yuxiao Wang, Liviu Panait, Sewoong Oh, Sean Augenstein, Ting Liu, Florian Schroff, H. Brendan McMahan

Small on-device models have been successfully trained with user-level differential privacy (DP) for next word prediction and image classification tasks in the past.

Federated Learning Image Classification

647

Paper
Code

Zonotope Domains for Lagrangian Neural Network Verification

no code implementations • 14 Oct 2022 • Matt Jordan, Jonathan Hayase, Alexandros G. Dimakis, Sewoong Oh

Neural network verification aims to provide provable bounds for the output of a neural network for a given input range.

Paper
Add Code

Few-shot Backdoor Attacks via Neural Tangent Kernels

no code implementations • 12 Oct 2022 • Jonathan Hayase, Sewoong Oh

In a backdoor attack, an attacker injects corrupted examples into the training set.

Backdoor Attack Bilevel Optimization

Paper
Add Code

Stochastic optimization on matrices and a graphon McKean-Vlasov limit

no code implementations • 2 Oct 2022 • Zaid Harchaoui, Sewoong Oh, Soumik Pal, Raghav Somani, Raghavendra Tripathi

The limiting curve of graphons is characterized by a family of stochastic differential equations with reflections and can be thought of as an extension of the classical McKean-Vlasov limit for interacting diffusions.

Stochastic Optimization

Paper
Add Code

CRISP: Curriculum based Sequential Neural Decoders for Polar Code Family

1 code implementation • 1 Oct 2022 • S Ashwin Hebbar, Viraj Nadkarni, Ashok Vardhan Makkuva, Suma Bhat, Sewoong Oh, Pramod Viswanath

We design a principled curriculum, guided by information-theoretic insights, to train CRISP and show that it outperforms the successive-cancellation (SC) decoder and attains near-optimal reliability performance on the Polar(32, 16) and Polar(64, 22) codes.

Paper
Code

Quality Not Quantity: On the Interaction between Dataset Design and Robustness of CLIP

1 code implementation • 10 Aug 2022 • Thao Nguyen, Gabriel Ilharco, Mitchell Wortsman, Sewoong Oh, Ludwig Schmidt

Web-crawled datasets have enabled remarkable generalization capabilities in recent image-text models such as CLIP (Contrastive Language-Image pre-training) or Flamingo, but little is known about the dataset creation processes.

Paper
Code

Bring Your Own Algorithm for Optimal Differentially Private Stochastic Minimax Optimization

no code implementations • 1 Jun 2022 • Liang Zhang, Kiran Koshy Thekumparampil, Sewoong Oh, Niao He

We provide a general framework for solving differentially private stochastic minimax optimization (DP-SMO) problems, which enables the practitioners to bring their own base optimization algorithm and use it as a black-box to obtain the near-optimal privacy-loss trade-off.

Paper
Add Code

DP-PCA: Statistically Optimal and Differentially Private PCA

no code implementations • 27 May 2022 • Xiyang Liu, Weihao Kong, Prateek Jain, Sewoong Oh

For sub-Gaussian data, we provide nearly optimal statistical error rates even for $n=\tilde O(d)$.

Paper
Add Code

Towards a Defense Against Federated Backdoor Attacks Under Continuous Training

1 code implementation • 24 May 2022 • Shuaiqi Wang, Jonathan Hayase, Giulia Fanti, Sewoong Oh

We propose shadow learning, a framework for defending against backdoor attacks in the FL setting under long-range training.

Continual Learning Federated Learning

Paper
Code

MAML and ANIL Provably Learn Representations

no code implementations • 7 Feb 2022 • Liam Collins, Aryan Mokhtari, Sewoong Oh, Sanjay Shakkottai

Recent empirical evidence has driven conventional wisdom to believe that gradient-based meta-learning (GBML) methods perform well at few-shot learning because they learn an expressive data representation that is shared across tasks.

Few-Shot Learning Representation Learning

Paper
Add Code

Lifted Primal-Dual Method for Bilinearly Coupled Smooth Minimax Optimization

no code implementations • 19 Jan 2022 • Kiran Koshy Thekumparampil, Niao He, Sewoong Oh

We also provide a direct single-loop algorithm, using the LPD method, that achieves the iteration complexity of $O(\sqrt{\frac{L_x}{\varepsilon}} + \frac{\|A\|}{\sqrt{\mu_y \varepsilon}} + \sqrt{\frac{L_y}{\varepsilon}})$.

Paper
Add Code

Statistically and Computationally Efficient Linear Meta-representation Learning

no code implementations • NeurIPS 2021 • Kiran K. Thekumparampil, Prateek Jain, Praneeth Netrapalli, Sewoong Oh

To cope with such data scarcity, meta-representation learning methods train across many related tasks to find a shared (lower-dimensional) representation of the data where all tasks can be solved accurately.

Few-Shot Learning Representation Learning

Paper
Add Code

Gradient flows on graphons: existence, convergence, continuity equations

no code implementations • 18 Nov 2021 • Sewoong Oh, Soumik Pal, Raghav Somani, Raghavendra Tripathi

Wasserstein gradient flows on probability measures have found a host of applications in various optimization problems.

Paper
Add Code

Differential privacy and robust statistics in high dimensions

no code implementations • 12 Nov 2021 • Xiyang Liu, Weihao Kong, Sewoong Oh

The key insight is that if we design an exponential mechanism that accesses the data only via one-dimensional robust statistics, then the resulting local sensitivity can be dramatically reduced.

Vocal Bursts Intensity Prediction

Paper
Add Code

Gradient Inversion with Generative Image Prior

1 code implementation • NeurIPS 2021 • Jinwoo Jeon, Jaechang Kim, Kangwook Lee, Sewoong Oh, Jungseul Ok

Federated Learning (FL) is a distributed learning framework, in which the local data never leaves clients devices to preserve privacy, and the server trains models on the data via accessing only the gradients of those local data.

Federated Learning

Paper
Code

KO codes: Inventing Nonlinear Encoding and Decoding for Reliable Wireless Communication via Deep-learning

1 code implementation • 29 Aug 2021 • Ashok Vardhan Makkuva, Xiyang Liu, Mohammad Vahid Jamali, Hessam Mahdavifar, Sewoong Oh, Pramod Viswanath

In this paper, we construct KO codes, a computationaly efficient family of deep-learning driven (encoder, decoder) pairs that outperform the state-of-the-art reliability performance on the standardized AWGN channel.

Benchmarking

Paper
Code

FedChain: Chained Algorithms for Near-Optimal Communication Cost in Federated Learning

no code implementations • ICLR 2022 • Charlie Hou, Kiran K. Thekumparampil, Giulia Fanti, Sewoong Oh

We propose FedChain, an algorithmic framework that combines the strengths of local methods and global methods to achieve fast convergence in terms of R while leveraging the similarity between clients.

Federated Learning Image Classification

Paper
Add Code

Divergence Frontiers for Generative Models: Sample Complexity, Quantization Effects, and Frontier Integrals

1 code implementation • NeurIPS 2021 • Lang Liu, Krishna Pillutla, Sean Welleck, Sewoong Oh, Yejin Choi, Zaid Harchaoui

The spectacular success of deep generative models calls for quantitative tools to measure their statistical performance.

Quantization

Paper
Code

Sample Efficient Linear Meta-Learning by Alternating Minimization

no code implementations • 18 May 2021 • Kiran Koshy Thekumparampil, Prateek Jain, Praneeth Netrapalli, Sewoong Oh

We show that, for a constant subspace dimension MLLAM obtains nearly-optimal estimation error, despite requiring only $\Omega(\log d)$ samples per task.

Meta-Learning

Paper
Add Code

SPECTRE: Defending Against Backdoor Attacks Using Robust Statistics

1 code implementation • 22 Apr 2021 • Jonathan Hayase, Weihao Kong, Raghav Somani, Sewoong Oh

There have been promising attempts to use the intermediate representations of such a model to separate corrupted examples from clean ones.

Paper
Code

Robust and Differentially Private Mean Estimation

1 code implementation • NeurIPS 2021 • Xiyang Liu, Weihao Kong, Sham Kakade, Sewoong Oh

In statistical learning and analysis from shared data, which is increasingly widely adopted in platforms such as federated learning and meta-learning, there are two major concerns: privacy and robustness.

Federated Learning Meta-Learning

Paper
Code

Efficient Algorithms for Federated Saddle Point Optimization

no code implementations • 12 Feb 2021 • Charlie Hou, Kiran K. Thekumparampil, Giulia Fanti, Sewoong Oh

Our goal is to design an algorithm that can harness the benefit of similarity in the clients while recovering the Minibatch Mirror-prox performance under arbitrary heterogeneity (up to log factors).

Paper
Add Code

Eliminating Sharp Minima from SGD with Truncated Heavy-tailed Noise

no code implementations • ICLR 2022 • Xingyu Wang, Sewoong Oh, Chang-Han Rhee

The empirical success of deep learning is often attributed to SGD's mysterious ability to avoid sharp local minima in the loss landscape, as sharp minima are known to lead to poor generalization.

Paper
Add Code

Reed-Muller Subcodes: Machine Learning-Aided Design of Efficient Soft Recursive Decoding

no code implementations • 2 Feb 2021 • Mohammad Vahid Jamali, Xiyang Liu, Ashok Vardhan Makkuva, Hessam Mahdavifar, Sewoong Oh, Pramod Viswanath

To lower the complexity of our decoding algorithm, referred to as subRPA in this paper, we investigate different ways for pruning the projections.

Information Theory Information Theory

Paper
Add Code

Projection Efficient Subgradient Method and Optimal Nonsmooth Frank-Wolfe Method

no code implementations • NeurIPS 2020 • Kiran Koshy Thekumparampil, Prateek Jain, Praneeth Netrapalli, Sewoong Oh

Further, instead of a PO if we only have a linear minimization oracle (LMO, a la Frank-Wolfe) to access the constraint set, an extension of our method, MOLES, finds a feasible $\epsilon$-suboptimal solution using $O(\epsilon^{-2})$ LMO calls and FO calls---both match known lower bounds, resolving a question left open since White (1993).

Paper
Add Code

Deepcode and Modulo-SK are Designed for Different Settings

no code implementations • 18 Aug 2020 • Hyeji Kim, Yihan Jiang, Sreeram Kannan, Sewoong Oh, Pramod Viswanath

DeepCode is designed and evaluated for the AWGN channel with (potentially delayed) uncoded output feedback.

Paper
Add Code

Robust Meta-learning for Mixed Linear Regression with Small Batches

no code implementations • NeurIPS 2020 • Weihao Kong, Raghav Somani, Sham Kakade, Sewoong Oh

Together, this approach is robust against outliers and achieves a graceful statistical trade-off; the lack of $\Omega(k^{1/2})$-size tasks can be compensated for with smaller tasks, which can now be as small as $O(\log k)$.

Meta-Learning regression

Paper
Add Code

Meta-learning for mixed linear regression

no code implementations • ICML 2020 • Weihao Kong, Raghav Somani, Zhao Song, Sham Kakade, Sewoong Oh

In modern supervised learning, there are a large number of tasks, but many of them are associated with only a small amount of labeled data.

Meta-Learning regression +1

Paper
Add Code

Minimax Optimal Estimation of Approximate Differential Privacy on Neighboring Databases

1 code implementation • NeurIPS 2019 • Xiyang Liu, Sewoong Oh

We pose it as a property estimation problem, and study the fundamental trade-offs involved in the accuracy in estimated privacy guarantees and the number of samples required.

Paper
Code

Turbo Autoencoder: Deep learning based channel codes for point-to-point communication channels

1 code implementation • NeurIPS 2019 • Yihan Jiang, Hyeji Kim, Himanshu Asnani, Sreeram Kannan, Sewoong Oh, Pramod Viswanath

Designing codes that combat the noise in a communication medium has remained a significant area of research in information theory as well as wireless communications.

Paper
Code

Towards Principled Objectives for Contrastive Disentanglement

no code implementations • 25 Sep 2019 • Anwesa Choudhuri, Ashok Vardhan Makkuva, Ranvir Rana, Sewoong Oh, Girish Chowdhary, Alexander Schwing

%In fact, contrastive disentanglement and unsupervised recovery are often combined in that we seek additional variations that exhibit salient factors/properties.

Disentanglement

Paper
Add Code

Optimal transport mapping via input convex neural networks

2 code implementations • ICML 2020 • Ashok Vardhan Makkuva, Amirhossein Taghvaei, Sewoong Oh, Jason D. Lee

Building upon recent advances in the field of input convex neural networks, we propose a new framework where the gradient of one convex function represents the optimal transport mapping.

Paper
Code

Efficient Algorithms for Smooth Minimax Optimization

2 code implementations • NeurIPS 2019 • Kiran Koshy Thekumparampil, Prateek Jain, Praneeth Netrapalli, Sewoong Oh

This paper studies first order methods for solving smooth minimax optimization problems $\min_x \max_y g(x, y)$ where $g(\cdot,\cdot)$ is smooth and $g(x,\cdot)$ is concave for each $x$.

Paper
Code

InfoGAN-CR and ModelCentrality: Self-supervised Model Training and Selection for Disentangling GANs

1 code implementation • 14 Jun 2019 • Zinan Lin, Kiran Koshy Thekumparampil, Giulia Fanti, Sewoong Oh

Disentangled generative models map a latent code vector to a target space, while enforcing that a subset of the learned latent codes are interpretable and associated with distinct properties of the target distribution.

Disentanglement Model Selection

Paper
Code

Robust conditional GANs under missing or uncertain labels

no code implementations • 9 Jun 2019 • Kiran Koshy Thekumparampil, Sewoong Oh, Ashish Khetan

Matching the performance of conditional Generative Adversarial Networks with little supervision is an important task, especially in venturing into new domains.

Paper
Add Code

Learning in Gated Neural Networks

no code implementations • 6 Jun 2019 • Ashok Vardhan Makkuva, Sewoong Oh, Sreeram Kannan, Pramod Viswanath

Gating is a key feature in modern neural networks including LSTMs, GRUs and sparsely-gated deep neural networks.

Paper
Add Code

Minimax Rates of Estimating Approximate Differential Privacy

2 code implementations • 24 May 2019 • Xiyang Liu, Sewoong Oh

We pose it as a property estimation problem, and study the fundamental trade-offs involved in the accuracy in estimated privacy guarantees and the number of samples required.

Information Theory Information Theory

Paper
Code

DeepTurbo: Deep Turbo Decoder

1 code implementation • 6 Mar 2019 • Yihan Jiang, Hyeji Kim, Himanshu Asnani, Sreeram Kannan, Sewoong Oh, Pramod Viswanath

We focus on Turbo codes and propose DeepTurbo, a novel deep learning based architecture for Turbo decoding.

Paper
Code

Number of Connected Components in a Graph: Estimation via Counting Patterns

no code implementations • 1 Dec 2018 • Ashish Khetan, Harshay Shah, Sewoong Oh

This representation is crucial in introducing a novel estimator for the number of connected components for general graphs, under the knowledge of the spectral gap of the original graph.

Paper
Add Code

LEARN Codes: Inventing Low-latency Codes via Recurrent Neural Networks

1 code implementation • 30 Nov 2018 • Yihan Jiang, Hyeji Kim, Himanshu Asnani, Sreeram Kannan, Sewoong Oh, Pramod Viswanath

Designing channel codes under low-latency constraints is one of the most demanding requirements in 5G standards.

Paper
Code

Robustness of Conditional GANs to Noisy Labels

2 code implementations • NeurIPS 2018 • Kiran Koshy Thekumparampil, Ashish Khetan, Zinan Lin, Sewoong Oh

When the distribution of the noise is known, we introduce a novel architecture which we call Robust Conditional GAN (RCGAN).

Paper
Code

Rate Distortion For Model Compression: From Theory To Practice

no code implementations • 9 Oct 2018 • Weihao Gao, Yu-Han Liu, Chong Wang, Sewoong Oh

Theoretically, we prove that the proposed scheme is optimal for compressing one-hidden-layer ReLU neural networks.

Data Compression Model Compression +1

Paper
Add Code

Learning One-hidden-layer Neural Networks under General Input Distributions

no code implementations • 9 Oct 2018 • Weihao Gao, Ashok Vardhan Makkuva, Sewoong Oh, Pramod Viswanath

Significant advances have been made recently on training neural networks, where the main challenge is in solving an optimization problem with abundant critical points.

Paper
Add Code

Deepcode: Feedback Codes via Deep Learning

1 code implementation • NeurIPS 2018 • Hyeji Kim, Yihan Jiang, Sreeram Kannan, Sewoong Oh, Pramod Viswanath

The design of codes for communicating reliably over a statistically well defined channel is an important endeavor involving deep mathematical research and wide-ranging practical applications.

Paper
Code

Communication Algorithms via Deep Learning

3 code implementations • ICLR 2018 • Hyeji Kim, Yihan Jiang, Ranvir Rana, Sreeram Kannan, Sewoong Oh, Pramod Viswanath

We show that creatively designed and trained RNN architectures can decode well known sequential codes such as the convolutional and turbo codes with close to optimal performance on the additive white Gaussian noise (AWGN) channel, which itself is achieved by breakthrough algorithms of our times (Viterbi and BCJR decoders, representing dynamic programing and forward-backward algorithms).

Paper
Code

Attention-based Graph Neural Network for Semi-supervised Learning

1 code implementation • ICLR 2018 • Kiran K. Thekumparampil, Chong Wang, Sewoong Oh, Li-Jia Li

Recently popularized graph neural networks achieve the state-of-the-art accuracy on a number of standard benchmark datasets for graph-based semi-supervised learning, improving significantly over existing approaches.

Ranked #10 on Graph Regression on Lipophilicity

Graph Regression

144

Paper
Code

Breaking the gridlock in Mixture-of-Experts: Consistent and Efficient Algorithms

no code implementations • 21 Feb 2018 • Ashok Vardhan Makkuva, Sewoong Oh, Sreeram Kannan, Pramod Viswanath

Once the experts are known, the recovery of gating parameters still requires an EM algorithm; however, we show that the EM algorithm for this simplified problem, unlike the joint EM algorithm, converges to the true parameters.

Ensemble Learning

Paper
Add Code

PacGAN: The power of two samples in generative adversarial networks

7 code implementations • NeurIPS 2018 • Zinan Lin, Ashish Khetan, Giulia Fanti, Sewoong Oh

Generative adversarial networks (GANs) are innovative techniques for learning generative models of complex data distributions from samples.

Two-sample testing Vocal Bursts Valence Prediction

275

Paper
Code

Optimal Sample Complexity of M-wise Data for Top-K Ranking

no code implementations • NeurIPS 2017 • Minje Jang, Sunghyun Kim, Changho Suh, Sewoong Oh

As our result, we characterize the minimax optimality on the sample size for top-K ranking.

Paper
Add Code

Matrix Norm Estimation from a Few Entries

1 code implementation • NeurIPS 2017 • Ashish Khetan, Sewoong Oh

This paper focuses on the technical challenges in accurately estimating the Schatten norms from a sampling of a matrix.

Collaborative Filtering Matrix Completion

Paper
Code

Estimating Mutual Information for Discrete-Continuous Mixtures

1 code implementation • NeurIPS 2017 • Weihao Gao, Sreeram Kannan, Sewoong Oh, Pramod Viswanath

We provide numerical experiments suggesting superiority of the proposed estimator compared to other heuristics of adding small continuous noise to all the samples and applying standard estimators tailored for purely continuous variables, and quantizing the samples and applying standard estimators tailored for purely discrete variables.

Clustering Mutual Information Estimation

Paper
Code

Discovering Potential Correlations via Hypercontractivity

no code implementations • NeurIPS 2017 • Hyeji Kim, Weihao Gao, Sreeram Kannan, Sewoong Oh, Pramod Viswanath

Discovering a correlation from one variable to another variable is of fundamental scientific and practical interest.

Time Series Time Series Analysis +1

Paper
Add Code

Learning from Comparisons and Choices

no code implementations • 24 Apr 2017 • Sahand Negahban, Sewoong Oh, Kiran K. Thekumparampil, Jiaming Xu

This also allows one to compute similarities among users and items to be used for categorization and search.

Marketing Recommendation Systems

Paper
Add Code

Spectrum Estimation from a Few Entries

no code implementations • 18 Mar 2017 • Ashish Khetan, Sewoong Oh

We propose first estimating the Schatten $k$-norms of a matrix, and then applying Chebyshev approximation to the spectral sum function or applying moment matching in Wasserstein distance to recover the singular values.

Collaborative Filtering Matrix Completion

Paper
Add Code

Iterative Bayesian Learning for Crowdsourced Regression

no code implementations • 28 Feb 2017 • Jungseul Ok, Sewoong Oh, Yunhun Jang, Jinwoo Shin, Yung Yi

Crowdsourcing platforms emerged as popular venues for purchasing human intelligence at low cost for large volume of tasks.

regression

Paper
Add Code

Breaking the Bandwidth Barrier: Geometrical Adaptive Entropy Estimation

no code implementations • NeurIPS 2016 • Weihao Gao, Sewoong Oh, Pramod Viswanath

In this paper, we combine both these approaches to design new estimators of entropy and mutual information that outperform state of the art methods.

Paper
Add Code

Computational and Statistical Tradeoffs in Learning to Rank

no code implementations • NeurIPS 2016 • Ashish Khetan, Sewoong Oh

For massive and heterogeneous modern datasets, it is of fundamental interest to provide guarantees on the accuracy of estimation when computational resources are limited.

Learning-To-Rank

Paper
Add Code

Demystifying Fixed k-Nearest Neighbor Information Estimators

1 code implementation • 11 Apr 2016 • Weihao Gao, Sewoong Oh, Pramod Viswanath

In this paper we demonstrate that the estimator is consistent and also identify an upper bound on the rate of convergence of the bias as a function of number of samples.

Paper
Code

Top-$K$ Ranking from Pairwise Comparisons: When Spectral Ranking is Optimal

no code implementations • 14 Mar 2016 • Minje Jang, Sunghyun Kim, Changho Suh, Sewoong Oh

First, in a general comparison model where item pairs to compare are given a priori, we attain an upper and lower bound on the sample size for reliable recovery of the top-$K$ ranked items.

Paper
Add Code

Optimal Inference in Crowdsourced Classification via Belief Propagation

no code implementations • 11 Feb 2016 • Jungseul Ok, Sewoong Oh, Jinwoo Shin, Yung Yi

Crowdsourcing systems are popular for solving large-scale labelling tasks with low-paid workers.

Classification General Classification

Paper
Add Code

Achieving Budget-optimality with Adaptive Schemes in Crowdsourcing

no code implementations • NeurIPS 2016 • Ashish Khetan, Sewoong Oh

Under this generalized Dawid-Skene model, we characterize the fundamental trade-off between budget and accuracy.

Paper
Add Code

Conditional Dependence via Shannon Capacity: Axioms, Estimators and Applications

no code implementations • 10 Feb 2016 • Weihao Gao, Sreeram Kannan, Sewoong Oh, Pramod Viswanath

We conduct an axiomatic study of the problem of estimating the strength of a known causal relationship between a pair of variables.

Paper
Add Code

Data-driven Rank Breaking for Efficient Rank Aggregation

no code implementations • 21 Jan 2016 • Ashish Khetan, Sewoong Oh

Rank aggregation systems collect ordinal preferences from individuals to produce a global ranking that represents the social preference.

Paper
Add Code

Secure Multi-party Differential Privacy

no code implementations • NeurIPS 2015 • Peter Kairouz, Sewoong Oh, Pramod Viswanath

In this setting, each party is interested in computing a function on its private bit and all the other parties' bits.

Paper
Add Code

Collaboratively Learning Preferences from Ordinal Data

no code implementations • NeurIPS 2015 • Sewoong Oh, Kiran K. Thekumparampil, Jiaming Xu

In order to predict the preferences, we want to learn the underlying model from noisy observations of the low-rank matrix, collected as revealed preferences in various forms of ordinal data.

Collaborative Ranking Management +1

Paper
Add Code

Spy vs. Spy: Rumor Source Obfuscation

no code implementations • 29 Dec 2014 • Giulia Fanti, Peter Kairouz, Sewoong Oh, Pramod Viswanath

Whether for fear of judgment or personal endangerment, it is crucial to keep anonymous the identity of the user who initially posted a sensitive message.

Paper
Add Code

Learning Mixed Multinomial Logit Model from Ordinal Data

no code implementations • NeurIPS 2014 • Sewoong Oh, Devavrat Shah

In case of single MNL models (no mixture), computationally and statistically tractable learning from pair-wise comparisons is feasible.

Management Tensor Decomposition

Paper
Add Code

Minimax-optimal Inference from Partial Rankings

no code implementations • NeurIPS 2014 • Bruce Hajek, Sewoong Oh, Jiaming Xu

For a given assignment of items to users, we first derive an oracle lower bound of the estimation error that holds even for the more general Thurstone models.

Paper
Add Code

Provable Tensor Factorization with Missing Data

1 code implementation • NeurIPS 2014 • Prateek Jain, Sewoong Oh

We show that under certain standard assumptions, our method can recover a three-mode $n\times n\times n$ dimensional rank-$r$ tensor exactly from $O(n^{3/2} r^5 \log^4 n)$ randomly sampled entries.

Paper
Code

Learning Mixtures of Discrete Product Distributions using Spectral Decompositions

no code implementations • 12 Nov 2013 • Prateek Jain, Sewoong Oh

The main challenge in learning mixtures of discrete product distributions is that these low-rank tensors cannot be obtained directly from the sample moments.

Matrix Completion Recommendation Systems

Paper
Add Code

The Composition Theorem for Differential Privacy

no code implementations • 4 Nov 2013 • Peter Kairouz, Sewoong Oh, Pramod Viswanath

Sequential querying of differentially private mechanisms degrades the overall privacy level.

Data Structures and Algorithms Cryptography and Security Information Theory Information Theory

Paper
Add Code

Iterative ranking from pair-wise comparisons

no code implementations • NeurIPS 2012 • Sahand Negahban, Sewoong Oh, Devavrat Shah

In most settings, in addition to obtaining ranking, finding ‘scores’ for each object (e. g. player’s rating) is of interest to understanding the intensity of the preferences.

Paper
Add Code

Rank Centrality: Ranking from Pair-wise Comparisons

no code implementations • 8 Sep 2012 • Sahand Negahban, Sewoong Oh, Devavrat Shah

To study the efficacy of the algorithm, we consider the popular Bradley-Terry-Luce (BTL) model (equivalent to the Multinomial Logit (MNL) for pair-wise comparisons) in which each object has an associated score which determines the probabilistic outcomes of pair-wise comparisons between objects.

Paper
Add Code

Iterative Learning for Reliable Crowdsourcing Systems

no code implementations • NeurIPS 2011 • David R. Karger, Sewoong Oh, Devavrat Shah

Crowdsourcing systems, in which tasks are electronically distributed to numerous ``information piece-workers'', have emerged as an effective paradigm for human-powered solving of large scale problems in domains such as image classification, data entry, optical character recognition, recommendation, and proofreading.

Image Classification Optical Character Recognition +1

Paper
Add Code

Budget-Optimal Task Allocation for Reliable Crowdsourcing Systems

no code implementations • 17 Oct 2011 • David R. Karger, Sewoong Oh, Devavrat Shah

Further, we compare our approach with a more general class of algorithms which can dynamically assign tasks.

Image Classification Optical Character Recognition +1

Paper
Add Code

A Gradient Descent Algorithm on the Grassman Manifold for Matrix Completion

1 code implementation • 27 Oct 2009 • Raghunandan H. Keshavan, Sewoong Oh

We consider the problem of reconstructing a low-rank matrix from a small subset of its entries.

Collaborative Filtering Low-Rank Matrix Completion

Paper
Code

Matrix Completion from Noisy Entries

1 code implementation • NeurIPS 2009 • Raghunandan H. Keshavan, Andrea Montanari, Sewoong Oh

Given a matrix M of low-rank, we consider the problem of reconstructing it from noisy observations of a small, random subset of its entries.

Collaborative Filtering Matrix Completion

Paper
Code

Matrix Completion from a Few Entries

1 code implementation • 20 Jan 2009 • Raghunandan H. Keshavan, Andrea Montanari, Sewoong Oh

In the process of proving these statements, we obtain a generalization of a celebrated result by Friedman-Kahn-Szemeredi and Feige-Ofek on the spectrum of sparse random matrices.

Matrix Completion

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.