Search Results for author: Jakub Tarnawski

Found 12 papers, 4 papers with code

Efficiently Computing Similarities to Private Datasets

no code implementations13 Mar 2024 Arturs Backurs, Zinan Lin, Sepideh Mahabadi, Sandeep Silwal, Jakub Tarnawski

We abstract out this common subroutine and study the following fundamental algorithmic problem: Given a similarity function $f$ and a large high-dimensional private dataset $X \subset \mathbb{R}^d$, output a differentially private (DP) data structure which approximates $\sum_{x \in X} f(x, y)$ for any query $y$.

Density Estimation Dimensionality Reduction

Fairness in Submodular Maximization over a Matroid Constraint

no code implementations21 Dec 2023 Marwa El Halabi, Jakub Tarnawski, Ashkan Norouzi-Fard, Thuy-Duong Vuong

Submodular maximization over a matroid constraint is a fundamental problem with various applications in machine learning.

Attribute Decision Making +1

Fairness in Streaming Submodular Maximization over a Matroid Constraint

1 code implementation24 May 2023 Marwa El Halabi, Federico Fusco, Ashkan Norouzi-Fard, Jakab Tardos, Jakub Tarnawski

Streaming submodular maximization is a natural model for the task of selecting a representative subset from a large-scale dataset.

Clustering Fairness +1

Near-Optimal Correlation Clustering with Privacy

no code implementations2 Mar 2022 Vincent Cohen-Addad, Chenglin Fan, Silvio Lattanzi, Slobodan Mitrović, Ashkan Norouzi-Fard, Nikos Parotsidis, Jakub Tarnawski

Correlation clustering is a central problem in unsupervised learning, with applications spanning community detection, duplicate detection, automated labelling and many more.

Clustering Community Detection

Harmony: Overcoming the Hurdles of GPU Memory Capacity to Train Massive DNN Models on Commodity Servers

1 code implementation2 Feb 2022 Youjie Li, Amar Phanishayee, Derek Murray, Jakub Tarnawski, Nam Sung Kim

Deep neural networks (DNNs) have grown exponentially in size over the past decade, leaving only those who have massive datacenter-based resources with the ability to develop and train such models.

Correlation Clustering in Constant Many Parallel Rounds

no code implementations15 Jun 2021 Vincent Cohen-Addad, Silvio Lattanzi, Slobodan Mitrović, Ashkan Norouzi-Fard, Nikos Parotsidis, Jakub Tarnawski

Correlation clustering is a central topic in unsupervised learning, with many applications in ML and data mining.

Clustering

Fairness in Streaming Submodular Maximization: Algorithms and Hardness

1 code implementation NeurIPS 2020 Marwa El Halabi, Slobodan Mitrović, Ashkan Norouzi-Fard, Jakab Tardos, Jakub Tarnawski

Submodular maximization has become established as the method of choice for the task of selecting representative and diverse summaries of data.

BIG-bench Machine Learning Clustering +2

Efficient Algorithms for Device Placement of DNN Graph Operators

1 code implementation NeurIPS 2020 Jakub Tarnawski, Amar Phanishayee, Nikhil R. Devanur, Divya Mahajan, Fanny Nina Paravecino

However, for such settings (large models and multiple heterogeneous devices), we require automated algorithms and toolchains that can partition the ML workload across devices.

Beyond $1/2$-Approximation for Submodular Maximization on Massive Data Streams

no code implementations6 Aug 2018 Ashkan Norouzi-Fard, Jakub Tarnawski, Slobodan Mitrović, Amir Zandieh, Aida Mousavifar, Ola Svensson

It is the first low-memory, single-pass algorithm that improves the factor $0. 5$, under the natural assumption that elements arrive in a random order.

Clustering Recommendation Systems

Streaming Robust Submodular Maximization: A Partitioned Thresholding Approach

no code implementations NeurIPS 2017 Slobodan Mitrović, Ilija Bogunovic, Ashkan Norouzi-Fard, Jakub Tarnawski, Volkan Cevher

We study the classical problem of maximizing a monotone submodular function subject to a cardinality constraint k, with two additional twists: (i) elements arrive in a streaming fashion, and (ii) m items from the algorithm's memory are removed after the stream is finished.

Data Summarization

Active Learning and Proofreading for Delineation of Curvilinear Structures

no code implementations23 Dec 2016 Agata Mosinska, Jakub Tarnawski, Pascal Fua

In a proofreading context, we similarly find regions of the resulting reconstruction that should be verified in priority to obtain a nearly-perfect result.

Active Learning General Classification

Cannot find the paper you are looking for? You can Submit a new open access paper.