no code implementations • EMNLP 2021 • Vivek Madan, Ashish Khetan, Zohar Karnin
In this paper, we address the problem for the case when the downstream corpus is too small for additional pre-training.
no code implementations • 22 Jan 2025 • Simone Filice, Guy Horowitz, David Carmel, Zohar Karnin, Liane Lewin-Eytan, Yoelle Maarek
We introduce here DataMorgana, a tool for generating highly customizable and diverse synthetic Q&A benchmarks tailored to RAG applications.
no code implementations • 24 Sep 2024 • Shadi Iskander, Nachshon Cohen, Zohar Karnin, Ori Shapira, Sofia Tolmach
To that end, we propose two approaches for assessing the reliability of data for training LLMs to use external tools.
no code implementations • 7 Sep 2023 • Saeed Khaki, Akhouri Abhinav Aditya, Zohar Karnin, Lan Ma, Olivia Pan, Samarth Marudheri Chandrashekar
Our first step involves encoding a sample of production data as the target distribution, and the model training data as the reference distribution.
no code implementations • 23 May 2022 • Anastasia Razdaibiedina, Ashish Khetan, Zohar Karnin, Daniel Khashabi, Vishaal Kapoor, Vivek Madan
In this paper, we propose Representation Projection Invariance (REPINA), a novel regularization method to maintain the information content of representation and reduce representation collapse during fine-tuning by discouraging undesirable changes in the representations.
no code implementations • ACL 2022 • Xin Huang, Ashish Khetan, Rene Bidart, Zohar Karnin
Transformer-based language models such as BERT have achieved the state-of-the-art performance on various NLP tasks, but are computationally prohibitive.
no code implementations • 26 Nov 2021 • David Nigenda, Zohar Karnin, Muhammad Bilal Zafar, Raghu Ramesha, Alan Tan, Michele Donini, Krishnaram Kenthapadi
With the increasing adoption of machine learning (ML) models and systems in high-stakes settings across different industries, guaranteeing a model's performance after deployment has become crucial.
no code implementations • 23 Jul 2021 • Fred Qin, Vivek Madan, Ujjwal Ratan, Zohar Karnin, Vishaal Kapoor, Parminder Bhatia, Taha Kass-Hout
Clinical text provides essential information to estimate the severity of the sepsis in addition to structured clinical data.
no code implementations • 1 Jan 2021 • Vivek Madan, Ashish Khetan, Zohar Karnin
The need for such a method is clear as it is infeasible to collect a large pre-training corpus for every possible domain.
no code implementations • 1 Jan 2021 • Sheng-Min Shih, Pin-Ju Tien, Zohar Karnin
Our approach effectively selects the baseline as the closest realistic sample belong to the target class, which allows attribution methods to provide true one-vs-one explanations.
no code implementations • 15 Dec 2020 • Piali Das, Valerio Perrone, Nikita Ivkin, Tanya Bansal, Zohar Karnin, Huibin Shen, Iaroslav Shcherbatyi, Yotam Elor, Wilton Wu, Aida Zolic, Thibaut Lienart, Alex Tang, Amr Ahmed, Jean Baptiste Faddoul, Rodolphe Jenatton, Fela Winkelmolen, Philip Gautier, Leo Dirac, Andre Perunicic, Miroslav Miladinovic, Giovanni Zappella, Cédric Archambeau, Matthias Seeger, Bhaskar Dutt, Laurence Rouesnel
AutoML systems provide a black-box solution to machine learning problems by selecting the right way of processing features, choosing an algorithm and tuning the hyperparameters of the entire pipeline.
12 code implementations • 11 Dec 2020 • Xin Huang, Ashish Khetan, Milan Cvitkovic, Zohar Karnin
We propose TabTransformer, a novel deep tabular data modeling architecture for supervised and semi-supervised learning.
1 code implementation • 11 Nov 2020 • Sheng-Min Shih, Pin-Ju Tien, Zohar Karnin
Attribution methods have been shown as promising approaches for identifying key features that led to learned model predictions.
no code implementations • 27 Jul 2020 • Fela Winkelmolen, Nikita Ivkin, H. Furkan Bozkurt, Zohar Karnin
Zero-shot hyperparameter optimization (HPO) is a simple yet effective use of transfer learning for constructing a small list of hyperparameter (HP) configurations that complement each other.
no code implementations • NeurIPS 2020 • Julian Katz-Samuels, Lalit Jain, Zohar Karnin, Kevin Jamieson
This paper proposes near-optimal algorithms for the pure-exploration linear bandit problem in the fixed confidence and fixed budget settings.
no code implementations • 22 May 2020 • Ashish Khetan, Zohar Karnin
The methods that start with a pretrained network either prune channels uniformly across the layers or prune channels based on the basic statistics of the network parameters.
no code implementations • ACL 2020 • Ashish Khetan, Zohar Karnin
In this work we revisit the architecture choices of BERT in efforts to obtain a lighter model.
no code implementations • ICLR 2020 • Shashank Singh, Ashish Khetan, Zohar Karnin
In many learning situations, resources at inference time are significantly more constrained than resources at training time.
1 code implementation • 29 Jun 2019 • Nikita Ivkin, Edo Liberty, Kevin Lang, Zohar Karnin, Vladimir Braverman
Approximating quantiles and distributions over streaming data has been studied for roughly two decades now.
no code implementations • 22 Jun 2019 • Nick Ryder, Zohar Karnin, Edo Liberty
In many applications the data set to be projected is given to us in advance, yet the current RP techniques do not make use of information about the data.
no code implementations • 11 Jun 2019 • Zohar Karnin, Edo Liberty
We provide general techniques for bounding the class discrepancy of machine learning problems.
no code implementations • 20 May 2019 • Shashank Singh, Ashish Khetan, Zohar Karnin
In many learning situations, resources at inference time are significantly more constrained than resources at training time.
no code implementations • ICML 2017 • Satyen Kale, Zohar Karnin, Tengyuan Liang, Dávid Pál
Online sparse linear regression is an online problem where an algorithm repeatedly chooses a subset of coordinates to observe in an adversarially chosen feature vector, makes a real-valued prediction, receives the true label, and incurs the squared loss.
no code implementations • 5 Jul 2016 • Yahel David, Dotan Di Castro, Zohar Karnin
Our optimization problem is formulated as an MDP where the action space is of a combinatorial nature as we recommend in each round, multiple items.
no code implementations • 29 Jun 2016 • Mihajlo Grbovic, Guy Halawi, Zohar Karnin, Yoelle Maarek
We report how we have discovered that a set of 6 "latent" categories (one for human- and the others for machine-generated messages) can explain a significant portion of email traffic.
2 code implementations • 17 Mar 2016 • Zohar Karnin, Kevin Lang, Edo Liberty
One of our contributions is a novel representation and modification of the widely used merge-and-reduce construction.
Data Structures and Algorithms
no code implementations • NeurIPS 2015 • Masrour Zoghi, Zohar Karnin, Shimon Whiteson, Maarten de Rijke
A version of the dueling bandit problem is addressed in which a Condorcet winner may not exist.
no code implementations • 10 Jun 2014 • Oren Anava, Shahar Golan, Nadav Golbandi, Zohar Karnin, Ronny Lempel, Oleg Rokhlenko, Oren Somekh
It is well known that collaborative filtering (CF) based recommender systems provide better modeling of users and items associated with considerable rating history.
no code implementations • 14 May 2014 • Nir Ailon, Thorsten Joachims, Zohar Karnin
We present algorithms for reducing the Dueling Bandits problem to the conventional (stochastic) Multi-Armed Bandits problem.
no code implementations • 21 Dec 2013 • Elad Hazan, Zohar Karnin, Raghu Mehka
Numerous machine learning problems require an exploration basis - a mechanism to explore the action space.
no code implementations • NeurIPS 2013 • Dimitris Achlioptas, Zohar Karnin, Edo Liberty
We consider the problem of selecting non-zero entries of a matrix $A$ in order to produce a sparse sketch of it, $B$, that minimizes $\|A-B\|_2$.
no code implementations • NeurIPS 2013 • Eshcar Hillel, Zohar Karnin, Tomer Koren, Ronny Lempel, Oren Somekh
That is, distributing learning to $k$ players gives rise to a factor $\sqrt{k}$ parallel speed-up.
no code implementations • NeurIPS 2012 • Elad Hazan, Zohar Karnin
We present a simplex algorithm for linear programming in a linear classification formulation.