no code implementations • 28 Nov 2024 • Akhiad Bercovich, Tomer Ronen, Talor Abramovich, Nir Ailon, Nave Assaf, Mohammad Dabbah, Ido Galil, Amnon Geifman, Yonatan Geifman, Izhak Golan, Netanel Haber, Ehud Karpas, Roi Koren, Itay Levy, Pavlo Molchanov, Shahar Mor, Zach Moshe, Najeeb Nabwani, Omri Puny, Ran Rubin, Itamar Schen, Ido Shahaf, Oren Tropp, Omer Ullman Argov, Ran Zilberstein, Ran El-Yaniv
We demonstrate the real-world impact of our framework through Llama-3. 1-Nemotron-51B-Instruct (Nemotron-51B), a publicly available model derived from Llama-3. 1-70B-Instruct.
no code implementations • 10 Oct 2022 • Nir Ailon, Supratim Shit
Recently, neural tangent kernel (NTK) has been used to explain the dynamics of learning parameters of neural networks, at the large width limit.
no code implementations • 1 Jan 2021 • Omer Leibovitch, Nir Ailon
We are given a multi-class prediction problem, combined with a (possibly pre-trained) network architecture for solving it on a given instance distribution, and also a method for pruning the network to allow trading off prediction speed with accuracy.
no code implementations • 17 Jul 2020 • Nir Ailon, Omer Leibovich, Vineet Nair
Motivated by these facts, we propose to replace a dense linear layer in any neural network by an architecture based on the butterfly network.
no code implementations • 21 Nov 2016 • Elad Hoffer, Itay Hubara, Nir Ailon
Convolutional networks have marked their place over the last few years as the best performing model for various visual tasks.
1 code implementation • 4 Nov 2016 • Elad Hoffer, Nir Ailon
Deep networks are successfully used as classification models yielding state-of-the-art results when trained on a large number of labeled samples.
no code implementations • 2 Oct 2016 • Elad Hoffer, Itay Hubara, Nir Ailon
Convolutional networks have marked their place over the last few years as the best performing model for various visual tasks.
3 code implementations • 20 Dec 2014 • Elad Hoffer, Nir Ailon
Deep learning has proven itself as a successful set of models for learning useful semantic representations of data.
no code implementations • 14 May 2014 • Nir Ailon, Thorsten Joachims, Zohar Karnin
We present algorithms for reducing the Dueling Bandits problem to the conventional (stochastic) Multi-Armed Bandits problem.
no code implementations • 5 Dec 2013 • Nir Ailon, Kohei Hatano, Eiji Takimoto
Unfortunately, CombBand requires at each step an $n$-by-$n$ matrix permanent approximation to within improved accuracy as $T$ grows, resulting in a total running time that is super linear in $T$, making it impractical for large time horizons.
no code implementations • 30 Aug 2013 • Nir Ailon
Given a set $V$ of $n$ objects, an online ranking system outputs at each time step a full ranking of the set, observes a feedback of some form and suffers a loss.
no code implementations • NeurIPS 2011 • Nir Ailon
Given a set $V$ of $n$ elements we wish to linearly order them using pairwise preference labels which may be non-transitive (due to irrationality or arbitrary noise).
no code implementations • NeurIPS 2009 • Nir Ailon, Ragesh Jaiswal, Claire Monteleoni
We provide a clustering algorithm that approximately optimizes the k-means objective, in the one-pass streaming setting.