Search Results for author: Tharun Medini

Found 13 papers, 5 papers with code

BOLT: An Automated Deep Learning Framework for Training and Deploying Large-Scale Search and Recommendation Models on Commodity CPU Hardware

2 code implementations • 30 Mar 2023 • Nicholas Meisburger, Vihan Lakshman, Benito Geordie, Joshua Engels, David Torres Ramos, Pratik Pranav, Benjamin Coleman, Benjamin Meisburger, Shubh Gupta, Yashwanth Adunukota, Tharun Medini, Anshumali Shrivastava

Efficient large-scale neural network training and inference on commodity CPU hardware is of immense practical significance in democratizing deep learning (DL) capabilities.

Ranked #2 on Node Classification on Yelp-Fraud

Fraud Detection Information Retrieval +4

Paper
Code

Distributed SLIDE: Enabling Training Large Neural Networks on Low Bandwidth and Simple CPU-Clusters via Model Parallelism and Sparsity

no code implementations • 29 Jan 2022 • Minghao Yan, Nicholas Meisburger, Tharun Medini, Anshumali Shrivastava

We show that with reduced communication, due to sparsity, we can train close to a billion parameter model on simple 4-16 core CPU nodes connected by basic low bandwidth interconnect.

Cloud Computing

Paper
Add Code

IRLI: Iterative Re-partitioning for Learning to Index

no code implementations • 17 Mar 2021 • Gaurav Gupta, Tharun Medini, Anshumali Shrivastava, Alexander J Smola

Neural models have transformed the fundamental information retrieval problem of mapping a query to a giant set of items.

Information Retrieval Multi-Label Classification +1

Paper
Add Code

A Truly Constant-time Distribution-aware Negative Sampling

no code implementations • 1 Jan 2021 • Shabnam Daghaghi, Tharun Medini, Beidi Chen, Mengnan Zhao, Anshumali Shrivastava

Softmax classifiers with a very large number of classes naturally occur in many applications such as natural language processing and information retrieval.

Information Retrieval Retrieval

Paper
Add Code

A Tale of Two Efficient and Informative Negative Sampling Distributions

no code implementations • 31 Dec 2020 • Shabnam Daghaghi, Tharun Medini, Nicholas Meisburger, Beidi Chen, Mengnan Zhao, Anshumali Shrivastava

Unfortunately, due to the dynamically updated parameters and data samples, there is no sampling scheme that is provably adaptive and samples the negative classes efficiently.

Information Retrieval Retrieval +1

Paper
Add Code

SOLAR: Sparse Orthogonal Learned and Random Embeddings

no code implementations • ICLR 2021 • Tharun Medini, Beidi Chen, Anshumali Shrivastava

The label vectors are random, sparse, and near-orthogonal by design, while the query vectors are learned and sparse.

Multi-Label Classification

Paper
Add Code

Extreme Classification in Log Memory using Count-Min Sketch: A Case Study of Amazon Search with 50M Products

1 code implementation • NeurIPS 2019 • Tharun Medini, Qixuan Huang, Yiqiu Wang, Vijai Mohan, Anshumali Shrivastava

Our largest model has 6. 4 billion parameters and trains in less than 35 hours on a single p3. 16x machine.

Classification General Classification +1

Paper
Code

Fast Processing and Querying of 170TB of Genomics Data via a Repeated And Merged BloOm Filter (RAMBO)

1 code implementation • 10 Oct 2019 • Gaurav Gupta, Minghao Yan, Benjamin Coleman, Bryce Kille, R. A. Leo Elworth, Tharun Medini, Todd Treangen, Anshumali Shrivastava

Interestingly, it is a count-min sketch type arrangement of a membership testing utility (Bloom Filter in our case).

Paper
Code

RAMBO: Repeated And Merged Bloom Filter for Multiple Set Membership Testing (MSMT) in Sub-linear time

1 code implementation • 7 Oct 2019 • Gaurav Gupta, Benjamin Coleman, Tharun Medini, Vijai Mohan, Anshumali Shrivastava

A simple array of Bloom Filters can achieve that.

Paper
Code

SDM-Net: A Simple and Effective Model for Generalized Zero-Shot Learning

no code implementations • 10 Sep 2019 • Shabnam Daghaghi, Tharun Medini, Anshumali Shrivastava

Zero-Shot Learning (ZSL) is a classification task where we do not have even a single training labeled example from a set of unseen classes.

Descriptive General Classification +3

Paper
Add Code

SLIDE : In Defense of Smart Algorithms over Hardware Acceleration for Large-Scale Deep Learning Systems

3 code implementations • 7 Mar 2019 • Beidi Chen, Tharun Medini, James Farwell, Sameh Gobriel, Charlie Tai, Anshumali Shrivastava

On the same CPU hardware, SLIDE is over 10x faster than TF.

1,064

Paper
Code

Extreme Classification in Log Memory

no code implementations • 9 Oct 2018 • Qixuan Huang, Yiqiu Wang, Tharun Medini, Anshumali Shrivastava

With MACH we can train ODP dataset with 100, 000 classes and 400, 000 features on a single Titan X GPU, with the classification accuracy of 19. 28%, which is the best-reported accuracy on this dataset.

Classification General Classification

Paper
Add Code

Mimicking actions is a good strategy for beginners: Fast Reinforcement Learning with Expert Action Sequences

no code implementations • 27 Sep 2018 • Tharun Medini, Anshumali Shrivastava

Imitation Learning is the task of mimicking the behavior of an expert player in a Reinforcement Learning(RL) Environment to enhance the training of a fresh agent (called novice) beginning from scratch.

Atari Games Imitation Learning +2

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.