Search Results for author: Sachin Goyal

Found 8 papers, 6 papers with code

Scaling Laws for Data Filtering -- Data Curation cannot be Compute Agnostic

2 code implementations • 10 Apr 2024 • Sachin Goyal, Pratyush Maini, Zachary C. Lipton, aditi raghunathan, J. Zico Kolter

Vision-language models (VLMs) are trained for thousands of GPU hours on carefully curated web datasets.

144

Paper
Code

Think before you speak: Training Language Models With Pause Tokens

no code implementations • 3 Oct 2023 • Sachin Goyal, Ziwei Ji, Ankit Singh Rawat, Aditya Krishna Menon, Sanjiv Kumar, Vaishnavh Nagarajan

Language models generate responses by producing a series of tokens in immediate succession: the $(K+1)^{th}$ token is an outcome of manipulating $K$ hidden vectors per layer, one vector per preceding token.

GSM8K Question Answering

Paper
Add Code

T-MARS: Improving Visual Representations by Circumventing Text Feature Learning

1 code implementation • 6 Jul 2023 • Pratyush Maini, Sachin Goyal, Zachary C. Lipton, J. Zico Kolter, aditi raghunathan

However, naively removing all such data could also be wasteful, as it throws away images that contain visual features (in addition to overlapping text).

Optical Character Recognition

Paper
Code

Finetune like you pretrain: Improved finetuning of zero-shot vision models

1 code implementation • CVPR 2023 • Sachin Goyal, Ananya Kumar, Sankalp Garg, Zico Kolter, aditi raghunathan

In total, these benchmarks establish contrastive finetuning as a simple, intuitive, and state-of-the-art approach for supervised finetuning of image-text models like CLIP.

Descriptive Few-Shot Learning +1

Paper
Code

Test-Time Adaptation via Conjugate Pseudo-labels

1 code implementation • 20 Jul 2022 • Sachin Goyal, MingJie Sun, aditi raghunathan, Zico Kolter

In this paper, we start by presenting a surprising phenomenon: if we attempt to meta-learn the best possible TTA loss over a wide class of functions, then we recover a function that is remarkably similar to (a temperature-scaled version of) the softmax-entropy employed by TENT.

Meta-Learning Test-time Adaptation

Paper
Code

MET: Masked Encoding for Tabular Data

1 code implementation • 17 Jun 2022 • Kushal Majmundar, Sachin Goyal, Praneeth Netrapalli, Prateek Jain

Typical contrastive learning based SSL methods require instance-wise data augmentations which are difficult to design for unstructured tabular data.

Contrastive Learning Representation Learning

Paper
Code

PAL : Pretext-based Active Learning

no code implementations • 29 Oct 2020 • Shubhang Bhatnagar, Sachin Goyal, Darshan Tank, Amit Sethi

To counter the paucity of data, we also deploy another head on the scoring network for regularization via multi-task learning and use an unusual self-balancing hybrid scoring function.

Active Learning Multi-Task Learning +2

Paper
Add Code

DROCC: Deep Robust One-Class Classification

1 code implementation • ICML 2020 • Sachin Goyal, aditi raghunathan, Moksh Jain, Harsha Vardhan Simhadri, Prateek Jain

Classical approaches for one-class problems such as one-class SVM and isolation forest require careful feature engineering when applied to structured domains like images.

Ranked #3 on Anomaly Detection on UEA time-series datasets

Classification Feature Engineering +3

1,534

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.