Search Results for author: Sachin Goyal

Found 8 papers, 6 papers with code

Scaling Laws for Data Filtering -- Data Curation cannot be Compute Agnostic

2 code implementations10 Apr 2024 Sachin Goyal, Pratyush Maini, Zachary C. Lipton, aditi raghunathan, J. Zico Kolter

Vision-language models (VLMs) are trained for thousands of GPU hours on carefully curated web datasets.

Think before you speak: Training Language Models With Pause Tokens

no code implementations3 Oct 2023 Sachin Goyal, Ziwei Ji, Ankit Singh Rawat, Aditya Krishna Menon, Sanjiv Kumar, Vaishnavh Nagarajan

Language models generate responses by producing a series of tokens in immediate succession: the $(K+1)^{th}$ token is an outcome of manipulating $K$ hidden vectors per layer, one vector per preceding token.

GSM8K Question Answering

T-MARS: Improving Visual Representations by Circumventing Text Feature Learning

1 code implementation6 Jul 2023 Pratyush Maini, Sachin Goyal, Zachary C. Lipton, J. Zico Kolter, aditi raghunathan

However, naively removing all such data could also be wasteful, as it throws away images that contain visual features (in addition to overlapping text).

Optical Character Recognition

Finetune like you pretrain: Improved finetuning of zero-shot vision models

1 code implementation CVPR 2023 Sachin Goyal, Ananya Kumar, Sankalp Garg, Zico Kolter, aditi raghunathan

In total, these benchmarks establish contrastive finetuning as a simple, intuitive, and state-of-the-art approach for supervised finetuning of image-text models like CLIP.

Descriptive Few-Shot Learning +1

Test-Time Adaptation via Conjugate Pseudo-labels

1 code implementation20 Jul 2022 Sachin Goyal, MingJie Sun, aditi raghunathan, Zico Kolter

In this paper, we start by presenting a surprising phenomenon: if we attempt to meta-learn the best possible TTA loss over a wide class of functions, then we recover a function that is remarkably similar to (a temperature-scaled version of) the softmax-entropy employed by TENT.

Meta-Learning Test-time Adaptation

MET: Masked Encoding for Tabular Data

1 code implementation17 Jun 2022 Kushal Majmundar, Sachin Goyal, Praneeth Netrapalli, Prateek Jain

Typical contrastive learning based SSL methods require instance-wise data augmentations which are difficult to design for unstructured tabular data.

Contrastive Learning Representation Learning

PAL : Pretext-based Active Learning

no code implementations29 Oct 2020 Shubhang Bhatnagar, Sachin Goyal, Darshan Tank, Amit Sethi

To counter the paucity of data, we also deploy another head on the scoring network for regularization via multi-task learning and use an unusual self-balancing hybrid scoring function.

Active Learning Multi-Task Learning +2

DROCC: Deep Robust One-Class Classification

1 code implementation ICML 2020 Sachin Goyal, aditi raghunathan, Moksh Jain, Harsha Vardhan Simhadri, Prateek Jain

Classical approaches for one-class problems such as one-class SVM and isolation forest require careful feature engineering when applied to structured domains like images.

Classification Feature Engineering +3

Cannot find the paper you are looking for? You can Submit a new open access paper.