Search Results for author: Kayvon Fatahalian

Found 22 papers, 12 papers with code

Block and Detail: Scaffolding Sketch-to-Image Generation

no code implementations • 28 Feb 2024 • Vishnu Sarukkai, Lu Yuan, Mia Tang, Maneesh Agrawala, Kayvon Fatahalian

Our tool lets users sketch blocking strokes to coarsely represent the placement and form of objects and detail strokes to refine their shape and silhouettes.

Blocking Image Generation

Paper
Add Code

Learning Subject-Aware Cropping by Outpainting Professional Photos

no code implementations • 19 Dec 2023 • James Hong, Lu Yuan, Michaël Gharbi, Matthew Fisher, Kayvon Fatahalian

How to frame (or crop) a photo often depends on the image subject and its context; e. g., a human portrait.

Image Cropping

Paper
Add Code

Iterative Motion Editing with Natural Language

no code implementations • 15 Dec 2023 • Purvi Goel, Kuan-Chieh Wang, C. Karen Liu, Kayvon Fatahalian

Text-to-motion diffusion models can generate realistic animations from text prompts, but do not support fine-grained motion editing controls.

Paper
Add Code

Collage Diffusion

no code implementations • 1 Mar 2023 • Vishnu Sarukkai, Linden Li, Arden Ma, Christopher Ré, Kayvon Fatahalian

We seek to give users precise control over diffusion-based image generation by modeling complex scenes as sequences of layers, which define the desired spatial arrangement and visual attributes of objects in the scene.

Conditional Image Generation Image Harmonization

Paper
Add Code

Spotting Temporally Precise, Fine-Grained Events in Video

2 code implementations • 20 Jul 2022 • James Hong, Haotian Zhang, Michaël Gharbi, Matthew Fisher, Kayvon Fatahalian

We introduce the task of spotting temporally precise, fine-grained events in video (detecting the precise moment in time events occur).

Ranked #6 on Action Spotting on SoccerNet-v2

Action Detection Action Spotting +2

Paper
Code

Perfectly Balanced: Improving Transfer and Robustness of Supervised Contrastive Learning

1 code implementation • 15 Apr 2022 • Mayee F. Chen, Daniel Y. Fu, Avanika Narayan, Michael Zhang, Zhao Song, Kayvon Fatahalian, Christopher Ré

We first prove that adding a weighted class-conditional InfoNCE loss to SupCon controls the degree of spread.

Contrastive Learning Data Augmentation

Paper
Code

Shoring Up the Foundations: Fusing Model Embeddings and Weak Supervision

1 code implementation • 24 Mar 2022 • Mayee F. Chen, Daniel Y. Fu, Dyah Adila, Michael Zhang, Frederic Sala, Kayvon Fatahalian, Christopher Ré

Despite the black-box nature of foundation models, we prove results characterizing how our approach improves performance and show that lift scales with the smoothness of label distributions in embedding space.

Paper
Code

The Details Matter: Preventing Class Collapse in Supervised Contrastive Learning

no code implementations • 29 Sep 2021 • Daniel Yang Fu, Mayee F Chen, Michael Zhang, Kayvon Fatahalian, Christopher Ré

Supervised contrastive learning optimizes a loss that pushes together embeddings of points from the same class while pulling apart embeddings of points from different classes.

Contrastive Learning Transfer Learning

Paper
Add Code

Low-Shot Validation: Active Importance Sampling for Estimating Classifier Performance on Rare Categories

no code implementations • ICCV 2021 • Fait Poms, Vishnu Sarukkai, Ravi Teja Mullapudi, Nimit S. Sohoni, William R. Mark, Deva Ramanan, Kayvon Fatahalian

For machine learning models trained with limited labeled training data, validation stands to become the main bottleneck to reducing overall annotation costs.

Paper
Add Code

Video Pose Distillation for Few-Shot, Fine-Grained Sports Action Recognition

1 code implementation • ICCV 2021 • James Hong, Matthew Fisher, Michaël Gharbi, Kayvon Fatahalian

This leads to poor accuracy when downstream tasks, such as action recognition, depend on pose.

Action Understanding Fine-grained Action Recognition +2

Paper
Code

Mandoline: Model Evaluation under Distribution Shift

1 code implementation • 1 Jul 2021 • Mayee Chen, Karan Goel, Nimit S. Sohoni, Fait Poms, Kayvon Fatahalian, Christopher Ré

If an unlabeled sample from the target distribution is available, along with a labeled sample from a possibly different source distribution, standard approaches such as importance weighting can be applied to estimate performance on the target.

Density Ratio Estimation Epidemiology

Paper
Code

Large Batch Simulation for Deep Reinforcement Learning

1 code implementation • ICLR 2021 • Brennan Shacklett, Erik Wijmans, Aleksei Petrenko, Manolis Savva, Dhruv Batra, Vladlen Koltun, Kayvon Fatahalian

We accelerate deep reinforcement learning-based training in visually complex 3D environments by two orders of magnitude over prior work, realizing end-to-end training speeds of over 19, 000 frames of experience per second on a single GPU and up to 72, 000 frames per second on a single eight-GPU machine.

PointGoal Navigation reinforcement-learning +1

Paper
Code

Learning Rare Category Classifiers on a Tight Labeling Budget

no code implementations • ICCV 2021 • Ravi Teja Mullapudi, Fait Poms, William R. Mark, Deva Ramanan, Kayvon Fatahalian

In this paper, we consider the scenario where we start with as-little-as five labeled positives of a rare category and a large amount of unlabeled data of which 99. 9% of it is negatives.

Active Learning Representation Learning

Paper
Add Code

Iterative Text-based Editing of Talking-heads Using Neural Retargeting

no code implementations • 21 Nov 2020 • Xinwei Yao, Ohad Fried, Kayvon Fatahalian, Maneesh Agrawala

We present a text-based tool for editing talking-head video that enables an iterative editing workflow.

Paper
Add Code

Background Splitting: Finding Rare Classes in a Sea of Background

1 code implementation • CVPR 2021 • Ravi Teja Mullapudi, Fait Poms, William R. Mark, Deva Ramanan, Kayvon Fatahalian

We focus on the real-world problem of training accurate deep models for image classification of a small number of rare categories.

Image Classification

Paper
Code

Train and You'll Miss It: Interactive Model Iteration with Weak Supervision and Pre-Trained Embeddings

1 code implementation • 26 Jun 2020 • Mayee F. Chen, Daniel Y. Fu, Frederic Sala, Sen Wu, Ravi Teja Mullapudi, Fait Poms, Kayvon Fatahalian, Christopher Ré

Our goal is to enable machine learning systems to be trained interactively.

Transfer Learning

Paper
Code

Fast and Three-rious: Speeding Up Weak Supervision with Triplet Methods

1 code implementation • ICML 2020 • Daniel Y. Fu, Mayee F. Chen, Frederic Sala, Sarah M. Hooper, Kayvon Fatahalian, Christopher Ré

In this work, we show that, for a class of latent variable models highly applicable to weak supervision, we can find a closed-form solution to model parameters, obviating the need for iterative solutions like stochastic gradient descent (SGD).

309

Paper
Code

Multi-Resolution Weak Supervision for Sequential Data

no code implementations • NeurIPS 2019 • Frederic Sala, Paroma Varma, Jason Fries, Daniel Y. Fu, Shiori Sagawa, Saelig Khattar, Ashwini Ramamoorthy, Ke Xiao, Kayvon Fatahalian, James Priest, Christopher Ré

Multi-resolution sources exacerbate this challenge due to complex correlations and sample complexity that scales in the length of the sequence.

Paper
Add Code

Rekall: Specifying Video Events using Compositions of Spatiotemporal Labels

1 code implementation • 7 Oct 2019 • Daniel Y. Fu, Will Crichton, James Hong, Xinwei Yao, Haotian Zhang, Anh Truong, Avanika Narayan, Maneesh Agrawala, Christopher Ré, Kayvon Fatahalian

Many real-world video analysis applications require the ability to identify domain-specific events in video, such as interviews and commercials in TV news broadcasts, or action sequences in film.

Paper
Code

Online Model Distillation for Efficient Video Inference

1 code implementation • ICCV 2019 • Ravi Teja Mullapudi, Steven Chen, Keyi Zhang, Deva Ramanan, Kayvon Fatahalian

Rather than learn a specialized student model on offline data from the video stream, we train the student in an online fashion on the live video, intermittently running the teacher to provide a target for learning.

Segmentation Semantic Segmentation +2

Paper
Code

HydraNets: Specialized Dynamic Architectures for Efficient Inference

no code implementations • CVPR 2018 • Ravi Teja Mullapudi, William R. Mark, Noam Shazeer, Kayvon Fatahalian

On ImageNet, applying the HydraNet template improves accuracy up to 2. 5% when compared to an efficient baseline architecture with similar inference cost.

Classification Computational Efficiency +2

Paper
Add Code

Scanner: Efficient Video Analysis at Scale

1 code implementation • 18 May 2018 • Alex Poms, Will Crichton, Pat Hanrahan, Kayvon Fatahalian

The challenge is that scaling applications to operate on these datasets requires efficient systems for pixel data access and parallel processing across large numbers of machines.

613

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.