no code implementations • 4 Dec 2024 • Mahtab Bigverdi, Zelun Luo, Cheng-Yu Hsieh, Ethan Shen, Dongping Chen, Linda G. Shapiro, Ranjay Krishna
For example, in a depth-related task, an MLM augmented with perception tokens can reason by generating a depth map as tokens, enabling it to solve the problem effectively.
1 code implementation • 3 Jun 2024 • Zane Durante, Robathan Harries, Edward Vendrow, Zelun Luo, Yuta Kyuragi, Kazuki Kozuka, Li Fei-Fei, Ehsan Adeli
Understanding Activities of Daily Living (ADLs) is a crucial step for different applications including assistive robots, smart homes, and healthcare.
Ranked #1 on Few Shot Action Recognition on MOMA-LRG (using extra training data)
Few Shot Action Recognition Fine-Grained Image Classification +2
no code implementations • 27 Jun 2023 • Zelun Luo, Yuliang Zou, Yijin Yang, Zane Durante, De-An Huang, Zhiding Yu, Chaowei Xiao, Li Fei-Fei, Animashree Anandkumar
In recent years, differential privacy has seen significant advancements in image classification; however, its application to video activity recognition remains under-explored.
1 code implementation • NeurIPS 2022 • Zelun Luo, Zane Durante, Linden Li, Wanze Xie, Ruochen Liu, Emily Jin, Zhuoyi Huang, Lun Yu Li, Jiajun Wu, Juan Carlos Niebles, Ehsan Adeli, Fei-Fei Li
Video-language models (VLMs), large models pre-trained on numerous but noisy video-text pairs from the internet, have revolutionized activity recognition through their remarkable generalization and open-vocabulary capabilities.
Ranked #2 on Few Shot Action Recognition on MOMA-LRG (using extra training data)
no code implementations • NeurIPS 2021 • Zelun Luo, Wanze Xie, Siddharth Kapoor, Yiyun Liang, Michael Cooper, Juan Carlos Niebles, Ehsan Adeli, Fei-Fei Li
This paper introduces Activity Parsing as the overarching task of temporal segmentation and classification of activities, sub-activities, atomic actions, along with an instance-level understanding of actors, objects, and their relationships in videos.
no code implementations • CVPR 2021 • Zelun Luo, Daniel J. Wu, Ehsan Adeli, Li Fei-Fei
We propose a novel method for privacy-preserving training of deep neural networks leveraging public, out-domain data.
no code implementations • 1 Dec 2018 • David Xue, Anin Sayana, Evan Darke, Kelly Shen, Jun-Ting Hsieh, Zelun Luo, Li-Jia Li, N. Lance Downing, Arnold Milstein, Li Fei-Fei
As the senior population rapidly increases, it is challenging yet crucial to provide effective long-term care for seniors who live at home or in senior care facilities.
1 code implementation • ECCV 2018 • Yuliang Zou, Zelun Luo, Jia-Bin Huang
We present an unsupervised learning framework for simultaneously training single-view depth prediction and optical flow estimation models using unlabeled video sequences.
no code implementations • NeurIPS 2017 • Zelun Luo, Yuliang Zou, Judy Hoffman, Li F. Fei-Fei
We propose a framework that learns a representation transferable across different domains and tasks in a data efficient manner.
no code implementations • NeurIPS 2017 • Zelun Luo, Yuliang Zou, Judy Hoffman, Li Fei-Fei
We propose a framework that learns a representation transferable across different domains and tasks in a label efficient manner.
1 code implementation • ECCV 2018 • Zelun Luo, Jun-Ting Hsieh, Lu Jiang, Juan Carlos Niebles, Li Fei-Fei
We propose a technique that tackles action detection in multimodal videos under a realistic and challenging condition in which only limited training data and partially observed modalities are available.
no code implementations • 1 Aug 2017 • Albert Haque, Michelle Guo, Alexandre Alahi, Serena Yeung, Zelun Luo, Alisha Rege, Jeffrey Jopling, Lance Downing, William Beninati, Amit Singh, Terry Platchek, Arnold Milstein, Li Fei-Fei
One in twenty-five patients admitted to a hospital will suffer from a hospital acquired infection.
no code implementations • CVPR 2017 • Zelun Luo, Boya Peng, De-An Huang, Alexandre Alahi, Li Fei-Fei
We present an unsupervised representation learning approach that compactly encodes the motion dependencies in videos.
2 code implementations • 23 Mar 2016 • Albert Haque, Boya Peng, Zelun Luo, Alexandre Alahi, Serena Yeung, Li Fei-Fei
We propose a viewpoint invariant model for 3D human pose estimation from a single depth image.
Ranked #4 on Pose Estimation on ITOP top-view