Search Results for author: Chenyou Fan

Found 23 papers, 9 papers with code

Heterogeneous Memory Enhanced Multimodal Attention Model for Video Question Answering

1 code implementation • CVPR 2019 • Chenyou Fan, Xiaofan Zhang, Shu Zhang, Wensheng Wang, Chi Zhang, Heng Huang

In this paper, we propose a novel end-to-end trainable Video Question Answering (VideoQA) framework with three major components: 1) a new heterogeneous memory which can effectively learn global context information from appearance and motion features; 2) a redesigned question memory which helps understand the complex semantics of question and highlights queried subjects; and 3) a new multimodal fusion layer which performs multi-step reasoning by attending to relevant visual and textual hints with self-updated attention.

Ranked #27 on Visual Question Answering (VQA) on MSRVTT-QA

Question Answering Video Question Answering +1

Paper
Code

Boosting Light-Weight Depth Estimation Via Knowledge Distillation

2 code implementations • 13 May 2021 • Junjie Hu, Chenyou Fan, Hualie Jiang, Xiyue Guo, Yuan Gao, Xiangyong Lu, Tin Lun Lam

However, this KD process can be challenging and insufficient due to the large model capacity gap between the teacher and the student.

Computational Efficiency Knowledge Distillation +1

Paper
Code

Learning Latent Sub-events in Activity Videos Using Temporal Attention Filters

1 code implementation • 26 May 2016 • AJ Piergiovanni, Chenyou Fan, Michael S. Ryoo

In this paper, we newly introduce the concept of temporal attention filters, and describe how they can be used for human activity recognition from videos.

Ranked #1 on Activity Recognition In Videos on DogCentric

Action Classification Action Recognition In Videos +2

Paper
Code

Progressive Self-Distillation for Ground-to-Aerial Perception Knowledge Transfer

1 code implementation • 29 Aug 2022 • Junjie Hu, Chenyou Fan, Mete Ozay, Hua Feng, Yuan Gao, Tin Lun Lam

In this paper, we introduce the ground-to-aerial perception knowledge transfer and propose a progressive semi-supervised learning framework that enables drone perception using only labeled data of ground viewpoint and unlabeled data of flying viewpoints.

Autonomous Driving Knowledge Distillation +1

Paper
Code

Lifelong-MonoDepth: Lifelong Learning for Multi-Domain Monocular Metric Depth Estimation

1 code implementation • 9 Mar 2023 • Junjie Hu, Chenyou Fan, Liguang Zhou, Qing Gao, Honghai Liu, Tin Lun Lam

With the rapid advancements in autonomous driving and robot navigation, there is a growing demand for lifelong learning models capable of estimating metric (absolute) depth.

Autonomous Driving Depth Prediction +2

Paper
Code

DeepDiary: Automatic Caption Generation for Lifelogging Image Streams

1 code implementation • 12 Aug 2016 • Chenyou Fan, David J. Crandall

Lifelogging cameras capture everyday life from a first-person perspective, but generate so much data that it is hard for users to browse and organize their image collections effectively.

Caption Generation Image Captioning +2

Paper
Code

Improved Sample Complexity for Stochastic Compositional Variance Reduced Gradient

1 code implementation • 1 Jun 2018 • Tianyi Lin, Chenyou Fan, Mengdi Wang, Michael. I. Jordan

Convex composition optimization is an emerging topic that covers a wide range of applications arising from stochastic optimal control, reinforcement learning and multi-stage stochastic programming.

reinforcement-learning Reinforcement Learning (RL)

Paper
Code

Where to Attack: A Dynamic Locator Model for Backdoor Attack in Text Classifications

1 code implementation • COLING 2022 • Heng-yang Lu, Chenyou Fan, Jun Yang, Cong Hu, Wei Fang, Xiao-Jun Wu

Based on the predicted P2P, four effective strategies are introduced to show the BDA performance.

Backdoor Attack

Paper
Code

Improved Oracle Complexity of Variance Reduced Methods for Nonsmooth Convex Stochastic Composition Optimization

no code implementations • 7 Feb 2018 • Tianyi Lin, Chenyou Fan, Mengdi Wang

We consider the nonsmooth convex composition optimization problem where the objective is a composition of two finite-sum functions and analyze stochastic compositional variance reduced gradient (SCVRG) methods for them.

Paper
Add Code

Joint Person Segmentation and Identification in Synchronized First- and Third-person Videos

no code implementations • ECCV 2018 • Mingze Xu, Chenyou Fan, Yuchen Wang, Michael S. Ryoo, David J. Crandall

In this paper, we wish to solve two specific problems: (1) given two or more synchronized third-person videos of a scene, produce a pixel-level segmentation of each visible person and identify corresponding people across different views (i. e., determine who in camera A corresponds with whom in camera B), and (2) given one or more synchronized third-person videos as well as a first-person video taken by a mobile or wearable camera, segment and identify the camera wearer in the third-person videos.

Segmentation

Paper
Add Code

Multi-Task Spatiotemporal Neural Networks for Structured Surface Reconstruction

1 code implementation • 11 Jan 2018 • Mingze Xu, Chenyou Fan, John D Paden, Geoffrey C. Fox, David J. Crandall

Deep learning methods have surpassed the performance of traditional techniques on a wide range of problems in computer vision, but nearly all of this work has studied consumer photos, where precisely correct output is often not critical.

Structured Prediction Surface Reconstruction

Paper
Code

Forecasting Hands and Objects in Future Frames

no code implementations • 20 May 2017 • Chenyou Fan, JangWon Lee, Michael S. Ryoo

The key idea is that (1) an intermediate representation of a convolutional object recognition model abstracts scene information in its frame and that (2) we can predict (i. e., regress) such representations corresponding to the future frames based on that of the current frame.

Object object-detection +2

Paper
Add Code

Identifying First-person Camera Wearers in Third-person Videos

no code implementations • CVPR 2017 • Chenyou Fan, Jang-Won Lee, Mingze Xu, Krishna Kumar Singh, Yong Jae Lee, David J. Crandall, Michael S. Ryoo

We consider scenarios in which we wish to perform joint scene understanding, object tracking, activity recognition, and other tasks in environments in which multiple people are wearing body-worn cameras while a third-person static camera also captures the scene.

Activity Recognition Object Tracking +1

Paper
Add Code

Federated Generative Adversarial Learning

no code implementations • 7 May 2020 • Chenyou Fan, Ping Liu

This work studies training generative adversarial networks under the federated learning setting.

Federated Learning Style Transfer

Paper
Add Code

Projection Robust Wasserstein Distance and Riemannian Optimization

no code implementations • NeurIPS 2020 • Tianyi Lin, Chenyou Fan, Nhat Ho, Marco Cuturi, Michael. I. Jordan

Projection robust Wasserstein (PRW) distance, or Wasserstein projection pursuit (WPP), is a robust variant of the Wasserstein distance.

Riemannian optimization

Paper
Add Code

Federated Few-Shot Learning with Adversarial Learning

no code implementations • 1 Apr 2021 • Chenyou Fan, Jianwei Huang

In this paper, we propose a federated few-shot learning (FedFSL) framework to learn a few-shot classification model that can classify unseen data classes with only a few labeled samples.

Federated Learning Few-Shot Learning

Paper
Add Code

Learn2Agree: Fitting with Multiple Annotators without Objective Ground Truth

no code implementations • 8 Sep 2021 • Chongyang Wang, Yuan Gao, Chenyou Fan, Junjie Hu, Tin Lun Lam, Nicholas D. Lane, Nadia Bianchi-Berthouze

For such issues, we propose a novel Learning to Agreement (Learn2Agree) framework to tackle the challenge of learning from multiple annotators without objective ground truth.

Paper
Add Code

Deep Depth Completion from Extremely Sparse Data: A Survey

no code implementations • 11 May 2022 • Junjie Hu, Chenyu Bao, Mete Ozay, Chenyou Fan, Qing Gao, Honghai Liu, Tin Lun Lam

Depth completion aims at predicting dense pixel-wise depth from an extremely sparse map captured from a depth sensor, e. g., LiDARs.

3D Reconstruction Autonomous Driving +2

Paper
Add Code

Dense Depth Distillation with Out-of-Distribution Simulated Images

no code implementations • 26 Aug 2022 • Junjie Hu, Chenyou Fan, Mete Ozay, Hualie Jiang, Tin Lun Lam

We study data-free knowledge distillation (KD) for monocular depth estimation (MDE), which learns a lightweight model for real-world depth perception tasks by compressing it from a trained teacher model while lacking training data in the target domain.

Data-free Knowledge Distillation Image Classification +1

Paper
Add Code

Federated Prompting and Chain-of-Thought Reasoning for Improving LLMs Answering

no code implementations • 27 Apr 2023 • Xiangyang Liu, Tianqi Pang, Chenyou Fan

Due to the unsatisfactory accuracy of LLMs' zero-shot prompting with standalone questions, we propose to improve the distributed synonymous questions using Self-Consistency (SC) and Chain-of-Thought (CoT) techniques.

Mathematical Reasoning

Paper
Add Code

Towards Applying Powerful Large AI Models in Classroom Teaching: Opportunities, Challenges and Prospects

no code implementations • 5 May 2023 • Kehui Tan, Tianqi Pang, Chenyou Fan, Song Yu

This perspective paper proposes a series of interactive scenarios that utilize Artificial Intelligence (AI) to enhance classroom teaching, such as dialogue auto-completion, knowledge and style transfer, and assessment of AI-generated content.

Style Transfer

Paper
Add Code

Carbon Price Forecasting with Quantile Regression and Feature Selection

no code implementations • 5 May 2023 • Tianqi Pang, Kehui Tan, Chenyou Fan

Carbon futures has recently emerged as a novel financial asset in the trading markets such as the European Union and China.

feature selection regression

Paper
Add Code

Sci-CoT: Leveraging Large Language Models for Enhanced Knowledge Distillation in Small Models for Scientific QA

no code implementations • 9 Aug 2023 • Yuhan Ma, Haiqi Jiang, Chenyou Fan

Large Language Models (LLMs) have shown outstanding performance across wide range of downstream tasks.

Knowledge Distillation Question Answering

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.