Search Results for author: Joya Chen

Found 11 papers, 7 papers with code

Residual Objectness for Imbalance Reduction

no code implementations • 24 Aug 2019 • Joya Chen, Dong Liu, Bin Luo, Xuezheng Peng, Tong Xu, Enhong Chen

For a long time, object detectors have suffered from extreme imbalance between foregrounds and backgrounds.

Paper
Add Code

Is Heuristic Sampling Necessary in Training Deep Object Detectors?

13 code implementations • 11 Sep 2019 • Joya Chen, Dong Liu, Tong Xu, Shiwei Wu, Yifei Cheng, Enhong Chen

In this paper, we challenge the necessity of such hard/soft sampling methods for training accurate deep object detectors.

General Classification Instance Segmentation +2

9,244

Paper
Code

Long-term Joint Scheduling for Urban Traffic

1 code implementation • 27 Oct 2019 • Xianfeng Liang, Likang Wu, Joya Chen, Yang Liu, Runlong Yu, Min Hou, Han Wu, Yuyang Ye, Qi Liu, Enhong Chen

Recently, the traffic congestion in modern cities has become a growing worry for the residents.

Scheduling

Paper
Code

Foreground-Background Imbalance Problem in Deep Object Detectors: A Review

no code implementations • 16 Jun 2020 • Joya Chen, Qi Wu, Dong Liu, Tong Xu

Recent years have witnessed the remarkable developments made by deep learning techniques for object detection, a fundamentally challenging problem of computer vision.

Object object-detection +1

Paper
Add Code

DropIT: Dropping Intermediate Tensors for Memory-Efficient DNN Training

1 code implementation • 28 Feb 2022 • Joya Chen, Kai Xu, Yuhui Wang, Yifei Cheng, Angela Yao

A standard hardware bottleneck when training deep neural networks is GPU memory.

Instance Segmentation object-detection +2

Paper
Code

AssistQ: Affordance-centric Question-driven Task Completion for Egocentric Assistant

4 code implementations • 8 Mar 2022 • Benita Wong, Joya Chen, You Wu, Stan Weixian Lei, Dongxing Mao, Difei Gao, Mike Zheng Shou

In this paper, we define a new task called Affordance-centric Question-driven Task Completion, where the AI assistant should learn from instructional videos to provide step-by-step help in the user's view.

Visual Question Answering (VQA)

Paper
Code

Affordance Grounding from Demonstration Video to Target Image

1 code implementation • CVPR 2023 • Joya Chen, Difei Gao, Kevin Qinghong Lin, Mike Zheng Shou

Humans excel at learning from expert demonstrations and solving their own problems.

Ranked #1 on Video-to-image Affordance Grounding on EPIC-Hotspot

Video-to-image Affordance Grounding

Paper
Code

AssistGPT: A General Multi-modal Assistant that can Plan, Execute, Inspect, and Learn

no code implementations • 14 Jun 2023 • Difei Gao, Lei Ji, Luowei Zhou, Kevin Qinghong Lin, Joya Chen, Zihan Fan, Mike Zheng Shou

2) Flexible inputs and intermediate results.

Paper
Add Code

UniVTG: Towards Unified Video-Language Temporal Grounding

1 code implementation • ICCV 2023 • Kevin Qinghong Lin, Pengchuan Zhang, Joya Chen, Shraman Pramanick, Difei Gao, Alex Jinpeng Wang, Rui Yan, Mike Zheng Shou

Most methods in this direction develop taskspecific models that are trained with type-specific labels, such as moment retrieval (time interval) and highlight detection (worthiness curve), which limits their abilities to generalize to various VTG tasks and labels.

Ranked #3 on Highlight Detection on QVHighlights (using extra training data)