Search Results for author: Lijin Yang

Found 8 papers, 1 papers with code

EgoExoLearn: A Dataset for Bridging Asynchronous Ego- and Exo-centric View of Procedural Activities in Real World

1 code implementation • 24 Mar 2024 • Yifei HUANG, Guo Chen, Jilan Xu, Mingfang Zhang, Lijin Yang, Baoqi Pei, Hongjie Zhang, Lu Dong, Yali Wang, LiMin Wang, Yu Qiao

Along with the videos we record high-quality gaze data and provide detailed multimodal annotations, formulating a playground for modeling the human ability to bridge asynchronous procedural actions from different viewpoints.

Paper
Code

Weakly Supervised Temporal Sentence Grounding With Uncertainty-Guided Self-Training

no code implementations • CVPR 2023 • Yifei HUANG, Lijin Yang, Yoichi Sato

The task of weakly supervised temporal sentence grounding aims at finding the corresponding temporal moments of a language description in the video, given video-language correspondence only at video-level.

Data Augmentation Sentence +2

Paper
Add Code

DeCo: Decomposition and Reconstruction for Compositional Temporal Grounding via Coarse-To-Fine Contrastive Ranking

no code implementations • CVPR 2023 • Lijin Yang, Quan Kong, Hsuan-Kung Yang, Wadim Kehl, Yoichi Sato, Norimasa Kobori

Compositional temporal grounding is the task of localizing dense action by using known words combined in novel ways in the form of novel query sentences for the actual grounding.

Boundary Detection Sentence

Paper
Add Code

Compound Prototype Matching for Few-shot Action Recognition

no code implementations • 12 Jul 2022 • Yifei HUANG, Lijin Yang, Yoichi Sato

Each global prototype is encouraged to summarize a specific aspect from the entire video, for example, the start/evolution of the action.

Few-Shot action recognition Few Shot Action Recognition +1

Paper
Add Code

Interact Before Align: Leveraging Cross-Modal Knowledge for Domain Adaptive Action Recognition

no code implementations • CVPR 2022 • Lijin Yang, Yifei HUANG, Yusuke Sugano, Yoichi Sato

Different from previous works, we find that the cross-domain alignment can be more effectively done by using cross-modal interaction first.

Action Recognition Temporal Action Localization

Paper
Add Code

Stacked Temporal Attention: Improving First-person Action Recognition by Emphasizing Discriminative Clips

no code implementations • 2 Dec 2021 • Lijin Yang, Yifei HUANG, Yusuke Sugano, Yoichi Sato

Previous works explored to address this problem by applying temporal attention but failed to consider the global context of the full video, which is critical for determining the relatively significant parts.

Action Recognition Video Understanding

Paper
Add Code

Leveraging Human Selective Attention for Medical Image Analysis with Limited Training Data

no code implementations • 2 Dec 2021 • Yifei HUANG, Xiaoxiao Li, Lijin Yang, Lin Gu, Yingying Zhu, Hirofumi Seo, Qiuming Meng, Tatsuya Harada, Yoichi Sato

Then we design a novel Auxiliary Attention Block (AAB) to allow information from SAN to be utilized by the backbone encoder to focus on selective areas.

Tumor Segmentation

Paper
Add Code

EPIC-KITCHENS-100 Unsupervised Domain Adaptation Challenge for Action Recognition 2021: Team M3EM Technical Report

no code implementations • 18 Jun 2021 • Lijin Yang, Yifei HUANG, Yusuke Sugano, Yoichi Sato

In this report, we describe the technical details of our submission to the 2021 EPIC-KITCHENS-100 Unsupervised Domain Adaptation Challenge for Action Recognition.

Action Recognition Unsupervised Domain Adaptation

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.