Search Results for author: Benjia Zhou

Found 11 papers, 6 papers with code

Factorized Learning Assisted with Large Language Model for Gloss-free Sign Language Translation

no code implementations19 Mar 2024 Zhigang Chen, Benjia Zhou, Jun Li, Jun Wan, Zhen Lei, Ning Jiang, Quan Lu, Guoqing Zhao

Although some approaches work towards gloss-free SLT through jointly training the visual encoder and translation network, these efforts still suffer from poor performance and inefficient use of the powerful Large Language Model (LLM).

Gloss-free Sign Language Translation Language Modelling +3

Multi-stage Factorized Spatio-Temporal Representation for RGB-D Action and Gesture Recognition

1 code implementation23 Aug 2023 Yujun Ma, Benjia Zhou, Ruili Wang, Pichao Wang

RGB-D action and gesture recognition remain an interesting topic in human-centered scene understanding, primarily due to the multiple granularities and large variation in human motion.

Gesture Recognition Scene Understanding

Gloss-free Sign Language Translation: Improving from Visual-Language Pretraining

1 code implementation ICCV 2023 Benjia Zhou, Zhigang Chen, Albert Clapés, Jun Wan, Yanyan Liang, Sergio Escalera, Zhen Lei, Du Zhang

Many previous methods employ an intermediate representation, i. e., gloss sequences, to facilitate SLT, thus transforming it into a two-stage task of sign language recognition (SLR) followed by sign language translation (SLT).

Gloss-free Sign Language Translation Self-Supervised Learning +3

A Unified Multimodal De- and Re-coupling Framework for RGB-D Motion Recognition

1 code implementation16 Nov 2022 Benjia Zhou, Pichao Wang, Jun Wan, Yanyan Liang, Fan Wang

Although improving motion recognition to some extent, these methods still face sub-optimal situations in the following aspects: (i) Data augmentation, i. e., the scale of the RGB-D datasets is still limited, and few efforts have been made to explore novel data augmentation strategies for videos; (ii) Optimization mechanism, i. e., the tightly space-time-entangled network structure brings more challenges to spatiotemporal information modeling; And (iii) cross-modal knowledge fusion, i. e., the high similarity between multimodal representations caused to insufficient late fusion.

Action Recognition Data Augmentation +2

Effective Vision Transformer Training: A Data-Centric Perspective

no code implementations29 Sep 2022 Benjia Zhou, Pichao Wang, Jun Wan, Yanyan Liang, Fan Wang

To achieve these two purposes, we propose a novel data-centric ViT training framework to dynamically measure the ``difficulty'' of training samples and generate ``effective'' samples for models at different training stages.

Regional Attention with Architecture-Rebuilt 3D Network for RGB-D Gesture Recognition

1 code implementation10 Feb 2021 Benjia Zhou, Yunan Li, Jun Wan

Meanwhile, a more adaptive architecture-searched network structure can also perform better than the block-fixed ones like Resnet since it increases the diversity of features in different stages of the network better.

Gesture Recognition Neural Architecture Search

DSAM: A Distance Shrinking with Angular Marginalizing Loss for High Performance Vehicle Re-identificatio

no code implementations12 Nov 2020 Jiangtao Kong, Yu Cheng, Benjia Zhou, Kai Li, Junliang Xing

To obtain a high-performance vehicle ReID model, we present a novel Distance Shrinking with Angular Marginalizing (DSAM) loss function to perform hybrid learning in both the Original Feature Space (OFS) and the Feature Angular Space (FAS) using the local verification and the global identification information.

Person Re-Identification Vehicle Re-Identification

Cross-ethnicity Face Anti-spoofing Recognition Challenge: A Review

no code implementations23 Apr 2020 Ajian Liu, Xuan Li, Jun Wan, Sergio Escalera, Hugo Jair Escalante, Meysam Madadi, Yi Jin, Zhuoyuan Wu, Xiaogang Yu, Zichang Tan, Qi Yuan, Ruikun Yang, Benjia Zhou, Guodong Guo, Stan Z. Li

Although ethnic bias has been verified to severely affect the performance of face recognition systems, it still remains an open research problem in face anti-spoofing.

Face Anti-Spoofing Face Recognition

Cannot find the paper you are looking for? You can Submit a new open access paper.