Search Results for author: Yuanchun Shi

Found 13 papers, 7 papers with code

CogACT: A Foundational Vision-Language-Action Model for Synergizing Cognition and Action in Robotic Manipulation

no code implementations29 Nov 2024 Qixiu Li, Yaobo Liang, Zeyu Wang, Lin Luo, Xi Chen, Mozheng Liao, Fangyun Wei, Yu Deng, Sicheng Xu, Yizhong Zhang, Xiaofan Wang, Bei Liu, Jianlong Fu, Jianmin Bao, Dong Chen, Yuanchun Shi, Jiaolong Yang, Baining Guo

Unlike previous works that directly repurpose VLM for action prediction by simple action quantization, we propose a omponentized VLA architecture that has a specialized action module conditioned on VLM output.

Quantization

Summit Vitals: Multi-Camera and Multi-Signal Biosensing at High Altitudes

1 code implementation28 Sep 2024 Ke Liu, Jiankai Tang, Zhang Jiang, Yuntao Wang, Xiaojing Liu, Dong Li, Yuanchun Shi

Video photoplethysmography (vPPG) is an emerging method for non-invasive and convenient measurement of physiological signals, utilizing two primary approaches: remote video PPG (rPPG) and contact video PPG (cPPG).

SpO2 estimation

PoseAugment: Generative Human Pose Data Augmentation with Physical Plausibility for IMU-based Motion Capture

1 code implementation21 Sep 2024 Zhuojun Li, Chun Yu, Chen Liang, Yuanchun Shi

However, effective data augmentation for IMU-based motion capture is challenging, since it has to capture the physical relations and constraints of the human body, while maintaining the data distribution and quality.

Data Augmentation Diversity

G-VOILA: Gaze-Facilitated Information Querying in Daily Scenarios

1 code implementation13 May 2024 Zeyu Wang, Yuanchun Shi, Yuntao Wang, Yuchen Yao, Kun Yan, YuHan Wang, Lei Ji, Xuhai Xu, Chun Yu

Modern information querying systems are progressively incorporating multimodal inputs like vision and audio.

Natural Language Queries

Time2Stop: Adaptive and Explainable Human-AI Loop for Smartphone Overuse Intervention

no code implementations3 Mar 2024 Adiba Orzikulova, Han Xiao, Zhipeng Li, Yukang Yan, Yuntao Wang, Yuanchun Shi, Marzyeh Ghassemi, Sung-Ju Lee, Anind K Dey, Xuhai "Orson" Xu

Participants preferred the adaptive interventions and rated the system highly on intervention time accuracy, effectiveness, and level of trust.

A Comprehensive Dataset and Automated Pipeline for Nailfold Capillary Analysis

1 code implementation10 Dec 2023 Linxi Zhao, Jiankai Tang, Dongyu Chen, Xiaohong Liu, Yong Zhou, Yuanchun Shi, Guangyu Wang, Yuntao Wang

In this study, we present a pioneering effort in constructing a comprehensive nailfold capillary dataset-321 images, 219 videos from 68 subjects, with clinic reports and expert annotations-that serves as a crucial resource for training deep-learning models.

MindShift: Leveraging Large Language Models for Mental-States-Based Problematic Smartphone Use Intervention

no code implementations28 Sep 2023 Ruolan Wu, Chun Yu, Xiaole Pan, Yujia Liu, Ningning Zhang, Yue Fu, YuHan Wang, Zhi Zheng, Li Chen, Qiaolei Jiang, Xuhai Xu, Yuanchun Shi

We first conducted a Wizard-of-Oz study (N=12) and an interview study (N=10) to summarize the mental states behind problematic smartphone use: boredom, stress, and inertia.

Persuasion Strategies

Modeling the Trade-off of Privacy Preservation and Activity Recognition on Low-Resolution Images

no code implementations18 Mar 2023 Yuntao Wang, Zirui Cheng, Xin Yi, Yan Kong, Xueyang Wang, Xuhai Xu, Yukang Yan, Chun Yu, Shwetak Patel, Yuanchun Shi

Modeling the trade-off of privacy preservation and machine recognition performance can guide future privacy-preserving computer vision systems using low-resolution image sensors.

Activity Recognition Image Super-Resolution +1

GazeReader: Detecting Unknown Word Using Webcam for English as a Second Language (ESL) Learners

no code implementations18 Mar 2023 Jiexin Ding, Bowen Zhao, Yuqi Huang, Yuntao Wang, Yuanchun Shi

Automatic unknown word detection techniques can enable new applications for assisting English as a Second Language (ESL) learners, thus improving their reading experiences.

named-entity-recognition Named Entity Recognition

MMPD: Multi-Domain Mobile Video Physiology Dataset

2 code implementations8 Feb 2023 Jiankai Tang, Kequan Chen, Yuntao Wang, Yuanchun Shi, Shwetak Patel, Daniel McDuff, Xin Liu

Second, most datasets are relatively small and therefore are limited in diversity, both in appearance (e. g., skin tone), behaviors (e. g., motion) and environment (e. g., lighting conditions).

Descriptive Diversity

MMTSA: Multimodal Temporal Segment Attention Network for Efficient Human Activity Recognition

1 code implementation14 Oct 2022 Ziqi Gao, Yuntao Wang, Jianguo Chen, Junliang Xing, Shwetak Patel, Xin Liu, Yuanchun Shi

The efficiency evaluation on an edge device showed that MMTSA achieved significantly better accuracy, lower computational load, and lower inference latency than SOTA methods.

Human Activity Recognition

Revisiting Discrete Soft Actor-Critic

1 code implementation21 Sep 2022 Haibin Zhou, Tong Wei, Zichuan Lin, Junyou Li, Junliang Xing, Yuanchun Shi, Li Shen, Chao Yu, Deheng Ye

We study the adaption of Soft Actor-Critic (SAC), which is considered as a state-of-the-art reinforcement learning (RL) algorithm, from continuous action space to discrete action space.

Atari Games Q-Learning +1

Cannot find the paper you are looking for? You can Submit a new open access paper.