1 code implementation • 17 Feb 2025 • Boxuan Zhu, Sicheng Yang, Zhuo Wang, HaiNing Liang, Junxiao Shen
Gesture recognition in resource-constrained scenarios faces significant challenges in achieving high accuracy and low latency.
no code implementations • 12 Jan 2025 • Wenqi Zhou, Kai Cao, Hao Zheng, Xinyi Zheng, Miao Liu, Per Ola Kristensson, Walterio Mayol-Cuevas, Fan Zhang, Weizhe Lin, Junxiao Shen
Leveraging the advanced text processing capabilities of large language models (LLMs), X-LeBench develops a life-logging simulation pipeline that produces realistic, coherent daily plans aligned with real-world video data.
1 code implementation • 12 Jan 2025 • Xinyi Zheng, Steve Zhang, Weizhe Lin, Aaron Zhang, Walterio W. Mayol-Cuevas, Junxiao Shen
The dataset enables seamless integration with multi-modal data, supporting a range of 3D applications, from architectural reconstruction to virtual tourism.
no code implementations • 19 Nov 2024 • Weizhe Lin, Junxiao Shen
The rapid evolution of artificial intelligence, especially through multi-modal large language models, has redefined user interactions, enabling responses that are contextually rich and human-like.
no code implementations • 1 Nov 2024 • Zihong He, Weizhe Lin, Hao Zheng, Fan Zhang, Matt W. Jones, Laurence Aitchison, Xuhai Xu, Miao Liu, Per Ola Kristensson, Junxiao Shen
With the rapid advancement of AI systems, their abilities to store, retrieve, and utilize information over the long term - referred to as long-term memory - have become increasingly significant.
no code implementations • 8 Oct 2024 • Junxiao Shen, Roger Boldu, Arpit Kalla, Michael Glueck, Hemant Bhaskar Surale Amy Karlson
Secondly, we highlight the superior performance of the Score Fusion framework, which offers a 28. 2% improvement in uncorrected Character Error Rate over a conventional word prediction framework, Naive Correction, leading to a 55. 2% improvement in text entry speed for RingGesture.
no code implementations • 8 Oct 2024 • Junxiao Shen, Khadija Khaldi, Enmin Zhou, Hemant Bhaskar Surale, Amy Karlson
However, these methods are susceptible to decoding inaccuracies for noisy trajectories.
no code implementations • 20 Jan 2024 • Junxiao Shen, Matthias De Lange, Xuhai "Orson" Xu, Enmin Zhou, Ran Tan, Naveen Suda, Maciej Lazarewicz, Per Ola Kristensson, Amy Karlson, Evan Strasnick
Unfortunately, in real-world applications involving gesture recognition, such as gesture recognition based on wrist-worn devices, the data distribution may change over time.
no code implementations • 20 Jan 2024 • Junxiao Shen, Xuhai Xu, Ran Tan, Amy Karlson, Evan Strasnick
We believe that this annotation framework has immense potential to improve the training of downstream gesture recognition models using unlabeled datasets.
no code implementations • 12 Oct 2023 • Junxiao Shen, John J. Dudley, Jingyao Zheng, Bill Byrne, Per Ola Kristensson
However, the task of prompting large language models to specialize in specific text prediction tasks can be challenging, particularly for designers without expertise in prompt engineering.
no code implementations • 10 Aug 2023 • Junxiao Shen, John Dudley, Per Ola Kristensson
To address these challenges, we propose a memory augmentation agent that involves leveraging natural language encoding for video data and storing them in a vector database.
no code implementations • 28 Mar 2023 • Xuhai Xu, Mengjie Yu, Tanya R. Jonker, Kashyap Todi, Feiyu Lu, Xun Qian, João Marcelo Evangelista Belo, Tianyi Wang, Michelle Li, Aran Mun, Te-Yen Wu, Junxiao Shen, Ting Zhang, Narine Kokhlikyan, Fulton Wang, Paul Sorenson, Sophie Kahyun Kim, Hrvoje Benko
The framework was based on a multi-disciplinary literature review of XAI and HCI research, a large-scale survey probing 500+ end-users' preferences for AR-based explanations, and three workshops with 12 experts collecting their insights about XAI design in AR.
no code implementations • ICLR 2022 • Hang Ren, Aivar Sootla, Taher Jafferjee, Junxiao Shen, Jun Wang, Haitham Bou-Ammar
We consider a context-dependent Reinforcement Learning (RL) setting, which is characterized by: a) an unknown finite number of not directly observable contexts; b) abrupt (discontinuous) context changes occurring during an episode; and c) Markovian context evolution.
no code implementations • 27 May 2021 • Junxiao Shen, John Dudley, Per Ola Kristensson
Insufficient training data results in over-fitting and data augmentation is one approach to address this challenge.