1 code implementation • 2 Dec 2024 • Yuxuan Zhou, Xingxing Li, Shengyu Li, Chunxi Xia, Xuanbin Wang, Shaoquan Feng
For high-level geo-spatial applications and intelligent robotics, accurate global pose information is of crucial importance.
no code implementations • 22 Sep 2024 • Yuxuan Zhou, Xien Liu, Chen Ning, Xiao Zhang, Ji Wu
Finally, these produced predicate variants are converted into textual language, resulting in a series of reliable and diverse test samples to evaluate whether LLMs fully master the given medical factual knowledge point.
1 code implementation • 24 Aug 2024 • Yuxuan Zhou, Margret Keuper, Mario Fritz
Sampling-based decoding strategies have been widely adopted for Large Language Models (LLMs) in numerous applications, which target a balance between diversity and quality via temperature tuning and tail truncation (e. g., top-k and top-p sampling).
1 code implementation • 27 Jun 2024 • Heng Li, Minghan Li, Zhi-Qi Cheng, Yifei Dong, Yuxuan Zhou, Jun-Yan He, Qi Dai, Teruko Mitamura, Alexander G. Hauptmann
Vision-and-Language Navigation (VLN) aims to develop embodied agents that navigate based on human instructions.
1 code implementation • 5 Jun 2024 • Yuxuan Zhou, Xien Liu, Chen Ning, Ji Wu
In this paper, we aim to explore the causes of this gap by employing a multifaceted examination schema to systematically probe the actual mastery of medical knowledge by current LLMs.
1 code implementation • 3 Jun 2024 • Yuxuan Zhou, Mario Fritz, Margret Keuper
Yet, as a smooth approximation to the Argmax function, a significant amount of probability mass is distributed to other, residual entries, leading to poor interpretability and noise.
1 code implementation • 20 Mar 2024 • Yuxuan Zhou, Xingxing Li, Shengyu Li, Xuanbin Wang, Shaoquan Feng, Yuxuan Tan
Visual simultaneous localization and mapping (VSLAM) has broad applications, with state-of-the-art methods leveraging deep neural networks for better robustness and applicability.
1 code implementation • CVPR 2024 • Yuxuan Zhou, Xudong Yan, Zhi-Qi Cheng, Yan Yan, Qi Dai, Xian-Sheng Hua
To remedy this we propose a two-fold strategy: (1) We introduce an innovative approach that encodes bone connectivity by harnessing the power of graph distances to describe the physical topology; we further incorporate action-specific topological representation via persistent homology analysis to depict systemic dynamics.
Ranked #6 on Skeleton Based Action Recognition on NTU RGB+D 120
no code implementations • 2023 2023 • Yaqin Ye, Yue Xiao, Yuxuan Zhou, Shengwen Li, Yuanfei Zang, Yixuan Zhang
Second, we designed a dynamic graph adjustment module to update the adjacency matrix used in each training step.
1 code implementation • CVPR 2024 • Wenqian Zhang, Molin Huang, Yuxuan Zhou, Juze Zhang, Jingyi Yu, Jingya Wang, Lan Xu
We further provide a strong baseline method, BOTH2Hands, for the novel task: generating vivid two-hand motions from both implicit body dynamics and explicit text prompts.
no code implementations • 22 Nov 2023 • Yuxuan Zhou, Liangcai Gao, Zhi Tang, Baole Wei
Scene Text Image Super-Resolution (STISR) aims to enhance the resolution and legibility of text within low-resolution (LR) images, consequently elevating recognition accuracy in Scene Text Recognition (STR).
2 code implementations • 20 Sep 2023 • Shengbin Yue, Wei Chen, Siyuan Wang, Bingxuan Li, Chenchen Shen, Shujun Liu, Yuxuan Zhou, Yao Xiao, Song Yun, Xuanjing Huang, Zhongyu Wei
We propose DISC-LawLLM, an intelligent legal system utilizing large language models (LLMs) to provide a wide range of legal services.
1 code implementation • 21 Aug 2023 • Katharina Prasse, Steffen Jung, Yuxuan Zhou, Margret Keuper
We propose a method specifically designed for hand action recognition which uses relative angular embeddings and local Spherical Harmonics to create novel hand representations.
1 code implementation • 2 Jun 2023 • Yuxuan Zhou, Ziyu Jin, Meiwei Li, Miao Li, Xien Liu, Xinxin You, Ji Wu
The NLI4CT task aims to entail hypotheses based on Clinical Trial Reports (CTRs) and retrieve the corresponding evidence supporting the justification.
1 code implementation • 19 May 2023 • Yuxuan Zhou, Zhi-Qi Cheng, Jun-Yan He, Bin Luo, Yifeng Geng, Xuansong Xie
As a remedy, we propose a threefold strategy: (1) We forge an innovative pathway that encodes bone connectivity by harnessing the power of graph distances.
1 code implementation • 17 Nov 2022 • Yuxuan Zhou, Zhi-Qi Cheng, Chao Li, Yanwen Fang, Yifeng Geng, Xuansong Xie, Margret Keuper
Skeleton-based action recognition aims to recognize human actions given human joint coordinates with skeletal interconnections.
Ranked #12 on Skeleton Based Action Recognition on NTU RGB+D 120
3 code implementations • ICCV 2023 • Wangmeng Xiang, Chao Li, Yuxuan Zhou, Biao Wang, Lei Zhang
More specifically, we employ a pre-trained large-scale language model as the knowledge engine to automatically generate text descriptions for body parts movements of actions, and propose a multi-modal training scheme by utilizing the text encoder to generate feature vectors for different body parts and supervise the skeleton encoder for action representation learning.
Ranked #7 on Skeleton Based Action Recognition on N-UCLA
1 code implementation • 15 Jun 2022 • Yuxuan Zhou, Wangmeng Xiang, Chao Li, Biao Wang, Xihan Wei, Lei Zhang, Margret Keuper, Xiansheng Hua
Unlike convolutional inductive biases, which are forced to focus exclusively on hard-coded local regions, our proposed SPs are learned by the model itself and take a variety of spatial relations into account.
Ranked #162 on Image Classification on ImageNet
1 code implementation • Findings (ACL) 2022 • Yuxuan Zhou, Xien Liu, Kaiyin Zhou, Ji Wu
The table-based fact verification task has recently gained widespread attention and yet remains to be a very challenging problem.
no code implementations • SEMEVAL 2021 • Yuxuan Zhou, Kaiyin Zhou, Xien Liu, Ji Wu, Xiaodan Zhu
This paper describes our system for verifying statements with tables at SemEval-2021 Task 9.
1 code implementation • NeurIPS 2020 • Robin Tibor Schirrmeister, Yuxuan Zhou, Tonio Ball, Dan Zhang
We refine previous investigations of this failure at anomaly detection for invertible generative networks and provide a clear explanation of it as a combination of model bias and domain prior: Convolutional networks learn similar low-level feature distributions when trained on any natural image dataset and these low-level features dominate the likelihood.