1 code implementation • 2 Jan 2025 • Yongle Huang, Haodong Chen, Zhenbang Xu, Zihan Jia, Haozhou Sun, Dian Shao
Experiments show that SeFAR achieves state-of-the-art performance on two FAR datasets, FineGym and FineDiving, across various data scopes.
no code implementations • 1 Jan 2025 • Chuanzhi Xu, Langyi Chen, Vincent Qu, Haodong Chen, Vera Chung
Neuromorphic cameras, also known as event cameras, are asynchronous brightness-change sensors that can capture extremely fast motion without suffering from motion blur, making them particularly promising for 3D reconstruction in extreme environments.
Ranked #1 on Single-View 3D Reconstruction on SynthEVox3D-Tiny
no code implementations • 3 Dec 2024 • Haodong Chen, Lan Wang, Harry Yang, Ser-Nam Lim
On the other hand, when presented with a text prompt only, OmniCreator becomes generative, producing high-quality video as a result of the semantic correspondence learned.
no code implementations • 19 Nov 2024 • Haodong Chen, Runnan Chen, Qiang Qu, Zhaoqing Wang, Tongliang Liu, Xiaoming Chen, Yuk Ying Chung
Recent advancements in 3D Gaussian Splatting (3DGS) have substantially improved novel view synthesis, enabling high-quality reconstruction and real-time rendering.
no code implementations • 14 Nov 2024 • Hui Ye, Haodong Chen, Xiaoming Chen, Vera Chung
Remote sensing (RS) involves the acquisition of data about objects or areas from a distance, primarily to monitor environmental changes, manage resources, and support planning and disaster response.
1 code implementation • 29 Aug 2024 • Kaijing Ma, Haojian Huang, Jin Chen, Haodong Chen, Pengliang Ji, Xianghao Zang, Han Fang, Chao Ban, Hao Sun, Mulin Chen, Xuelong Li
To the best of our knowledge, this marks the first successful attempt of DER in VTG.
no code implementations • 22 Jul 2024 • Zeke Zexi Hu, Haodong Chen, Yuk Ying Chung, Xiaoming Chen
This paper presents the Multi-scale Disparity Transformer (MDT), a novel Transformer tailored for light field image super-resolution (LFSR) that addresses the issues of computational redundancy and disparity entanglement caused by the indiscriminate processing of sub-aperture images inherent in conventional methods.
no code implementations • 2 Jul 2024 • Haodong Chen, Haojian Huang, Junhao Dong, Mingzhe Zheng, Dian Shao
Dynamic Facial Expression Recognition (DFER) is crucial for understanding human behavior.
Ranked #1 on Dynamic Facial Expression Recognition on MAFW
Dynamic Facial Expression Recognition Facial Expression Recognition +3
1 code implementation • 13 May 2024 • Haodong Chen, Yongle Huang, Haojian Huang, Xiangsheng Ge, Dian Shao
The increasing prominence of e-commerce has underscored the importance of Virtual Try-On (VTON).
1 code implementation • 15 Apr 2024 • Haojian Huang, Xiaozhen Qiao, Zhuo Chen, Haodong Chen, Bingyu Li, Zhe Sun, Mulin Chen, Xuelong Li
Zero-shot learning (ZSL) enables the recognition of novel classes by leveraging semantic knowledge transfer from known to unknown categories.
1 code implementation • 22 Oct 2023 • Yibo Yan, Haomin Wen, Siru Zhong, Wei Chen, Haodong Chen, Qingsong Wen, Roger Zimmermann, Yuxuan Liang
To answer the questions, we leverage the power of Large Language Models (LLMs) and introduce the first-ever LLM-enhanced framework that integrates the knowledge of textual modality into urban imagery profiling, named LLM-enhanced Urban Region Profiling with Contrastive Language-Image Pretraining (UrbanCLIP).
no code implementations • 1 Sep 2023 • Haodong Chen, Vera Chung, Li Tan, Xiaoming Chen
Our preliminary results demonstrate that the proposed method can produce visually distinguishable dense 3D reconstructions directly without requiring pipelines like those used by existing methods.
Ranked #2 on Single-View 3D Reconstruction on SynthEVox3D-Tiny
no code implementations • 15 Aug 2023 • Haodong Chen, Ming C. Leu, Md Moniruzzaman, Zhaozheng Yin, Solmaz Hajmohammadi
Repetitive counting (RepCount) is critical in various applications, such as fitness tracking and rehabilitation.
no code implementations • 20 Dec 2021 • Wenjin Tao, Haodong Chen, Md Moniruzzaman, Ming C. Leu, Zhaozheng Yi, Ruwen Qin
Secondly, an attention-based fusion mechanism is developed to learn the importance of sensors at different body locations and to generate an attentive feature representation.