no code implementations • 2 Apr 2025 • Encheng Su, Hu Cao, Alois Knoll
While various segmentation methods for polyps and skin lesions using fully supervised deep learning techniques have been developed, the pixel-level annotation of medical images by doctors is both time-consuming and costly.
no code implementations • 9 Mar 2025 • Rui Song, Chenwei Liang, Yan Xia, Walter Zimmer, Hu Cao, Holger Caesar, Andreas Festag, Alois Knoll
By aggregating and encoding both semantic and temporal deformation features, each Gaussian is equipped with cues for potential deformation compensation within 3D space, facilitating a more precise representation of dynamic scenes.
1 code implementation • 4 Feb 2025 • Xingcheng Zhou, Konstantinos Larintzakis, Hao Guo, Walter Zimmer, MingYu Liu, Hu Cao, Jiajie Zhang, Venkatnarayanan Lakshminarasimhan, Leah Strand, Alois C. Knoll
We present TUMTraffic-VideoQA, a novel dataset and benchmark designed for spatio-temporal video understanding in complex roadside traffic scenarios.
no code implementations • 16 Dec 2024 • Yan Xia, Zhendong Li, Yun-Jin Li, Letian Shi, Hu Cao, João F. Henriques, Daniel Cremers
To date, most place recognition methods focus on single-modality retrieval.
1 code implementation • 19 Jul 2024 • Dai Liu, Jindong Gu, Hu Cao, Carsten Trinitis, Martin Schulz
Dataset Distillation is used to create a concise, yet informative, synthetic dataset that can replace the original dataset for training purposes.
1 code implementation • 17 Jul 2024 • Hu Cao, Zehua Zhang, Yan Xia, Xinyi Li, Jiahao Xia, Guang Chen, Alois Knoll
The core concept is the design of the coarse-to-fine fusion module, denoted as the cross-modality adaptive feature refinement (CAFR) module.
Ranked #1 on
Object Detection
on PKU-DDD17-Car
no code implementations • CVPR 2024 • Rui Song, Chenwei Liang, Hu Cao, Zhiran Yan, Walter Zimmer, Markus Gross, Andreas Festag, Alois Knoll
Additionally, due to the lack of a collaborative perception dataset designed for semantic occupancy prediction, we augment a current collaborative perception dataset to include 3D collaborative semantic occupancy labels for a more robust evaluation.
1 code implementation • 4 Dec 2023 • Christoph Hümmer, Manuel Schwonberg, Liangwei Zhou, Hu Cao, Alois Knoll, Hanno Gottschalk
Moreover, we confirm this observation for object detection on a novel synthetic-to-real benchmark.
Ranked #1 on
Semantic Segmentation
on BDD100K val
1 code implementation • 2 Nov 2023 • Xinyi Li, Zijian Ma, Yinlong Liu, Walter Zimmer, Hu Cao, Feihu Zhang, Alois Knoll
This paper focuses on addressing the robust correspondence-based registration problem with gravity prior that often arises in practice.
1 code implementation • 22 Oct 2023 • Xingcheng Zhou, MingYu Liu, Ekim Yurtsever, Bare Luka Zagar, Walter Zimmer, Hu Cao, Alois C. Knoll
The applications of Vision-Language Models (VLMs) in the field of Autonomous Driving (AD) have attracted widespread attention due to their outstanding performance and the ability to leverage Large Language Models (LLMs).
no code implementations • 19 May 2023 • Xinyi Li, Hu Cao, Yinlong Liu, Xueli Liu, Feihu Zhang, Alois Knoll
Moreover, our method can be adapted to address the challenging problem of simultaneous pose and registration.
no code implementations • 7 Oct 2022 • Boyang Zhang, Suping Wu, Hu Cao, Kehua Ma, Pan Li, Lei Lin
Different from them, our STR aims to learn accurate and natural motion sequences in an unconstrained environment through temporal and spatial tendency and to fully excavate the spatio-temporal features of existing video data.
Ranked #61 on
3D Human Pose Estimation
on MPI-INF-3DHP
1 code implementation • COLING 2022 • Hu Cao, Jingye Li, Fangfang Su, Fei Li, Hao Fei, Shengqiong Wu, Bobo Li, Liang Zhao, Donghong Ji
Event extraction (EE) is an essential task of information extraction, which aims to extract structured event information from unstructured text.
7 code implementations • 12 May 2021 • Hu Cao, Yueyue Wang, Joy Chen, Dongsheng Jiang, Xiaopeng Zhang, Qi Tian, Manning Wang
In the past few years, convolutional neural networks (CNNs) have achieved milestones in medical image analysis.
Ranked #5 on
Medical Image Segmentation
on ACDC
no code implementations • 25 Jan 2021 • Hu Cao, Guang Chen, Zhijun Li, Jianjie Lin, Alois Knoll
Extensive experiments on two public grasping datasets, Cornell and Jacquard demonstrate the state-of-the-art performance of our method in balancing accuracy and inference speed.
Ranked #1 on
Robotic Grasping
on Jacquard dataset
1 code implementation • 28 Apr 2020 • Bin Li, Hu Cao, Zhongnan Qu, Yingbai Hu, Zhenke Wang, Zichen Liang
Based on the Event-Stream dataset, we develop a deep neural network for grasping detection which consider the angle learning problem as classification instead of regression.