Search Results for author: Zhiyong Li

Found 11 papers, 8 papers with code

EchoTrack: Auditory Referring Multi-Object Tracking for Autonomous Driving

1 code implementation28 Feb 2024 Jiacheng Lin, Jiajun Chen, Kunyu Peng, Xuan He, Zhiyong Li, Rainer Stiefelhagen, Kailun Yang

This paper introduces the task of Auditory Referring Multi-Object Tracking (AR-MOT), which dynamically tracks specific objects in a video sequence based on audio expressions and appears as a challenging problem in autonomous driving.

Autonomous Driving Multi-Object Tracking +1

LF Tracy: A Unified Single-Pipeline Approach for Salient Object Detection in Light Field Cameras

1 code implementation30 Jan 2024 Fei Teng, Jiaming Zhang, Jiawei Liu, Kunyu Peng, Xina Cheng, Zhiyong Li, Kailun Yang

Previous approaches predominantly employ a custom two-stream design to discover the implicit angular feature within light field cameras, leading to significant information isolation between different LF representations.

Data Augmentation object-detection +2

S$^3$-MonoDETR: Supervised Shape&Scale-perceptive Deformable Transformer for Monocular 3D Object Detection

no code implementations2 Sep 2023 Xuan He, Kailun Yang, Junwei Zheng, Jin Yuan, Luis M. Bergasa, HUI ZHANG, Zhiyong Li

These methods typically use visual and depth representations to generate query points on objects, whose quality plays a decisive role in the detection accuracy.

Monocular 3D Object Detection object-detection

Contrast-augmented Diffusion Model with Fine-grained Sequence Alignment for Markup-to-Image Generation

1 code implementation2 Aug 2023 Guojin Zhong, Jin Yuan, Pan Wang, Kailun Yang, Weili Guan, Zhiyong Li

The recently rising markup-to-image generation poses greater challenges as compared to natural image generation, due to its low tolerance for errors as well as the complex sequence and context correlations between markup and rendered image.

Denoising Image Generation

VPUFormer: Visual Prompt Unified Transformer for Interactive Image Segmentation

1 code implementation11 Jun 2023 Xu Zhang, Kailun Yang, Jiacheng Lin, Jin Yuan, Zhiyong Li, Shutao Li

Specifically, we design a Prompt-unified Encoder (PuE) by using Gaussian mapping to generate a unified one-dimensional vector for click, box, and scribble prompts, which well captures users' intentions as well as provides a denser representation of user prompts.

Image Segmentation Segmentation +1

SSD-MonoDETR: Supervised Scale-aware Deformable Transformer for Monocular 3D Object Detection

1 code implementation12 May 2023 Xuan He, Fan Yang, Kailun Yang, Jiacheng Lin, Haolong Fu, Meng Wang, Jin Yuan, Zhiyong Li

To tackle this problem, this paper proposes a novel "Supervised Scale-aware Deformable Attention" (SSDA) for monocular 3D object detection.

Monocular 3D Object Detection Object +1

Bi-Mapper: Holistic BEV Semantic Mapping for Autonomous Driving

1 code implementation7 May 2023 Siyu Li, Kailun Yang, Hao Shi, Jiaming Zhang, Jiacheng Lin, Zhifeng Teng, Zhiyong Li

At the same time, an Across-Space Loss (ASL) is designed to mitigate the negative impact of geometric distortions.

Autonomous Driving

Energy-efficient Dense DNN Acceleration with Signed Bit-slice Architecture

no code implementations15 Mar 2022 Dongseok Im, Gwangtae Park, Zhiyong Li, Junha Ryu, Hoi-jun Yoo

This paper proposes energy-efficient signed bit-slice architecture which accelerates both high-precision and dense DNNs by exploiting a large number of zero values of signed bit-slices.

Cannot find the paper you are looking for? You can Submit a new open access paper.