Search Results for author: Zhiyong Li

Found 11 papers, 8 papers with code

EchoTrack: Auditory Referring Multi-Object Tracking for Autonomous Driving

1 code implementation • 28 Feb 2024 • Jiacheng Lin, Jiajun Chen, Kunyu Peng, Xuan He, Zhiyong Li, Rainer Stiefelhagen, Kailun Yang

This paper introduces the task of Auditory Referring Multi-Object Tracking (AR-MOT), which dynamically tracks specific objects in a video sequence based on audio expressions and appears as a challenging problem in autonomous driving.

Autonomous Driving Multi-Object Tracking +1

Paper
Code

LF Tracy: A Unified Single-Pipeline Approach for Salient Object Detection in Light Field Cameras

1 code implementation • 30 Jan 2024 • Fei Teng, Jiaming Zhang, Jiawei Liu, Kunyu Peng, Xina Cheng, Zhiyong Li, Kailun Yang

Previous approaches predominantly employ a custom two-stream design to discover the implicit angular feature within light field cameras, leading to significant information isolation between different LF representations.

Data Augmentation object-detection +2

Paper
Code

S$^3$-MonoDETR: Supervised Shape&Scale-perceptive Deformable Transformer for Monocular 3D Object Detection

no code implementations • 2 Sep 2023 • Xuan He, Kailun Yang, Junwei Zheng, Jin Yuan, Luis M. Bergasa, HUI ZHANG, Zhiyong Li

These methods typically use visual and depth representations to generate query points on objects, whose quality plays a decisive role in the detection accuracy.

Monocular 3D Object Detection object-detection

Paper
Add Code

EPCFormer: Expression Prompt Collaboration Transformer for Universal Referring Video Object Segmentation

1 code implementation • 8 Aug 2023 • Jiajun Chen, Jiacheng Lin, Zhiqiang Xiao, Haolong Fu, Ke Nai, Kailun Yang, Zhiyong Li

Next, we propose an Expression Alignment (EA) mechanism for audio and text expressions.

Ranked #10 on Referring Expression Segmentation on Refer-YouTube-VOS (2021 public validation)

Contrastive Learning Object +5

Paper
Code

Contrast-augmented Diffusion Model with Fine-grained Sequence Alignment for Markup-to-Image Generation

1 code implementation • 2 Aug 2023 • Guojin Zhong, Jin Yuan, Pan Wang, Kailun Yang, Weili Guan, Zhiyong Li

The recently rising markup-to-image generation poses greater challenges as compared to natural image generation, due to its low tolerance for errors as well as the complex sequence and context correlations between markup and rendered image.

Denoising Image Generation

Paper
Code

VPUFormer: Visual Prompt Unified Transformer for Interactive Image Segmentation

1 code implementation • 11 Jun 2023 • Xu Zhang, Kailun Yang, Jiacheng Lin, Jin Yuan, Zhiyong Li, Shutao Li

Specifically, we design a Prompt-unified Encoder (PuE) by using Gaussian mapping to generate a unified one-dimensional vector for click, box, and scribble prompts, which well captures users' intentions as well as provides a denser representation of user prompts.

Image Segmentation Segmentation +1

Paper
Code

SSD-MonoDETR: Supervised Scale-aware Deformable Transformer for Monocular 3D Object Detection

1 code implementation • 12 May 2023 • Xuan He, Fan Yang, Kailun Yang, Jiacheng Lin, Haolong Fu, Meng Wang, Jin Yuan, Zhiyong Li

To tackle this problem, this paper proposes a novel "Supervised Scale-aware Deformable Attention" (SSDA) for monocular 3D object detection.

Monocular 3D Object Detection Object +1

Paper
Code

AdaptiveClick: Clicks-aware Transformer with Adaptive Focal Loss for Interactive Image Segmentation

1 code implementation • 7 May 2023 • Jiacheng Lin, Jiajun Chen, Kailun Yang, Alina Roitberg, Siyu Li, Zhiyong Li, Shutao Li

Interactive Image Segmentation (IIS) has emerged as a promising technique for decreasing annotation time.

Image Segmentation Segmentation +1

Paper
Code

Bi-Mapper: Holistic BEV Semantic Mapping for Autonomous Driving

1 code implementation • 7 May 2023 • Siyu Li, Kailun Yang, Hao Shi, Jiaming Zhang, Jiacheng Lin, Zhifeng Teng, Zhiyong Li

At the same time, an Across-Space Loss (ASL) is designed to mitigate the negative impact of geometric distortions.

Autonomous Driving

Paper
Code

Energy-efficient Dense DNN Acceleration with Signed Bit-slice Architecture

no code implementations • 15 Mar 2022 • Dongseok Im, Gwangtae Park, Zhiyong Li, Junha Ryu, Hoi-jun Yoo

This paper proposes energy-efficient signed bit-slice architecture which accelerates both high-precision and dense DNNs by exploiting a large number of zero values of signed bit-slices.

Paper
Add Code

STURE: Spatial-Temporal Mutual Representation Learning for Robust Data Association in Online Multi-Object Tracking

no code implementations • 18 Jan 2022 • Haidong Wang, Zhiyong Li, Yaping Li, Ke Nai, Ming Wen

The feature difference between current candidate detections and historical tracklets makes the object association much harder.

Multi-Object Tracking Object +2

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.