Search Results for author: Yueyi Zhang

Found 41 papers, 26 papers with code

Spatial Hierarchy Aware Residual Pyramid Network for Time-of-Flight Depth Denoising

1 code implementation ECCV 2020 Guanting Dong, Yueyi Zhang, Zhiwei Xiong

In this paper, we propose a Spatial Hierarchy Aware Residual Pyramid Network, called SHARP-Net, to remove the depth noise by fully exploiting the geometry information of the scene on different scales.

Denoising

Event-Enhanced Blurry Video Super-Resolution

1 code implementation17 Apr 2025 Dachun Kai, Yueyi Zhang, Jin Wang, Zeyu Xiao, Zhiwei Xiong, Xiaoyan Sun

In this paper, we tackle the task of blurry video super-resolution (BVSR), aiming to generate high-resolution (HR) videos from low-resolution (LR) and blurry inputs.

Deblurring Motion Estimation +2

Fine-Grained Evaluation of Large Vision-Language Models in Autonomous Driving

1 code implementation27 Mar 2025 Yue Li, Meng Tian, Zhenyu Lin, Jiangtong Zhu, Dechang Zhu, Haiqiang Liu, Zining Wang, Yueyi Zhang, Zhiwei Xiong, Xinhai Zhao

To further exploit the cognitive and reasoning interactions among the 5 domains for AD understanding, we start from a small-scale VLM and train the DS models on individual domain datasets (collected from 1. 4M DS QAs across public sources).

Attribute Autonomous Driving +3

Denoising Designs-inherited Search Framework for Image Denoising

no code implementations19 Feb 2025 Zheyu Zhang, Yueyi Zhang, Xiaoyan Sun

The parameters of our searched architecture are $1/3$ of Restormer's, and our method surpasses existing NAS-based denoising methods by $1. 50$ dB in the real-world dataset.

Image Denoising Neural Architecture Search

Spiking Point Transformer for Point Cloud Classification

1 code implementation19 Feb 2025 Peixi Wu, Bosong Chai, Hebei Li, Menghua Zheng, Yansong Peng, Zeyu Wang, Xuan Nie, Yueyi Zhang, Xiaoyan Sun

To this end, we present Spiking Point Transformer (SPT), the first transformer-based SNN framework for point cloud classification.

Classification Point Cloud Classification

S2D-LFE: Sparse-to-Dense Light Field Event Generation

1 code implementation CVPR 2025 Yutong Liu, Wenming Weng, Yueyi Zhang, Zhiwei Xiong

For the first time to our knowledge, S2D-LFE enables controllable novel view synthesis only from sparse-view light field event (LFE) data, and addresses three critical challenges for the LFE generation task: simplicity, controllability, and consistency.

Novel View Synthesis

Incomplete Multi-modal Brain Tumor Segmentation via Learnable Sorting State Space Model

no code implementations CVPR 2025 Zheyu Zhang, Yayuan Lu, Feipeng Ma, Yueyi Zhang, Huanjing Yue, Xiaoyan Sun

Brain tumor segmentation plays a crucial role in clinical diagnosis, yet the frequent unavailability of certain MRI modalities poses a significant challenge.

Brain Tumor Segmentation Mamba +2

Event-boosted Deformable 3D Gaussians for Dynamic Scene Reconstruction

no code implementations25 Nov 2024 Wenhao Xu, Wenming Weng, Yueyi Zhang, Ruikang Xu, Zhiwei Xiong

To address this, we introduce the first approach combining event cameras, which capture high-temporal-resolution, continuous motion data, with deformable 3D-GS for dynamic scene reconstruction.

3D Reconstruction Dynamic Reconstruction

Generalizable Non-Line-of-Sight Imaging with Learnable Physical Priors

no code implementations21 Sep 2024 Shida Sun, Yue Li, Yueyi Zhang, Zhiwei Xiong

Non-line-of-sight (NLOS) imaging, recovering the hidden volume from indirect reflections, has attracted increasing attention due to its potential applications.

Anatomical Consistency Distillation and Inconsistency Synthesis for Brain Tumor Segmentation with Missing Modalities

no code implementations25 Aug 2024 Zheyu Zhang, Xinzhao Liu, Zheng Chen, Yueyi Zhang, Huanjing Yue, Yunwei Ou, Xiaoyan Sun

Through validation on the BraTS2018 and BraTS2020 datasets, ACDIS substantiates its efficacy in the segmentation of brain tumors with missing MRI modalities.

Brain Tumor Segmentation Segmentation +2

EE-MLLM: A Data-Efficient and Compute-Efficient Multimodal Large Language Model

no code implementations21 Aug 2024 Feipeng Ma, Yizhou Zhou, Hebei Li, Zilong He, Siying Wu, Fengyun Rao, Yueyi Zhang, Xiaoyan Sun

While self-attention-based methods offer superior data efficiency due to their simple MLP architecture, they often suffer from lower computational efficiency due to concatenating visual and textual tokens as input for LLM.

Computational Efficiency Language Modeling +4

CEIA: CLIP-Based Event-Image Alignment for Open-World Event-Based Understanding

no code implementations9 Jul 2024 Wenhao Xu, Wenming Weng, Yueyi Zhang, Zhiwei Xiong

In response to this challenge, CEIA learns to align event and image data as an alternative instead of directly aligning event and text data.

Contrastive Learning Domain Adaptation +3

EvTexture: Event-driven Texture Enhancement for Video Super-Resolution

1 code implementation19 Jun 2024 Dachun Kai, Jiayao Lu, Yueyi Zhang, Xiaoyan Sun

Our method, called EvTexture, leverages high-frequency details of events to better recover texture regions in VSR.

Event-based vision Video Super-Resolution

Multi-Modal Generative Embedding Model

no code implementations29 May 2024 Feipeng Ma, Hongwei Xue, Guangting Wang, Yizhou Zhou, Fengyun Rao, Shilin Yan, Yueyi Zhang, Siying Wu, Mike Zheng Shou, Xiaoyan Sun

Existing models usually tackle these two types of problems by decoupling language modules into a text decoder for generation, and a text encoder for embedding.

Caption Generation Cross-Modal Retrieval +9

Scene Adaptive Sparse Transformer for Event-based Object Detection

1 code implementation CVPR 2024 Yansong Peng, Hebei Li, Yueyi Zhang, Xiaoyan Sun, Feng Wu

However, they display inadequate sparsity and adaptability when applied to event-based object detection, since these approaches cannot balance the fine granularity of token-level sparsification and the efficiency of window-based Transformers, leading to reduced performance and efficiency.

Object object-detection +1

Event-assisted Low-Light Video Object Segmentation

1 code implementation CVPR 2024 Hebei Li, Jin Wang, Jiahui Yuan, Yue Li, Wenming Weng, Yansong Peng, Yueyi Zhang, Zhiwei Xiong, Xiaoyan Sun

In the realm of video object segmentation (VOS), the challenge of operating under low-light conditions persists, resulting in notably degraded image quality and compromised accuracy when comparing query and memory frames for similarity computation.

Object Semantic Segmentation +2

Graph Relation Distillation for Efficient Biomedical Instance Segmentation

2 code implementations12 Jan 2024 Xiaoyu Liu, Yueyi Zhang, Zhiwei Xiong, Wei Huang, Bo Hu, Xiaoyan Sun, Feng Wu

IGD constructs a graph representing instance features and relations, transferring these two types of knowledge by enforcing instance graph consistency.

Instance Segmentation Knowledge Distillation +2

Cross-Dimension Affinity Distillation for 3D EM Neuron Segmentation

1 code implementation CVPR 2024 Xiaoyu Liu, Miaomiao Cai, Yinda Chen, Yueyi Zhang, Te Shi, Ruobing Zhang, Xuejin Chen, Zhiwei Xiong

Recent advancements utilize 3D CNNs to predict a 3D affinity map with improved accuracy but suffer from two challenges: high computational cost and limited input size especially for practical deployment for large-scale EM volumes.

Segmentation Transfer Learning

Domain Adaptive Synapse Detection with Weak Point Annotations

no code implementations31 Aug 2023 Qi Chen, Wei Huang, Yueyi Zhang, Zhiwei Xiong

In the second stage, we improve model generalizability on target data by regenerating square masks to get high-quality pseudo labels.

Segmentation

Deep Multi-Threshold Spiking-UNet for Image Processing

1 code implementation20 Jul 2023 Hebei Li, Yueyi Zhang, Zhiwei Xiong, Xiaoyan Sun

Furthermore, we adopt a flow-based training method to fine-tune the converted models, reducing time steps while preserving performance.

Denoising Image Segmentation +1

Toward Real-World Light Field Super-Resolution

1 code implementation30 May 2023 Zeyu Xiao, Ruisheng Gao, Yutong Liu, Yueyi Zhang, Zhiwei Xiong

Deep learning has opened up new possibilities for light field super-resolution (SR), but existing methods trained on synthetic datasets with simple degradations (e. g., bicubic downsampling) suffer from poor performance when applied to complex real-world scenarios.

Super-Resolution

Image Captioning with Multi-Context Synthetic Data

no code implementations29 May 2023 Feipeng Ma, Yizhou Zhou, Fengyun Rao, Yueyi Zhang, Xiaoyan Sun

This potential can be harnessed to create synthetic image-text pairs for training captioning models.

Image Captioning Language Modelling +2

A Soma Segmentation Benchmark in Full Adult Fly Brain

1 code implementation CVPR 2023 Xiaoyu Liu, Bo Hu, Mingxing Li, Wei Huang, Yueyi Zhang, Zhiwei Xiong

Finally, we provide quantitative and qualitative benchmark comparisons on the testset to validate the superiority of the proposed method, as well as preliminary statistics of the reconstructed somas in the full adult fly brain from the biological perspective.

Progressive Spatio-Temporal Alignment for Efficient Event-Based Motion Estimation

1 code implementation CVPR 2023 Xueyan Huang, Yueyi Zhang, Zhiwei Xiong

In addition, a dynamic batch size strategy is applied to adaptively adjust the batch size so that all events in the batch are consistent with the current motion model.

Event-based Motion Estimation Motion Estimation

Depth Estimation From Indoor Panoramas With Neural Scene Representation

1 code implementation CVPR 2023 Wenjie Chang, Yueyi Zhang, Zhiwei Xiong

Depth estimation from indoor panoramas is challenging due to the equirectangular distortions of panoramas and inaccurate matching.

Depth Estimation Position

NLOST: Non-Line-of-Sight Imaging With Transformer

no code implementations CVPR 2023 Yue Li, Jiayong Peng, Juntian Ye, Yueyi Zhang, Feihu Xu, Zhiwei Xiong

Specifically, after extracting the shallow features with the assistance of physics-based priors, we design two spatial-temporal self attention encoders to explore both local and global correlations within 3D NLOS data by splitting or downsampling the features into different scales, respectively.

Decoder

Unsupervised Video Deraining with An Event Camera

1 code implementation ICCV 2023 Jin Wang, Wenming Weng, Yueyi Zhang, Zhiwei Xiong

Current unsupervised video deraining methods are inefficient in modeling the intricate spatio-temporal properties of rain, which leads to unsatisfactory results.

Contrastive Learning Rain Removal +1

Learning Cross-Representation Affinity Consistency for Sparsely Supervised Biomedical Instance Segmentation

1 code implementation ICCV 2023 Xiaoyu Liu, Wei Huang, Zhiwei Xiong, Shenglong Zhou, Yueyi Zhang, Xuejin Chen, Zheng-Jun Zha, Feng Wu

Sparse instance-level supervision has recently been explored to address insufficient annotation in biomedical instance segmentation, which is easier to annotate crowded instances and better preserves instance completeness for 3D volumetric datasets compared to common semi-supervision. In this paper, we propose a sparsely supervised biomedical instance segmentation framework via cross-representation affinity consistency regularization.

Instance Segmentation Pseudo Label +1

Event-Based Blurry Frame Interpolation Under Blind Exposure

1 code implementation CVPR 2023 Wenming Weng, Yueyi Zhang, Zhiwei Xiong

Therefore, we first propose an exposure estimation strategy guided by event streams to estimate the lost exposure prior, transforming the blind exposure problem well-posed.

Degradation-agnostic Correspondence from Resolution-asymmetric Stereo

no code implementations CVPR 2022 Xihao Chen, Zhiwei Xiong, Zhen Cheng, Jiayong Peng, Yueyi Zhang, Zheng-Jun Zha

Interestingly, we find that, although a stereo matching network trained with the photometric loss is not optimal, its feature extractor can produce degradation-agnostic and matching-specific features.

Stereo Matching

Retinal Vessel Segmentation with Pixel-wise Adaptive Filters

1 code implementation3 Feb 2022 Mingxing Li, Shenglong Zhou, Chang Chen, Yueyi Zhang, Dong Liu, Zhiwei Xiong

Accurate retinal vessel segmentation is challenging because of the complex texture of retinal vessels and low imaging contrast.

Retinal Vessel Segmentation Segmentation

Exploiting Rigidity Constraints for LiDAR Scene Flow Estimation

no code implementations CVPR 2022 Guanting Dong, Yueyi Zhang, HanLin Li, Xiaoyan Sun, Zhiwei Xiong

Previous LiDAR scene flow estimation methods, especially recurrent neural networks, usually suffer from structure distortion in challenging cases, such as sparse reflection and motion occlusions.

Autonomous Driving Scene Flow Estimation

Advanced Deep Networks for 3D Mitochondria Instance Segmentation

1 code implementation16 Apr 2021 Mingxing Li, Chang Chen, Xiaoyu Liu, Wei Huang, Yueyi Zhang, Zhiwei Xiong

Mitochondria instance segmentation from electron microscopy (EM) images has seen notable progress since the introduction of deep learning methods.

3D Instance Segmentation Denoising +2

Event-Based Video Reconstruction Using Transformer

1 code implementation ICCV 2021 Wenming Weng, Yueyi Zhang, Zhiwei Xiong

Event cameras, which output events by detecting spatio-temporal brightness changes, bring a novel paradigm to image sensors with high dynamic range and low latency.

Event-based Object Segmentation Event-Based Video Reconstruction +1

Depth Acquisition from Density Modulated Binary Patterns

no code implementations CVPR 2013 Zhe Yang, Zhiwei Xiong, Yueyi Zhang, Jiao Wang, Feng Wu

First, we propose an algorithm to design the patterns to carry more phase information without compromising the depth reconstruction from a single captured image as with Kinect.

Cannot find the paper you are looking for? You can Submit a new open access paper.