Search Results for author: Zhaohu Xing

Found 16 papers, 12 papers with code

Infinity-MM: Scaling Multimodal Performance with Large-Scale and High-Quality Instruction Data

4 code implementations24 Oct 2024 Shuhao Gu, Jialing Zhang, Siyuan Zhou, Kevin Yu, Zhaohu Xing, Liangdong Wang, Zhou Cao, Jintao Jia, Zhuoyi Zhang, YiXuan Wang, Zhenchong Hu, Bo-Wen Zhang, Jijie Li, Dong Liang, Yingli Zhao, Songjing Wang, Yulong Ao, Yiming Ju, Huanhuan Ma, Xiaotong Li, Haiwen Diao, Yufeng Cui, Xinlong Wang, Yaoqi Liu, Fangxiang Feng, Guang Liu

Despite the availability of several open-source multimodal datasets, limitations in the scale and quality of open-source instruction data hinder the performance of VLMs trained on these datasets, leading to a significant gap compared to models trained on closed-source data.

Image Generation Question Generation +2

Teaching Tailored to Talent: Adverse Weather Restoration via Prompt Pool and Depth-Anything Constraint

no code implementations24 Sep 2024 Sixiang Chen, Tian Ye, Kai Zhang, Zhaohu Xing, Yunlong Lin, Lei Zhu

Recent advancements in adverse weather restoration have shown potential, yet the unpredictable and varied combinations of weather degradations in the real world pose significant challenges.

Computational Efficiency

Diff-VPS: Video Polyp Segmentation via a Multi-task Diffusion Network with Adversarial Temporal Reasoning

1 code implementation11 Sep 2024 Yingling Lu, Yijun Yang, Zhaohu Xing, Qiong Wang, Lei Zhu

We incorporate multi-task supervision into diffusion models to promote the discrimination of diffusion models on pixel-by-pixel segmentation.

Segmentation Video Polyp Segmentation

Timeline and Boundary Guided Diffusion Network for Video Shadow Detection

1 code implementation21 Aug 2024 Haipeng Zhou, Honqiu Wang, Tian Ye, Zhaohu Xing, Jun Ma, Ping Li, Qiong Wang, Lei Zhu

Moreover, we are the first to introduce the Diffusion model for VSD in which we explore a Space-Time Encoded Embedding (STEE) to inject the temporal guidance for Diffusion to conduct shadow detection.

Shadow Detection Video Shadow Detection

AGLLDiff: Guiding Diffusion Models Towards Unsupervised Training-free Real-world Low-light Image Enhancement

no code implementations20 Jul 2024 Yunlong Lin, Tian Ye, Sixiang Chen, Zhenqi Fu, Yingying Wang, Wenhao Chai, Zhaohu Xing, Lei Zhu, Xinghao Ding

Existing low-light image enhancement (LIE) methods have achieved noteworthy success in solving synthetic distortions, yet they often fall short in practical applications.

Attribute Low-Light Image Enhancement

Vivim: a Video Vision Mamba for Medical Video Segmentation

1 code implementation25 Jan 2024 Yijun Yang, Zhaohu Xing, Chunwang Huang, Lei Zhu

To this end, this paper presents a Video Vision Mamba-based framework, dubbed as Vivim, for medical video segmentation tasks.

Lesion Segmentation Mamba +5

SegMamba: Long-range Sequential Modeling Mamba For 3D Medical Image Segmentation

1 code implementation24 Jan 2024 Zhaohu Xing, Tian Ye, Yijun Yang, Guang Liu, Lei Zhu

Our SegMamba, in contrast to Transformer-based methods, excels in whole volume feature modeling from a state space model standpoint, maintaining superior processing speed, even with volume features at a resolution of {$64\times 64\times 64$}.

Image Segmentation Mamba +2

Learning Diffusion Texture Priors for Image Restoration

no code implementations CVPR 2024 Tian Ye, Sixiang Chen, Wenhao Chai, Zhaohu Xing, Jing Qin, Ge Lin, Lei Zhu

When adopting diffusion models for image restoration the crucial challenge lies in how to preserve high-level image fidelity in the randomness diffusion process and generate accurate background structures and realistic texture details.

Image Generation Image Restoration

HybridMIM: A Hybrid Masked Image Modeling Framework for 3D Medical Image Segmentation

1 code implementation18 Mar 2023 Zhaohu Xing, Lei Zhu, Lequan Yu, Zhiheng Xing, Liang Wan

Masked image modeling (MIM) with transformer backbones has recently been exploited as a powerful self-supervised pre-training technique.

Contrastive Learning Image Segmentation +3

Diff-UNet: A Diffusion Embedded Network for Volumetric Segmentation

1 code implementation18 Mar 2023 Zhaohu Xing, Liang Wan, Huazhu Fu, Guang Yang, Lei Zhu

Our experimental results also indicate the universality and effectiveness of the proposed model.

Denoising Segmentation

NestedFormer: Nested Modality-Aware Transformer for Brain Tumor Segmentation

1 code implementation31 Aug 2022 Zhaohu Xing, Lequan Yu, Liang Wan, Tong Han, Lei Zhu

Multi-modal MR imaging is routinely used in clinical practice to diagnose and investigate brain tumors by providing rich complementary information.

Brain Tumor Segmentation Decoder +3

Cannot find the paper you are looking for? You can Submit a new open access paper.