Search Results for author: Yu Dai

Found 12 papers, 6 papers with code

ARMOR v0.1: Empowering Autoregressive Multimodal Understanding Model with Interleaved Multimodal Generation via Asymmetric Synergy

no code implementations9 Mar 2025 Jianwen Sun, Yukang Feng, Chuanhao Li, Fanrui Zhang, Zizhen Li, Jiaxin Ai, Sizhuo Zhou, Yu Dai, Shenglin Zhang, Kaipeng Zhang

Existing UniMs are designed to simultaneously learn both multimodal understanding and generation capabilities, demanding substantial computational resources, and often struggle to generate interleaved text-image.

Decoder Image Generation +1

Modulating CNN Features with Pre-Trained ViT Representations for Open-Vocabulary Object Detection

no code implementations28 Jan 2025 Xiangyu Gao, Yu Dai, Benliu Qiu, Hongliang Li

On OV-COCO, the proposed method achieves 44. 3 AP$_{50}^{\mathrm{novel}}$ with ViT-B/16 and 48. 5 AP$_{50}^{\mathrm{novel}}$ with ViT-L/14.

object-detection Open-vocabulary object detection +1

PSFHS Challenge Report: Pubic Symphysis and Fetal Head Segmentation from Intrapartum Ultrasound Images

no code implementations17 Sep 2024 Jieyun Bai, ZiHao Zhou, Zhanhong Ou, Gregor Koehler, Raphael Stock, Klaus Maier-Hein, Marawan Elbatel, Robert Martí, Xiaomeng Li, Yaoyang Qiu, Panjie Gou, Gongping Chen, Lei Zhao, Jianxun Zhang, Yu Dai, Fangyijie Wang, Guénolé Silvestre, Kathleen Curran, Hongkun Sun, Jing Xu, Pengzhou Cai, Lu Jiang, Libin Lan, Dong Ni, Mei Zhong, Gaowen Chen, Víctor M. Campello, Yaosheng Lu, Karim Lekadir

This challenge aimed to enhance the development of automatic segmentation algorithms at an international scale, providing the largest dataset to date with 5, 101 intrapartum ultrasound images collected from two ultrasound machines across three hospitals from two institutions.

Segmentation

VGA: Vision GUI Assistant -- Minimizing Hallucinations through Image-Centric Fine-Tuning

1 code implementation20 Jun 2024 Ziyang Meng, Yu Dai, Zezheng Gong, Shaoxiong Guo, Minglong Tang, Tongquan Wei

We first construct a Vision Question Answering (VQA) dataset of 63. 8k high-quality examples with our propose Referent Method, which ensures the model's responses are highly depend on visual content within the image.

Image Comprehension Question Answering +1

Rethinking the Unpretentious U-net for Medical Ultrasound Image Segmentation

2 code implementations15 Sep 2022 Gongping Chen, Lei LI, Jianxun Zhang, Yu Dai

However, variable tumor morphology, blurred boundary, and similar intensity distributions bring challenges for accurate segmentation of breast tumors.

Image Segmentation Segmentation +1

AAU-net: An Adaptive Attention U-net for Breast Lesions Segmentation in Ultrasound Images

2 code implementations26 Apr 2022 Gongping Chen, Yu Dai, Jianxun Zhang, Moi Hoon Yap

Different from existing attention mechanisms, the hybrid adaptive attention module can guide the network to adaptively select more robust representation in channel and space dimensions to cope with more complex breast lesions segmentation.

Lesion Segmentation Segmentation

Cannot find the paper you are looking for? You can Submit a new open access paper.