Search Results for author: Yutao Hu

Found 8 papers, 2 papers with code

Few-Shot Semantic Segmentation with Democratic Attention Networks

no code implementations • ECCV 2020 • Haochen Wang, Xu-Dong Zhang, Yutao Hu, Yandan Yang, Xian-Bin Cao, Xian-Tong Zhen

The crux of few-shot segmentation is to extract object information from the support image and then propagate it to guide the segmentation of query images.

Few-Shot Semantic Segmentation Graph Attention +2

Paper
Add Code

OmniMedVQA: A New Large-Scale Comprehensive Evaluation Benchmark for Medical LVLM

no code implementations • 14 Feb 2024 • Yutao Hu, Tianbin Li, Quanfeng Lu, Wenqi Shao, Junjun He, Yu Qiao, Ping Luo

A significant challenge arises from the scarcity of diverse medical images spanning various modalities and anatomical regions, which is essential in real-world medical applications.

Medical Visual Question Answering Question Answering +1

Paper
Add Code

Beyond One-to-One: Rethinking the Referring Image Segmentation

1 code implementation • ICCV 2023 • Yutao Hu, Qixiong Wang, Wenqi Shao, Enze Xie, Zhenguo Li, Jungong Han, Ping Luo

In this paper, we address this issue from two perspectives.

Image Segmentation Semantic Segmentation +1

Paper
Code

Tiny LVLM-eHub: Early Multimodal Experiments with Bard

1 code implementation • 7 Aug 2023 • Wenqi Shao, Yutao Hu, Peng Gao, Meng Lei, Kaipeng Zhang, Fanqing Meng, Peng Xu, Siyuan Huang, Hongsheng Li, Yu Qiao, Ping Luo

Secondly, it conducts an in-depth analysis of LVLMs' predictions using the ChatGPT Ensemble Evaluation (CEE), which leads to a robust and accurate evaluation and exhibits improved alignment with human evaluation compared to the word matching approach.

Hallucination Visual Reasoning

360

Paper
Code

Double Graphs Regularized Multi-view Subspace Clustering

no code implementations • 30 Sep 2022 • Longlong Chen, Yulong Wang, Youheng Liu, Yutao Hu, Libin Wang

In this paper, we propose a novel Double Graphs Regularized Multi-view Subspace Clustering (DGRMSC) method, which aims to harness both global and local structural information of multi-view data in a unified framework.

Clustering Multi-view Subspace Clustering

Paper
Add Code

Global Weighted Tensor Nuclear Norm for Tensor Robust Principal Component Analysis

no code implementations • 28 Sep 2022 • Libin Wang, Yulong Wang, Shiyuan Wang, Youheng Liu, Yutao Hu, Longlong Chen, Hong Chen

Tensor Robust Principal Component Analysis (TRPCA), which aims to recover a low-rank tensor corrupted by sparse noise, has attracted much attention in many real applications.

Paper
Add Code

Alignment Enhancement Network for Fine-grained Visual Categorization

no code implementations • 1 Mar 2021 • Yutao Hu

However, they are still inefficient to fully use the cross-layer information based on the simple aggregation strategy, while existing pairwise learning methods also fail to explore long-range interactions between different images.

Ranked #8 on Fine-Grained Image Classification on FGVC Aircraft

Fine-Grained Image Classification Fine-Grained Visual Categorization

Paper
Add Code

NAS-Count: Counting-by-Density with Neural Architecture Search

no code implementations • ECCV 2020 • Yutao Hu, Xiao-Long Jiang, Xuhui Liu, Baochang Zhang, Jungong Han, Xian-Bin Cao, David Doermann

Most of the recent advances in crowd counting have evolved from hand-designed density estimation networks, where multi-scale features are leveraged to address the scale variation problem, but at the expense of demanding design efforts.

Crowd Counting Density Estimation +1

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.