Search Results for author: Yutao Hu

Found 8 papers, 2 papers with code

Few-Shot Semantic Segmentation with Democratic Attention Networks

no code implementations ECCV 2020 Haochen Wang, Xu-Dong Zhang, Yutao Hu, Yandan Yang, Xian-Bin Cao, Xian-Tong Zhen

The crux of few-shot segmentation is to extract object information from the support image and then propagate it to guide the segmentation of query images.

Few-Shot Semantic Segmentation Graph Attention +2

OmniMedVQA: A New Large-Scale Comprehensive Evaluation Benchmark for Medical LVLM

no code implementations14 Feb 2024 Yutao Hu, Tianbin Li, Quanfeng Lu, Wenqi Shao, Junjun He, Yu Qiao, Ping Luo

A significant challenge arises from the scarcity of diverse medical images spanning various modalities and anatomical regions, which is essential in real-world medical applications.

Medical Visual Question Answering Question Answering +1

Tiny LVLM-eHub: Early Multimodal Experiments with Bard

1 code implementation7 Aug 2023 Wenqi Shao, Yutao Hu, Peng Gao, Meng Lei, Kaipeng Zhang, Fanqing Meng, Peng Xu, Siyuan Huang, Hongsheng Li, Yu Qiao, Ping Luo

Secondly, it conducts an in-depth analysis of LVLMs' predictions using the ChatGPT Ensemble Evaluation (CEE), which leads to a robust and accurate evaluation and exhibits improved alignment with human evaluation compared to the word matching approach.

Hallucination Visual Reasoning

Double Graphs Regularized Multi-view Subspace Clustering

no code implementations30 Sep 2022 Longlong Chen, Yulong Wang, Youheng Liu, Yutao Hu, Libin Wang

In this paper, we propose a novel Double Graphs Regularized Multi-view Subspace Clustering (DGRMSC) method, which aims to harness both global and local structural information of multi-view data in a unified framework.

Clustering Multi-view Subspace Clustering

Global Weighted Tensor Nuclear Norm for Tensor Robust Principal Component Analysis

no code implementations28 Sep 2022 Libin Wang, Yulong Wang, Shiyuan Wang, Youheng Liu, Yutao Hu, Longlong Chen, Hong Chen

Tensor Robust Principal Component Analysis (TRPCA), which aims to recover a low-rank tensor corrupted by sparse noise, has attracted much attention in many real applications.

Alignment Enhancement Network for Fine-grained Visual Categorization

no code implementations1 Mar 2021 Yutao Hu

However, they are still inefficient to fully use the cross-layer information based on the simple aggregation strategy, while existing pairwise learning methods also fail to explore long-range interactions between different images.

Fine-Grained Image Classification Fine-Grained Visual Categorization

NAS-Count: Counting-by-Density with Neural Architecture Search

no code implementations ECCV 2020 Yutao Hu, Xiao-Long Jiang, Xuhui Liu, Baochang Zhang, Jungong Han, Xian-Bin Cao, David Doermann

Most of the recent advances in crowd counting have evolved from hand-designed density estimation networks, where multi-scale features are leveraged to address the scale variation problem, but at the expense of demanding design efforts.

Crowd Counting Density Estimation +1

Cannot find the paper you are looking for? You can Submit a new open access paper.