Search Results for author: Zhang Li

Found 11 papers, 3 papers with code

3MOS: Multi-sources, Multi-resolutions, and Multi-scenes dataset for Optical-SAR image matching

no code implementations • 1 Apr 2024 • Yibin Ye, Xichao Teng, Shuo Chen, Yijie Bian, Tao Tan, Zhang Li

Optical-SAR image matching is a fundamental task for image fusion and visual navigation.

Domain Adaptation Visual Navigation

Paper
Add Code

TextMonkey: An OCR-Free Large Multimodal Model for Understanding Document

1 code implementation • 7 Mar 2024 • Yuliang Liu, Biao Yang, Qiang Liu, Zhang Li, Zhiyin Ma, Shuo Zhang, Xiang Bai

We present TextMonkey, a large multimodal model (LMM) tailored for text-centric tasks.

document understanding Key Information Extraction +4

1,327

Paper
Code

Monkey: Image Resolution and Text Label Are Important Things for Large Multi-modal Models

1 code implementation • 11 Nov 2023 • Zhang Li, Biao Yang, Qiang Liu, Zhiyin Ma, Shuo Zhang, Jingxu Yang, Yabo Sun, Yuliang Liu, Xiang Bai

Additionally, experiments on 18 datasets further demonstrate that Monkey surpasses existing LMMs in many tasks like Image Captioning and various Visual Question Answering formats.

Image Captioning Question Answering +2

1,327

Paper
Code

On the Hidden Mystery of OCR in Large Multimodal Models

1 code implementation • 13 May 2023 • Yuliang Liu, Zhang Li, Biao Yang, Chunyuan Li, XuCheng Yin, Cheng-Lin Liu, Lianwen Jin, Xiang Bai

In this paper, we conducted a comprehensive evaluation of Large Multimodal Models, such as GPT4V and Gemini, in various text-related visual tasks including Text Recognition, Scene Text-Centric Visual Question Answering (VQA), Document-Oriented VQA, Key Information Extraction (KIE), and Handwritten Mathematical Expression Recognition (HMER).

Key Information Extraction Nutrition +4

272

Paper
Code

Oriented Object Detection in Optical Remote Sensing Images using Deep Learning: A Survey

no code implementations • 21 Feb 2023 • Kun Wang, Zi Wang, Zhang Li, Ang Su, Xichao Teng, Minhao Liu, Qifeng Yu

Given the rapid development of this field, this paper aims to provide a comprehensive survey of recent advances in oriented object detection.

Object object-detection +2

Paper
Add Code

LYSTO: The Lymphocyte Assessment Hackathon and Benchmark Dataset

no code implementations • 16 Jan 2023 • Yiping Jiao, Jeroen van der Laak, Shadi Albarqouni, Zhang Li, Tao Tan, Abhir Bhalerao, Jiabo Ma, Jiamei Sun, Johnathan Pocock, Josien P. W. Pluim, Navid Alemi Koohbanani, Raja Muhammad Saad Bashir, Shan E Ahmed Raza, Sibo Liu, Simon Graham, Suzanne Wetstein, Syed Ali Khurram, Thomas Watson, Nasir Rajpoot, Mitko Veta, Francesco Ciompi

Additionally, we present post-competition results where we show how the presented methods perform on an independent set of lung cancer slides, which was not part of the initial competition, as well as a comparison on lymphocyte assessment between presented methods and a panel of pathologists.

Paper
Add Code

Bridging the Domain Gap in Satellite Pose Estimation: a Self-Training Approach based on Geometrical Constraints

no code implementations • 23 Dec 2022 • Zi Wang, Minglin Chen, Yulan Guo, Zhang Li, Qifeng Yu

Recently, unsupervised domain adaptation in satellite pose estimation has gained increasing attention, aiming at alleviating the annotation cost for training deep models.

Pose Estimation Pseudo Label +1

Paper
Add Code

Deep Learning Methods for Lung Cancer Segmentation in Whole-slide Histopathology Images -- the ACDC@LungHP Challenge 2019

no code implementations • 21 Aug 2020 • Zhang Li, Jiehua Zhang, Tao Tan, Xichao Teng, Xiaoliang Sun, Yang Li, Lihong Liu, Yang Xiao, Byungjae Lee, Yilong Li, Qianni Zhang, Shujiao Sun, Yushan Zheng, Junyu Yan, Ni Li, Yiyu Hong, Junsu Ko, Hyun Jung, Yanling Liu, Yu-cheng Chen, Ching-Wei Wang, Vladimir Yurovskiy, Pavel Maevskikh, Vahid Khanagha, Yi Jiang, Xiangjun Feng, Zhihong Liu, Daiqiang Li, Peter J. Schüffler, Qifeng Yu, Hui Chen, Yuling Tang, Geert Litjens

All methods were based on deep learning and categorized into two groups: multi-model method and single model method.

Segmentation

Paper
Add Code

Minimal Solutions for Relative Pose with a Single Affine Correspondence

no code implementations • CVPR 2020 • Banglei Guan, Ji Zhao, Zhang Li, Fang Sun, Friedrich Fraundorfer

In this paper we present four cases of minimal solutions for two-view relative pose estimation by exploiting the affine transformation between feature points and we demonstrate efficient solvers for these cases.

Motion Estimation Outlier Detection +1

Paper
Add Code

Computer-aided diagnosis of lung carcinoma using deep learning - a pilot study

no code implementations • 14 Mar 2018 • Zhang Li, Zheyu Hu, Jiaolong Xu, Tao Tan, Hui Chen, Zhi Duan, Ping Liu, Jun Tang, Guoping Cai, Quchang Ouyang, Yuling Tang, Geert Litjens, Qiang Li

Aim: Early detection and correct diagnosis of lung cancer are the most important steps in improving patient outcome.

Lung Cancer Diagnosis

Paper
Add Code

Optimize transfer learning for lung diseases in bronchoscopy using a new concept: sequential fine-tuning

no code implementations • 10 Feb 2018 • Tao Tan, Zhang Li, Haixia Liu, Ping Liu, Wenfang Tang, Hui Li, Yue Sun, Yusheng Yan, Keyu Li, Tao Xu, Shanshan Wan, Ke Lou, Jun Xu, Huiming Ying, Quchang Ouyang, Yuling Tang, Zheyu Hu, Qiang Li

To help doctors to be more selective on biopsies and provide a second opinion on diagnosis, in this work, we propose a computer-aided diagnosis (CAD) system for lung diseases including cancers and tuberculosis (TB).

Transfer Learning

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.