Search Results for author: Yulin Li

Found 9 papers, 3 papers with code

Collaborative Position Reasoning Network for Referring Image Segmentation

no code implementations • 22 Jan 2024 • JianJian Cao, Beiya Dai, Yulin Li, Xiameng Qin, Jingdong Wang

Holi integrates features of the two modalities by a cross-modal attention mechanism, which suppresses the irrelevant redundancy under the guide of positioning information from RoCo.

Image Segmentation Position +2

Paper
Add Code

Frequency Domain Modality-invariant Feature Learning for Visible-infrared Person Re-Identification

no code implementations • 3 Jan 2024 • Yulin Li, Tianzhu Zhang, Yongdong Zhang

Visible-infrared person re-identification (VI-ReID) is challenging due to the significant cross-modality discrepancies between visible and infrared images.

Metric Learning Person Re-Identification

Paper
Add Code

MataDoc: Margin and Text Aware Document Dewarping for Arbitrary Boundary

no code implementations • 24 Jul 2023 • Beiya Dai, Xing Li, Qunyi Xie, Yulin Li, Xiameng Qin, Chengquan Zhang, Kun Yao, Junyu Han

To produce a comprehensive evaluation of MataDoc, we propose a novel benchmark ArbDoc, mainly consisting of document images with arbitrary boundaries in four typical scenarios.

document understanding Optical Character Recognition (OCR)

Paper
Add Code

Fast-StrucTexT: An Efficient Hourglass Transformer with Modality-guided Dynamic Token Merge for Document Understanding

no code implementations • 19 May 2023 • Mingliang Zhai, Yulin Li, Xiameng Qin, Chen Yi, Qunyi Xie, Chengquan Zhang, Kun Yao, Yuwei Wu, Yunde Jia

Transformers achieve promising performance in document understanding because of their high effectiveness and still suffer from quadratic computational complexity dependency on the sequence length.

document understanding

Paper
Add Code

StrucTexTv2: Masked Visual-Textual Prediction for Document Image Pre-training

1 code implementation • 1 Mar 2023 • Yuechen Yu, Yulin Li, Chengquan Zhang, Xiaoqiang Zhang, Zengyuan Guo, Xiameng Qin, Kun Yao, Junyu Han, Errui Ding, Jingdong Wang

Compared to the masked multi-modal modeling methods for document image understanding that rely on both the image and text modalities, StrucTexTv2 models image-only input and potentially deals with more application scenarios free from OCR pre-processing.

Ranked #1 on Table Recognition on WTW

Document Image Classification Language Modelling +3

481

Paper
Code

Deep Learning Predicts Prevalent and Incident Parkinson's Disease From UK Biobank Fundus Imaging

1 code implementation • 13 Feb 2023 • Charlie Tran, Kai Shen, Kang Liu, Akshay Ashok, Adolfo Ramirez-Zamora, Jinghua Chen, Yulin Li, Ruogu Fang

Parkinson's disease is the world's fastest-growing neurological disorder.

Paper
Code

StrucTexT: Structured Text Understanding with Multi-Modal Transformers

1 code implementation • 6 Aug 2021 • Yulin Li, Yuxi Qian, Yuchen Yu, Xiameng Qin, Chengquan Zhang, Yan Liu, Kun Yao, Junyu Han, Jingtuo Liu, Errui Ding

Due to the complexity of content and layout in VRDs, structured text understanding has been a challenging task.

Entity Linking Language Modelling +1

481

Paper
Code

Diverse Part Discovery: Occluded Person Re-identification with Part-Aware Transformer

no code implementations • CVPR 2021 • Yulin Li, Jianfeng He, Tianzhu Zhang, Xiang Liu, Yongdong Zhang, Feng Wu

To address these issues, we propose a novel end-to-end Part-Aware Transformer (PAT) for occluded person Re-ID through diverse part discovery via a transformer encoderdecoder architecture, including a pixel context based transformer encoder and a part prototype based transformer decoder.

Person Re-Identification

Paper
Add Code

Neural-iLQR: A Learning-Aided Shooting Method for Trajectory Optimization

no code implementations • 21 Nov 2020 • Zilong Cheng, Yulin Li, Kai Chen, Jun Ma, Tong Heng Lee

Iterative linear quadratic regulator (iLQR) has gained wide popularity in addressing trajectory optimization problems with nonlinear system models.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.