Search Results for author: Zhi Yu

Found 16 papers, 5 papers with code

An End-to-End OCR Text Re-organization Sequence Learning for Rich-text Detail Image Comprehension

no code implementations • ECCV 2020 • Liangcheng Li, Feiyu Gao, Jiajun Bu, Yongpan Wang, Zhi Yu, Qi Zheng

Nowadays rich description on detail images help users know more about the commodities.

Image Comprehension Optical Character Recognition (OCR)

Paper
Add Code

LayoutLLM: Layout Instruction Tuning with Large Language Models for Document Understanding

3 code implementations • 8 Apr 2024 • Chuwei Luo, Yufan Shen, Zhaoqing Zhu, Qi Zheng, Zhi Yu, Cong Yao

The core of LayoutLLM is a layout instruction tuning strategy, which is specially designed to enhance the comprehension and utilization of document layouts.

document understanding

926

Paper
Code

Less is More: A Closer Look at Semantic-based Few-Shot Learning

no code implementations • 10 Jan 2024 • Chunpeng Zhou, Haishuai Wang, Xilu Yuan, Zhi Yu, Jiajun Bu

To address this, we propose a simple but effective framework for few-shot learning tasks, specifically designed to exploit the textual information and language model.

Few-Shot Learning Language Modelling

Paper
Add Code

LORE++: Logical Location Regression Network for Table Structure Recognition with Pre-training

no code implementations • 3 Jan 2024 • Rujiao Long, Hangdi Xing, Zhibo Yang, Qi Zheng, Zhi Yu, Cong Yao, Fei Huang

We model TSR as a logical location regression problem and propose a new TSR framework called LORE, standing for LOgical location REgression network, which for the first time regresses logical location as well as spatial location of table cells in a unified network.

regression

Paper
Add Code

Multi-View Fusion and Distillation for Subgrade Distresses Detection based on 3D-GPR

1 code implementation • 9 Aug 2023 • Chunpeng Zhou, Kangjie Ning, Haishuai Wang, Zhi Yu, Sheng Zhou, Jiajun Bu

To address these challenges, we introduce a novel methodology for the subgrade distress detection task by leveraging the multi-view information from 3D-GPR data.

GPR Knowledge Distillation +1

Paper
Code

Translate the Beauty in Songs: Jointly Learning to Align Melody and Translate Lyrics

no code implementations • 28 Mar 2023 • Chengxi Li, Kai Fan, Jiajun Bu, Boxing Chen, Zhongqiang Huang, Zhi Yu

Song translation requires both translation of lyrics and alignment of music notes so that the resulting verse can be sung to the accompanying melody, which is a challenging problem that has attracted some interests in different aspects of the translation process.

Translation

Paper
Add Code

LORE: Logical Location Regression Network for Table Structure Recognition

1 code implementation • 7 Mar 2023 • Hangdi Xing, Feiyu Gao, Rujiao Long, Jiajun Bu, Qi Zheng, Liangcheng Li, Cong Yao, Zhi Yu

Table structure recognition (TSR) aims at extracting tables in images into machine-understandable formats.

regression Table Recognition

926

Paper
Code

Dynamic Data-Free Knowledge Distillation by Easy-to-Hard Learning Strategy

1 code implementation • 29 Aug 2022 • Jingru Li, Sheng Zhou, Liangcheng Li, Haishuai Wang, Zhi Yu, Jiajun Bu

Besides, CuDFKD adapts the generation target dynamically according to the status of student model.

Data-free Knowledge Distillation

Paper
Code

A machine-learning-based tool for last closed-flux surface reconstruction on tokamaks

no code implementations • 12 Jul 2022 • Chenguang Wan, Zhi Yu, Alessandro Pau, Xiaojuan Liu, Jiangang Li

Tokamaks allow to confine fusion plasma with magnetic fields and one of the main challenges in the control of the magnetic configuration is the prediction/reconstruction of the Last Closed-Flux Surface (LCFS).

Surface Reconstruction

Paper
Add Code

SentiPrompt: Sentiment Knowledge Enhanced Prompt-Tuning for Aspect-Based Sentiment Analysis

no code implementations • 17 Sep 2021 • Chengxi Li, Feiyu Gao, Jiajun Bu, Lu Xu, Xiang Chen, Yu Gu, Zirui Shao, Qi Zheng, Ningyu Zhang, Yongpan Wang, Zhi Yu

We inject sentiment knowledge regarding aspects, opinions, and polarities into prompt and explicitly model term relations via constructing consistency and polarity judgment templates from the ground truth triplets.

Aspect-Based Sentiment Analysis Aspect-Based Sentiment Analysis (ABSA) +3

Paper
Add Code

Cross-modal Image Retrieval with Deep Mutual Information Maximization

no code implementations • 10 Mar 2021 • Chunbin Gu, Jiajun Bu, Xixi Zhou, Chengwei Yao, Dongfang Ma, Zhi Yu, Xifeng Yan

Prior work usually uses a three-stage strategy to tackle this task: 1) extract the features of the inputs; 2) fuse the feature of the source image and its modified text to obtain fusion feature; 3) learn a similarity metric between the desired image and the source image + modified text by using deep metric learning.

Cross-Modal Retrieval Image Retrieval +3

Paper
Add Code

Experiment data-driven modeling of tokamak discharge in EAST

no code implementations • 21 Jul 2020 • Chenguang Wan, Jiangang Li, Zhi Yu, Xiaojuan Liu

By using the data-driven methodology, we exploit the temporal sequence of control signals for a large set of EAST discharges to develop a deep learning model for modeling discharge diagnostic signals, such as electron density $n_{e}$, store energy $W_{mhd}$ and loop voltage $V_{loop}$.

Paper
Add Code

Adaptive-Step Graph Meta-Learner for Few-Shot Graph Classification

no code implementations • 18 Mar 2020 • Ning Ma, Jiajun Bu, Jieyu Yang, Zhen Zhang, Chengwei Yao, Zhi Yu, Sheng Zhou, Xifeng Yan

The shared sub-structures between training classes and test classes are essential in few-shot graph classification.

Few-Shot Learning General Classification +3

Paper
Add Code

Matching Text with Deep Mutual Information Estimation

no code implementations • 9 Mar 2020 • Xixi Zhou, Chengxi Li, Jiajun Bu, Chengwei Yao, Keyue Shi, Zhi Yu, Zhou Yu

Our approach, Text matching with Deep Info Max (TIM), is integrated with a procedure of unsupervised learning of representations by maximizing the mutual information between text matching neural network's input and output.

Answer Selection Mutual Information Estimation +3

Paper
Add Code

Hierarchical Graph Pooling with Structure Learning

3 code implementations • 14 Nov 2019 • Zhen Zhang, Jiajun Bu, Martin Ester, Jianfeng Zhang, Chengwei Yao, Zhi Yu, Can Wang

HGP-SL incorporates graph pooling and structure learning into a unified module to generate hierarchical representations of graphs.

Ranked #1 on Graph Classification on PROTEINS

Graph Classification Representation Learning

12,994

Paper
Code

Lightweight Real-time Makeup Try-on in Mobile Browsers with Tiny CNN Models for Facial Tracking

no code implementations • 5 Jun 2019 • TianXing Li, Zhi Yu, Edmund Phung, Brendan Duke, Irina Kezele, Parham Aarabi

Recent works on convolutional neural networks (CNNs) for facial alignment have demonstrated unprecedented accuracy on a variety of large, publicly available datasets.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.