Search Results for author: Zhi Yu

Found 16 papers, 5 papers with code

LayoutLLM: Layout Instruction Tuning with Large Language Models for Document Understanding

3 code implementations8 Apr 2024 Chuwei Luo, Yufan Shen, Zhaoqing Zhu, Qi Zheng, Zhi Yu, Cong Yao

The core of LayoutLLM is a layout instruction tuning strategy, which is specially designed to enhance the comprehension and utilization of document layouts.

document understanding

Less is More: A Closer Look at Semantic-based Few-Shot Learning

no code implementations10 Jan 2024 Chunpeng Zhou, Haishuai Wang, Xilu Yuan, Zhi Yu, Jiajun Bu

To address this, we propose a simple but effective framework for few-shot learning tasks, specifically designed to exploit the textual information and language model.

Few-Shot Learning Language Modelling

LORE++: Logical Location Regression Network for Table Structure Recognition with Pre-training

no code implementations3 Jan 2024 Rujiao Long, Hangdi Xing, Zhibo Yang, Qi Zheng, Zhi Yu, Cong Yao, Fei Huang

We model TSR as a logical location regression problem and propose a new TSR framework called LORE, standing for LOgical location REgression network, which for the first time regresses logical location as well as spatial location of table cells in a unified network.

regression

Multi-View Fusion and Distillation for Subgrade Distresses Detection based on 3D-GPR

1 code implementation9 Aug 2023 Chunpeng Zhou, Kangjie Ning, Haishuai Wang, Zhi Yu, Sheng Zhou, Jiajun Bu

To address these challenges, we introduce a novel methodology for the subgrade distress detection task by leveraging the multi-view information from 3D-GPR data.

GPR Knowledge Distillation +1

Translate the Beauty in Songs: Jointly Learning to Align Melody and Translate Lyrics

no code implementations28 Mar 2023 Chengxi Li, Kai Fan, Jiajun Bu, Boxing Chen, Zhongqiang Huang, Zhi Yu

Song translation requires both translation of lyrics and alignment of music notes so that the resulting verse can be sung to the accompanying melody, which is a challenging problem that has attracted some interests in different aspects of the translation process.

Translation

LORE: Logical Location Regression Network for Table Structure Recognition

1 code implementation7 Mar 2023 Hangdi Xing, Feiyu Gao, Rujiao Long, Jiajun Bu, Qi Zheng, Liangcheng Li, Cong Yao, Zhi Yu

Table structure recognition (TSR) aims at extracting tables in images into machine-understandable formats.

regression Table Recognition

A machine-learning-based tool for last closed-flux surface reconstruction on tokamaks

no code implementations12 Jul 2022 Chenguang Wan, Zhi Yu, Alessandro Pau, Xiaojuan Liu, Jiangang Li

Tokamaks allow to confine fusion plasma with magnetic fields and one of the main challenges in the control of the magnetic configuration is the prediction/reconstruction of the Last Closed-Flux Surface (LCFS).

Surface Reconstruction

SentiPrompt: Sentiment Knowledge Enhanced Prompt-Tuning for Aspect-Based Sentiment Analysis

no code implementations17 Sep 2021 Chengxi Li, Feiyu Gao, Jiajun Bu, Lu Xu, Xiang Chen, Yu Gu, Zirui Shao, Qi Zheng, Ningyu Zhang, Yongpan Wang, Zhi Yu

We inject sentiment knowledge regarding aspects, opinions, and polarities into prompt and explicitly model term relations via constructing consistency and polarity judgment templates from the ground truth triplets.

Aspect-Based Sentiment Analysis Aspect-Based Sentiment Analysis (ABSA) +3

Cross-modal Image Retrieval with Deep Mutual Information Maximization

no code implementations10 Mar 2021 Chunbin Gu, Jiajun Bu, Xixi Zhou, Chengwei Yao, Dongfang Ma, Zhi Yu, Xifeng Yan

Prior work usually uses a three-stage strategy to tackle this task: 1) extract the features of the inputs; 2) fuse the feature of the source image and its modified text to obtain fusion feature; 3) learn a similarity metric between the desired image and the source image + modified text by using deep metric learning.

Cross-Modal Retrieval Image Retrieval +3

Experiment data-driven modeling of tokamak discharge in EAST

no code implementations21 Jul 2020 Chenguang Wan, Jiangang Li, Zhi Yu, Xiaojuan Liu

By using the data-driven methodology, we exploit the temporal sequence of control signals for a large set of EAST discharges to develop a deep learning model for modeling discharge diagnostic signals, such as electron density $n_{e}$, store energy $W_{mhd}$ and loop voltage $V_{loop}$.

Matching Text with Deep Mutual Information Estimation

no code implementations9 Mar 2020 Xixi Zhou, Chengxi Li, Jiajun Bu, Chengwei Yao, Keyue Shi, Zhi Yu, Zhou Yu

Our approach, Text matching with Deep Info Max (TIM), is integrated with a procedure of unsupervised learning of representations by maximizing the mutual information between text matching neural network's input and output.

Answer Selection Mutual Information Estimation +3

Hierarchical Graph Pooling with Structure Learning

3 code implementations14 Nov 2019 Zhen Zhang, Jiajun Bu, Martin Ester, Jianfeng Zhang, Chengwei Yao, Zhi Yu, Can Wang

HGP-SL incorporates graph pooling and structure learning into a unified module to generate hierarchical representations of graphs.

Graph Classification Representation Learning

Lightweight Real-time Makeup Try-on in Mobile Browsers with Tiny CNN Models for Facial Tracking

no code implementations5 Jun 2019 TianXing Li, Zhi Yu, Edmund Phung, Brendan Duke, Irina Kezele, Parham Aarabi

Recent works on convolutional neural networks (CNNs) for facial alignment have demonstrated unprecedented accuracy on a variety of large, publicly available datasets.

Cannot find the paper you are looking for? You can Submit a new open access paper.