Search Results for author: Pengfei Hu

Found 31 papers, 11 papers with code

Skeleton and Font Generation Network for Zero-shot Chinese Character Generation

no code implementations14 Jan 2025 Mobai Xue, Jun Du, Zhenrong Zhang, Jiefeng Ma, Qikai Chang, Pengfei Hu, Jianshu Zhang, Yu Hu

We used generated misspelled characters as data augmentation in Chinese character error correction tasks, simulating the scenario where students learn handwritten Chinese characters with the help of misspelled characters.

Data Augmentation Font Generation

Joint Knowledge Editing for Information Enrichment and Probability Promotion

1 code implementation22 Dec 2024 Wenhang Shi, Yiren Chen, Shuqing Bian, Xinyi Zhang, Zhe Zhao, Pengfei Hu, Wei Lu, Xiaoyong Du

Knowledge stored in large language models requires timely updates to reflect the dynamic nature of real-world information.

counterfactual knowledge editing

RFL: Simplifying Chemical Structure Recognition with Ring-Free Language

1 code implementation10 Dec 2024 Qikai Chang, Mingjun Chen, Changpeng Pi, Pengfei Hu, Zhenrong Zhang, Jiefeng Ma, Jun Du, BaoCai Yin, Jinshui Hu

The primary objective of Optical Chemical Structure Recognition is to identify chemical structure images into corresponding markup sequences.

Decoder

MPLite: Multi-Aspect Pretraining for Mining Clinical Health Records

no code implementations17 Nov 2024 Eric Yang, Pengfei Hu, Xiaoxue Han, Yue Ning

The adoption of digital systems in healthcare has resulted in the accumulation of vast electronic health records (EHRs), offering valuable data for machine learning methods to predict patient health outcomes.

Prediction

LLM-Neo: Parameter Efficient Knowledge Distillation for Large Language Models

no code implementations11 Nov 2024 Runming Yang, Taiqiang Wu, Jiahao Wang, Pengfei Hu, Ngai Wong, Yujiu Yang

Inspired by this observation, we explore the strategy that combines LoRA and KD to enhance the efficiency of knowledge transfer.

Knowledge Distillation Language Modeling +3

DualMAR: Medical-Augmented Representation from Dual-Expertise Perspectives

no code implementations25 Oct 2024 Pengfei Hu, Chang Lu, Fei Wang, Yue Ning

Electronic Health Records (EHR) has revolutionized healthcare data management and prediction in the field of AI and machine learning.

Prediction

DAWN: Dynamic Frame Avatar with Non-autoregressive Diffusion Framework for Talking Head Video Generation

1 code implementation17 Oct 2024 Hanbo Cheng, Limin Lin, Chenyu Liu, Pengcheng Xia, Pengfei Hu, Jiefeng Ma, Jun Du, Jia Pan

To address these challenges, we present DAWN (Dynamic frame Avatar With Non-autoregressive diffusion), a framework that enables all-at-once generation of dynamic-length video sequences.

Talking Head Generation Video Generation

t-READi: Transformer-Powered Robust and Efficient Multimodal Inference for Autonomous Driving

no code implementations13 Oct 2024 Pengfei Hu, Yuhang Qian, Tianyue Zheng, Ang Li, Zhe Chen, Yue Gao, Xiuzhen Cheng, Jun Luo

Given the wide adoption of multimodal sensors (e. g., camera, lidar, radar) by autonomous vehicles (AVs), deep analytics to fuse their outputs for a robust perception become imperative.

Autonomous Driving Contrastive Learning

See then Tell: Enhancing Key Information Extraction with Vision Grounding

no code implementations29 Sep 2024 Shuhang Liu, Zhenrong Zhang, Pengfei Hu, Jiefeng Ma, Jun Du, Qing Wang, Jianshu Zhang, Chenyu Liu

Positioned at the outset of the answer text, the <see> token allows the model to first see--observing the regions of the image related to the input question--and then tell--providing articulated textual responses.

Image to text Key Information Extraction +4

SRFUND: A Multi-Granularity Hierarchical Structure Reconstruction Benchmark in Form Understanding

no code implementations13 Jun 2024 Jiefeng Ma, Yan Wang, Chenyu Liu, Jun Du, Yu Hu, Zhenrong Zhang, Pengfei Hu, Qing Wang, Jianshu Zhang

Accurately identifying and organizing textual content is crucial for the automation of document processing in the field of form understanding.

Relation Prediction

SEMv3: A Fast and Robust Approach to Table Separation Line Detection

1 code implementation20 May 2024 Chunxia Qin, Zhenrong Zhang, Pengfei Hu, Chenyu Liu, Jiefeng Ma, Jun Du

The `"split-and-merge" paradigm is a pivotal approach to parse table structure, where the table separation line detection is crucial.

Line Detection

A Poisson-Gamma Dynamic Factor Model with Time-Varying Transition Dynamics

no code implementations26 Feb 2024 Jiahao Wang, Sikun Yang, Heinz Koeppl, Xiuzhen Cheng, Pengfei Hu, Guoming Zhang

Probabilistic approaches for handling count-valued time sequences have attracted amounts of research attentions because their ability to infer explainable latent structures and to estimate uncertainties, and thus are especially suitable for dealing with \emph{noisy} and \emph{incomplete} count data.

Data Augmentation Time Series

Incentivizing Massive Unknown Workers for Budget-Limited Crowdsensing: From Off-Line and On-Line Perspectives

no code implementations21 Sep 2023 Feng Li, Yuqi Chai, Huan Yang, Pengfei Hu, Lingjie Duan

How to incentivize strategic workers using limited budget is a very fundamental problem for crowdsensing systems; nevertheless, since the sensing abilities of the workers may not always be known as prior knowledge due to the diversities of their sensor devices and behaviors, it is difficult to properly select and pay the unknown workers.

Speech2Lip: High-fidelity Speech to Lip Generation by Learning from a Short Video

1 code implementation ICCV 2023 Xiuzhe Wu, Pengfei Hu, Yang Wu, Xiaoyang Lyu, Yan-Pei Cao, Ying Shan, Wenming Yang, Zhongqian Sun, Xiaojuan Qi

Therefore, directly learning a mapping function from speech to the entire head image is prone to ambiguity, particularly when using a short video for training.

Image Generation

Count, Decode and Fetch: A New Approach to Handwritten Chinese Character Error Correction

no code implementations30 Jul 2023 Pengfei Hu, Jiefeng Ma, Zhenrong Zhang, Jun Du, Jianshu Zhang

This poses a challenge when dealing with an unseen misspelled character, as the decoder may generate an IDS sequence that matches a seen character instead.

Decoder Transfer Learning

HRDoc: Dataset and Baseline Method Toward Hierarchical Reconstruction of Document Structures

1 code implementation24 Mar 2023 Jiefeng Ma, Jun Du, Pengfei Hu, Zhenrong Zhang, Jianshu Zhang, Huihui Zhu, Cong Liu

Moreover, we proposed an encoder-decoder-based hierarchical document structure parsing system (DSPS) to tackle this problem.

Decoder

Multimodal Tree Decoder for Table of Contents Extraction in Document Images

1 code implementation6 Dec 2022 Pengfei Hu, Zhenrong Zhang, Jianshu Zhang, Jun Du, Jiajia Wu

Next, to parse the hierarchical relationship between the heading entities, a tree-structured decoder is designed.

Decoder document understanding +2

Federated Learning Hyper-Parameter Tuning from a System Perspective

1 code implementation24 Nov 2022 Huanle Zhang, Lei Fu, Mi Zhang, Pengfei Hu, Xiuzhen Cheng, Prasant Mohapatra, Xin Liu

In this paper, we propose FedTune, an automatic FL hyper-parameter tuning algorithm tailored to applications' diverse system requirements in FL training.

Federated Learning

Learning Audio-Visual embedding for Person Verification in the Wild

no code implementations9 Sep 2022 Peiwen Sun, Shanshan Zhang, Zishan Liu, Yougen Yuan, Taotao Zhang, Honggang Zhang, Pengfei Hu

It has already been observed that audio-visual embedding is more robust than uni-modality embedding for person verification.

Face Verification

High Speed Rotation Estimation with Dynamic Vision Sensors

no code implementations6 Sep 2022 Guangrong Zhao, Yiran Shen, Ning Chen, Pengfei Hu, Lei Liu, Hongkai Wen

By designing a series of signal processing algorithms bespoke for dynamic vision sensing on mobile devices, EV-Tach is able to extract the rotational speed accurately from the event stream produced by dynamic vision sensing on rotary targets.

Vocal Bursts Intensity Prediction

Defensive Patches for Robust Recognition in the Physical World

1 code implementation CVPR 2022 Jiakai Wang, Zixin Yin, Pengfei Hu, Aishan Liu, Renshuai Tao, Haotong Qin, Xianglong Liu, DaCheng Tao

For the generalization against diverse noises, we inject class-specific identifiable patterns into a confined local patch prior, so that defensive patches could preserve more recognizable features towards specific classes, leading models for better recognition under noises.

Membership Inference Attacks Against Recommender Systems

1 code implementation16 Sep 2021 Minxing Zhang, Zhaochun Ren, Zihan Wang, Pengjie Ren, Zhumin Chen, Pengfei Hu, Yang Zhang

In this paper, we make the first attempt on quantifying the privacy leakage of recommender systems through the lens of membership inference.

Recommendation Systems

Dual Synchronous Generator: Inertial Current Source based Grid-Forming Solution for VSC

no code implementations5 Jul 2021 Huanhai Xin, Kehao Zhuang, Pengfei Hu, Yunjie Gu, Ping Ju

Based on dual synchronous idea, a dual synchronous generator (DSG) control is applied in VSC to form inertial current source.

Shielding Collaborative Learning: Mitigating Poisoning Attacks through Client-Side Detection

no code implementations29 Oct 2019 Lingchen Zhao, Shengshan Hu, Qian Wang, Jianlin Jiang, Chao Shen, Xiangyang Luo, Pengfei Hu

Collaborative learning allows multiple clients to train a joint model without sharing their data with each other.

Cannot find the paper you are looking for? You can Submit a new open access paper.