Search Results for author: dianhai yu

Found 38 papers, 26 papers with code

A Framework for Cost-Effective and Self-Adaptive LLM Shaking and Recovery Mechanism

no code implementations • 12 Mar 2024 • Zhiyu Chen, Yu Li, Suochao Zhang, Jingbo Zhou, Jiwen Zhou, Chenfu Bao, dianhai yu

As Large Language Models (LLMs) gain great success in real-world applications, an increasing number of users are seeking to develop and deploy their customized LLMs through cloud services.

Privacy Preserving

Paper
Add Code

Spectral Heterogeneous Graph Convolutions via Positive Noncommutative Polynomials

2 code implementations • 31 May 2023 • Mingguo He, Zhewei Wei, Shikun Feng, Zhengjie Huang, Weibin Li, Yu Sun, dianhai yu

These spatial-based HGNNs neglect the utilization of spectral graph convolutions, which are the foundation of Graph Convolutional Networks (GCN) on homogeneous graphs.

Ranked #5 on Node Property Prediction on ogbn-mag

Graph Learning Node Classification +1

Paper
Code

Label Information Enhanced Fraud Detection against Low Homophily in Graphs

1 code implementation • 21 Feb 2023 • Yuchen Wang, Jinghui Zhang, Zhengjie Huang, Weibin Li, Shikun Feng, Ziheng Ma, Yu Sun, dianhai yu, Fang Dong, Jiahui Jin, Beilun Wang, Junzhou Luo

Then, we combine the group aggregation and the learnable encodings into a Transformer encoder to capture the semantic information.

Fraud Detection Node Classification

Paper
Code

TA-MoE: Topology-Aware Large Scale Mixture-of-Expert Training

1 code implementation • 20 Feb 2023 • Chang Chen, Min Li, Zhihua Wu, dianhai yu, Chao Yang

In this paper, we propose TA-MoE, a topology-aware routing strategy for large-scale MoE trainging, from a model-system co-design perspective, which can dynamically adjust the MoE dispatch pattern according to the network topology.

Paper
Code

PP-YOLOE-R: An Efficient Anchor-Free Rotated Object Detector

2 code implementations • 4 Nov 2022 • Xinxin Wang, Guanzhong Wang, Qingqing Dang, Yi Liu, Xiaoguang Hu, dianhai yu

With multi-scale training and testing, PP-YOLOE-R-l and PP-YOLOE-R-x further improve the detection precision to 80. 02 and 80. 73 mAP.

Ranked #6 on Oriented Object Detection on DOTA 1.0

Object object-detection +3

21,587

Paper
Code

PP-StructureV2: A Stronger Document Analysis System

1 code implementation • 11 Oct 2022 • Chenxia Li, Ruoyu Guo, Jun Zhou, Mengtao An, Yuning Du, Lingfeng Zhu, Yi Liu, Xiaoguang Hu, dianhai yu

For Table Recognition model, we utilize PP-LCNet, CSP-PAN and SLAHead to optimize the backbone module, feature fusion module and decoding module, respectively, which improved the table structure accuracy by 6\% with comparable inference speed.

Ranked #1 on Network Pruning on CIFAR-100 (Inference Time (ms) metric)

Key Information Extraction Knowledge Distillation +3

38,291

Paper
Code

ERNIE-mmLayout: Multi-grained MultiModal Transformer for Document Understanding

no code implementations • 18 Sep 2022 • Wenjin Wang, Zhengjie Huang, Bin Luo, Qianglong Chen, Qiming Peng, Yinxu Pan, Weichong Yin, Shikun Feng, Yu Sun, dianhai yu, Yin Zhang

At first, a document graph is proposed to model complex relationships among multi-grained multimodal elements, in which salient visual regions are detected by a cluster-based method.

Common Sense Reasoning document understanding +1

Paper
Add Code

Large-scale Knowledge Distillation with Elastic Heterogeneous Computing Resources

1 code implementation • 14 Jul 2022 • Ji Liu, daxiang dong, Xi Wang, An Qin, Xingjian Li, Patrick Valduriez, Dejing Dou, dianhai yu

Although more layers and more parameters generally improve the accuracy of the models, such big models generally have high computational complexity and require big memory, which exceed the capacity of small devices for inference and incurs long training time.

Knowledge Distillation

169

Paper
Code

HelixFold: An Efficient Implementation of AlphaFold2 using PaddlePaddle

1 code implementation • 12 Jul 2022 • Guoxia Wang, Xiaomin Fang, Zhihua Wu, Yiqun Liu, Yang Xue, Yingfei Xiang, dianhai yu, Fan Wang, Yanjun Ma

Due to the complex model architecture and large memory consumption, it requires lots of computational resources and time to implement the training and inference of AlphaFold2 from scratch.

Protein Structure Prediction

785

Paper
Code

PP-OCRv3: More Attempts for the Improvement of Ultra Lightweight OCR System

1 code implementation • 7 Jun 2022 • Chenxia Li, Weiwei Liu, Ruoyu Guo, Xiaoting Yin, Kaitao Jiang, Yongkun Du, Yuning Du, Lingfeng Zhu, Baohua Lai, Xiaoguang Hu, dianhai yu, Yanjun Ma

For text recognizer, the base model is replaced from CRNN to SVTR, and we introduce lightweight text recognition network SVTR LCNet, guided training of CTC by attention, data augmentation strategy TextConAug, better pre-trained model by self-supervised TextRotNet, UDML, and UIM to accelerate the model and improve the effect.

Data Augmentation Optical Character Recognition +2

38,291

Paper
Code

PaddleSpeech: An Easy-to-Use All-in-One Speech Toolkit

2 code implementations • NAACL (ACL) 2022 • HUI ZHANG, Tian Yuan, Junkun Chen, Xintong Li, Renjie Zheng, Yuxin Huang, Xiaojie Chen, Enlei Gong, Zeyu Chen, Xiaoguang Hu, dianhai yu, Yanjun Ma, Liang Huang

PaddleSpeech is an open-source all-in-one speech toolkit.

Automatic Speech Recognition (ASR) Environmental Sound Classification +9

10,088

Paper
Code

SE-MoE: A Scalable and Efficient Mixture-of-Experts Distributed Training and Inference System

1 code implementation • 20 May 2022 • Liang Shen, Zhihua Wu, Weibao Gong, Hongxiang Hao, Yangfan Bai, HuaChao Wu, Xinxuan Wu, Jiang Bian, Haoyi Xiong, dianhai yu, Yanjun Ma

With the increasing diversity of ML infrastructures nowadays, distributed training over heterogeneous computing systems is desired to facilitate the production of big models.

Distributed Computing

421

Paper
Code

Nebula-I: A General Framework for Collaboratively Training Deep Learning Models on Low-Bandwidth Cloud Clusters

1 code implementation • 19 May 2022 • Yang Xiang, Zhihua Wu, Weibao Gong, Siyu Ding, Xianjie Mo, Yuang Liu, Shuohuan Wang, Peng Liu, Yongshuai Hou, Long Li, Bin Wang, Shaohuai Shi, Yaqian Han, Yue Yu, Ge Li, Yu Sun, Yanjun Ma, dianhai yu

We took natural language processing (NLP) as an example to show how Nebula-I works in different training phases that include: a) pre-training a multilingual language model using two remote clusters; and b) fine-tuning a machine translation model using knowledge distilled from pre-trained models, which run through the most popular paradigm of recent deep learning.

Cross-Lingual Natural Language Inference Distributed Computing +2

21,587

Paper
Code

Simple and Effective Relation-based Embedding Propagation for Knowledge Representation Learning

1 code implementation • 13 May 2022 • Huijuan Wang, Siming Dai, Weiyue Su, Hui Zhong, Zeyang Fang, Zhengjie Huang, Shikun Feng, Zeyu Chen, Yu Sun, dianhai yu

Notably, it averagely brings about 10% relative improvement to triplet-based embedding methods on OGBL-WikiKG2 and takes 5%-83% time to achieve comparable results as the state-of-the-art GC-OTE.

Knowledge Graphs Relation +1

1,559

Paper
Code

PP-Matting: High-Accuracy Natural Image Matting

1 code implementation • 20 Apr 2022 • Guowei Chen, Yi Liu, Jian Wang, Juncai Peng, Yuying Hao, Lutao Chu, Shiyu Tang, Zewu Wu, Zeyu Chen, Zhiliang Yu, Yuning Du, Qingqing Dang, Xiaoguang Hu, dianhai yu

Also, we propose a semantic context branch (SCB) that adopts a semantic segmentation subtask.

Ranked #4 on Image Matting on Distinctions-646

Image Matting Semantic Segmentation +1

8,226

Paper
Code

PP-LiteSeg: A Superior Real-Time Semantic Segmentation Model

3 code implementations • 6 Apr 2022 • Juncai Peng, Yi Liu, Shiyu Tang, Yuying Hao, Lutao Chu, Guowei Chen, Zewu Wu, Zeyu Chen, Zhiliang Yu, Yuning Du, Qingqing Dang, Baohua Lai, Qiwen Liu, Xiaoguang Hu, dianhai yu, Yanjun Ma

Real-world applications have high demands for semantic segmentation methods.

Ranked #4 on Real-Time Semantic Segmentation on Cityscapes val

Real-Time Semantic Segmentation Segmentation

8,226

Paper
Code

ERNIE 3.0 Titan: Exploring Larger-scale Knowledge Enhanced Pre-training for Language Understanding and Generation

3 code implementations • 23 Dec 2021 • Shuohuan Wang, Yu Sun, Yang Xiang, Zhihua Wu, Siyu Ding, Weibao Gong, Shikun Feng, Junyuan Shang, Yanbin Zhao, Chao Pang, Jiaxiang Liu, Xuyi Chen, Yuxiang Lu, Weixin Liu, Xi Wang, Yangfan Bai, Qiuliang Chen, Li Zhao, Shiyong Li, Peng Sun, dianhai yu, Yanjun Ma, Hao Tian, Hua Wu, Tian Wu, Wei Zeng, Ge Li, Wen Gao, Haifeng Wang

A unified framework named ERNIE 3. 0 was recently proposed for pre-training large-scale knowledge enhanced models and trained a model with 10 billion parameters.

Language Modelling

11,384

Paper
Code

End-to-end Adaptive Distributed Training on PaddlePaddle

1 code implementation • 6 Dec 2021 • Yulong Ao, Zhihua Wu, dianhai yu, Weibao Gong, Zhiqing Kui, Minxu Zhang, Zilingfeng Ye, Liang Shen, Yanjun Ma, Tian Wu, Haifeng Wang, Wei Zeng, Chao Yang

The experiments demonstrate that our framework can satisfy various requirements from the diversity of applications and the heterogeneity of resources with highly competitive performance.

Language Modelling Recommendation Systems +1

21,587

Paper
Code

HeterPS: Distributed Deep Learning With Reinforcement Learning Based Scheduling in Heterogeneous Environments

1 code implementation • 20 Nov 2021 • Ji Liu, Zhihua Wu, dianhai yu, Yanjun Ma, Danlei Feng, Minxu Zhang, Xinxuan Wu, Xuefeng Yao, Dejing Dou

The training process generally exploits distributed computing resources to reduce training time.

Distributed Computing reinforcement-learning +2

21,587

Paper
Code

PP-ShiTu: A Practical Lightweight Image Recognition System

2 code implementations • 1 Nov 2021 • Shengyu Wei, Ruoyu Guo, Cheng Cui, Bin Lu, Shuilong Dong, Tingquan Gao, Yuning Du, Ying Zhou, Xueying Lyu, Qiwen Liu, Xiaoguang Hu, dianhai yu, Yanjun Ma

In recent years, image recognition applications have developed rapidly.

Face Recognition Knowledge Distillation +4

5,248

Paper
Code

PP-PicoDet: A Better Real-Time Object Detector on Mobile Devices

4 code implementations • 1 Nov 2021 • Guanghua Yu, Qinyao Chang, Wenyu Lv, Chang Xu, Cheng Cui, Wei Ji, Qingqing Dang, Kaipeng Deng, Guanzhong Wang, Yuning Du, Baohua Lai, Qiwen Liu, Xiaoguang Hu, dianhai yu, Yanjun Ma

We investigate the applicability of the anchor-free strategy on lightweight object detection models.

Ranked #1 on Object Detection on MSCOCO

Object object-detection +1

12,029

Paper
Code

Exploiting Cross-Modal Prediction and Relation Consistency for Semi-Supervised Image Captioning

no code implementations • 22 Oct 2021 • Yang Yang, Hongchen Wei, HengShu Zhu, dianhai yu, Hui Xiong, Jian Yang

In detail, considering that the heterogeneous gap between modalities always leads to the supervision difficulty of using the global embedding directly, CPRC turns to transform both the raw image and corresponding generated sentence into the shared semantic space, and measure the generated sentence from two aspects: 1) Prediction consistency.

Image Captioning Informativeness +2

Paper
Add Code

PP-LCNet: A Lightweight CPU Convolutional Neural Network

8 code implementations • 17 Sep 2021 • Cheng Cui, Tingquan Gao, Shengyu Wei, Yuning Du, Ruoyu Guo, Shuilong Dong, Bin Lu, Ying Zhou, Xueying Lv, Qiwen Liu, Xiaoguang Hu, dianhai yu, Yanjun Ma

We propose a lightweight CPU network based on the MKLDNN acceleration strategy, named PP-LCNet, which improves the performance of lightweight models on multiple tasks.

Image Classification object-detection +2

38,291

Paper
Code

PP-OCRv2: Bag of Tricks for Ultra Lightweight OCR System

3 code implementations • 7 Sep 2021 • Yuning Du, Chenxia Li, Ruoyu Guo, Cheng Cui, Weiwei Liu, Jun Zhou, Bin Lu, Yehua Yang, Qiwen Liu, Xiaoguang Hu, dianhai yu, Yanjun Ma

Optical Character Recognition (OCR) systems have been widely used in various of application scenarios.

Optical Character Recognition Optical Character Recognition (OCR)

38,291

Paper
Code

ERNIE 3.0: Large-scale Knowledge Enhanced Pre-training for Language Understanding and Generation

2 code implementations • 5 Jul 2021 • Yu Sun, Shuohuan Wang, Shikun Feng, Siyu Ding, Chao Pang, Junyuan Shang, Jiaxiang Liu, Xuyi Chen, Yanbin Zhao, Yuxiang Lu, Weixin Liu, Zhihua Wu, Weibao Gong, Jianzhong Liang, Zhizhou Shang, Peng Sun, Wei Liu, Xuan Ouyang, dianhai yu, Hao Tian, Hua Wu, Haifeng Wang

We trained the model with 10 billion parameters on a 4TB corpus consisting of plain texts and a large-scale knowledge graph.

Few-Shot Learning Natural Language Understanding +2

11,384

Paper
Code

PP-YOLOv2: A Practical Object Detector

1 code implementation • 21 Apr 2021 • Xin Huang, Xinxin Wang, Wenyu Lv, Xiaying Bai, Xiang Long, Kaipeng Deng, Qingqing Dang, Shumin Han, Qiwen Liu, Xiaoguang Hu, dianhai yu, Yanjun Ma, Osamu Yoshie

To meet these two concerns, we comprehensively evaluate a collection of existing refinements to improve the performance of PP-YOLO while almost keep the infer time unchanged.

Object Real-Time Object Detection

12,029

Paper
Code

Beyond Self-Supervision: A Simple Yet Effective Network Distillation Alternative to Improve Backbones

2 code implementations • 10 Mar 2021 • Cheng Cui, Ruoyu Guo, Yuning Du, Dongliang He, Fu Li, Zewu Wu, Qiwen Liu, Shilei Wen, Jizhou Huang, Xiaoguang Hu, dianhai yu, Errui Ding, Yanjun Ma

Recently, research efforts have been concentrated on revealing how pre-trained model makes a difference in neural network performance.

Knowledge Distillation object-detection +3

5,248

Paper
Code

Distilling Knowledge from Pre-trained Language Models via Text Smoothing

no code implementations • 8 May 2020 • Xing Wu, Yibing Liu, Xiangyang Zhou, dianhai yu

As an alternative, we propose a new method for BERT distillation, i. e., asking the teacher to generate smoothed word ids, rather than labels, for teaching the student model in knowledge distillation.

Knowledge Distillation Language Modelling

Paper
Add Code

How to Evaluate the Next System: Automatic Dialogue Evaluation from the Perspective of Continual Learning

no code implementations • 10 Dec 2019 • Lu Li, Zhongheng He, Xiangyang Zhou, dianhai yu

Automatic dialogue evaluation plays a crucial role in open-domain dialogue research.

Continual Learning Dialogue Evaluation +1

Paper
Add Code

RLTM: An Efficient Neural IR Framework for Long Documents

no code implementations • 22 Jun 2019 • Chen Zheng, Yu Sun, Shengxian Wan, dianhai yu

This paper proposes a novel End-to-End neural ranking framework called Reinforced Long Text Matching (RLTM) which matches a query with long documents efficiently and effectively.

Information Retrieval Retrieval +2

Paper
Add Code

Multi-Turn Response Selection for Chatbots with Deep Attention Matching Network

2 code implementations • ACL 2018 • Xiangyang Zhou, Lu Li, daxiang dong, Yi Liu, Ying Chen, Wayne Xin Zhao, dianhai yu, Hua Wu

Human generates responses relying on semantic and functional dependencies, including coreference relation, among dialogue elements and their context.

Ranked #6 on Conversational Response Selection on RRS

Conversational Response Selection Translation

444

Paper
Code

A New Method of Region Embedding for Text Classification

1 code implementation • ICLR 2018 • chao qiao, Bo Huang, guocheng niu, daren li, daxiang dong, wei he, dianhai yu, Hua Wu

In this paper, we propose a new method of learning and utilizing task-specific distributed representations of n-grams, referred to as “region embeddings”.

General Classification text-classification +1