Search Results for author: Yanjun Ma

Found 21 papers, 17 papers with code

PP-OCRv2: Bag of Tricks for Ultra Lightweight OCR System

3 code implementations • 7 Sep 2021 • Yuning Du, Chenxia Li, Ruoyu Guo, Cheng Cui, Weiwei Liu, Jun Zhou, Bin Lu, Yehua Yang, Qiwen Liu, Xiaoguang Hu, dianhai yu, Yanjun Ma

Optical Character Recognition (OCR) systems have been widely used in various of application scenarios.

Optical Character Recognition Optical Character Recognition (OCR)

38,418

Paper
Code

PP-LCNet: A Lightweight CPU Convolutional Neural Network

8 code implementations • 17 Sep 2021 • Cheng Cui, Tingquan Gao, Shengyu Wei, Yuning Du, Ruoyu Guo, Shuilong Dong, Bin Lu, Ying Zhou, Xueying Lv, Qiwen Liu, Xiaoguang Hu, dianhai yu, Yanjun Ma

We propose a lightweight CPU network based on the MKLDNN acceleration strategy, named PP-LCNet, which improves the performance of lightweight models on multiple tasks.

Image Classification object-detection +2

38,418

Paper
Code

PP-OCRv3: More Attempts for the Improvement of Ultra Lightweight OCR System

1 code implementation • 7 Jun 2022 • Chenxia Li, Weiwei Liu, Ruoyu Guo, Xiaoting Yin, Kaitao Jiang, Yongkun Du, Yuning Du, Lingfeng Zhu, Baohua Lai, Xiaoguang Hu, dianhai yu, Yanjun Ma

For text recognizer, the base model is replaced from CRNN to SVTR, and we introduce lightweight text recognition network SVTR LCNet, guided training of CTC by attention, data augmentation strategy TextConAug, better pre-trained model by self-supervised TextRotNet, UDML, and UIM to accelerate the model and improve the effect.

Data Augmentation Optical Character Recognition +2

38,418

Paper
Code

HeterPS: Distributed Deep Learning With Reinforcement Learning Based Scheduling in Heterogeneous Environments

1 code implementation • 20 Nov 2021 • Ji Liu, Zhihua Wu, dianhai yu, Yanjun Ma, Danlei Feng, Minxu Zhang, Xinxuan Wu, Xuefeng Yao, Dejing Dou

The training process generally exploits distributed computing resources to reduce training time.

Distributed Computing reinforcement-learning +2

21,607

Paper
Code

End-to-end Adaptive Distributed Training on PaddlePaddle

1 code implementation • 6 Dec 2021 • Yulong Ao, Zhihua Wu, dianhai yu, Weibao Gong, Zhiqing Kui, Minxu Zhang, Zilingfeng Ye, Liang Shen, Yanjun Ma, Tian Wu, Haifeng Wang, Wei Zeng, Chao Yang

The experiments demonstrate that our framework can satisfy various requirements from the diversity of applications and the heterogeneity of resources with highly competitive performance.

Language Modelling Recommendation Systems +1

21,607

Paper
Code

Nebula-I: A General Framework for Collaboratively Training Deep Learning Models on Low-Bandwidth Cloud Clusters

1 code implementation • 19 May 2022 • Yang Xiang, Zhihua Wu, Weibao Gong, Siyu Ding, Xianjie Mo, Yuang Liu, Shuohuan Wang, Peng Liu, Yongshuai Hou, Long Li, Bin Wang, Shaohuai Shi, Yaqian Han, Yue Yu, Ge Li, Yu Sun, Yanjun Ma, dianhai yu

We took natural language processing (NLP) as an example to show how Nebula-I works in different training phases that include: a) pre-training a multilingual language model using two remote clusters; and b) fine-tuning a machine translation model using knowledge distilled from pre-trained models, which run through the most popular paradigm of recent deep learning.

Cross-Lingual Natural Language Inference Distributed Computing +2

21,607

Paper
Code

PP-YOLOv2: A Practical Object Detector

1 code implementation • 21 Apr 2021 • Xin Huang, Xinxin Wang, Wenyu Lv, Xiaying Bai, Xiang Long, Kaipeng Deng, Qingqing Dang, Shumin Han, Qiwen Liu, Xiaoguang Hu, dianhai yu, Yanjun Ma, Osamu Yoshie

To meet these two concerns, we comprehensively evaluate a collection of existing refinements to improve the performance of PP-YOLO while almost keep the infer time unchanged.

Object Real-Time Object Detection

12,048

Paper
Code

PAFNet: An Efficient Anchor-Free Object Detector Guidance

1 code implementation • 28 Apr 2021 • Ying Xin, Guanzhong Wang, Mingyuan Mao, Yuan Feng, Qingqing Dang, Yanjun Ma, Errui Ding, Shumin Han

Therefore, a trade-off between effectiveness and efficiency is necessary in practical scenarios.

Ranked #1 on Object Detection on COCO test-dev (Hardware Burden metric)

Object object-detection +1

12,048

Paper
Code

PP-PicoDet: A Better Real-Time Object Detector on Mobile Devices

4 code implementations • 1 Nov 2021 • Guanghua Yu, Qinyao Chang, Wenyu Lv, Chang Xu, Cheng Cui, Wei Ji, Qingqing Dang, Kaipeng Deng, Guanzhong Wang, Yuning Du, Baohua Lai, Qiwen Liu, Xiaoguang Hu, dianhai yu, Yanjun Ma

We investigate the applicability of the anchor-free strategy on lightweight object detection models.

Ranked #1 on Object Detection on MSCOCO

Object object-detection +1

12,048

Paper
Code

ERNIE 3.0 Titan: Exploring Larger-scale Knowledge Enhanced Pre-training for Language Understanding and Generation

3 code implementations • 23 Dec 2021 • Shuohuan Wang, Yu Sun, Yang Xiang, Zhihua Wu, Siyu Ding, Weibao Gong, Shikun Feng, Junyuan Shang, Yanbin Zhao, Chao Pang, Jiaxiang Liu, Xuyi Chen, Yuxiang Lu, Weixin Liu, Xi Wang, Yangfan Bai, Qiuliang Chen, Li Zhao, Shiyong Li, Peng Sun, dianhai yu, Yanjun Ma, Hao Tian, Hua Wu, Tian Wu, Wei Zeng, Ge Li, Wen Gao, Haifeng Wang

A unified framework named ERNIE 3. 0 was recently proposed for pre-training large-scale knowledge enhanced models and trained a model with 10 billion parameters.

Language Modelling

11,411

Paper
Code

PaddleSpeech: An Easy-to-Use All-in-One Speech Toolkit

2 code implementations • NAACL (ACL) 2022 • HUI ZHANG, Tian Yuan, Junkun Chen, Xintong Li, Renjie Zheng, Yuxin Huang, Xiaojie Chen, Enlei Gong, Zeyu Chen, Xiaoguang Hu, dianhai yu, Yanjun Ma, Liang Huang

PaddleSpeech is an open-source all-in-one speech toolkit.

Automatic Speech Recognition (ASR) Environmental Sound Classification +9

10,131

Paper
Code

PP-LiteSeg: A Superior Real-Time Semantic Segmentation Model

3 code implementations • 6 Apr 2022 • Juncai Peng, Yi Liu, Shiyu Tang, Yuying Hao, Lutao Chu, Guowei Chen, Zewu Wu, Zeyu Chen, Zhiliang Yu, Yuning Du, Qingqing Dang, Baohua Lai, Qiwen Liu, Xiaoguang Hu, dianhai yu, Yanjun Ma

Real-world applications have high demands for semantic segmentation methods.

Ranked #4 on Real-Time Semantic Segmentation on Cityscapes val

Real-Time Semantic Segmentation Segmentation

8,248

Paper
Code

Beyond Self-Supervision: A Simple Yet Effective Network Distillation Alternative to Improve Backbones

2 code implementations • 10 Mar 2021 • Cheng Cui, Ruoyu Guo, Yuning Du, Dongliang He, Fu Li, Zewu Wu, Qiwen Liu, Shilei Wen, Jizhou Huang, Xiaoguang Hu, dianhai yu, Errui Ding, Yanjun Ma

Recently, research efforts have been concentrated on revealing how pre-trained model makes a difference in neural network performance.

Knowledge Distillation object-detection +3

5,253

Paper
Code

PP-ShiTu: A Practical Lightweight Image Recognition System

2 code implementations • 1 Nov 2021 • Shengyu Wei, Ruoyu Guo, Cheng Cui, Bin Lu, Shuilong Dong, Tingquan Gao, Yuning Du, Ying Zhou, Xueying Lyu, Qiwen Liu, Xiaoguang Hu, dianhai yu, Yanjun Ma

In recent years, image recognition applications have developed rapidly.

Face Recognition Knowledge Distillation +4

5,253

Paper
Code

HelixFold: An Efficient Implementation of AlphaFold2 using PaddlePaddle

1 code implementation • 12 Jul 2022 • Guoxia Wang, Xiaomin Fang, Zhihua Wu, Yiqun Liu, Yang Xue, Yingfei Xiang, dianhai yu, Fan Wang, Yanjun Ma

Due to the complex model architecture and large memory consumption, it requires lots of computational resources and time to implement the training and inference of AlphaFold2 from scratch.

Protein Structure Prediction

787

Paper
Code

SE-MoE: A Scalable and Efficient Mixture-of-Experts Distributed Training and Inference System

1 code implementation • 20 May 2022 • Liang Shen, Zhihua Wu, Weibao Gong, Hongxiang Hao, Yangfan Bai, HuaChao Wu, Xinxuan Wu, Jiang Bian, Haoyi Xiong, dianhai yu, Yanjun Ma

With the increasing diversity of ML infrastructures nowadays, distributed training over heterogeneous computing systems is desired to facilitate the production of big models.

Distributed Computing

421

Paper
Code

SDWPF: A Dataset for Spatial Dynamic Wind Power Forecasting Challenge at KDD Cup 2022

1 code implementation • 8 Aug 2022 • Jingbo Zhou, Xinjiang Lu, Yixiong Xiao, Jiantao Su, Junfu Lyu, Yanjun Ma, Dejing Dou

Thus, Wind Power Forecasting (WPF) has been widely recognized as one of the most critical issues in wind power integration and operation.

Paper
Code

Answer-focused and Position-aware Neural Question Generation

no code implementations • EMNLP 2018 • Xingwu Sun, Jing Liu, Yajuan Lyu, wei he, Yanjun Ma, Shi Wang

(2) The model copies the context words that are far from and irrelevant to the answer, instead of the words that are close and relevant to the answer.

Machine Reading Comprehension Position +3

Paper
Add Code

An Evaluation of Statistical Post-Editing Systems Applied to RBMT and SMT Systems

no code implementations • COLING 2012 • Hanna B{\'e}chara, Rapha{\"e}l Rubino, Yifan He, Yanjun Ma, Josef van Genabith

Machine Translation

Paper
Add Code

SaGE: 基于句法感知图卷积神经网络和ELECTRA的中文隐喻识别模型(SaGE: Syntax-aware GCN with ELECTRA for Chinese Metaphor Detection)

no code implementations • CCL 2021 • Shenglong Zhang, Ying Liu, Yanjun Ma

“隐喻是人类语言中经常出现的一种特殊现象, 隐喻识别对于自然语言处理各项任务来说具有十分基础和重要的意义。针对中文领域的隐喻识别任务, 我们提出了一种基于句法感知图卷积神经网络和ELECTRA的隐喻识别模型(Syntax-aware GCN withELECTRA SaGE)。该模型从语言学出发, 使用ELECTRA和Transformer编码器抽取句子的语义特征, 将句子按照依存关系组织成一张图并使用图卷积神经网络抽取其句法特征, 在此基础上对两类特征进行融合以进行隐喻识别。我们的模型在CCL2018中文隐喻识别评测数据集上以85. 22%的宏平均F1分数超越了此前的最佳成绩, 验证了融合语义信息和句法信息对于隐喻识别任务具有重要作用。”

Paper
Add Code

A Gentle Introduction to Deep Nets and Opportunities for the Future

no code implementations • ACL 2022 • Kenneth Church, Valia Kordoni, Gary Marcus, Ernest Davis, Yanjun Ma, Zeyu Chen

The first half of this tutorial will make deep nets more accessible to a broader audience, following “Deep Nets for Poets” and “A Gentle Introduction to Fine-Tuning.” We will also introduce GFT (general fine tuning), a little language for fine tuning deep nets with short (one line) programs that are as easy to code as regression in statistics packages such as R using glm (general linear models).

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.