Search Results for author: Mengli Cheng

Found 7 papers, 2 papers with code

DiffPoint: Single and Multi-view Point Cloud Reconstruction with ViT Based Diffusion Model

no code implementations • 17 Feb 2024 • Yu Feng, Xing Shi, Mengli Cheng, Yun Xiong

As the task of 2D-to-3D reconstruction has gained significant attention in various real-world scenarios, it becomes crucial to be able to generate high-quality point clouds.

Point cloud reconstruction

Paper
Add Code

MuLTI: Efficient Video-and-Language Understanding with Text-Guided MultiWay-Sampler and Multiple Choice Modeling

no code implementations • 10 Mar 2023 • Jiaqi Xu, Bo Liu, Yunkuo Chen, Mengli Cheng, Xing Shi

Specifically, we design a Text-Guided MultiWay-Sampler based on adapt-pooling residual mapping and self-attention modules to sample long sequences and fuse multi-modal features, which reduces the computational costs and addresses performance degradation caused by previous samplers.

Ranked #1 on TGIF-Transition on TGIF-QA (using extra training data)

Multi-Label Classification Multiple-choice +8

Paper
Add Code

EasyRec: An easy-to-use, extendable and efficient framework for building industrial recommendation systems

1 code implementation • 26 Sep 2022 • Mengli Cheng, Yue Gao, Guoqiang Liu, Hongsheng Jin, Xiaowen Zhang

We present EasyRec, an easy-to-use, extendable and efficient recommendation framework for building industrial recommendation systems.

feature selection Recommendation Systems

1,471

Paper
Code

EasyASR: A Distributed Machine Learning Platform for End-to-end Automatic Speech Recognition

no code implementations • 14 Sep 2020 • Chengyu Wang, Mengli Cheng, Xu Hu, Jun Huang

We present EasyASR, a distributed machine learning platform for training and serving large-scale Automatic Speech Recognition (ASR) models, as well as collecting and processing audio data at scale.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +2

Paper
Add Code

One-shot Text Field Labeling using Attention and Belief Propagation for Structure Information Extraction

1 code implementation • 9 Sep 2020 • Mengli Cheng, Minghui Qiu, Xing Shi, Jun Huang, Wei. Lin

Existing learning based methods for text labeling task usually require a large amount of labeled examples to train a specific model for each type of document.

One-Shot Learning Text Detection

Paper
Code

Weakly Supervised Construction of ASR Systems with Massive Video Data

no code implementations • 4 Aug 2020 • Mengli Cheng, Chengyu Wang, Xu Hu, Jun Huang, Xiaobo Wang

Building Automatic Speech Recognition (ASR) systems from scratch is significantly challenging, mostly due to the time-consuming and financially-expensive process of annotating a large amount of audio data with transcripts.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +4

Paper
Add Code

IncepText: A New Inception-Text Module with Deformable PSROI Pooling for Multi-Oriented Scene Text Detection

no code implementations • 3 May 2018 • Qiangpeng Yang, Mengli Cheng, Wenmeng Zhou, Yan Chen, Minghui Qiu, Wei. Lin, Wei Chu

To solve this problem, we propose a novel end-to-end scene text detector IncepText from an instance-aware segmentation perspective.

Multi-Oriented Scene Text Detection object-detection +3

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.