Search Results for author: Mengli Cheng

Found 7 papers, 2 papers with code

DiffPoint: Single and Multi-view Point Cloud Reconstruction with ViT Based Diffusion Model

no code implementations17 Feb 2024 Yu Feng, Xing Shi, Mengli Cheng, Yun Xiong

As the task of 2D-to-3D reconstruction has gained significant attention in various real-world scenarios, it becomes crucial to be able to generate high-quality point clouds.

Point cloud reconstruction

MuLTI: Efficient Video-and-Language Understanding with Text-Guided MultiWay-Sampler and Multiple Choice Modeling

no code implementations10 Mar 2023 Jiaqi Xu, Bo Liu, Yunkuo Chen, Mengli Cheng, Xing Shi

Specifically, we design a Text-Guided MultiWay-Sampler based on adapt-pooling residual mapping and self-attention modules to sample long sequences and fuse multi-modal features, which reduces the computational costs and addresses performance degradation caused by previous samplers.

 Ranked #1 on TGIF-Transition on TGIF-QA (using extra training data)

Multi-Label Classification Multiple-choice +8

EasyRec: An easy-to-use, extendable and efficient framework for building industrial recommendation systems

1 code implementation26 Sep 2022 Mengli Cheng, Yue Gao, Guoqiang Liu, Hongsheng Jin, Xiaowen Zhang

We present EasyRec, an easy-to-use, extendable and efficient recommendation framework for building industrial recommendation systems.

feature selection Recommendation Systems

EasyASR: A Distributed Machine Learning Platform for End-to-end Automatic Speech Recognition

no code implementations14 Sep 2020 Chengyu Wang, Mengli Cheng, Xu Hu, Jun Huang

We present EasyASR, a distributed machine learning platform for training and serving large-scale Automatic Speech Recognition (ASR) models, as well as collecting and processing audio data at scale.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +2

One-shot Text Field Labeling using Attention and Belief Propagation for Structure Information Extraction

1 code implementation9 Sep 2020 Mengli Cheng, Minghui Qiu, Xing Shi, Jun Huang, Wei. Lin

Existing learning based methods for text labeling task usually require a large amount of labeled examples to train a specific model for each type of document.

One-Shot Learning Text Detection

Weakly Supervised Construction of ASR Systems with Massive Video Data

no code implementations4 Aug 2020 Mengli Cheng, Chengyu Wang, Xu Hu, Jun Huang, Xiaobo Wang

Building Automatic Speech Recognition (ASR) systems from scratch is significantly challenging, mostly due to the time-consuming and financially-expensive process of annotating a large amount of audio data with transcripts.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +4

Cannot find the paper you are looking for? You can Submit a new open access paper.