Search Results for author: Ming Cheng

Found 33 papers, 9 papers with code

Toward Short-Term Glucose Prediction Solely Based on CGM Time Series

no code implementations • 18 Apr 2024 • Ming Cheng, Xingjian Diao, Ziyi Zhou, Yanjun Cui, Wenjun Liu, Shitong Cheng

The global diabetes epidemic highlights the importance of maintaining good glycemic control.

Paper
Add Code

CrossGP: Cross-Day Glucose Prediction Excluding Physiological Information

no code implementations • 16 Apr 2024 • Ziyi Zhou, Ming Cheng, Yanjun Cui, Xingjian Diao, Zhaorui Ma

Because diabetes may develop into potential serious complications, early glucose prediction for diabetic patients is necessary for timely medical treatment.

Paper
Add Code

VeTraSS: Vehicle Trajectory Similarity Search Through Graph Modeling and Representation Learning

no code implementations • 11 Apr 2024 • Ming Cheng, BoWen Zhang, Ziyu Wang, Ziyi Zhou, Weiqi Feng, Yi Lyu, Xingjian Diao

Trajectory similarity search plays an essential role in autonomous driving, as it enables vehicles to analyze the information and characteristics of different trajectories to make informed decisions and navigate safely in dynamic environments.

Autonomous Driving Navigate +1

Paper
Add Code

Density-guided Translator Boosts Synthetic-to-Real Unsupervised Domain Adaptive Segmentation of 3D Point Clouds

1 code implementation • 27 Mar 2024 • Zhimin Yuan, Wankang Zeng, Yanfei Su, Weiquan Liu, Ming Cheng, Yulan Guo, Cheng Wang

3D synthetic-to-real unsupervised domain adaptive segmentation is crucial to annotating new domains.

Paper
Code

STARFlow: Spatial Temporal Feature Re-embedding with Attentive Learning for Real-world Scene Flow

1 code implementation • 11 Mar 2024 • Zhiyang Lu, Qinghan Chen, Ming Cheng

Scene flow prediction is a crucial underlying task in understanding dynamic scenes as it offers fundamental motion information.

Paper
Code

Multi-Input Multi-Output Target-Speaker Voice Activity Detection For Unified, Flexible, and Robust Audio-Visual Speaker Diarization

no code implementations • 16 Jan 2024 • Ming Cheng, Ming Li

The proposed method can take audio-visual input and leverage the speaker's acoustic footprint or lip track to flexibly conduct audio-based, video-based, and audio-visual speaker diarization in a unified sequence-to-sequence framework.

Action Detection Activity Detection +7

Paper
Add Code

SAIC: Integration of Speech Anonymization and Identity Classification

no code implementations • 23 Dec 2023 • Ming Cheng, Xingjian Diao, Shitong Cheng, Wenjun Liu

Speech anonymization and de-identification have garnered significant attention recently, especially in the healthcare area including telehealth consultations, patient voiceprint matching, and patient real-time monitoring.

Classification De-identification

Paper
Add Code

FT2TF: First-Person Statement Text-To-Talking Face Generation

no code implementations • 9 Dec 2023 • Xingjian Diao, Ming Cheng, Wayner Barrios, SouYoung Jin

This achievement highlights our model capability to bridge first-person statements and dynamic face generation, providing insightful guidance for future work.

Talking Face Generation

Paper
Add Code

AV-MaskEnhancer: Enhancing Video Representations through Audio-Visual Masked Autoencoder

no code implementations • 15 Sep 2023 • Xingjian Diao, Ming Cheng, Shitong Cheng

Learning high-quality video representation has shown significant applications in computer vision and remains challenging.

Video Classification

Paper
Add Code

Masked Cross-image Encoding for Few-shot Segmentation

no code implementations • 22 Aug 2023 • Wenbo Xu, Huaxi Huang, Ming Cheng, Litao Yu, Qiang Wu, Jian Zhang

Few-shot segmentation (FSS) is a dense prediction task that aims to infer the pixel-wise labels of unseen classes using only a limited number of annotated images.

Ranked #24 on Few-Shot Semantic Segmentation on COCO-20i (5-shot)

Few-Shot Semantic Segmentation

Paper
Add Code

The DKU-MSXF Diarization System for the VoxCeleb Speaker Recognition Challenge 2023

no code implementations • 15 Aug 2023 • Ming Cheng, Weiqing Wang, Xiaoyi Qin, Yuke Lin, Ning Jiang, Guoqing Zhao, Ming Li

This paper describes the DKU-MSXF submission to track 4 of the VoxCeleb Speaker Recognition Challenge 2023 (VoxSRC-23).

Action Detection Activity Detection +2

Paper
Add Code

VoxBlink: A Large Scale Speaker Verification Dataset on Camera

no code implementations • 14 Aug 2023 • Yuke Lin, Xiaoyi Qin, Guoqing Zhao, Ming Cheng, Ning Jiang, Haiyang Wu, Ming Li

In this paper, we introduce a large-scale and high-quality audio-visual speaker verification dataset, named VoxBlink.

Speaker Recognition Speaker Verification

Paper
Add Code

Hybrid Transformer and CNN Attention Network for Stereo Image Super-resolution

no code implementations • 9 May 2023 • Ming Cheng, Haoyu Ma, Qiufang Ma, Xiaopeng Sun, Weiqi Li, Zhenyu Zhang, Xuhan Sheng, Shijie Zhao, Junlin Li, Li Zhang

Multi-stage strategies are frequently employed in image restoration tasks.

Data Augmentation Image Enhancement +2

Paper
Add Code

OPDN: Omnidirectional Position-aware Deformable Network for Omnidirectional Image Super-Resolution

no code implementations • 26 Apr 2023 • Xiaopeng Sun, Weiqi Li, Zhenyu Zhang, Qiufang Ma, Xuhan Sheng, Ming Cheng, Haoyu Ma, Shijie Zhao, Jian Zhang, Junlin Li, Li Zhang

Model A aims to enhance the feature extraction ability of 360{\deg} image positional information, while Model B further focuses on the high-frequency information of 360{\deg} images.

Image Super-Resolution Position

Paper
Add Code

Target-Speaker Voice Activity Detection via Sequence-to-Sequence Prediction

no code implementations • 28 Oct 2022 • Ming Cheng, Weiqing Wang, Yucong Zhang, Xiaoyi Qin, Ming Li

Target-speaker voice activity detection is currently a promising approach for speaker diarization in complex acoustic environments.

Action Detection Activity Detection +2

Paper
Add Code

GMA3D: Local-Global Attention Learning to Estimate Occluded Motions of Scene Flow

1 code implementation • 7 Oct 2022 • Zhiyang Lu, Ming Cheng

Scene flow represents the motion information of each point in the 3D point clouds.

Autonomous Driving Motion Segmentation +3

Paper
Code

The DKU-DukeECE Diarization System for the VoxCeleb Speaker Recognition Challenge 2022

no code implementations • 4 Oct 2022 • Weiqing Wang, Xiaoyi Qin, Ming Cheng, Yucong Zhang, Kangyue Wang, Ming Li

This paper discribes the DKU-DukeECE submission to the 4th track of the VoxCeleb Speaker Recognition Challenge 2022 (VoxSRC-22).

Action Detection Activity Detection +2

Paper
Add Code

H2-Stereo: High-Speed, High-Resolution Stereoscopic Video System

no code implementations • 4 Aug 2022 • Ming Cheng, Yiling Xu, Wang Shen, M. Salman Asif, Chao Ma, Jun Sun, Zhan Ma

We utilize a disparity network to transfer spatiotemporal information across views even in large disparity scenes, based on which, we propose disparity-guided flow-based warping for LSR-HFR view and complementary warping for HSR-LFR view.

Super-Resolution Vocal Bursts Intensity Prediction

Paper
Add Code

Multi-Graph Fusion Networks for Urban Region Embedding

1 code implementation • 24 Jan 2022 • Shangbin Wu, Xu Yan, Xiaoliang Fan, Shirui Pan, Shichao Zhu, Chuanpan Zheng, Ming Cheng, Cheng Wang

Human mobility data contains rich but abundant information, which yields to the comprehensive region embeddings for cross domain tasks.

Crime Prediction

213

Paper
Code

DLA-Net: Learning Dual Local Attention Features for Semantic Segmentation of Large-Scale Building Facade Point Clouds

1 code implementation • 1 Jun 2021 • Yanfei Su, Weiquan Liu, Zhimin Yuan, Ming Cheng, Zhihong Zhang, Xuelun Shen, Cheng Wang

As there is a lack of 3D point clouds datasets related to the fine-grained building facade, we construct the first large-scale building facade point clouds benchmark dataset for semantic segmentation.

3D Semantic Segmentation Point Cloud Segmentation +1

Paper
Code

Outage Constrained Robust Secure Beamforming in Cognitive Satellite-Aerial Networks

no code implementations • 13 May 2021 • Bai Zhao, Min Lin, Ming Cheng, Wei-Ping Zhu, Naofal Al-Dhahir

This paper proposes a robust beamforming scheme to enhance the physical layer security (PLS) of multicast transmission in a cognitive satellite and aerial network (CSAN) operating in the millimeter wave frequency band.

Paper
Add Code

The evolution of network controllability in growing networks

no code implementations • 26 Jan 2021 • Rui Zhang, Xiaomeng Wang, Ming Cheng, Tao Jia

The study of network structural controllability focuses on the minimum number of driver nodes needed to control a whole network.

Physics and Society

Paper
Add Code

DASGIL: Domain Adaptation for Semantic and Geometric-aware Image-based Localization

1 code implementation • 1 Oct 2020 • Hanjiang Hu, Zhijian Qiao, Ming Cheng, Zhe Liu, Hesheng Wang

Long-Term visual localization under changing environments is a challenging problem in autonomous driving and mobile robotics due to season, illumination variance, etc.

Autonomous Driving Domain Adaptation +5

Paper
Code

RWF-2000: An Open Large Scale Video Database for Violence Detection

1 code implementation • 14 Nov 2019 • Ming Cheng, Kunjing Cai, Ming Li

In recent years, surveillance cameras are widely deployed in public places, and the general crime rate has been reduced significantly due to these ubiquitous devices.

Ranked #6 on Activity Recognition on RWF-2000

Action Classification Action Recognition

367

Paper
Code

Bacteria Biotope Relation Extraction via Lexical Chains and Dependency Graphs

no code implementations • WS 2019 • Wuti Xiong, Fei Li, Ming Cheng, Hong Yu, Donghong Ji

abstract In this article, we describe our approach for the Bacteria Biotopes relation extraction (BB-rel) subtask in the BioNLP Shared Task 2019.

graph construction Relation +2

Paper
Add Code

A Dual Camera System for High Spatiotemporal Resolution Video Acquisition

no code implementations • 28 Sep 2019 • Ming Cheng, Zhan Ma, M. Salman Asif, Yiling Xu, Haojie Liu, Wenbo Bao, Jun Sun

This paper presents a dual camera system for high spatiotemporal resolution (HSTR) video acquisition, where one camera shoots a video with high spatial resolution and low frame rate (HSR-LFR) and another one captures a low spatial resolution and high frame rate (LSR-HFR) video.

Vocal Bursts Intensity Prediction

Paper
Add Code

RF-Net: An End-to-End Image Matching Network based on Receptive Field

1 code implementation • CVPR 2019 • Xuelun Shen, Cheng Wang, Xin Li, Zenglei Yu, Jonathan Li, Chenglu Wen, Ming Cheng, Zijian He

This paper proposes a new end-to-end trainable matching network based on receptive field, RF-Net, to compute sparse correspondence between images.

Keypoint Detection

128

Paper
Code

Learned Quality Enhancement via Multi-Frame Priors for HEVC Compliant Low-Delay Applications

no code implementations • 3 May 2019 • Ming Lu, Ming Cheng, Yiling Xu, ShiLiang Pu, Qiu Shen, Zhan Ma

Networked video applications, e. g., video conferencing, often suffer from poor visual quality due to unexpected network fluctuation and limited bandwidth.

Video Compression

Paper
Add Code

LO-Net: Deep Real-time Lidar Odometry

no code implementations • CVPR 2019 • Qing Li, Shaoyang Chen, Cheng Wang, Xin Li, Chenglu Wen, Ming Cheng, Jonathan Li

We present a novel deep convolutional network pipeline, LO-Net, for real-time lidar odometry estimation.

feature selection Pose Estimation

Paper
Add Code

Advancing the State of the Art in Open Domain Dialog Systems through the Alexa Prize

no code implementations • 27 Dec 2018 • Chandra Khatri, Behnam Hedayatnia, Anu Venkatesh, Jeff Nunn, Yi Pan, Qing Liu, Han Song, Anna Gottardi, Sanjeev Kwatra, Sanju Pancholi, Ming Cheng, Qinglang Chen, Lauren Stubel, Karthik Gopalakrishnan, Kate Bland, Raefer Gabriel, Arindam Mandal, Dilek Hakkani-Tur, Gene Hwang, Nate Michel, Eric King, Rohit Prasad

In the second iteration of the competition in 2018, university teams advanced the state of the art by using context in dialog models, leveraging knowledge graphs for language understanding, handling complex utterances, building statistical and hierarchical dialog managers, and leveraging model-driven signals from user responses.

Knowledge Graphs Management +4

Paper
Add Code

On Evaluating and Comparing Open Domain Dialog Systems

no code implementations • 11 Jan 2018 • Anu Venkatesh, Chandra Khatri, Ashwin Ram, Fenfei Guo, Raefer Gabriel, Ashish Nagar, Rohit Prasad, Ming Cheng, Behnam Hedayatnia, Angeliki Metallinou, Rahul Goel, Shaohua Yang, Anirudh Raju

In this paper, we propose a comprehensive evaluation strategy with multiple metrics designed to reduce subjectivity by selecting metrics which correlate well with human judgement.

Goal-Oriented Dialogue Systems Open-Domain Dialog

Paper
Add Code

Conversational AI: The Science Behind the Alexa Prize

no code implementations • 11 Jan 2018 • Ashwin Ram, Rohit Prasad, Chandra Khatri, Anu Venkatesh, Raefer Gabriel, Qing Liu, Jeff Nunn, Behnam Hedayatnia, Ming Cheng, Ashish Nagar, Eric King, Kate Bland, Amanda Wartick, Yi Pan, Han Song, Sk Jayadevan, Gene Hwang, Art Pettigrue

This paper outlines the advances created by the university teams as well as the Alexa Prize team to achieve the common goal of solving the problem of Conversational

Management Natural Language Understanding +3

Paper
Add Code

Partial Procedural Geometric Model Fitting for Point Clouds

1 code implementation • 17 Oct 2016 • Zongliang Zhang, Jonathan Li, Yulan Guo, Yangbin Lin, Ming Cheng, Cheng Wang

However, most geometric model fitting methods are unable to fit an arbitrary geometric model (e. g. a surface with holes) to incomplete data, due to that the similarity metrics used in these methods are unable to measure the rigid partial similarity between arbitrary models.

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.