Search Results for author: Ming Cheng

Found 33 papers, 9 papers with code

Toward Short-Term Glucose Prediction Solely Based on CGM Time Series

no code implementations18 Apr 2024 Ming Cheng, Xingjian Diao, Ziyi Zhou, Yanjun Cui, Wenjun Liu, Shitong Cheng

The global diabetes epidemic highlights the importance of maintaining good glycemic control.

CrossGP: Cross-Day Glucose Prediction Excluding Physiological Information

no code implementations16 Apr 2024 Ziyi Zhou, Ming Cheng, Yanjun Cui, Xingjian Diao, Zhaorui Ma

Because diabetes may develop into potential serious complications, early glucose prediction for diabetic patients is necessary for timely medical treatment.

VeTraSS: Vehicle Trajectory Similarity Search Through Graph Modeling and Representation Learning

no code implementations11 Apr 2024 Ming Cheng, BoWen Zhang, Ziyu Wang, Ziyi Zhou, Weiqi Feng, Yi Lyu, Xingjian Diao

Trajectory similarity search plays an essential role in autonomous driving, as it enables vehicles to analyze the information and characteristics of different trajectories to make informed decisions and navigate safely in dynamic environments.

Autonomous Driving Navigate +1

STARFlow: Spatial Temporal Feature Re-embedding with Attentive Learning for Real-world Scene Flow

1 code implementation11 Mar 2024 Zhiyang Lu, Qinghan Chen, Ming Cheng

Scene flow prediction is a crucial underlying task in understanding dynamic scenes as it offers fundamental motion information.

Multi-Input Multi-Output Target-Speaker Voice Activity Detection For Unified, Flexible, and Robust Audio-Visual Speaker Diarization

no code implementations16 Jan 2024 Ming Cheng, Ming Li

The proposed method can take audio-visual input and leverage the speaker's acoustic footprint or lip track to flexibly conduct audio-based, video-based, and audio-visual speaker diarization in a unified sequence-to-sequence framework.

Action Detection Activity Detection +7

SAIC: Integration of Speech Anonymization and Identity Classification

no code implementations23 Dec 2023 Ming Cheng, Xingjian Diao, Shitong Cheng, Wenjun Liu

Speech anonymization and de-identification have garnered significant attention recently, especially in the healthcare area including telehealth consultations, patient voiceprint matching, and patient real-time monitoring.

Classification De-identification

FT2TF: First-Person Statement Text-To-Talking Face Generation

no code implementations9 Dec 2023 Xingjian Diao, Ming Cheng, Wayner Barrios, SouYoung Jin

This achievement highlights our model capability to bridge first-person statements and dynamic face generation, providing insightful guidance for future work.

Talking Face Generation

AV-MaskEnhancer: Enhancing Video Representations through Audio-Visual Masked Autoencoder

no code implementations15 Sep 2023 Xingjian Diao, Ming Cheng, Shitong Cheng

Learning high-quality video representation has shown significant applications in computer vision and remains challenging.

Video Classification

Masked Cross-image Encoding for Few-shot Segmentation

no code implementations22 Aug 2023 Wenbo Xu, Huaxi Huang, Ming Cheng, Litao Yu, Qiang Wu, Jian Zhang

Few-shot segmentation (FSS) is a dense prediction task that aims to infer the pixel-wise labels of unseen classes using only a limited number of annotated images.

Few-Shot Semantic Segmentation

VoxBlink: A Large Scale Speaker Verification Dataset on Camera

no code implementations14 Aug 2023 Yuke Lin, Xiaoyi Qin, Guoqing Zhao, Ming Cheng, Ning Jiang, Haiyang Wu, Ming Li

In this paper, we introduce a large-scale and high-quality audio-visual speaker verification dataset, named VoxBlink.

Speaker Recognition Speaker Verification

OPDN: Omnidirectional Position-aware Deformable Network for Omnidirectional Image Super-Resolution

no code implementations26 Apr 2023 Xiaopeng Sun, Weiqi Li, Zhenyu Zhang, Qiufang Ma, Xuhan Sheng, Ming Cheng, Haoyu Ma, Shijie Zhao, Jian Zhang, Junlin Li, Li Zhang

Model A aims to enhance the feature extraction ability of 360{\deg} image positional information, while Model B further focuses on the high-frequency information of 360{\deg} images.

Image Super-Resolution Position

Target-Speaker Voice Activity Detection via Sequence-to-Sequence Prediction

no code implementations28 Oct 2022 Ming Cheng, Weiqing Wang, Yucong Zhang, Xiaoyi Qin, Ming Li

Target-speaker voice activity detection is currently a promising approach for speaker diarization in complex acoustic environments.

Action Detection Activity Detection +2

The DKU-DukeECE Diarization System for the VoxCeleb Speaker Recognition Challenge 2022

no code implementations4 Oct 2022 Weiqing Wang, Xiaoyi Qin, Ming Cheng, Yucong Zhang, Kangyue Wang, Ming Li

This paper discribes the DKU-DukeECE submission to the 4th track of the VoxCeleb Speaker Recognition Challenge 2022 (VoxSRC-22).

Action Detection Activity Detection +2

H2-Stereo: High-Speed, High-Resolution Stereoscopic Video System

no code implementations4 Aug 2022 Ming Cheng, Yiling Xu, Wang Shen, M. Salman Asif, Chao Ma, Jun Sun, Zhan Ma

We utilize a disparity network to transfer spatiotemporal information across views even in large disparity scenes, based on which, we propose disparity-guided flow-based warping for LSR-HFR view and complementary warping for HSR-LFR view.

Super-Resolution Vocal Bursts Intensity Prediction

Multi-Graph Fusion Networks for Urban Region Embedding

1 code implementation24 Jan 2022 Shangbin Wu, Xu Yan, Xiaoliang Fan, Shirui Pan, Shichao Zhu, Chuanpan Zheng, Ming Cheng, Cheng Wang

Human mobility data contains rich but abundant information, which yields to the comprehensive region embeddings for cross domain tasks.

Crime Prediction

DLA-Net: Learning Dual Local Attention Features for Semantic Segmentation of Large-Scale Building Facade Point Clouds

1 code implementation1 Jun 2021 Yanfei Su, Weiquan Liu, Zhimin Yuan, Ming Cheng, Zhihong Zhang, Xuelun Shen, Cheng Wang

As there is a lack of 3D point clouds datasets related to the fine-grained building facade, we construct the first large-scale building facade point clouds benchmark dataset for semantic segmentation.

3D Semantic Segmentation Point Cloud Segmentation +1

Outage Constrained Robust Secure Beamforming in Cognitive Satellite-Aerial Networks

no code implementations13 May 2021 Bai Zhao, Min Lin, Ming Cheng, Wei-Ping Zhu, Naofal Al-Dhahir

This paper proposes a robust beamforming scheme to enhance the physical layer security (PLS) of multicast transmission in a cognitive satellite and aerial network (CSAN) operating in the millimeter wave frequency band.

The evolution of network controllability in growing networks

no code implementations26 Jan 2021 Rui Zhang, Xiaomeng Wang, Ming Cheng, Tao Jia

The study of network structural controllability focuses on the minimum number of driver nodes needed to control a whole network.

Physics and Society

DASGIL: Domain Adaptation for Semantic and Geometric-aware Image-based Localization

1 code implementation1 Oct 2020 Hanjiang Hu, Zhijian Qiao, Ming Cheng, Zhe Liu, Hesheng Wang

Long-Term visual localization under changing environments is a challenging problem in autonomous driving and mobile robotics due to season, illumination variance, etc.

Autonomous Driving Domain Adaptation +5

RWF-2000: An Open Large Scale Video Database for Violence Detection

1 code implementation14 Nov 2019 Ming Cheng, Kunjing Cai, Ming Li

In recent years, surveillance cameras are widely deployed in public places, and the general crime rate has been reduced significantly due to these ubiquitous devices.

Action Classification Action Recognition

Bacteria Biotope Relation Extraction via Lexical Chains and Dependency Graphs

no code implementations WS 2019 Wuti Xiong, Fei Li, Ming Cheng, Hong Yu, Donghong Ji

abstract In this article, we describe our approach for the Bacteria Biotopes relation extraction (BB-rel) subtask in the BioNLP Shared Task 2019.

graph construction Relation +2

A Dual Camera System for High Spatiotemporal Resolution Video Acquisition

no code implementations28 Sep 2019 Ming Cheng, Zhan Ma, M. Salman Asif, Yiling Xu, Haojie Liu, Wenbo Bao, Jun Sun

This paper presents a dual camera system for high spatiotemporal resolution (HSTR) video acquisition, where one camera shoots a video with high spatial resolution and low frame rate (HSR-LFR) and another one captures a low spatial resolution and high frame rate (LSR-HFR) video.

Vocal Bursts Intensity Prediction

RF-Net: An End-to-End Image Matching Network based on Receptive Field

1 code implementation CVPR 2019 Xuelun Shen, Cheng Wang, Xin Li, Zenglei Yu, Jonathan Li, Chenglu Wen, Ming Cheng, Zijian He

This paper proposes a new end-to-end trainable matching network based on receptive field, RF-Net, to compute sparse correspondence between images.

Keypoint Detection

Learned Quality Enhancement via Multi-Frame Priors for HEVC Compliant Low-Delay Applications

no code implementations3 May 2019 Ming Lu, Ming Cheng, Yiling Xu, ShiLiang Pu, Qiu Shen, Zhan Ma

Networked video applications, e. g., video conferencing, often suffer from poor visual quality due to unexpected network fluctuation and limited bandwidth.

Video Compression

Advancing the State of the Art in Open Domain Dialog Systems through the Alexa Prize

no code implementations27 Dec 2018 Chandra Khatri, Behnam Hedayatnia, Anu Venkatesh, Jeff Nunn, Yi Pan, Qing Liu, Han Song, Anna Gottardi, Sanjeev Kwatra, Sanju Pancholi, Ming Cheng, Qinglang Chen, Lauren Stubel, Karthik Gopalakrishnan, Kate Bland, Raefer Gabriel, Arindam Mandal, Dilek Hakkani-Tur, Gene Hwang, Nate Michel, Eric King, Rohit Prasad

In the second iteration of the competition in 2018, university teams advanced the state of the art by using context in dialog models, leveraging knowledge graphs for language understanding, handling complex utterances, building statistical and hierarchical dialog managers, and leveraging model-driven signals from user responses.

Knowledge Graphs Management +4

On Evaluating and Comparing Open Domain Dialog Systems

no code implementations11 Jan 2018 Anu Venkatesh, Chandra Khatri, Ashwin Ram, Fenfei Guo, Raefer Gabriel, Ashish Nagar, Rohit Prasad, Ming Cheng, Behnam Hedayatnia, Angeliki Metallinou, Rahul Goel, Shaohua Yang, Anirudh Raju

In this paper, we propose a comprehensive evaluation strategy with multiple metrics designed to reduce subjectivity by selecting metrics which correlate well with human judgement.

Goal-Oriented Dialogue Systems Open-Domain Dialog

Partial Procedural Geometric Model Fitting for Point Clouds

1 code implementation17 Oct 2016 Zongliang Zhang, Jonathan Li, Yulan Guo, Yangbin Lin, Ming Cheng, Cheng Wang

However, most geometric model fitting methods are unable to fit an arbitrary geometric model (e. g. a surface with holes) to incomplete data, due to that the similarity metrics used in these methods are unable to measure the rigid partial similarity between arbitrary models.

Cannot find the paper you are looking for? You can Submit a new open access paper.