no code implementations • 20 Aug 2024 • Changkun Liu, Shuai Chen, Yash Bhalgat, Siyan Hu, ZiRui Wang, Ming Cheng, Victor Adrian Prisacariu, Tristan Braud
We leverage 3D Gaussian Splatting (3DGS) as a scene representation and propose a novel test-time camera pose refinement framework, GSLoc.
no code implementations • 31 Jul 2024 • Zhiyang Lu, Qinghan Chen, Zhimin Yuan, Ming Cheng
Scene flow, which provides the 3D motion field of the first frame from two consecutive point clouds, is vital for dynamic scene perception.
no code implementations • 16 Jul 2024 • Yuke Lin, Ming Cheng, FuLin Zhang, Yingying Gao, Shilei Zhang, Ming Li
In this paper, we provide a large audio-visual speaker recognition dataset, VoxBlink2, which includes approximately 10M utterances with videos from 110K+ speakers in the wild.
1 code implementation • 19 May 2024 • Jianbo Dai, Jianqiao Lu, Yunlong Feng, Rongju Ruan, Ming Cheng, Haochen Tan, Zhijiang Guo
Our study analyzed two common benchmarks, HumanEval and MBPP, and found that these might not thoroughly evaluate LLMs' code generation capacities due to limitations in quality, difficulty, and granularity.
no code implementations • 19 Apr 2024 • Ziyi Zhou, Ming Cheng, Xingjian Diao, Yanjun Cui, Xiangling Li
This leaves a gap in expanding the scope of digital biomarkers for overall glycemic control in diabetes management.
no code implementations • 18 Apr 2024 • Ming Cheng, Xingjian Diao, Ziyi Zhou, Yanjun Cui, Wenjun Liu, Shitong Cheng
The global diabetes epidemic highlights the importance of maintaining good glycemic control.
no code implementations • 16 Apr 2024 • Ziyi Zhou, Ming Cheng, Yanjun Cui, Xingjian Diao, Zhaorui Ma
Because diabetes may develop into potential serious complications, early glucose prediction for diabetic patients is necessary for timely medical treatment.
no code implementations • 11 Apr 2024 • Ming Cheng, BoWen Zhang, Ziyu Wang, Ziyi Zhou, Weiqi Feng, Yi Lyu, Xingjian Diao
Trajectory similarity search plays an essential role in autonomous driving, as it enables vehicles to analyze the information and characteristics of different trajectories to make informed decisions and navigate safely in dynamic environments.
1 code implementation • CVPR 2024 • Zhimin Yuan, Wankang Zeng, Yanfei Su, Weiquan Liu, Ming Cheng, Yulan Guo, Cheng Wang
3D synthetic-to-real unsupervised domain adaptive segmentation is crucial to annotating new domains.
1 code implementation • 11 Mar 2024 • Zhiyang Lu, Qinghan Chen, Ming Cheng
Scene flow prediction is a crucial underlying task in understanding dynamic scenes as it offers fundamental motion information.
no code implementations • 16 Jan 2024 • Ming Cheng, Ming Li
The proposed method can take audio-visual input and leverage the speaker's acoustic footprint or lip track to flexibly conduct audio-based, video-based, and audio-visual speaker diarization in a unified sequence-to-sequence framework.
1 code implementation • CVPR 2024 • Wen Li, Yuyang Yang, Shangshu Yu, Guosheng Hu, Chenglu Wen, Ming Cheng, Cheng Wang
We recognize APR's lack of robust features learning and iterative denoising process leads to suboptimal results.
no code implementations • 23 Dec 2023 • Ming Cheng, Xingjian Diao, Shitong Cheng, Wenjun Liu
Speech anonymization and de-identification have garnered significant attention recently, especially in the healthcare area including telehealth consultations, patient voiceprint matching, and patient real-time monitoring.
no code implementations • 9 Dec 2023 • Xingjian Diao, Ming Cheng, Wayner Barrios, SouYoung Jin
This achievement highlights our model capability to bridge first-person statements and dynamic face generation, providing insightful guidance for future work.
no code implementations • 15 Sep 2023 • Xingjian Diao, Ming Cheng, Shitong Cheng
Learning high-quality video representation has shown significant applications in computer vision and remains challenging.
no code implementations • 22 Aug 2023 • Wenbo Xu, Huaxi Huang, Ming Cheng, Litao Yu, Qiang Wu, Jian Zhang
Few-shot segmentation (FSS) is a dense prediction task that aims to infer the pixel-wise labels of unseen classes using only a limited number of annotated images.
Ranked #27 on Few-Shot Semantic Segmentation on COCO-20i (5-shot)
no code implementations • 15 Aug 2023 • Ming Cheng, Weiqing Wang, Xiaoyi Qin, Yuke Lin, Ning Jiang, Guoqing Zhao, Ming Li
This paper describes the DKU-MSXF submission to track 4 of the VoxCeleb Speaker Recognition Challenge 2023 (VoxSRC-23).
no code implementations • 14 Aug 2023 • Yuke Lin, Xiaoyi Qin, Guoqing Zhao, Ming Cheng, Ning Jiang, Haiyang Wu, Ming Li
In this paper, we introduce a large-scale and high-quality audio-visual speaker verification dataset, named VoxBlink.
no code implementations • 9 May 2023 • Ming Cheng, Haoyu Ma, Qiufang Ma, Xiaopeng Sun, Weiqi Li, Zhenyu Zhang, Xuhan Sheng, Shijie Zhao, Junlin Li, Li Zhang
Multi-stage strategies are frequently employed in image restoration tasks.
no code implementations • 26 Apr 2023 • Xiaopeng Sun, Weiqi Li, Zhenyu Zhang, Qiufang Ma, Xuhan Sheng, Ming Cheng, Haoyu Ma, Shijie Zhao, Jian Zhang, Junlin Li, Li Zhang
Model A aims to enhance the feature extraction ability of 360{\deg} image positional information, while Model B further focuses on the high-frequency information of 360{\deg} images.
no code implementations • 28 Oct 2022 • Ming Cheng, Weiqing Wang, Yucong Zhang, Xiaoyi Qin, Ming Li
Target-speaker voice activity detection is currently a promising approach for speaker diarization in complex acoustic environments.
1 code implementation • 7 Oct 2022 • Zhiyang Lu, Ming Cheng
Scene flow represents the motion information of each point in the 3D point clouds.
no code implementations • 4 Oct 2022 • Weiqing Wang, Xiaoyi Qin, Ming Cheng, Yucong Zhang, Kangyue Wang, Ming Li
This paper discribes the DKU-DukeECE submission to the 4th track of the VoxCeleb Speaker Recognition Challenge 2022 (VoxSRC-22).
no code implementations • 4 Aug 2022 • Ming Cheng, Yiling Xu, Wang Shen, M. Salman Asif, Chao Ma, Jun Sun, Zhan Ma
We utilize a disparity network to transfer spatiotemporal information across views even in large disparity scenes, based on which, we propose disparity-guided flow-based warping for LSR-HFR view and complementary warping for HSR-LFR view.
1 code implementation • 24 Jan 2022 • Shangbin Wu, Xu Yan, Xiaoliang Fan, Shirui Pan, Shichao Zhu, Chuanpan Zheng, Ming Cheng, Cheng Wang
Human mobility data contains rich but abundant information, which yields to the comprehensive region embeddings for cross domain tasks.
1 code implementation • 1 Jun 2021 • Yanfei Su, Weiquan Liu, Zhimin Yuan, Ming Cheng, Zhihong Zhang, Xuelun Shen, Cheng Wang
As there is a lack of 3D point clouds datasets related to the fine-grained building facade, we construct the first large-scale building facade point clouds benchmark dataset for semantic segmentation.
no code implementations • 13 May 2021 • Bai Zhao, Min Lin, Ming Cheng, Wei-Ping Zhu, Naofal Al-Dhahir
This paper proposes a robust beamforming scheme to enhance the physical layer security (PLS) of multicast transmission in a cognitive satellite and aerial network (CSAN) operating in the millimeter wave frequency band.
no code implementations • 26 Jan 2021 • Rui Zhang, Xiaomeng Wang, Ming Cheng, Tao Jia
The study of network structural controllability focuses on the minimum number of driver nodes needed to control a whole network.
Physics and Society
1 code implementation • 1 Oct 2020 • Hanjiang Hu, Zhijian Qiao, Ming Cheng, Zhe Liu, Hesheng Wang
Long-Term visual localization under changing environments is a challenging problem in autonomous driving and mobile robotics due to season, illumination variance, etc.
1 code implementation • 14 Nov 2019 • Ming Cheng, Kunjing Cai, Ming Li
In recent years, surveillance cameras are widely deployed in public places, and the general crime rate has been reduced significantly due to these ubiquitous devices.
Ranked #6 on Activity Recognition on RWF-2000
no code implementations • WS 2019 • Wuti Xiong, Fei Li, Ming Cheng, Hong Yu, Donghong Ji
abstract In this article, we describe our approach for the Bacteria Biotopes relation extraction (BB-rel) subtask in the BioNLP Shared Task 2019.
no code implementations • 28 Sep 2019 • Ming Cheng, Zhan Ma, M. Salman Asif, Yiling Xu, Haojie Liu, Wenbo Bao, Jun Sun
This paper presents a dual camera system for high spatiotemporal resolution (HSTR) video acquisition, where one camera shoots a video with high spatial resolution and low frame rate (HSR-LFR) and another one captures a low spatial resolution and high frame rate (LSR-HFR) video.
1 code implementation • CVPR 2019 • Xuelun Shen, Cheng Wang, Xin Li, Zenglei Yu, Jonathan Li, Chenglu Wen, Ming Cheng, Zijian He
This paper proposes a new end-to-end trainable matching network based on receptive field, RF-Net, to compute sparse correspondence between images.
no code implementations • 3 May 2019 • Ming Lu, Ming Cheng, Yiling Xu, ShiLiang Pu, Qiu Shen, Zhan Ma
Networked video applications, e. g., video conferencing, often suffer from poor visual quality due to unexpected network fluctuation and limited bandwidth.
no code implementations • CVPR 2019 • Qing Li, Shaoyang Chen, Cheng Wang, Xin Li, Chenglu Wen, Ming Cheng, Jonathan Li
We present a novel deep convolutional network pipeline, LO-Net, for real-time lidar odometry estimation.
no code implementations • 27 Dec 2018 • Chandra Khatri, Behnam Hedayatnia, Anu Venkatesh, Jeff Nunn, Yi Pan, Qing Liu, Han Song, Anna Gottardi, Sanjeev Kwatra, Sanju Pancholi, Ming Cheng, Qinglang Chen, Lauren Stubel, Karthik Gopalakrishnan, Kate Bland, Raefer Gabriel, Arindam Mandal, Dilek Hakkani-Tur, Gene Hwang, Nate Michel, Eric King, Rohit Prasad
In the second iteration of the competition in 2018, university teams advanced the state of the art by using context in dialog models, leveraging knowledge graphs for language understanding, handling complex utterances, building statistical and hierarchical dialog managers, and leveraging model-driven signals from user responses.
no code implementations • 11 Jan 2018 • Anu Venkatesh, Chandra Khatri, Ashwin Ram, Fenfei Guo, Raefer Gabriel, Ashish Nagar, Rohit Prasad, Ming Cheng, Behnam Hedayatnia, Angeliki Metallinou, Rahul Goel, Shaohua Yang, Anirudh Raju
In this paper, we propose a comprehensive evaluation strategy with multiple metrics designed to reduce subjectivity by selecting metrics which correlate well with human judgement.
no code implementations • 11 Jan 2018 • Ashwin Ram, Rohit Prasad, Chandra Khatri, Anu Venkatesh, Raefer Gabriel, Qing Liu, Jeff Nunn, Behnam Hedayatnia, Ming Cheng, Ashish Nagar, Eric King, Kate Bland, Amanda Wartick, Yi Pan, Han Song, Sk Jayadevan, Gene Hwang, Art Pettigrue
This paper outlines the advances created by the university teams as well as the Alexa Prize team to achieve the common goal of solving the problem of Conversational
1 code implementation • 17 Oct 2016 • Zongliang Zhang, Jonathan Li, Yulan Guo, Yangbin Lin, Ming Cheng, Cheng Wang
However, most geometric model fitting methods are unable to fit an arbitrary geometric model (e. g. a surface with holes) to incomplete data, due to that the similarity metrics used in these methods are unable to measure the rigid partial similarity between arbitrary models.