Search Results for author: Ming Li

Found 253 papers, 70 papers with code

Multi-Task Dense Retrieval via Model Uncertainty Fusion for Open-Domain Question Answering

1 code implementation • Findings (EMNLP) 2021 • Minghan Li, Ming Li, Kun Xiong, Jimmy Lin

Our method reaches state-of-the-art performance on 5 benchmark QA datasets, with up to 10% improvement in top-100 accuracy compared to a joint-training multi-task DPR on SQuAD.

Open-Domain Question Answering Retrieval

Paper
Code

Unsupervised Chunking as Syntactic Structure Induction with a Knowledge-Transfer Approach

1 code implementation • Findings (EMNLP) 2021 • Anup Anand Deshmukh, Qianqiu Zhang, Ming Li, Jimmy Lin, Lili Mou

In this paper, we address unsupervised chunking as a new task of syntactic structure induction, which is helpful for understanding the linguistic structures of human languages as well as processing low-resource languages.

Chunking Transfer Learning

Paper
Code

ConsistencyDet: A Robust Object Detector with a Denoising Paradigm of Consistency Model

1 code implementation • 11 Apr 2024 • Lifan Jiang, Zhihui Wang, Changmiao Wang, Ming Li, Jiaxu Leng, Xindong Wu

In the present study, we introduce a novel framework designed to articulate object detection as a denoising diffusion process, which operates on the perturbed bounding boxes of annotated entities.

Attribute Denoising +2

Paper
Code

ControlNet++: Improving Conditional Controls with Efficient Consistency Feedback

1 code implementation • 11 Apr 2024 • Ming Li, Taojiannan Yang, Huafeng Kuang, Jie Wu, Zhaoning Wang, Xuefeng Xiao, Chen Chen

To this end, we propose ControlNet++, a novel approach that improves controllable generation by explicitly optimizing pixel-level cycle consistency between generated images and conditional controls.

SSIM

137

Paper
Code

MealRec$^+$: A Meal Recommendation Dataset with Meal-Course Affiliation for Personalization and Healthiness

1 code implementation • 8 Apr 2024 • Ming Li, Lin Li, Xiaohui Tao, Jimmy Xiangji Huang

Due to constraints related to user health privacy and meal scenario characteristics, the collection of data that includes both meal-course affiliation and two levels of interactions is impeded.

Paper
Code

Scalable Model Editing via Customized Expert Networks

1 code implementation • 3 Apr 2024 • Zihan Yao, Yu He, Tianyu Qi, Ming Li

Our experiments on two different sizes of open-source large language models, the Llama2 7B and 13B, achieve state-of-the-art results compared to existing mainstream Model Editing methods.

Model Editing

Paper
Code

CSST Strong Lensing Preparation: a Framework for Detecting Strong Lenses in the Multi-color Imaging Survey by the China Survey Space Telescope (CSST)

no code implementations • 2 Apr 2024 • Xu Li, Ruiqi Sun, Jiameng Lv, Peng Jia, Nan Li, Chengliang Wei, Zou Hu, Xinzhong Er, Yun Chen, Zhang Ban, Yuedong Fang, Qi Guo, Dezi Liu, Guoliang Li, Lin Lin, Ming Li, Ran Li, Xiaobo Li, Yu Luo, Xianmin Meng, Jundan Nie, Zhaoxiang Qi, Yisheng Qiu, Li Shao, Hao Tian, Lei Wang, Wei Wang, Jingtian Xian, Youhua Xu, Tianmeng Zhang, Xin Zhang, Zhimin Zhou

To overcome these challenges, we have developed a framework based on a hierarchical visual Transformer with a sliding window technique to search for strong lensing systems within entire images.

Paper
Add Code

Voice EHR: Introducing Multimodal Audio Data for Health

no code implementations • 2 Apr 2024 • James Anibal, Hannah Huth, Ming Li, Lindsey Hazen, Yen Minh Lam, Nguyen Thi Thu Hang, Michael Kleinman, Shelley Ost, Christopher Jackson, Laura Sprabery, Cheran Elangovan, Balaji Krishnaiah, Lee Akst, Ioan Lina, Iqbal Elyazar, Lenny Ekwati, Stefan Jansen, Richard Nduwayezu, Charisse Garcia, Jeffrey Plum, Jacqueline Brenner, Miranda Song, Emily Ricotta, David Clifton, C. Louise Thwaites, Yael Bensoussan, Bradford Wood

This report introduces a consortium of partners for global work, presents the application used for data collection, and showcases the potential of informative voice EHR to advance the scalability and diversity of audio AI.

Decision Making

Paper
Add Code

A Comparative Study of Artificial Potential Fields and Safety Filters

no code implementations • 23 Mar 2024 • Ming Li, Zhiyong Sun

In this paper, we have demonstrated that the controllers designed by a classical motion planning tool, namely artificial potential fields (APFs), can be derived from a recently prevalent approach: control barrier function quadratic program (CBF-QP) safety filters.

Motion Planning

Paper
Add Code

KunquDB: An Attempt for Speaker Verification in the Chinese Opera Scenario

no code implementations • 20 Mar 2024 • Huali Zhou, Yuke Lin, Dong Liu, Ming Li

This work aims to promote Chinese opera research in both musical and speech domains, with a primary focus on overcoming the data limitations.

Domain Adaptation Speaker Verification

Paper
Add Code

A Tunable Universal Formula for Safety-Critical Control

no code implementations • 10 Mar 2024 • Ming Li, Zhiyong Sun, Patrick J. W. Koelewijn, Siep Weiland

Finally, we demonstrate the efficacy of our method through a collision avoidance example, investigating the essential properties including safety, robustness, and smoothness under various tunable scaling terms.

Collision Avoidance

Paper
Add Code

Unifying Controller Design for Stabilizing Nonlinear Systems with Norm-Bounded Control Inputs

no code implementations • 5 Mar 2024 • Ming Li, Zhiyong Sun, Siep Weiland

This paper revisits a classical challenge in the design of stabilizing controllers for nonlinear systems with a norm-bounded input constraint.

Paper
Add Code

Pyramid Feature Attention Network for Monocular Depth Prediction

no code implementations • 3 Mar 2024 • Yifang Xu, Chenglei Peng, Ming Li, Yang Li, Sidan Du

Deep convolutional neural networks (DCNNs) have achieved great success in monocular depth estimation (MDE).

Depth Prediction Monocular Depth Estimation

Paper
Add Code

Location-guided Head Pose Estimation for Fisheye Image

no code implementations • 28 Feb 2024 • Bing Li, Dong Zhang, Cheng Huang, Yun Xian, Ming Li, Dah-Jye Lee

Camera with a fisheye or ultra-wide lens covers a wide field of view that cannot be modeled by the perspective projection.

Head Pose Estimation Multi-Task Learning

Paper
Add Code

A Survey on Knowledge Distillation of Large Language Models

1 code implementation • 20 Feb 2024 • Xiaohan Xu, Ming Li, Chongyang Tao, Tao Shen, Reynold Cheng, Jinyang Li, Can Xu, DaCheng Tao, Tianyi Zhou

In the era of Large Language Models (LLMs), Knowledge Distillation (KD) emerges as a pivotal methodology for transferring advanced capabilities from leading proprietary LLMs, such as GPT-4, to their open-source counterparts like LLaMA and Mistral.

Data Augmentation Knowledge Distillation +1

231

Paper
Code

Can LLMs Speak For Diverse People? Tuning LLMs via Debate to Generate Controllable Controversial Statements

1 code implementation • 16 Feb 2024 • Ming Li, Jiuhai Chen, Lichang Chen, Tianyi Zhou

To examine DEBATunE, we curate the largest dataset of debate topics so far, which covers 710 controversial topics and corresponding arguments for each topic.

Paper
Code

Selective Reflection-Tuning: Student-Selected Data Recycling for LLM Instruction-Tuning

2 code implementations • 15 Feb 2024 • Ming Li, Lichang Chen, Jiuhai Chen, Shwai He, Jiuxiang Gu, Tianyi Zhou

Instruction tuning is critical to large language models (LLMs) for achieving better instruction following and task adaptation capabilities but its success heavily relies on the training data quality.

Data Augmentation Instruction Following

Paper
Code

Deep Rib Fracture Instance Segmentation and Classification from CT on the RibFrac Challenge

no code implementations • 14 Feb 2024 • Jiancheng Yang, Rui Shi, Liang Jin, Xiaoyang Huang, Kaiming Kuang, Donglai Wei, Shixuan Gu, Jianying Liu, PengFei Liu, Zhizhong Chai, Yongjie Xiao, Hao Chen, Liming Xu, Bang Du, Xiangyi Yan, Hao Tang, Adam Alessio, Gregory Holste, Jiapeng Zhang, Xiaoming Wang, Jianye He, Lixuan Che, Hanspeter Pfister, Ming Li, Bingbing Ni

The resulting FracNet+ demonstrates competitive performance in rib fracture detection, which lays a foundation for further research and development in AI-assisted rib fracture detection and diagnosis.

Instance Segmentation Semantic Segmentation

Paper
Add Code

Zero-shot Explainable Mental Health Analysis on Social Media by Incorporating Mental Scales

no code implementations • 9 Feb 2024 • Wenyu Li, Yinuo Zhu, Xin Lin, Ming Li, Ziyue Jiang, Ziqian Zeng

Traditional discriminative approaches in mental health analysis are known for their strong capacity but lack interpretability and demand large-scale annotated data.

Paper
Add Code

Superfiltering: Weak-to-Strong Data Filtering for Fast Instruction-Tuning

1 code implementation • 1 Feb 2024 • Ming Li, Yong Zhang, Shwai He, Zhitao Li, Hongyu Zhao, Jianzong Wang, Ning Cheng, Tianyi Zhou

Data filtering for instruction tuning has proved important in improving both the efficiency and performance of the tuning process.

Language Modelling

Paper
Code

Leveraging Biases in Large Language Models: "bias-kNN'' for Effective Few-Shot Learning

no code implementations • 18 Jan 2024 • Yong Zhang, Hanzhang Li, Zhitao Li, Ning Cheng, Ming Li, Jing Xiao, Jianzong Wang

Large Language Models (LLMs) have shown significant promise in various applications, including zero-shot and few-shot learning.

Few-Shot Learning In-Context Learning +2

Paper
Add Code

Multi-Input Multi-Output Target-Speaker Voice Activity Detection For Unified, Flexible, and Robust Audio-Visual Speaker Diarization

no code implementations • 16 Jan 2024 • Ming Cheng, Ming Li

The proposed method can take audio-visual input and leverage the speaker's acoustic footprint or lip track to flexibly conduct audio-based, video-based, and audio-visual speaker diarization in a unified sequence-to-sequence framework.

Action Detection Activity Detection +7

Paper
Add Code

End-to-End Learning for SLP-Based ISAC Systems

no code implementations • 11 Jan 2024 • Yixian Zheng, Rang Liu, Ming Li, Qian Liu

Integrated sensing and communication (ISAC) is an encouraging wireless technology which can simultaneously perform both radar and communication functionalities by sharing the same transmit waveform, spectral resource, and hardware platform.

Paper
Add Code

A Practical Beamforming Design for Active RIS-assisted MU-MISO Systems

no code implementations • 8 Jan 2024 • Yun Yang, Zhiping Lu, Ming Li, Rang Liu, Qian Liu

Motivated by this fact, in this paper we first investigate the amplification principle of typical active RIS and propose a more accurate amplification model based on amplifier hardware characteristics.

Paper
Add Code

Self-supervised Reflective Learning through Self-distillation and Online Clustering for Speaker Representation Learning

no code implementations • 3 Jan 2024 • Danwei Cai, Zexin Cai, Ming Li

Specifically, a teacher model continually refines pseudo labels through online clustering, providing dynamic supervision signals to train the student model.

Clustering Knowledge Distillation +3

Paper
Add Code

IL-NeRF: Incremental Learning for Neural Radiance Fields with Camera Pose Alignment

no code implementations • 10 Dec 2023 • Letian Zhang, Ming Li, Chen Chen, Jie Xu

This poses a paradox as the necessary camera pose must be estimated from the entire dataset, even though the data arrives sequentially and future chunks are inaccessible.

Incremental Learning Knowledge Distillation

Paper
Add Code

Safe Stabilization with Model Uncertainties: A Universal Formula with Gaussian Process Learning

no code implementations • 5 Dec 2023 • Ming Li, Zhiyong Sun

In our previous research, we developed an analytical control strategy, namely the universal formula, that incorporates CLF and CBF conditions for safe stabilization.

Gaussian Processes

Paper
Add Code

ColonNeRF: High-Fidelity Neural Reconstruction of Long Colonoscopy

no code implementations • 4 Dec 2023 • Yufei Shi, Beijia Lu, Jia-Wei Liu, Ming Li, Mike Zheng Shou

Specifically, to reconstruct the entire colon in a piecewise manner, our ColonNeRF introduces a region division and integration module, effectively reducing shape dissimilarity and ensuring geometric consistency in each segment.

Neural Rendering Novel View Synthesis

Paper
Add Code

LucidDreaming: Controllable Object-Centric 3D Generation

no code implementations • 30 Nov 2023 • Zhaoning Wang, Ming Li, Chen Chen

Nonetheless, achieving precise control over 3D generation continues to be an arduous task, as using text to control often leads to missing objects and imprecise locations.

3D Generation Benchmarking +4

Paper
Add Code

Where to Begin? From Random to Foundation Model Instructed Initialization in Federated Learning for Medical Image Segmentation

no code implementations • 27 Nov 2023 • Ming Li, Guang Yang

In medical image analysis, Federated Learning (FL) stands out as a key technology that enables privacy-preserved, decentralized data processing, crucial for handling sensitive medical data.

Federated Learning Image Segmentation +2

Paper
Add Code

Potential Societal Biases of ChatGPT in Higher Education: A Scoping Review

no code implementations • 24 Nov 2023 • Ming Li, Ariunaa Enkhtur, Beverley Anne Yamamoto, Fei Cheng

In this scoping review, we clarify the ways in which biases related to GAI in higher education settings have been discussed in recent academic publications and identify what type of potential biases are commonly reported in this body of literature.

Paper
Add Code

Ethical implications of ChatGPT in higher education: A scoping review

no code implementations • 24 Nov 2023 • Ming Li, Ariunaa Enkhtur, Fei Cheng, Beverley Anne Yamamoto

This scoping review explores the ethical challenges of using ChatGPT in education, focusing particularly on issues related to higher education.

Misinformation

Paper
Add Code

A Safer Vision-based Autonomous Planning System for Quadrotor UAVs with Dynamic Obstacle Trajectory Prediction and Its Application with LLMs

no code implementations • 21 Nov 2023 • Jiageng Zhong, Ming Li, Yinliang Chen, Zihang Wei, Fan Yang, Haoran Shen

For intelligent quadcopter UAVs, a robust and reliable autonomous planning system is crucial.

object-detection Object Detection +3

Paper
Add Code

Joint Sensing and Communication Optimization in Target-Mounted STARS-Assisted Vehicular Networks: A MADRL Approach

no code implementations • 17 Nov 2023 • Haocheng Zhang, Rang Liu, Ming Li, Wei Wang, Qian Liu

Extensive experimental results confirm the effectiveness of our proposed MADRL framework in improving both sensing and communication performance through the utilization of target-mounted STARS.

Decision Making

Paper
Add Code

Instant3D: Instant Text-to-3D Generation

no code implementations • 14 Nov 2023 • Ming Li, Pan Zhou, Jia-Wei Liu, Jussi Keppo, Min Lin, Shuicheng Yan, Xiangyu Xu

Once trained, Instant3D is able to create a 3D object for an unseen text prompt in less than one second with a single run of a feedforward network.

3D Generation Negation +1

Paper
Add Code

PRCA: Fitting Black-Box Large Language Models for Retrieval Question Answering via Pluggable Reward-Driven Contextual Adapter

no code implementations • 23 Oct 2023 • Haoyan Yang, Zhitao Li, Yong Zhang, Jianzong Wang, Ning Cheng, Ming Li, Jing Xiao

The Retrieval Question Answering (ReQA) task employs the retrieval-augmented framework, composed of a retriever and generator.

Question Answering Retrieval

Paper
Add Code

Data-Free Distillation Improves Efficiency and Privacy in Federated Thorax Disease Analysis

no code implementations • 22 Oct 2023 • Ming Li, Guang Yang

Thorax disease analysis in large-scale, multi-centre, and multi-scanner settings is often limited by strict privacy policies.

Federated Learning Privacy Preserving

Paper
Add Code

Mitigating the missing-fragmentation problem in de novo peptide sequencing with a two-stage graph-based deep learning model

1 code implementation • Nature Machine Intelligence 2023 • Zeping Mao, Ruixue Zhang, Lei Xin, Ming Li

Here we reveal that in the process of peptide prediction, missing fragmentation results in the generation of incorrect amino acids within those regions and causes error accumulation thereafter.

de novo peptide sequencing

Paper
Code

Reflection-Tuning: Data Recycling Improves LLM Instruction-Tuning

2 code implementations • 18 Oct 2023 • Ming Li, Lichang Chen, Jiuhai Chen, Shwai He, Heng Huang, Jiuxiang Gu, Tianyi Zhou

Recent advancements in Large Language Models (LLMs) have expanded the horizons of natural language understanding and generation.

Natural Language Understanding

Paper
Code

End-to-end Online Speaker Diarization with Target Speaker Tracking

no code implementations • 12 Oct 2023 • Weiqing Wang, Ming Li

During the inference process, we employ a front-end model to extract the frame-level speaker embeddings for each coming block of a signal.

Action Detection Activity Detection +3

Paper
Add Code

ChatRadio-Valuer: A Chat Large Language Model for Generalizable Radiology Report Generation Based on Multi-institution and Multi-system Data

no code implementations • 8 Oct 2023 • Tianyang Zhong, Wei Zhao, Yutong Zhang, Yi Pan, Peixin Dong, Zuowei Jiang, Xiaoyan Kui, Youlan Shang, Li Yang, Yaonai Wei, Longtao Yang, Hao Chen, Huan Zhao, Yuxiao Liu, Ning Zhu, Yiwei Li, Yisong Wang, Jiaqi Yao, Jiaqi Wang, Ying Zeng, Lei He, Chao Zheng, Zhixue Zhang, Ming Li, Zhengliang Liu, Haixing Dai, Zihao Wu, Lu Zhang, Shu Zhang, Xiaoyan Cai, Xintao Hu, Shijie Zhao, Xi Jiang, Xin Zhang, Xiang Li, Dajiang Zhu, Lei Guo, Dinggang Shen, Junwei Han, Tianming Liu, Jun Liu, Tuo Zhang

Radiology report generation, as a key step in medical image analysis, is critical to the quantitative analysis of clinically informed decision-making levels.

Decision Making Language Modelling +1

Paper
Add Code

Multi-objective Progressive Clustering for Semi-supervised Domain Adaptation in Speaker Verification

no code implementations • 7 Oct 2023 • Ze Li, Yuke Lin, Ning Jiang, Xiaoyi Qin, Guoqing Zhao, Haiying Wu, Ming Li

Utilizing the pseudo-labeling algorithm with large-scale unlabeled data becomes crucial for semi-supervised domain adaptation in speaker verification tasks.

Clustering Denoising +3

Paper
Add Code

BiSinger: Bilingual Singing Voice Synthesis

1 code implementation • 25 Sep 2023 • Huali Zhou, Yueqian Lin, Yao Shi, Peng Sun, Ming Li

We fuse monolingual singing datasets with open-source singing voice conversion techniques to generate bilingual singing voices while also exploring the potential use of bilingual speech data.

Singing Voice Synthesis Voice Conversion

Paper
Code

Haha-Pod: An Attempt for Laughter-based Non-Verbal Speaker Verification

1 code implementation • 25 Sep 2023 • Yuke Lin, Xiaoyi Qin, Ning Jiang, Guoqing Zhao, Ming Li

It is widely acknowledged that discriminative representation for speaker verification can be extracted from verbal speech.

Speaker Verification

Paper
Code

Semi-supervised News Discourse Profiling with Contrastive Learning

no code implementations • 20 Sep 2023 • Ming Li, Ruihong Huang

News Discourse Profiling seeks to scrutinize the event-related role of each sentence in a news article and has been proven useful across various downstream applications.

Contrastive Learning Sentence

Paper
Add Code

Cramer-Rao Bound Optimization for Active RIS-Empowered ISAC Systems

no code implementations • 17 Sep 2023 • Qi Zhu, Ming Li, Rang Liu, Qian Liu

Integrated sensing and communication (ISAC), which simultaneously performs sensing and communication functions within a shared frequency band and hardware platform, has emerged as a promising technology for future wireless systems.

Paper
Add Code

SdCT-GAN: Reconstructing CT from Biplanar X-Rays with Self-driven Generative Adversarial Networks

1 code implementation • 10 Sep 2023 • Shuangqin Cheng, Qingliang Chen, Qiyi Zhang, Ming Li, Yamuhanmode Alike, Kaile Su, Pengcheng Wen

Computed Tomography (CT) is a medical imaging modality that can generate more informative 3D images than 2D X-rays.

Computed Tomography (CT) Generative Adversarial Network

Paper
Code

RST-style Discourse Parsing Guided by Document-level Content Structures

no code implementations • 8 Sep 2023 • Ming Li, Ruihong Huang

Rhetorical Structure Theory based Discourse Parsing (RST-DP) explores how clauses, sentences, and large text spans compose a whole discourse and presents the rhetorical structure as a hierarchical tree.

Discourse Parsing Sentence

Paper
Add Code

DiffusionEngine: Diffusion Model is Scalable Data Engine for Object Detection

no code implementations • 7 Sep 2023 • Manlin Zhang, Jie Wu, Yuxi Ren, Ming Li, Jie Qin, Xuefeng Xiao, Wei Liu, Rui Wang, Min Zheng, Andy J. Ma

This paper reveals that the recently developed Diffusion Model is a scalable data engine for object detection.

Data Augmentation object-detection +1

Paper
Add Code

Leveraging ASR Pretrained Conformers for Speaker Verification through Transfer Learning and Knowledge Distillation

no code implementations • 6 Sep 2023 • Danwei Cai, Ming Li

This paper explores the use of ASR-pretrained Conformers for speaker verification, leveraging their strengths in modeling speech signals.

Knowledge Distillation Speaker Verification +1

Paper
Add Code

DLIP: Distilling Language-Image Pre-training

no code implementations • 24 Aug 2023 • Huafeng Kuang, Jie Wu, Xiawu Zheng, Ming Li, Xuefeng Xiao, Rui Wang, Min Zheng, Rongrong Ji

Furthermore, DLIP succeeds in retaining more than 95% of the performance with 22. 4% parameters and 24. 8% FLOPs compared to the teacher model and accelerates inference speed by 2. 7x.

Image Captioning Knowledge Distillation +5

Paper
Add Code

From Quantity to Quality: Boosting LLM Performance with Self-Guided Data Selection for Instruction Tuning

2 code implementations • 23 Aug 2023 • Ming Li, Yong Zhang, Zhitao Li, Jiuhai Chen, Lichang Chen, Ning Cheng, Jianzong Wang, Tianyi Zhou, Jing Xiao

In the realm of Large Language Models (LLMs), the balance between instruction data quality and quantity is a focal point.

Instruction Following

185

Paper
Code

The DKU-DUKEECE System for the Manipulation Region Location Task of ADD 2023

no code implementations • 20 Aug 2023 • Zexin Cai, Weiqing Wang, Yikang Wang, Ming Li

This paper introduces our system designed for Track 2, which focuses on locating manipulated regions, in the second Audio Deepfake Detection Challenge (ADD 2023).

Boundary Detection DeepFake Detection +2

Paper
Add Code

The DKU-MSXF Speaker Verification System for the VoxCeleb Speaker Recognition Challenge 2023

no code implementations • 17 Aug 2023 • Ze Li, Yuke Lin, Xiaoyi Qin, Ning Jiang, Guoqing Zhao, Ming Li

For Track 1, we utilize a network structure based on ResNet for training.

Domain Adaptation Semi-supervised Domain Adaptation +2

Paper
Add Code

The DKU-MSXF Diarization System for the VoxCeleb Speaker Recognition Challenge 2023

no code implementations • 15 Aug 2023 • Ming Cheng, Weiqing Wang, Xiaoyi Qin, Yuke Lin, Ning Jiang, Guoqing Zhao, Ming Li

This paper describes the DKU-MSXF submission to track 4 of the VoxCeleb Speaker Recognition Challenge 2023 (VoxSRC-23).

Action Detection Activity Detection +2

Paper
Add Code

Approximating Human-Like Few-shot Learning with GPT-based Compression

no code implementations • 14 Aug 2023 • Cynthia Huang, Yuqing Xie, Zhiying Jiang, Jimmy Lin, Ming Li

Leveraging the approximated information distance, our method allows the direct application of GPT models in quantitative text similarity measurements.

Data Compression Few-Shot Learning +6

Paper
Add Code

VoxBlink: A Large Scale Speaker Verification Dataset on Camera

no code implementations • 14 Aug 2023 • Yuke Lin, Xiaoyi Qin, Guoqing Zhao, Ming Cheng, Ning Jiang, Haiyang Wu, Ming Li

In this paper, we introduce a large-scale and high-quality audio-visual speaker verification dataset, named VoxBlink.

Speaker Recognition Speaker Verification

Paper
Add Code

A Novel Joint Angle-Range-Velocity Estimation Method for MIMO-OFDM ISAC Systems

no code implementations • 7 Aug 2023 • Zichao Xiao, Rang Liu, Ming Li, Qian Liu

Therefore, the proposed joint estimation algorithm can achieve larger processing gains and higher resolution by fully exploiting echo signals and jointly estimating the angle-range-velocity information.

Paper
Add Code

Masked and Swapped Sequence Modeling for Next Novel Basket Recommendation in Grocery Shopping

1 code implementation • 2 Aug 2023 • Ming Li, Mozhdeh Ariannezhad, Andrew Yates, Maarten de Rijke

In next basket recommendation (NBR), it is useful to distinguish between repeat items, i. e., items that a user has consumed before, and explore items, i. e., items that a user has not consumed before.

Next-basket recommendation

Paper
Code

AlignDet: Aligning Pre-training and Fine-tuning in Object Detection

1 code implementation • ICCV 2023 • Ming Li, Jie Wu, Xionghui Wang, Chen Chen, Jie Qin, Xuefeng Xiao, Rui Wang, Min Zheng, Xin Pan

To this end, we propose AlignDet, a unified pre-training framework that can be adapted to various existing detectors to alleviate the discrepancies.

object-detection Object Detection

129

Paper
Code

Robust Graph Structure Learning with the Alignment of Features and Adjacency Matrix

no code implementations • 5 Jul 2023 • Shaogao Lv, Gang Wen, Shiyu Liu, Linsen Wei, Ming Li

Overall, our research highlights the importance of integrating feature and graph information alignment in GSL, as inspired by our derived theoretical result, and showcases the superiority of our approach in handling noisy graph structures through comprehensive experiments on real-world datasets.

Graph structure learning

Paper
Add Code

Thompson Sampling under Bernoulli Rewards with Local Differential Privacy

no code implementations • 3 Jul 2023 • Bo Jiang, Tianchi Zhao, Ming Li

This paper investigates the problem of regret minimization for multi-armed bandit (MAB) problems with local differential privacy (LDP) guarantee.

Thompson Sampling

Paper
Add Code

First Place Solution to the CVPR'2023 AQTC Challenge: A Function-Interaction Centric Approach with Spatiotemporal Visual-Language Alignment

1 code implementation • 23 Jun 2023 • Tom Tongjia Chen, Hongshan Yu, Zhengeng Yang, Ming Li, Zechuan Li, Jingwen Wang, Wei Miao, Wei Sun, Chen Chen

Affordance-Centric Question-driven Task Completion (AQTC) has been proposed to acquire knowledge from videos to furnish users with comprehensive and systematic instructions.

Human-Object Interaction Detection

Paper
Code

Permutation Equivariant Graph Framelets for Heterophilous Graph Learning

1 code implementation • 7 Jun 2023 • Jianfei Li, Ruigang Zheng, Han Feng, Ming Li, Xiaosheng Zhuang

The nature of heterophilous graphs is significantly different from that of homophilous graphs, which causes difficulties in early graph neural network models and suggests aggregations beyond the 1-hop neighborhood.

Graph Learning

Paper
Code

A Feature Reuse Framework with Texture-adaptive Aggregation for Reference-based Super-Resolution

1 code implementation • 2 Jun 2023 • Xiaoyong Mei, Yi Yang, Ming Li, Changqin Huang, Kai Zhang, Pietro Lió

In this study, we propose a feature reuse framework that guides the step-by-step texture reconstruction process through different stages, reducing the negative impacts of perceptual and adversarial loss.

Image Super-Resolution Reference-based Super-Resolution

Paper
Code

Low-Range-Sidelobe Waveform Design for MIMO-OFDM ISAC Systems

no code implementations • 30 May 2023 • Peishi Li, Zichao Xiao, Ming Li, Rang Liu, Qian Liu

Integrated sensing and communication (ISAC) is a promising technology in future wireless systems owing to its efficient hardware and spectrum utilization.

Paper
Add Code

AUC Optimization from Multiple Unlabeled Datasets

no code implementations • 25 May 2023 • Zheng Xie, Yu Liu, Ming Li

In this paper, we study the problem of building an AUC (area under ROC curve) optimization model from multiple unlabeled datasets, which maximizes the pairwise ranking ability of the classifier.

Weakly-supervised Learning

Paper
Add Code

Weakly Supervised AUC Optimization: A Unified Partial AUC Approach

no code implementations • 23 May 2023 • Zheng Xie, Yu Liu, Hao-Yuan He, Ming Li, Zhi-Hua Zhou

Since acquiring perfect supervision is usually difficult, real-world machine learning tasks often confront inaccurate, incomplete, or inexact supervision, collectively referred to as weak supervision.

Paper
Add Code

Stability and Generalization of lp-Regularized Stochastic Learning for GCN

no code implementations • 20 May 2023 • Shiyu Liu, Linsen Wei, Shaogao Lv, Ming Li

For a single-layer GCN, we establish an explicit theoretical understanding of GCN with the $\ell_p$-regularized stochastic learning by analyzing the stability of our SGD proximal algorithm.

Graph Learning

Paper
Add Code

Think Outside the Code: Brainstorming Boosts Large Language Models in Code Generation

no code implementations • 18 May 2023 • Xin-Ye Li, Jiang-Tian Xue, Zheng Xie, Ming Li

We demonstrate that Brainstorm significantly enhances the ability of LLMs to solve competition-level programming problems, resulting in a more than 50% increase in the pass@$k$ metrics for ChatGPT on the CodeContests benchmark, achieving state-of-the-art performance.

Code Generation

Paper
Add Code

A Unified Framework for Exploratory Learning-Aided Community Detection Under Topological Uncertainty

no code implementations • 10 Apr 2023 • Yu Hou, Cong Tran, Ming Li, Won-Yong Shin

In social networks, the discovery of community structures has received considerable attention as a fundamental problem in various network analysis tasks.

Community Detection Computational Efficiency

Paper
Add Code

FAN: Fatigue-Aware Network for Click-Through Rate Prediction in E-commerce Recommendation

1 code implementation • 10 Apr 2023 • Ming Li, Naiyin Liu, Xiaofeng Pan, Yang Huang, Ningning Li, Yingmin Su, Chengjun Mao, Bo Cao

Then the frequency spectrum is modulated by category information of the target item to model the bias that both the upper bound of fatigue and users' patience is different for different categories.

Click-Through Rate Prediction Time Series

Paper
Code

Moving Obstacle Collision Avoidance via Chance-Constrained MPC with CBF

no code implementations • 4 Apr 2023 • Ming Li, Zhiyong Sun, Zirui Liao, Siep Weiland

Model predictive control (MPC) with control barrier functions (CBF) is a promising solution to address the moving obstacle collision avoidance (MOCA) problem.

Collision Avoidance Model Predictive Control

Paper
Add Code

FreeSeg: Unified, Universal and Open-Vocabulary Image Segmentation

no code implementations • CVPR 2023 • Jie Qin, Jie Wu, Pengxiang Yan, Ming Li, Ren Yuxi, Xuefeng Xiao, Yitong Wang, Rui Wang, Shilei Wen, Xin Pan, Xingang Wang

Recently, open-vocabulary learning has emerged to accomplish segmentation for arbitrary categories of text-based descriptions, which popularizes the segmentation system to more general-purpose application scenarios.

Ranked #6 on Open Vocabulary Panoptic Segmentation on ADE20K

Image Segmentation Instance Segmentation +3

Paper
Add Code

XVoxel-Based Parametric Design Optimization of Feature Models

no code implementations • 17 Mar 2023 • Ming Li, Chengfeng Lin, Wei Chen, Yusheng Liu, Shuming Gao, Qiang Zou

As such, it can establish a direct mapping between design models and analysis models, which in turn enables automatic updates on simulation results for design modifications, and vice versa -- effectively a closed loop between CAD and CAE.

Computational Efficiency

Paper
Add Code

AERK: Aligned Entropic Reproducing Kernels through Continuous-time Quantum Walks

no code implementations • 4 Mar 2023 • Lixin Cui, Ming Li, Yue Wang, Lu Bai, Edwin R. Hancock

For pairwise graphs, the proposed AERK kernel is defined by computing a reproducing kernel based similarity between the quantum Shannon entropies of their each pair of aligned vertices.

Graph Classification

Paper
Add Code

RIS-Aided Integrated Sensing and Communication: Joint Beamforming and Reflection Design

no code implementations • 22 Feb 2023 • Honghao Luo, Rang Liu, Ming Li, Qian Liu

Integrated sensing and communication (ISAC) has been envisioned as a promising technique to alleviate the spectrum congestion problem.

Paper
Add Code

Joint Transceiver Beamforming and Reflecting Design for Active RIS-Aided ISAC Systems

no code implementations • 21 Feb 2023 • Qi Zhu, Ming Li, Rang Liu, Qian Liu

Integrated sensing and communication (ISAC) is recognized as a promising technology with great potential in saving hardware and spectrum resources, since it simultaneously realizes radar detection and user communication functions in the fully-shared platform.

Paper
Add Code

Language-Specific Representation of Emotion-Concept Knowledge Causally Supports Emotion Inference

1 code implementation • 19 Feb 2023 • Ming Li, Yusheng Su, Hsiu-Yuan Huang, Jiali Cheng, Xin Hu, Xinmiao Zhang, Huadong Wang, Yujia Qin, Xiaozhi Wang, Kristen A. Lindquist, Zhiyuan Liu, Dan Zhang

Humans no doubt use language to communicate about their emotional experiences, but does language in turn help humans understand emotions, or is language just a vehicle of communication?

Attribute Language Modelling

Paper
Code

Self-supervised Geometric Features Discovery via Interpretable Attentio for Vehicle Re-Identification and Beyond (Complete Version)

1 code implementation • 5 Feb 2023 • Ming Li, Xinming Huang, Ziming Zhang

To learn distinguishable patterns, most of recent works in vehicle re-identification (ReID) struggled to redevelop official benchmarks to provide various supervisions, which requires prohibitive human labors.

Representation Learning Self-Supervised Learning +1

Paper
Code

SNR/CRB-Constrained Joint Beamforming and Reflection Designs for RIS-ISAC Systems

no code implementations • 26 Jan 2023 • Rang Liu, Ming Li, Qian Liu, A. Lee Swindlehurst

Two optimization problems are formulated for maximizing the achievable sum-rate of the multi-user communications under an SNR constraint for target detection or a CRB constraint for parameter estimation, the transmit power budget, and the unit-modulus constraint of the RIS reflection coefficients.

Paper
Add Code

STPrivacy: Spatio-Temporal Privacy-Preserving Action Recognition

no code implementations • ICCV 2023 • Ming Li, Xiangyu Xu, Hehe Fan, Pan Zhou, Jun Liu, Jia-Wei Liu, Jiahe Li, Jussi Keppo, Mike Zheng Shou, Shuicheng Yan

For the first time, we introduce vision Transformers into PPAR by treating a video as a tubelet sequence, and accordingly design two complementary mechanisms, i. e., sparsification and anonymization, to remove privacy from a spatio-temporal perspective.

Action Recognition Facial Expression Recognition (FER) +2

Paper
Add Code

Deep Learning-Based UAV Aerial Triangulation without Image Control Points

no code implementations • 7 Jan 2023 • Jiageng Zhong, Ming Li, Jiangying Qin, Hanqi Zhang

The emerging drone aerial survey has the advantages of low cost, high efficiency, and flexible use.

Image Registration POS

Paper
Add Code

A Theory of Human-Like Few-Shot Learning

no code implementations • 3 Jan 2023 • Zhiying Jiang, Rui Wang, Dongbo Bu, Ming Li

We aim to bridge the gap between our common-sense few-sample human learning and large-data machine learning.

Common Sense Reasoning Few-Shot Learning

Paper
Add Code

Class Balanced Adaptive Pseudo Labeling for Federated Semi-Supervised Learning

no code implementations • CVPR 2023 • Ming Li, Qingli Li, Yan Wang

The second key element is that we design class balanced adaptive thresholds via considering the empirical distribution of all training data in local clients, to encourage a balanced training process.

Paper
Add Code

Combining Photogrammetric Computer Vision and Semantic Segmentation for Fine-grained Understanding of Coral Reef Growth under Climate Change

1 code implementation • 8 Dec 2022 • Jiageng Zhong, Ming Li, Hanqi Zhang, Jiangying Qin

Corals are the primary habitat-building life-form on reefs that support a quarter of the species in the ocean.

Semantic Segmentation

Paper
Code

An operational framework to automatically evaluate the quality of weather observations from third-party stations

no code implementations • 5 Dec 2022 • Quanxi Shao, Ming Li, Joel Janek Dabrowski, Shuvo Bakar, Ashfaqur Rahman, Andrea Powell, Brent Henderson

With increasing number of crowdsourced private automatic weather stations (called TPAWS) established to fill the gap of official network and obtain local weather information for various purposes, the data quality is a major concern in promoting their usage.

Paper
Add Code

Joint Secure Transmit Beamforming Designs for Integrated Sensing and Communication Systems

no code implementations • 1 Dec 2022 • Jinjin Chu, Rang Liu, Ming Li, Yang Liu, Qian Liu

Integrated sensing and communication (ISAC), which allows individual radar and communication systems to share the same spectrum bands, is an emerging and promising technique for alleviating spectrum congestion problems.

Paper
Add Code

HAQJSK: Hierarchical-Aligned Quantum Jensen-Shannon Kernels for Graph Classification

no code implementations • 5 Nov 2022 • Lu Bai, Lixin Cui, Yue Wang, Ming Li, Edwin R. Hancock

In this work, we propose a family of novel quantum kernels, namely the Hierarchical Aligned Quantum Jensen-Shannon Kernels (HAQJSK), for un-attributed graphs.

Graph Classification

Paper
Add Code

Waveform Boundary Detection for Partially Spoofed Audio

no code implementations • 1 Nov 2022 • Zexin Cai, Weiqing Wang, Ming Li

The present paper proposes a waveform boundary detection system for audio spoofing attacks containing partially manipulated segments.

Boundary Detection

Paper
Add Code

Target-Speaker Voice Activity Detection via Sequence-to-Sequence Prediction

no code implementations • 28 Oct 2022 • Ming Cheng, Weiqing Wang, Yucong Zhang, Xiaoyi Qin, Ming Li

Target-speaker voice activity detection is currently a promising approach for speaker diarization in complex acoustic environments.

Action Detection Activity Detection +2

Paper
Add Code

Laugh Betrays You? Learning Robust Speaker Representation From Speech Containing Non-Verbal Fragments

no code implementations • 28 Oct 2022 • Yuke Lin, Xiaoyi Qin, Huahua Cui, Zhenyi Zhu, Ming Li

We collect a set of clips with laughter components by conducting a laughter detection script on VoxCeleb and part of the CN-Celeb dataset.

Speaker Verification

Paper
Add Code

End-to-End Learning for Symbol-Level Precoding and Detection with Adaptive Modulation

no code implementations • 25 Oct 2022 • Rang Liu, Zhu Bo, Ming Li, Qian Liu

To overcome the performance bottleneck of these approaches, in this letter we propose an end-to-end learning based approach to jointly optimize the modulation orders, the transmit precoding and the receive detection for an SLP communication system.

Paper
Add Code

Less is More: A Lightweight and Robust Neural Architecture for Discourse Parsing

no code implementations • 18 Oct 2022 • Ming Li, Ruihong Huang

Complex feature extractors are widely employed for text representation building.

Discourse Parsing

Paper
Add Code

RibSeg v2: A Large-scale Benchmark for Rib Labeling and Anatomical Centerline Extraction

1 code implementation • 18 Oct 2022 • Liang Jin, Shixuan Gu, Donglai Wei, Jason Ken Adhinarta, Kaiming Kuang, Yongjie Jessica Zhang, Hanspeter Pfister, Bingbing Ni, Jiancheng Yang, Ming Li

Based on the RibSeg v2, we develop a pipeline including deep learning-based methods for rib labeling, and a skeletonization-based method for centerline extraction.

Computational Efficiency Segmentation

Paper
Code

Deepfake Detection System for the ADD Challenge Track 3.2 Based on Score Fusion

no code implementations • 13 Oct 2022 • Yuxiang Zhang, Jingze Lu, Xingming Wang, Zhuo Li, Runqiu Xiao, Wenchao Wang, Ming Li, Pengyuan Zhang

The overfitting of the model to the training set leads to extreme values of the scores and low correlation of the score distributions, which makes score fusion difficult.

Data Augmentation DeepFake Detection +1

Paper
Add Code

Joint Beamforming Designs for Active Reconfigurable Intelligent Surface: A Sub-Connected Array Architecture

no code implementations • 5 Oct 2022 • Qi Zhu, Ming Li, Rang Liu, Yang Liu, Qian Liu

Affected by the "double fading" effect, however, conventional passive RIS cannot bring considerable performance improvement when users are not close enough to RIS.

Paper
Add Code

The DKU-DukeECE Diarization System for the VoxCeleb Speaker Recognition Challenge 2022

no code implementations • 4 Oct 2022 • Weiqing Wang, Xiaoyi Qin, Ming Cheng, Yucong Zhang, Kangyue Wang, Ming Li

This paper discribes the DKU-DukeECE submission to the 4th track of the VoxCeleb Speaker Recognition Challenge 2022 (VoxSRC-22).

Action Detection Activity Detection +2

Paper
Add Code

Data-driven and machine-learning based prediction of wave propagation behavior in dam-break flood

no code implementations • 19 Sep 2022 • Changli Li, Zheng Han, Yange Li, Ming Li, Weidong Wang

To show the performance of the RC-ESN model, we also provide a sensitivity analysis of the prediction accuracy concerning the key parameters including training set size, reservoir size, and spectral radius.

Paper
Add Code

The 2022 Far-field Speaker Verification Challenge: Exploring domain mismatch and semi-supervised learning under the far-field scenario

1 code implementation • 12 Sep 2022 • Xiaoyi Qin, Ming Li, Hui Bu, Shrikanth Narayanan, Haizhou Li

In addition, a supplementary set for the FFSVC2020 dataset is released this year.

Speaker Verification

Paper
Code

Optimization for Reflection and Transmission Dual-Functional Active RIS-Assisted Systems

no code implementations • 5 Sep 2022 • Yanan Ma, Ming Li, Yang Liu, Qingqing Wu, Qian Liu

Reconfigurable intelligent surface (RIS) has been deemed as one of potential components of future wireless communication systems because it can adaptively manipulate the wireless propagation environment with low-cost passive devices.

Paper
Add Code

Multi-frequency PolSAR Image Fusion Classification Based on Semantic Interactive Information and Topological Structure

no code implementations • 5 Sep 2022 • Yice Cao, Yan Wu, Ming Li, Mingjie Zheng, Peng Zhang, Jili Wang

Finally, an adaptive weighting fusion (AWF) strategy is proposed to merge inference from different bands, so as to make the MF joint classification decisions of SIC and TPC.

Classification Image Classification +1

Paper
Add Code

Joint Beamforming Design for Intelligent Omni Surface Assisted Wireless Communication Systems

no code implementations • 1 Sep 2022 • Wenhao Cai, Ming Li, Yang Liu, Qingqing Wu, Qian Liu

Intelligent reflecting surface (IRS) has been widely considered as one of the key enabling techniques for future wireless communication networks owing to its ability of dynamically controlling the phase shift of reflected electromagnetic (EM) waves to construct a favorable propagation environment.

Paper
Add Code

Non-Cooperative Resource Management for Intelligent Reflecting Surface Aided Networks

no code implementations • 1 Sep 2022 • Wenhao Cai, Ming Li, Qian Liu

Intelligent reflecting surface (IRS) has emerged as a promising and revolutionizing technology for future wireless networks.

Management

Paper
Add Code

Multi-Granularity Distillation Scheme Towards Lightweight Semi-Supervised Semantic Segmentation

1 code implementation • 22 Aug 2022 • Jie Qin, Jie Wu, Ming Li, Xuefeng Xiao, Min Zheng, Xingang Wang

Consequently, we offer the first attempt to provide lightweight SSSS models via a novel multi-granularity distillation (MGD) scheme, where multi-granularity is captured from three aspects: i) complementary teacher structure; ii) labeled-unlabeled data cooperative distillation; iii) hierarchical and multi-levels loss setting.

Knowledge Distillation Semi-Supervised Semantic Segmentation

Paper
Code

Reduced Implication-bias Logic Loss for Neuro-Symbolic Learning

no code implementations • 14 Aug 2022 • Haoyuan He, WangZhou Dai, Ming Li

Integrating logical reasoning and machine learning by approximating logical inference with differentiable operators is a widely used technique in Neuro-Symbolic systems.

Logical Reasoning

Paper
Add Code

User Association and Hybrid Beamforming Designs for Cooperative mmWave MIMO Systems

no code implementations • 10 Aug 2022 • Pengfei Ni, Rang Liu, Ming Li, Qian Liu

In an effort to further exploit multiple-antenna diversities, we also consider the dynamic subarray architecture and propose a novel antenna design algorithm for the analog beamforming design.

Paper
Add Code

Partially Distributed Beamforming Design for RIS-Aided Cell-Free Networks

no code implementations • 10 Aug 2022 • Pengfei Ni, Ming Li, Rang Liu, Qian Liu

Cell-free networks are regarded as a promising technology to meet higher rate requirements for beyond fifth-generation (5G) communications.

Paper
Add Code

Joint Beamforming Design for RIS-Assisted Integrated Sensing and Communication Systems

no code implementations • 3 Aug 2022 • Honghao Luo, Rang Liu, Ming Li, Yang Liu, Qian Liu

Integrated sensing and communication (ISAC) has been envisioned as a promising technology to tackle the spectrum congestion problem for future networks.

Paper
Add Code

The DKU-OPPO System for the 2022 Spoofing-Aware Speaker Verification Challenge

no code implementations • 15 Jul 2022 • Xingming Wang, Xiaoyi Qin, Yikang Wang, Yunfei Xu, Ming Li

For CM systems, we propose two methods on top of the challenge baseline to further improve the performance, namely Embedding Random Sampling Augmentation (ERSA) and One-Class Confusion Loss(OCCL).

Speaker Verification

Paper
Add Code

Cross-Age Speaker Verification: Learning Age-Invariant Speaker Embeddings

1 code implementation • 13 Jul 2022 • Xiaoyi Qin, Na Li, Chao Weng, Dan Su, Ming Li

In this paper, we mine cross-age test sets based on the VoxCeleb dataset and propose our age-invariant speaker representation(AISR) learning method.

Age Estimation Speaker Verification

Paper
Code

Online Target Speaker Voice Activity Detection for Speaker Diarization

no code implementations • 13 Jul 2022 • Weiqing Wang, Qingjian Lin, Ming Li

We iteratively extract the results for each block and update the target speaker embedding until reaching the end of the signal.

Action Detection Activity Detection +3

Paper
Add Code

Discriminator-Guided Model-Based Offline Imitation Learning

no code implementations • 1 Jul 2022 • Wenjia Zhang, Haoran Xu, Haoyi Niu, Peng Cheng, Ming Li, Heming Zhang, Guyue Zhou, Xianyuan Zhan

In this paper, we propose the Discriminator-guided Model-based offline Imitation Learning (DMIL) framework, which introduces a discriminator to simultaneously distinguish the dynamics correctness and suboptimality of model rollout data against real expert demonstrations.

Imitation Learning

Paper
Add Code

When to Trust Your Simulator: Dynamics-Aware Hybrid Offline-and-Online Reinforcement Learning

1 code implementation • 27 Jun 2022 • Haoyi Niu, Shubham Sharma, Yiwen Qiu, Ming Li, Guyue Zhou, Jianming Hu, Xianyuan Zhan

This brings up a new question: is it possible to combine learning from limited real data in offline RL and unrestricted exploration through imperfect simulators in online RL to address the drawbacks of both approaches?

Offline RL reinforcement-learning +1

Paper
Code

Few-Shot Non-Parametric Learning with Deep Latent Variable Model

no code implementations • 23 Jun 2022 • Zhiying Jiang, Yiqin Dai, Ji Xin, Ming Li, Jimmy Lin

Most real-world problems that machine learning algorithms are expected to solve face the situation with 1) unknown data distribution; 2) little domain-specific knowledge; and 3) datasets with limited annotation.

Classification Image Classification

Paper
Add Code

Parallel Pre-trained Transformers (PPT) for Synthetic Data-based Instance Segmentation

no code implementations • 22 Jun 2022 • Ming Li, Jie Wu, Jinhang Cai, Jie Qin, Yuxi Ren, Xuefeng Xiao, Min Zheng, Rui Wang, Xin Pan

Recently, Synthetic data-based Instance Segmentation has become an exceedingly favorable optimization paradigm since it leverages simulation rendering and physics to generate high-quality image-annotation pairs.

Instance Segmentation Segmentation +1

Paper
Add Code

Identifying Source Speakers for Voice Conversion based Spoofing Attacks on Speaker Verification Systems

no code implementations • 18 Jun 2022 • Danwei Cai, Zexin Cai, Ming Li

An automatic speaker verification system aims to verify the speaker identity of a speech signal.

Speaker Identification Speaker Verification +1

Paper
Add Code

Integrated Sensing and Communication with Reconfigurable Intelligent Surfaces: Opportunities, Applications, and Future Directions

no code implementations • 17 Jun 2022 • Rang Liu, Ming Li, Honghao Luo, Qian Liu, A. Lee Swindlehurst

Integrated sensing and communication (ISAC) is emerging as a key enabler to address the growing spectrum congestion problem and satisfy increasing demands for ubiquitous sensing and communication.

Paper
Add Code

Simultaneous Speech Extraction for Multiple Target Speakers under the Meeting Scenarios

no code implementations • 17 Jun 2022 • Bang Zeng, Hongbing Suo, Yulong Wan, Ming Li

The common target speech separation directly estimate the target source, ignoring the interrelationship between different speakers at each frame.

Action Detection Activity Detection +3

Paper
Add Code

Collaborative Knowledge Graph Fusion by Exploiting the Open Corpus

no code implementations • 15 Jun 2022 • Yue Wang, Yao Wan, Lu Bai, Lixin Cui, Zhuo Xu, Ming Li, Philip S. Yu, Edwin R Hancock

To alleviate the challenges of building Knowledge Graphs (KG) from scratch, a more general task is to enrich a KG using triples from an open corpus, where the obtained triples contain noisy entities and relations.

Event Extraction Knowledge Graphs

Paper
Add Code

Embedding Graphs on Grassmann Manifold

1 code implementation • 30 May 2022 • Bingxin Zhou, Xuebin Zheng, Yu Guang Wang, Ming Li, Junbin Gao

Learning efficient graph representation is the key to favorably addressing downstream tasks on graphs, such as node or graph property prediction.

Graph Embedding Graph Property Prediction +2

Paper
Code

MealRec: A Meal Recommendation Dataset

1 code implementation • 24 May 2022 • Ming Li, Lin Li, Qing Xie, Jingling Yuan, Xiaohui Tao

A publicly available dataset specialising in meal recommendation research for the research community is in urgent demand.

Recommendation Systems

Paper
Code

IRS-assisted Multi-cell Multi-band Systems: Practical Reflection Model and Joint Beamforming Design

no code implementations • 13 Apr 2022 • Wenhao Cai, Rang Liu, Ming Li, Yang Liu, Qingqing Wu, Qian Liu

Intelligent reflecting surface (IRS) has been regarded as a promising and revolutionary technology for future wireless communication systems owing to its capability of tailoring signal propagation environment in an energy/spectrum/hardware-efficient manner.

Paper
Add Code

Joint Beamforming and Reflection Design for RIS-assisted ISAC Systems

no code implementations • 1 Mar 2022 • Rang Liu, Ming Li, A. Lee Swindlehurst

In this paper, we investigate the potential of employing reconfigurable intelligent surface (RIS) in integrated sensing and communication (ISAC) systems.

Paper
Add Code

Fully-integrated multipurpose microwave frequency identification system on a single chip

no code implementations • 17 Feb 2022 • Yuhan Yao, Yuhe Zhao, Yanxian Wei, Feng Zhou, Daigao Chen, Yuguang Zhang, Xi Xiao, Ming Li, Jianji Dong, Shaohua Yu, Xinliang Zhang

We demonstrate a fully-integrated multipurpose microwave frequency identification system on silicon-on-insulator platform.

Paper
Add Code

Graph Neural Networks for Graphs with Heterophily: A Survey

no code implementations • 14 Feb 2022 • Xin Zheng, Yi Wang, Yixin Liu, Ming Li, Miao Zhang, Di Jin, Philip S. Yu, Shirui Pan

In the end, we point out the potential directions to advance and stimulate more future research and applications on heterophilic graph learning with GNNs.

Graph Learning

Paper
Add Code

Explainable COVID-19 Infections Identification and Delineation Using Calibrated Pseudo Labels

1 code implementation • 11 Feb 2022 • Ming Li, Yingying Fang, Zeyu Tang, Chibudom Onuorah, Jun Xia, Javier Del Ser, Simon Walsh, Guang Yang

We demonstrate the effectiveness of our model with the combination of limited labelled data and sufficient unlabelled data or weakly-labelled data.

Computed Tomography (CT) Decision Making +1

Paper
Code

Cross-Channel Attention-Based Target Speaker Voice Activity Detection: Experimental Results for M2MeT Challenge

no code implementations • 6 Feb 2022 • Weiqing Wang, Xiaoyi Qin, Ming Li

The multi-channel TS-VAD further reduces the DER by 28% and achieves a DER of 2. 26%.

Action Detection Activity Detection +2

Paper
Add Code

Invertible Voice Conversion

no code implementations • 26 Jan 2022 • Zexin Cai, Ming Li

In this paper, we propose an invertible deep learning framework called INVVC for voice conversion.

Voice Conversion

Paper
Add Code

pvCNN: Privacy-Preserving and Verifiable Convolutional Neural Network Testing

1 code implementation • 23 Jan 2022 • Jiasi Weng, Jian Weng, Gui Tang, Anjia Yang, Ming Li, Jia-Nan Liu

First, a CNN model to be tested is strategically partitioned into a private part kept locally by the model developer, and a public part outsourced to an outside server.

Privacy Preserving

Paper
Code

CaFT: Clustering and Filter on Tokens of Transformer for Weakly Supervised Object Localization

no code implementations • 3 Jan 2022 • Ming Li

Therefore, we propose Clustering and Filter of Tokens (CaFT) with Vision Transformer (ViT) backbone to solve this problem in another way.

Clustering Object +1

Paper
Add Code

MetaCVR: Conversion Rate Prediction via Meta Learning in Small-Scale Recommendation Scenarios

no code implementations • 27 Dec 2021 • Xiaofeng Pan, Ming Li, Jing Zhang, Keren Yu, Luping Wang, Hong Wen, Chengjun Mao, Bo Cao

At last, we develop an Ensemble Prediction Network (EPN) which incorporates the output of FRN and DMN to make the final CVR prediction.

Meta-Learning

Paper
Add Code

Joint Transmit Waveform and Passive Beamforming Design for RIS-Aided DFRC Systems

no code implementations • 16 Dec 2021 • Rang Liu, Ming Li, Yang Liu, Qingqing Wu, Qian Liu

Reconfigurable intelligent surface (RIS) is a promising technology for 6G networks owing to its superior ability to enhance the capacity and coverage of wireless communications by smartly creating a favorable propagation environment.

Paper
Add Code

Low-Latency Online Speaker Diarization with Graph-Based Label Generation

no code implementations • 27 Nov 2021 • Yucong Zhang, Qinjian Lin, Weiqing Wang, Lin Yang, Xuyang Wang, Junjie Wang, Ming Li

To ensure the low latency in the online setting, we introduce a variant of AHC, namely chkpt-AHC, to cluster the speakers.

Clustering speaker-diarization +1

Paper
Add Code

Towards Graph Self-Supervised Learning with Contrastive Adjusted Zooming

no code implementations • 20 Nov 2021 • Yizhen Zheng, Ming Jin, Shirui Pan, Yuan-Fang Li, Hao Peng, Ming Li, Zhao Li

To overcome the aforementioned problems, we introduce a novel self-supervised graph representation learning algorithm via Graph Contrastive Adjusted Zooming, namely G-Zoom, to learn node representations by leveraging the proposed adjusted zooming scheme.

Contrastive Learning Graph Representation Learning +1

Paper
Add Code

SIG-VC: A Speaker Information Guided Zero-shot Voice Conversion System for Both Human Beings and Machines

1 code implementation • 6 Nov 2021 • Haozhe Zhang, Zexin Cai, Xiaoyi Qin, Ming Li

Moreover, speaker information control is added to our system to maintain the voice cloning performance.

Disentanglement Speaker Verification +2

Paper
Code

PRECAD: Privacy-Preserving and Robust Federated Learning via Crypto-Aided Differential Privacy

no code implementations • 22 Oct 2021 • Xiaolan Gu, Ming Li, Li Xiong

In this paper, we develop a framework called PRECAD, which simultaneously achieves differential privacy (DP) and enhances robustness against model poisoning attacks with the help of cryptography.

Federated Learning Model Poisoning +1

Paper
Add Code

The Sigma-Max System Induced from Randomness and Fuzziness

no code implementations • 12 Oct 2021 • Wei Mei, Ming Li, Yuanzeng Cheng, Limin Liu

This paper managed to induce probability theory (sigma system) and possibility theory (max system) respectively from randomness and fuzziness, through which the premature theory of possibility is expected to be well founded.

Paper
Add Code

Automatic annotation of visual deep neural networks

no code implementations • 8 Oct 2021 • Ming Li, Chenhao Guo

Computer vision is widely used in the fields of driverless, face recognition and 3D reconstruction as a technology to help or replace human eye perception images or multidimensional data through computers.

3D Reconstruction Face Recognition

Paper
Add Code

A Time-Varying Endogenous Random Coefficient Model with an Application to Production Functions

no code implementations • 3 Oct 2021 • Ming Li

This paper proposes a random coefficient panel model where the regressors are correlated with the time-varying random coefficients in each period, a critical feature in many economic applications.

Paper
Add Code

A Next Basket Recommendation Reality Check

1 code implementation • 29 Sep 2021 • Ming Li, Sami Jullien, Mozhdeh Ariannezhad, Maarten de Rijke

We propose a set of metrics that measure the repeat/explore ratio and performance of NBR models.

Next-basket recommendation

Paper
Code

Two Birds, One Stone: Achieving both Differential Privacy and Certified Robustness for Pre-trained Classifiers via Input Perturbation

no code implementations • 29 Sep 2021 • Pengfei Tang, Wenjie Wang, Xiaolan Gu, Jian Lou, Li Xiong, Ming Li

To solve this challenge, a reconstruction network is built before the public pre-trained classifiers to offer certified robustness and defend against adversarial examples through input perturbation.

Image Classification

Paper
Add Code

The DKU-DukeECE System for the Self-Supervision Speaker Verification Task of the 2021 VoxCeleb Speaker Recognition Challenge

no code implementations • 7 Sep 2021 • Danwei Cai, Ming Li

Taking advantage of DNN's ability to learn from data with label noise, we propose to cluster the speaker embedding obtained from the previous speaker network and use the subsequent class assignments as pseudo labels to train a new DNN.

Pseudo Label Representation Learning +2

Paper
Add Code

The DKU-DukeECE-Lenovo System for the Diarization Task of the 2021 VoxCeleb Speaker Recognition Challenge

no code implementations • 5 Sep 2021 • Weiqing Wang, Danwei Cai, Qingjian Lin, Lin Yang, Junjie Wang, Jin Wang, Ming Li

This report describes the submission of the DKU-DukeECE-Lenovo team to the VoxCeleb Speaker Recognition Challenge (VoxSRC) 2021 track 4.

Action Detection Activity Detection +4

Paper
Add Code

Dual-Functional Radar-Communication Waveform Design: A Symbol-Level Precoding Approach

no code implementations • 11 Aug 2021 • Rang Liu, Ming Li, Qian Liu, A. Lee Swindlehurst

In this paper, we consider multi-input multi-output (MIMO) DFRC systems and focus on transmit beamforming designs to provide both radar sensing and multi-user communications.

Paper
Add Code

Reflection and Relay Dual-Functional RIS Assisted MU-MISO Systems

no code implementations • 24 Jul 2021 • Yanan Ma, Rang Liu, Yang Liu, Ming Li, Qian Liu

Reconfigurable intelligent surfaces (RISs) have been deemed as one of potential components of future wireless communication systems because they can adaptively manipulate the wireless propagation environment with low-cost passive devices.

Paper
Add Code

A Data-Driven Method for Recognizing Automated Negotiation Strategies

no code implementations • 3 Jul 2021 • Ming Li, Pradeep K. Murukannaiah, Catholijn M. Jonker

Our approach includes a data generation method for an agent to generate domain-independent sequences by negotiating with a variety of opponents across domains, a feature engineering method for representing negotiation data as time series with time-step features and overall features, and a hybrid (recurrent neural network-based) deep learning method for recognizing an opponent's strategy from the time series of bids.

Feature Engineering Time Series +1

Paper
Add Code

BS-RIS-User Association and Beamforming Designs for RIS-aided Cellular Networks

no code implementations • 27 Jun 2021 • Sifan Liu, Pengfei Ni, Rang Liu, Yang Liu, Ming Li, Qian Liu

During the dynamical access process, an iterative algorithm is proposed to alternatively obtain the active and passive beamforming.

Paper
Add Code

A Feature Fusion-Net Using Deep Spatial Context Encoder and Nonstationary Joint Statistical Model for High Resolution SAR Image Classification

no code implementations • 11 May 2021 • Wenkai Liang, Yan Wu, Ming Li, Peng Zhang, Yice Cao, Xin Hu

To address this problem, a novel end-to-end supervised classification method is proposed for HR SAR images by considering both spatial context and statistical features.

Image Classification

Paper
Add Code

Building Bilingual and Code-Switched Voice Conversion with Limited Training Data Using Embedding Consistency Loss

1 code implementation • 22 Apr 2021 • Yaogen Yang, Haozhe Zhang, Xiaoyi Qin, Shanshan Liang, Huahua Cui, Mingyang Xu, Ming Li

We achieve cross-lingual VC between Mandarin speech with multiple speakers and English speech with multiple speakers by applying bilingual bottleneck features.

Voice Cloning Voice Conversion

Paper
Code

End-to-End Mandarin Tone Classification with Short Term Context Information

no code implementations • 12 Apr 2021 • Jiyang Tang, Ming Li

In this paper, we propose an end-to-end Mandarin tone classification method from continuous speech utterances utilizing both the spectrogram and the short-term context information as the input.

General Classification

Paper
Add Code

The DKU System Description for The Interspeech 2021 Auto-KWS Challenge

no code implementations • 11 Apr 2021 • Yechen Wang, Yan Jia, Murong Ma, Zexin Cai, Ming Li

This paper introduces the system submitted by the DKU-SMIIP team for the Auto-KWS 2021 Challenge.

Dynamic Time Warping Keyword Spotting +3

Paper
Add Code

Binary Neural Network for Speaker Verification

no code implementations • 6 Apr 2021 • Tinglong Zhu, Xiaoyi Qin, Ming Li

Although deep neural networks are successful for many tasks in the speech domain, the high computational and memory costs of deep neural networks make it difficult to directly deploy highperformance Neural Network systems on low-resource embedded devices.

Binarization Quantization +1

Paper
Add Code

Computationally instrument-resolution-independent de novo peptide sequencing for high-resolution devices

2 code implementations • Nature Machine Intelligence 2021 • Rui Qiao, Ngoc Hieu Tran, Lei Xin, Xin Chen, Ming Li, Baozhen Shan, Ali Ghodsi

De novo peptide sequencing is the key technology for finding novel peptides from mass spectra.

de novo peptide sequencing

Paper
Code

Grassmann Graph Embedding

no code implementations • ICLR Workshop GTRL 2021 • Bingxin Zhou, Xuebin Zheng, Yu Guang Wang, Ming Li, Junbin Gao

Geometric deep learning that employs the geometric and topological features of data has attracted increasing attention in deep neural networks.

Dimensionality Reduction Graph Embedding

Paper
Add Code

How Framelets Enhance Graph Neural Networks

1 code implementation • 13 Feb 2021 • Xuebin Zheng, Bingxin Zhou, Junbin Gao, Yu Guang Wang, Pietro Lio, Ming Li, Guido Montufar

The graph neural networks with the proposed framelet convolution and pooling achieve state-of-the-art performance in many node and graph prediction tasks.

Denoising

Paper
Code

First Saturation Correction in High Energy Proton-Nucleus Collisions: I. Time evolution of classical Yang-Mills fields beyond leading order

no code implementations • 2 Feb 2021 • Ming Li, Vladimir V. Skokov

This paper is the first in a series of papers towards analytically completing the first saturation correction to physical observables in high energy proton-nucleus collisions.

High Energy Physics - Phenomenology Nuclear Theory

Paper
Add Code

Intelligent reflecting surface assisted multi-cell multi-band wireless networks

no code implementations • 5 Jan 2021 • Wenhao Cai, Rang Liu, Yang Liu, Ming Li, Qian Liu

Therefore, the practical phase shift model, which can describe the difference of IRS phase shift responses for the signals with different frequencies, should be utilized in the IRS optimization for wideband and multi-band systems.

Paper
Add Code

Channel Estimation for Practical IRS-Assisted OFDM Systems

no code implementations • 25 Dec 2020 • Wanning Yang, Hongyu Li, Ming Li, Yang Liu, Qian Liu

Different from the prior works which assume that IRS has an ideal reflection model, we perform channel estimation by considering amplitude-phase shift-frequency relationship for the response of practical IRS.

Paper
Add Code

A Unified Deep Speaker Embedding Framework for Mixed-Bandwidth Speech Data

no code implementations • 1 Dec 2020 • Weicheng Cai, Ming Li

This paper proposes a unified deep speaker embedding framework for modeling speech data with different sampling rates.

Bandwidth Extension Image Classification

Paper
Add Code

Intelligent Reflecting Surface Aided MISO Uplink Communication Network: Feasibility and Power Minimization for Perfect and Imperfect CSI

no code implementations • 22 Nov 2020 • Yang Liu, Jun Zhao, Ming Li, Qingqing Wu

In this paper, we consider the weighted sum-power minimization under quality-of-service (QoS) constraints in the multi-user multi-input-single-output (MISO) uplink wireless network assisted by intelligent reflecting surface (IRS).

Paper
Add Code

Training Wake Word Detection with Synthesized Speech Data on Confusion Words

no code implementations • 3 Nov 2020 • Yan Jia, Zexin Cai, Murong Ma, Zeqing Zhao, Xuyang Wang, Junjie Wang, Ming Li

Confusing-words are commonly encountered in real-life keyword spotting applications, which causes severe degradation of performance due to complex spoken terms and various kinds of words that sound similar to the predefined keywords.

Data Augmentation Keyword Spotting +1

Paper
Add Code

Self-supervised Geometric Features Discovery via Interpretable Attention for Vehicle Re-Identification and Beyond

1 code implementation • ICCV 2021 • Ming Li, Xinming Huang, Ziming Zhang

Representation Learning Self-Supervised Learning +2

Paper
Code

RNN Training along Locally Optimal Trajectories via Frank-Wolfe Algorithm

1 code implementation • 12 Oct 2020 • Yun Yue, Ming Li, Venkatesh Saligrama, Ziming Zhang

We propose to utilize the Frank-Wolfe (FW) algorithm in this context.

Paper
Code

PointIso: Point Cloud Based Deep Learning Model for Detecting Arbitrary-Precision Peptide Features in LC-MS Map through Attention Based Segmentation

1 code implementation • 15 Sep 2020 • Fatema Tuz Zohora, M Ziaur Rahman, Ngoc Hieu Tran, Lei Xin, Baozhen Shan, Ming Li

A promising technique of discovering disease biomarkers is to measure the relative protein abundance in multiple biofluid samples through liquid chromatography with tandem mass spectrometry (LC-MS/MS) based quantitative proteomics.

Paper
Code

Temporal optical neurons for serial deep learning

no code implementations • 4 Sep 2020 • Zhixing Lin, Shuqian Sun, Jose Azana, Wei Li, Ninghua Zhu, Ming Li

This concept represents a novel one-dimensional realization of artificial neural networks, enabling an efficient application of optical deep learning methods to the analysis and processing of serial data signals, while offering a new overall perspective for the temporal signal processing.

Paper
Add Code

LodoNet: A Deep Neural Network with 2D Keypoint Matchingfor 3D LiDAR Odometry Estimation

no code implementations • 1 Sep 2020 • Ce Zheng, Yecheng Lyu, Ming Li, Ziming Zhang

Deep learning based LiDAR odometry (LO) estimation attracts increasing research interests in the field of autonomous driving and robotics.

Autonomous Driving

Paper
Add Code

Don't Change Me! User-Controllable Selective Paraphrase Generation

no code implementations • EACL 2021 • Mohan Zhang, Luchen Tan, Zhengkai Tu, Zihang Fu, Kun Xiong, Ming Li, Jimmy Lin

The contribution of this work is a novel data generation technique using distant supervision that allows us to start with a pretrained sequence-to-sequence model and fine-tune a paraphrase generator that exhibits this behavior, allowing user-controllable paraphrase generation.

Paraphrase Generation

Paper
Add Code

How Powerful are Shallow Neural Networks with Bandlimited Random Weights?

no code implementations • 19 Aug 2020 • Ming Li, Sho Sonoda, Feilong Cao, Yu Guang Wang, Jiye Liang

Despite the well-known fact that a neural network is a universal approximator, in this study, we mathematically show that when hidden parameters are distributed in a bounded domain, the network may not achieve zero approximation error.

Learning Theory

Paper
Add Code

Synergy between Machine/Deep Learning and Software Engineering: How Far Are We?

no code implementations • 12 Aug 2020 • Simin Wang, LiGuo Huang, Jidong Ge, Tengfei Zhang, Haitao Feng, Ming Li, He Zhang, Vincent Ng

To improve the applicability and generalizability of research results, we analyzed what ingredients in a study would facilitate an understanding of why a ML/DL technique was selected for a specific SE problem.

Paper
Add Code

Mask Detection and Breath Monitoring from Speech: on Data Augmentation, Feature Representation and Modeling

no code implementations • 12 Aug 2020 • Haiwei Wu, Lin Zhang, Lin Yang, Xuyang Wang, Jun-Jie Wang, Dong Zhang, Ming Li

This paper introduces our approaches for the Mask and Breathing Sub-Challenge in the Interspeech COMPARE Challenge 2020.

Data Augmentation

Paper
Add Code

Intelligent Reflecting Surface based Passive Information Transmission: A Symbol-Level Precoding Approach

no code implementations • 29 Jul 2020 • Rang Liu, Ming Li, Qian Liu, A. Lee Swindlehurst, Qingqing Wu

Intelligent reflecting surfaces (IRS) have been proposed as a revolutionary technology owing to its capability of adaptively reconfiguring the propagation environment in a cost-effective and hardware-efficient fashion.

Paper
Add Code

Intelligent reflecting surface enhanced wideband MIMO-OFDM communications: From practical model to reflection optimization

no code implementations • 26 Jul 2020 • Hongyu Li, Wenhao Cai, Yang Liu, Ming Li, Qian Liu, Qingqing Wu

Simulation results demonstrate that the proposed algorithm can offer significant average sum-rate enhancement compared to that achieved using the ideal IRS reflection model, which confirms the importance of the use of the practical model for the design of wideband systems.

Paper
Add Code

MathNet: Haar-Like Wavelet Multiresolution-Analysis for Graph Representation and Learning

no code implementations • 22 Jul 2020 • Xuebin Zheng, Bingxin Zhou, Ming Li, Yu Guang Wang, Junbin Gao

In this paper, we propose a framework for graph neural networks with multiresolution Haar-like wavelets, or MathNet, with interrelated convolution and pooling strategies.

Graph Classification

Paper
Add Code

Prediction of the onset of cardiovascular diseases from electronic health records using multi-task gated recurrent units

no code implementations • 16 Jul 2020 • Fernando Andreotti, Frank S. Heldt, Basel Abu-Jamous, Ming Li, Avelino Javer, Oliver Carr, Stojan Jovanovic, Nadezda Lipunova, Benjamin Irving, Rabia T. Khan, Robert Dürichen

The proposed approach is compared to a standard clinical risk predictor (QRISK) and machine learning alternatives using 5-year data from a NHS Foundation Trust.

BIG-bench Machine Learning

Paper
Add Code

Demo: iJam with Channel Randomization

no code implementations • 7 Jul 2020 • Jordan L. Melcher, Yao Zheng, Dylan Anthony, Matthew Troglia, Yanjun Pan, Ming Li, Thomas Yang, Alvin Yang, Samson Aggelopoulos

Their secrecy rate (bit generation rate) depends heavily on the randomness of the channel, which may reduce significantly in a stable environment.

Paper
Add Code

Intelligent Reflecting Surface Aided MISO Uplink Communication Network: Feasibility and Power Minimization for Perfect and Imperfect CSI

no code implementations • 3 Jul 2020 • Yang Liu, Jun Zhao, Ming Li, Qingqing Wu

Paper
Add Code

MSA-MIL: A deep residual multiple instance learning model based on multi-scale annotation for classification and visualization of glomerular spikes

no code implementations • 2 Jul 2020 • Yilin Chen, Ming Li, Yongfei Wu, Xueyu Liu, Fang Hao, Daoxiang Zhou, Xiaoshuang Zhou, Chen Wang

Therefore, the proposed model can provide a good foundation for assisting the clinical doctors to diagnose the glomerular membranous nephropathy.

Classification General Classification +1

Paper
Add Code

Path Integral Based Convolution and Pooling for Graph Neural Networks

1 code implementation • NeurIPS 2020 • Zheng Ma, Junyu Xuan, Yu Guang Wang, Ming Li, Pietro Lio

Borrowing ideas from physics, we propose a path integral based graph neural networks (PAN) for classification and regression tasks on graphs.

Graph Classification Graph Regression +1

Paper
Code

TreeRNN: Topology-Preserving Deep GraphEmbedding and Learning

1 code implementation • 21 Jun 2020 • Yecheng Lyu, Ming Li, Xinming Huang, Ulkuhan Guler, Patrick Schaumont, Ziming Zhang

General graphs are difficult for learning due to their irregular structures.

Graph Classification Graph Embedding

Paper
Code

Learning to Utilize Correlated Auxiliary Noise: A Possible Quantum Advantage

no code implementations • 8 Jun 2020 • Aida Ahmadzadegan, Petar Simidzija, Ming Li, Achim Kempf

In effect, the network learns to use the correlated auxiliary noise as an approximate key to decipher its noisy input data.

Image Classification

Paper
Add Code

Practical Modeling and Beamforming for Intelligent Reflecting Surface Aided Wideband Systems

no code implementations • 2 Jun 2020 • Wenhao Cai, Hongyu Li, Ming Li, Qian Liu

In this letter, we aim to investigate the phase-amplitude-frequency relationship of the reflected signals and propose a practical model of reflection coefficient for an IRS-aided wideband system.

Paper
Add Code

Acoustic Word Embedding System for Code-Switching Query-by-example Spoken Term Detection

no code implementations • 24 May 2020 • Murong Ma, Haiwei Wu, Xuyang Wang, Lin Yang, Jun-Jie Wang, Ming Li

In this paper, we propose a deep convolutional neural network-based acoustic word embedding system on code-switching query by example spoken term detection.

Word Embeddings

Paper
Add Code

Cross-lingual Multispeaker Text-to-Speech under Limited-Data Scenario

no code implementations • 21 May 2020 • Zexin Cai, Yaogen Yang, Ming Li

In addition, we investigate the model's performance on the cross-lingual synthesis, with and without a bilingual dataset during training.

Attribute Speech Synthesis

Paper
Add Code

From Speaker Verification to Multispeaker Speech Synthesis, Deep Transfer with Feedback Constraint

1 code implementation • 10 May 2020 • Zexin Cai, Chuxiong Zhang, Ming Li

The constraint is taken by an added loss related to the speaker identity, which is centralized to improve the speaker similarity between the synthesized speech and its natural reference audio.

Speaker Verification Speech Synthesis +1

Paper
Code

Domain Aware Training for Far-field Small-footprint Keyword Spotting

no code implementations • 7 May 2020 • Haiwei Wu, Yan Jia, Yuanfei Nie, Ming Li

In this paper, we focus on the task of small-footprint keyword spotting under the far-field scenario.

Multi-Task Learning

Paper
Add Code

MV-RAN: Multiview recurrent aggregation network for echocardiographic sequences segmentation and full cardiac cycle analysis

no code implementations • 1 May 2020 • Ming Li, Chengjia Wang, Heye Zhang, Guang Yang

In addition, for a better interpretation of pathophysiological processes, clinical decision-making and prognosis, such cardiac anatomy segmentation and quantitative analysis of various clinical indices should ideally be performed for the data covering the full cardiac cycle.

Anatomy Decision Making +1

Paper
Add Code

Segatron: Segment-Aware Transformer for Language Modeling and Understanding

1 code implementation • 30 Apr 2020 • He Bai, Peng Shi, Jimmy Lin, Yuqing Xie, Luchen Tan, Kun Xiong, Wen Gao, Ming Li

To verify this, we propose a segment-aware Transformer (Segatron), by replacing the original token position encoding with a combined position encoding of paragraph, sentence, and token.

Ranked #20 on Language Modelling on WikiText-103

Language Modelling Masked Language Modeling +3

Paper
Code

Resource-Optimized Fermionic Local-Hamiltonian Simulation on Quantum Computer for Quantum Chemistry

no code implementations • 8 Apr 2020 • Qingfeng Wang, Ming Li, Christopher Monroe, Yunseong Nam

This framework, based on perturbation theory, is capable of improving the energy estimate at each cycle of the VQE progression, by about a factor of three closer to the known ground-state energy compared to the standard VQE approach in the test-bed, classically-accessible system of the water molecule.

Quantum Physics Emerging Technologies

Paper
Add Code

Semantics of the Unwritten: The Effect of End of Paragraph and Sequence Tokens on Text Generation with GPT2

1 code implementation • ACL 2021 • He Bai, Peng Shi, Jimmy Lin, Luchen Tan, Kun Xiong, Wen Gao, Jie Liu, Ming Li

Experimental results show that the Chinese GPT2 can generate better essay endings with \eop.

Language Modelling Story Generation

Paper
Code

DIHARD II is Still Hard: Experimental Results and Discussions from the DKU-LENOVO Team

no code implementations • 23 Feb 2020 • Qingjian Lin, Weicheng Cai, Lin Yang, Jun-Jie Wang, Jun Zhang, Ming Li

Our diarization system includes multiple modules, namely voice activity detection (VAD), segmentation, speaker embedding extraction, similarity scoring, clustering, resegmentation and overlap detection.

Action Detection Activity Detection +1

Paper
Add Code

Wireless Federated Learning with Local Differential Privacy

no code implementations • 12 Feb 2020 • Mohamed Seif, Ravi Tandon, Ming Li

In this paper, we study the problem of federated learning (FL) over a wireless channel, modeled by a Gaussian multiple access channel (MAC), subject to local differential privacy (LDP) constraints.

Cryptography and Security Information Theory Information Theory

Paper
Add Code

Diversity-Achieving Slow-DropBlock Network for Person Re-Identification

no code implementations • 9 Feb 2020 • Xiaofu Wu, Ben Xie, Shiliang Zhao, Suofei Zhang, Yong Xiao, Ming Li

In particular, we show that the feature diversity can be well achieved with the use of multiple dropping branches by setting individual dropping ratio for each branch.

Person Re-Identification

Paper
Add Code

Learning Diverse Features with Part-Level Resolution for Person Re-Identification

1 code implementation • 21 Jan 2020 • Ben Xie, Xiaofu Wu, Suofei Zhang, Shiliang Zhao, Ming Li

Learning diverse features is key to the success of person re-identification.

Ranked #3 on Person Re-Identification on Market-1501-C

Person Re-Identification

Paper
Code

GhostImage: Remote Perception Attacks against Camera-based Image Classification Systems

1 code implementation • 21 Jan 2020 • Yanmao Man, Ming Li, Ryan Gerdes

In vision-based object classification systems imaging sensors perceive the environment and machine learning is then used to detect and classify objects for decision-making purposes; e. g., to maneuver an automated vehicle around an obstacle or to raise an alarm to indicate the presence of an intruder in surveillance settings.

Autonomous Driving BIG-bench Machine Learning +4

Paper
Code

Deep Time-Stream Framework for Click-Through Rate Prediction by Tracking Interest Evolution

no code implementations • 8 Jan 2020 • Shu-Ting Shi, Wenhao Zheng, Jun Tang, Qing-Guo Chen, Yao Hu, Jianke Zhu, Ming Li

Click-through rate (CTR) prediction is an essential task in industrial applications such as video recommendation.

Click-Through Rate Prediction

Paper
Add Code

Logical Differencing in Dyadic Network Formation Models with Nontransferable Utilities

no code implementations • 3 Jan 2020 • Wayne Yuan Gao, Ming Li, Sheng Xu

This paper considers a semiparametric model of dyadic network formation under nontransferable utilities (NTU).

Paper
Add Code

RWF-2000: An Open Large Scale Video Database for Violence Detection

1 code implementation • 14 Nov 2019 • Ming Cheng, Kunjing Cai, Ming Li

In recent years, surveillance cameras are widely deployed in public places, and the general crime rate has been reduced significantly due to these ubiquitous devices.

Ranked #6 on Activity Recognition on RWF-2000

Action Classification Action Recognition

370

Paper
Code

Variational Quantum Algorithms for Dimensionality Reduction and Classification

no code implementations • 27 Oct 2019 • Jin-Min Liang, Shu-Qian Shen, Ming Li, Lei LI

In this work, we present a quantum neighborhood preserving embedding and a quantum local discriminant embedding for dimensionality reduction and classification.

Classification Dimensionality Reduction +1

Paper
Add Code

Parameter-Transferred Wasserstein Generative Adversarial Network (PT-WGAN) for Low-Dose PET Image Denoising

1 code implementation • 13 Oct 2019 • Yu Gong, Hongming Shan, Yueyang Teng, Ning Tu, Ming Li, Guodong Liang, Ge Wang, Shan-Shan Wang

The contributions of this paper are twofold: i) a PT-WGAN framework is designed to denoise low-dose PET images without compromising structural details, and ii) a task-specific initialization based on transfer learning is developed to train PT-WGAN using trainable parameters transferred from a pretrained model, which significantly improves the training efficiency of PT-WGAN.

Generative Adversarial Network Image Denoising +1

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.