Search Results for author: Ming Li

Found 332 papers, 89 papers with code

Unsupervised Chunking as Syntactic Structure Induction with a Knowledge-Transfer Approach

1 code implementation Findings (EMNLP) 2021 Anup Anand Deshmukh, Qianqiu Zhang, Ming Li, Jimmy Lin, Lili Mou

In this paper, we address unsupervised chunking as a new task of syntactic structure induction, which is helpful for understanding the linguistic structures of human languages as well as processing low-resource languages.

Chunking Transfer Learning

Multi-Task Dense Retrieval via Model Uncertainty Fusion for Open-Domain Question Answering

1 code implementation Findings (EMNLP) 2021 Minghan Li, Ming Li, Kun Xiong, Jimmy Lin

Our method reaches state-of-the-art performance on 5 benchmark QA datasets, with up to 10% improvement in top-100 accuracy compared to a joint-training multi-task DPR on SQuAD.

Open-Domain Question Answering Retrieval

Disentangling the Complex Multiplexed DIA Spectra in De Novo Peptide Sequencing

1 code implementation24 Nov 2024 Zheng Ma, Zeping Mao, Ruixue Zhang, Jiazhen Chen, Lei Xin, Paul Shan, Ali Ghodsi, Ming Li

This paper also provides criteria about when DIA data could be used for de novo peptide sequencing and when not to by providing a comparison between DDA and DIA, in both de novo and database search mode.

de novo peptide sequencing

Sequence-to-Sequence Neural Diarization with Automatic Speaker Detection and Representation

no code implementations21 Nov 2024 Ming Cheng, Yuke Lin, Ming Li

1) Speaker Detection: The proposed approach can utilize incompletely given speaker embeddings to discover the unknown speaker and predict the target voice activities in the audio signal.

Action Detection Activity Detection +2

MARM: Unlocking the Future of Recommendation Systems through Memory Augmentation and Scalable Complexity

no code implementations14 Nov 2024 Xiao Lv, Jiangxia Cao, Shijie Guan, Xiaoyou Zhou, Zhiguang Qi, Yaqiang Zang, Ming Li, Ben Wang, Kun Gai, Guorui Zhou

Considering the above differences with LLM, we can draw a conclusion that: for a RecSys model, compared to model parameters, the computational complexity FLOPs is a more expensive factor that requires careful control.

Language Modelling Recommendation Systems

Script-centric behavior understanding for assisted autism spectrum disorder diagnosis

no code implementations14 Nov 2024 Wenxing Liu, Yueran Pan, Ming Li

Our pipeline converts video content into scripts that describe the behavior of characters, leveraging the generalizability of large language models to detect ASD in a zero-shot or few-shot manner.

Layer-Wise Feature Metric of Semantic-Pixel Matching for Few-Shot Learning

1 code implementation10 Nov 2024 Hao Tang, Junhao Lu, Guoheng Huang, Ming Li, Xuhang Chen, Guo Zhong, Zhengguang Tan, Zinuo Li

In Few-Shot Learning (FSL), traditional metric-based approaches often rely on global metrics to compute similarity.

Few-Shot Learning

Urban Flood Mapping Using Satellite Synthetic Aperture Radar Data: A Review of Characteristics, Approaches and Datasets

no code implementations6 Nov 2024 Jie Zhao, Ming Li, Yu Li, Patrick Matgen, Marco Chini

Besides, we evaluated the Technology Readiness Levels (TRLs) of urban flood mapping techniques to identify challenges and future research areas.

Multi-Reward as Condition for Instruction-based Image Editing

no code implementations6 Nov 2024 Xin Gu, Ming Li, Libo Zhang, Fan Chen, Longyin Wen, Tiejian Luo, Sijie Zhu

1) we first design a quantitative metric system based on best-in-class LVLM (Large Vision Language Model), i. e., GPT-4o in our case, to evaluate the generation quality from 3 perspectives, namely, instruction following, detail preserving, and generation quality.

Descriptive Instruction Following

EEE-Bench: A Comprehensive Multimodal Electrical And Electronics Engineering Benchmark

no code implementations3 Nov 2024 Ming Li, Jike Zhong, Tianle Chen, Yuxiang Lai, Konstantinos Psounis

In summary, we believe EEE-Bench not only reveals some noteworthy limitations of LMMs but also provides a valuable resource for advancing research on their application in practical engineering tasks, driving future improvements in their capability to handle complex, real-world scenarios.

Inference in a Stationary/Nonstationary Autoregressive Time-Varying-Parameter Model

no code implementations1 Nov 2024 Donald W. K. Andrews, Ming Li

This paper considers nonparametric estimation and inference in first-order autoregressive (AR(1)) models with deterministically time-varying parameters.

Unity

A Survey on Bundle Recommendation: Methods, Applications, and Challenges

1 code implementation1 Nov 2024 Meng Sun, Lin Li, Ming Li, Xiaohui Tao, Dong Zhang, Peipei Wang, Jimmy Xiangji Huang

In recent years, bundle recommendation systems have gained significant attention in both academia and industry due to their ability to enhance user experience and increase sales by recommending a set of items as a bundle rather than individual items.

Recommendation Systems Representation Learning +1

Estimation and Inference in Dyadic Network Formation Models with Nontransferable Utilities

no code implementations31 Oct 2024 Ming Li, Zhentao Shi, Yapeng Zheng

This paper studies estimation and inference in a dyadic network formation model with observed covariates, unobserved heterogeneity, and nontransferable utilities.

How Do Flow Matching Models Memorize and Generalize in Sample Data Subspaces?

no code implementations31 Oct 2024 Weiguo Gao, Ming Li

In this work, we provide theoretical insights into this challenge by leveraging Flow Matching models, which transform a simple prior into a complex target distribution via a learned velocity field.

Dimensionality Reduction

What Happened in LLMs Layers when Trained for Fast vs. Slow Thinking: A Gradient Perspective

1 code implementation31 Oct 2024 Ming Li, Yanhong Li, Tianyi Zhou

We are specifically interested in how fast vs. slow thinking affects the layer-wise gradients, given the recent popularity of training LLMs on reasoning paths such as chain-of-thoughts (CoT) and process rewards.

CRB Optimization using a Parametric Scattering Model for Extended Targets in ISAC Systems

no code implementations31 Oct 2024 Rang Liu, A. Lee Swindlehurst, Ming Li

This paper presents a novel parametric scattering model (PSM) for sensing extended targets in integrated sensing and communication (ISAC) systems.

Large Language Model Benchmarks in Medical Tasks

no code implementations28 Oct 2024 Lawrence K. Q. Yan, Ming Li, Yichao Zhang, Caitlyn Heqi Yin, Cheng Fei, Benji Peng, Ziqian Bi, Pohsun Feng, Keyu Chen, Junyu Liu, Qian Niu

With the increasing application of large language models (LLMs) in the medical domain, evaluating these models' performance using benchmark datasets has become crucial.

Image Captioning Language Modelling +5

Deep Learning, Machine Learning -- Digital Signal and Image Processing: From Theory to Application

no code implementations27 Oct 2024 Weiche Hsieh, Ziqian Bi, Junyu Liu, Benji Peng, Sen Zhang, Xuanhe Pan, Jiawei Xu, Jinlang Wang, Keyu Chen, Caitlyn Heqi Yin, Pohsun Feng, Yizhu Wen, Tianyang Wang, Ming Li, Jintao Ren, Qian Niu, Silin Chen, Ming Liu

Digital Signal Processing (DSP) and Digital Image Processing (DIP) with Machine Learning (ML) and Deep Learning (DL) are popular research areas in Computer Vision and related fields.

Image Enhancement

Label Filling via Mixed Supervision for Medical Image Segmentation from Noisy Annotations

no code implementations21 Oct 2024 Ming Li, Wei Shen, Qingli Li, Yan Wang

The fundamental idea of label filling is to supervise the segmentation model by a subset of pixels with trustworthy labels, meanwhile filling labels of other pixels by mixed supervision.

Image Segmentation Lesion Segmentation +2

Joint Space-Time Adaptive Processing and Beamforming Design for Cell-Free ISAC Systems

no code implementations18 Oct 2024 Rang Liu, Ming Li, Qian Liu

In this paper, we explore cooperative sensing and communication within cell-free integrated sensing and communication (ISAC) systems.

BenTo: Benchmark Task Reduction with In-Context Transferability

1 code implementation17 Oct 2024 Hongyu Zhao, Ming Li, Lichao Sun, Tianyi Zhou

Evaluating large language models (LLMs) is costly: it requires the generation and examination of LLM outputs on a large-scale benchmark of various tasks.

In-Context Learning MMLU

DOA Estimation-Oriented Joint Array Partitioning and Beamforming Designs for ISAC Systems

no code implementations16 Oct 2024 Rang Liu, Ming Li, Qian Liu, A. Lee Swindlehurst

Integrated sensing and communication has been identified as an enabling technology for forthcoming wireless networks.

A Hierarchical DRL Approach for Resource Optimization in Multi-RIS Multi-Operator Networks

no code implementations16 Oct 2024 Haocheng Zhang, Wei Wang, Hao Zhou, Zhiping Lu, Ming Li

As reconfigurable intelligent surfaces (RIS) emerge as a pivotal technology in the upcoming sixth-generation (6G) networks, their deployment within practical multiple operator (OP) networks presents significant challenges, including the coordination of RIS configurations among OPs, interference management, and privacy maintenance.

Deep Reinforcement Learning Management

RealEra: Semantic-level Concept Erasure via Neighbor-Concept Mining

no code implementations11 Oct 2024 Yufan Liu, Jinyang An, Wanqian Zhang, Ming Li, Dayan Wu, Jingzi Gu, Zheng Lin, Weiping Wang

The remarkable development of text-to-image generation models has raised notable security concerns, such as the infringement of portrait rights and the generation of inappropriate content.

Specificity Text-to-Image Generation

Deeper Insights into Deep Graph Convolutional Networks: Stability and Generalization

no code implementations11 Oct 2024 Guangrui Yang, Ming Li, Han Feng, Xiaosheng Zhuang

Graph convolutional networks (GCNs) have emerged as powerful models for graph learning tasks, exhibiting promising performance in various domains.

Graph Learning

Adversarial Attacks and Robust Defenses in Speaker Embedding based Zero-Shot Text-to-Speech System

no code implementations5 Oct 2024 Ze Li, Yao Shi, Yunfei Xu, Ming Li

Speaker embedding based zero-shot Text-to-Speech (TTS) systems enable high-quality speech synthesis for unseen speakers using minimal data.

Adversarial Purification Speech Synthesis +1

Optimal Control in Both Steady State and Transient Process with Unknown Disturbances

no code implementations4 Oct 2024 Ming Li, Zhaojian Wang, Feng Liu, Ming Cao, Bo Yang

The scheme of online optimization as a feedback controller is widely used to steer the states of a physical system to the optimal solution of a predefined optimization problem.

Deep Learning and Machine Learning: Advancing Big Data Analytics and Management with Design Patterns

no code implementations4 Oct 2024 Keyu Chen, Ziqian Bi, Tianyang Wang, Yizhu Wen, Pohsun Feng, Qian Niu, Junyu Liu, Benji Peng, Sen Zhang, Ming Li, Xuanhe Pan, Jiawei Xu, Jinlang Wang, Caitlyn Heqi Yin, Ming Liu

This book, Design Patterns in Machine Learning and Deep Learning: Advancing Big Data Analytics Management, presents a comprehensive study of essential design patterns tailored for large-scale machine learning and deep learning applications.

Management

Deep Learning and Machine Learning, Advancing Big Data Analytics and Management: Object-Oriented Programming

no code implementations30 Sep 2024 Tianyang Wang, Ziqian Bi, Keyu Chen, Jiawei Xu, Qian Niu, Junyu Liu, Benji Peng, Ming Li, Sen Zhang, Xuanhe Pan, Jinlang Wang, Pohsun Feng, Caitlyn Heqi Yin, Yizhu Wen, Ming Liu

Object-Oriented Programming (OOP) has become a crucial paradigm for managing the growing complexity of modern software systems, particularly in fields like machine learning, deep learning, large language models (LLM), and data analytics.

Management

Semi-LLIE: Semi-supervised Contrastive Learning with Mamba-based Low-light Image Enhancement

1 code implementation25 Sep 2024 Guanlin Li, Ke Zhang, Ting Wang, Ming Li, Bin Zhao, Xuelong Li

Despite the impressive advancements made in recent low-light image enhancement techniques, the scarcity of paired data has emerged as a significant obstacle to further advancements.

Contrastive Learning Low-Light Image Enhancement +1

Deep Learning and Machine Learning, Advancing Big Data Analytics and Management: Handy Appetizer

no code implementations25 Sep 2024 Benji Peng, Xuanhe Pan, Yizhu Wen, Ziqian Bi, Keyu Chen, Ming Li, Ming Liu, Qian Niu, Junyu Liu, Jinlang Wang, Sen Zhang, Jiawei Xu, Pohsun Feng

This book explores the role of Artificial Intelligence (AI), Machine Learning (ML), and Deep Learning (DL) in driving the progress of big data analytics and management.

Autonomous Driving Deep Learning +2

Vision-Language Model Fine-Tuning via Simple Parameter-Efficient Modification

1 code implementation25 Sep 2024 Ming Li, Jike Zhong, Chenxin Li, Liuzhuozheng Li, Nie Lin, Masashi Sugiyama

Recent advances in fine-tuning Vision-Language Models (VLMs) have witnessed the success of prompt tuning and adapter tuning, while the classic model fine-tuning on inherent parameters seems to be overlooked.

Language Modelling

Robust and Flexible Omnidirectional Depth Estimation with Multiple 360° Cameras

no code implementations23 Sep 2024 Ming Li, Xueqian Jin, Xuejiao Hu, Jinghao Cao, Sidan Du, Yang Li

We implement two algorithms, in which the two-stage algorithm obtains initial depth maps by pairwise stereo matching of multiple cameras and fuses the multiple depth maps to achieve the final depth estimation; the one-stage algorithm adopts spherical sweeping based on hypothetical depths to construct a uniform spherical matching cost of the multi-camera images and obtain the depth.

Depth Estimation Stereo Matching

Active Reconfigurable Intelligent Surface Empowered Synthetic Aperture Radar Imaging

no code implementations18 Sep 2024 Yifan Sun, Rang Liu, Zhiping Lu, Honghao Luo, Ming Li, Qian Liu

In this paper, we first present a range-Doppler (RD) imaging algorithm to obtain imaging results for the proposed ARIS-empowered SAR system.

From Challenges and Pitfalls to Recommendations and Opportunities: Implementing Federated Learning in Healthcare

no code implementations15 Sep 2024 Ming Li, Pengcheng Xu, Junjie Hu, Zeyu Tang, Guang Yang

Federated learning holds great potential for enabling large-scale healthcare research and collaboration across multiple centres while ensuring data privacy and security are not compromised.

Federated Learning

From Text to Multimodality: Exploring the Evolution and Impact of Large Language Models in Medical Practice

no code implementations14 Sep 2024 Qian Niu, Keyu Chen, Ming Li, Pohsun Feng, Ziqian Bi, Lawrence KQ Yan, Yichao Zhang, Caitlyn Heqi Yin, Cheng Fei, Junyu Liu, Benji Peng, Tianyang Wang, Yunze Wang, Silin Chen

This comprehensive review explores the progression of LLMs to Multimodal Large Language Models (MLLMs) and their growing influence in medical practice.

TSELM: Target Speaker Extraction using Discrete Tokens and Language Models

1 code implementation12 Sep 2024 Beilong Tang, Bang Zeng, Ming Li

We propose TSELM, a novel target speaker extraction network that leverages discrete tokens and language models.

Audio Generation Target Speaker Extraction

Findings of the 2024 Mandarin Stuttering Event Detection and Automatic Speech Recognition Challenge

no code implementations9 Sep 2024 Hongfei Xue, Rong Gong, Mingchen Shao, Xin Xu, Lezhi Wang, Lei Xie, Hui Bu, Jiaming Zhou, Yong Qin, Jun Du, Ming Li, BinBin Zhang, Bin Jia

The StutteringSpeech Challenge focuses on advancing speech technologies for people who stutter, specifically targeting Stuttering Event Detection (SED) and Automatic Speech Recognition (ASR) in Mandarin.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +2

Multimodal Laryngoscopic Video Analysis for Assisted Diagnosis of Vocal Fold Paralysis

no code implementations5 Sep 2024 Yucong Zhang, Xin Zou, Jinshan Yang, Wenjun Chen, Juan Liu, Faya Liang, Ming Li

The system integrates video-based glottis detection with an audio keyword spotting method to analyze both video and audio data, identifying patient vocalizations and refining video highlights to ensure optimal inspection of vocal fold movements.

Keyword Spotting Segmentation

A Dual-Path Framework with Frequency-and-Time Excited Network for Anomalous Sound Detection

no code implementations5 Sep 2024 Yucong Zhang, Juan Liu, Yao Tian, Haifeng Liu, Ming Li

In contrast to human speech, machine-generated sounds of the same type often exhibit consistent frequency characteristics and discernible temporal periodicity.

Anomaly Detection Task 2

Dynamic Hybrid Beamforming Designs for ELAA Near-Field Communications

no code implementations5 Sep 2024 Mengzhen Liu, Ming Li, Rang Liu, Qian Liu

Extremely large-scale antenna array (ELAA) is a key candidate technology for the sixth generation (6G) mobile networks.

Large Language Models and Cognitive Science: A Comprehensive Review of Similarities, Differences, and Challenges

no code implementations4 Sep 2024 Qian Niu, Junyu Liu, Ziqian Bi, Pohsun Feng, Benji Peng, Keyu Chen, Ming Li, Lawrence KQ Yan, Yichao Zhang, Caitlyn Heqi Yin, Cheng Fei, Tianyang Wang, Yunze Wang, Silin Chen

This comprehensive review explores the intersection of Large Language Models (LLMs) and cognitive science, examining similarities and differences between LLMs and human cognitive processes.

USEF-TSE: Universal Speaker Embedding Free Target Speaker Extraction

no code implementations4 Sep 2024 Bang Zeng, Ming Li

Traditionally, this process has relied on extracting a speaker embedding from a reference speech, necessitating a speaker recognition model.

Speaker Recognition Speech Separation +1

One-Index Vector Quantization Based Adversarial Attack on Image Classification

no code implementations2 Sep 2024 Haiju Fan, Xiaona Qin, Shuang Chen, Hubert P. H. Shum, Ming Li

In this paper, we propose a novel one-index attack method in the VQ domain to generate adversarial images by a differential evolution algorithm, successfully resulting in image misclassification in victim models.

Adversarial Attack Image Classification +1

A Joint Learning Model with Variational Interaction for Multilingual Program Translation

1 code implementation25 Aug 2024 Yali Du, Hui Sun, Ming Li

However, parallel data is difficult to collect for some language pairs, and the distribution of program semantics across languages can shift, posing challenges for pairwise program translation.

Disentanglement Translation +1

Top Pass: Improve Code Generation by Pass@k-Maximized Code Ranking

no code implementations11 Aug 2024 Zhi-Cun Lyu, Xin-Ye Li, Zheng Xie, Ming Li

In this paper, we propose Top Pass, a code ranking approach that identifies potential correct solutions from a large number of candidates.

Code Generation

VoxBlink2: A 100K+ Speaker Recognition Corpus and the Open-Set Speaker-Identification Benchmark

no code implementations16 Jul 2024 Yuke Lin, Ming Cheng, FuLin Zhang, Yingying Gao, Shilei Zhang, Ming Li

In this paper, we provide a large audio-visual speaker recognition dataset, VoxBlink2, which includes approximately 10M utterances with videos from 110K+ speakers in the wild.

Diversity Speaker Identification +2

Bridging Smoothness and Approximation: Theoretical Insights into Over-Smoothing in Graph Neural Networks

no code implementations1 Jul 2024 Guangrui Yang, Jianfei Li, Ming Li, Han Feng, Ding-Xuan Zhou

In our numerical experiments, we analyze several widely applied GCNs and observe the phenomenon of energy decay.

MIMO-OFDM ISAC Waveform Design for Range-Doppler Sidelobe Suppression

no code implementations25 Jun 2024 Peishi Li, Ming Li, Rang Liu, Qian Liu, A. Lee Swindlehurst

In addition, the proposed waveform design achieves target detection and estimation performance close to that achievable by waveforms designed only for radar, which demonstrates the superiority of the proposed SLP-based ISAC approach.

Understanding is Compression

1 code implementation24 Jun 2024 Ziguang Li, Chao Huang, Xuliang Wang, Haibo Hu, Cole Wyeth, Dongbo Bu, Quan Yu, Wen Gao, Xingwu Liu, Ming Li

The better a large model understands the data, the better LMCompress compresses.

Data Compression

Feature-prompting GBMSeg: One-Shot Reference Guided Training-Free Prompt Engineering for Glomerular Basement Membrane Segmentation

1 code implementation24 Jun 2024 Xueyu Liu, Guangze Shi, Rui Wang, Yexin Lai, Jianan Zhang, Lele Sun, Quan Yang, Yongfei Wu, Ming Li, Weixia Han, Wen Zheng

Experimental results on our collected 2538 TEM images confirm that GBMSeg achieves superior segmentation performance with a Dice similarity coefficient (DSC) of 87. 27% using only one labeled reference image in a training-free manner, outperforming recently proposed one-shot or few-shot methods.

Prompt Engineering Segmentation

RuleR: Improving LLM Controllability by Rule-based Data Recycling

3 code implementations22 Jun 2024 Ming Li, Han Chen, Chenguang Wang, Dang Nguyen, Dianqi Li, Tianyi Zhou

Despite the remarkable advancement of Large language models (LLMs), they still lack delicate controllability under sophisticated constraints, which is critical to enhancing their response quality and the user experience.

Data Augmentation Instruction Following

Ranking LLMs by compression

no code implementations20 Jun 2024 Peijia Guo, Ziguang Li, Haibo Hu, Chao Huang, Ming Li, Rui Zhang

We conceptualize the process of understanding as information compression, and propose a method for ranking large language models (LLMs) based on lossless data compression.

coreference-resolution Data Compression +5

PFID: Privacy First Inference Delegation Framework for LLMs

no code implementations18 Jun 2024 Haoyan Yang, Zhitao Li, Yong Zhang, Jianzong Wang, Ning Cheng, Ming Li, Jing Xiao

Our framework was designed to be communication efficient, computation can be delegated to the local client so that the server's computation burden can be lightened.

Machine Translation

The Database and Benchmark for the Source Speaker Tracing Challenge 2024

no code implementations7 Jun 2024 Ze Li, Yuke Lin, Tian Yao, Hongbin Suo, Pengyuan Zhang, Yanzhen Ren, Zexin Cai, Hiromitsu Nishizaki, Ming Li

We expect SSTC to be a platform for advancing the development of the SSV task and provide further insights into the performance and limitations of current SV systems against VC attacks.

Multi-Task Learning Speaker Verification +1

Multipath Exploitation for Fluctuating Target Detection in RIS-Assisted ISAC Systems

no code implementations2 Jun 2024 Shoushuo Zhang, Zichao Xiao, Rang Liu, Ming Li, Wei Wang, Qian Liu

Integrated sensing and communication (ISAC) systems are typically deployed in multipath environments, which is usually deemed as a challenging issue for wireless communications.

Diversity

DINO-SD: Champion Solution for ICRA 2024 RoboDepth Challenge

no code implementations27 May 2024 Yifan Mao, Ming Li, Jian Liu, Jiayang Liu, Zihan Qin, Chunxi Chu, Jialei Xu, Wenbo Zhao, Junjun Jiang, Xianming Liu

However, given that most of the data in the autonomous driving dataset is collected in daytime scenarios, this leads to poor depth model performance in the face of out-of-distribution(OoD) data.

3D Reconstruction Autonomous Driving +1

Scaling Laws for Discriminative Classification in Large Language Models

no code implementations24 May 2024 Dean Wyatte, Fatemeh Tahmasbi, Ming Li, Thomas Markovich

To address this issue we present a system that allows us to use an LLM to augment our customer support advocates by re-framing the language modeling task as a discriminative classification task.

Hallucination Language Modelling

HC-GAE: The Hierarchical Cluster-based Graph Auto-Encoder for Graph Representation Learning

no code implementations23 May 2024 Zhuo Xu, Lu Bai, Lixin Cui, Ming Li, Yue Wang, Edwin R. Hancock

To this end, during the encoding process, we commence by utilizing the hard node assignment to decompose a sample graph into a family of separated subgraphs.

Decoder Graph Classification +2

Mosaic-IT: Free Compositional Data Augmentation Improves Instruction Tuning

3 code implementations22 May 2024 Ming Li, Pei Chen, Chenguang Wang, Hongyu Zhao, Yijun Liang, Yupeng Hou, Fuxiao Liu, Tianyi Zhou

Finetuning large language models with a variety of instruction-response pairs has enhanced their capability to understand and follow instructions.

Data Augmentation Diversity +1

How Universal Polynomial Bases Enhance Spectral Graph Neural Networks: Heterophily, Over-smoothing, and Over-squashing

1 code implementation21 May 2024 Keke Huang, Yu Guang Wang, Ming Li, and Pietro Liò

Our extensive experiments, conducted on a diverse range of real-world and synthetic datasets with varying degrees of heterophily, support the superiority of UniFilter.

Graph Neural Network

ENADPool: The Edge-Node Attention-based Differentiable Pooling for Graph Neural Networks

no code implementations16 May 2024 Zhehan Zhao, Lu Bai, Lixin Cui, Ming Li, Yue Wang, Lixiang Xu, Edwin R. Hancock

In this paper, we propose a new hierarchical pooling operation, namely the Edge-Node Attention-based Differentiable Pooling (ENADPool), for GNNs to learn effective graph representations.

Graph Classification

Frame Interpolation with Consecutive Brownian Bridge Diffusion

1 code implementation9 May 2024 Zonglin Lyu, Ming Li, Jianbo Jiao, Chen Chen

To address this problem, we propose our unique solution: Frame Interpolation with Consecutive Brownian Bridge Diffusion.

Conditional Image Generation Video Frame Interpolation

Towards Less Biased Data-driven Scoring with Deep Learning-Based End-to-end Database Search in Tandem Mass Spectrometry

no code implementations8 May 2024 Yonghan Yu, Ming Li

Traditional database search methods, though widely used, rely on heuristic scoring functions and statistical estimations have to be introduced for a higher identification rate.

Contrastive Learning Decoder

Semi-Supervised Disease Classification based on Limited Medical Image Data

no code implementations7 May 2024 Yan Zhang, Chun Li, Zhaoxia Liu, Ming Li

By addressing the limitations imposed by limited labeled data and harnessing the untapped potential of unlabeled medical images, our novel generative model presents a promising direction for enhancing semi-supervised disease classification in the field of medical image analysis.

Medical Image Analysis text-classification +1

Are We Really Achieving Better Beyond-Accuracy Performance in Next Basket Recommendation?

no code implementations2 May 2024 Ming Li, Yuanna Liu, Sami Jullien, Mozhdeh Ariannezhad, Mohammad Aliannejadi, Andrew Yates, Maarten de Rijke

So far, most NBR studies have focused on optimizing the accuracy of the recommendation, whereas optimizing for beyond-accuracy metrics, e. g., item fairness and diversity remains largely unexplored.

Diversity Fairness +3

ConsistencyDet: A Robust Object Detector with a Denoising Paradigm of Consistency Model

1 code implementation11 Apr 2024 Lifan Jiang, Zhihui Wang, Changmiao Wang, Ming Li, Jiaxu Leng, Xindong Wu

In the present study, we introduce a novel framework designed to articulate object detection as a denoising diffusion process, which operates on the perturbed bounding boxes of annotated entities.

Attribute Denoising +2

ControlNet++: Improving Conditional Controls with Efficient Consistency Feedback

1 code implementation11 Apr 2024 Ming Li, Taojiannan Yang, Huafeng Kuang, Jie Wu, Zhaoning Wang, Xuefeng Xiao, Chen Chen

Specifically, for an input conditional control, we use a pre-trained discriminative reward model to extract the corresponding condition of the generated images, and then optimize the consistency loss between the input conditional control and extracted condition.

SSIM

MealRec$^+$: A Meal Recommendation Dataset with Meal-Course Affiliation for Personalization and Healthiness

1 code implementation8 Apr 2024 Ming Li, Lin Li, Xiaohui Tao, Jimmy Xiangji Huang

Due to constraints related to user health privacy and meal scenario characteristics, the collection of data that includes both meal-course affiliation and two levels of interactions is impeded.

Scalable Model Editing via Customized Expert Networks

1 code implementation3 Apr 2024 Zihan Yao, Yu He, Tianyu Qi, Ming Li

Addressing the issues of hallucinations and outdated knowledge in large language models is critical for their reliable application.

Hallucination Model Editing

SSHPool: The Separated Subgraph-based Hierarchical Pooling

no code implementations24 Mar 2024 Zhuo Xu, Lixin Cui, Ming Li, Yue Wang, Ziyu Lyu, Hangyuan Du, Lu Bai, Philip S. Yu, Edwin R. Hancock

We commence by assigning the nodes of a sample graph into different clusters, resulting in a family of separated subgraphs.

Graph Classification

A Comparative Study of Artificial Potential Fields and Safety Filters

no code implementations23 Mar 2024 Ming Li, Zhiyong Sun

In this paper, we have demonstrated that the controllers designed by a classical motion planning tool, namely artificial potential fields (APFs), can be derived from a recently prevalent approach: control barrier function quadratic program (CBF-QP) safety filters.

Motion Planning

KunquDB: An Attempt for Speaker Verification in the Chinese Opera Scenario

1 code implementation20 Mar 2024 Huali Zhou, Yuke Lin, Dong Liu, Ming Li

This work aims to promote Chinese opera research in both musical and speech domains, with a primary focus on overcoming the data limitations.

Domain Adaptation Speaker Verification

A Tunable Universal Formula for Safety-Critical Control

no code implementations10 Mar 2024 Ming Li, Zhiyong Sun, Patrick J. W. Koelewijn, Siep Weiland

Finally, we demonstrate the efficacy of our method through a collision avoidance example, investigating the essential properties including safety, robustness, and smoothness under various tunable scaling terms.

Collision Avoidance

Unifying Controller Design for Stabilizing Nonlinear Systems with Norm-Bounded Control Inputs

no code implementations5 Mar 2024 Ming Li, Zhiyong Sun, Siep Weiland

This paper revisits a classical challenge in the design of stabilizing controllers for nonlinear systems with a norm-bounded input constraint.

Pyramid Feature Attention Network for Monocular Depth Prediction

no code implementations3 Mar 2024 Yifang Xu, Chenglei Peng, Ming Li, Yang Li, Sidan Du

Deep convolutional neural networks (DCNNs) have achieved great success in monocular depth estimation (MDE).

Depth Prediction Monocular Depth Estimation

Location-guided Head Pose Estimation for Fisheye Image

no code implementations28 Feb 2024 Bing Li, Dong Zhang, Cheng Huang, Yun Xian, Ming Li, Dah-Jye Lee

Camera with a fisheye or ultra-wide lens covers a wide field of view that cannot be modeled by the perspective projection.

Head Pose Estimation Multi-Task Learning

A Survey on Knowledge Distillation of Large Language Models

1 code implementation20 Feb 2024 Xiaohan Xu, Ming Li, Chongyang Tao, Tao Shen, Reynold Cheng, Jinyang Li, Can Xu, DaCheng Tao, Tianyi Zhou

In the era of Large Language Models (LLMs), Knowledge Distillation (KD) emerges as a pivotal methodology for transferring advanced capabilities from leading proprietary LLMs, such as GPT-4, to their open-source counterparts like LLaMA and Mistral.

Data Augmentation Knowledge Distillation +2

Can LLMs Speak For Diverse People? Tuning LLMs via Debate to Generate Controllable Controversial Statements

1 code implementation16 Feb 2024 Ming Li, Jiuhai Chen, Lichang Chen, Tianyi Zhou

To examine DEBATUNE, we curate the largest dataset of debate topics so far, which covers 710 controversial topics and corresponding arguments for each topic.

Selective Reflection-Tuning: Student-Selected Data Recycling for LLM Instruction-Tuning

2 code implementations15 Feb 2024 Ming Li, Lichang Chen, Jiuhai Chen, Shwai He, Jiuxiang Gu, Tianyi Zhou

This paper introduces Selective Reflection-Tuning, a novel paradigm that synergizes a teacher LLM's reflection and introspection for improving existing data quality with the data selection capability of the student LLM, to automatically refine existing instruction-tuning data.

Data Augmentation Instruction Following

Zero-shot Explainable Mental Health Analysis on Social Media by Incorporating Mental Scales

no code implementations9 Feb 2024 Wenyu Li, Yinuo Zhu, Xin Lin, Ming Li, Ziyue Jiang, Ziqian Zeng

Traditional discriminative approaches in mental health analysis are known for their strong capacity but lack interpretability and demand large-scale annotated data.

Superfiltering: Weak-to-Strong Data Filtering for Fast Instruction-Tuning

1 code implementation1 Feb 2024 Ming Li, Yong Zhang, Shwai He, Zhitao Li, Hongyu Zhao, Jianzong Wang, Ning Cheng, Tianyi Zhou

Data filtering for instruction tuning has proved important in improving both the efficiency and performance of the tuning process.

Language Modelling

Leveraging Biases in Large Language Models: "bias-kNN'' for Effective Few-Shot Learning

no code implementations18 Jan 2024 Yong Zhang, Hanzhang Li, Zhitao Li, Ning Cheng, Ming Li, Jing Xiao, Jianzong Wang

Large Language Models (LLMs) have shown significant promise in various applications, including zero-shot and few-shot learning.

Few-Shot Learning In-Context Learning +2

Multi-Input Multi-Output Target-Speaker Voice Activity Detection For Unified, Flexible, and Robust Audio-Visual Speaker Diarization

no code implementations16 Jan 2024 Ming Cheng, Ming Li

The proposed method can take audio-visual input and leverage the speaker's acoustic footprint or lip track to flexibly conduct audio-based, video-based, and audio-visual speaker diarization in a unified sequence-to-sequence framework.

Action Detection Activity Detection +7

End-to-End Learning for SLP-Based ISAC Systems

no code implementations11 Jan 2024 Yixian Zheng, Rang Liu, Ming Li, Qian Liu

Integrated sensing and communication (ISAC) is an encouraging wireless technology which can simultaneously perform both radar and communication functionalities by sharing the same transmit waveform, spectral resource, and hardware platform.

A Practical Beamforming Design for Active RIS-assisted MU-MISO Systems

no code implementations8 Jan 2024 Yun Yang, Zhiping Lu, Ming Li, Rang Liu, Qian Liu

Motivated by this fact, in this paper we first investigate the amplification principle of typical active RIS and propose a more accurate amplification model based on amplifier hardware characteristics.

Self-supervised Reflective Learning through Self-distillation and Online Clustering for Speaker Representation Learning

no code implementations3 Jan 2024 Danwei Cai, Zexin Cai, Ming Li

Specifically, a teacher model continually refines pseudo labels through online clustering, providing dynamic supervision signals to train the student model.

Clustering Knowledge Distillation +3

IL-NeRF: Incremental Learning for Neural Radiance Fields with Camera Pose Alignment

no code implementations10 Dec 2023 Letian Zhang, Ming Li, Chen Chen, Jie Xu

This poses a paradox as the necessary camera pose must be estimated from the entire dataset, even though the data arrives sequentially and future chunks are inaccessible.

Incremental Learning Knowledge Distillation

Safe Stabilization with Model Uncertainties: A Universal Formula with Gaussian Process Learning

no code implementations5 Dec 2023 Ming Li, Zhiyong Sun

In our previous research, we developed an analytical control strategy, namely the universal formula, that incorporates CLF and CBF conditions for safe stabilization.

Gaussian Processes

ColonNeRF: High-Fidelity Neural Reconstruction of Long Colonoscopy

no code implementations4 Dec 2023 Yufei Shi, Beijia Lu, Jia-Wei Liu, Ming Li, Mike Zheng Shou

Specifically, to reconstruct the entire colon in a piecewise manner, our ColonNeRF introduces a region division and integration module, effectively reducing shape dissimilarity and ensuring geometric consistency in each segment.

Neural Rendering Novel View Synthesis

LucidDreaming: Controllable Object-Centric 3D Generation

no code implementations30 Nov 2023 Zhaoning Wang, Ming Li, Chen Chen

Specifically, our research demonstrates that Large Language Models (LLMs) possess 3D spatial awareness and can effectively translate textual 3D information into precise 3D bounding boxes.

3D Generation Benchmarking +4

Where to Begin? From Random to Foundation Model Instructed Initialization in Federated Learning for Medical Image Segmentation

no code implementations27 Nov 2023 Ming Li, Guang Yang

In medical image analysis, Federated Learning (FL) stands out as a key technology that enables privacy-preserved, decentralized data processing, crucial for handling sensitive medical data.

Federated Learning Image Segmentation +3