1 code implementation • Findings (EMNLP) 2021 • Anup Anand Deshmukh, Qianqiu Zhang, Ming Li, Jimmy Lin, Lili Mou
In this paper, we address unsupervised chunking as a new task of syntactic structure induction, which is helpful for understanding the linguistic structures of human languages as well as processing low-resource languages.
1 code implementation • Findings (EMNLP) 2021 • Minghan Li, Ming Li, Kun Xiong, Jimmy Lin
Our method reaches state-of-the-art performance on 5 benchmark QA datasets, with up to 10% improvement in top-100 accuracy compared to a joint-training multi-task DPR on SQuAD.
1 code implementation • 1 Dec 2024 • Weiche Hsieh, Ziqian Bi, Chuanqi Jiang, Junyu Liu, Benji Peng, Sen Zhang, Xuanhe Pan, Jiawei Xu, Jinlang Wang, Keyu Chen, Caitlyn Heqi Yin, Pohsun Feng, Yizhu Wen, Xinyuan Song, Tianyang Wang, Junjie Yang, Ming Li, Bowen Jing, Jintao Ren, Junhao Song, Han Xu, Hong-Ming Tseng, Yichao Zhang, Lawrence K. Q. Yan, Qian Niu, Silin Chen, Yunze Wang, Chia Xin Liang, Ming Liu
Explainable Artificial Intelligence (XAI) addresses the growing need for transparency and interpretability in AI systems, enabling trust and accountability in decision-making processes.
1 code implementation • 24 Nov 2024 • Zheng Ma, Zeping Mao, Ruixue Zhang, Jiazhen Chen, Lei Xin, Paul Shan, Ali Ghodsi, Ming Li
This paper also provides criteria about when DIA data could be used for de novo peptide sequencing and when not to by providing a comparison between DDA and DIA, in both de novo and database search mode.
no code implementations • 21 Nov 2024 • Ming Cheng, Yuke Lin, Ming Li
1) Speaker Detection: The proposed approach can utilize incompletely given speaker embeddings to discover the unknown speaker and predict the target voice activities in the audio signal.
no code implementations • 14 Nov 2024 • Xiao Lv, Jiangxia Cao, Shijie Guan, Xiaoyou Zhou, Zhiguang Qi, Yaqiang Zang, Ming Li, Ben Wang, Kun Gai, Guorui Zhou
Considering the above differences with LLM, we can draw a conclusion that: for a RecSys model, compared to model parameters, the computational complexity FLOPs is a more expensive factor that requires careful control.
no code implementations • 14 Nov 2024 • Wenxing Liu, Yueran Pan, Ming Li
Our pipeline converts video content into scripts that describe the behavior of characters, leveraging the generalizability of large language models to detect ASD in a zero-shot or few-shot manner.
1 code implementation • 10 Nov 2024 • Hao Tang, Junhao Lu, Guoheng Huang, Ming Li, Xuhang Chen, Guo Zhong, Zhengguang Tan, Zinuo Li
In Few-Shot Learning (FSL), traditional metric-based approaches often rely on global metrics to compute similarity.
no code implementations • 9 Nov 2024 • Keqin Li, Lipeng Liu, Jiajing Chen, Dezhi Yu, Xiaofan Zhou, Ming Li, Congyu Wang, Zhao Li
In this paper, how to efficiently find the optimal path in complex warehouse layout and make real-time decision is a key problem.
no code implementations • 6 Nov 2024 • Charles Zhang, Benji Peng, Xintian Sun, Qian Niu, Junyu Liu, Keyu Chen, Ming Li, Pohsun Feng, Ziqian Bi, Ming Liu, Yichao Zhang, Cheng Fei, Caitlyn Heqi Yin, Lawrence KQ Yan, Tianyang Wang
Word embeddings and language models have transformed natural language processing (NLP) by facilitating the representation of linguistic elements in continuous vector spaces.
no code implementations • 6 Nov 2024 • Jie Zhao, Ming Li, Yu Li, Patrick Matgen, Marco Chini
Besides, we evaluated the Technology Readiness Levels (TRLs) of urban flood mapping techniques to identify challenges and future research areas.
no code implementations • 6 Nov 2024 • Xin Gu, Ming Li, Libo Zhang, Fan Chen, Longyin Wen, Tiejian Luo, Sijie Zhu
1) we first design a quantitative metric system based on best-in-class LVLM (Large Vision Language Model), i. e., GPT-4o in our case, to evaluate the generation quality from 3 perspectives, namely, instruction following, detail preserving, and generation quality.
no code implementations • 3 Nov 2024 • Ming Li, Jike Zhong, Tianle Chen, Yuxiang Lai, Konstantinos Psounis
In summary, we believe EEE-Bench not only reveals some noteworthy limitations of LMMs but also provides a valuable resource for advancing research on their application in practical engineering tasks, driving future improvements in their capability to handle complex, real-world scenarios.
no code implementations • 1 Nov 2024 • Donald W. K. Andrews, Ming Li
This paper considers nonparametric estimation and inference in first-order autoregressive (AR(1)) models with deterministically time-varying parameters.
1 code implementation • 1 Nov 2024 • Meng Sun, Lin Li, Ming Li, Xiaohui Tao, Dong Zhang, Peipei Wang, Jimmy Xiangji Huang
In recent years, bundle recommendation systems have gained significant attention in both academia and industry due to their ability to enhance user experience and increase sales by recommending a set of items as a bundle rather than individual items.
no code implementations • 31 Oct 2024 • Ming Li, Zhentao Shi, Yapeng Zheng
This paper studies estimation and inference in a dyadic network formation model with observed covariates, unobserved heterogeneity, and nontransferable utilities.
no code implementations • 31 Oct 2024 • Weiguo Gao, Ming Li
In this work, we provide theoretical insights into this challenge by leveraging Flow Matching models, which transform a simple prior into a complex target distribution via a learned velocity field.
1 code implementation • 31 Oct 2024 • Ming Li, Yanhong Li, Tianyi Zhou
We are specifically interested in how fast vs. slow thinking affects the layer-wise gradients, given the recent popularity of training LLMs on reasoning paths such as chain-of-thoughts (CoT) and process rewards.
no code implementations • 31 Oct 2024 • Rang Liu, A. Lee Swindlehurst, Ming Li
This paper presents a novel parametric scattering model (PSM) for sensing extended targets in integrated sensing and communication (ISAC) systems.
no code implementations • 30 Oct 2024 • Keyu Chen, Cheng Fei, Ziqian Bi, Junyu Liu, Benji Peng, Sen Zhang, Xuanhe Pan, Jiawei Xu, Jinlang Wang, Caitlyn Heqi Yin, Yichao Zhang, Pohsun Feng, Yizhu Wen, Tianyang Wang, Ming Li, Jintao Ren, Qian Niu, Silin Chen, Weiche Hsieh, Lawrence K. Q. Yan, Chia Xin Liang, Han Xu, Hong-Ming Tseng, Xinyuan Song, Ming Liu
With a focus on natural language processing (NLP) and the role of large language models (LLMs), we explore the intersection of machine learning, deep learning, and artificial intelligence.
no code implementations • 28 Oct 2024 • Lawrence K. Q. Yan, Ming Li, Yichao Zhang, Caitlyn Heqi Yin, Cheng Fei, Benji Peng, Ziqian Bi, Pohsun Feng, Keyu Chen, Junyu Liu, Qian Niu
With the increasing application of large language models (LLMs) in the medical domain, evaluating these models' performance using benchmark datasets has become crucial.
no code implementations • 27 Oct 2024 • Weiche Hsieh, Ziqian Bi, Junyu Liu, Benji Peng, Sen Zhang, Xuanhe Pan, Jiawei Xu, Jinlang Wang, Keyu Chen, Caitlyn Heqi Yin, Pohsun Feng, Yizhu Wen, Tianyang Wang, Ming Li, Jintao Ren, Qian Niu, Silin Chen, Ming Liu
Digital Signal Processing (DSP) and Digital Image Processing (DIP) with Machine Learning (ML) and Deep Learning (DL) are popular research areas in Computer Vision and related fields.
no code implementations • 22 Oct 2024 • Silin Chen, Ziqian Bi, Junyu Liu, Benji Peng, Sen Zhang, Xuanhe Pan, Jiawei Xu, Jinlang Wang, Keyu Chen, Caitlyn Heqi Yin, Pohsun Feng, Yizhu Wen, Tianyang Wang, Ming Li, Jintao Ren, Qian Niu, Ming Liu
This book provides a comprehensive introduction to the foundational concepts of machine learning (ML) and deep learning (DL).
no code implementations • 21 Oct 2024 • Jintao Ren, Ziqian Bi, Qian Niu, Junyu Liu, Benji Peng, Sen Zhang, Xuanhe Pan, Jinlang Wang, Keyu Chen, Caitlyn Heqi Yin, Pohsun Feng, Yizhu Wen, Tianyang Wang, Silin Chen, Ming Li, Jiawei Xu, Ming Liu
This book offers an in-depth exploration of object detection and semantic segmentation, combining theoretical foundations with practical applications.
no code implementations • 21 Oct 2024 • Ming Li, Wei Shen, Qingli Li, Yan Wang
The fundamental idea of label filling is to supervise the segmentation model by a subset of pixels with trustworthy labels, meanwhile filling labels of other pixels by mixed supervision.
no code implementations • 18 Oct 2024 • Rang Liu, Ming Li, Qian Liu
In this paper, we explore cooperative sensing and communication within cell-free integrated sensing and communication (ISAC) systems.
1 code implementation • 17 Oct 2024 • Hongyu Zhao, Ming Li, Lichao Sun, Tianyi Zhou
Evaluating large language models (LLMs) is costly: it requires the generation and examination of LLM outputs on a large-scale benchmark of various tasks.
no code implementations • 16 Oct 2024 • Rang Liu, Ming Li, Qian Liu, A. Lee Swindlehurst
Integrated sensing and communication has been identified as an enabling technology for forthcoming wireless networks.
no code implementations • 16 Oct 2024 • Haocheng Zhang, Wei Wang, Hao Zhou, Zhiping Lu, Ming Li
As reconfigurable intelligent surfaces (RIS) emerge as a pivotal technology in the upcoming sixth-generation (6G) networks, their deployment within practical multiple operator (OP) networks presents significant challenges, including the coordination of RIS configurations among OPs, interference management, and privacy maintenance.
no code implementations • 12 Oct 2024 • Pohsun Feng, Ziqian Bi, Yizhu Wen, Benji Peng, Junyu Liu, Caitlyn Heqi Yin, Tianyang Wang, Keyu Chen, Sen Zhang, Ming Li, Jiawei Xu, Ming Liu, Xuanhe Pan, Jinlang Wang, Qian Niu
This manuscript presents a comprehensive guide to Automated Machine Learning (AutoML), covering fundamental principles, practical implementations, and future trends.
no code implementations • 11 Oct 2024 • Yufan Liu, Jinyang An, Wanqian Zhang, Ming Li, Dayan Wu, Jingzi Gu, Zheng Lin, Weiping Wang
The remarkable development of text-to-image generation models has raised notable security concerns, such as the infringement of portrait rights and the generation of inappropriate content.
no code implementations • 11 Oct 2024 • Guangrui Yang, Ming Li, Han Feng, Xiaosheng Zhuang
Graph convolutional networks (GCNs) have emerged as powerful models for graph learning tasks, exhibiting promising performance in various domains.
no code implementations • 5 Oct 2024 • Ze Li, Yao Shi, Yunfei Xu, Ming Li
Speaker embedding based zero-shot Text-to-Speech (TTS) systems enable high-quality speech synthesis for unseen speakers using minimal data.
no code implementations • 4 Oct 2024 • Ming Li, Zhaojian Wang, Feng Liu, Ming Cao, Bo Yang
The scheme of online optimization as a feedback controller is widely used to steer the states of a physical system to the optimal solution of a predefined optimization problem.
no code implementations • 4 Oct 2024 • Keyu Chen, Ziqian Bi, Tianyang Wang, Yizhu Wen, Pohsun Feng, Qian Niu, Junyu Liu, Benji Peng, Sen Zhang, Ming Li, Xuanhe Pan, Jiawei Xu, Jinlang Wang, Caitlyn Heqi Yin, Ming Liu
This book, Design Patterns in Machine Learning and Deep Learning: Advancing Big Data Analytics Management, presents a comprehensive study of essential design patterns tailored for large-scale machine learning and deep learning applications.
no code implementations • 2 Oct 2024 • Pohsun Feng, Ziqian Bi, Yizhu Wen, Xuanhe Pan, Benji Peng, Ming Liu, Jiawei Xu, Keyu Chen, Junyu Liu, Caitlyn Heqi Yin, Sen Zhang, Jinlang Wang, Qian Niu, Ming Li, Tianyang Wang
This book serves as an introduction to deep learning and machine learning, focusing on their applications in big data analytics.
no code implementations • 30 Sep 2024 • Tianyang Wang, Ziqian Bi, Keyu Chen, Jiawei Xu, Qian Niu, Junyu Liu, Benji Peng, Ming Li, Sen Zhang, Xuanhe Pan, Jinlang Wang, Pohsun Feng, Caitlyn Heqi Yin, Yizhu Wen, Ming Liu
Object-Oriented Programming (OOP) has become a crucial paradigm for managing the growing complexity of modern software systems, particularly in fields like machine learning, deep learning, large language models (LLM), and data analytics.
1 code implementation • 25 Sep 2024 • Guanlin Li, Ke Zhang, Ting Wang, Ming Li, Bin Zhao, Xuelong Li
Despite the impressive advancements made in recent low-light image enhancement techniques, the scarcity of paired data has emerged as a significant obstacle to further advancements.
no code implementations • 25 Sep 2024 • Benji Peng, Xuanhe Pan, Yizhu Wen, Ziqian Bi, Keyu Chen, Ming Li, Ming Liu, Qian Niu, Junyu Liu, Jinlang Wang, Sen Zhang, Jiawei Xu, Pohsun Feng
This book explores the role of Artificial Intelligence (AI), Machine Learning (ML), and Deep Learning (DL) in driving the progress of big data analytics and management.
1 code implementation • 25 Sep 2024 • Ming Li, Jike Zhong, Chenxin Li, Liuzhuozheng Li, Nie Lin, Masashi Sugiyama
Recent advances in fine-tuning Vision-Language Models (VLMs) have witnessed the success of prompt tuning and adapter tuning, while the classic model fine-tuning on inherent parameters seems to be overlooked.
no code implementations • 23 Sep 2024 • Ming Li, Xueqian Jin, Xuejiao Hu, Jinghao Cao, Sidan Du, Yang Li
We implement two algorithms, in which the two-stage algorithm obtains initial depth maps by pairwise stereo matching of multiple cameras and fuses the multiple depth maps to achieve the final depth estimation; the one-stage algorithm adopts spherical sweeping based on hypothetical depths to construct a uniform spherical matching cost of the multi-camera images and obtain the depth.
no code implementations • 18 Sep 2024 • Yifan Sun, Rang Liu, Zhiping Lu, Honghao Luo, Ming Li, Qian Liu
In this paper, we first present a range-Doppler (RD) imaging algorithm to obtain imaging results for the proposed ARIS-empowered SAR system.
no code implementations • 15 Sep 2024 • Ming Li, Pengcheng Xu, Junjie Hu, Zeyu Tang, Guang Yang
Federated learning holds great potential for enabling large-scale healthcare research and collaboration across multiple centres while ensuring data privacy and security are not compromised.
no code implementations • 14 Sep 2024 • Qian Niu, Keyu Chen, Ming Li, Pohsun Feng, Ziqian Bi, Lawrence KQ Yan, Yichao Zhang, Caitlyn Heqi Yin, Cheng Fei, Junyu Liu, Benji Peng, Tianyang Wang, Yunze Wang, Silin Chen
This comprehensive review explores the progression of LLMs to Multimodal Large Language Models (MLLMs) and their growing influence in medical practice.
1 code implementation • 12 Sep 2024 • Beilong Tang, Bang Zeng, Ming Li
We propose TSELM, a novel target speaker extraction network that leverages discrete tokens and language models.
no code implementations • 12 Sep 2024 • Ming Li, Xiong Yang, Chaofan Wu, Jiaheng Li, Pinzhi Wang, Xuejiao Hu, Sidan Du, Yang Li
Omnidirectional Depth Estimation has broad application prospects in fields such as robotic navigation and autonomous driving.
no code implementations • 9 Sep 2024 • Hongfei Xue, Rong Gong, Mingchen Shao, Xin Xu, Lezhi Wang, Lei Xie, Hui Bu, Jiaming Zhou, Yong Qin, Jun Du, Ming Li, BinBin Zhang, Bin Jia
The StutteringSpeech Challenge focuses on advancing speech technologies for people who stutter, specifically targeting Stuttering Event Detection (SED) and Automatic Speech Recognition (ASR) in Mandarin.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +2
no code implementations • 5 Sep 2024 • Yucong Zhang, Xin Zou, Jinshan Yang, Wenjun Chen, Juan Liu, Faya Liang, Ming Li
The system integrates video-based glottis detection with an audio keyword spotting method to analyze both video and audio data, identifying patient vocalizations and refining video highlights to ensure optimal inspection of vocal fold movements.
no code implementations • 5 Sep 2024 • Yucong Zhang, Juan Liu, Yao Tian, Haifeng Liu, Ming Li
In contrast to human speech, machine-generated sounds of the same type often exhibit consistent frequency characteristics and discernible temporal periodicity.
no code implementations • 5 Sep 2024 • Mengzhen Liu, Ming Li, Rang Liu, Qian Liu
Extremely large-scale antenna array (ELAA) is a key candidate technology for the sixth generation (6G) mobile networks.
no code implementations • 4 Sep 2024 • Qian Niu, Junyu Liu, Ziqian Bi, Pohsun Feng, Benji Peng, Keyu Chen, Ming Li, Lawrence KQ Yan, Yichao Zhang, Caitlyn Heqi Yin, Cheng Fei, Tianyang Wang, Yunze Wang, Silin Chen
This comprehensive review explores the intersection of Large Language Models (LLMs) and cognitive science, examining similarities and differences between LLMs and human cognitive processes.
no code implementations • 4 Sep 2024 • Bang Zeng, Ming Li
Traditionally, this process has relied on extracting a speaker embedding from a reference speech, necessitating a speaker recognition model.
no code implementations • 2 Sep 2024 • Haiju Fan, Xiaona Qin, Shuang Chen, Hubert P. H. Shum, Ming Li
In this paper, we propose a novel one-index attack method in the VQ domain to generate adversarial images by a differential evolution algorithm, successfully resulting in image misclassification in victim models.
1 code implementation • 25 Aug 2024 • Yali Du, Hui Sun, Ming Li
However, parallel data is difficult to collect for some language pairs, and the distribution of program semantics across languages can shift, posing challenges for pairwise program translation.
no code implementations • 11 Aug 2024 • Zhi-Cun Lyu, Xin-Ye Li, Zheng Xie, Ming Li
In this paper, we propose Top Pass, a code ranking approach that identifies potential correct solutions from a large number of candidates.
no code implementations • 16 Jul 2024 • Yuke Lin, Ming Cheng, FuLin Zhang, Yingying Gao, Shilei Zhang, Ming Li
In this paper, we provide a large audio-visual speaker recognition dataset, VoxBlink2, which includes approximately 10M utterances with videos from 110K+ speakers in the wild.
no code implementations • 1 Jul 2024 • Guangrui Yang, Jianfei Li, Ming Li, Han Feng, Ding-Xuan Zhou
In our numerical experiments, we analyze several widely applied GCNs and observe the phenomenon of energy decay.
no code implementations • 25 Jun 2024 • Peishi Li, Ming Li, Rang Liu, Qian Liu, A. Lee Swindlehurst
In addition, the proposed waveform design achieves target detection and estimation performance close to that achievable by waveforms designed only for radar, which demonstrates the superiority of the proposed SLP-based ISAC approach.
1 code implementation • 24 Jun 2024 • Ziguang Li, Chao Huang, Xuliang Wang, Haibo Hu, Cole Wyeth, Dongbo Bu, Quan Yu, Wen Gao, Xingwu Liu, Ming Li
The better a large model understands the data, the better LMCompress compresses.
1 code implementation • 24 Jun 2024 • Xueyu Liu, Guangze Shi, Rui Wang, Yexin Lai, Jianan Zhang, Lele Sun, Quan Yang, Yongfei Wu, Ming Li, Weixia Han, Wen Zheng
Experimental results on our collected 2538 TEM images confirm that GBMSeg achieves superior segmentation performance with a Dice similarity coefficient (DSC) of 87. 27% using only one labeled reference image in a training-free manner, outperforming recently proposed one-shot or few-shot methods.
3 code implementations • 22 Jun 2024 • Ming Li, Han Chen, Chenguang Wang, Dang Nguyen, Dianqi Li, Tianyi Zhou
Despite the remarkable advancement of Large language models (LLMs), they still lack delicate controllability under sophisticated constraints, which is critical to enhancing their response quality and the user experience.
no code implementations • 20 Jun 2024 • Peijia Guo, Ziguang Li, Haibo Hu, Chao Huang, Ming Li, Rui Zhang
We conceptualize the process of understanding as information compression, and propose a method for ranking large language models (LLMs) based on lossless data compression.
no code implementations • 18 Jun 2024 • Haoyan Yang, Zhitao Li, Yong Zhang, Jianzong Wang, Ning Cheng, Ming Li, Jing Xiao
Our framework was designed to be communication efficient, computation can be delegated to the local client so that the server's computation burden can be lightened.
no code implementations • 11 Jun 2024 • Rong Gong, Hongfei Xue, Lezhi Wang, Xin Xu, Qisheng Li, Lei Xie, Hui Bu, Shaomei Wu, Jiaming Zhou, Yong Qin, BinBin Zhang, Jun Du, Jia Bin, Ming Li
The rapid advancements in speech technologies over the past two decades have led to human-level performance in tasks like automatic speech recognition (ASR) for fluent speech.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +2
no code implementations • 7 Jun 2024 • Ze Li, Yuke Lin, Tian Yao, Hongbin Suo, Pengyuan Zhang, Yanzhen Ren, Zexin Cai, Hiromitsu Nishizaki, Ming Li
We expect SSTC to be a platform for advancing the development of the SSV task and provide further insights into the performance and limitations of current SV systems against VC attacks.
no code implementations • 2 Jun 2024 • Shoushuo Zhang, Zichao Xiao, Rang Liu, Ming Li, Wei Wang, Qian Liu
Integrated sensing and communication (ISAC) systems are typically deployed in multipath environments, which is usually deemed as a challenging issue for wireless communications.
no code implementations • 27 May 2024 • Yifan Mao, Ming Li, Jian Liu, Jiayang Liu, Zihan Qin, Chunxi Chu, Jialei Xu, Wenbo Zhao, Junjun Jiang, Xianming Liu
However, given that most of the data in the autonomous driving dataset is collected in daytime scenarios, this leads to poor depth model performance in the face of out-of-distribution(OoD) data.
no code implementations • 24 May 2024 • Dean Wyatte, Fatemeh Tahmasbi, Ming Li, Thomas Markovich
To address this issue we present a system that allows us to use an LLM to augment our customer support advocates by re-framing the language modeling task as a discriminative classification task.
no code implementations • 23 May 2024 • Zhuo Xu, Lu Bai, Lixin Cui, Ming Li, Yue Wang, Edwin R. Hancock
To this end, during the encoding process, we commence by utilizing the hard node assignment to decompose a sample graph into a family of separated subgraphs.
3 code implementations • 22 May 2024 • Ming Li, Pei Chen, Chenguang Wang, Hongyu Zhao, Yijun Liang, Yupeng Hou, Fuxiao Liu, Tianyi Zhou
Finetuning large language models with a variety of instruction-response pairs has enhanced their capability to understand and follow instructions.
1 code implementation • 21 May 2024 • Keke Huang, Yu Guang Wang, Ming Li, and Pietro Liò
Our extensive experiments, conducted on a diverse range of real-world and synthetic datasets with varying degrees of heterophily, support the superiority of UniFilter.
no code implementations • 16 May 2024 • Zhehan Zhao, Lu Bai, Lixin Cui, Ming Li, Yue Wang, Lixiang Xu, Edwin R. Hancock
In this paper, we propose a new hierarchical pooling operation, namely the Edge-Node Attention-based Differentiable Pooling (ENADPool), for GNNs to learn effective graph representations.
no code implementations • 14 May 2024 • Lingdong Kong, Shaoyuan Xie, Hanjiang Hu, Yaru Niu, Wei Tsang Ooi, Benoit R. Cottereau, Lai Xing Ng, Yuexin Ma, Wenwei Zhang, Liang Pan, Kai Chen, Ziwei Liu, Weichao Qiu, Wei zhang, Xu Cao, Hao Lu, Ying-Cong Chen, Caixin Kang, Xinning Zhou, Chengyang Ying, Wentao Shang, Xingxing Wei, Yinpeng Dong, Bo Yang, Shengyin Jiang, Zeliang Ma, Dengyi Ji, Haiwen Li, Xingliang Huang, Yu Tian, Genghua Kou, Fan Jia, Yingfei Liu, Tiancai Wang, Ying Li, Xiaoshuai Hao, Yifan Yang, HUI ZHANG, Mengchuan Wei, Yi Zhou, Haimei Zhao, Jing Zhang, Jinke Li, Xiao He, Xiaoqiang Cheng, Bingyang Zhang, Lirong Zhao, Dianlei Ding, Fangsheng Liu, Yixiang Yan, Hongming Wang, Nanfei Ye, Lun Luo, Yubo Tian, Yiwei Zuo, Zhe Cao, Yi Ren, Yunfan Li, Wenjie Liu, Xun Wu, Yifan Mao, Ming Li, Jian Liu, Jiayang Liu, Zihan Qin, Cunxi Chu, Jialei Xu, Wenbo Zhao, Junjun Jiang, Xianming Liu, Ziyan Wang, Chiwei Li, Shilong Li, Chendong Yuan, Songyue Yang, Wentao Liu, Peng Chen, Bin Zhou, YuBo Wang, Chi Zhang, Jianhang Sun, Hai Chen, Xiao Yang, Lizhong Wang, Dongyi Fu, Yongchun Lin, Huitong Yang, Haoang Li, Yadan Luo, Xianjing Cheng, Yong Xu
In the realm of autonomous driving, robust perception under out-of-distribution conditions is paramount for the safe deployment of vehicles.
1 code implementation • 9 May 2024 • Zonglin Lyu, Ming Li, Jianbo Jiao, Chen Chen
To address this problem, we propose our unique solution: Frame Interpolation with Consecutive Brownian Bridge Diffusion.
no code implementations • 8 May 2024 • Yonghan Yu, Ming Li
Traditional database search methods, though widely used, rely on heuristic scoring functions and statistical estimations have to be introduced for a higher identification rate.
no code implementations • 7 May 2024 • Yan Zhang, Chun Li, Zhaoxia Liu, Ming Li
By addressing the limitations imposed by limited labeled data and harnessing the untapped potential of unlabeled medical images, our novel generative model presents a promising direction for enhancing semi-supervised disease classification in the field of medical image analysis.
no code implementations • 2 May 2024 • Ming Li, Yuanna Liu, Sami Jullien, Mozhdeh Ariannezhad, Mohammad Aliannejadi, Andrew Yates, Maarten de Rijke
So far, most NBR studies have focused on optimizing the accuracy of the recommendation, whereas optimizing for beyond-accuracy metrics, e. g., item fairness and diversity remains largely unexplored.
1 code implementation • 11 Apr 2024 • Lifan Jiang, Zhihui Wang, Changmiao Wang, Ming Li, Jiaxu Leng, Xindong Wu
In the present study, we introduce a novel framework designed to articulate object detection as a denoising diffusion process, which operates on the perturbed bounding boxes of annotated entities.
1 code implementation • 11 Apr 2024 • Ming Li, Taojiannan Yang, Huafeng Kuang, Jie Wu, Zhaoning Wang, Xuefeng Xiao, Chen Chen
Specifically, for an input conditional control, we use a pre-trained discriminative reward model to extract the corresponding condition of the generated images, and then optimize the consistency loss between the input conditional control and extracted condition.
1 code implementation • 8 Apr 2024 • Ming Li, Lin Li, Xiaohui Tao, Jimmy Xiangji Huang
Due to constraints related to user health privacy and meal scenario characteristics, the collection of data that includes both meal-course affiliation and two levels of interactions is impeded.
1 code implementation • 3 Apr 2024 • Zihan Yao, Yu He, Tianyu Qi, Ming Li
Addressing the issues of hallucinations and outdated knowledge in large language models is critical for their reliable application.
no code implementations • 2 Apr 2024 • Xu Li, Ruiqi Sun, Jiameng Lv, Peng Jia, Nan Li, Chengliang Wei, Zou Hu, Xinzhong Er, Yun Chen, Zhang Ban, Yuedong Fang, Qi Guo, Dezi Liu, Guoliang Li, Lin Lin, Ming Li, Ran Li, Xiaobo Li, Yu Luo, Xianmin Meng, Jundan Nie, Zhaoxiang Qi, Yisheng Qiu, Li Shao, Hao Tian, Lei Wang, Wei Wang, Jingtian Xian, Youhua Xu, Tianmeng Zhang, Xin Zhang, Zhimin Zhou
To overcome these challenges, we have developed a framework based on a hierarchical visual Transformer with a sliding window technique to search for strong lensing systems within entire images.
no code implementations • 2 Apr 2024 • James Anibal, Hannah Huth, Ming Li, Lindsey Hazen, Veronica Daoud, Dominique Ebedes, Yen Minh Lam, Hang Nguyen, Phuc Hong, Michael Kleinman, Shelley Ost, Christopher Jackson, Laura Sprabery, Cheran Elangovan, Balaji Krishnaiah, Lee Akst, Ioan Lina, Iqbal Elyazar, Lenny Ekwati, Stefan Jansen, Richard Nduwayezu, Charisse Garcia, Jeffrey Plum, Jacqueline Brenner, Miranda Song, Emily Ricotta, David Clifton, C. Louise Thwaites, Yael Bensoussan, Bradford Wood
Artificial intelligence (AI) models trained on audio data may have the potential to rapidly perform clinical tasks, enhancing medical decision-making and potentially improving outcomes through early detection.
1 code implementation • IEEE Transactions on Affective Computing 2024 • Chengyan Yu, Dong Zhang, Wei Zou, Ming Li
In this study, we propose a joint training method for training an FER model using multiple FER datasets.
Ranked #8 on Facial Expression Recognition (FER) on RAF-DB (using extra training data)
Facial Expression Recognition Facial Expression Recognition (FER) +1
no code implementations • 24 Mar 2024 • Feifei Qian, Lixin Cui, Ming Li, Yue Wang, Hangyuan Du, Lixiang Xu, Lu Bai, Philip S. Yu, Edwin R. Hancock
In this paper, we propose a new model to learn Adaptive Kernel-based Representations (AKBR) for graph classification.
no code implementations • 24 Mar 2024 • Zhuo Xu, Lixin Cui, Ming Li, Yue Wang, Ziyu Lyu, Hangyuan Du, Lu Bai, Philip S. Yu, Edwin R. Hancock
We commence by assigning the nodes of a sample graph into different clusters, resulting in a family of separated subgraphs.
no code implementations • 23 Mar 2024 • Ming Li, Zhiyong Sun
In this paper, we have demonstrated that the controllers designed by a classical motion planning tool, namely artificial potential fields (APFs), can be derived from a recently prevalent approach: control barrier function quadratic program (CBF-QP) safety filters.
1 code implementation • 20 Mar 2024 • Huali Zhou, Yuke Lin, Dong Liu, Ming Li
This work aims to promote Chinese opera research in both musical and speech domains, with a primary focus on overcoming the data limitations.
no code implementations • 10 Mar 2024 • Ming Li, Zhiyong Sun, Patrick J. W. Koelewijn, Siep Weiland
Finally, we demonstrate the efficacy of our method through a collision avoidance example, investigating the essential properties including safety, robustness, and smoothness under various tunable scaling terms.
no code implementations • 5 Mar 2024 • Ming Li, Zhiyong Sun, Siep Weiland
This paper revisits a classical challenge in the design of stabilizing controllers for nonlinear systems with a norm-bounded input constraint.
no code implementations • 3 Mar 2024 • Yifang Xu, Chenglei Peng, Ming Li, Yang Li, Sidan Du
Deep convolutional neural networks (DCNNs) have achieved great success in monocular depth estimation (MDE).
no code implementations • 28 Feb 2024 • Bing Li, Dong Zhang, Cheng Huang, Yun Xian, Ming Li, Dah-Jye Lee
Camera with a fisheye or ultra-wide lens covers a wide field of view that cannot be modeled by the perspective projection.
1 code implementation • 20 Feb 2024 • Xiaohan Xu, Ming Li, Chongyang Tao, Tao Shen, Reynold Cheng, Jinyang Li, Can Xu, DaCheng Tao, Tianyi Zhou
In the era of Large Language Models (LLMs), Knowledge Distillation (KD) emerges as a pivotal methodology for transferring advanced capabilities from leading proprietary LLMs, such as GPT-4, to their open-source counterparts like LLaMA and Mistral.
1 code implementation • 16 Feb 2024 • Ming Li, Jiuhai Chen, Lichang Chen, Tianyi Zhou
To examine DEBATUNE, we curate the largest dataset of debate topics so far, which covers 710 controversial topics and corresponding arguments for each topic.
2 code implementations • 15 Feb 2024 • Ming Li, Lichang Chen, Jiuhai Chen, Shwai He, Jiuxiang Gu, Tianyi Zhou
This paper introduces Selective Reflection-Tuning, a novel paradigm that synergizes a teacher LLM's reflection and introspection for improving existing data quality with the data selection capability of the student LLM, to automatically refine existing instruction-tuning data.
no code implementations • 14 Feb 2024 • Jiancheng Yang, Rui Shi, Liang Jin, Xiaoyang Huang, Kaiming Kuang, Donglai Wei, Shixuan Gu, Jianying Liu, PengFei Liu, Zhizhong Chai, Yongjie Xiao, Hao Chen, Liming Xu, Bang Du, Xiangyi Yan, Hao Tang, Adam Alessio, Gregory Holste, Jiapeng Zhang, Xiaoming Wang, Jianye He, Lixuan Che, Hanspeter Pfister, Ming Li, Bingbing Ni
The resulting FracNet+ demonstrates competitive performance in rib fracture detection, which lays a foundation for further research and development in AI-assisted rib fracture detection and diagnosis.
no code implementations • 9 Feb 2024 • Wenyu Li, Yinuo Zhu, Xin Lin, Ming Li, Ziyue Jiang, Ziqian Zeng
Traditional discriminative approaches in mental health analysis are known for their strong capacity but lack interpretability and demand large-scale annotated data.
1 code implementation • 1 Feb 2024 • Ming Li, Yong Zhang, Shwai He, Zhitao Li, Hongyu Zhao, Jianzong Wang, Ning Cheng, Tianyi Zhou
Data filtering for instruction tuning has proved important in improving both the efficiency and performance of the tuning process.
no code implementations • 18 Jan 2024 • Yong Zhang, Hanzhang Li, Zhitao Li, Ning Cheng, Ming Li, Jing Xiao, Jianzong Wang
Large Language Models (LLMs) have shown significant promise in various applications, including zero-shot and few-shot learning.
no code implementations • 16 Jan 2024 • Ming Cheng, Ming Li
The proposed method can take audio-visual input and leverage the speaker's acoustic footprint or lip track to flexibly conduct audio-based, video-based, and audio-visual speaker diarization in a unified sequence-to-sequence framework.
no code implementations • 11 Jan 2024 • Yixian Zheng, Rang Liu, Ming Li, Qian Liu
Integrated sensing and communication (ISAC) is an encouraging wireless technology which can simultaneously perform both radar and communication functionalities by sharing the same transmit waveform, spectral resource, and hardware platform.
no code implementations • 8 Jan 2024 • Yun Yang, Zhiping Lu, Ming Li, Rang Liu, Qian Liu
Motivated by this fact, in this paper we first investigate the amplification principle of typical active RIS and propose a more accurate amplification model based on amplifier hardware characteristics.
no code implementations • 3 Jan 2024 • Danwei Cai, Zexin Cai, Ming Li
Specifically, a teacher model continually refines pseudo labels through online clustering, providing dynamic supervision signals to train the student model.
no code implementations • 10 Dec 2023 • Letian Zhang, Ming Li, Chen Chen, Jie Xu
This poses a paradox as the necessary camera pose must be estimated from the entire dataset, even though the data arrives sequentially and future chunks are inaccessible.
no code implementations • 5 Dec 2023 • Ming Li, Zhiyong Sun
In our previous research, we developed an analytical control strategy, namely the universal formula, that incorporates CLF and CBF conditions for safe stabilization.
no code implementations • 4 Dec 2023 • Yufei Shi, Beijia Lu, Jia-Wei Liu, Ming Li, Mike Zheng Shou
Specifically, to reconstruct the entire colon in a piecewise manner, our ColonNeRF introduces a region division and integration module, effectively reducing shape dissimilarity and ensuring geometric consistency in each segment.
no code implementations • 30 Nov 2023 • Zhaoning Wang, Ming Li, Chen Chen
Specifically, our research demonstrates that Large Language Models (LLMs) possess 3D spatial awareness and can effectively translate textual 3D information into precise 3D bounding boxes.
no code implementations • 27 Nov 2023 • Ming Li, Guang Yang
In medical image analysis, Federated Learning (FL) stands out as a key technology that enables privacy-preserved, decentralized data processing, crucial for handling sensitive medical data.