no code implementations • 10 Aug 2024 • Jinpeng Li, Yu Pu, Qi Sun, Wei-Qiang Zhang
It is worth researching how to utilize low-cost data to improve the performance of Whisper on under-represented languages.
no code implementations • 1 Jul 2024 • Changde Du, Kaicheng Fu, Bincheng Wen, Yi Sun, Jie Peng, Wei Wei, Ying Gao, Shengpei Wang, Chuncheng Zhang, Jinpeng Li, Shuang Qiu, Le Chang, Huiguang He
The conceptualization and categorization of natural objects in the human mind have long intrigued cognitive scientists and neuroscientists, offering crucial insights into human perception and cognition.
1 code implementation • 17 Jun 2024 • Yifan Yang, Zheshu Song, Jianheng Zhuo, Mingyu Cui, Jinpeng Li, Bo Yang, Yexing Du, Ziyang Ma, Xunying Liu, Ziyuan Wang, Ke Li, Shuai Fan, Kai Yu, Wei-Qiang Zhang, Guoguo Chen, Xie Chen
Notably, ASR models trained on GigaSpeech 2 can reduce the word error rate for Thai, Indonesian, and Vietnamese on our challenging and realistic YouTube test set by 25% to 40% compared to the Whisper large-v3 model, with merely 10% model parameters.
no code implementations • 16 Jun 2024 • Zhenyu Zhang, Bingguang Hao, Jinpeng Li, Zekai Zhang, Dongyan Zhao
Most large language models (LLMs) are sensitive to prompts, and another synonymous expression or a typo may lead to unexpected results for the model.
no code implementations • 5 Jun 2024 • Xiaoxi Sun, Jinpeng Li, Yan Zhong, Dongyan Zhao, Rui Yan
The advent of large language models (LLMs) has facilitated the development of natural language text generation.
no code implementations • 29 May 2024 • Jiaze Wang, Hao Chen, Hongcan Xu, Jinpeng Li, Bowen Wang, Kun Shao, Furui Liu, Huaxi Chen, Guangyong Chen, Pheng-Ann Heng
Weather forecasting plays a critical role in various sectors, driving decision-making and risk management.
no code implementations • 18 Apr 2024 • Pengfei Wu, Jiahao Liu, Zhuocheng Gong, Qifan Wang, Jinpeng Li, Jingang Wang, Xunliang Cai, Dongyan Zhao
In this paper, we propose a novel parallel decoding approach, namely \textit{hidden transfer}, which decodes multiple successive tokens simultaneously in a single forward pass.
no code implementations • 18 Mar 2024 • Jinpeng Li, Zekai Zhang, Quan Tu, Xin Cheng, Dongyan Zhao, Rui Yan
Furthermore, although many prompt-based methods have been proposed to accomplish specific tasks, their performance in complex real-world scenarios involving a wide variety of dialog styles further enhancement.
1 code implementation • 13 Mar 2024 • Jiayu Du, Jinpeng Li, Guoguo Chen, Wei-Qiang Zhang
In this paper we introduce the SpeechColab Leaderboard, a general-purpose, open-source platform designed for ASR evaluation.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +1
no code implementations • 7 Feb 2024 • Gangming Zhao, Chaoqi Chen, Wenhao He, Chengwei Pan, Chaowei Fang, Jinpeng Li, Xilin Chen, Yizhou Yu
Moreover, as adjusting to the most recent target domain can interfere with the features learned from previous target domains, we develop a conservative sparse attention mechanism.
no code implementations • 22 Jan 2024 • Hao Chen, Jiaze Wang, Ziyu Guo, Jinpeng Li, Donghao Zhou, Bian Wu, Chenyong Guan, Guangyong Chen, Pheng-Ann Heng
Sign language recognition (SLR) plays a vital role in facilitating communication for the hearing-impaired community.
no code implementations • 11 Jan 2024 • Xinyuan Wang, Chengwei Pan, Hongming Dai, Gangming Zhao, Jinpeng Li, Xiao Zhang, Yizhou Yu
In this study, we leverage Fourier domain learning as a substitute for multi-scale convolutional kernels in 3D hierarchical segmentation models, which can reduce computational expenses while preserving global receptive fields within the network.
1 code implementation • 23 Aug 2023 • Donghao Zhou, Jialin Li, Jinpeng Li, Jiancheng Huang, Qiang Nie, Yong liu, Bin-Bin Gao, Qiong Wang, Pheng-Ann Heng, Guangyong Chen
Unfortunately, the resultant noisy bounding boxes could cause corrupt supervision signals and thus diminish detection performance.
1 code implementation • 20 Aug 2023 • Quan Tu, Chuanqi Chen, Jinpeng Li, Yanran Li, Shuo Shang, Dongyan Zhao, Ran Wang, Rui Yan
In our modern, fast-paced, and interconnected world, the importance of mental well-being has grown into a matter of great urgency.
no code implementations • 29 Jun 2023 • Ang Lv, Jinpeng Li, Yuhan Chen, Xing Gao, Ji Zhang, Rui Yan
In open-domain dialogue generation tasks, contexts and responses in most datasets are one-to-one mapped, violating an important many-to-many characteristic: a context leads to various responses, and a response answers multiple contexts.
1 code implementation • 23 Jun 2023 • Shizhan Gong, Yuan Zhong, Wenao Ma, Jinpeng Li, Zhao Wang, Jingyang Zhang, Pheng-Ann Heng, Qi Dou
Notably, the original SAM architecture is designed for 2D natural images, therefore would not be able to extract the 3D spatial information from volumetric medical data effectively.
1 code implementation • 30 May 2023 • Yuxuan Wang, Zilong Zheng, Xueliang Zhao, Jinpeng Li, Yueqian Wang, Dongyan Zhao
Video-grounded dialogue understanding is a challenging problem that requires machine to perceive, parse and reason over situated semantics extracted from weakly aligned video and dialogues.
no code implementations • 14 Mar 2023 • Xuchu Chen, Yu Pu, Jinpeng Li, Wei-Qiang Zhang
We present our submission to the ICASSP-SPGC-2023 ADReSS-M Challenge Task, which aims to investigate which acoustic features can be generalized and transferred across languages for Alzheimer's Disease (AD) prediction.
no code implementations • 12 Mar 2023 • Yi Wang, Jiaze Wang, Jinpeng Li, Zixu Zhao, Guangyong Chen, Anfeng Liu, Pheng-Ann Heng
With Point-MAE as our baseline, our model surpasses previous methods by a significant margin, achieving 86. 3% accuracy on ScanObjectNN and 94. 1% accuracy on ModelNet40.
no code implementations • 1 Nov 2022 • Enwei Zhu, Yiyang Liu, Ming Jin, Jinpeng Li
However, existing nested NER models heavily rely on training data annotated with nested entities, while labeling such data is costly.
2 code implementations • 13 Oct 2022 • Changde Du, Kaicheng Fu, Jinpeng Li, Huiguang He
Finally, we construct three trimodal matching datasets, and the extensive experiments lead to some interesting conclusions and cognitive insights: 1) decoding novel visual categories from human brain activity is practically possible with good accuracy; 2) decoding models using the combination of visual and linguistic features perform much better than those using either of them alone; 3) visual perception may be accompanied by linguistic influences to represent the semantics of visual stimuli.
1 code implementation • 9 Oct 2022 • Enwei Zhu, Yiyang Liu, Jinpeng Li
However, this typically results in significant ineffectiveness for long-span entities, a coupling between the representations of overlapping spans, and ultimately a performance degradation.
1 code implementation • 23 Aug 2022 • Lingfeng li, Huaiwei Cong, Gangming Zhao, Junran Peng, Zheng Zhang, Jinpeng Li
However, due to the tissue overlap, X-ray images are difficult to provide fine-grained features for early diagnosis.
no code implementations • 23 Aug 2022 • Penghua Zhai, Enwei Zhu, Baolian Qi, Xin Wei, Jinpeng Li
In the past five years, several works have tailored for unsupervised representations of CT lesions via two-dimensional (2D) and three-dimensional (3D) self-supervised learning (SSL) algorithms.
1 code implementation • 23 Aug 2022 • Jinkai Lv, Yuyong Hu, Quanshui Fu, Zhiwang Zhang, Yuqiang Hu, Lin Lv, Guoqing Yang, Jinpeng Li, Yi Zhao
However, those methods have the following challenges when dealing with the edges of the medical images: (1) Previous convolutional-based methods do not focus on the boundary relationship between foreground and background around the segmentation edge, which leads to the degradation of segmentation performance when the edge changes complexly.
1 code implementation • 23 Aug 2022 • Xin Wei, Huaiwei Cong, Zheng Zhang, Junran Peng, Guoping Chen, Jinpeng Li
Long-term vertebral fractures severely affect the life quality of patients, causing kyphotic, lumbar deformity and even paralysis.
1 code implementation • 22 Aug 2022 • Chengwei Pan, Baolian Qi, Gangming Zhao, Jiaheng Liu, Chaowei Fang, Dingwen Zhang, Jinpeng Li
In CTN, a transformer module is constructed in parallel to a U-Net to learn long-distance dependencies between different anatomical regions; and these dependencies are communicated to the U-Net at multiple stages to endow it with global awareness.
no code implementations • 8 Jul 2022 • Jinpeng Li, Haibo Jin, Shengcai Liao, Ling Shao, Pheng-Ann Heng
This paper presents a Refinement Pyramid Transformer (RePFormer) for robust facial landmark detection.
1 code implementation • 1 Jul 2022 • Chengwei Pan, Gangming Zhao, Junjie Fang, Baolian Qi, Jiaheng Liu, Chaowei Fang, Dingwen Zhang, Jinpeng Li, Yizhou Yu
Although deep learning algorithms have been intensively developed for computer-aided tuberculosis diagnosis (CTD), they mainly depend on carefully annotated datasets, leading to much time and resource consumption.
no code implementations • 29 Jun 2022 • Jing Zhao, Haoyu Wang, Jinpeng Li, Shuzhou Chai, Guan-Bo Wang, Guoguo Chen, Wei-Qiang Zhang
For the Constrained training condition, we construct our basic ASR system based on the standard hybrid architecture.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +4
1 code implementation • ACL 2022 • Enwei Zhu, Jinpeng Li
Neural named entity recognition (NER) models may easily encounter the over-confidence issue, which degrades the performance and calibration.
Ranked #2 on Chinese Named Entity Recognition on Weibo NER
Chinese Named Entity Recognition named-entity-recognition +3
no code implementations • 18 Apr 2022 • Yan Li, Hao Chen, Jake Zhao, Haolan Zhang, Jinpeng Li
Specifically, numerous domain adaptation (DA) algorithms have been exploited in the past five years to enhance the generalization of emotion recognition models across subjects.
1 code implementation • 8 Mar 2022 • Enwei Zhu, Qilin Sheng, Huanwan Yang, Jinpeng Li
The resulted annotated corpus includes 1, 200 full medical records (or 18, 039 broken-down documents), and achieves inter-annotator agreements (IAAs) of 94. 53%, 73. 73% and 91. 98% F 1 scores for the three tasks.
2 code implementations • 10 Jan 2022 • Irtiza Hasan, Shengcai Liao, Jinpeng Li, Saad Ullah Akram, Ling Shao
As for the data, we show that the autonomous driving benchmarks are monotonous in nature, that is, they are not diverse in scenarios and dense in pedestrians.
1 code implementation • NeurIPS 2021 • Jinpeng Li, Yingce Xia, Rui Yan, Hongda Sun, Dongyan Zhao, Tie-Yan Liu
Considering there is no parallel data between the contexts and the responses of target style S1, existing works mainly use back translation to generate stylized synthetic data for training, where the data about context, target style S1 and an intermediate style S0 is used.
4 code implementations • 1 Sep 2021 • Yichao Yan, Jinpeng Li, Jie Qin, Shengcai Liao, Xiaokang Yang
Third, by investigating the advantages of both anchor-based and anchor-free models, we further augment AlignPS with an ROI-Align head, which significantly improves the robustness of re-id features while still keeping our model highly efficient.
Ranked #4 on Person Search on PRW
no code implementations • 17 Aug 2021 • Penghua Zhai, Huaiwei Cong, Gangming Zhao, Chaowei Fang, Jinpeng Li, Ting Cai, Huiguang He
To avoid the subjectivity associated with these methods, we propose the MVCNet, a novel unsupervised three dimensional (3D) representation learning method working in a transformation-free manner.
2 code implementations • 16 Aug 2021 • Bo Dong, Wenhai Wang, Deng-Ping Fan, Jinpeng Li, Huazhu Fu, Ling Shao
Unlike existing CNN-based methods, we adopt a transformer encoder, which learns more powerful and robust representations.
Ranked #10 on Medical Image Segmentation on CVC-ColonDB
1 code implementation • 16 Jul 2021 • Hao Chen, Ming Jin, Zhunan Li, Cunhang Fan, Jinpeng Li, Huiguang He
Although several studies have adopted domain adaptation (DA) approaches to tackle this problem, most of them treat multiple EEG data from different subjects and sessions together as a single source domain for transfer, which either fails to satisfy the assumption of domain adaptation that the source has a certain marginal distribution, or increases the difficulty of adaptation.
1 code implementation • 14 Jul 2021 • Baolian Qi, Gangming Zhao, Xin Wei, Changde Du, Chengwei Pan, Yizhou Yu, Jinpeng Li
To model the relationship, we propose the Graph Regularized Embedding Network (GREN), which leverages the intra-image and inter-image information to locate diseases on chest X-ray images.
no code implementations • 10 Jul 2021 • Jinpeng Li, Yichao Yan, Shengcai Liao, Xiaokang Yang, Ling Shao
Transformers have demonstrated great potential in computer vision tasks.
3 code implementations • 19 Jun 2021 • Yichao Yan, Jinpeng Li, Shengcai Liao, Jie Qin, Bingbing Ni, Xiaokang Yang, Ling Shao
This paper inventively considers weakly supervised person search with only bounding box annotations.
no code implementations • 27 May 2021 • Haibo Jin, Jinpeng Li, Shengcai Liao, Ling Shao
To this end, we first propose a baseline model equipped with one transformer decoder as detection head.
Ranked #5 on Face Alignment on COFW
1 code implementation • CVPR 2021 • Yichao Yan, Jinpeng Li, Jie Qin, Song Bai, Shengcai Liao, Li Liu, Fan Zhu, Ling Shao
Person search aims to simultaneously localize and identify a query person from realistic, uncropped images, which can be regarded as the unified task of pedestrian detection and person re-identification (re-id).
Ranked #10 on Person Search on CUHK-SYSU
no code implementations • 3 Feb 2021 • Jinpeng Li, Yaling Tao, Ting Cai
We exploit liver cancer prediction model using machine learning algorithms based on epidemiological data of over 55 thousand peoples from 2014 to the present.
1 code implementation • 2020 IEEE International Conference on Bioinformatics and Biomedicine (BIBM) 2021 • Jinpeng Li, Hao Chen, Ting Cai
However, most of them are iterative methods, which need considerable training time and are unfeasible in practice.
no code implementations • 22 Jan 2021 • Gangming Zhao, Baolian Qi, Jinpeng Li
Locating lesions is important in the computer-aided diagnosis of X-ray images.
1 code implementation • CVPR 2021 • Irtiza Hasan, Shengcai Liao, Jinpeng Li, Saad Ullah Akram, Ling Shao
Furthermore, we illustrate that diverse and dense datasets, collected by crawling the web, serve to be an efficient source of pre-training for pedestrian detection.
Ranked #3 on Pedestrian Detection on CityPersons (using extra training data)
no code implementations • 15 Apr 2019 • Shuai Chen, Jinpeng Li, Chuanqi Yao, Wenbo Hou, Shuo Qin, Wenyao Jin, Xu Tang
Working with multi-scale features, the designed dual scale residual unit makes dual scale detectors no longer run independently.
no code implementations • 25 Apr 2017 • Changde Du, Changying Du, Jinpeng Li, Wei-Long Zheng, Bao-liang Lu, Huiguang He
In this paper, we first build a multi-view deep generative model to simulate the generative process of multi-modality emotional data.