1 code implementation • 30 May 2023 • Yuxuan Wang, Zilong Zheng, Xueliang Zhao, Jinpeng Li, Yueqian Wang, Dongyan Zhao
Video-grounded dialogue understanding is a challenging problem that requires machine to perceive, parse and reason over situated semantics extracted from weakly aligned video and dialogues.
no code implementations • 14 Mar 2023 • Xuchu Chen, Yu Pu, Jinpeng Li, Wei-Qiang Zhang
We present our submission to the ICASSP-SPGC-2023 ADReSS-M Challenge Task, which aims to investigate which acoustic features can be generalized and transferred across languages for Alzheimer's Disease (AD) prediction.
no code implementations • 12 Mar 2023 • Yi Wang, Jiaze Wang, Jinpeng Li, Zixu Zhao, Guangyong Chen, Anfeng Liu, Pheng-Ann Heng
With Point-MAE as our baseline, our model surpasses previous methods by a significant margin, achieving 86. 3% accuracy on ScanObjectNN and 94. 1% accuracy on ModelNet40.
no code implementations • 1 Nov 2022 • Enwei Zhu, Yiyang Liu, Ming Jin, Jinpeng Li
However, existing nested NER models heavily rely on training data annotated with nested entities, while labeling such data is costly.
1 code implementation • 13 Oct 2022 • Changde Du, Kaicheng Fu, Jinpeng Li, Huiguang He
Finally, we construct three trimodal matching datasets, and the extensive experiments lead to some interesting conclusions and cognitive insights: 1) decoding novel visual categories from human brain activity is practically possible with good accuracy; 2) decoding models using the combination of visual and linguistic features perform much better than those using either of them alone; 3) visual perception may be accompanied by linguistic influences to represent the semantics of visual stimuli.
1 code implementation • 9 Oct 2022 • Enwei Zhu, Yiyang Liu, Jinpeng Li
However, this typically results in significant ineffectiveness for long-span entities, a coupling between the representations of overlapping spans, and ultimately a performance degradation.
no code implementations • 23 Aug 2022 • Penghua Zhai, Enwei Zhu, Baolian Qi, Xin Wei, Jinpeng Li
In the past five years, several works have tailored for unsupervised representations of CT lesions via two-dimensional (2D) and three-dimensional (3D) self-supervised learning (SSL) algorithms.
1 code implementation • 23 Aug 2022 • Lingfeng li, Huaiwei Cong, Gangming Zhao, Junran Peng, Zheng Zhang, Jinpeng Li
However, due to the tissue overlap, X-ray images are difficult to provide fine-grained features for early diagnosis.
1 code implementation • 23 Aug 2022 • Jinkai Lv, Yuyong Hu, Quanshui Fu, Zhiwang Zhang, Yuqiang Hu, Lin Lv, Guoqing Yang, Jinpeng Li, Yi Zhao
However, those methods have the following challenges when dealing with the edges of the medical images: (1) Previous convolutional-based methods do not focus on the boundary relationship between foreground and background around the segmentation edge, which leads to the degradation of segmentation performance when the edge changes complexly.
1 code implementation • 23 Aug 2022 • Xin Wei, Huaiwei Cong, Zheng Zhang, Junran Peng, Guoping Chen, Jinpeng Li
Long-term vertebral fractures severely affect the life quality of patients, causing kyphotic, lumbar deformity and even paralysis.
1 code implementation • 22 Aug 2022 • Chengwei Pan, Baolian Qi, Gangming Zhao, Jiaheng Liu, Chaowei Fang, Dingwen Zhang, Jinpeng Li
In CTN, a transformer module is constructed in parallel to a U-Net to learn long-distance dependencies between different anatomical regions; and these dependencies are communicated to the U-Net at multiple stages to endow it with global awareness.
no code implementations • 8 Jul 2022 • Jinpeng Li, Haibo Jin, Shengcai Liao, Ling Shao, Pheng-Ann Heng
This paper presents a Refinement Pyramid Transformer (RePFormer) for robust facial landmark detection.
1 code implementation • 1 Jul 2022 • Chengwei Pan, Gangming Zhao, Junjie Fang, Baolian Qi, Jiaheng Liu, Chaowei Fang, Dingwen Zhang, Jinpeng Li, Yizhou Yu
Although deep learning algorithms have been intensively developed for computer-aided tuberculosis diagnosis (CTD), they mainly depend on carefully annotated datasets, leading to much time and resource consumption.
no code implementations • 29 Jun 2022 • Jing Zhao, Haoyu Wang, Jinpeng Li, Shuzhou Chai, Guan-Bo Wang, Guoguo Chen, Wei-Qiang Zhang
For the Constrained training condition, we construct our basic ASR system based on the standard hybrid architecture.
Automatic Speech Recognition
Automatic Speech Recognition (ASR)
+3
1 code implementation • ACL 2022 • Enwei Zhu, Jinpeng Li
Neural named entity recognition (NER) models may easily encounter the over-confidence issue, which degrades the performance and calibration.
Ranked #2 on
Chinese Named Entity Recognition
on Weibo NER
Chinese Named Entity Recognition
named-entity-recognition
+3
no code implementations • 18 Apr 2022 • Yan Li, Hao Chen, Jake Zhao, Haolan Zhang, Jinpeng Li
Specifically, numerous domain adaptation (DA) algorithms have been exploited in the past five years to enhance the generalization of emotion recognition models across subjects.
1 code implementation • 8 Mar 2022 • Enwei Zhu, Qilin Sheng, Huanwan Yang, Jinpeng Li
The resulted annotated corpus includes 1, 200 full medical records (or 18, 039 broken-down documents), and achieves inter-annotator agreements (IAAs) of 94. 53%, 73. 73% and 91. 98% F 1 scores for the three tasks.
2 code implementations • 10 Jan 2022 • Irtiza Hasan, Shengcai Liao, Jinpeng Li, Saad Ullah Akram, Ling Shao
As for the data, we show that the autonomous driving benchmarks are monotonous in nature, that is, they are not diverse in scenarios and dense in pedestrians.
1 code implementation • NeurIPS 2021 • Jinpeng Li, Yingce Xia, Rui Yan, Hongda Sun, Dongyan Zhao, Tie-Yan Liu
Considering there is no parallel data between the contexts and the responses of target style S1, existing works mainly use back translation to generate stylized synthetic data for training, where the data about context, target style S1 and an intermediate style S0 is used.
3 code implementations • 1 Sep 2021 • Yichao Yan, Jinpeng Li, Jie Qin, Shengcai Liao, Xiaokang Yang
Third, by investigating the advantages of both anchor-based and anchor-free models, we further augment AlignPS with an ROI-Align head, which significantly improves the robustness of re-id features while still keeping our model highly efficient.
Ranked #4 on
Person Search
on PRW
no code implementations • 17 Aug 2021 • Penghua Zhai, Huaiwei Cong, Gangming Zhao, Chaowei Fang, Jinpeng Li, Ting Cai, Huiguang He
To avoid the subjectivity associated with these methods, we propose the MVCNet, a novel unsupervised three dimensional (3D) representation learning method working in a transformation-free manner.
2 code implementations • 16 Aug 2021 • Bo Dong, Wenhai Wang, Deng-Ping Fan, Jinpeng Li, Huazhu Fu, Ling Shao
Unlike existing CNN-based methods, we adopt a transformer encoder, which learns more powerful and robust representations.
Ranked #6 on
Medical Image Segmentation
on CVC-ColonDB
1 code implementation • 16 Jul 2021 • Hao Chen, Ming Jin, Zhunan Li, Cunhang Fan, Jinpeng Li, Huiguang He
Although several studies have adopted domain adaptation (DA) approaches to tackle this problem, most of them treat multiple EEG data from different subjects and sessions together as a single source domain for transfer, which either fails to satisfy the assumption of domain adaptation that the source has a certain marginal distribution, or increases the difficulty of adaptation.
1 code implementation • 14 Jul 2021 • Baolian Qi, Gangming Zhao, Xin Wei, Changde Du, Chengwei Pan, Yizhou Yu, Jinpeng Li
To model the relationship, we propose the Graph Regularized Embedding Network (GREN), which leverages the intra-image and inter-image information to locate diseases on chest X-ray images.
no code implementations • 10 Jul 2021 • Jinpeng Li, Yichao Yan, Shengcai Liao, Xiaokang Yang, Ling Shao
Transformers have demonstrated great potential in computer vision tasks.
2 code implementations • 19 Jun 2021 • Yichao Yan, Jinpeng Li, Shengcai Liao, Jie Qin, Bingbing Ni, Xiaokang Yang, Ling Shao
This paper inventively considers weakly supervised person search with only bounding box annotations.
no code implementations • 27 May 2021 • Haibo Jin, Jinpeng Li, Shengcai Liao, Ling Shao
To this end, we first propose a baseline model equipped with one transformer decoder as detection head.
1 code implementation • CVPR 2021 • Yichao Yan, Jinpeng Li, Jie Qin, Song Bai, Shengcai Liao, Li Liu, Fan Zhu, Ling Shao
Person search aims to simultaneously localize and identify a query person from realistic, uncropped images, which can be regarded as the unified task of pedestrian detection and person re-identification (re-id).
Ranked #10 on
Person Search
on CUHK-SYSU
no code implementations • 3 Feb 2021 • Jinpeng Li, Yaling Tao, Ting Cai
We exploit liver cancer prediction model using machine learning algorithms based on epidemiological data of over 55 thousand peoples from 2014 to the present.
1 code implementation • 2020 IEEE International Conference on Bioinformatics and Biomedicine (BIBM) 2021 • Jinpeng Li, Hao Chen, Ting Cai
However, most of them are iterative methods, which need considerable training time and are unfeasible in practice.
no code implementations • 22 Jan 2021 • Gangming Zhao, Baolian Qi, Jinpeng Li
Locating lesions is important in the computer-aided diagnosis of X-ray images.
1 code implementation • CVPR 2021 • Irtiza Hasan, Shengcai Liao, Jinpeng Li, Saad Ullah Akram, Ling Shao
Furthermore, we illustrate that diverse and dense datasets, collected by crawling the web, serve to be an efficient source of pre-training for pedestrian detection.
Ranked #2 on
Pedestrian Detection
on CityPersons
(using extra training data)
no code implementations • 15 Apr 2019 • Shuai Chen, Jinpeng Li, Chuanqi Yao, Wenbo Hou, Shuo Qin, Wenyao Jin, Xu Tang
Working with multi-scale features, the designed dual scale residual unit makes dual scale detectors no longer run independently.
no code implementations • 25 Apr 2017 • Changde Du, Changying Du, Jinpeng Li, Wei-Long Zheng, Bao-liang Lu, Huiguang He
In this paper, we first build a multi-view deep generative model to simulate the generative process of multi-modality emotional data.