no code implementations • WASSA (ACL) 2022 • Bin Li, Yixuan Weng, Qiya Song, Bin Sun, Shutao Li
This paper describes the contribution of the LingJing team’s method to the Workshop on Computational Approaches to Subjectivity, Sentiment & Social Media Analysis (WASSA) 2022 shared task on Emotion Classification.
1 code implementation • SemEval (NAACL) 2022 • Fei Xia, Bin Li, Yixuan Weng, Shizhu He, Bin Sun, Shutao Li, Kang Liu, Jun Zhao
For the classification sub-task, we adopt the DeBERTa-v3 pre-trained model for fine-tuning datasets of different languages.
2 code implementations • SemEval (NAACL) 2022 • Bin Li, Yixuan Weng, Fei Xia, Shizhu He, Bin Sun, Shutao Li
This paper introduces the approach of Team LingJing’s experiments on SemEval-2022 Task 1 Comparing Dictionaries and Word Embeddings (CODWOE).
1 code implementation • BioNLP (ACL) 2022 • Bin Li, Yixuan Weng, Fei Xia, Bin Sun, Shutao Li
Given an input video, the MedVidCL task aims to correctly classify it into one of three following categories: Medical Instructional, Medical Non-instructional, and Non-medical.
no code implementations • 21 Dec 2024 • Hang Yin, Zhifeng Lin, Xin Liu, Bin Sun, Kan Li
Direction reasoning is essential for intelligent systems to understand the real world.
no code implementations • 18 Jun 2024 • Ziyu Ma, Chenhui Gou, Hengcan Shi, Bin Sun, Shutao Li, Hamid Rezatofighi, Jianfei Cai
Specifically, DrVideo first transforms a long video into a coarse text-based long document to initially retrieve key frames and then updates the documents with the augmented key frame information.
no code implementations • 12 Jun 2024 • Yiwei Li, Fei Mi, Yitong Li, Yasheng Wang, Bin Sun, Shaoxiong Feng, Kan Li
In DDS, both sequence-level and token-level adaptive search can be achieved to adjust the decoding process in a unified framework.
1 code implementation • 4 Feb 2024 • Ziyu Ma, Shutao Li, Bin Sun, Jianfei Cai, Zuxiang Long, Fuyan Ma
Therefore, we propose GeReA, a generate-reason framework that prompts a MLLM like InstructBLIP with question relevant vision and language information to generate knowledge-relevant descriptions and reasons those descriptions for knowledge-based VQA.
1 code implementation • 19 Jan 2024 • Yiwei Li, Peiwen Yuan, Shaoxiong Feng, Boyuan Pan, Xinglin Wang, Bin Sun, HeDa Wang, Kan Li
Self-consistency (SC) has been a widely used decoding strategy for chain-of-thought reasoning.
1 code implementation • 20 Dec 2023 • Yiwei Li, Peiwen Yuan, Shaoxiong Feng, Boyuan Pan, Bin Sun, Xinglin Wang, HeDa Wang, Kan Li
In this work, we illustrate the merit of negative data and propose a model specialization framework to distill LLMs with negative samples besides positive ones.
1 code implementation • 17 Oct 2023 • Hang Yin, Pinren Lu, Ziang Li, Bin Sun, Kan Li
The need for high-quality data has been a key issue hindering the research of dialogue tasks.
1 code implementation • 9 May 2023 • Yixuan Weng, Bin Li, Fei Xia, Minjun Zhu, Bin Sun, Shizhu He, Kang Liu, Jun Zhao
The medical conversational question answering (CQA) system aims at providing a series of professional medical services to improve the efficiency of medical care.
no code implementations • 5 May 2023 • Fuyan Ma, Bin Sun, Shutao Li
Previous methods for dynamic facial expression recognition (DFER) in the wild are mainly based on Convolutional Neural Networks (CNNs), whose local operations ignore the long-range dependencies in videos.
Dynamic Facial Expression Recognition
Facial Expression Recognition
no code implementations • 21 Mar 2023 • Yiwei Li, Shaoxiong Feng, Bin Sun, Kan Li
Collaborative learning, also known as online knowledge distillation, is an effective way to conduct one-stage group distillation in the absence of a well-trained large teacher model.
2 code implementations • 2 Mar 2023 • Xu Ma, Yuqian Zhou, Huan Wang, Can Qin, Bin Sun, Chang Liu, Yun Fu
Context clusters (CoCs) view an image as a set of unorganized points and extract features via simplified clustering algorithm.
1 code implementation • 19 Dec 2022 • Yixuan Weng, Minjun Zhu, Fei Xia, Bin Li, Shizhu He, Shengping Liu, Bin Sun, Kang Liu, Jun Zhao
By performing a backward verification of the answers that LLM deduced for itself, we can obtain interpretable answer validation scores to select the candidate answer with the highest score.
no code implementations • 2 Dec 2022 • Bin Sun, Yitong Li, Fei Mi, Weichao Wang, Yiwei Li, Kan Li
Specifically, HLV constrains the global semantics of responses through discrete latent variables and enriches responses with continuous latent variables.
no code implementations • 1 Dec 2022 • Bin Sun, Shaoxiong Feng, Yiwei Li, Weichao Wang, Fei Mi, Yitong Li, Kan Li
Complex dialogue mappings (CDM), including one-to-many and many-to-one mappings, tend to make dialogue models generate incoherent or dull responses, and modeling these mappings remains a huge challenge for neural dialogue systems.
1 code implementation • 11 Oct 2022 • Bin Li, Yixuan Weng, Bin Sun, Shutao Li
We introduce a new task, named video corpus visual answer localization (VCVAL), which aims to locate the visual answer in a large collection of untrimmed instructional videos using a natural language question.
no code implementations • 26 Jul 2022 • Bin Sun
Multiple object tracking (MOT) is the task containing detection and association.
no code implementations • 5 Jul 2022 • Bin Li, Yixuan Weng, Ziyu Ma, Bin Sun, Shutao Li
To fully leverage the visual information for both scene understanding and dialogue generation, we propose the scene-aware prompt for the MDUG task.
no code implementations • 20 Jun 2022 • Zixuan Wang, Bin Sun
Infrared and visible images, as multi-modal image pairs, show significant differences in the expression of the same scene.
1 code implementation • CVPR 2022 • Xu Ma, Yuqian Zhou, Xingqian Xu, Bin Sun, Valerii Filev, Nikita Orlov, Yun Fu, Humphrey Shi
Image rasterization is a mature technique in computer graphics, while image vectorization, the reverse path of rasterization, remains a major challenge.
no code implementations • 23 May 2022 • Yiwei Li, Bin Sun, Shaoxiong Feng, Kan Li
However, the discarded samples may obtain high scores in other perspectives and can provide regularization effects on the model learning, which causes the performance improvement to be sensitive to the filtering ratio.
no code implementations • 10 May 2022 • Fuyan Ma, Bin Sun, Shutao Li
Previous methods for dynamic facial expression in the wild are mainly based on Convolutional Neural Networks (CNNs), whose local operations ignore the long-range dependencies in videos.
Ranked #10 on
Dynamic Facial Expression Recognition
on FERV39k
Dynamic Facial Expression Recognition
Facial Expression Recognition
+1
no code implementations • NAACL 2022 • Yiwei Li, Shaoxiong Feng, Bin Sun, Kan Li
Generative dialogue models suffer badly from the generic response problem, limiting their applications to a few toy scenarios.
1 code implementation • 20 Apr 2022 • Fei Xia, Bin Li, Yixuan Weng, Shizhu He, Kang Liu, Bin Sun, Shutao Li, Jun Zhao
The medical conversational system can relieve the burden of doctors and improve the efficiency of healthcare, especially during the pandemic.
1 code implementation • 16 Mar 2022 • Bin Sun, Yulun Zhang, Songyao Jiang, Yun Fu
In this paper, we propose a novel Hybrid Pixel-Unshuffled Network (HPUN) by introducing an efficient and effective downsampling module into the SR task.
no code implementations • 13 Mar 2022 • Bin Li, Yixuan Weng, Bin Sun, Shutao Li
However, due to the weak correlations and huge gaps of the semantic features between the textual question and visual answer, existing methods adopting visual span predictor perform poorly in the TAGV task.
no code implementations • 29 Nov 2021 • Bin Li, Fei Xia, Yixuan Weng, Xiusheng Huang, Bin Sun
In this paper, we propose a Simple framework for Contrastive Learning of Acronym Disambiguation (SimCLAD) method to better understand the acronym meanings.
no code implementations • 29 Nov 2021 • Bin Li, Fei Xia, Yixuan Weng, Xiusheng Huang, Bin Sun, Shutao Li
In this paper, we propose a Prompt-based Sequence Generation (PSG) method for the acronym extraction task.
no code implementations • 16 Oct 2021 • Ziyu Ma, Fuyan Ma, Bin Sun, Shutao Li
For the MuSe-Stress sub-challenge, we highlight our solutions in three aspects: 1) the audio-visual features and the bio-signal features are used for emotional state recognition.
2 code implementations • 12 Oct 2021 • Songyao Jiang, Bin Sun, Lichen Wang, Yue Bai, Kunpeng Li, Yun Fu
Current Sign Language Recognition (SLR) methods usually extract features via deep neural networks and suffer overfitting due to limited and noisy data.
no code implementations • 7 Sep 2021 • Bin Sun, Shaofan Wang, Dehui Kong, Jinghua Li, BaoCai Yin
GGLS presents a landmark selection scheme using attention-induced neighbors of the graphical structure of samples and performs distribution adaptation and knowledge adaptation over Grassmann manifold.
no code implementations • 3 Aug 2021 • Bin Li, Encheng Chen, Hongru Liu, Yixuan Weng, Bin Sun, Shutao Li, Yongping Bai, Meiling Hu
Medical Dialogue Generation (MDG) is intended to build a medical dialogue system for intelligent consultation, which can communicate with patients in real-time, thereby improving the efficiency of clinical diagnosis with broad application prospects.
no code implementations • ACL 2021 • Bin Sun, Shaoxiong Feng, Yiwei Li, Jiamou Liu, Kan Li
Conditional Variational AutoEncoder (CVAE) effectively increases the diversity and informativeness of responses in open-ended dialogue generation tasks through enriching the context vector with sampled latent variables.
no code implementations • 28 May 2021 • Bin Sun, Shaoxiong Feng, Yiwei Li, Jiamou Liu, Kan Li
In this work, we proposed a conversation model named "THINK" (Teamwork generation Hover around Impressive Noticeable Keywords) to make the decoder more complicated and avoid generating duplicated and self-contradicting responses.
no code implementations • 25 May 2021 • Bin Sun, Dehui Kong, Shaofan Wang, Jinghua Li, BaoCai Yin, Xiaonan Luo
In the sampling stage, we utilize a generative adversarial networks (GAN) trained by action features and word vectors of seen classes to synthesize the action features of unseen classes, which can balance the training sample data of seen classes and unseen classes.
no code implementations • 24 May 2021 • Bin Sun, Shaofan Wang, Dehui Kong, LiChun Wang, BaoCai Yin
To tackle all these problems, we propose a real-time 3D action recognition framework by integrating the locally aggregated kinematic-guided skeletonlet (LAKS) with a supervised hashing-by-analysis (SHA) model.
no code implementations • 31 Mar 2021 • Fuyan Ma, Bin Sun, Shutao Li
Facial Expression Recognition (FER) in the wild is extremely challenging due to occlusions, variant head poses, face deformation and motion blur under unconstrained conditions.
Facial Expression Recognition
Facial Expression Recognition (FER)
3 code implementations • 16 Mar 2021 • Songyao Jiang, Bin Sun, Lichen Wang, Yue Bai, Kunpeng Li, Yun Fu
Sign language is commonly used by deaf or speech impaired people to communicate but requires significant effort to master.
Ranked #2 on
Sign Language Recognition
on AUTSL
(using extra training data)
no code implementations • EMNLP 2020 • Shaoxiong Feng, Xuancheng Ren, Hongshen Chen, Bin Sun, Kan Li, Xu sun
Human dialogues are scenario-based and appropriate responses generally relate to the latent context knowledge entailed by the specific scenario.
no code implementations • 8 Aug 2020 • Renwei Dian, Shutao Li, Bin Sun, Anjing Guo
Hyperspectral image (HSI) with high spectral resolution often suffers from low spatial resolution owing to the limitations of imaging sensors.
no code implementations • 23 Dec 2019 • Yuhua Chen, Dan Ruan, Jiayu Xiao, Lixia Wang, Bin Sun, Rola Saouaf, Wensha Yang, Debiao Li, Zhaoyang Fan
The model takes in multi-slice MR images and generates the output of segmentation results.
no code implementations • 19 Nov 2019 • Ying Huang, Bin Sun, Haipeng Kan, Jiankai Zhuang, Zengchang Qin
Human pose estimation has made significant advancement in recent years.
no code implementations • 25 Oct 2019 • Bin Sun, Ming Shao, Siyu Xia, Yun Fu
To accelerate the model, we propose an efficient network structure to accelerate the evolutionary learning process through a factorization strategy.
no code implementations • 25 Oct 2019 • Bin Sun, Jun Li, Ming Shao, Yun Fu
To reduce the computation and memory costs, we propose a novel lightweight deep learning module by low-rank pointwise residual (LPR) convolution, called LPRNet.
no code implementations • 4 Aug 2019 • Nansen Petrosyan, Bin Sun
We apply these results to obtain hyperbolic and acylindrically hyperbolic quotients with special properties.
Group Theory 20F67, 20F10, 20E06
no code implementations • 29 Jul 2019 • Bin Sun
This is the second paper in a series of three papers aiming to study cohomology of group theoretic Dehn fillings.
Group Theory
1 code implementation • 20 Apr 2019 • Lichen Wang, Bin Sun, Joseph Robinson, Taotao Jing, Yun Fu
To make up this, we introduce a new, large-scale EV-Action dataset in this work, which consists of RGB, depth, electromyography (EMG), and two skeleton modalities.
Ranked #4 on
Multimodal Activity Recognition
on EV-Action
no code implementations • 12 Apr 2019 • Bin Sun, Chen Chen, Yingying Zhu, Jianmin Jiang
The task of cross-view image geo-localization aims to determine the geo-location (GPS coordinates) of a query ground-view image by matching it with the GPS-tagged aerial (satellite) images in a reference dataset.