no code implementations • NLP4ConvAI (ACL) 2022 • Tong Zhang, Yong liu, Boyang Li, Peixiang Zhong, Chen Zhang, Hao Wang, Chunyan Miao
Conversational Recommendation Systems recommend items through language based interactions with users. In order to generate naturalistic conversations and effectively utilize knowledge graphs (KGs) containing background information, we propose a novel Bag-of-Entities loss, which encourages the generated utterances to mention concepts related to the item being recommended, such as the genre or director of a movie.
1 code implementation • 23 May 2023 • Shitian He, Huanxin Zou, Yingqian Wang, Boyang Li, Xu Cao, Ning Jing
In this paper, we make the first attempt to achieve RS object detection with single point supervision, and propose a PSOD framework tailored with RS images.
1 code implementation • 11 May 2023 • Wenliang Dai, Junnan Li, Dongxu Li, Anthony Meng Huat Tiong, Junqi Zhao, Weisheng Wang, Boyang Li, Pascale Fung, Steven Hoi
In this paper, we conduct a systematic and comprehensive study on vision-language instruction tuning based on the pre-trained BLIP-2 models.
no code implementations • 20 Apr 2023 • Qin Chao, Eunsoo Kim, Boyang Li
Investments in movie production are associated with a high level of risk as movie revenues have long-tailed and bimodal distributions.
1 code implementation • 10 Apr 2023 • Boyang Li, Yingqian Wang, Longguang Wang, Fei Zhang, Ting Liu, Zaiping Lin, Wei An, Yulan Guo
The core idea of this work is to recover the per-pixel mask of each target from the given single point label by using clustering approaches, which looks simple but is indeed challenging since targets are always insalient and accompanied with background clutters.
no code implementations • 2 Feb 2023 • Tong Zhang, Yong liu, Boyang Li, Zhiwei Zeng, Pengwei Wang, Yuan You, Chunyan Miao, Lizhen Cui
HAHT maintains a long-term memory of history conversations and utilizes history information to understand current conversation context and generate well-informed and context-relevant responses.
no code implementations • CVPR 2023 • Jiaxian Guo, Junnan Li, Dongxu Li, Anthony Meng Huat Tiong, Boyang Li, DaCheng Tao, Steven Hoi
To address this issue, we propose Img2Prompt, a plug-and-play module that provides the prompts that can bridge the aforementioned modality and task disconnections, so that LLMs can perform zero-shot VQA tasks without end-to-end training.
1 code implementation • 21 Dec 2022 • Jiaxian Guo, Junnan Li, Dongxu Li, Anthony Meng Huat Tiong, Boyang Li, DaCheng Tao, Steven C. H. Hoi
To address this issue, we propose \emph{Img2Prompt}, a plug-and-play module that provides the prompts that can bridge the aforementioned modality and task disconnections, so that LLMs can perform zero-shot VQA tasks without end-to-end training.
no code implementations • 20 Dec 2022 • Bosheng Ding, Chengwei Qin, Linlin Liu, Lidong Bing, Shafiq Joty, Boyang Li
In this paper, we evaluate the performance of GPT-3 as a data annotator by comparing it with traditional data annotation methods and analyzing its output on a range of tasks.
no code implementations • 23 Nov 2022 • Haoxin Li, YuAn Liu, Hanwang Zhang, Boyang Li
In video action recognition, shortcut static features can interfere with the learning of motion features, resulting in poor out-of-distribution (OOD) generalization.
no code implementations • 20 Oct 2022 • Jiayun Luo, Boyang Li, Cyril Leung
In addition, we discuss five key subareas of computer vision and how they related to these CEA problems, as well as nine vision-based CEA datasets.
1 code implementation • 17 Oct 2022 • Anthony Meng Huat Tiong, Junnan Li, Boyang Li, Silvio Savarese, Steven C. H. Hoi
Visual question answering (VQA) is a hallmark of vision and language reasoning and a challenging task under the zero-shot setting.
Ranked #5 on
Visual Question Answering (VQA)
on VQA v2 val
1 code implementation • 6 Oct 2022 • Xu Guo, Boyang Li, Han Yu
Prompt tuning, or the conditioning of a frozen pretrained language model (PLM) with soft prompts learned from data, has demonstrated impressive performance on a wide range of NLP tasks.
1 code implementation • 28 Sep 2022 • Tianhao Wu, Boyang Li, Yihang Luo, Yingqian Wang, Chao Xiao, Ting Liu, Jungang Yang, Wei An, Yulan Guo
Due to the extremely large image coverage area (e. g., thousands square kilometers), candidate targets in these images are much smaller, dimer, more changeable than those targets observed by aerial-based and land-based imaging devices.
no code implementations • 29 Jun 2022 • Yinan Zhang, Boyang Li, Yong liu, You Yuan, Chunyan Miao
Multi-shot CRS is designed to make recommendations multiple times until the user either accepts the recommendation or leaves at the end of their patience.
1 code implementation • 1 Jun 2022 • Jun Chen, Ming Hu, Boyang Li, Mohamed Elhoseiny
After finetuning the pretrained LoMaR on 384$\times$384 images, it can reach 85. 4% top-1 accuracy, surpassing MAE by 0. 6%.
no code implementations • 5 May 2022 • Boyang Li, Qing Lu, Weiwen Jiang, Taeho Jung, Yiyu Shi
In many recent novel blockchain consensuses, the deep learning training procedure becomes the task for miners to prove their workload, thus the computation power of miners will not purely be spent on the hash puzzle.
no code implementations • 11 Mar 2022 • Yidan Sun, Qin Chao, Yangfeng Ji, Boyang Li
Despite recent advances of AI, story understanding remains an open and under-investigated problem.
no code implementations • 19 Oct 2021 • Anthony Meng Huat Tiong, Junnan Li, Guosheng Lin, Boyang Li, Caiming Xiong, Steven C. H. Hoi
ICCL interpolates two images from a class-agnostic sampler and a class-aware sampler, and trains the model such that the representation of the interpolative image can be used to retrieve the centroids for both source classes.
Ranked #20 on
Long-tail Learning
on CIFAR-10-LT (ρ=10)
no code implementations • 3 Aug 2021 • Chang Liu, Han Yu, Boyang Li, Zhiqi Shen, Zhanning Gao, Peiran Ren, Xuansong Xie, Lizhen Cui, Chunyan Miao
Noisy labels are commonly found in real-world data, which cause performance degradation of deep neural networks.
no code implementations • 9 Jun 2021 • Yinan Zhang, Boyang Li, Yong liu, Hao Wang, Chunyan Miao
In this work, we propose a new initialization scheme for user and item embeddings called Laplacian Eigenmaps with Popularity-based Regularization for Isolated Data (LEPORID).
1 code implementation • 1 Jun 2021 • Boyang Li, Chao Xiao, Longguang Wang, Yingqian Wang, Zaiping Lin, Miao Li, Wei An, Yulan Guo
With the repeated interaction in DNIM, infrared small targets in deep layers can be maintained.
1 code implementation • 31 May 2021 • Ting Liu, Jungang Yang, Boyang Li, Chao Xiao, Yang Sun, Yingqian Wang, Wei An
Considering that different singular values have different importance and should be treated discriminatively, in this paper, we propose a non-convex tensor low-rank approximation (NTLA) method for infrared small target detection.
no code implementations • Proceedings of the Sixteenth European Conference on Computer Systems 2021 • Yidi Wu, Kaihao Ma, Zhenkun Cai, Tatiana Jin, Boyang Li, Chenguang Zheng, James Cheng, Fan Yu
Graph neural networks (GNNs) have achieved breakthrough performance in graph analytics such as node classification, link prediction and graph clustering.
1 code implementation • NAACL 2021 • Xu Guo, Boyang Li, Han Yu, Chunyan Miao
The existence of multiple datasets for sarcasm detection prompts us to apply transfer learning to exploit their commonality.
1 code implementation • CVPR 2021 • Chang Liu, Han Yu, Boyang Li, Zhiqi Shen, Zhanning Gao, Peiran Ren, Xuansong Xie, Lizhen Cui, Chunyan Miao
The existence of noisy labels in real-world data negatively impacts the performance of deep learning models.
1 code implementation • CVPR 2022 • Jun Chen, Han Guo, Kai Yi, Boyang Li, Mohamed Elhoseiny
To the best of our knowledge, this is the first work that improves data efficiency of image captioning by utilizing LM pretrained on unimodal data.
1 code implementation • 4 Feb 2021 • YuanYuan Chen, Boyang Li, Han Yu, Pengcheng Wu, Chunyan Miao
the weights of training data, HYDRA assesses the contribution of training data toward test data points throughout the training trajectory.
no code implementations • 3 Dec 2020 • Xu Guo, Han Yu, Boyang Li, Hao Wang, Pengwei Xing, Siwei Feng, Zaiqing Nie, Chunyan Miao
In this paper, we propose the FedHumor approach for the recognition of humorous content in a personalized manner through Federated Learning (FL).
1 code implementation • 15 Nov 2020 • Jianan Wang, Boyang Li, Xiangyu Fan, Jing Lin, Yanwei Fu
The task of video and text sequence alignment is a prerequisite step toward joint understanding of movie videos and screenplays.
no code implementations • 29 Jul 2020 • Yixiao Lan, Yu-An Liu, Boyang Li
Empirical evaluation shows that SML can detect cheating nodes at small cost to the predictive performance.
3 code implementations • ICCV 2021 • Sherif Abdelkarim, Aniket Agarwal, Panos Achlioptas, Jun Chen, Jiaji Huang, Boyang Li, Kenneth Church, Mohamed Elhoseiny
We use these benchmarks to study the performance of several state-of-the-art long-tail models on the LTVRR setup.
no code implementations • 30 Dec 2019 • Xin Zhou, Dejing Dou, Boyang Li
Search space is a key consideration for neural architecture search.
1 code implementation • 7 Dec 2019 • Adam Noack, Isaac Ahern, Dejing Dou, Boyang Li
We demonstrate that training the networks to have interpretable gradients improves their robustness to adversarial perturbations.
no code implementations • ICLR 2020 • Isaac Ahern, Adam Noack, Luis Guzman-Nateras, Dejing Dou, Boyang Li, Jun Huan
The problem of explaining deep learning models, and model predictions generally, has attracted intensive interest recently.
1 code implementation • 31 May 2019 • Yuan Gong, Boyang Li, Christian Poellabauer, Yiyu Shi
In recent years, many efforts have demonstrated that modern machine learning algorithms are vulnerable to adversarial attacks, where small, but carefully crafted, perturbations on the input can make them fail.
no code implementations • 15 Apr 2019 • Boyang Li, Changhao Chenli, Xiaowei Xu, Yiyu Shi, Taeho Jung
In this paper, we propose DLBC to exploit the computation power of miners for deep learning training as proof of useful work instead of calculating hash values.
1 code implementation • 21 Dec 2018 • Guoyun Tu, Yanwei Fu, Boyang Li, Jiarui Gao, Yu-Gang Jiang, xiangyang xue
However, the sparsity of emotional expressions in the videos poses an obstacle to visual emotion analysis.
no code implementations • 6 Apr 2018 • Hannah Kim, Denys Katerenchuk, Daniel Billet, Jun Huan, Haesun Park, Boyang Li
Understanding narrative content has become an increasingly popular topic.
1 code implementation • 28 Feb 2018 • Huijuan Xu, Boyang Li, Vasili Ramanishka, Leonid Sigal, Kate Saenko
In order to explicitly model temporal relationships between visual events and their captions in a single video, we also propose a two-level hierarchical captioning module that keeps track of context.
1 code implementation • CVPR 2018 • Pelin Dogan, Boyang Li, Leonid Sigal, Markus Gross
The alignment of heterogeneous sequential data (video to text) is an important and challenging problem.
no code implementations • LREC 2018 • Boyang Li, Beth Cardier, Tong Wang, Florian Metze
Stories are a vital form of communication in human culture; they are employed daily to persuade, to elicit sympathy, or to convey a message.
Cultural Vocal Bursts Intensity Prediction
Vocal Bursts Intensity Prediction
no code implementations • 8 Jul 2017 • Tong Wang, Ping Chen, Boyang Li
An important and difficult challenge in building computational models for narratives is the automatic evaluation of narrative quality.
no code implementations • 16 Nov 2015 • Baohan Xu, Yanwei Fu, Yu-Gang Jiang, Boyang Li, Leonid Sigal
Emotion is a key element in user-generated videos.