1 code implementation • 30 Sep 2024 • Haoyu Zhang, Wenbin Wang, Tianshu Yu
The field of Multimodal Sentiment Analysis (MSA) has recently witnessed an emerging direction seeking to tackle the issue of data incompleteness.
1 code implementation • 28 Aug 2024 • Wenbin Wang, Liang Ding, Minyan Zeng, Xiabin Zhou, Li Shen, Yong Luo, DaCheng Tao
Building upon this insight, we propose Divide, Conquer and Combine (DC$^2$), a novel training-free framework for enhancing MLLM perception of HR images.
no code implementations • 27 Jul 2024 • Yansheng Li, Tingzhu Wang, Kang Wu, LinLin Wang, Xin Guo, Wenbin Wang
Therefore, we propose a novel Sample-Level Bias Prediction (SBP) method for fine-grained SGG (SBG).
3 code implementations • 13 Jun 2024 • Yansheng Li, LinLin Wang, Tingzhu Wang, Xue Yang, Junwei Luo, Qi Wang, Youming Deng, Wenbin Wang, Xian Sun, Haifeng Li, Bo Dang, Yongjun Zhang, Yi Yu, Junchi Yan
This paper constructs a large-scale dataset for SGG in large-size VHR SAI with image sizes ranging from 512 x 768 to 27, 860 x 31, 096 pixels, named STAR (Scene graph generaTion in lArge-size satellite imageRy), encompassing over 210K objects and over 400K triplets.
1 code implementation • 28 Apr 2024 • Wenbin Wang, Yang song, Sanjay Jha
To prevent catastrophic forgetting and reduce storage implications for few-shot speaker adaptation, we designed two adapters and a unique adaptation procedure.
1 code implementation • 8 Feb 2024 • Wenjie Xu, Wenbin Wang, Yuning Jiang, Bratislav Svetozarevic, Colin N. Jones
We study the problem of preferential Bayesian optimization (BO), where we aim to optimize a black-box function with only preference feedback over a pair of candidate solutions.
no code implementations • 12 Jan 2024 • Wenbin Wang, Liang Ding, Li Shen, Yong Luo, Han Hu, DaCheng Tao
Sentiment analysis is rapidly advancing by utilizing various data modalities (e. g., text, image).
no code implementations • 9 Nov 2023 • Kui Jiang, Xuemei Jia, Wenxin Huang, Wenbin Wang, Zheng Wang, Junjun Jiang
Thus, we propose to refine background textures with the predicted degradation prior in an association learning manner.
no code implementations • 24 Aug 2023 • Wenbin Wang, Yang song, Sanjay Jha
However, most current approaches suffer from the degradation of naturalness and speaker similarity when synthesizing speech for unseen speakers (i. e., speakers not in the training dataset) due to the poor generalizability of the model in out-of-distribution data.
no code implementations • 11 Jan 2023 • Haoyang Zhang, Danping He, Xiping Wang, Wenbin Wang, Yunhao Cheng, Ke Guan
As an emerging approach, deep learning plays an increasingly influential role in channel modeling.
1 code implementation • CVPR 2023 • Yuanyuan Liu, Wenbin Wang, Yibing Zhan, Shaoze Feng, Kejun Liu, Zhe Chen
Self-supervised facial representation has recently attracted increasing attention due to its ability to perform face understanding without relying on large-scale annotated datasets heavily.
no code implementations • 19 Sep 2022 • Wenbin Wang, Yang song, Sanjay Jha
We propose an end-to-end lecture video generation system that can generate realistic and complete lecture videos directly from annotated slides, instructor's reference voice and instructor's reference portrait video.
no code implementations • 1 Aug 2022 • Yuanyuan Liu, Wei Dai, Chuanxu Feng, Wenbin Wang, Guanghao Yin, Jiabei Zeng, Shiguang Shan
To the best of our knowledge, MAFW is the first in-the-wild multi-modal database annotated with compound emotion annotations and emotion-related captions.
Ranked #12 on Dynamic Facial Expression Recognition on MAFW
Dynamic Facial Expression Recognition Facial Expression Recognition +1
1 code implementation • Information Sciences 2022 • Yuanyuan Liu, Chuanxu Feng, Xiaohui Yuan, Lin Zhou, Wenbin Wang, Jie Qin, and Zhongwen Luo
In this paper, we divide a video into several short clips for processing and propose a clip-aware emotion-rich feature learning network (CEFLNet) for robust video-based FER.
Ranked #20 on Dynamic Facial Expression Recognition on DFEW
Dynamic Facial Expression Recognition Facial Expression Recognition +1
no code implementations • 17 Sep 2021 • Yuanyuan Liu, Wenbin Wang, Chuanxu Feng, Haoyu Zhang, Zhe Chen, Yibing Zhan
To this end, we propose to decompose each video into a series of expression snippets, each of which contains a small number of facial movements, and attempt to augment the Transformer's ability for modeling intra-snippet and inter-snippet visual relations, respectively, obtaining the Expression snippet Transformer (EST).
Ranked #18 on Dynamic Facial Expression Recognition on DFEW
Dynamic Facial Expression Recognition Facial Expression Recognition +1
no code implementations • ICCV 2021 • Wenbin Wang, Ruiping Wang, Xilin Chen
To this end, we let the scene graph borrow the ability from the image caption so that it can be a specialist on the basis of remaining all-around, resulting in the so-called Topic Scene Graph.
no code implementations • 13 Nov 2020 • Yu Yun, Xin Li, Arashdeep Singh Thind, Yuewei Yin, Hao liu, Qiang Li, Wenbin Wang, Alpha T. N Diaye, Corbyn Mellinger, Xuanyuan Jiang, Rohan Mishra, Xiaoshan Xu
The coupling between ferroelectric and magnetic orders in multiferroic materials and the nature of magnetoelectric (ME) effects are enduring experimental challenges.
Materials Science Other Condensed Matter
1 code implementation • ECCV 2020 • Wenbin Wang, Ruiping Wang, Shiguang Shan, Xilin Chen
Scene graph aims to faithfully reveal humans' perception of image content.
no code implementations • CVPR 2019 • Wenbin Wang, Ruiping Wang, Shiguang Shan, Xilin Chen
Therefore, inspired by the successful application of context to object-oriented tasks, we especially construct context for relationships where all of them are gathered so that the recognition could benefit from their association.