no code implementations • 23 Sep 2024 • Rui Cao, Chuanxin Song, Biqi Yang, Jiangliu Wang, Pheng-Ann Heng, Yun-hui Liu
Unseen Object Instance Segmentation (UOIS) is crucial for autonomous robots operating in unstructured environments.
no code implementations • 10 Sep 2024 • Peng Wang, Xin Wen, Ruochen Cao, Chengxin Gao, Yanrong Hao, Rui Cao
We then employ a specialized weighted edge aggregation (WEA) module, which uses the cross convolution with channel-wise element-wise convolutional kernel, to integrate dynamic functional connectivity and to isolating task-relevant connections.
no code implementations • 31 Aug 2024 • Tianrui Wang, Jin Li, Ziyang Ma, Rui Cao, Xie Chen, Longbiao Wang, Meng Ge, Xiaobao Wang, Yuguang Wang, Jianwu Dang, Nyima Tashi
In this way, we can progressively extract pitch variation, speaker, and content representations from the input speech.
1 code implementation • 9 Aug 2024 • Rui Cao, Qiao Wang
This research examines the use of Large Language Models (LLMs) in predicting time series, with a specific focus on the LLMTIME model.
1 code implementation • 11 Jul 2024 • Rui Cao, Jiangliu Wang, Yun-hui Liu
Inspired by the recent success of Mamba, a state space model with linear scalability in sequence length, this paper presents SR-Mamba, a novel attention-free model specifically tailored to meet the challenges of surgical phase recognition.
1 code implementation • 29 Jun 2024 • Rui Cao, Shijie Xue, Jindong Li, Qi Wang, Yi Chang
We introduce normalizing flows to unsupervised graph-level anomaly detection due to their successful application and superior quality in learning the underlying distribution of samples.
1 code implementation • 19 Feb 2024 • Rui Cao, Roy Ka-Wei Lee, Jing Jiang
We then use the few available annotated samples to train a module composer, which assigns weights to the LoRA modules based on their relevance.
1 code implementation • 4 Feb 2024 • Rui Cao, Jing Jiang
Previous solutions to knowledge-based visual question answering~(K-VQA) retrieve knowledge from external knowledge bases and use supervised learning to train the K-VQA model.
no code implementations • 30 Jan 2024 • Ming Shan Hee, Shivam Sharma, Rui Cao, Palash Nandi, Tanmoy Chakraborty, Roy Ka-Wei Lee
In the evolving landscape of online communication, moderating hate speech (HS) presents an intricate challenge, compounded by the multimodal nature of digital content.
1 code implementation • 18 Dec 2023 • Rui Cao, Tianrui Wang, Meng Ge, Longbiao Wang, Jianwu Dang
By bridging the speech enhancement and the Information Bottleneck principle in this letter, we rethink a universal plug-and-play strategy and propose a Refining Underlying Information framework called RUI to rise to the challenges both in theory and practice.
1 code implementation • 11 Dec 2023 • Ming Shan Hee, Aditi Kumaresan, Nguyen Khoi Hoang, Nirmalendu Prakash, Rui Cao, Roy Ka-Wei Lee
The rise of social media platforms has brought about a new digital culture called memes.
2 code implementations • 16 Aug 2023 • Rui Cao, Ming Shan Hee, Adriel Kuek, Wen-Haw Chong, Roy Ka-Wei Lee, Jing Jiang
Specifically, we prompt a frozen PVLM by asking hateful content-related questions and use the answers as image captions (which we call Pro-Cap), so that the captions contain information critical for hateful content detection.
Ranked #10 on Meme Classification on Hateful Memes
1 code implementation • 27 May 2023 • Rui Cao, Jing Jiang
We propose a modularized zero-shot network that explicitly decomposes questions into sub reasoning steps and is highly interpretable.
no code implementations • 8 Feb 2023 • Rui Cao, Roy Ka-Wei Lee, Wen-Haw Chong, Jing Jiang
Specifically, we construct simple prompts and provide a few in-context examples to exploit the implicit knowledge in the pre-trained RoBERTa language model for hateful meme classification.
Ranked #3 on Hateful Meme Classification on HarMeme
no code implementations • 18 Aug 2022 • Qiaohua Zhou, Rui Cao
The results show that the proposed approach can significantly outperform baseline method that mixes built-up and non-built-up regions, with accuracy increase of 25% and 30% for level-1 and level-2 classification, respectively.
no code implementations • 14 Apr 2022 • Kai Chen, Rui Cao, Stephen James, Yichuan Li, Yun-hui Liu, Pieter Abbeel, Qi Dou
To continuously improve the quality of pseudo labels, we iterate the above steps by taking the trained student model as a new teacher and re-label real data using the refined teacher model.
no code implementations • 5 Mar 2022 • Yidan Feng, Biqi Yang, Xianzhi Li, Chi-Wing Fu, Rui Cao, Kai Chen, Qi Dou, Mingqiang Wei, Yun-hui Liu, Pheng-Ann Heng
Industrial bin picking is a challenging task that requires accurate and robust segmentation of individual object instances.
1 code implementation • Findings (ACL) 2022 • Rui Cao, Yihao Wang, Yuxin Liang, Ling Gao, Jie Zheng, Jie Ren, Zheng Wang
We define a maximum traceable distance metric, through which we learn to what extent the text contrastive learning benefits from the historical information of negative samples.
no code implementations • 20 Oct 2021 • Yihao Wang, Ling Gao, Jie Ren, Rui Cao, Hai Wang, Jie Zheng, Quanli Gao
In detail, we train a DNN model (termed as pre-model) to predict which object detection model to use for the coming task and offloads to which edge servers by physical characteristics of the image task (e. g., brightness, saturation).
no code implementations • 15 Aug 2021 • Shuhui Gong, Xiaopeng Mo, Rui Cao, Yu Liu, Wei Tu, Ruibin Bai
Parking demand forecasting and behaviour analysis have received increasing attention in recent years because of their critical role in mitigating traffic congestion and understanding travel behaviours.
no code implementations • 9 Aug 2021 • Rui Cao, Ziqing Fan, Roy Ka-Wei Lee, Wen-Haw Chong, Jing Jiang
Our experiment results show that DisMultiHate is able to outperform state-of-the-art unimodal and multimodal baselines in the hateful meme classification task.
Ranked #4 on Hateful Meme Classification on HarMeme
no code implementations • 3 May 2021 • Seyed Amir Hossein Aqajari, Rui Cao, Amir Hosein Afandizadeh Zargari, Amir M. Rahmani
In this paper, we present an end-to-end and accurate pipeline for RR estimation using Cycle Generative Adversarial Networks (CycleGAN) to reconstruct respiratory signals from raw PPG signals.
1 code implementation • 12 Apr 2021 • Yuxin Liang, Rui Cao, Jie Zheng, Jie Ren, Ling Gao
We train the weights on word similarity tasks and show that processed embedding is more isotropic.
no code implementations • 14 Mar 2021 • Md Rabiul Awal, Rui Cao, Roy Ka-Wei Lee, Sandra Mitrovic
Automated hate speech detection in social media is a challenging task that has recently gained significant traction in the data mining and Natural Language Processing community.
no code implementations • 14 Mar 2021 • Rui Cao, Roy Ka-Wei Lee, Tuan-Anh Hoang
Online hate speech is an important issue that breaks the cohesiveness of online social communities and even raises public safety concerns in our societies.
no code implementations • COLING 2020 • Rui Cao, Roy Ka-Wei Lee
We also conduct case studies to empirically examine the HateGAN generated hate speeches and show that the generated tweets are diverse, coherent, and relevant to hate speech detection.
no code implementations • 9 Nov 2020 • Qing Li, Jiasong Zhu, Jun Liu, Rui Cao, Qingquan Li, Sen Jia, Guoping Qiu
Despite the rapid progress in this topic, there are lacking of a comprehensive review, which is needed to summarize the current progress and provide the future directions.
no code implementations • 21 Jul 2020 • Md Rabiul Awal, Rui Cao, Sandra Mitrovic, Roy Ka-Wei Lee
The COVID-19 pandemic has developed to be more than a bio-crisis as global news has reported a sharp rise in xenophobia and discrimination in both online and offline communities.
no code implementations • 27 Jun 2020 • Jun Liu, Qing Li, Rui Cao, Wenming Tang, Guoping Qiu
To the best of our knowledge, this work is the first extremely lightweight neural network trained on monocular video sequences for real-time unsupervised monocular depth estimation, which opens up the possibility of implementing deep learning-based real-time unsupervised monocular depth prediction on low-cost embedded devices.
Ranked #5 on Semantic Segmentation on SpectralWaste
no code implementations • 24 Jun 2020 • Md Rabiul Awal, Rui Cao, Roy Ka-Wei Lee, Sandra Mitrović
In this study, we proposed an analytical framework to study the annotation consistency in online hate and abusive content datasets.
no code implementations • MIDL 2019 • Xin Li, Rui Cao, Dongxiao Zhu
Medical imaging contains the essential information for rendering diagnostic and treatment decisions.
no code implementations • 15 Feb 2019 • Rui Cao, Qian Zhang, Jiasong Zhu, Qing Li, Qingquan Li, Bozhi Liu, Guoping Qiu
With the rapid growing of remotely sensed imagery data, there is a high demand for effective and efficient image retrieval tools to manage and exploit such data.
no code implementations • 22 Jan 2019 • Rui Cao, Pavel Naumov
A coalition is blameable for an outcome if the coalition had a strategy to prevent it.
no code implementations • 4 Jan 2019 • Qing Li, Jiasong Zhu, Rui Cao, Ke Sun, Jonathan M. Garibaldi, Qingquan Li, Bozhi Liu, Guoping Qiu
6DOF camera relocalization is an important component of autonomous driving and navigation.