Search Results for author: Rui Cao

Found 36 papers, 11 papers with code

Adapting Segment Anything Model for Unseen Object Instance Segmentation

no code implementations23 Sep 2024 Rui Cao, Chuanxin Song, Biqi Yang, Jiangliu Wang, Pheng-Ann Heng, Yun-hui Liu

Unseen Object Instance Segmentation (UOIS) is crucial for autonomous robots operating in unstructured environments.

Decoder Segmentation +2

MCDGLN: Masked Connection-based Dynamic Graph Learning Network for Autism Spectrum Disorder

no code implementations10 Sep 2024 Peng Wang, Xin Wen, Ruochen Cao, Chengxin Gao, Yanrong Hao, Rui Cao

We then employ a specialized weighted edge aggregation (WEA) module, which uses the cross convolution with channel-wise element-wise convolutional kernel, to integrate dynamic functional connectivity and to isolating task-relevant connections.

Functional Connectivity Graph Learning

An Evaluation of Standard Statistical Models and LLMs on Time Series Forecasting

1 code implementation9 Aug 2024 Rui Cao, Qiao Wang

This research examines the use of Large Language Models (LLMs) in predicting time series, with a specific focus on the LLMTIME model.

Sentiment Analysis Text Generation +3

SR-Mamba: Effective Surgical Phase Recognition with State Space Model

1 code implementation11 Jul 2024 Rui Cao, Jiangliu Wang, Yun-hui Liu

Inspired by the recent success of Mamba, a state space model with linear scalability in sequence length, this paper presents SR-Mamba, a novel attention-free model specifically tailored to meet the challenges of surgical phase recognition.

Decoder Mamba +1

FANFOLD: Graph Normalizing Flows-driven Asymmetric Network for Unsupervised Graph-Level Anomaly Detection

1 code implementation29 Jun 2024 Rui Cao, Shijie Xue, Jindong Li, Qi Wang, Yi Chang

We introduce normalizing flows to unsupervised graph-level anomaly detection due to their successful application and superior quality in learning the underlying distribution of samples.

Knowledge Distillation Unsupervised Anomaly Detection

Modularized Networks for Few-shot Hateful Meme Detection

1 code implementation19 Feb 2024 Rui Cao, Roy Ka-Wei Lee, Jing Jiang

We then use the few available annotated samples to train a module composer, which assigns weights to the LoRA modules based on their relevance.

Few-Shot Learning In-Context Learning

Knowledge Generation for Zero-shot Knowledge-based VQA

1 code implementation4 Feb 2024 Rui Cao, Jing Jiang

Previous solutions to knowledge-based visual question answering~(K-VQA) retrieve knowledge from external knowledge bases and use supervised learning to train the K-VQA model.

Question Answering Visual Question Answering

Recent Advances in Hate Speech Moderation: Multimodality and the Role of Large Models

no code implementations30 Jan 2024 Ming Shan Hee, Shivam Sharma, Rui Cao, Palash Nandi, Tanmoy Chakraborty, Roy Ka-Wei Lee

In the evolving landscape of online communication, moderating hate speech (HS) presents an intricate challenge, compounded by the multimodal nature of digital content.

Survey

A Refining Underlying Information Framework for Monaural Speech Enhancement

1 code implementation18 Dec 2023 Rui Cao, Tianrui Wang, Meng Ge, Longbiao Wang, Jianwu Dang

By bridging the speech enhancement and the Information Bottleneck principle in this letter, we rethink a universal plug-and-play strategy and propose a Refining Underlying Information framework called RUI to rise to the challenges both in theory and practice.

Speech Enhancement

MATK: The Meme Analytical Tool Kit

1 code implementation11 Dec 2023 Ming Shan Hee, Aditi Kumaresan, Nguyen Khoi Hoang, Nirmalendu Prakash, Rui Cao, Roy Ka-Wei Lee

The rise of social media platforms has brought about a new digital culture called memes.

Meme Classification

Pro-Cap: Leveraging a Frozen Vision-Language Model for Hateful Meme Detection

2 code implementations16 Aug 2023 Rui Cao, Ming Shan Hee, Adriel Kuek, Wen-Haw Chong, Roy Ka-Wei Lee, Jing Jiang

Specifically, we prompt a frozen PVLM by asking hateful content-related questions and use the answers as image captions (which we call Pro-Cap), so that the captions contain information critical for hateful content detection.

Image Captioning Language Modelling +3

Modularized Zero-shot VQA with Pre-trained Models

1 code implementation27 May 2023 Rui Cao, Jing Jiang

We propose a modularized zero-shot network that explicitly decomposes questions into sub reasoning steps and is highly interpretable.

object-detection Object Detection +3

Prompting for Multimodal Hateful Meme Classification

no code implementations8 Feb 2023 Rui Cao, Roy Ka-Wei Lee, Wen-Haw Chong, Jing Jiang

Specifically, we construct simple prompts and provide a few in-context examples to exploit the implicit knowledge in the pre-trained RoBERTa language model for hateful meme classification.

Classification Hateful Meme Classification +1

A Coarse-to-Fine Approach for Urban Land Use Mapping Based on Multisource Geospatial Data

no code implementations18 Aug 2022 Qiaohua Zhou, Rui Cao

The results show that the proposed approach can significantly outperform baseline method that mixes built-up and non-built-up regions, with accuracy increase of 25% and 30% for level-1 and level-2 classification, respectively.

Management

Sim-to-Real 6D Object Pose Estimation via Iterative Self-training for Robotic Bin Picking

no code implementations14 Apr 2022 Kai Chen, Rui Cao, Stephen James, Yichuan Li, Yun-hui Liu, Pieter Abbeel, Qi Dou

To continuously improve the quality of pseudo labels, we iterate the above steps by taking the trained student model as a new teacher and re-label real data using the refined teacher model.

6D Pose Estimation using RGB Robotic Grasping

Exploring the Impact of Negative Samples of Contrastive Learning: A Case Study of Sentence Embedding

1 code implementation Findings (ACL) 2022 Rui Cao, Yihao Wang, Yuxin Liang, Ling Gao, Jie Zheng, Jie Ren, Zheng Wang

We define a maximum traceable distance metric, through which we learn to what extent the text contrastive learning benefits from the historical information of negative samples.

Contrastive Learning Sentence +4

ESOD:Edge-based Task Scheduling for Object Detection

no code implementations20 Oct 2021 Yihao Wang, Ling Gao, Jie Ren, Rui Cao, Hai Wang, Jie Zheng, Quanli Gao

In detail, we train a DNN model (termed as pre-model) to predict which object detection model to use for the coming task and offloads to which edge servers by physical characteristics of the image task (e. g., brightness, saturation).

Object object-detection +2

Spatio-temporal Parking Behaviour Forecasting and Analysis Before and During COVID-19

no code implementations15 Aug 2021 Shuhui Gong, Xiaopeng Mo, Rui Cao, Yu Liu, Wei Tu, Ruibin Bai

Parking demand forecasting and behaviour analysis have received increasing attention in recent years because of their critical role in mitigating traffic congestion and understanding travel behaviours.

Epidemiology graph construction +1

Disentangling Hate in Online Memes

no code implementations9 Aug 2021 Rui Cao, Ziqing Fan, Roy Ka-Wei Lee, Wen-Haw Chong, Jing Jiang

Our experiment results show that DisMultiHate is able to outperform state-of-the-art unimodal and multimodal baselines in the hateful meme classification task.

Classification Hateful Meme Classification

An End-to-End and Accurate PPG-based Respiratory Rate Estimation Approach Using Cycle Generative Adversarial Networks

no code implementations3 May 2021 Seyed Amir Hossein Aqajari, Rui Cao, Amir Hosein Afandizadeh Zargari, Amir M. Rahmani

In this paper, we present an end-to-end and accurate pipeline for RR estimation using Cycle Generative Adversarial Networks (CycleGAN) to reconstruct respiratory signals from raw PPG signals.

Photoplethysmography (PPG) Respiratory Rate Estimation

Learning to Remove: Towards Isotropic Pre-trained BERT Embedding

1 code implementation12 Apr 2021 Yuxin Liang, Rui Cao, Jie Zheng, Jie Ren, Ling Gao

We train the weights on word similarity tasks and show that processed embedding is more isotropic.

Semantic Textual Similarity Word Similarity

AngryBERT: Joint Learning Target and Emotion for Hate Speech Detection

no code implementations14 Mar 2021 Md Rabiul Awal, Rui Cao, Roy Ka-Wei Lee, Sandra Mitrovic

Automated hate speech detection in social media is a challenging task that has recently gained significant traction in the data mining and Natural Language Processing community.

Hate Speech Detection Sentiment Analysis +1

DeepHate: Hate Speech Detection via Multi-Faceted Text Representations

no code implementations14 Mar 2021 Rui Cao, Roy Ka-Wei Lee, Tuan-Anh Hoang

Online hate speech is an important issue that breaks the cohesiveness of online social communities and even raises public safety concerns in our societies.

Hate Speech Detection Word Embeddings

HateGAN: Adversarial Generative-Based Data Augmentation for Hate Speech Detection

no code implementations COLING 2020 Rui Cao, Roy Ka-Wei Lee

We also conduct case studies to empirically examine the HateGAN generated hate speeches and show that the generated tweets are diverse, coherent, and relevant to hate speech detection.

Data Augmentation Hate Speech Detection

Deep Learning based Monocular Depth Prediction: Datasets, Methods and Applications

no code implementations9 Nov 2020 Qing Li, Jiasong Zhu, Jun Liu, Rui Cao, Qingquan Li, Sen Jia, Guoping Qiu

Despite the rapid progress in this topic, there are lacking of a comprehensive review, which is needed to summarize the current progress and provide the future directions.

Depth Prediction Indoor Localization +2

On Analyzing Antisocial Behaviors Amid COVID-19 Pandemic

no code implementations21 Jul 2020 Md Rabiul Awal, Rui Cao, Sandra Mitrovic, Roy Ka-Wei Lee

The COVID-19 pandemic has developed to be more than a bio-crisis as global news has reported a sharp rise in xenophobia and discrimination in both online and offline communities.

MiniNet: An extremely lightweight convolutional neural network for real-time unsupervised monocular depth estimation

no code implementations27 Jun 2020 Jun Liu, Qing Li, Rui Cao, Wenming Tang, Guoping Qiu

To the best of our knowledge, this work is the first extremely lightweight neural network trained on monocular video sequences for real-time unsupervised monocular depth estimation, which opens up the possibility of implementing deep learning-based real-time unsupervised monocular depth prediction on low-cost embedded devices.

Depth Prediction Monocular Depth Estimation +2

On Analyzing Annotation Consistency in Online Abusive Behavior Datasets

no code implementations24 Jun 2020 Md Rabiul Awal, Rui Cao, Roy Ka-Wei Lee, Sandra Mitrović

In this study, we proposed an analytical framework to study the annotation consistency in online hate and abusive content datasets.

Vispi: Automatic Visual Perception and Interpretation of Chest X-rays

no code implementations MIDL 2019 Xin Li, Rui Cao, Dongxiao Zhu

Medical imaging contains the essential information for rendering diagnostic and treatment decisions.

Image Captioning

Enhancing Remote Sensing Image Retrieval with Triplet Deep Metric Learning Network

no code implementations15 Feb 2019 Rui Cao, Qian Zhang, Jiasong Zhu, Qing Li, Qingquan Li, Bozhi Liu, Guoping Qiu

With the rapid growing of remotely sensed imagery data, there is a high demand for effective and efficient image retrieval tools to manage and exploit such data.

Image Retrieval Metric Learning +2

The Limits of Morality in Strategic Games

no code implementations22 Jan 2019 Rui Cao, Pavel Naumov

A coalition is blameable for an outcome if the coalition had a strategy to prevent it.

Cannot find the paper you are looking for? You can Submit a new open access paper.