Search Results for author: Guoshun Nan

Found 22 papers, 10 papers with code

Exploring What Why and How: A Multifaceted Benchmark for Causation Understanding of Video Anomaly

1 code implementation10 Dec 2024 Hang Du, Guoshun Nan, Jiawen Qian, Wangchenhui Wu, Wendi Deng, Hanqing Mu, Zhenyan Chen, Pengxuan Mao, Xiaofeng Tao, Jun Liu

Specifically, each instance of our ECVA involves three sets of human annotations to indicate "what", "why" and "how" of an anomaly, including 1) anomaly type, start and end times, and event descriptions, 2) natural language explanations for the cause of an anomaly, and 3) free text reflecting the effect of the abnormality.

ContextBLIP: Doubly Contextual Alignment for Contrastive Image Retrieval from Linguistically Complex Descriptions

no code implementations29 May 2024 Honglin Lin, Siyu Li, Guoshun Nan, Chaoyue Tang, Xueting Wang, Jingxin Xu, Rong Yankai, Zhili Zhou, Yutong Gao, Qimei Cui, Xiaofeng Tao

The main challenges lie in aligning key contextual cues in two modalities, where these subtle cues are concealed in tiny areas of multiple contrastive images and within the complex linguistics of textual descriptions.

Image Retrieval Retrieval

Can We Improve Channel Reciprocity via Loop-back Compensation for RIS-assisted Physical Layer Key Generation

no code implementations31 Jan 2024 Ningya Xu, Guoshun Nan, Xiaofeng Tao, Na Li, Pengxuan Mao, Tianyuan Yang

The results demonstrate a significant improvement in both the rate of key generation assisted by the RIS and the consistency of the generated keys, showing great potential for the practical deployment of our LoCKey in future wireless systems.

Uncovering What Why and How: A Comprehensive Benchmark for Causation Understanding of Video Anomaly

1 code implementation CVPR 2024 Hang Du, Sicheng Zhang, Binzhu Xie, Guoshun Nan, Jiayang Zhang, Junrui Xu, Hangyu Liu, Sicong Leng, Jiangming Liu, Hehe Fan, Dajiu Huang, Jing Feng, Linli Chen, Can Zhang, Xuhuan Li, Hao Zhang, Jianhang Chen, Qimei Cui, Xiaofeng Tao

In addition we also introduce MMEval a novel evaluation metric designed to better align with human preferences for CUVA facilitating the measurement of existing LLMs in comprehending the underlying cause and corresponding effect of video anomalies.

Anomaly Detection

Refining Latent Homophilic Structures over Heterophilic Graphs for Robust Graph Convolution Networks

no code implementations27 Dec 2023 Chenyang Qiu, Guoshun Nan, Tianyu Xiong, Wendi Deng, Di Wang, Zhiyang Teng, Lijuan Sun, Qimei Cui, Xiaofeng Tao

This finding motivates us to present a novel method that aims to harden GCNs by automatically learning Latent Homophilic Structures over heterophilic graphs.

Contrastive Learning Node Classification

DocMSU: A Comprehensive Benchmark for Document-level Multimodal Sarcasm Understanding

1 code implementation26 Dec 2023 Hang Du, Guoshun Nan, Sicheng Zhang, Binzhu Xie, Junrui Xu, Hehe Fan, Qimei Cui, Xiaofeng Tao, Xudong Jiang

Multimodal Sarcasm Understanding (MSU) has a wide range of applications in the news field such as public opinion analysis and forgery detection.

Object Detection Sarcasm Detection +1

Passive Eavesdropping Can Significantly Slow Down RIS-Assisted Secret Key Generation

no code implementations6 Sep 2023 Ningya Xu, Guoshun Nan, Xiaofeng Tao

Reconfigurable Intelligent Surface (RIS) assisted physical layer key generation has shown great potential to secure wireless communications by smartly controlling signals such as phase and amplitude.

Disentangled Dynamic Intrusion Detection

1 code implementation2 Jul 2023 Chenyang Qiu, Guoshun Nan, Hongrui Xia, Zheng Weng, Xueting Wang, Meng Shen, Xiaofeng Tao, Jun Liu

Specifically, the proposed DIDS first disentangles traffic features by a non-parameterized optimization, automatically differentiating tens and hundreds of complex features of various attacks.

Disentanglement Few-Shot Learning +1

Physical-layer Adversarial Robustness for Deep Learning-based Semantic Communications

no code implementations12 May 2023 Guoshun Nan, Zhichun Li, Jinli Zhai, Qimei Cui, Gong Chen, Xin Du, Xuefei Zhang, Xiaofeng Tao, Zhu Han, Tony Q. S. Quek

We argue that central to the success of ESC is the robust interpretation of conveyed semantics at the receiver side, especially for security-critical applications such as automatic driving and smart healthcare.

Adversarial Robustness Deep Learning +1

Boosting Physical Layer Black-Box Attacks with Semantic Adversaries in Semantic Communications

no code implementations29 Mar 2023 Zeju Li, Xinghan Liu, Guoshun Nan, Jinfei Zhou, Xinchen Lyu, Qimei Cui, Xiaofeng Tao

To this end, we present SemBLK, a novel method that can learn to generate destructive physical layer semantic attacks for an ESC system under the black-box setting, where the adversaries are imperceptible to humans.

Semantic Communication

You Can Ground Earlier than See: An Effective and Efficient Pipeline for Temporal Sentence Grounding in Compressed Videos

no code implementations CVPR 2023 Xiang Fang, Daizong Liu, Pan Zhou, Guoshun Nan

To handle the raw video bit-stream input, we propose a novel Three-branch Compressed-domain Spatial-temporal Fusion (TCSF) framework, which extracts and aggregates three kinds of low-level visual features (I-frame, motion vector and residual features) for effective and efficient grounding.

Sentence Temporal Sentence Grounding

Fusion with Hierarchical Graphs for Mulitmodal Emotion Recognition

no code implementations15 Sep 2021 Shuyun Tang, Zhaojie Luo, Guoshun Nan, Yuichiro Yoshikawa, Ishiguro Hiroshi

Automatic emotion recognition (AER) based on enriched multimodal inputs, including text, speech, and visual clues, is crucial in the development of emotionally intelligent machines.

Emotion Classification Emotion Recognition +1

Speaker-Oriented Latent Structures for Dialogue-Based Relation Extraction

1 code implementation11 Sep 2021 Guoshun Nan, Guoqing Luo, Sicong Leng, Yao Xiao, Wei Lu

Dialogue-based relation extraction (DiaRE) aims to detect the structural information from unstructured utterances in dialogues.

Dialog Relation Extraction Relation

Interventional Video Grounding with Dual Contrastive Learning

1 code implementation CVPR 2021 Guoshun Nan, Rui Qiao, Yao Xiao, Jun Liu, Sicong Leng, Hao Zhang, Wei Lu

2) Meanwhile, we introduce a dual contrastive learning approach (DCL) to better align the text and video by maximizing the mutual information (MI) between query and video clips, and the MI between start/end frames of a target moment and the others within a video to learn more informative visual representations.

Causal Inference Contrastive Learning +2

Video Corpus Moment Retrieval with Contrastive Learning

1 code implementation13 May 2021 Hao Zhang, Aixin Sun, Wei Jing, Guoshun Nan, Liangli Zhen, Joey Tianyi Zhou, Rick Siow Mong Goh

We adopt the first approach and introduce two contrastive learning objectives to refine video encoder and text encoder to learn video and text representations separately but with better alignment for VCMR.

Contrastive Learning Moment Retrieval +2

Integrating Subgraph-aware Relation and DirectionReasoning for Question Answering

no code implementations1 Apr 2021 Xu Wang, Shuai Zhao, Bo Cheng, Jiale Han, Yingting Li, Hao Yang, Ivan Sekulic, Guoshun Nan

Question Answering (QA) models over Knowledge Bases (KBs) are capable of providing more precise answers by utilizing relation information among entities.

Question Answering Relation

Counterfactual Thinking for Long-tailed Information Extraction

no code implementations1 Jan 2021 Guoshun Nan, Jiaqi Zeng, Rui Qiao, Wei Lu

However, in practice, the long-tailed and imbalanced data may lead to severe bias issues for deep learning models, due to very few training instances available for the tail classes.

Causal Inference counterfactual +9

Reasoning with Latent Structure Refinement for Document-Level Relation Extraction

2 code implementations ACL 2020 Guoshun Nan, Zhijiang Guo, Ivan Sekulić, Wei Lu

Document-level relation extraction requires integrating information within and across multiple sentences of a document and capturing complex interactions between inter-sentence entities.

Document-level Relation Extraction Relation +2

Cannot find the paper you are looking for? You can Submit a new open access paper.