Search Results for author: Yangyang Guo

Found 28 papers, 20 papers with code

Diffusion Facial Forgery Detection

1 code implementation29 Jan 2024 Harry Cheng, Yangyang Guo, Tianyi Wang, Liqiang Nie, Mohan Kankanhalli

In particular, this dataset leverages 30, 000 carefully collected textual and visual prompts, ensuring the synthesis of images with both high fidelity and semantic consistency.

Image Generation

UNK-VQA: A Dataset and a Probe into the Abstention Ability of Multi-modal Large Models

1 code implementation17 Oct 2023 Yangyang Guo, Fangkai Jiao, Zhiqi Shen, Liqiang Nie, Mohan Kankanhalli

Teaching Visual Question Answering (VQA) models to refrain from answering unanswerable questions is necessary for building a trustworthy AI system.

Attribute Question Answering +1

PELA: Learning Parameter-Efficient Models with Low-Rank Approximation

1 code implementation16 Oct 2023 Yangyang Guo, Guangzhi Wang, Mohan Kankanhalli

This allows for direct and efficient utilization of the low-rank model for downstream fine-tuning tasks.

Sample Less, Learn More: Efficient Action Recognition via Frame Feature Restoration

1 code implementation27 Jul 2023 Harry Cheng, Yangyang Guo, Liqiang Nie, Zhiyong Cheng, Mohan Kankanhalli

Training an effective video action recognition model poses significant computational challenges, particularly under limited resource budgets.

Action Recognition Temporal Action Localization

Towards Generalizable Deepfake Detection by Primary Region Regularization

no code implementations24 Jul 2023 Harry Cheng, Yangyang Guo, Tianyi Wang, Liqiang Nie, Mohan Kankanhalli

The existing deepfake detection methods have reached a bottleneck in generalizing to unseen forgeries and manipulation approaches.

DeepFake Detection Face Swapping

Mining Conditional Part Semantics with Occluded Extrapolation for Human-Object Interaction Detection

no code implementations19 Jul 2023 Guangzhi Wang, Yangyang Guo, Mohan Kankanhalli

Human-Object Interaction Detection is a crucial aspect of human-centric scene understanding, with important applications in various domains.

Human-Object Interaction Detection Object +1

Learning to Agree on Vision Attention for Visual Commonsense Reasoning

no code implementations4 Feb 2023 Zhenyang Li, Yangyang Guo, Kejie Wang, Fan Liu, Liqiang Nie, Mohan Kankanhalli

Visual Commonsense Reasoning (VCR) remains a significant yet challenging research problem in the realm of visual reasoning.

Visual Commonsense Reasoning

Distance Matters in Human-Object Interaction Detection

1 code implementation5 Jul 2022 Guangzhi Wang, Yangyang Guo, Yongkang Wong, Mohan Kankanhalli

2) Insufficient number of distant interactions in benchmark datasets results in under-fitting on these instances.

Human-Object Interaction Detection Object +1

A Unified End-to-End Retriever-Reader Framework for Knowledge-based VQA

1 code implementation30 Jun 2022 Yangyang Guo, Liqiang Nie, Yongkang Wong, Yibing Liu, Zhiyong Cheng, Mohan Kankanhalli

On the other hand, pertaining to the implicit knowledge, the multi-modal implicit knowledge for knowledge-based VQA still remains largely unexplored.

Question Answering Retrieval +1

Voice-Face Homogeneity Tells Deepfake

no code implementations4 Mar 2022 Harry Cheng, Yangyang Guo, Tianyi Wang, Qi Li, Xiaojun Chang, Liqiang Nie

To this end, a voice-face matching method is devised to measure the matching degree of these two.

Joint Answering and Explanation for Visual Commonsense Reasoning

1 code implementation25 Feb 2022 Zhenyang Li, Yangyang Guo, Kejie Wang, Yinwei Wei, Liqiang Nie, Mohan Kankanhalli

Given that our framework is model-agnostic, we apply it to the existing popular baselines and validate its effectiveness on the benchmark dataset.

Knowledge Distillation Question Answering +2

On Modality Bias Recognition and Reduction

1 code implementation25 Feb 2022 Yangyang Guo, Liqiang Nie, Harry Cheng, Zhiyong Cheng, Mohan Kankanhalli, Alberto del Bimbo

From the results on four datasets regarding the above three tasks, our method yields remarkable performance improvements compared with the baselines, demonstrating its superiority on reducing the modality bias problem.

Action Recognition Multi-modal Classification +3

When Product Search Meets Collaborative Filtering: A Hierarchical Heterogeneous Graph Neural Network Approach

no code implementations17 Aug 2021 Xiangkun Yin, Yangyang Guo, Liqiang Nie, Zhiyong Cheng

In addition, we empirically prove that collaborative filtering and semantic matching are complementary to each other in product search performance enhancement.

Collaborative Filtering Representation Learning +1

Review Polarity-wise Recommender

1 code implementation8 Jun 2021 Han Liu, Yangyang Guo, Jianhua Yin, Zan Gao, Liqiang Nie

To be specific, in this model, positive and negative reviews are separately gathered and utilized to model the user-preferred and user-rejected aspects, respectively.

Recommendation Systems

AdaVQA: Overcoming Language Priors with Adapted Margin Cosine Loss

1 code implementation5 May 2021 Yangyang Guo, Liqiang Nie, Zhiyong Cheng, Feng Ji, Ji Zhang, Alberto del Bimbo

Experimental results demonstrate that our adapted margin cosine loss can greatly enhance the baseline models with an absolute performance gain of 15\% on average, strongly verifying the potential of tackling the language prior problem in VQA from the angle of the answer feature space learning.

Question Answering Visual Question Answering

OneStop QAMaker: Extract Question-Answer Pairs from Text in a One-Stop Approach

no code implementations24 Feb 2021 Shaobo Cui, Xintong Bao, Xinxing Zu, Yangyang Guo, Zhongzhou Zhao, Ji Zhang, Haiqing Chen

This pipeline approach, however, is undesired in mining the most appropriate QA pairs from documents since it ignores the connection between question generation and answer extraction, which may lead to incompatible QA pair generation, i. e., the selected answer span is inappropriate for question generation.

Machine Reading Comprehension Question Answering +2

Feature-level Attentive ICF for Recommendation

1 code implementation22 Feb 2021 Zhiyong Cheng, Fan Liu, Shenghan Mei, Yangyang Guo, Lei Zhu, Liqiang Nie

To demonstrate the effectiveness of our method, we design a light attention neural network to integrate both item-level and feature-level attention for neural ICF models.

Collaborative Filtering Recommendation Systems

Answer Questions with Right Image Regions: A Visual Attention Regularization Approach

1 code implementation3 Feb 2021 Yibing Liu, Yangyang Guo, Jianhua Yin, Xuemeng Song, Weifeng Liu, Liqiang Nie

However, recent studies have pointed out that the highlighted image regions from the visual attention are often irrelevant to the given question and answer, leading to model confusion for correct visual reasoning.

Question Answering Visual Grounding +2

Loss re-scaling VQA: Revisiting the LanguagePrior Problem from a Class-imbalance View

1 code implementation30 Oct 2020 Yangyang Guo, Liqiang Nie, Zhiyong Cheng, Qi Tian, Min Zhang

Concretely, we design a novel interpretation scheme whereby the loss of mis-predicted frequent and sparse answers of the same question type is distinctly exhibited during the late training phase.

Face Recognition Image Classification +2

Enhancing Factorization Machines with Generalized Metric Learning

1 code implementation20 Jun 2020 Yangyang Guo, Zhiyong Cheng, Jiazheng Jing, Yanpeng Lin, Liqiang Nie, Meng Wang

Traditional FMs adopt the inner product to model the second-order interactions between different attributes, which are represented via feature vectors.

Attribute Metric Learning +1

Quantifying and Alleviating the Language Prior Problem in Visual Question Answering

1 code implementation13 May 2019 Yangyang Guo, Zhiyong Cheng, Liqiang Nie, Yibing Liu, Yinglong Wang, Mohan Kankanhalli

Benefiting from the advancement of computer vision, natural language processing and information retrieval techniques, visual question answering (VQA), which aims to answer questions about an image or a video, has received lots of attentions over the past few years.

Information Retrieval Question Answering +2

Cannot find the paper you are looking for? You can Submit a new open access paper.