Search Results for author: Ruizhi Qiao

Found 18 papers, 8 papers with code

Unified and Dynamic Graph for Temporal Character Grouping in Long Videos

no code implementations • 27 Aug 2023 • Xiujun Shu, Wei Wen, Liangsheng Xu, Mingbao Lin, Ruizhi Qiao, Taian Guo, Hanjun Li, Bei Gan, Xiao Wang, Xing Sun

In this paper, we present a unified and dynamic graph (UniDG) framework for temporal character grouping.

Clustering Graph Clustering

Paper
Add Code

D3G: Exploring Gaussian Prior for Temporal Sentence Grounding with Glance Annotation

1 code implementation • ICCV 2023 • Hanjun Li, Xiujun Shu, Sunan He, Ruizhi Qiao, Wei Wen, Taian Guo, Bei Gan, Xing Sun

Under this setup, we propose a Dynamic Gaussian prior based Grounding framework with Glance annotation (D3G), which consists of a Semantic Alignment Group Contrastive Learning module (SA-GCL) and a Dynamic Gaussian prior Adjustment module (DGA).

Ranked #10 on Temporal Sentence Grounding on Charades-STA

Contrastive Learning Sentence +1

Paper
Code

Coarse-to-Fine: Learning Compact Discriminative Representation for Single-Stage Image Retrieval

1 code implementation • ICCV 2023 • Yunquan Zhu, Xinkai Gao, Bo Ke, Ruizhi Qiao, Xing Sun

Image retrieval targets to find images from a database that are visually similar to the query image.

Image Retrieval Retrieval

Paper
Code

Collaborative Noisy Label Cleaner: Learning Scene-aware Trailers for Multi-modal Highlight Detection in Movies

1 code implementation • CVPR 2023 • Bei Gan, Xiujun Shu, Ruizhi Qiao, Haoqian Wu, Keyu Chen, Hanjun Li, Bo Ren

Based on existing efforts, this work has two observations: (1) For different annotators, labeling highlight has uncertainty, which leads to inaccurate and time-consuming annotations.

Highlight Detection Learning with noisy labels +1

Paper
Code

NewsNet: A Novel Dataset for Hierarchical Temporal Segmentation

no code implementations • CVPR 2023 • Haoqian Wu, Keyu Chen, Haozhe Liu, Mingchen Zhuge, Bing Li, Ruizhi Qiao, Xiujun Shu, Bei Gan, Liangsheng Xu, Bo Ren, Mengmeng Xu, Wentian Zhang, Raghavendra Ramachandra, Chia-Wen Lin, Bernard Ghanem

Temporal video segmentation is the get-to-go automatic video analysis, which decomposes a long-form video into smaller components for the following-up understanding tasks.

Video Segmentation Video Semantic Segmentation

Paper
Add Code

VLMAE: Vision-Language Masked Autoencoder

no code implementations • 19 Aug 2022 • Sunan He, Taian Guo, Tao Dai, Ruizhi Qiao, Chen Wu, Xiujun Shu, Bo Ren

Image and language modeling is of crucial importance for vision-language pre-training (VLP), which aims to learn multi-modal representations from large-scale paired image-text data.

Language Modelling Question Answering +4

Paper
Add Code

See Finer, See More: Implicit Modality Alignment for Text-based Person Retrieval

1 code implementation • 18 Aug 2022 • Xiujun Shu, Wei Wen, Haoqian Wu, Keyu Chen, Yiran Song, Ruizhi Qiao, Bo Ren, Xiao Wang

To explore the fine-grained alignment, we further propose two implicit semantic alignment paradigms: multi-level alignment (MLA) and bidirectional mask modeling (BMM).

Person Retrieval Retrieval +3

Paper
Code

Exploiting Feature Diversity for Make-up Temporal Video Grounding

no code implementations • 12 Aug 2022 • Xiujun Shu, Wei Wen, Taian Guo, Sunan He, Chen Wu, Ruizhi Qiao

This technical report presents the 3rd winning solution for MTVG, a new task introduced in the 4-th Person in Context (PIC) Challenge at ACM MM 2022.

Video Grounding

Paper
Add Code

Open-Vocabulary Multi-Label Classification via Multi-Modal Knowledge Transfer

1 code implementation • 5 Jul 2022 • Sunan He, Taian Guo, Tao Dai, Ruizhi Qiao, Bo Ren, Shu-Tao Xia

Specifically, our method exploits multi-modal knowledge of image-text pairs based on a vision and language pre-training (VLP) model.

Ranked #1 on Multi-label zero-shot learning on Open Images V4

Image-text matching Knowledge Distillation +7

111

Paper
Code

Scene Consistency Representation Learning for Video Scene Segmentation

1 code implementation • CVPR 2022 • Haoqian Wu, Keyu Chen, Yanan Luo, Ruizhi Qiao, Bo Ren, Haozhe Liu, Weicheng Xie, Linlin Shen

Additionally, we suggest a more fair and reasonable benchmark to evaluate the performance of Video Scene Segmentation methods.

Data Augmentation Inductive Bias +3

Paper
Code

HybridCR: Weakly-Supervised 3D Point Cloud Semantic Segmentation via Hybrid Contrastive Regularization

1 code implementation • CVPR 2022 • Mengtian Li, Yuan Xie, Yunhang Shen, Bo Ke, Ruizhi Qiao, Bo Ren, Shaohui Lin, Lizhuang Ma

To address the huge labeling cost in large-scale point cloud semantic segmentation, we propose a novel hybrid contrastive regularization (HybridCR) framework in weakly-supervised setting, which obtains competitive performance compared to its fully-supervised counterpart.

Semantic Segmentation Semantic Similarity +1

Paper
Code

Head and Body: Unified Detector and Graph Network for Person Search in Media

no code implementations • 27 Nov 2021 • Xiujun Shu, Yusheng Tao, Ruizhi Qiao, Bo Ke, Wei Wen, Bo Ren

It is by far the largest dataset for person search in media.

Person Search

Paper
Add Code

Novelty Detection via Contrastive Learning with Negative Data Augmentation

no code implementations • 18 Jun 2021 • Chengwei Chen, Yuan Xie, Shaohui Lin, Ruizhi Qiao, Jian Zhou, Xin Tan, Yi Zhang, Lizhuang Ma

Moreover, our model is more stable for training in a non-adversarial manner, compared to other adversarial based novelty detection methods.

Clustering Contrastive Learning +5

Paper
Add Code

Contrastive Learning for Compact Single Image Dehazing

7 code implementations • CVPR 2021 • Haiyan Wu, Yanyun Qu, Shaohui Lin, Jian Zhou, Ruizhi Qiao, Zhizhong Zhang, Yuan Xie, Lizhuang Ma

In this paper, we propose a novel contrastive regularization (CR) built upon contrastive learning to exploit both the information of hazy images and clear images as negative and positive samples, respectively.

Ranked #5 on Image Dehazing on RS-Haze

Contrastive Learning Image Dehazing +1

333

Paper
Code

Visually Aligned Word Embeddings for Improving Zero-shot Learning

no code implementations • 18 Jul 2017 • Ruizhi Qiao, Lingqiao Liu, Chunhua Shen, Anton Van Den Hengel

To overcome this visual-semantic discrepancy, this work proposes an objective function to re-align the distributed word embeddings with visual information by learning a neural network to map it into a new representation called visually aligned word embedding (VAWE).

Paper
Add Code

Structured Learning of Tree Potentials in CRF for Image Segmentation

no code implementations • 26 Mar 2017 • Fayao Liu, Guosheng Lin, Ruizhi Qiao, Chunhua Shen

In this fashion, we easily achieve nonlinear learning of potential functions on both unary and pairwise terms in CRFs.

Image Segmentation Semantic Segmentation

Paper
Add Code

Less is more: zero-shot learning from online textual documents with noise suppression

no code implementations • CVPR 2016 • Ruizhi Qiao, Lingqiao Liu, Chunhua Shen, Anton Van Den Hengel

Classifying a visual concept merely from its associated online textual source, such as a Wikipedia article, is an attractive research topic in zero-shot learning because it alleviates the burden of manually collecting semantic attributes.

Zero-Shot Learning

Paper
Add Code

Learning discriminative trajectorylet detector sets for accurate skeleton-based action recognition

no code implementations • 20 Apr 2015 • Ruizhi Qiao, Lingqiao Liu, Chunhua Shen, Anton von den Hengel

The introduction of low-cost RGB-D sensors has promoted the research in skeleton-based human action recognition.

Action Recognition Skeleton Based Action Recognition +1

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.