Search Results for author: Xiangteng He

Found 14 papers, 8 papers with code

Towards Video Anomaly Retrieval from Video Anomaly Detection: New Benchmarks and Model

1 code implementation24 Jul 2023 Peng Wu, Jing Liu, Xiangteng He, Yuxin Peng, Peng Wang, Yanning Zhang

In this context, we propose a novel task called Video Anomaly Retrieval (VAR), which aims to pragmatically retrieve relevant anomalous videos by cross-modalities, e. g., language descriptions and synchronous audios.

Anomaly Detection Retrieval +2

PosterLayout: A New Benchmark and Approach for Content-aware Visual-Textual Presentation Layout

1 code implementation CVPR 2023 HsiaoYuan Hsu, Xiangteng He, Yuxin Peng, Hao Kong, Qing Zhang

Content-aware visual-textual presentation layout aims at arranging spatial space on the given canvas for pre-defined elements, including text, logo, and underlay, which is a key to automatic template-free creative graphic design.

Generative Adversarial Network

SIM-Trans: Structure Information Modeling Transformer for Fine-grained Visual Categorization

1 code implementation31 Aug 2022 Hongbo Sun, Xiangteng He, Yuxin Peng

To address the above limitations, we propose the Structure Information Modeling Transformer (SIM-Trans) to incorporate object structure information into transformer for enhancing discriminative representation learning to contain both the appearance information and structure information.

Contrastive Learning Fine-Grained Image Classification +2

Video Similarity and Alignment Learning on Partial Video Copy Detection

no code implementations4 Aug 2021 Zhen Han, Xiangteng He, Mingqian Tang, Yiliang Lv

To address the above issues, we propose the Video Similarity and Alignment Learning (VSAL) approach, which jointly models spatial similarity, temporal similarity and partial alignment.

Copy Detection Partial Video Copy Detection +1

HANet: Hierarchical Alignment Networks for Video-Text Retrieval

1 code implementation26 Jul 2021 Peng Wu, Xiangteng He, Mingqian Tang, Yiliang Lv, Jing Liu

Based on these, we naturally construct hierarchical representations in the individual-local-global manner, where the individual level focuses on the alignment between frame and word, local level focuses on the alignment between video clip and textual context, and global level focuses on the alignment between the whole video and text.

Retrieval Text Matching +3

Self-supervised Video Retrieval Transformer Network

no code implementations16 Apr 2021 Xiangteng He, Yulin Pan, Mingqian Tang, Yiliang Lv

In addition, most retrieval systems are based on frame-level features for video similarity searching, making it expensive both storage wise and search wise.

Retrieval Self-supervised Video Retrieval +2

A New Benchmark and Approach for Fine-grained Cross-media Retrieval

1 code implementation10 Jul 2019 Xiangteng He, Yuxin Peng, Liu Xie

To the best of our knowledge, it is the first benchmark with 4 media types for fine-grained cross-media retrieval.

Representation Learning Retrieval

Fast Fine-grained Image Classification via Weakly Supervised Discriminative Localization

no code implementations30 Sep 2017 Xiangteng He, Yuxin Peng, Junjie Zhao

Therefore, we propose a weakly supervised discriminative localization approach (WSDL) for fast fine-grained image classification to address the two limitations at the same time, and its main advantages are: (1) n-pathway end-to-end discriminative localization network is designed to improve classification speed, which simultaneously localizes multiple different discriminative regions for one image to boost classification accuracy, and shares full-image convolutional features generated by region proposal network to accelerate the process of generating region proposals as well as reduce the computation of convolutional operation.

Classification Fine-Grained Image Classification +2

Fine-grained Discriminative Localization via Saliency-guided Faster R-CNN

no code implementations25 Sep 2017 Xiangteng He, Yuxin Peng, Junjie Zhao

Existing methods generally adopt a two-stage learning framework: The first stage is to localize the discriminative regions of objects, and the second is to encode the discriminative features for training classifiers.

Classification Fine-Grained Image Classification +1

Fine-grained Visual-textual Representation Learning

1 code implementation31 Aug 2017 Xiangteng He, Yuxin Peng

As is known to all, when we describe the object of an image via textual descriptions, we mainly focus on the pivotal characteristics, and rarely pay attention to common characteristics as well as the background areas.

Fine-Grained Visual Categorization Representation Learning

Fine-Grained Image Classification via Combining Vision and Language

no code implementations CVPR 2017 Xiangteng He, Yuxin Peng

Most existing fine-grained image classification methods generally learn part detection models to obtain the semantic parts for better classification accuracy.

Attribute Classification +2

Fine-graind Image Classification via Combining Vision and Language

no code implementations10 Apr 2017 Xiangteng He, Yuxin Peng

Most existing fine-grained image classification methods generally learn part detection models to obtain the semantic parts for better classification accuracy.

Attribute Classification +2

Object-Part Attention Model for Fine-grained Image Classification

1 code implementation6 Apr 2017 Yuxin Peng, Xiangteng He, Junjie Zhao

Both are jointly employed to exploit the subtle and local differences for distinguishing the subcategories.

Classification Fine-Grained Image Classification +2

Cannot find the paper you are looking for? You can Submit a new open access paper.