Search Results for author: Xiangteng He

Found 14 papers, 8 papers with code

Towards Video Anomaly Retrieval from Video Anomaly Detection: New Benchmarks and Model

1 code implementation • 24 Jul 2023 • Peng Wu, Jing Liu, Xiangteng He, Yuxin Peng, Peng Wang, Yanning Zhang

In this context, we propose a novel task called Video Anomaly Retrieval (VAR), which aims to pragmatically retrieve relevant anomalous videos by cross-modalities, e. g., language descriptions and synchronous audios.

Anomaly Detection Retrieval +2

Paper
Code

PosterLayout: A New Benchmark and Approach for Content-aware Visual-Textual Presentation Layout

1 code implementation • CVPR 2023 • HsiaoYuan Hsu, Xiangteng He, Yuxin Peng, Hao Kong, Qing Zhang

Content-aware visual-textual presentation layout aims at arranging spatial space on the given canvas for pre-defined elements, including text, logo, and underlay, which is a key to automatic template-free creative graphic design.

Generative Adversarial Network

Paper
Code

Scanning Only Once: An End-to-end Framework for Fast Temporal Grounding in Long Videos

1 code implementation • ICCV 2023 • Yulin Pan, Xiangteng He, Biao Gong, Yiliang Lv, Yujun Shen, Yuxin Peng, Deli Zhao

Video temporal grounding aims to pinpoint a video segment that matches the query description.

Paper
Code

SIM-Trans: Structure Information Modeling Transformer for Fine-grained Visual Categorization

1 code implementation • 31 Aug 2022 • Hongbo Sun, Xiangteng He, Yuxin Peng

To address the above limitations, we propose the Structure Information Modeling Transformer (SIM-Trans) to incorporate object structure information into transformer for enhancing discriminative representation learning to contain both the appearance information and structure information.

Ranked #8 on Fine-Grained Image Classification on CUB-200-2011

Contrastive Learning Fine-Grained Image Classification +2

Paper
Code

Video Similarity and Alignment Learning on Partial Video Copy Detection

no code implementations • 4 Aug 2021 • Zhen Han, Xiangteng He, Mingqian Tang, Yiliang Lv

To address the above issues, we propose the Video Similarity and Alignment Learning (VSAL) approach, which jointly models spatial similarity, temporal similarity and partial alignment.

Copy Detection Partial Video Copy Detection +1

Paper
Add Code

HANet: Hierarchical Alignment Networks for Video-Text Retrieval

1 code implementation • 26 Jul 2021 • Peng Wu, Xiangteng He, Mingqian Tang, Yiliang Lv, Jing Liu

Based on these, we naturally construct hierarchical representations in the individual-local-global manner, where the individual level focuses on the alignment between frame and word, local level focuses on the alignment between video clip and textual context, and global level focuses on the alignment between the whole video and text.

Retrieval Text Matching +3

Paper
Code

Self-supervised Video Retrieval Transformer Network

no code implementations • 16 Apr 2021 • Xiangteng He, Yulin Pan, Mingqian Tang, Yiliang Lv

In addition, most retrieval systems are based on frame-level features for video similarity searching, making it expensive both storage wise and search wise.

Retrieval Self-supervised Video Retrieval +2

Paper
Add Code

A New Benchmark and Approach for Fine-grained Cross-media Retrieval

1 code implementation • 10 Jul 2019 • Xiangteng He, Yuxin Peng, Liu Xie

To the best of our knowledge, it is the first benchmark with 4 media types for fine-grained cross-media retrieval.

Representation Learning Retrieval

Paper
Code

Fast Fine-grained Image Classification via Weakly Supervised Discriminative Localization

no code implementations • 30 Sep 2017 • Xiangteng He, Yuxin Peng, Junjie Zhao

Therefore, we propose a weakly supervised discriminative localization approach (WSDL) for fast fine-grained image classification to address the two limitations at the same time, and its main advantages are: (1) n-pathway end-to-end discriminative localization network is designed to improve classification speed, which simultaneously localizes multiple different discriminative regions for one image to boost classification accuracy, and shares full-image convolutional features generated by region proposal network to accelerate the process of generating region proposals as well as reduce the computation of convolutional operation.

Classification Fine-Grained Image Classification +2

Paper
Add Code

Fine-grained Discriminative Localization via Saliency-guided Faster R-CNN

no code implementations • 25 Sep 2017 • Xiangteng He, Yuxin Peng, Junjie Zhao

Existing methods generally adopt a two-stage learning framework: The first stage is to localize the discriminative regions of objects, and the second is to encode the discriminative features for training classifiers.

Classification Fine-Grained Image Classification +1

Paper
Add Code

Fine-grained Visual-textual Representation Learning

1 code implementation • 31 Aug 2017 • Xiangteng He, Yuxin Peng

As is known to all, when we describe the object of an image via textual descriptions, we mainly focus on the pivotal characteristics, and rarely pay attention to common characteristics as well as the background areas.

Fine-Grained Visual Categorization Representation Learning

Paper
Code

Fine-Grained Image Classification via Combining Vision and Language

no code implementations • CVPR 2017 • Xiangteng He, Yuxin Peng

Most existing fine-grained image classification methods generally learn part detection models to obtain the semantic parts for better classification accuracy.

Attribute Classification +2

Paper
Add Code

Fine-graind Image Classification via Combining Vision and Language

no code implementations • 10 Apr 2017 • Xiangteng He, Yuxin Peng

Most existing fine-grained image classification methods generally learn part detection models to obtain the semantic parts for better classification accuracy.

Attribute Classification +2

Paper
Add Code

Object-Part Attention Model for Fine-grained Image Classification

1 code implementation • 6 Apr 2017 • Yuxin Peng, Xiangteng He, Junjie Zhao

Both are jointly employed to exploit the subtle and local differences for distinguishing the subcategories.

Classification Fine-Grained Image Classification +2

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.