1 code implementation • 24 Jul 2023 • Peng Wu, Jing Liu, Xiangteng He, Yuxin Peng, Peng Wang, Yanning Zhang
In this context, we propose a novel task called Video Anomaly Retrieval (VAR), which aims to pragmatically retrieve relevant anomalous videos by cross-modalities, e. g., language descriptions and synchronous audios.
1 code implementation • CVPR 2023 • HsiaoYuan Hsu, Xiangteng He, Yuxin Peng, Hao Kong, Qing Zhang
Content-aware visual-textual presentation layout aims at arranging spatial space on the given canvas for pre-defined elements, including text, logo, and underlay, which is a key to automatic template-free creative graphic design.
1 code implementation • ICCV 2023 • Yulin Pan, Xiangteng He, Biao Gong, Yiliang Lv, Yujun Shen, Yuxin Peng, Deli Zhao
Video temporal grounding aims to pinpoint a video segment that matches the query description.
1 code implementation • 31 Aug 2022 • Hongbo Sun, Xiangteng He, Yuxin Peng
To address the above limitations, we propose the Structure Information Modeling Transformer (SIM-Trans) to incorporate object structure information into transformer for enhancing discriminative representation learning to contain both the appearance information and structure information.
Ranked #8 on Fine-Grained Image Classification on CUB-200-2011
no code implementations • 4 Aug 2021 • Zhen Han, Xiangteng He, Mingqian Tang, Yiliang Lv
To address the above issues, we propose the Video Similarity and Alignment Learning (VSAL) approach, which jointly models spatial similarity, temporal similarity and partial alignment.
1 code implementation • 26 Jul 2021 • Peng Wu, Xiangteng He, Mingqian Tang, Yiliang Lv, Jing Liu
Based on these, we naturally construct hierarchical representations in the individual-local-global manner, where the individual level focuses on the alignment between frame and word, local level focuses on the alignment between video clip and textual context, and global level focuses on the alignment between the whole video and text.
no code implementations • 16 Apr 2021 • Xiangteng He, Yulin Pan, Mingqian Tang, Yiliang Lv
In addition, most retrieval systems are based on frame-level features for video similarity searching, making it expensive both storage wise and search wise.
1 code implementation • 10 Jul 2019 • Xiangteng He, Yuxin Peng, Liu Xie
To the best of our knowledge, it is the first benchmark with 4 media types for fine-grained cross-media retrieval.
no code implementations • 30 Sep 2017 • Xiangteng He, Yuxin Peng, Junjie Zhao
Therefore, we propose a weakly supervised discriminative localization approach (WSDL) for fast fine-grained image classification to address the two limitations at the same time, and its main advantages are: (1) n-pathway end-to-end discriminative localization network is designed to improve classification speed, which simultaneously localizes multiple different discriminative regions for one image to boost classification accuracy, and shares full-image convolutional features generated by region proposal network to accelerate the process of generating region proposals as well as reduce the computation of convolutional operation.
no code implementations • 25 Sep 2017 • Xiangteng He, Yuxin Peng, Junjie Zhao
Existing methods generally adopt a two-stage learning framework: The first stage is to localize the discriminative regions of objects, and the second is to encode the discriminative features for training classifiers.
1 code implementation • 31 Aug 2017 • Xiangteng He, Yuxin Peng
As is known to all, when we describe the object of an image via textual descriptions, we mainly focus on the pivotal characteristics, and rarely pay attention to common characteristics as well as the background areas.
no code implementations • CVPR 2017 • Xiangteng He, Yuxin Peng
Most existing fine-grained image classification methods generally learn part detection models to obtain the semantic parts for better classification accuracy.
no code implementations • 10 Apr 2017 • Xiangteng He, Yuxin Peng
Most existing fine-grained image classification methods generally learn part detection models to obtain the semantic parts for better classification accuracy.
1 code implementation • 6 Apr 2017 • Yuxin Peng, Xiangteng He, Junjie Zhao
Both are jointly employed to exploit the subtle and local differences for distinguishing the subcategories.