no code implementations • 15 Oct 2024 • Jiayi Lin, Jiabo Huang, Jian Hu, Shaogang Gong
To solve this, we propose InvSeg, a test-time prompt inversion method that tackles open-vocabulary semantic segmentation by inverting image-specific visual context into text prompt embedding space, leveraging structure information derived from the diffusion model's reconstruction process to enrich text prompts so as to associate each class with a structure-consistent mask.
1 code implementation • 27 Aug 2024 • Jian Hu, Jiayi Lin, Junchi Yan, Shaogang Gong
In this paper, we utilize hallucinations to mine task-related information from images and verify its accuracy for enhancing precision of the generated prompts.
Camouflaged Object Segmentation with a Single Task-generic Prompt Medical Image Segmentation +1
no code implementations • 9 Jul 2024 • Yu Cao, Shaogang Gong
In the field of Few-Shot Image Generation (FSIG) using Deep Generative Models (DGMs), accurately estimating the distribution of target domain with minimal samples poses a significant challenge.
1 code implementation • 6 Jul 2024 • Zixu Cheng, Yujiang Pu, Shaogang Gong, Parisa Kordjamshidi, Yu Kong
Temporal grounding, also known as video moment retrieval, aims at locating video segments corresponding to a given query sentence.
no code implementations • 25 Jun 2024 • Weitong Cai, Jiabo Huang, Shaogang Gong, Hailin Jin, Yang Liu
This intrinsic modality imbalance leaves a considerable portion of visual information remaining unaligned with text.
no code implementations • 3 Jun 2024 • Weitong Cai, Jiabo Huang, Shaogang Gong
Experiments show EVA's effectiveness in exploring temporal segment annotations in a source domain to help learn video moment retrieval without temporal labels in a target domain.
1 code implementation • 29 May 2024 • Zengqun Zhao, Yu Cao, Shaogang Gong, Ioannis Patras
Current facial expression recognition (FER) models are often designed in a supervised learning manner and thus are constrained by the lack of large-scale facial expression images with high-quality annotations.
no code implementations • 24 Jan 2024 • Dezhao Luo, Shaogang Gong, Jiabo Huang, Hailin Jin, Yang Liu
We address two problems in video editing for optimising unseen domain VMR: (1) generation of high-quality simulation videos of different moments with subtle distinctions, (2) selection of simulation videos that complement existing source training videos without introducing harmful noise or unnecessary repetitions.
no code implementations • 14 Dec 2023 • Shitong Sun, Fanghua Ye, Shaogang Gong
Composed image retrieval attempts to retrieve an image of interest from gallery images through a composed query of a reference image and its corresponding modified text.
1 code implementation • 12 Dec 2023 • Jian Hu, Jiayi Lin, Weitong Cai, Shaogang Gong
In this work, we aim to eliminate the need for manual prompt.
Camouflaged Object Segmentation with a Single Task-generic Prompt object-detection +2
1 code implementation • 24 Nov 2023 • Shitong Sun, Jindong Gu, Shaogang Gong
In this paper, we perform the first robustness study and establish three new diversified benchmarks for systematic analysis of text-image composed retrieval against natural corruptions in both vision and text and further probe textural understanding.
no code implementations • 24 Oct 2023 • Qilei Li, Shaogang Gong
While deep learning has significantly improved ReID model accuracy under the independent and identical distribution (IID) assumption, it has also become clear that such models degrade notably when applied to an unseen novel domain due to unpredictable/unknown domain shift.
no code implementations • 16 Mar 2023 • Jiayi Lin, Shaogang Gong
Specifically, GridCLIP performs Grid-level Alignment to adapt the CLIP image-level representations to grid-level representations by aligning to CLIP category representations to learn the annotated (especially frequent) categories.
1 code implementation • Asian Conference on Computer Vision 2023 • Ke Han, Shaogang Gong, Yan Huang, Liang Wang, Tieniu Tan
However, existing Re-ID methods usually generate 3D body shapes without considering identity modeling, which severely weakens the discriminability of 3D human shapes.
no code implementations • CVPR 2023 • Dezhao Luo, Jiabo Huang, Shaogang Gong, Hailin Jin, Yang Liu
The correlation between the vision and text is essential for video moment retrieval (VMR), however, existing methods heavily rely on separate pre-training feature extractors for visual and textual understanding.
no code implementations • CVPR 2023 • Ke Han, Shaogang Gong, Yan Huang, Liang Wang, Tieniu Tan
Specifically, to formulate meaningful clothing variations in the feature space, our method first estimates a clothing-change normal distribution with intra-ID cross-clothing variances.
no code implementations • 29 Aug 2022 • Shitong Sun, Chenyang Si, Guile Wu, Shaogang Gong
To resolve this problem, federated learning has been introduced to transfer knowledge across multiple sources (clients) with non-shared data while optimising a globally generalised central model (server).
no code implementations • 13 Jul 2022 • Qingze Yin, GuanAn Wang, Guodong Ding, Qilei Li, Shaogang Gong, Zhenmin Tang
To strike a balance between the model accuracy and efficiency, we propose a novel Sub-space Consistency Regularization (SCR) algorithm that can speed up the ReID procedure by $0. 25$ times than real-value features under the same dimensions whilst maintaining a competitive accuracy, especially under short codes.
no code implementations • 26 Jun 2022 • Jiabo Huang, Hailin Jin, Shaogang Gong, Yang Liu
Such uncertainties in temporal labelling are currently ignored in model training, resulting in learning mis-matched video-text correlation with poor generalisation in test.
1 code implementation • 2 Jun 2022 • Jian Hu, Haowen Zhong, Junchi Yan, Shaogang Gong, Guile Wu, Fei Yang
However, due to the significant imbalance between the amount of annotated data in the source and target domains, usually only the target distribution is aligned to the source domain, leading to adapting unnecessary source specific knowledge to the target domain, i. e., biased domain adaptation.
no code implementations • 23 May 2022 • Qilei Li, Jiabo Huang, Jian Hu, Shaogang Gong
In this work, we propose a Feature-Distribution Perturbation and Calibration (PECA) method to derive generic feature representations for person ReID, which is not only discriminative across cameras but also agnostic and deployable to arbitrary unseen target domains.
no code implementations • 28 Mar 2022 • Pan Li, Yanwei Fu, Shaogang Gong
The MFL computes meta-knowledge on functional regularisation generalisable to different learning tasks by which functional training on limited labelled data promotes more discriminative functions to be learned.
no code implementations • CVPR 2022 • Pan Li, Shaogang Gong, Chengjie Wang, Yanwei Fu
The calibrated distance in this target-aware non-linear subspace is complementary to that in the pre-trained representation.
no code implementations • 22 Oct 2021 • Qilei Li, Jiabo Huang, Shaogang Gong
In this work, we explore jointly both local alignments and global correlations with further consideration of their mutual promotion/reinforcement so to better assemble complementary discriminative Re-ID information within all the relevant frames in video tracklets.
no code implementations • 21 Oct 2021 • Shitong Sun, Guile Wu, Shaogang Gong
This helps to preserve model personalisation knowledge on each local client domain and learn instance-specific information.
no code implementations • ICCV 2021 • Jiabo Huang, Yang Liu, Shaogang Gong, Hailin Jin
Video activity localisation has recently attained increasing attention due to its practical values in automatically localising the most salient visual segments corresponding to their language descriptions (sentences) from untrimmed and unstructured videos.
no code implementations • 3 Mar 2021 • Jiabo Huang, Shaogang Gong
Whilst contrastive learning has recently brought notable benefits to deep clustering of unlabelled images by learning sample-specific discriminative visual features, its potential for explicitly inferring class decision boundaries is less well understood.
no code implementations • 16 Jan 2021 • Minxian Li, Xiatian Zhu, Shaogang Gong
Extensive comparative experiments demonstrate that the proposed STL model surpasses significantly the state-of-the-art unsupervised learning and one-shot learning re-id methods on three large tracklet person re-id benchmarks.
no code implementations • ICCV 2021 • Guile Wu, Shaogang Gong
Our base model consists of a domain-invariant feature extractor and an ensemble of domain-specific classifiers.
Domain Generalization Multi-Source Unsupervised Domain Adaptation +1
no code implementations • 1 Jan 2021 • Xu Lan, Steven McDonagh, Shaogang Gong, Jiali Wang, Zhenguo Li, Sarah Parisot
Semi-Supervised Few-shot Learning (SS-FSL) investigates the benefit of incorporating unlabelled data in few-shot settings.
no code implementations • ICCV 2021 • Guile Wu, Shaogang Gong, Pan Li
With the reformulated baseline, we present two new approaches to CIL by learning class-independent knowledge and multi-perspective knowledge, respectively.
no code implementations • ICCV 2021 • Pan Li, Da Li, Wei Li, Shaogang Gong, Yanwei Fu, Timothy M. Hospedales
The topical domain generalization (DG) problem asks trained models to perform well on an unseen target domain with different data statistics from the source training domains.
1 code implementation • ECCV 2020 • Guan'an Wang, Shaogang Gong, Jian Cheng, Zeng-Guang Hou
In this work, we introduce a new solution for fast ReID by formulating a novel Coarse-to-Fine (CtF) hashing code search strategy, which complementarily uses short and long codes, achieving both faster speed and better accuracy.
1 code implementation • 11 Aug 2020 • Alexander Hudson, Shaogang Gong
Structure determination is key to understanding protein function at a molecular level.
no code implementations • 8 Jun 2020 • Jiabo Huang, Shaogang Gong
In this work, we address this problem by transfer clustering that aims to learn a discriminative latent space of the unlabelled target data in a novel domain by knowledge transfer from labelled related domains.
1 code implementation • 7 Jun 2020 • Guile Wu, Shaogang Gong
Meanwhile, we employ the temporal mean model of each peer as the peer mean teacher to collaboratively transfer knowledge among peers, which helps each peer to learn richer knowledge and facilitates to optimise a more stable model with better generalisation.
no code implementations • 7 Jun 2020 • Guile Wu, Shaogang Gong
Specifically, each local client receives global model updates from the server and trains a local model using its local data independent from all the other clients.
no code implementations • 12 Feb 2020 • Xiangping Zhu, Xiatian Zhu, Minxian Li, Pietro Morerio, Vittorio Murino, Shaogang Gong
Existing person re-identification (re-id) methods mostly exploit a large set of cross-camera identity labelled training data.
no code implementations • 30 Dec 2019 • Zhiyi Cheng, Xiatian Zhu, Shaogang Gong
Extensive evaluations demonstrate the performance superiority of our method over state-of-the-art SR and UDA models on both genuine and artificial LR facial imagery data.
no code implementations • 25 Sep 2019 • Wei Li, Shaogang Gong, Xiatian Zhu
We address this limitation by additionally exploiting feature self-calibration operations, resulting in a heterogeneous search space.
no code implementations • 27 Aug 2019 • Xiangping Zhu, Xiatian Zhu, Minxian Li, Vittorio Murino, Shaogang Gong
Existing person re-identification (re-id) methods rely mostly on a large set of inter-camera identity labelled training data, requiring a tedious data collection and annotation process therefore leading to poor scalability in practical re-id applications.
no code implementations • 16 Aug 2019 • Xun Xu, Shaogang Gong, Timothy Hospedales
To that end, we relax the common assumption that each individual crowd video instance is only associated with a single crowd attribute.
no code implementations • 22 Jul 2019 • Xu Lan, Xiatian Zhu, Shaogang Gong
Most state-of-the-art person re-identification (re-id) methods depend on supervised model learning with a large set of cross-view identity labelled training data.
1 code implementation • 25 Apr 2019 • Jiabo Huang, Qi Dong, Shaogang Gong, Xiatian Zhu
Deep convolutional neural networks (CNNs) have demonstrated remarkable success in computer vision by supervisedly learning strong visual feature representations.
1 code implementation • CVPR 2019 • Hong-Xing Yu, Wei-Shi Zheng, An-Cong Wu, Xiaowei Guo, Shaogang Gong, Jian-Huang Lai
To overcome this problem, we propose a deep model for the soft multilabel learning for unsupervised RE-ID.
Ranked #83 on Person Re-Identification on DukeMTMC-reID
1 code implementation • 1 Mar 2019 • Minxian Li, Xiatian Zhu, Shaogang Gong
We formulate an Unsupervised Tracklet Association Learning (UTAL) framework.
Ranked #1 on Person Re-Identification on DukeTracklet
no code implementations • 28 Nov 2018 • Peng Lu, Hangyu Lin, Yanwei Fu, Shaogang Gong, Yu-Gang Jiang, xiangyang xue
Additionally, to study the tasks of sketch-based hairstyle retrieval, this paper contributes a new instance-level photo-sketch dataset - Hairstyle Photo-Sketch dataset, which is composed of 3600 sketches and photos, and 2400 sketch-photo pairs.
no code implementations • 21 Nov 2018 • Zhiyi Cheng, Xiatian Zhu, Shaogang Gong
Whilst recent face-recognition (FR) techniques have made significant progress on recognising constrained high-resolution web images, the same cannot be said on natively unconstrained low-resolution images at large scales.
no code implementations • 20 Nov 2018 • Qi Dong, Xiatian Zhu, Shaogang Gong
The objective learning formulation is essential for the success of convolutional neural networks.
no code implementations • 19 Nov 2018 • Xu Lan, Xiatian Zhu, Shaogang Gong
Whilst being able to create stronger target networks compared to the vanilla non-teacher based learning strategy, this scheme needs to train additionally a large teacher model with expensive computational cost.
no code implementations • 25 Sep 2018 • Aytaç Kanacı, Xiatian Zhu, Shaogang Gong
Existing vehicle re-identification (re-id) evaluation benchmarks consider strongly artificial test scenarios by assuming the availability of high quality images and fine-grained appearance at an almost constant image scale, reminiscent to images required for Automatic Number Plate Recognition, e. g. VeRi-776.
no code implementations • ECCV 2018 • Minxian Li, Xiatian Zhu, Shaogang Gong
Mostexistingpersonre-identification(re-id)methods relyon supervised model learning on per-camera-pair manually labelled pairwise training data.
Ranked #2 on Person Re-Identification on DukeTracklet
1 code implementation • ECCV 2018 • Yanbei Chen, Xiatian Zhu, Shaogang Gong
We consider the semi-supervised multi-class classification problem of learning from sparse labelled and abundant unlabelled training data.
1 code implementation • 22 Aug 2018 • Yanbei Chen, Xiatian Zhu, Shaogang Gong
In this work, to address the video person re-id task, we formulate a novel Deep Association Learning (DAL) scheme, the first end-to-end deep learning method using none of the identity labels in model initialisation and training.
Ranked #6 on Person Re-Identification on PRID2011
Unsupervised Person Re-Identification Video-Based Person Re-Identification
no code implementations • ECCV 2018 • Xu Lan, Xiatian Zhu, Shaogang Gong
In contrast to previous studies, we show that sufficiently reliable person instance cropping is achievable by slightly improved state-of-the-art deep learning object detectors (e. g. Faster-RCNN), and the under-studied multi-scale matching problem in person search is a more severe barrier.
2 code implementations • 5 Jul 2018 • Hang Su, Xiatian Zhu, Shaogang Gong
In this work, we introduce a more realistic and challenging logo detection setting, called Open Logo Detection.
no code implementations • 25 Jun 2018 • Hanxiao Wang, Xiatian Zhu, Shaogang Gong, Tao Xiang
Most existing person re-identification (re-id) methods are unsuitable for real-world deployment due to two reasons: Unscalability to large population size, and Inadaptability over time.
3 code implementations • NeurIPS 2018 • Xu Lan, Xiatian Zhu, Shaogang Gong
Knowledge distillation is effective to train small and generalisable network models for meeting the low-memory and fast running requirements.
1 code implementation • 28 Apr 2018 • Qi Dong, Shaogang Gong, Xiatian Zhu
In particular, existing deep learning methods consider mostly either class balanced data or moderately imbalanced data in model training, and ignore the challenge of learning from significantly imbalanced training data.
1 code implementation • 25 Apr 2018 • Zhiyi Cheng, Xiatian Zhu, Shaogang Gong
To facilitate more studies on developing FR models that are effective and robust for low-resolution surveillance facial images, we introduce a new Surveillance Face Recognition Challenge, which we call the QMUL-SurvFace benchmark.
2 code implementations • 30 Mar 2018 • Hang Su, Shaogang Gong, Xiatian Zhu
Existing logo detection methods usually consider a small number of logo classes and limited images per class with a strong assumption of requiring tedious object bounding box annotations, therefore not scalable to real-world dynamic applications.
no code implementations • CVPR 2018 • Jingya Wang, Xiatian Zhu, Shaogang Gong, Wei Li
Most existing person re-identification (re-id) methods require supervised model learning from a separate large set of pairwise labelled training data for every single camera pair.
Ranked #23 on Unsupervised Domain Adaptation on Market to Duke
1 code implementation • CVPR 2018 • Wei Li, Xiatian Zhu, Shaogang Gong
Existing person re-identification (re-id) methods either assume the availability of well-aligned person bounding box images as model input or rely on constrained attention selection mechanisms to calibrate misaligned images.
Ranked #13 on Person Re-Identification on CUHK03 detected
1 code implementation • ICCV 2017 • Qi Dong, Shaogang Gong, Xiatian Zhu
Recognising detailed facial or clothing attributes in images of people is a challenging task for computer vision, especially when the training data are both in very large scale and extremely imbalanced among different attribute classes.
no code implementations • 13 Oct 2017 • Yanwei Fu, Tao Xiang, Yu-Gang Jiang, xiangyang xue, Leonid Sigal, Shaogang Gong
With the recent renaissance of deep convolution neural networks, encouraging breakthroughs have been achieved on the supervised recognition tasks, where each class has sufficient training data and fully annotated training data.
no code implementations • ICCV 2017 • Ancong Wu, Wei-Shi Zheng, Hong-Xing Yu, Shaogang Gong, Jian-Huang Lai
To that end, matching RGB images with infrared images is required, which are heterogeneous with very different visual characteristics.
Ranked #4 on Cross-Modal Person Re-Identification on SYSU-MM01 (mAP (All-search & Single-shot) metric)
Cross-Modality Person Re-identification Cross-Modal Person Re-Identification
no code implementations • ICCV 2017 • Jingya Wang, Xiatian Zhu, Shaogang Gong, Wei Li
Recognising semantic pedestrian attributes in surveillance images is a challenging task for computer vision, particularly when the imaging quality is poor with complex background clutter and uncontrolled viewing conditions, and the number of labelled training data is small.
no code implementations • 10 Jul 2017 • Xu Lan, Hanxiao Wang, Shaogang Gong, Xiatian Zhu
Existing person re-identification (re-id) methods assume the provision of accurately cropped person bounding boxes with minimum background noise, mostly by manually cropping.
no code implementations • 29 Jun 2017 • Li Zhang, Flood Sung, Feng Liu, Tao Xiang, Shaogang Gong, Yongxin Yang, Timothy M. Hospedales
Generating natural language descriptions of images is an important capability for a robot or other visual-intelligence driven AI agent that may need to communicate with human users about what it is seeing.
no code implementations • 30 May 2017 • Jingya Wang, Xiatian Zhu, Shaogang Gong
As a result, our model is able to discover more accurate semantic correlation between textual tags and visual features, and finally providing favourable visual semantics interpretation even with highly sparse and incomplete tags.
no code implementations • 12 May 2017 • Wei Li, Xiatian Zhu, Shaogang Gong
Existing person re-identification (re-id) methods rely mostly on either localised or global feature representation alone.
Ranked #102 on Person Re-Identification on Market-1501
4 code implementations • CVPR 2017 • Elyor Kodirov, Tao Xiang, Shaogang Gong
We show that with this additional reconstruction constraint, the learned projection function from the seen classes is able to generalise better to the new unseen classes.
no code implementations • 29 Dec 2016 • Hang Su, Xiatian Zhu, Shaogang Gong
Logo detection in unconstrained images is challenging, particularly when only very sparse labelled training images are accessible due to high labelling costs.
no code implementations • 5 Dec 2016 • Hanxiao Wang, Shaogang Gong, Tao Xiang
Existing person re-identification models are poor for scaling up to large data required in real-world applications due to: (1) Complexity: They employ complex models for optimal performance resulting in high computational cost for training at a large scale; (2) Inadaptability: Once trained, they are unsuitable for incremental update to incorporate any new data available.
no code implementations • 5 Dec 2016 • Hanxiao Wang, Shaogang Gong, Xiatian Zhu, Tao Xiang
Current person re-identification (re-id) methods assume that (1) pre-labelled training data are available for every camera pair, (2) the gallery size for re-identification is moderate.
no code implementations • 26 Nov 2016 • Xun Xu, Timothy M. Hospedales, Shaogang Gong
In this work, we improve the ability of ZSL to generalise across this domain shift in both model- and data-centric ways by formulating a visual-semantic mapping with better generalisation properties and a dynamic data re-weighting method to prioritise auxiliary data that are relevant to the target classes.
Ranked #7 on Zero-Shot Action Recognition on Olympics
no code implementations • 25 Nov 2016 • Xiaolong Ma, Xiatian Zhu, Shaogang Gong, Xudong Xie, Jianming Hu, Kin-Man Lam, Yisheng Zhong
Crucially, this model does not require pairwise labelled training data (i. e. unsupervised) therefore readily scalable to large scale camera networks of arbitrary camera pairs without the need for exhaustive data annotation for every camera pair.
4 code implementations • CVPR 2017 • Li Zhang, Tao Xiang, Shaogang Gong
In this paper we argue that the key to make deep ZSL models succeed is to choose the right embedding space.
Ranked #16 on Zero-Shot Action Recognition on Kinetics
no code implementations • 17 Oct 2016 • Arne Schumann, Shaogang Gong, Tobias Schuchert
Person re-identification (re-id) is the task of matching multiple occurrences of the same person from different cameras, poses, lighting conditions, and a multitude of other factors which alter the visual appearance.
Ranked #5 on Person Re-Identification on CUHK-SYSU
no code implementations • 12 Oct 2016 • Qi Dong, Shaogang Gong, Xiatian Zhu
Recognising detailed clothing characteristics (fine-grained attributes) in unconstrained images of people in-the-wild is a challenging task for computer vision, especially when there is only limited training data from the wild whilst most data available for model learning are captured in well-controlled environments using fashion models (well lit, no background clutter, frontal view, high-resolution).
no code implementations • CVPR 2016 • Peixi Peng, Tao Xiang, Yao-Wei Wang, Massimiliano Pontil, Shaogang Gong, Tiejun Huang, Yonghong Tian
Most existing person re-identification (Re-ID) approaches follow a supervised learning framework, in which a large number of labelled matching pairs are required for training.
no code implementations • CVPR 2016 • Li Zhang, Tao Xiang, Shaogang Gong
Most existing person re-identification (re-id) methods focus on learning the optimal distance metrics across camera views.
Ranked #118 on Person Re-Identification on Market-1501
no code implementations • 23 Jan 2016 • Taiqing Wang, Shaogang Gong, Xiatian Zhu, Shengjin Wang
Current person re-identification (ReID) methods typically rely on single-frame imagery features, whilst ignoring space-time information from image sequences often available in the practical surveillance scenarios.
no code implementations • ICCV 2015 • Wei-Shi Zheng, Xiang Li, Tao Xiang, Shengcai Liao, Jian-Huang Lai, Shaogang Gong
We address a new partial person re-identification (re-id) problem, where only a partial observation of a person is available for matching across different non-overlapping camera views.
no code implementations • ICCV 2015 • Xiang Li, Wei-Shi Zheng, Xiaojuan Wang, Tao Xiang, Shaogang Gong
In real world person re-identification (re-id), images of people captured at very different resolutions from different locations need be matched.
no code implementations • ICCV 2015 • Elyor Kodirov, Tao Xiang, Zhen-Yong Fu, Shaogang Gong
Zero-shot learning (ZSL) can be considered as a special case of transfer learning where the source and target domains have different tasks/label spaces and the target domain is unlabelled, providing little guidance for the knowledge transfer.
no code implementations • 13 Nov 2015 • Xun Xu, Timothy Hospedales, Shaogang Gong
This is a more challenging problem than existing ZSL of still images and/or attributes, because the mapping between video spacetime features of actions and the semantic space is more complex and harder to learn for the purpose of generalising over any cross-category domain shift.
no code implementations • 9 Oct 2015 • Yi Li, Yi-Zhe Song, Timothy Hospedales, Shaogang Gong
We present a generative model which can automatically summarize the stroke composition of free-hand sketches of a given category.
no code implementations • 27 Jul 2015 • Xun Xu, Timothy Hospedales, Shaogang Gong
The growing rate of public space CCTV installations has generated a need for automated methods for exploiting video surveillance data including scene understanding, query, behaviour annotation and summarization.
no code implementations • CVPR 2015 • Zhenyong Fu, Tao Xiang, Elyor Kodirov, Shaogang Gong
The semantic manifold structure is used to redefine the distance metric in the semantic embedding space for more effective ZSL.
no code implementations • 26 Mar 2015 • Yanwei Fu, Yongxin Yang, Timothy M. Hospedales, Tao Xiang, Shaogang Gong
Recently, zero-shot learning (ZSL) has received increasing interest.
no code implementations • 26 Mar 2015 • Yanwei Fu, Yongxin Yang, Tim Hospedales, Tao Xiang, Shaogang Gong
Zero-shot learning has received increasing interest as a means to alleviate the often prohibitive expense of annotating training data for large scale recognition problems.
no code implementations • 5 Feb 2015 • Xun Xu, Timothy Hospedales, Shaogang Gong
In this framework a mapping is constructed between visual features and a human interpretable semantic description of each category, allowing categories to be recognised in the absence of any training data.
Ranked #34 on Zero-Shot Action Recognition on UCF101
no code implementations • 25 Jan 2015 • Yanwei Fu, Timothy M. Hospedales, Tao Xiang, Jiechao Xiong, Shaogang Gong, Yizhou Wang, Yuan YAO
In this paper, we propose a more principled way to identify annotation outliers by formulating the subjective visual property prediction task as a unified robust learning to rank problem, tackling both the outlier detection and learning to rank jointly.
no code implementations • 19 Jan 2015 • Yanwei Fu, Timothy M. Hospedales, Tao Xiang, Shaogang Gong
A projection from a low-level feature space to the semantic representation space is learned from the auxiliary dataset and is applied without adaptation to the target dataset.
no code implementations • 13 Jan 2015 • Xiatian Zhu, Chen Change Loy, Shaogang Gong
Many visual surveillance tasks, e. g. video summarisation, is conventionally accomplished through analysing imagerybased features.
no code implementations • 16 Jun 2014 • Zhen-Yong Fu, Tao Xiang, Shaogang Gong
Specifically, in contrast to previous work which ignores the semantic relationships between seen classes and focus merely on those between seen and unseen classes, in this paper a novel approach based on a semantic graph is proposed to represent the relationships between all the seen and unseen class in a semantic word space.
no code implementations • CVPR 2014 • Xiatian Zhu, Chen Change Loy, Shaogang Gong
Spectral clustering requires robust and meaningful affinity graphs as input in order to form clusters with desired structures that can well support human intuition.
no code implementations • CVPR 2013 • Ke Chen, Shaogang Gong, Tao Xiang, Chen Change Loy
A number of computer vision problems such as human age estimation, crowd density estimation and body/face pose (view angle) estimation can be formulated as a regression problem by learning a mapping function between a high dimensional vector-formed feature input and a scalarvalued output.