1 code implementation • ICML 2020 • Jianshu Zhang, Jun Du, Yongxin Yang, Yi-Zhe Song, Si Wei, Li-Rong Dai
Recent encoder-decoder approaches typically employ string decoders to convert images into serialized strings for image-to-markup.
no code implementations • 16 Nov 2022 • Linus Ericsson, Nanqing Dong, Yongxin Yang, Ales Leonardis, Steven McDonagh
In this work, we propose a simple pretext task that provides an effective pre-training for the RPN, towards efficiently improving downstream object detection performance.
no code implementations • 17 Oct 2022 • Qishi Dong, Awais Muhammad, Fengwei Zhou, Chuanlong Xie, Tianyang Hu, Yongxin Yang, Sung-Ho Bae, Zhenguo Li
We evaluate our paradigm on a diverse model zoo consisting of 35 models for various OoD tasks and demonstrate: (i) model ranking is better correlated with fine-tuning ranking than previous methods and up to 9859x faster than brute-force fine-tuning; (ii) OoD generalization after model ensemble with feature selection outperforms the state-of-the-art methods and the accuracy on most challenging task DomainNet is improved from 46. 5\% to 50. 6\%.
1 code implementation • 4 Oct 2022 • Yongshuo Zong, Yongxin Yang, Timothy Hospedales
In this work, we introduce MEDFAIR, a framework to benchmark the fairness of machine learning models for medical imaging.
1 code implementation • 20 Sep 2022 • Ling Luo, Yulia Gryaditskaya, Yongxin Yang, Tao Xiang, Yi-Zhe Song
In this paper, we offer a different perspective towards answering these questions -- we study the use of 3D sketches as an input modality and advocate a VR-scenario where retrieval is conducted.
1 code implementation • 20 Sep 2022 • Ling Luo, Yulia Gryaditskaya, Yongxin Yang, Tao Xiang, Yi-Zhe Song
We then, for the first time, study the scenario of fine-grained 3D VR sketch to 3D shape retrieval, as a novel VR sketching application and a proving ground to drive out generic insights to inform future research.
no code implementations • 26 Jun 2022 • Xiongjie Chen, Yunpeng Li, Yongxin Yang
Out-of-distribution (OOD) detection has recently received much attention from the machine learning community due to its importance in deploying machine learning models in real-world applications.
no code implementations • CVPR 2022 • Sarah Parisot, Pedro M. Esperanca, Steven McDonagh, Tamas J. Madarasz, Yongxin Yang, Zhenguo Li
In this work, we introduce a novel strategy for long-tail recognition that addresses the tail classes' few-shot problem via training-free knowledge transfer.
1 code implementation • 6 Nov 2021 • Zhongying Deng, Kaiyang Zhou, Yongxin Yang, Tao Xiang
Importantly, the attention module is supervised by a consistency loss, which is imposed on the distributions of channel attention weights between source and target domains.
no code implementations • 29 Sep 2021 • Boyan Gao, Henry Gouk, Yongxin Yang, Timothy Hospedales
We take a different approach, and explore the impact of the ERM loss function on out-of-domain generalisation.
no code implementations • 29 Sep 2021 • Nanqing Dong, Matteo Maggioni, Yongxin Yang, Eduardo Pérez-Pellitero, Ales Leonardis, Steven McDonagh
In the era of deep learning, supervised residual learning (ResL) has led to many breakthroughs in low-level vision such as image restoration and enhancement tasks.
no code implementations • ICLR 2022 • Ayan Das, Yongxin Yang, Timothy Hospedales, Tao Xiang, Yi-Zhe Song
Learning meaningful representations for chirographic drawing data such as sketches, handwriting, and flowcharts is a gateway for understanding and emulating human creative expression.
2 code implementations • 5 Jul 2021 • Kaiyang Zhou, Yongxin Yang, Yu Qiao, Tao Xiang
In this work, we address domain generalization with MixStyle, a plug-and-play, parameter-free module that is simply inserted to shallow CNN layers and requires no modification to training objectives.
1 code implementation • NeurIPS 2021 • Ondrej Bohdal, Yongxin Yang, Timothy Hospedales
Gradient-based meta-learning and hyperparameter optimization have seen significant progress recently, enabling practical end-to-end training of neural networks together with many hyperparameters.
no code implementations • 18 Jun 2021 • Nanqing Dong, Matteo Maggioni, Yongxin Yang, Eduardo Pérez-Pellitero, Ales Leonardis, Steven McDonagh
We propose a new label-efficient learning paradigm based on residuals, residual contrastive learning (RCL), and derive an unsupervised visual representation learning framework, suitable for low-level vision tasks with noisy inputs.
1 code implementation • 17 Jun 2021 • Ondrej Bohdal, Yongxin Yang, Timothy Hospedales
The problem is especially noticeable when using modern neural networks, for which there is a significant difference between the confidence of the model and the probability of correct prediction.
no code implementations • 18 May 2021 • Conghui Hu, Yongxin Yang, Yunpeng Li, Timothy M. Hospedales, Yi-Zhe Song
The practical value of existing supervised sketch-based image retrieval (SBIR) algorithms is largely limited by the requirement for intensive data collection and labeling.
3 code implementations • ICLR 2021 • Kaiyang Zhou, Yongxin Yang, Yu Qiao, Tao Xiang
Our method, termed MixStyle, is motivated by the observation that visual domain is closely related to image style (e. g., photo vs.~sketch images).
Ranked #43 on
Domain Generalization
on PACS
no code implementations • 1 Apr 2021 • Jiansong Li, Xiao Dong, Guangli Li, Peng Zhao, Xueying Wang, Xiaobing Chen, Xianzhi Yu, Yongxin Yang, Zihan Jiang, Wei Cao, Lei Liu, Xiaobing Feng
The training of deep neural networks (DNNs) is usually memory-hungry due to the limited device memory capacity of DNN accelerators.
1 code implementation • CVPR 2021 • Ayan Das, Yongxin Yang, Timothy Hospedales, Tao Xiang, Yi-Zhe Song
Analysis of human sketches in deep learning has advanced immensely through the use of waypoint-sequences rather than raster-graphic representations.
no code implementations • CVPR 2021 • Aneeshan Sain, Ayan Kumar Bhunia, Yongxin Yang, Tao Xiang, Yi-Zhe Song
With this meta-learning framework, our model can not only disentangle the cross-modal shared semantic content for SBIR, but can adapt the disentanglement to any unseen user style as well, making the SBIR model truly style-agnostic.
1 code implementation • CVPR 2021 • Ayan Kumar Bhunia, Pinaki Nath Chowdhury, Aneeshan Sain, Yongxin Yang, Tao Xiang, Yi-Zhe Song
A fundamental challenge faced by existing Fine-Grained Sketch-Based Image Retrieval (FG-SBIR) models is the data scarcity -- model performances are largely bottlenecked by the lack of sketch-photo pairs.
1 code implementation • CVPR 2021 • Ayan Kumar Bhunia, Pinaki Nath Chowdhury, Yongxin Yang, Timothy M. Hospedales, Tao Xiang, Yi-Zhe Song
This data is uniquely characterised by its existence in dual modalities of rasterized images and vector coordinate sequences.
1 code implementation • CVPR 2021 • Sen He, Wentong Liao, Michael Ying Yang, Yongxin Yang, Yi-Zhe Song, Bodo Rosenhahn, Tao Xiang
We argue that these are caused by the lack of context-aware object and stuff feature encoding in their generators, and location-sensitive appearance representation in their discriminators.
Ranked #1 on
Layout-to-Image Generation
on COCO-Stuff 128x128
no code implementations • 10 Dec 2020 • Yuting Qiang, Yongxin Yang, Xueting Zhang, Yanwen Guo, Timothy M. Hospedales
We present a novel Tensor Composition Net (TCN) to predict visual relationships in images.
1 code implementation • 29 Jul 2020 • Aneeshan Sain, Ayan Kumar Bhunia, Yongxin Yang, Tao Xiang, Yi-Zhe Song
In this paper, we study a further trait of sketches that has been overlooked to date, that is, they are hierarchical in terms of the levels of detail -- a person typically sketches up to various extents of detail to depict an object.
1 code implementation • ECCV 2020 • Kaiyang Zhou, Yongxin Yang, Timothy Hospedales, Tao Xiang
This explicitly increases the diversity of available training domains and leads to a more generalizable model.
Ranked #51 on
Domain Generalization
on PACS
1 code implementation • ECCV 2020 • Ayan Das, Yongxin Yang, Timothy Hospedales, Tao Xiang, Yi-Zhe Song
The study of neural generative models of human sketches is a fascinating contemporary modeling problem due to the links between sketch image generation and the human drawing process.
2 code implementations • 15 Jun 2020 • Ondrej Bohdal, Yongxin Yang, Timothy Hospedales
In particular, we study the problem of label distillation - creating synthetic labels for a small set of real images, and show it to be more effective than the prior image-based approach to dataset distillation.
1 code implementation • ICLR 2022 • Xiongjie Chen, Yongxin Yang, Yunpeng Li
While theoretically appealing, the application of the Wasserstein distance to large-scale machine learning problems has been hampered by its prohibitive computational cost.
no code implementations • 3 Apr 2020 • Da Li, Yongxin Yang, Yi-Zhe Song, Timothy Hospedales
In DG this means encountering a sequence of domains and at each step training to maximise performance on the next domain.
Ranked #62 on
Domain Generalization
on PACS
1 code implementation • 16 Mar 2020 • Kaiyang Zhou, Yongxin Yang, Yu Qiao, Tao Xiang
Each such classifier is an expert to its own domain and a non-expert to others.
no code implementations • 12 Mar 2020 • Kaiyang Zhou, Yongxin Yang, Timothy Hospedales, Tao Xiang
This is achieved by having a learning objective formulated to ensure that the generated data can be correctly classified by the label classifier while fooling the domain classifier.
Ranked #49 on
Domain Generalization
on PACS
1 code implementation • NeurIPS 2020 • Wei Zhou, Yiying Li, Yongxin Yang, Huaimin Wang, Timothy M. Hospedales
Off-Policy Actor-Critic (Off-PAC) methods have proven successful in a variety of continuous control tasks.
1 code implementation • ECCV 2020 • Yonggang Li, Guosheng Hu, Yongtao Wang, Timothy Hospedales, Neil M. Robertson, Yongxin Yang
In this paper, we propose Differentiable Automatic Data Augmentation (DADA) which dramatically reduces the cost.
Ranked #13 on
Data Augmentation
on ImageNet
1 code implementation • 24 Feb 2020 • Ayan Kumar Bhunia, Yongxin Yang, Timothy M. Hospedales, Tao Xiang, Yi-Zhe Song
Fine-grained sketch-based image retrieval (FG-SBIR) addresses the problem of retrieving a particular photo instance given a user's query sketch.
Cross-Modal Retrieval
On-the-Fly Sketch Based Image Retrieval
+1
no code implementations • 12 Nov 2019 • Yu Zheng, Bowei Chen, Timothy M. Hospedales, Yongxin Yang
Compared with the benchmarked models, our model has the lowest tracking error, across a range of portfolio sizes.
no code implementations • 17 Oct 2019 • Boyan Gao, Yongxin Yang, Henry Gouk, Timothy M. Hospedales
We address the problem of simultaneously learning a k-means clustering and deep feature representation from unlabelled data, which is of interest due to the potential of deep k-means to outperform traditional two-step feature extraction and shallow-clustering strategies.
5 code implementations • 15 Oct 2019 • Kaiyang Zhou, Yongxin Yang, Andrea Cavallaro, Tao Xiang
An effective person re-identification (re-ID) model should learn feature representations that are both discriminative, for distinguishing similar-looking people, and generalisable, for deployment across datasets without any adaptation.
Unsupervised Domain Adaptation
Unsupervised Person Re-Identification
no code implementations • 25 Sep 2019 • Tianyuan Yu, Yongxin Yang, Da Li, Timothy Hospedales, Tao Xiang
Stochastic neural networks (SNNs) are currently topical, with several paradigms being actively investigated including dropout, Bayesian neural networks, variational information bottleneck (VIB) and noise regularized learning.
no code implementations • ICCV 2019 • Umar Riaz Muhammad, Yongxin Yang, Timothy M. Hospedales, Tao Xiang, Yi-Zhe Song
In the former one asks whether a machine can `understand' enough about the meaning of input data to produce a meaningful but more compact abstraction.
9 code implementations • ICCV 2019 • Kaiyang Zhou, Yongxin Yang, Andrea Cavallaro, Tao Xiang
As an instance-level recognition problem, person re-identification (ReID) relies on discriminative features, which not only capture different spatial scales but also encapsulate an arbitrary combination of multiple scales.
Ranked #2 on
Person Re-Identification
on MSMT17-C
no code implementations • 29 Apr 2019 • Yu Zheng, Yongxin Yang, Bo-Wei Chen
This is one of the very first studies which discuss a methodological framework that incorporates prior financial domain knowledge into neural network architecture design and model training.
2 code implementations • 31 Jan 2019 • Yiying Li, Yongxin Yang, Wei Zhou, Timothy M. Hospedales
The well known domain shift issue causes model performance to degrade when deployed to a new target domain with different statistics to training.
Ranked #97 on
Domain Generalization
on PACS
2 code implementations • ICCV 2019 • Da Li, Jianshu Zhang, Yongxin Yang, Cong Liu, Yi-Zhe Song, Timothy M. Hospedales
In this paper, we build on this strong baseline by designing an episodic training procedure that trains a single deep network in a way that exposes it to the domain shift that characterises a novel domain at runtime.
Ranked #62 on
Domain Generalization
on PACS
no code implementations • 6 Dec 2018 • Xiaobin Chang, Yongxin Yang, Tao Xiang, Timothy M. Hospedales
In this paper, a unified approach is presented to transfer learning that addresses several source and target domain label-space and annotation assumptions with a single model.
Ranked #19 on
Unsupervised Domain Adaptation
on Market to Duke
1 code implementation • 17 Nov 2018 • Xueting Zhang, Yu-ting Qiang, Flood Sung, Yongxin Yang, Timothy M. Hospedales
We thus propose a new deep comparison network comprised of embedding and relation modules that learn multiple non-linear distance metrics based on different levels of features simultaneously.
no code implementations • 6 Sep 2018 • Yu Zheng, Timothy M. Hospedales, Yongxin Yang
We introduce the first index tracking method that explicitly optimises both diversity and sparsity in a single joint framework.
no code implementations • ECCV 2018 • Guosheng Hu, Li Liu, Yang Yuan, Zehao Yu, Yang Hua, Zhihong Zhang, Fumin Shen, Ling Shao, Timothy Hospedales, Neil Robertson, Yongxin Yang
To advance subtle expression recognition, we contribute a Large-scale Subtle Emotions and Mental States in the Wild database (LSEMSW).
5 code implementations • 19 Jun 2018 • Yongxin Yang, Irene Garcia Morillo, Timothy M. Hospedales
In this work, we present Deep Neural Decision Trees (DNDT) -- tree models realised by neural networks.
no code implementations • CVPR 2018 • Umar Riaz Muhammad, Yongxin Yang, Yi-Zhe Song, Tao Xiang, Timothy M. Hospedales
Human free-hand sketches have been studied in various contexts including sketch recognition, synthesis and fine-grained sketch-based image retrieval (FG-SBIR).
12 code implementations • CVPR 2018 • Flood Sung, Yongxin Yang, Li Zhang, Tao Xiang, Philip H. S. Torr, Timothy M. Hospedales
Once trained, a RN is able to classify images of new classes by computing relation scores between query images and the few examples of each new class without further updating the network.
5 code implementations • 10 Oct 2017 • Da Li, Yongxin Yang, Yi-Zhe Song, Timothy M. Hospedales
We propose a novel {meta-learning} method for domain generalization.
Ranked #100 on
Domain Generalization
on PACS
5 code implementations • ICCV 2017 • Da Li, Yongxin Yang, Yi-Zhe Song, Timothy M. Hospedales
In this paper, we make two main contributions: Firstly, we build upon the favorable domain shift-robust properties of deep learning methods, and develop a low-rank parameterized CNN model for end-to-end DG learning.
Ranked #103 on
Domain Generalization
on PACS
no code implementations • ICCV 2017 • Guosheng Hu, Yang Hua, Yang Yuan, Zhihong Zhang, Zheng Lu, Sankha S. Mukherjee, Timothy M. Hospedales, Neil M. Robertson, Yongxin Yang
To solve this problem, we establish a theoretical equivalence between tensor optimisation and a two-stream gated neural network.
no code implementations • 8 Aug 2017 • Zhiyuan Shi, Yongxin Yang, Timothy M. Hospedales, Tao Xiang
We propose to model complex visual scenes using a non-parametric Bayesian model learned from weakly labelled images abundant on media sharing sites such as Flickr.
no code implementations • 29 Jun 2017 • Li Zhang, Flood Sung, Feng Liu, Tao Xiang, Shaogang Gong, Yongxin Yang, Timothy M. Hospedales
Generating natural language descriptions of images is an important capability for a robot or other visual-intelligence driven AI agent that may need to communicate with human users about what it is seeing.
no code implementations • 29 Jun 2017 • Flood Sung, Li Zhang, Tao Xiang, Timothy Hospedales, Yongxin Yang
We propose a novel and flexible approach to meta-learning for learning-to-learn from only a few examples.
no code implementations • 28 Nov 2016 • Yongxin Yang, Timothy M. Hospedales
In this chapter, we propose a single framework that unifies multi-domain learning (MDL) and the related but better studied area of multi-task learning (MTL).
1 code implementation • 14 Sep 2016 • Yongxin Yang, Yu Zheng, Timothy M. Hospedales
We propose a neural network approach to price EU call options that significantly outperforms some existing pricing models and comes with guarantees that its predictions are economically reasonable.
1 code implementation • 13 Jun 2016 • Yongxin Yang, Timothy M. Hospedales
We propose a framework for training multiple neural networks simultaneously.
no code implementations • CVPR 2016 • Yongxin Yang, Timothy M. Hospedales
This allows a recognition model to be pre-calibrated for a new domain in advance (e. g., future time or view angle) without waiting for data collection and re-training.
2 code implementations • 20 May 2016 • Yongxin Yang, Timothy Hospedales
Our approach applies to both homogeneous and heterogeneous MTL.
no code implementations • 21 Mar 2016 • Guosheng Hu, Xiaojiang Peng, Yongxin Yang, Timothy Hospedales, Jakob Verbeek
To train such networks, very large training sets are needed with millions of labeled images.
no code implementations • 28 Jul 2015 • Yongxin Yang, Timothy Hospedales
Most visual recognition methods implicitly assume the data distribution remains unchanged from training to testing.
no code implementations • 9 Apr 2015 • Guosheng Hu, Yongxin Yang, Dong Yi, Josef Kittler, William Christmas, Stan Z. Li, Timothy Hospedales
In this work, we conduct an extensive evaluation of CNN-based face recognition systems (CNN-FRS) on a common ground to make our work easily reproducible.
no code implementations • 31 Mar 2015 • Zhiyuan Shi, Yongxin Yang, Timothy M. Hospedales, Tao Xiang
When humans describe images they tend to use combinations of nouns and adjectives, corresponding to objects and their associated attributes respectively.
no code implementations • 26 Mar 2015 • Yanwei Fu, Yongxin Yang, Timothy M. Hospedales, Tao Xiang, Shaogang Gong
Recently, zero-shot learning (ZSL) has received increasing interest.
no code implementations • 26 Mar 2015 • Yanwei Fu, Yongxin Yang, Tim Hospedales, Tao Xiang, Shaogang Gong
Zero-shot learning has received increasing interest as a means to alleviate the often prohibitive expense of annotating training data for large scale recognition problems.
2 code implementations • 30 Jan 2015 • Qian Yu, Yongxin Yang, Yi-Zhe Song, Tao Xiang, Timothy Hospedales
We propose a multi-scale multi-channel deep neural network framework that, for the first time, yields sketch recognition performance surpassing that of humans.
no code implementations • 23 Dec 2014 • Yongxin Yang, Timothy M. Hospedales
In this paper, we provide a new neural-network based perspective on multi-task learning (MTL) and multi-domain learning (MDL).