no code implementations • 11 Sep 2023 • Cong Wu, Xiao-Jun Wu, Josef Kittler, Tianyang Xu, Sara Atito, Muhammad Awais, ZhenHua Feng
Contrastive learning has achieved great success in skeleton-based action recognition.
1 code implementation • 4 Sep 2023 • Zhangyong Tang, Tianyang Xu, XueFeng Zhu, Xiao-Jun Wu, Josef Kittler
In this context, we seek to uncover the potential of harnessing generative techniques to address the critical challenge, information fusion, in multi-modal tracking.
no code implementations • 22 Aug 2023 • Jiantao Wu, Shentong Mo, Muhammad Awais, Sara Atito, ZhenHua Feng, Josef Kittler
Self-supervised pretraining (SSP) has emerged as a popular technique in machine learning, enabling the extraction of meaningful feature representations without labelled data.
no code implementations • 10 May 2023 • Chunyang Cheng, Tianyang Xu, Xiao-Jun Wu, Hui Li, Xi Li, Josef Kittler
We argue that there is a scope to improve the fusion performance with the help of the FusionBooster, a model specifically designed for the fusion task.
1 code implementation • 11 Apr 2023 • Hui Li, Tianyang Xu, Xiao-Jun Wu, Jiwen Lu, Josef Kittler
In particular we adopt a learnable representation approach to the fusion task, in which the construction of the fusion network architecture is guided by the optimisation algorithm producing the learnable model.
no code implementations • 23 Nov 2022 • Sara Atito, Muhammad Awais, Wenwu Wang, Mark D Plumbley, Josef Kittler
Vision transformers, which were originally developed for natural language processing, have recently generated significant interest in the computer vision and audio communities due to their flexibility in learning long-range relationships.
1 code implementation • 21 Aug 2022 • Xue-Feng Zhu, Tianyang Xu, Zhangyong Tang, Zucheng Wu, Haodong Liu, Xiao Yang, Xiao-Jun Wu, Josef Kittler
To demonstrate the benefits of training on a larger RGB-D data set in general, and RGBD1K in particular, we develop a transformer-based RGB-D tracker, named SPT, as a baseline for future visual object tracking studies using the new dataset.
no code implementations • 22 Jun 2022 • Daqi Liu, Miroslaw Bober, Josef Kittler
As a structured prediction task, scene graph generation, given an input image, aims to explicitly model objects and their relationships by constructing a visually-grounded scene graph.
no code implementations • 16 Jun 2022 • Rui Wang, Xiao-Jun Wu, Ziheng Chen, Tianyang Xu, Josef Kittler
Image set-based visual classification methods have achieved remarkable performance, via characterising the image set in terms of a non-singular covariance matrix on a symmetric positive definite (SPD) manifold.
1 code implementation • 30 May 2022 • Sara Atito, Muhammad Awais, Josef Kittler
This has motivated the research in self-supervised transformer pretraining, which does not need to decode the semantic information conveyed by labels to link it to the image properties, but rather focuses directly on extracting a concise representation of the image data that reflects the notion of similarity, and is invariant to nuisance factors.
no code implementations • 14 May 2022 • Daqi Liu, Miroslaw Bober, Josef Kittler
Scene graph generation is a structured prediction task aiming to explicitly model objects and their relationships via constructing a visually-grounded scene graph for an input image.
no code implementations • 27 Jan 2022 • Daqi Liu, Miroslaw Bober, Josef Kittler
As a structured prediction task, scene graph generation aims to build a visually-grounded scene graph to explicitly model objects and their relationships in an input image.
1 code implementation • 25 Jan 2022 • Ziheng Chen, Tianyang Xu, Xiao-Jun Wu, Rui Wang, Zhiwu Huang, Josef Kittler
The Symmetric Positive Definite (SPD) matrices have received wide attention for data representation in many scientific areas.
1 code implementation • 21 Jan 2022 • Zhangyong Tang, Tianyang Xu, Hui Li, Xiao-Jun Wu, XueFeng Zhu, Josef Kittler
The effectiveness of the proposed decision-level fusion strategy owes to a number of innovative contributions, including a dynamic weighting of the RGB and TIR contributions and a linear template update operation.
no code implementations • 10 Dec 2021 • Daqi Liu, Miroslaw Bober, Josef Kittler
Scene graph generation aims to interpret an input image by explicitly modelling the potential objects and their relationships, which is predominantly solved by the message passing neural network models in previous methods.
no code implementations • 30 Nov 2021 • Sara Atito, Muhammad Awais, Ammarah Farooq, ZhenHua Feng, Josef Kittler
In this aspect the proposed SSL frame-work MC-SSL0. 0 is a step towards Multi-Concept Self-Supervised Learning (MC-SSL) that goes beyond modelling single dominant label in an image to effectively utilise the information from all the concepts present in it.
no code implementations • 25 Nov 2021 • Ammarah Farooq, Muhammad Awais, Sara Ahmed, Josef Kittler
Hence, most of the learning is independent of the image patches $(N)$ in the higher layers, and the class embedding is learned solely based on the Super tokens $(N/M^2)$ where $M^2$ is the window size.
no code implementations • 17 Nov 2021 • Norman Poh, Thirimachos Bourlai, Josef Kittler, Lorene Allano, Fernando Alonso-Fernandez, Onkar Ambekar, John Baker, Bernadette Dorizzi, Omolara Fatukasi, Julian Fierrez, Harald Ganster, Javier Ortega-Garcia, Donald Maurer, Albert Ali Salah, Tobias Scheidat, Claus Vielhauer
The cost-sensitive evaluation, on the other hand, investigates how well a fusion algorithm can perform given restricted computation and in the presence of software and hardware failures, resulting in errors such as failure-to-acquire and failure-to-match.
no code implementations • 17 Nov 2021 • Javier Ortega-Garcia, Julian Fierrez, Fernando Alonso-Fernandez, Javier Galbally, Manuel R Freire, Joaquin Gonzalez-Rodriguez, Carmen Garcia-Mateo, Jose-Luis Alba-Castro, Elisardo Gonzalez-Agulla, Enrique Otero-Muras, Sonia Garcia-Salicetti, Lorene Allano, Bao Ly-Van, Bernadette Dorizzi, Josef Kittler, Thirimachos Bourlai, Norman Poh, Farzin Deravi, Ming NR Ng, Michael Fairhurst, Jean Hennebert, Andreas Humm, Massimo Tistarelli, Linda Brodo, Jonas Richiardi, Andrezj Drygajlo, Harald Ganster, Federico M Sukno, Sri-Kaushik Pavani, Alejandro Frangi, Lale Akarun, Arman Savran
It is comprised of more than 600 individuals acquired simultaneously in three scenarios: 1) over the Internet, 2) in an office environment with desktop PC, and 3) in indoor/outdoor environments with mobile portable hardware.
no code implementations • 29 Sep 2021 • Michael Danner, Muhammad Awais Tanvir Rana, Thomas Weber, Tobias Gerlach, Patrik Huber, Matthias Rätsch, Josef Kittler
Our experiments prove that human aesthetic judgements are usually biased.
no code implementations • 21 Sep 2021 • Maria Perez-Ortiz, Omar Rivasplata, Benjamin Guedj, Matthew Gleeson, Jingyu Zhang, John Shawe-Taylor, Miroslaw Bober, Josef Kittler
We experiment on 6 datasets with different strategies and amounts of data to learn data-dependent PAC-Bayes priors, and we compare them in terms of their effect on test performance of the learnt predictors and tightness of their risk certificate.
no code implementations • 17 Aug 2021 • Donglin Zhang, Xiao-Jun Wu, He-Feng Yin, Josef Kittler
To this end, we develop a novel Multiple hash cOdes jOint learNing method (MOON) for cross-media retrieval.
no code implementations • 29 Jul 2021 • Yu Fu, Tianyang Xu, XiaoJun Wu, Josef Kittler
In this paper, we propose a Patch Pyramid Transformer(PPT) to effectively address the above issues. Specifically, we first design a Patch Transformer to transform the image into a sequence of patches, where transformer encoding is performed for each patch to extract local representations.
2 code implementations • 8 Apr 2021 • Sara Atito, Muhammad Awais, Josef Kittler
We also observed that SiT is good for few shot learning and also showed that it is learning useful representation by simply training a linear classifier on top of the learned features from SiT.
1 code implementation • 7 Mar 2021 • Hui Li, Xiao-Jun Wu, Josef Kittler
The most difficult part of the design is to choose an appropriate strategy to generate the fused image for a specific task in hand.
no code implementations • 5 Mar 2021 • Syed Safwan Khalid, Muhammad Awais, Chi-Ho Chan, ZhenHua Feng, Ammarah Farooq, Ali Akbari, Josef Kittler
One key ingredient of DCNN-based FR is the appropriate design of a loss function that ensures discrimination between various identities.
1 code implementation • 3 Mar 2021 • Qingbei Guo, Xiao-Jun Wu, Josef Kittler, Zhiquan Feng
To address this computational complexity issue, we introduce a novel \emph{architecture parameterisation} based on scaled sigmoid function, and propose a general \emph{Differentiable Neural Architecture Learning} (DNAL) method to optimize the neural architecture without the need to evaluate candidate neural networks.
no code implementations • 21 Feb 2021 • Yu Fu, Xiao-Jun Wu, Josef Kittler
In this paper, we apply the image decomposition network to the image fusion task.
no code implementations • 19 Jan 2021 • Ammarah Farooq, Muhammad Awais, Josef Kittler, Syed Safwan Khalid
Our framework is novel in its ability to implicitly learn aligned semantics between modalities during the feature learning stage.
Ranked #9 on
Text based Person Retrieval
on CUHK-PEDES
Cross-Modal Person Re-Identification
Cross-Modal Person Re-Identification
+3
no code implementations • 17 Jan 2021 • Shuangping Jin, ZhenHua Feng, Wankou Yang, Josef Kittler
Different from the standard BN layer that uses all the training data to calculate a single set of parameters, SepBN considers that the samples of a training dataset may belong to different sub-domains.
no code implementations • 20 Oct 2020 • Ali Akbari, Muhammad Awais, Zhen-Hua Feng, Ammarah Farooq, Josef Kittler
Compared with existing loss functions, the lower gradient of the proposed loss function leads to the convergence of SGD to a better optimum point, and consequently a better generalisation.
1 code implementation • 29 Sep 2020 • Qingbei Guo, Xiao-Jun Wu, Josef Kittler, Zhiquan Feng
To tackle this issue, we propose a novel method of designing self-grouping convolutional neural networks, called SG-CNN, in which the filters of each convolutional layer group themselves based on the similarity of their importance vectors.
no code implementations • 7 Sep 2020 • Sara Atito Ali Ahmed, Cemre Zor, Berrin Yanikoglu, Muhammad Awais, Josef Kittler
Deep neural networks have enhanced the performance of decision making systems in many applications including image understanding, and further gains can be achieved by constructing ensembles.
1 code implementation • 10 Jul 2020 • He-Feng Yin, Xiao-Jun Wu, Zhen-Hua Feng, Josef Kittler
Moreover, ANCR introduces an affine constraint to better represent the data from affine subspaces.
no code implementations • 27 May 2020 • Tianyang Xu, Zhen-Hua Feng, Xiao-Jun Wu, Josef Kittler
To this end, we propose a failure-aware system, realised by a Quality Prediction Network (QPN), based on convolutional and LSTM modules in the decision stage, enabling online reporting of potential tracking failures.
no code implementations • 20 Feb 2020 • Ammarah Farooq, Muhammad Awais, Fei Yan, Josef Kittler, Ali Akbari, Syed Safwan Khalid
However, in real-world surveillance scenarios, frequently no visual information will be available about the queried person.
no code implementations • 24 Dec 2019 • Fei Feng, Xiao-Jun Wu, Tianyang Xu, Josef Kittler, Xue-Feng Zhu
In the response map obtained for the previous frame by the CF algorithm, we adaptively find the image blocks that are similar to the target and use them as negative samples.
1 code implementation • 6 Dec 2019 • He-Feng Yin, Xiao-Jun Wu, Josef Kittler
First, a low-rank representation is introduced to handle the possible contamination of the training as well as test data.
no code implementations • 5 Dec 2019 • Tianyang Xu, Zhen-Hua Feng, Xiao-Jun Wu, Josef Kittler
Recent visual object tracking methods have witnessed a continuous improvement in the state-of-the-art with the development of efficient discriminative correlation filters (DCF) and robust deep neural network features.
no code implementations • 23 Nov 2019 • He-Feng Yin, Xiao-Jun Wu, Josef Kittler, Zhen-Hua Feng
To counteract this problem, we propose an approach that learns Representation with Block-Diagonal Structure (RBDS) for robust image recognition.
no code implementations • IEEE International Conference on Computer Vision (ICCV), 2019 2019 • Cong Wu, Xiao-Jun Wu, Josef Kittler
In order to capture the rich spatiotemporal information and utilize features more effectively, we introduce a spatial residual layer and a dense connection block enhanced spatial temporal graph convolutional network.
Ranked #41 on
Skeleton Based Action Recognition
on NTU RGB+D
no code implementations • 19 Sep 2019 • Cong Hu, Zhen-Hua Feng, Xiao-Jun Wu, Josef Kittler
To be more specific, the encoder-decoder structured generator is used to learn a pose disentangled face representation, and the encoder-decoder structured discriminator is tasked to perform real/fake classification, face reconstruction, determining identity and estimating face pose.
2 code implementations • 16 Sep 2019 • Kai-Xuan Chen, Xiao-Jun Wu, Jie-Yi Ren, Rui Wang, Josef Kittler
We consider a family of structural descriptors for visual data, namely covariance descriptors (CovDs) that lie on a non-linear symmetric positive definite (SPD) manifold, a special type of Riemannian manifolds.
no code implementations • 6 Aug 2019 • Rui Wang, Xiao-Jun Wu, Josef Kittler
Specifically, the covariance matrix, linear subspace, and Gaussian distribution are applied for set representation simultaneously.
1 code implementation • ICCV 2019 • Tianyang Xu, Zhen-Hua Feng, Xiao-Jun Wu, Josef Kittler
We propose a new Group Feature Selection method for Discriminative Correlation Filters (GFS-DCF) based visual object tracking.
Ranked #1 on
Visual Object Tracking
on VOT2017
no code implementations • 22 May 2019 • Shervin Rahimzadeh Arashloo, Josef Kittler
The non-linear structure learning method is then extended to a sparse setting where different tasks compete in an output composition mechanism, leading to a sparse non-linear structure among multiple problems.
no code implementations • 14 May 2019 • Zhe Chen, Xiao-Jun Wu, Josef Kittler
Only learning one projection matrix from original samples to the corresponding binary labels is too strict and will consequentlly lose some intrinsic geometric structures of data.
no code implementations • 19 Mar 2019 • Zhe Chen, Xiao-Jun Wu, Josef Kittler
To solve above problems, we propose a low-rank discriminative least squares regression model (LRDLSR) for multi-class image classification.
1 code implementation • 19 Mar 2019 • Zhe Chen, Xiao-Jun Wu, Josef Kittler
On one hand, the Fisher criterion improves the intra-class compactness of the relaxed labels during relaxation learning.
no code implementations • 19 Mar 2019 • Zhe Chen, Xiao-Jun Wu, Josef Kittler
In this paper, we propose a non-negative representation based discriminative dictionary learning algorithm (NRDL) for multicategory face classification.
no code implementations • 13 Mar 2019 • Daqi Liu, Miroslaw Bober, Josef Kittler
Since it helps to enhance the accuracy and the consistency of the resulting interpretation, visual context reasoning is often incorporated with visual perception in current deep end-to-end visual semantic information pursuit methods.
no code implementations • 6 Feb 2019 • Shervin Rahimzadeh Arashloo, Josef Kittler
This work addresses these shortcomings by studying the effect of regularising the solution of the null-space kernel Fisher methodology in the context of its regression-based formulation (OC-KSR).
no code implementations • 6 Dec 2018 • Jun Yu, Xiao-Jun Wu, Josef Kittler
With the advantage of low storage cost and high retrieval efficiency, hashing techniques have recently been an emerging topic in cross-modal similarity search.
no code implementations • 13 Nov 2018 • Lei Jiang, Xiao-Jun Wu, Josef Kittler
Our method solves the problem of face reconstruction of a single image of a traditional method in a large pose, works on arbitrary Pose and Expressions, greatly improves the accuracy of reconstruction.
2 code implementations • 6 Nov 2018 • Hui Li, Xiao-Jun Wu, Josef Kittler
We develop a novel image fusion framework based on MDLatLRR, which is used to decompose source images into detail parts(salient features) and base parts.
no code implementations • 16 Aug 2018 • Yu Yanga, Xiao-Jun Wu, Josef Kittler
In this paper we show that landmark weighting is instrumental to improve the accuracy of shape reconstruction and propose a novel 3D Morphable Model Fitting method.
no code implementations • 13 Aug 2018 • Jun Yu, Xiao-Jun Wu, Josef Kittler
Many hashing methods based on a single view have been extensively studied for information retrieval.
1 code implementation • 30 Jul 2018 • Tianyang Xu, Zhen-Hua Feng, Xiao-Jun Wu, Josef Kittler
The key innovations of the proposed method include adaptive spatial feature selection and temporal consistent constraints, with which the new tracker enables joint spatial-temporal filter learning in a lower dimensional discriminative manifold.
no code implementations • 3 Jul 2018 • Shervin Rahimzadeh Arashloo, Josef Kittler
The paper introduces a new efficient nonlinear one-class classifier formulated as the Rayleigh quotient criterion optimisation.
no code implementations • 2 Jul 2018 • Shervin Rahimzadeh Arashloo, Josef Kittler
In addition, it is demonstrated that the same set of deep convolutional features used for the recognition purposes is effective for face presentation attack detection in the class-specific one-class anomaly detection paradigm.
no code implementations • 28 Jun 2018 • Rui Wang, Xiao-Jun Wu, Kai-Xuan Chen, Josef Kittler
The core of the method is a new discriminant function for metric learning and dimensionality reduction.
no code implementations • 19 Jun 2018 • Jun Yu, Xiao-Jun Wu, Josef Kittler
Recently, hashing techniques have gained importance in large-scale retrieval tasks because of their retrieval speed.
no code implementations • 16 Jun 2018 • Kai-Xuan Chen, Xiao-Jun Wu, Rui Wang, Josef Kittler
We propose a novel framework for representing image sets by approximating infinite-dimensional CovDs in the paradigm of the Nystr\"om method based on a Riemannian kernel.
no code implementations • 30 May 2018 • Rui Wang, Xiao-Jun Wu, Kai-Xuan Chen, Josef Kittler
In image set classification, a considerable advance has been made by modeling the original image sets by second order statistics or linear subspace, which typically lie on the Riemannian manifold.
no code implementations • 27 May 2018 • Rui Wang, Xiao-Jun Wu, Josef Kittler
The proposed RieMNet and DRieMNet are evaluated on three tasks: video-based face recognition, set-based object categorization, and set-based cell identification.
no code implementations • 26 May 2018 • YunKun Li, Xiao-Jun Wu, Josef Kittler
In our network, the role of L1-(2D)2PCA is to learn the filters of multiple convolution layers.
3 code implementations • 19 Apr 2018 • Hui Li, Xiao-Jun Wu, Josef Kittler
Then the base parts are fused by weighted-averaging.
1 code implementation • ECCV 2018 • Baris Gecer, Binod Bhattarai, Josef Kittler, Tae-Kyun Kim
We propose a novel end-to-end semi-supervised adversarial framework to generate photorealistic face images of new identities with wide ranges of expressions, poses, and illuminations conditioned by a 3D morphable model.
Ranked #16 on
Face Verification
on IJB-A
no code implementations • 14 Mar 2018 • Zhen-Hua Feng, Patrik Huber, Josef Kittler, Peter JB Hancock, Xiao-Jun Wu, Qijun Zhao, Paul Koppen, Matthias Rätsch
To this end, we organise a competition that provides a new benchmark dataset that contains 2000 2D facial images of 135 subjects as well as their 3D ground truth face scans.
6 code implementations • CVPR 2018 • Zhen-Hua Feng, Josef Kittler, Muhammad Awais, Patrik Huber, Xiao-Jun Wu
We present a new loss function, namely Wing loss, for robust facial landmark localisation with Convolutional Neural Networks (CNNs).
Ranked #1 on
Face Alignment
on 300W
(NME_inter-pupil (%, Common) metric)
no code implementations • 3 Oct 2017 • Fei Yan, Krystian Mikolajczyk, Josef Kittler
We propose a joint vision and language model based on CCA and CNN architectures to match across the two modalities as well as to enrich visual examples for which there are no language descriptions.
1 code implementation • 23 Aug 2017 • Anil Bas, Patrik Huber, William A. P. Smith, Muhammad Awais, Josef Kittler
In this paper, we show how a 3D Morphable Model (i. e. a statistical model of the 3D shape of a class of objects such as faces) can be used to spatially transform input data as a module (a 3DMM-STN) within a convolutional neural network.
no code implementations • 8 Aug 2017 • Manuel Günther, Peiyun Hu, Christian Herrmann, Chi Ho Chan, Min Jiang, Shufan Yang, Akshay Raj Dhamija, Deva Ramanan, Jürgen Beyerer, Josef Kittler, Mohamad Al Jazaery, Mohammad Iqbal Nouyed, Guodong Guo, Cezary Stankiewicz, Terrance E. Boult
Face detection and recognition benchmarks have shifted toward more difficult environments.
no code implementations • 5 May 2017 • Zhen-Hua Feng, Josef Kittler, Muhammad Awais, Patrik Huber, Xiao-Jun Wu
The framework has four stages: face detection, bounding box aggregation, pose estimation and landmark localisation.
no code implementations • 30 Dec 2016 • Zhen-Hua Feng, Josef Kittler, William Christmas, Xiao-Jun Wu
To deal with this challenge, we present a Unified Tensor-based Active Appearance Model (UT-AAM) for jointly modelling the geometry and texture information of 2D faces.
no code implementations • CVPR 2017 • Zhen-Hua Feng, Josef Kittler, William Christmas, Patrik Huber, Xiao-Jun Wu
We present a new Cascaded Shape Regression (CSR) architecture, namely Dynamic Attention-Controlled CSR (DAC-CSR), for robust facial landmark detection on unconstrained faces.
Ranked #17 on
Face Alignment
on AFLW-19
no code implementations • 1 Nov 2016 • Xiaoning Song, Zhen-Hua Feng, Guosheng Hu, Josef Kittler, William Christmas, Xiao-Jun Wu
The paper presents a dictionary integration algorithm using 3D morphable face models (3DMM) for pose-invariant collaborative-representation-based face classification.
no code implementations • 22 May 2016 • Patrik Huber, Philipp Kopp, Matthias Rätsch, William Christmas, Josef Kittler
We present a fully automatic approach to real-time 3D face reconstruction from monocular in-the-wild videos.
no code implementations • 15 Apr 2016 • Josef Kittler, Cemre Zor
In this paper, we postulate the properties that a divergence measure should satisfy and propose a novel divergence measure, referred to as Delta divergence.
1 code implementation • 1 Feb 2016 • Patrik Huber, Guosheng Hu, Rafael Tena, Pouria Mortazavian, Willem P. Koppen, William Christmas, Matthias Rätsch, Josef Kittler
In this paper, we present the Surrey Face Model, a multi-resolution 3D Morphable Model that we make available to the public for non-commercial purposes.
no code implementations • 9 Apr 2015 • Guosheng Hu, Yongxin Yang, Dong Yi, Josef Kittler, William Christmas, Stan Z. Li, Timothy Hospedales
In this work, we conduct an extensive evaluation of CNN-based face recognition systems (CNN-FRS) on a common ground to make our work easily reproducible.
1 code implementation • 8 Mar 2015 • Patrik Huber, Zhen-Hua Feng, William Christmas, Josef Kittler, Matthias Rätsch
Our approach is unique in that we are the first to use local features to fit a Morphable Model.
no code implementations • 21 Nov 2014 • Mitko Veta, Paul J. van Diest, Stefan M. Willems, Haibo Wang, Anant Madabhushi, Angel Cruz-Roa, Fabio Gonzalez, Anders B. L. Larsen, Jacob S. Vestergaard, Anders B. Dahl, Dan C. Cireşan, Jürgen Schmidhuber, Alessandro Giusti, Luca M. Gambardella, F. Boray Tek, Thomas Walter, Ching-Wei Wang, Satoshi Kondo, Bogdan J. Matuszewski, Frederic Precioso, Violet Snell, Josef Kittler, Teofilo E. de Campos, Adnan M. Khan, Nasir M. Rajpoot, Evdokia Arkoumani, Miangela M. Lacle, Max A. Viergever, Josien P. W. Pluim
The proliferative activity of breast tumors, which is routinely estimated by counting of mitotic figures in hematoxylin and eosin stained histology sections, is considered to be one of the most important prognostic markers.