no code implementations • 25 Dec 2023 • Jiayin Sun, Qiulei Dong
Open-set image recognition (OSR) aims to both classify known-class samples and identify unknown-class samples in the testing set, which supports robust classifiers in many realistic applications, such as autonomous driving, medical diagnosis, security monitoring, etc.
no code implementations • 25 Sep 2023 • Jiayin Sun, Hong Wang, Qiulei Dong
Image recognition is a classic and common task in the computer vision field, which has been widely applied in the past decade.
no code implementations • 14 Jul 2023 • Jiayin Sun, Hong Wang, Qiulei Dong
Open-set image recognition is a challenging topic in computer vision.
no code implementations • 25 Nov 2022 • Jiayin Sun, Hong Wang, Qiulei Dong
To address this problem, motivated by the temporal attention mechanism in brains, we propose a spatial-temporal attention network for learning fine-grained feature representations, called STAN, where the features learnt by implementing a sequence of spatial self-attention operations corresponding to multiple moments are aggregated progressively.
no code implementations • 13 Jul 2022 • Jiayin Sun, Qiulei Dong
Specifically, at each iteration, a dual-space consistent sampling approach is presented in the explored reliability sampling module for selecting some relatively more reliable ones from the test samples according to their pseudo labels assigned by a baseline method, which could be an arbitrary inductive OSR method.
no code implementations • 22 Oct 2019 • Qiulei Dong, Jiayin Sun, Zhanyi Hu
In this work, we investigate this problem by formulating face images as points in a shape-appearance parameter space, and our results demonstrate that: (i) The encoding and decoding of the neuron responses (representations) to face images in CNNs could be achieved under a linear model in the parameter space, in agreement with the recent discovery in primate IT face neurons, but different from the aforementioned perspective on CNNs' face representation with complex image feature encoding; (ii) The linear model for face encoding and decoding in the parameter space could achieve close or even better performances on face recognition and verification than state-of-the-art CNNs, which might provide new lights on the design strategies for face recognition systems; (iii) The neuron responses to face images in CNNs could not be adequately modelled by the axis model, a model recently proposed on face modelling in primate IT cortex.