no code implementations • 22 Feb 2025 • Sotirios Stamnas, Victor Sanchez
Traditional deepfake detectors have dealt with the detection problem as a binary classification task.
no code implementations • 3 Jan 2025 • Haoyi Wang, Victor Sanchez, Chang-Tsun Li, Nathan Clarke
Generalized age feature extraction is crucial for age-related facial analysis tasks, such as age estimation and age-invariant face recognition (AIFR).
1 code implementation • 21 Dec 2024 • Tongfei Bian, Yiming Ma, Mathieu Chollet, Victor Sanchez, Tanaya Guha
For efficient human-agent interaction, an agent should proactively recognize their target user and prepare for upcoming interactions.
no code implementations • 8 Aug 2024 • Bhushan Atote, Victor Sanchez
We also introduce a faithfulness score to evaluate the explainability of the results based on the discovered prototypes.
Explainable artificial intelligence
Explainable Artificial Intelligence (XAI)
+1
no code implementations • 6 Jun 2024 • Yixuan Yang, Junru Lu, Zixiang Zhao, Zhen Luo, James J. Q. Yu, Victor Sanchez, Feng Zheng
In this paper, we introduce LLplace, a novel 3D indoor scene layout designer based on lightweight fine-tuned open-source LLM Llama3.
no code implementations • 21 May 2024 • Shuai Shao, Yu Guan, Victor Sanchez
Human Activity Recognition (HAR) has become increasingly popular with ubiquitous computing, driven by the popularity of wearable sensors in fields like healthcare and sports.
1 code implementation • 1 May 2024 • Olly Styles, Sam Miller, Patricio Cerda-Mardini, Tanaya Guha, Victor Sanchez, Bertie Vidgen
We evaluate five existing ReAct agents on WorkBench, finding they successfully complete as few as 3% of tasks (Llama2-70B), and just 43% for the best-performing (GPT-4).
1 code implementation • 14 Mar 2024 • Yiming Ma, Victor Sanchez, Tanaya Guha
Within our backbone-agnostic EBC framework, we then introduce CLIP-EBC to fully leverage CLIP's recognition capabilities for this task.
Ranked #1 on
Crowd Counting
on NWPU-Crowd (Val)
no code implementations • 8 Jan 2024 • Roberto Leyva, Victor Sanchez, Gregory Epiphaniou, Carsten Maple
In this paper, we propose a fusion-based strategy to detect face image synthesis while providing resiliency to several attacks.
no code implementations • 8 Jan 2024 • Roberto Leyva, Victor Sanchez, Gregory Epiphaniou, Carsten Maple
Face image synthesis detection is considerably gaining attention because of the potential negative impact on society that this type of synthetic data brings.
2 code implementations • 18 Dec 2023 • Haoyi Wang, Victor Sanchez, Chang-Tsun Li
Cross-age facial images are typically challenging and expensive to collect, making noise-free age-oriented datasets relatively small compared to widely-used large-scale facial datasets.
1 code implementation • 13 Apr 2023 • Yiming Ma, Victor Sanchez, Soodeh Nikan, Devesh Upadhyay, Bhushan Atote, Tanaya Guha
Driver Monitoring Systems (DMSs) are crucial for safe hand-over actions in Level-2+ self-driving vehicles.
no code implementations • 17 Oct 2022 • Yiming Ma, Victor Sanchez, Soodeh Nikan, Devesh Upadhyay, Bhushan Atote, Tanaya Guha
Driver distractions are known to be the dominant cause of road accidents.
no code implementations • 27 Jul 2022 • Yuqi Ouyang, Guodong Shen, Victor Sanchez
Based on the information shifts between adjacent frames, an incremental learner is used to update parameters of the multilayer perceptron after observing each frame, thus allowing to detect anomalous events along the video stream.
1 code implementation • 16 Jul 2022 • Amir Shirian, Krishna Somandepalli, Victor Sanchez, Tanaya Guha
In contrast, we employ heterogeneous graphs to explicitly capture the spatial and temporal relationships between the modalities and represent detailed information about the underlying signal.
no code implementations • 26 Jun 2022 • Guodong Shen, Yuqi Ouyang, Victor Sanchez
Video anomaly detection is a challenging task because most anomalies are scarce and non-deterministic.
1 code implementation • 28 Feb 2022 • Yiming Ma, Victor Sanchez, Tanaya Guha
Then, to account for perspective distortion, the highest-level feature map is fed to extra components to extract multiscale features, which are the input to the decoder to generate crowd densities.
Ranked #11 on
Crowd Counting
on ShanghaiTech B
no code implementations • 24 Jan 2022 • Lee Prangnell, Victor Sanchez
With application for RGB 4:4:4 video data, Spectral-PQ exploits HVS spectral sensitivity-related color masking in addition to spatial masking and temporal masking; the proposed method operates at the Coding Block (CB) level and the Prediction Unit (PU) level in the HEVC standard.
no code implementations • 19 Dec 2021 • Haoyi Wang, Victor Sanchez, Chang-Tsun Li
Since the proposed RMHHA mechanism ranks the discovered patches based on their importance, the length of the learning path of each patch in the FusionNet is proportional to the amount of information it carries (the longer, the more important).
1 code implementation • 24 Aug 2021 • Oluwafunmilola Kesa, Olly Styles, Victor Sanchez
We propose to jointly train a tracking and trajectory forecasting model and use the predicted trajectory forecasts for short-term motion estimates in lieu of linear motion prediction methods such as the Kalman filter.
1 code implementation • 10 Aug 2021 • Olly Styles, Tanaya Guha, Victor Sanchez
We introduce the problem of multi-camera trajectory forecasting (MCTF), which involves predicting the trajectory of a moving object across a network of cameras.
no code implementations • 1 Jul 2021 • Xufeng Lin, Chang-Tsun Li, Victor Sanchez, Carsten Maple
Driven by recent advances in object detection with deep neural networks, the tracking-by-detection paradigm has gained increasing prevalence in the research community of multi-object tracking (MOT).
no code implementations • 13 Jun 2021 • Ching-Chun Chang, Xu Wang, Sisheng Chen, Isao Echizen, Victor Sanchez, Chang-Tsun Li
Given that reversibility is governed independently by the coding module, we narrow our focus to the incorporation of neural networks into the analytics module, which serves the purpose of predicting pixel intensities and a pivotal role in determining capacity and imperceptibility.
1 code implementation • 2 Dec 2020 • Yuqi Ouyang, Victor Sanchez
Video anomaly detection is a challenging task not only because it involves solving many sub-tasks such as motion representation, object localization and action recognition, but also because it is commonly considered as an unsupervised learning problem that involves detecting outliers.
no code implementations • 1 Jul 2020 • Haoyi Wang, Victor Sanchez, Chang-Tsun Li
In this paper, we propose a method for the age-oriented face synthesis task that achieves a high synthesis accuracy with strong identity permanence capabilities.
1 code implementation • 6 Jun 2020 • Sachin Singh, Victor Sanchez, Tanaya Guha
The ranking is expected to correspond with human perception of overall appeal of the images.
no code implementations • 16 May 2020 • Lee Prangnell, Victor Sanchez
In the evaluations, we compare the proposed PCC technique with a set of reference methods including Versatile Video Coding (VVC) and High Efficiency Video Coding (HEVC) in addition to two other recently proposed algorithms.
1 code implementation • 1 May 2020 • Olly Styles, Tanaya Guha, Victor Sanchez, Alex Kot
To facilitate research in this new area, we release the Warwick-NTU Multi-camera Forecasting Database (WNMF), a unique dataset of multi-camera pedestrian trajectories from a network of 15 synchronized cameras.
1 code implementation • 26 Sep 2019 • Olly Styles, Tanaya Guha, Victor Sanchez
In contrast to existing works on object trajectory forecasting which primarily consider the problem from a birds-eye perspective, we formulate the problem from an object-level perspective and call for the prediction of full object bounding boxes, rather than trajectories alone.
Ranked #1 on
Multiple Object Forecasting
on Citywalks
1 code implementation • 9 May 2019 • Olly Styles, Arun Ross, Victor Sanchez
In this work, we present a deep learning approach for pedestrian trajectory forecasting using a single vehicle-mounted camera.
no code implementations • 27 Jul 2018 • Haoyi Wang, Xingjie Wei, Victor Sanchez, Chang-Tsun Li
Convolutional Neural Networks (CNN) have been applied to age-related research as the core framework.
no code implementations • 5 Oct 2016 • Alasdair Thomason, Nathan Griffiths, Victor Sanchez
The Predictive Context Tree (PCT) is constructed as a hierarchical classifier, capable of predicting both the future locations that a user will visit and the contexts that a user will be immersed within.
no code implementations • 14 Jun 2016 • Alasdair Thomason, Nathan Griffiths, Victor Sanchez
Summarising user contexts into a single data structure gives easy access to information that would otherwise remain latent, providing the basis for better understanding and predicting the actions and behaviours of individuals and groups.