Search Results for author: Sergio Escalera

Found 85 papers, 31 papers with code

Surveillance Face Presentation Attack Detection Challenge

no code implementations15 Apr 2023 Hao Fang, Ajian Liu, Jun Wan, Sergio Escalera, Hugo Jair Escalante, Zhen Lei

Based on this dataset and protocol-$3$ for evaluating the robustness of the algorithm under quality changes, we organized a face presentation attack detection challenge in surveillance scenarios.

Face Anti-Spoofing Face Presentation Attack Detection +1

Wild Face Anti-Spoofing Challenge 2023: Benchmark and Results

1 code implementation12 Apr 2023 Dong Wang, Jia Guo, Qiqi Shao, Haochi He, Zhian Chen, Chuanbao Xiao, Ajian Liu, Sergio Escalera, Hugo Jair Escalante, Zhen Lei, Jun Wan, Jiankang Deng

Leveraging the WFAS dataset and Protocol 1 (Known-Type), we host the Wild Face Anti-Spoofing Challenge at the CVPR2023 workshop.

Face Anti-Spoofing Face Recognition

Surveillance Face Anti-spoofing

no code implementations3 Jan 2023 Hao Fang, Ajian Liu, Jun Wan, Sergio Escalera, Chenxu Zhao, Xu Zhang, Stan Z. Li, Zhen Lei

In order to promote relevant research and fill this gap in the community, we collect a large-scale Surveillance High-Fidelity Mask (SuHiFiMask) dataset captured under 40 surveillance scenes, which has 101 subjects from different age groups with 232 3D attacks (high-fidelity masks), 200 2D attacks (posters, portraits, and screens), and 2 adversarial attacks.

Contrastive Learning Face Anti-Spoofing +2

Neural Cloth Simulation

1 code implementation13 Dec 2022 Hugo Bertiche, Meysam Madadi, Sergio Escalera

Here, we propose the first methodology able to learn realistic cloth dynamics unsupervisedly, and henceforth, a general formulation for neural cloth simulation.

BeLFusion: Latent Diffusion for Behavior-Driven Human Motion Prediction

1 code implementation25 Nov 2022 German Barquero, Sergio Escalera, Cristina Palmero

To address these issues, we present BeLFusion, a model that, for the first time, leverages latent diffusion models in HMP to sample from a latent space where behavior is disentangled from pose and motion.

 Ranked #1 on Human Pose Forecasting on AMASS (ADE metric)

Human Pose Forecasting motion prediction +1

SoccerNet 2022 Challenges Results

7 code implementations5 Oct 2022 Silvio Giancola, Anthony Cioppa, Adrien Deliège, Floriane Magera, Vladimir Somers, Le Kang, Xin Zhou, Olivier Barnich, Christophe De Vleeschouwer, Alexandre Alahi, Bernard Ghanem, Marc Van Droogenbroeck, Abdulrahman Darwish, Adrien Maglo, Albert Clapés, Andreas Luyts, Andrei Boiarov, Artur Xarles, Astrid Orcesi, Avijit Shah, Baoyu Fan, Bharath Comandur, Chen Chen, Chen Zhang, Chen Zhao, Chengzhi Lin, Cheuk-Yiu Chan, Chun Chuen Hui, Dengjie Li, Fan Yang, Fan Liang, Fang Da, Feng Yan, Fufu Yu, Guanshuo Wang, H. Anthony Chan, He Zhu, Hongwei Kan, Jiaming Chu, Jianming Hu, Jianyang Gu, Jin Chen, João V. B. Soares, Jonas Theiner, Jorge De Corte, José Henrique Brito, Jun Zhang, Junjie Li, Junwei Liang, Leqi Shen, Lin Ma, Lingchi Chen, Miguel Santos Marques, Mike Azatov, Nikita Kasatkin, Ning Wang, Qiong Jia, Quoc Cuong Pham, Ralph Ewerth, Ran Song, RenGang Li, Rikke Gade, Ruben Debien, Runze Zhang, Sangrok Lee, Sergio Escalera, Shan Jiang, Shigeyuki Odashima, Shimin Chen, Shoichi Masui, Shouhong Ding, Sin-wai Chan, Siyu Chen, Tallal El-Shabrawy, Tao He, Thomas B. Moeslund, Wan-Chi Siu, Wei zhang, Wei Li, Xiangwei Wang, Xiao Tan, Xiaochuan Li, Xiaolin Wei, Xiaoqing Ye, Xing Liu, Xinying Wang, Yandong Guo, YaQian Zhao, Yi Yu, YingYing Li, Yue He, Yujie Zhong, Zhenhua Guo, Zhiheng Li

The SoccerNet 2022 challenges were the second annual video understanding challenges organized by the SoccerNet team.

Action Spotting Camera Calibration +3

NeurIPS'22 Cross-Domain MetaDL competition: Design and baseline results

no code implementations31 Aug 2022 Dustin Carrión-Ojeda, Hong Chen, Adrian El Baz, Sergio Escalera, Chaoyu Guan, Isabelle Guyon, Ihsan Ullah, Xin Wang, Wenwu Zhu

We present the design and baseline results for a new challenge in the ChaLearn meta-learning series, accepted at NeurIPS'22, focusing on "cross-domain" meta-learning.

Few-Shot Image Classification Few-Shot Learning +1

A Non-Anatomical Graph Structure for isolated hand gesture separation in continuous gesture sequences

no code implementations15 Jul 2022 Razieh Rastgoo, Kourosh Kiani, Sergio Escalera

Recently, one model has been presented to deal with the challenge of the boundary detection of isolated gestures in a continuous gesture video [17].

Boundary Detection Hand Gesture Recognition +1

Relevance-based Margin for Contrastively-trained Video Retrieval Models

1 code implementation27 Apr 2022 Alex Falcon, Swathikiran Sudhakaran, Giuseppe Serra, Sergio Escalera, Oswald Lanz

We show that even if we carefully tuned the fixed margin, our technique (which does not have the margin as a hyper-parameter) would still achieve better performance.

Multi-Instance Retrieval Natural Language Queries +2

Word separation in continuous sign language using isolated signs and post-processing

no code implementations2 Apr 2022 Razieh Rastgoo, Kourosh Kiani, Sergio Escalera

Results of the continuous sign videos confirm the efficiency of the proposed model to deal with isolated sign boundaries detection.

Sign Language Recognition

Towards Self-Supervised Gaze Estimation

1 code implementation21 Mar 2022 Arya Farkhondeh, Cristina Palmero, Simone Scardapane, Sergio Escalera

Recent joint embedding-based self-supervised methods have surpassed standard supervised approaches on various image recognition tasks such as image classification.

Gaze Estimation Image Classification +1

Gate-Shift-Fuse for Video Action Recognition

1 code implementation16 Mar 2022 Swathikiran Sudhakaran, Sergio Escalera, Oswald Lanz

3D kernel factorization approaches have been proposed to reduce the complexity of 3D CNNs.

Ranked #12 on Action Recognition on EPIC-KITCHENS-100 (using extra training data)

Action Recognition Temporal Action Localization +1

Video Transformers: A Survey

no code implementations16 Jan 2022 Javier Selva, Anders S. Johansen, Sergio Escalera, Kamal Nasrollahi, Thomas B. Moeslund, Albert Clapés

Transformer models have shown great success handling long-range interactions, making them a promising tool for modeling video.

Action Classification Self-Supervised Learning

All You Need In Sign Language Production

no code implementations5 Jan 2022 Razieh Rastgoo, Kourosh Kiani, Sergio Escalera, Vassilis Athitsos, Mohammad Sabokrou

To make an easy and mutual communication between the hearing-impaired and the hearing communities, building a robust system capable of translating the spoken language into sign language and vice versa is fundamental.

Cultural Vocal Bursts Intensity Prediction Sign Language Production +2

Codabench: Flexible, Easy-to-Use and Reproducible Benchmarking Platform

2 code implementations12 Oct 2021 Zhen Xu, Sergio Escalera, Isabelle Guyon, Adrien Pavão, Magali Richard, Wei-Wei Tu, Quanming Yao, Huan Zhao

A public instance of Codabench (https://www. codabench. org/) is open to everyone, free of charge, and allows benchmark organizers to compare fairly submissions, under the same setting (software, hardware, data, algorithms), with custom protocols and data formats.

Benchmarking

Neural Implicit Surfaces for Efficient and Accurate Collisions in Physically Based Simulations

1 code implementation3 Oct 2021 Hugo Bertiche, Meysam Madadi, Sergio Escalera

Nonetheless, their memory footprint also grows rapidly and their parallelization in a GPU is problematic due to their branching nature.

ZS-SLR: Zero-Shot Sign Language Recognition from RGB-D Videos

no code implementations23 Aug 2021 Razieh Rastgoo, Kourosh Kiani, Sergio Escalera

To benefit from the vision Transformer capabilities, we use two vision Transformer models, for human detection and visual features representation.

Human Detection Sign Language Recognition

ChaLearn Looking at People: Inpainting and Denoising challenges

no code implementations24 Jun 2021 Sergio Escalera, Marti Soler, Stephane Ayache, Umut Guclu, Jun Wan, Meysam Madadi, Xavier Baro, Hugo Jair Escalante, Isabelle Guyon

Dealing with incomplete information is a well studied problem in the context of machine learning and computational intelligence.

Denoising Pose Estimation

Deep unsupervised 3D human body reconstruction from a sparse set of landmarks

no code implementations23 Jun 2021 Meysam Madadi, Hugo Bertiche, Sergio Escalera

In this paper we propose the first deep unsupervised approach in human body reconstruction to estimate body surface from a sparse set of landmarks, so called DeepMurf.

Denoising

Deep learning with self-supervision and uncertainty regularization to count fish in underwater images

1 code implementation30 Apr 2021 Penny Tarling, Mauricio Cantor, Albert Clapés, Sergio Escalera

From experiments on both contrasting datasets, we demonstrate our network outperforms the few other deep learning models implemented for solving this task.

Crowd Counting Decision Making

Sign Language Production: A Review

1 code implementation29 Mar 2021 Razieh Rastgoo, Kourosh Kiani, Sergio Escalera, Mohammad Sabokrou

Sign Language is the dominant yet non-primary form of communication language used in the deaf and hearing-impaired community.

Sign Language Production Sign Language Recognition

Learning to Recognize Actions on Objects in Egocentric Video with Attention Dictionaries

no code implementations16 Feb 2021 Swathikiran Sudhakaran, Sergio Escalera, Oswald Lanz

We present EgoACO, a deep neural architecture for video action recognition that learns to pool action-context-object descriptors from frame level features by leveraging the verb-noun structure of action labels in egocentric video datasets.

Action Recognition Temporal Action Localization

Context-Aware Personality Inference in Dyadic Scenarios: Introducing the UDIVA Dataset

no code implementations28 Dec 2020 Cristina Palmero, Javier Selva, Sorina Smeureanu, Julio C. S. Jacques Junior, Albert Clapés, Alexa Moseguí, Zejian Zhang, David Gallardo, Georgina Guilera, David Leiva, Sergio Escalera

This paper introduces UDIVA, a new non-acted dataset of face-to-face dyadic interactions, where interlocutors perform competitive and collaborative tasks with different behavior elicitation and cognitive workload.

PBNS: Physically Based Neural Simulator for Unsupervised Garment Pose Space Deformation

1 code implementation21 Dec 2020 Hugo Bertiche, Meysam Madadi, Sergio Escalera

While deep-based approaches in the domain are becoming a trend, these are data-hungry models.

Person Perception Biases Exposed: Revisiting the First Impressions Dataset

no code implementations30 Nov 2020 Julio C. S. Jacques Junior, Agata Lapedriza, Cristina Palmero, Xavier Baró, Sergio Escalera

This work revisits the ChaLearn First Impressions database, annotated for personality perception using pairwise comparisons via crowdsourcing.

FairFace Challenge at ECCV 2020: Analyzing Bias in Face Recognition

no code implementations16 Sep 2020 Tomáš Sixta, Julio C. S. Jacques Junior, Pau Buch-Cardona, Neil M. Robertson, Eduard Vazquez, Sergio Escalera

This work summarizes the 2020 ChaLearn Looking at People Fair Face Recognition and Analysis Challenge and provides a description of the top-winning solutions and analysis of the results.

Face Recognition Face Verification

FBK-HUPBA Submission to the EPIC-Kitchens Action Recognition 2020 Challenge

no code implementations24 Jun 2020 Swathikiran Sudhakaran, Sergio Escalera, Oswald Lanz

In this report we describe the technical details of our submission to the EPIC-Kitchens Action Recognition 2020 Challenge.

Action Recognition

Computing the Testing Error without a Testing Set

1 code implementation1 May 2020 Ciprian Corneanu, Meysam Madadi, Sergio Escalera, Aleix Martinez

Here, we derive an algorithm to estimate the performance gap between training and testing that does not require any testing dataset.

Object Recognition Semantic Segmentation

Cross-ethnicity Face Anti-spoofing Recognition Challenge: A Review

no code implementations23 Apr 2020 Ajian Liu, Xuan Li, Jun Wan, Sergio Escalera, Hugo Jair Escalante, Meysam Madadi, Yi Jin, Zhuoyuan Wu, Xiaogang Yu, Zichang Tan, Qi Yuan, Ruikun Yang, Benjia Zhou, Guodong Guo, Stan Z. Li

Although ethnic bias has been verified to severely affect the performance of face recognition systems, it still remains an open research problem in face anti-spoofing.

Face Anti-Spoofing Face Recognition

CASIA-SURF CeFA: A Benchmark for Multi-modal Cross-ethnicity Face Anti-spoofing

no code implementations11 Mar 2020 Ajian Li, Zichang Tan, Xuan Li, Jun Wan, Sergio Escalera, Guodong Guo, Stan Z. Li

Ethnic bias has proven to negatively affect the performance of face recognition systems, and it remains an open research problem in face anti-spoofing.

Face Anti-Spoofing Face Recognition

Multi-level Context Gating of Embedded Collective Knowledge for Medical Image Segmentation

1 code implementation10 Mar 2020 Maryam Asadi-Aghbolaghi, Reza Azad, Mahmood Fathy, Sergio Escalera

These blocks adaptively recalibrate the channel-wise feature responses by utilizing a self-gating mechanism of the global information embedding of the feature maps.

Anatomy Image Segmentation +2

Static and Dynamic Fusion for Multi-modal Cross-ethnicity Face Anti-spoofing

no code implementations5 Dec 2019 Ajian Liu, Zichang Tan, Xuan Li, Jun Wan, Sergio Escalera, Guodong Guo, Stan Z. Li

Regardless of the usage of deep learning and handcrafted methods, the dynamic information from videos and the effect of cross-ethnicity are rarely considered in face anti-spoofing.

Face Anti-Spoofing

CLOTH3D: Clothed 3D Humans

1 code implementation ECCV 2020 Hugo Bertiche, Meysam Madadi, Sergio Escalera

We provide the dataset with a generative model for cloth generation.

Gate-Shift Networks for Video Action Recognition

2 code implementations CVPR 2020 Swathikiran Sudhakaran, Sergio Escalera, Oswald Lanz

Deep 3D CNNs for video action recognition are designed to learn powerful representations in the joint spatio-temporal feature space.

Ranked #21 on Action Recognition on Something-Something V1 (using extra training data)

Action Recognition

On the Effect of Observed Subject Biases in Apparent Personality Analysis from Audio-visual Signals

no code implementations12 Sep 2019 Ricardo Darío Pérez Principi, Cristina Palmero, Julio C. S. Jacques Junior, Sergio Escalera

Furthermore, given the interpretability nature of our network design, we provide an incremental analysis on the impact of each possible source of bias on final network predictions.

CASIA-SURF: A Large-scale Multi-modal Benchmark for Face Anti-spoofing

no code implementations28 Aug 2019 Shifeng Zhang, Ajian Liu, Jun Wan, Yanyan Liang, Guogong Guo, Sergio Escalera, Hugo Jair Escalante, Stan Z. Li

To facilitate face anti-spoofing research, we introduce a large-scale multi-modal dataset, namely CASIA-SURF, which is the largest publicly available dataset for face anti-spoofing in terms of both subjects and modalities.

Face Anti-Spoofing Face Recognition

ChaLearn Looking at People: IsoGD and ConGD Large-scale RGB-D Gesture Recognition

no code implementations29 Jul 2019 Jun Wan, Chi Lin, Longyin Wen, Yunan Li, Qiguang Miao, Sergio Escalera, Gholamreza Anbarjafari, Isabelle Guyon, Guodong Guo, Stan Z. Li

The ChaLearn large-scale gesture recognition challenge has been run twice in two workshops in conjunction with the International Conference on Pattern Recognition (ICPR) 2016 and International Conference on Computer Vision (ICCV) 2017, attracting more than $200$ teams round the world.

Gesture Recognition

FBK-HUPBA Submission to the EPIC-Kitchens 2019 Action Recognition Challenge

no code implementations21 Jun 2019 Swathikiran Sudhakaran, Sergio Escalera, Oswald Lanz

In this report we describe the technical details of our submission to the EPIC-Kitchens 2019 action recognition challenge.

Action Recognition

Hierarchical Feature Aggregation Networks for Video Action Recognition

no code implementations29 May 2019 Swathikiran Sudhakaran, Sergio Escalera, Oswald Lanz

Most action recognition methods base on a) a late aggregation of frame level CNN features using average pooling, max pooling, or RNN, among others, or b) spatio-temporal aggregation via 3D convolutions.

Ranked #49 on Action Recognition on HMDB-51 (using extra training data)

Action Recognition Temporal Action Localization

SMPLR: Deep SMPL reverse for 3D human pose and shape recovery

no code implementations27 Dec 2018 Meysam Madadi, Hugo Bertiche, Sergio Escalera

By implementing SMPLR as an encoder-decoder we avoid the need of complex constraints on pose and shape.

Denoising

A Dataset and Benchmark for Large-scale Multi-modal Face Anti-spoofing

2 code implementations CVPR 2019 Shifeng Zhang, Xiaobo Wang, Ajian Liu, Chenxu Zhao, Jun Wan, Sergio Escalera, Hailin Shi, Zezheng Wang, Stan Z. Li

To facilitate face anti-spoofing research, we introduce a large-scale multi-modal dataset, namely CASIA-SURF, which is the largest publicly available dataset for face anti-spoofing in terms of both subjects and visual modalities.

Face Anti-Spoofing Face Recognition

From 2D to 3D Geodesic-based Garment Matching

1 code implementation21 Sep 2018 Meysam Madadi, Egils Avots, Sergio Escalera, Jordi Gonzalez, Xavier Baro, Gholamreza Anbarjafari

A new approach for 2D to 3D garment retexturing is proposed based on Gaussian mixture models and thin plate splines (TPS).

Beyond One-hot Encoding: lower dimensional target embedding

no code implementations28 Jun 2018 Pau Rodríguez, Miguel A. Bautista, Jordi Gonzàlez, Sergio Escalera

Following this observation, we embed the targets into a low-dimensional space, drastically improving convergence speed while preserving accuracy.

Deep Structure Inference Network for Facial Action Unit Recognition

no code implementations ECCV 2018 Ciprian A. Corneanu, Meysam Madadi, Sergio Escalera

In recent years, most efforts in automatic AU recognition have been dedicated to learning combinations of local features and to exploiting correlations between Action Units.

Facial Action Unit Detection

Folded Recurrent Neural Networks for Future Video Prediction

1 code implementation ECCV 2018 Marc Oliu, Javier Selva, Sergio Escalera

Future video prediction is an ill-posed Computer Vision problem that recently received much attention.

 Ranked #1 on Video Prediction on KTH (Cond metric)

Specificity Video Prediction

RGB-D-based Human Motion Recognition with Deep Learning: A Survey

no code implementations31 Oct 2017 Pichao Wang, Wanqing Li, Philip Ogunbona, Jun Wan, Sergio Escalera

Specifically, deep learning methods based on the CNN and RNN architectures have been adopted for motion recognition using RGB-D data.

Automatic Recognition of Facial Displays of Unfelt Emotions

no code implementations13 Jul 2017 Kaustubh Kulkarni, Ciprian Adrian Corneanu, Ikechukwu Ofodile, Sergio Escalera, Xavier Baro, Sylwia Hyniewska, Juri Allik, Gholamreza Anbarjafari

Performance of the proposed model shows that on average it is easier to distinguish among genuine facial expressions of emotion than among unfelt facial expressions of emotion and that certain emotion pairs such as contempt and disgust are more difficult to distinguish than the rest.

Video Emotion Recognition

End-to-end Global to Local CNN Learning for Hand Pose Recovery in Depth Data

no code implementations26 May 2017 Meysam Madadi, Sergio Escalera, Xavier Baro, Jordi Gonzalez

Despite recent advances in 3D pose estimation of human hands, especially thanks to the advent of CNNs and depth cameras, this task is still far from being solved.

3D Pose Estimation Data Augmentation

WordFence: Text Detection in Natural Images with Border Awareness

no code implementations15 May 2017 Andrei Polzounov, Artsiom Ablavatski, Sergio Escalera, Shijian Lu, Jianfei Cai

In recent years, text recognition has achieved remarkable success in recognizing scanned document text.

Semantic Segmentation

ChaLearn Looking at People: A Review of Events and Resources

no code implementations10 Jan 2017 Sergio Escalera, Xavier Baró, Hugo Jair Escalante, Isabelle Guyon

This paper reviews associated events, and introduces the ChaLearn LAP platform where public resources (including code, data and preprints of papers) related to the organized events are available.

Gesture Recognition

Error-Correcting Factorization

no code implementations27 Feb 2015 Miguel Angel Bautista, Oriol Pujol, Fernando de la Torre, Sergio Escalera

To address these limitations this paper proposes an Error-Correcting Factorization (ECF) method, our contribution is three fold: (I) We propose a novel representation of the error-correction capability, called the design matrix, that enables us to build an ECOC on the basis of allocating correction to pairs of classes.

Multi-class Classification

Non-Verbal Communication Analysis in Victim-Offender Mediations

no code implementations25 Nov 2014 Víctor Ponce-López, Sergio Escalera, Marc Pérez, Oriol Janés, Xavier Baró

In this paper we present a non-invasive ambient intelligence framework for the semi-automatic analysis of non-verbal communication applied to the restorative justice field.

Binary Classification

A Gesture Recognition System for Detecting Behavioral Patterns of ADHD

no code implementations16 Oct 2014 Miguel Ángel Bautista, Antonio Hernández-Vela, Sergio Escalera, Laura Igual, Oriol Pujol, Josep Moya, Verónica Violant, María Teresa Anguera

We propose an extension of DTW using one-class classifiers in order to be able to encode the variability of a gesture category, and thus, perform an alignment between a gesture sample and a gesture class.

Dynamic Time Warping Gesture Recognition

Cannot find the paper you are looking for? You can Submit a new open access paper.