no code implementations • COLING 2022 • Ionut-Catalin Sandu, Daniel Voinea, Alin-Ionut Popa
We created a dataset with 11, 926 images depicting food product labels entitled TREAT dataset, with fully detailed annotations.
no code implementations • 8 Apr 2024 • George Leotescu, Daniel Voinea, Alin-Ionut Popa
The transformer is a powerful data modelling framework responsible for remarkable performance on a wide range of tasks.
no code implementations • 10 Jan 2024 • Mateusz Krubiński, Stefan Matcovici, Diana Grigore, Daniel Voinea, Alin-Ionut Popa
Watermark text spotting in document images can offer access to an often unexplored source of information, providing crucial evidence about a record's scope, audience and sometimes even authenticity.
1 code implementation • 3 Aug 2023 • Mihai Fieraru, Mihai Zanfir, Elisabeta Oneata, Alin-Ionut Popa, Vlad Olaru, Cristian Sminchisescu
Understanding 3d human interactions is fundamental for fine-grained scene analysis and behavioural modeling.
no code implementations • 16 May 2022 • Ionut-Catalin Sandu, Daniel Voinea, Alin-Ionut Popa
We present CONSENT, a simple yet effective CONtext SENsitive Transformer framework for context-dependent object classification within a fully-trainable end-to-end deep learning pipeline.
no code implementations • 18 Dec 2020 • Mihai Fieraru, Mihai Zanfir, Elisabeta Oneata, Alin-Ionut Popa, Vlad Olaru, Cristian Sminchisescu
Monocular estimation of three dimensional human self-contact is fundamental for detailed scene analysis including body language understanding and behaviour modeling.
no code implementations • 23 Sep 2019 • Mihai Zanfir, Elisabeta Oneata, Alin-Ionut Popa, Andrei Zanfir, Cristian Sminchisescu
Generating good quality and geometrically plausible synthetic images of humans with the ability to control appearance, pose and shape parameters, has become increasingly important for a variety of tasks ranging from photo editing, fashion virtual try-on, to special effects and image compression.
no code implementations • NeurIPS 2018 • Andrei Zanfir, Elisabeta Marinoiu, Mihai Zanfir, Alin-Ionut Popa, Cristian Sminchisescu
The final stage of 3d pose and shape prediction is based on a learned attention process where information from different human body parts is optimally integrated.
no code implementations • CVPR 2018 • Mihai Zanfir, Alin-Ionut Popa, Andrei Zanfir, Cristian Sminchisescu
We propose an automatic person-to-person appearance transfer model based on explicit parametric 3d human representations and learned, constrained deep translation network architectures for photographic image synthesis.
no code implementations • CVPR 2017 • Alin-Ionut Popa, Mihai Zanfir, Cristian Sminchisescu
We propose a deep multitask architecture for \emph{fully automatic 2d and 3d human sensing} (DMHS), including \emph{recognition and reconstruction}, in \emph{monocular images}.
Ranked #21 on 3D Human Pose Estimation on HumanEva-I
no code implementations • 27 Jan 2015 • Alin-Ionut Popa, Cristian Sminchisescu
The figure-ground segmentation of humans in images captured in natural environments is an outstanding open problem due to the presence of complex backgrounds, articulation, varying body proportions, partial views and viewpoint changes.