Search Results for author: Edgar Simo-Serra

Found 30 papers, 9 papers with code

Fracking Deep Convolutional Image Descriptors

no code implementations19 Dec 2014 Edgar Simo-Serra, Eduard Trulls, Luis Ferraz, Iasonas Kokkinos, Francesc Moreno-Noguer

In this paper we propose a novel framework for learning local image descriptors in a discriminative manner.

Neuroaesthetics in Fashion: Modeling the Perception of Fashionability

no code implementations Conference 2015 Edgar Simo-Serra, Sanja Fidler, Francesc Moreno-Noguer, Raquel Urtasun

Importantly, our model is able to give rich feedback back to the user, conveying which garments or even scenery she/he should change in order to improve fashionability.

Discriminative Learning of Deep Convolutional Feature Point Descriptors

1 code implementation ICCV 2015 Edgar Simo-Serra, Eduard Trulls, Luis Ferraz, Iasonas Kokkinos, Pascal Fua, Francesc Moreno-Noguer

Deep learning has revolutionalized image-level tasks such as classification, but patch-level tasks, such as correspondence, still rely on hand-crafted features, e. g. SIFT.

Satellite Image Classification

Understanding Human-Centric Images: From Geometry to Fashion

no code implementations14 Dec 2015 Edgar Simo-Serra

Understanding humans from photographs has always been a fundamental goal of computer vision.

3D Human Pose Estimation Semantic Segmentation

Mastering Sketching: Adversarial Augmentation for Structured Prediction

no code implementations27 Mar 2017 Edgar Simo-Serra, Satoshi Iizuka, Hiroshi Ishikawa

Our approach augments a simplification network with a discriminator network, training both networks jointly so that the discriminator network discerns whether a line drawing is a real training data or the output of the simplification network, which in turn tries to fool it.

Structured Prediction

Joint Gap Detection and Inpainting of Line Drawings

no code implementations CVPR 2017 Kazuma Sasaki, Satoshi Iizuka, Edgar Simo-Serra, Hiroshi Ishikawa

We evaluate our method qualitatively on a diverse set of challenging line drawings and also provide quantitative results with a user study, where it significantly outperforms the state of the art.

Virtual Thin Slice: 3D Conditional GAN-based Super-resolution for CT Slice Interval

no code implementations30 Aug 2019 Akira Kudo, Yoshiro Kitamura, Yuanzhong Li, Satoshi Iizuka, Edgar Simo-Serra

In this paper, we present a novel architecture based on conditional Generative Adversarial Networks (cGANs) with the goal of generating high resolution images of main body parts including head, chest, abdomen and legs.

Anatomy SSIM +1

Understanding the Effects of Pre-Training for Object Detectors via Eigenspectrum

no code implementations9 Sep 2019 Yosuke Shinya, Edgar Simo-Serra, Taiji Suzuki

Furthermore, we propose a method for automatically determining the widths (the numbers of channels) of object detectors based on the eigenspectrum.

Image Classification Object +2

Two-stage Discriminative Re-ranking for Large-scale Landmark Retrieval

2 code implementations25 Mar 2020 Shuhei Yokoo, Kohei Ozaki, Edgar Simo-Serra, Satoshi Iizuka

Due to the variance of the images, which include extreme viewpoint changes such as having to retrieve images of the exterior of a landmark from images of the interior, this is very challenging for approaches based exclusively on visual similarity.

Image Retrieval Landmark Recognition +3

TopNet: Topology Preserving Metric Learning for Vessel Tree Reconstruction and Labelling

no code implementations18 Sep 2020 Deepak Keshwani, Yoshiro Kitamura, Satoshi Ihara, Satoshi Iizuka, Edgar Simo-Serra

To the best of our knowledge, this is the first deep learning based approach which learns multi-label tree structure connectivity from images.

Metric Learning Segmentation +1

DeepRemaster: Temporal Source-Reference Attention Networks for Comprehensive Video Enhancement

no code implementations18 Sep 2020 Satoshi Iizuka, Edgar Simo-Serra

The remastering of vintage film comprises of a diversity of sub-tasks including super-resolution, noise removal, and contrast enhancement which aim to restore the deteriorated film medium to its original state.

Colorization Super-Resolution +1

User-Guided Line Art Flat Filling With Split Filling Mechanism

no code implementations CVPR 2021 Lvmin Zhang, Chengze Li, Edgar Simo-Serra, Yi Ji, Tien-Tsin Wong, Chunping Liu

We present a deep learning framework for user-guided line art flat filling that can compute the "influence areas" of the user color scribbles, i. e., the areas where the user scribbles should propagate and influence.

Constrained Graphic Layout Generation via Latent Optimization

1 code implementation2 Aug 2021 Kotaro Kikuchi, Edgar Simo-Serra, Mayu Otani, Kota Yamaguchi

We optimize using the latent space of an off-the-shelf layout generation model, allowing our approach to be complementary to and used with existing layout generation models.

Generative Colorization of Structured Mobile Web Pages

1 code implementation22 Dec 2022 Kotaro Kikuchi, Naoto Inoue, Mayu Otani, Edgar Simo-Serra, Kota Yamaguchi

The web page colorization problem is then formalized as a task of estimating plausible color styles for a given web page content with a given hierarchical structure of the elements.

Colorization Efficient Exploration +1

LayoutDM: Discrete Diffusion Model for Controllable Layout Generation

1 code implementation CVPR 2023 Naoto Inoue, Kotaro Kikuchi, Edgar Simo-Serra, Mayu Otani, Kota Yamaguchi

Controllable layout generation aims at synthesizing plausible arrangement of element bounding boxes with optional constraints, such as type or position of a specific element.

Position

Towards Flexible Multi-modal Document Models

1 code implementation CVPR 2023 Naoto Inoue, Kotaro Kikuchi, Edgar Simo-Serra, Mayu Otani, Kota Yamaguchi

Creative workflows for generating graphical documents involve complex inter-related tasks, such as aligning elements, choosing appropriate fonts, or employing aesthetically harmonious colors.

Multi-Task Learning Position

Controllable Multi-domain Semantic Artwork Synthesis

no code implementations19 Aug 2023 Yuantian Huang, Satoshi Iizuka, Edgar Simo-Serra, Kazuhiro Fukui

To address this problem, we propose a dataset, which we call ArtSem, that contains 40, 000 images of artwork from 4 different domains with their corresponding semantic label maps.

Generative Adversarial Network

Diffusion-based Holistic Texture Rectification and Synthesis

no code implementations26 Sep 2023 Guoqing Hao, Satoshi Iizuka, Kensho Hara, Edgar Simo-Serra, Hirokatsu Kataoka, Kazuhiro Fukui

We present a novel framework for rectifying occlusions and distortions in degraded texture samples from natural images.

Texture Synthesis

Image Synthesis-based Late Stage Cancer Augmentation and Semi-Supervised Segmentation for MRI Rectal Cancer Staging

no code implementations8 Dec 2023 Saeko Sasuga, Akira Kudo, Yoshiro Kitamura, Satoshi Iizuka, Edgar Simo-Serra, Atsushi Hamabe, Masayuki Ishii, Ichiro Takemasa

To tackle this, we propose two kinds of approaches of image synthesis-based late stage cancer augmentation and semi-supervised learning which is designed for T-stage prediction.

Data Augmentation Image Generation +1

Return-Aligned Decision Transformer

no code implementations6 Feb 2024 Tsunehiko Tanaka, Kenshi Abe, Kaito Ariu, Tetsuro Morimura, Edgar Simo-Serra

Traditional approaches in offline reinforcement learning aim to learn the optimal policy that maximizes the cumulative reward, also known as return.

Cannot find the paper you are looking for? You can Submit a new open access paper.