Search Results for author: Erkut Erdem

Found 39 papers, 18 papers with code

Sequential Compositional Generalization in Multimodal Models

no code implementations • 18 Apr 2024 • Semih Yagcioglu, Osman Batur İnce, Aykut Erdem, Erkut Erdem, Desmond Elliott, Deniz Yuret

The rise of large-scale multimodal models has paved the pathway for groundbreaking advances in generative modeling and reasoning, unlocking transformative applications in a variety of complex tasks.

Paper
Add Code

ViLMA: A Zero-Shot Benchmark for Linguistic and Temporal Grounding in Video-Language Models

no code implementations • 13 Nov 2023 • Ilker Kesen, Andrea Pedrotti, Mustafa Dogan, Michele Cafagna, Emre Can Acikgoz, Letitia Parcalabescu, Iacer Calixto, Anette Frank, Albert Gatt, Aykut Erdem, Erkut Erdem

With the ever-increasing popularity of pretrained Video-Language Models (VidLMs), there is a pressing need to develop robust evaluation methodologies that delve deeper into their visio-linguistic capabilities.

counterfactual Language Modelling

Paper
Add Code

Harnessing Dataset Cartography for Improved Compositional Generalization in Transformers

1 code implementation • 18 Oct 2023 • Osman Batur İnce, Tanin Zeraati, Semih Yagcioglu, Yadollah Yaghoobzadeh, Erkut Erdem, Aykut Erdem

Neural networks have revolutionized language modeling and excelled in various downstream tasks.

Language Modelling

Paper
Code

Hyperspectral Image Denoising via Self-Modulating Convolutional Neural Networks

1 code implementation • 15 Sep 2023 • Orhan Torun, Seniha Esen Yuksel, Erkut Erdem, Nevrez Imamoglu, Aykut Erdem

At the core of the model lies a novel block, which we call spectral self-modulating residual block (SSMRB), that allows the network to transform the features in an adaptive manner based on the adjacent spectral data, enhancing the network's ability to handle complex noise.

Hyperspectral Image Denoising Image Denoising

Paper
Code

Spherical Vision Transformer for 360-degree Video Saliency Prediction

1 code implementation • 24 Aug 2023 • Mert Cokelek, Nevrez Imamoglu, Cagri Ozcinar, Erkut Erdem, Aykut Erdem

The growing interest in omnidirectional videos (ODVs) that capture the full field-of-view (FOV) has gained 360-degree saliency prediction importance in computer vision.

Saliency Prediction Video Saliency Prediction +1

Paper
Code

CLIP-Guided StyleGAN Inversion for Text-Driven Real Image Editing

no code implementations • 17 Jul 2023 • Ahmet Canberk Baykal, Abdul Basit Anees, Duygu Ceylan, Erkut Erdem, Aykut Erdem, Deniz Yuret

Existing approaches for editing images using language either resort to instance-level latent code optimization or map predefined text prompts to some editing directions in the latent space.

Attribute

Paper
Add Code

HyperE2VID: Improving Event-Based Video Reconstruction via Hypernetworks

1 code implementation • 10 May 2023 • Burak Ercan, Onur Eker, Canberk Saglam, Aykut Erdem, Erkut Erdem

Event-based cameras are becoming increasingly popular for their ability to capture high-speed motion with low latency and high dynamic range.

Ranked #1 on Video Reconstruction on MVSEC

Event-Based Video Reconstruction Video Reconstruction

Paper
Code

EVREAL: Towards a Comprehensive Benchmark and Analysis Suite for Event-based Video Reconstruction

1 code implementation • 30 Apr 2023 • Burak Ercan, Onur Eker, Aykut Erdem, Erkut Erdem

Event cameras are a new type of vision sensor that incorporates asynchronous and independent pixels, offering advantages over traditional frame-based cameras such as high dynamic range and minimal motion blur.

Event-Based Video Reconstruction Video Reconstruction

Paper
Code

VidStyleODE: Disentangled Video Editing via StyleGAN and NeuralODEs

no code implementations • ICCV 2023 • Moayed Haji Ali, Andrew Bond, Tolga Birdal, Duygu Ceylan, Levent Karacan, Erkut Erdem, Aykut Erdem

However, the applicability of such advancements to the video domain has been hindered by the difficulty of representing and controlling videos in the latent space of GANs.

Image Animation Video Editing +1

Paper
Add Code

Inst-Inpaint: Instructing to Remove Objects with Diffusion Models

1 code implementation • 6 Apr 2023 • Ahmet Burak Yildirim, Vedat Baday, Erkut Erdem, Aykut Erdem, Aysegul Dundar

From the application point of view, a user needs to generate the masks for the objects they would like to remove which can be time-consuming and prone to errors.

Image Inpainting

315

Paper
Code

ST360IQ: No-Reference Omnidirectional Image Quality Assessment with Spherical Vision Transformers

1 code implementation • 13 Mar 2023 • Nafiseh Jabbari Tofighi, Mohamed Hedi Elfkir, Nevrez Imamoglu, Cagri Ozcinar, Erkut Erdem, Aykut Erdem

As their popularity has increased dramatically in recent years, evaluating the quality of 360 images has become a problem of interest since it provides insights for capturing, transmitting, and consuming this new media.

Image Quality Assessment

Paper
Code

Detecting Euphemisms with Literal Descriptions and Visual Imagery

1 code implementation • 8 Nov 2022 • İlker Kesen, Aykut Erdem, Erkut Erdem, Iacer Calixto

In the second stage, we integrate visual supervision into our system using visual imageries, two sets of images generated by a text-to-image model by taking terms and descriptions as input.

Paper
Code

Disentangling Content and Motion for Text-Based Neural Video Manipulation

1 code implementation • 5 Nov 2022 • Levent Karacan, Tolga Kerimoğlu, İsmail İnan, Tolga Birdal, Erkut Erdem, Aykut Erdem

Giving machines the ability to imagine possible new objects or scenes from linguistic descriptions and produce their realistic renderings is arguably one of the most challenging problems in computer vision.

Paper
Code

Perception-Distortion Trade-off in the SR Space Spanned by Flow Models

no code implementations • 18 Sep 2022 • Cansu Korkmaz, A. Murat Tekalp, Zafer Dogan, Erkut Erdem, Aykut Erdem

We achieve this by benefiting from a diverse set of feasible photo-realistic solutions in the SR space spanned by flow models.

Super-Resolution

Paper
Add Code

Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models

3 code implementations • 9 Jun 2022 • Aarohi Srivastava, Abhinav Rastogi, Abhishek Rao, Abu Awal Md Shoeb, Abubakar Abid, Adam Fisch, Adam R. Brown, Adam Santoro, Aditya Gupta, Adrià Garriga-Alonso, Agnieszka Kluska, Aitor Lewkowycz, Akshat Agarwal, Alethea Power, Alex Ray, Alex Warstadt, Alexander W. Kocurek, Ali Safaya, Ali Tazarv, Alice Xiang, Alicia Parrish, Allen Nie, Aman Hussain, Amanda Askell, Amanda Dsouza, Ambrose Slone, Ameet Rahane, Anantharaman S. Iyer, Anders Andreassen, Andrea Madotto, Andrea Santilli, Andreas Stuhlmüller, Andrew Dai, Andrew La, Andrew Lampinen, Andy Zou, Angela Jiang, Angelica Chen, Anh Vuong, Animesh Gupta, Anna Gottardi, Antonio Norelli, Anu Venkatesh, Arash Gholamidavoodi, Arfa Tabassum, Arul Menezes, Arun Kirubarajan, Asher Mullokandov, Ashish Sabharwal, Austin Herrick, Avia Efrat, Aykut Erdem, Ayla Karakaş, B. Ryan Roberts, Bao Sheng Loe, Barret Zoph, Bartłomiej Bojanowski, Batuhan Özyurt, Behnam Hedayatnia, Behnam Neyshabur, Benjamin Inden, Benno Stein, Berk Ekmekci, Bill Yuchen Lin, Blake Howald, Bryan Orinion, Cameron Diao, Cameron Dour, Catherine Stinson, Cedrick Argueta, César Ferri Ramírez, Chandan Singh, Charles Rathkopf, Chenlin Meng, Chitta Baral, Chiyu Wu, Chris Callison-Burch, Chris Waites, Christian Voigt, Christopher D. Manning, Christopher Potts, Cindy Ramirez, Clara E. Rivera, Clemencia Siro, Colin Raffel, Courtney Ashcraft, Cristina Garbacea, Damien Sileo, Dan Garrette, Dan Hendrycks, Dan Kilman, Dan Roth, Daniel Freeman, Daniel Khashabi, Daniel Levy, Daniel Moseguí González, Danielle Perszyk, Danny Hernandez, Danqi Chen, Daphne Ippolito, Dar Gilboa, David Dohan, David Drakard, David Jurgens, Debajyoti Datta, Deep Ganguli, Denis Emelin, Denis Kleyko, Deniz Yuret, Derek Chen, Derek Tam, Dieuwke Hupkes, Diganta Misra, Dilyar Buzan, Dimitri Coelho Mollo, Diyi Yang, Dong-Ho Lee, Dylan Schrader, Ekaterina Shutova, Ekin Dogus Cubuk, Elad Segal, Eleanor Hagerman, Elizabeth Barnes, Elizabeth Donoway, Ellie Pavlick, Emanuele Rodola, Emma Lam, Eric Chu, Eric Tang, Erkut Erdem, Ernie Chang, Ethan A. Chi, Ethan Dyer, Ethan Jerzak, Ethan Kim, Eunice Engefu Manyasi, Evgenii Zheltonozhskii, Fanyue Xia, Fatemeh Siar, Fernando Martínez-Plumed, Francesca Happé, Francois Chollet, Frieda Rong, Gaurav Mishra, Genta Indra Winata, Gerard de Melo, Germán Kruszewski, Giambattista Parascandolo, Giorgio Mariani, Gloria Wang, Gonzalo Jaimovitch-López, Gregor Betz, Guy Gur-Ari, Hana Galijasevic, Hannah Kim, Hannah Rashkin, Hannaneh Hajishirzi, Harsh Mehta, Hayden Bogar, Henry Shevlin, Hinrich Schütze, Hiromu Yakura, Hongming Zhang, Hugh Mee Wong, Ian Ng, Isaac Noble, Jaap Jumelet, Jack Geissinger, Jackson Kernion, Jacob Hilton, Jaehoon Lee, Jaime Fernández Fisac, James B. Simon, James Koppel, James Zheng, James Zou, Jan Kocoń, Jana Thompson, Janelle Wingfield, Jared Kaplan, Jarema Radom, Jascha Sohl-Dickstein, Jason Phang, Jason Wei, Jason Yosinski, Jekaterina Novikova, Jelle Bosscher, Jennifer Marsh, Jeremy Kim, Jeroen Taal, Jesse Engel, Jesujoba Alabi, Jiacheng Xu, Jiaming Song, Jillian Tang, Joan Waweru, John Burden, John Miller, John U. Balis, Jonathan Batchelder, Jonathan Berant, Jörg Frohberg, Jos Rozen, Jose Hernandez-Orallo, Joseph Boudeman, Joseph Guerr, Joseph Jones, Joshua B. Tenenbaum, Joshua S. Rule, Joyce Chua, Kamil Kanclerz, Karen Livescu, Karl Krauth, Karthik Gopalakrishnan, Katerina Ignatyeva, Katja Markert, Kaustubh D. Dhole, Kevin Gimpel, Kevin Omondi, Kory Mathewson, Kristen Chiafullo, Ksenia Shkaruta, Kumar Shridhar, Kyle McDonell, Kyle Richardson, Laria Reynolds, Leo Gao, Li Zhang, Liam Dugan, Lianhui Qin, Lidia Contreras-Ochando, Louis-Philippe Morency, Luca Moschella, Lucas Lam, Lucy Noble, Ludwig Schmidt, Luheng He, Luis Oliveros Colón, Luke Metz, Lütfi Kerem Şenel, Maarten Bosma, Maarten Sap, Maartje ter Hoeve, Maheen Farooqi, Manaal Faruqui, Mantas Mazeika, Marco Baturan, Marco Marelli, Marco Maru, Maria Jose Ramírez Quintana, Marie Tolkiehn, Mario Giulianelli, Martha Lewis, Martin Potthast, Matthew L. Leavitt, Matthias Hagen, Mátyás Schubert, Medina Orduna Baitemirova, Melody Arnaud, Melvin McElrath, Michael A. Yee, Michael Cohen, Michael Gu, Michael Ivanitskiy, Michael Starritt, Michael Strube, Michał Swędrowski, Michele Bevilacqua, Michihiro Yasunaga, Mihir Kale, Mike Cain, Mimee Xu, Mirac Suzgun, Mitch Walker, Mo Tiwari, Mohit Bansal, Moin Aminnaseri, Mor Geva, Mozhdeh Gheini, Mukund Varma T, Nanyun Peng, Nathan A. Chi, Nayeon Lee, Neta Gur-Ari Krakover, Nicholas Cameron, Nicholas Roberts, Nick Doiron, Nicole Martinez, Nikita Nangia, Niklas Deckers, Niklas Muennighoff, Nitish Shirish Keskar, Niveditha S. Iyer, Noah Constant, Noah Fiedel, Nuan Wen, Oliver Zhang, Omar Agha, Omar Elbaghdadi, Omer Levy, Owain Evans, Pablo Antonio Moreno Casares, Parth Doshi, Pascale Fung, Paul Pu Liang, Paul Vicol, Pegah Alipoormolabashi, Peiyuan Liao, Percy Liang, Peter Chang, Peter Eckersley, Phu Mon Htut, Pinyu Hwang, Piotr Miłkowski, Piyush Patil, Pouya Pezeshkpour, Priti Oli, Qiaozhu Mei, Qing Lyu, Qinlang Chen, Rabin Banjade, Rachel Etta Rudolph, Raefer Gabriel, Rahel Habacker, Ramon Risco, Raphaël Millière, Rhythm Garg, Richard Barnes, Rif A. Saurous, Riku Arakawa, Robbe Raymaekers, Robert Frank, Rohan Sikand, Roman Novak, Roman Sitelew, Ronan LeBras, Rosanne Liu, Rowan Jacobs, Rui Zhang, Ruslan Salakhutdinov, Ryan Chi, Ryan Lee, Ryan Stovall, Ryan Teehan, Rylan Yang, Sahib Singh, Saif M. Mohammad, Sajant Anand, Sam Dillavou, Sam Shleifer, Sam Wiseman, Samuel Gruetter, Samuel R. Bowman, Samuel S. Schoenholz, Sanghyun Han, Sanjeev Kwatra, Sarah A. Rous, Sarik Ghazarian, Sayan Ghosh, Sean Casey, Sebastian Bischoff, Sebastian Gehrmann, Sebastian Schuster, Sepideh Sadeghi, Shadi Hamdan, Sharon Zhou, Shashank Srivastava, Sherry Shi, Shikhar Singh, Shima Asaadi, Shixiang Shane Gu, Shubh Pachchigar, Shubham Toshniwal, Shyam Upadhyay, Shyamolima, Debnath, Siamak Shakeri, Simon Thormeyer, Simone Melzi, Siva Reddy, Sneha Priscilla Makini, Soo-Hwan Lee, Spencer Torene, Sriharsha Hatwar, Stanislas Dehaene, Stefan Divic, Stefano Ermon, Stella Biderman, Stephanie Lin, Stephen Prasad, Steven T. Piantadosi, Stuart M. Shieber, Summer Misherghi, Svetlana Kiritchenko, Swaroop Mishra, Tal Linzen, Tal Schuster, Tao Li, Tao Yu, Tariq Ali, Tatsu Hashimoto, Te-Lin Wu, Théo Desbordes, Theodore Rothschild, Thomas Phan, Tianle Wang, Tiberius Nkinyili, Timo Schick, Timofei Kornev, Titus Tunduny, Tobias Gerstenberg, Trenton Chang, Trishala Neeraj, Tushar Khot, Tyler Shultz, Uri Shaham, Vedant Misra, Vera Demberg, Victoria Nyamai, Vikas Raunak, Vinay Ramasesh, Vinay Uday Prabhu, Vishakh Padmakumar, Vivek Srikumar, William Fedus, William Saunders, William Zhang, Wout Vossen, Xiang Ren, Xiaoyu Tong, Xinran Zhao, Xinyi Wu, Xudong Shen, Yadollah Yaghoobzadeh, Yair Lakretz, Yangqiu Song, Yasaman Bahri, Yejin Choi, Yichi Yang, Yiding Hao, Yifu Chen, Yonatan Belinkov, Yu Hou, Yufang Hou, Yuntao Bai, Zachary Seid, Zhuoye Zhao, Zijian Wang, Zijie J. Wang, ZiRui Wang, Ziyi Wu

BIG-bench focuses on tasks that are believed to be beyond the capabilities of current language models.

Common Sense Reasoning Math +1

2,647

Paper
Code

SLAMP: Stochastic Latent Appearance and Motion Prediction

1 code implementation • ICCV 2021 • Adil Kaan Akan, Erkut Erdem, Aykut Erdem, Fatma Güney

Motion is an important cue for video prediction and often utilized by separating video content into static and dynamic components.

Ranked #1 on Video Prediction on Cityscapes 128x128 (PSNR metric)

Autonomous Driving motion prediction +2

Paper
Code

A Gated Fusion Network for Dynamic Saliency Prediction

no code implementations • 15 Feb 2021 • Aysun Kocak, Erkut Erdem, Aykut Erdem

Predicting saliency in videos is a challenging problem due to complex modeling of interactions between spatial and temporal information, especially when ever-changing, dynamic nature of videos is considered.

Saliency Prediction

Paper
Add Code

Object and Relation Centric Representations for Push Effect Prediction

no code implementations • 3 Feb 2021 • Ahmet E. Tekden, Aykut Erdem, Erkut Erdem, Tamim Asfour, Emre Ugur

Our approach enables the robot to predict and adapt the effect of a pushing action as it observes the scene.

Object Relation

Paper
Add Code

Cross-lingual Visual Pre-training for Multimodal Machine Translation

1 code implementation • EACL 2021 • Ozan Caglayan, Menekse Kuyu, Mustafa Sercan Amac, Pranava Madhyastha, Erkut Erdem, Aykut Erdem, Lucia Specia

Pre-trained language models have been shown to improve performance in many natural language tasks substantially.

Language Modelling Multimodal Machine Translation +1

Paper
Code

MSVD-Turkish: A Comprehensive Multimodal Dataset for Integrated Vision and Language Research in Turkish

no code implementations • 13 Dec 2020 • Begum Citamak, Ozan Caglayan, Menekse Kuyu, Erkut Erdem, Aykut Erdem, Pranava Madhyastha, Lucia Specia

We hope that the MSVD-Turkish dataset and the results reported in this work will lead to better video captioning and multimodal machine translation models for Turkish and other morphology rich and agglutinative languages.

Multimodal Machine Translation Sentence +3

Paper
Add Code

CRAFT: A Benchmark for Causal Reasoning About Forces and inTeractions

1 code implementation • Findings (ACL) 2022 • Tayfun Ates, M. Samil Atesoglu, Cagatay Yigit, Ilker Kesen, Mert Kobas, Erkut Erdem, Aykut Erdem, Tilbe Goksun, Deniz Yuret

Humans are able to perceive, understand and reason about causal events.

counterfactual Descriptive +3

Paper
Code

Burst Photography for Learning to Enhance Extremely Dark Images

1 code implementation • 17 Jun 2020 • Ahmet Serdar Karadeniz, Erkut Erdem, Aykut Erdem

Capturing images under extremely low-light conditions poses significant challenges for the standard camera pipeline.

Low-Light Image Enhancement

Paper
Code

Modulating Bottom-Up and Top-Down Visual Processing via Language-Conditional Filters

1 code implementation • 28 Mar 2020 • İlker Kesen, Ozan Arkan Can, Erkut Erdem, Aykut Erdem, Deniz Yuret

Our experiments reveal that using language to control the filters for bottom-up visual processing in addition to top-down attention leads to better results on both tasks and achieves competitive performance.

Colorization Image Colorization +3

Paper
Code

Burst Denoising of Dark Images

1 code implementation • 17 Mar 2020 • Ahmet Serdar Karadeniz, Erkut Erdem, Aykut Erdem

Capturing images under extremely low-light conditions poses significant challenges for the standard camera pipeline.

Denoising Image Enhancement

Paper
Code

mustGAN: Multi-Stream Generative Adversarial Networks for MR Image Synthesis

no code implementations • 25 Sep 2019 • Mahmut Yurt, Salman Ul Hassan Dar, Aykut Erdem, Erkut Erdem, Tolga Çukur

Multi-contrast MRI protocols increase the level of morphological information available for diagnosis.

Image Generation Motion Synthesis

Paper
Add Code

Procedural Reasoning Networks for Understanding Multimodal Procedures

no code implementations • CONLL 2019 • Mustafa Sercan Amac, Semih Yagcioglu, Aykut Erdem, Erkut Erdem

Our model learns to dynamically update entity states in relation to each other while reading the text instructions.

Inductive Bias Visual Reasoning

Paper
Add Code

Belief Regulated Dual Propagation Nets for Learning Action Effects on Articulated Multi-Part Objects

1 code implementation • 9 Sep 2019 • Ahmet E. Tekden, Aykut Erdem, Erkut Erdem, Mert Imre, M. Yunus Seker, Emre Ugur

In this paper, we introduce Belief Regulated Dual Propagation Networks (BRDPN), a general purpose learnable physics engine, which enables a robot to predict the effects of its actions in scenes containing groups of articulated multi-part objects.

Robotics

Paper
Code

RecipeQA: A Challenge Dataset for Multimodal Comprehension of Cooking Recipes

no code implementations • EMNLP 2018 • Semih Yagcioglu, Aykut Erdem, Erkut Erdem, Nazli Ikizler-Cinbis

With over 36K automatically generated question-answer pairs, we design a set of comprehension and reasoning tasks that require joint understanding of images and text, capturing the temporal flow of events and making sense of procedural knowledge.

Reading Comprehension

Paper
Add Code

Manipulating Attributes of Natural Scenes via Hallucination

no code implementations • 22 Aug 2018 • Levent Karacan, Zeynep Akata, Aykut Erdem, Erkut Erdem

In this study, we explore building a two-stage framework for enabling users to directly manipulate high-level attributes of a natural scene.

Hallucination Style Transfer +1

Paper
Add Code

Language Guided Fashion Image Manipulation with Feature-wise Transformations

no code implementations • 12 Aug 2018 • Mehmet Günel, Erkut Erdem, Aykut Erdem

Developing techniques for editing an outfit image through natural sentences and accordingly generating new outfits has promising applications for art, fashion and design.

Image Manipulation

Paper
Add Code

Image Synthesis in Multi-Contrast MRI with Conditional Generative Adversarial Networks

2 code implementations • 5 Feb 2018 • Salman Ul Hassan Dar, Mahmut Yurt, Levent Karacan, Aykut Erdem, Erkut Erdem, Tolga Çukur

The proposed approach preserves high-frequency details via an adversarial loss; and it offers enhanced synthesis performance via a pixel-wise loss for registered multi-contrast images and a cycle-consistency loss for unregistered images.

Anatomy Image Generation

Paper
Code

Re-evaluating Automatic Metrics for Image Captioning

no code implementations • EACL 2017 • Mert Kilickaya, Aykut Erdem, Nazli Ikizler-Cinbis, Erkut Erdem

The task of generating natural language descriptions from images has received a lot of attention in recent years.

Image Captioning

Paper
Add Code

Learning to Generate Images of Outdoor Scenes from Attributes and Semantic Layouts

no code implementations • 1 Dec 2016 • Levent Karacan, Zeynep Akata, Aykut Erdem, Erkut Erdem

Automatic image synthesis research has been rapidly growing with deep networks getting more and more expressive.

Generative Adversarial Network Image Generation +1

Paper
Add Code

Leveraging Captions in the Wild to Improve Object Detection

no code implementations • WS 2016 • Mert Kilickaya, Nazli Ikizler-Cinbis, Erkut Erdem, Aykut Erdem

object-detection Object Detection

Paper
Add Code

Spatio-Temporal Saliency Networks for Dynamic Saliency Prediction

no code implementations • 16 Jul 2016 • Cagdas Bak, Aysun Kocak, Erkut Erdem, Aykut Erdem

We also carry out some experiments on a number of still images from the MIT300 dataset by exploiting the optical flow maps predicted from these images.

Optical Flow Estimation Saliency Prediction

Paper
Add Code

Automatic Description Generation from Images: A Survey of Models, Datasets, and Evaluation Measures

no code implementations • 15 Jan 2016 • Raffaella Bernardi, Ruket Cakici, Desmond Elliott, Aykut Erdem, Erkut Erdem, Nazli Ikizler-Cinbis, Frank Keller, Adrian Muscat, Barbara Plank

Automatic description generation from natural images is a challenging problem that has recently received a large amount of interest from the computer vision and natural language processing communities.

Retrieval

Paper
Add Code

Image Matting With KL-Divergence Based Sparse Sampling

no code implementations • ICCV 2015 • Levent Karacan, Aykut Erdem, Erkut Erdem

Previous sampling-based image matting methods typically rely on certain heuristics in collecting representative samples from known regions, and thus their performance deteriorates if the underlying assumptions are not satisfied.

Image Matting

Paper
Add Code

A Distributed Representation Based Query Expansion Approach for Image Captioning

no code implementations • IJCNLP 2015 • Semih Yagcioglu, Erkut Erdem, Aykut Erdem, Ruket Cakici

Image Captioning

Paper
Add Code

Visual saliency estimation by integrating features using multiple kernel learning

no code implementations • 22 Jul 2013 • Yasin Kavak, Erkut Erdem, Aykut Erdem

In the last few decades, significant achievements have been attained in predicting where humans look at images through different computational models.

Object Saliency Prediction

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.