Search Results for author: Ilker Yildirim

Found 15 papers, 3 papers with code

L3GO: Language Agents with Chain-of-3D-Thoughts for Generating Unconventional Objects

no code implementations14 Feb 2024 Yutaro Yamada, Khyathi Chandu, YuChen Lin, Jack Hessel, Ilker Yildirim, Yejin Choi

In this paper, we propose a language agent with chain-of-3D-thoughts (L3GO), an inference-time approach that can reason about part-based 3D mesh generation of unconventional objects that current data-driven diffusion models struggle with.

Image Generation Text to 3D

How does the primate brain combine generative and discriminative computations in vision?

no code implementations11 Jan 2024 Benjamin Peters, James J. DiCarlo, Todd Gureckis, Ralf Haefner, Leyla Isik, Joshua Tenenbaum, Talia Konkle, Thomas Naselaris, Kimberly Stachenfeld, Zenna Tavares, Doris Tsao, Ilker Yildirim, Nikolaus Kriegeskorte

The alternative conception is that of vision as an inference process in Helmholtz's sense, where the sensory evidence is evaluated in the context of a generative model of the causal processes giving rise to it.

Evaluating Spatial Understanding of Large Language Models

1 code implementation23 Oct 2023 Yutaro Yamada, Yihan Bao, Andrew K. Lampinen, Jungo Kasai, Ilker Yildirim

Large language models (LLMs) show remarkable capabilities across a variety of tasks.

From seeing to remembering: Images with harder-to-reconstruct representations leave stronger memory traces

1 code implementation21 Feb 2023 Qi Lin, Zifan Li, John Lafferty, Ilker Yildirim

Much of what we remember is not due to intentional selection, but simply a by-product of perceiving.

Retrieval

3D Shape Perception Integrates Intuitive Physics and Analysis-by-Synthesis

1 code implementation9 Jan 2023 Ilker Yildirim, Max H. Siegel, Amir A. Soltani, Shraman Ray Chaudhari, Joshua B. Tenenbaum

Many surface cues support three-dimensional shape perception, but people can sometimes still see shape when these features are missing -- in extreme cases, even when an object is completely occluded, as when covered with a draped cloth.

When are Lemons Purple? The Concept Association Bias of Vision-Language Models

no code implementations22 Dec 2022 Yutaro Yamada, Yingtian Tang, Yoyo Zhang, Ilker Yildirim

Large-scale vision-language models such as CLIP have shown impressive performance on zero-shot image classification and image-to-text retrieval.

Attribute Image Classification +7

Exploiting 3D Shape Bias towards Robust Vision

no code implementations NeurIPS Workshop SVRHM 2021 Yutaro Yamada, Yuval Kluger, Sahand Negahban, Ilker Yildirim

To tackle the problem from a new perspective, we encourage closer collaboration between the robustness and 3D vision communities.

3D Reconstruction

Geon3D: Exploiting 3D Shape Bias towards Building Robust Machine Vision

no code implementations29 Sep 2021 Yutaro Yamada, Yuval Kluger, Sahand Negahban, Ilker Yildirim

To tackle the problem from a new perspective, we encourage closer collaboration between the robustness and 3D vision communities.

3D Reconstruction

Explaining intuitive difficulty judgments by modeling physical effort and risk

no code implementations11 May 2019 Ilker Yildirim, Basil Saeed, Grace Bennett-Pierre, Tobias Gerstenberg, Joshua Tenenbaum, Hyowon Gweon

The ability to estimate task difficulty is critical for many real-world decisions such as setting appropriate goals for ourselves or appreciating others' accomplishments.

Modeling human intuitions about liquid flow with particle-based simulation

no code implementations5 Sep 2018 Christopher J. Bates, Ilker Yildirim, Joshua B. Tenenbaum, Peter Battaglia

Humans can easily describe, imagine, and, crucially, predict a wide variety of behaviors of liquids--splashing, squirting, gushing, sloshing, soaking, dripping, draining, trickling, pooling, and pouring--despite tremendous variability in their material and dynamical properties.

Scene Understanding

Self-Supervised Intrinsic Image Decomposition

no code implementations NeurIPS 2017 Michael Janner, Jiajun Wu, Tejas D. Kulkarni, Ilker Yildirim, Joshua B. Tenenbaum

Intrinsic decomposition from a single image is a highly challenging task, due to its inherent ambiguity and the scarcity of training data.

Intrinsic Image Decomposition Transfer Learning

Physical problem solving: Joint planning with symbolic, geometric, and dynamic constraints

no code implementations25 Jul 2017 Ilker Yildirim, Tobias Gerstenberg, Basil Saeed, Marc Toussaint, Josh Tenenbaum

In Experiment~2, we asked participants online to judge whether they think the person in the lab used one or two hands.

Galileo: Perceiving Physical Object Properties by Integrating a Physics Engine with Deep Learning

no code implementations NeurIPS 2015 Jiajun Wu, Ilker Yildirim, Joseph J. Lim, Bill Freeman, Josh Tenenbaum

Humans demonstrate remarkable abilities to predict physical events in dynamic scenes, and to infer the physical properties of objects from static images.

Friction Scene Understanding

A Concept Learning Approach to Multisensory Object Perception

no code implementations23 Sep 2014 Ifeoma Nwogu, Goker Erdogan, Ilker Yildirim, Robert Jacobs

This paper presents a computational model of concept learning using Bayesian inference for a grammatically structured hypothesis space, and test the model on multisensory (visual and haptics) recognition of 3D objects.

Bayesian Inference Object

Cannot find the paper you are looking for? You can Submit a new open access paper.