1 code implementation • 15 Sep 2023 • Amir Rahimi, Vanessa D'Amario, Moyuru Yamada, Kentaro Takemoto, Tomotake Sasaki, Xavier Boix
We demonstrate that this result is independent of the similarity between the training and testing data and applies to well-known families of neural network architectures for VQA (i. e. monolithic architectures and neural module networks).
1 code implementation • 15 Jun 2023 • Ian Mason, Anirban Sarkar, Tomotake Sasaki, Xavier Boix
In this work we develop a compositional image classification task where, given a few elemental corruptions, models are asked to generalize to compositions of these corruptions.
1 code implementation • 17 May 2023 • Kentaro Takemoto, Moyuru Yamada, Tomotake Sasaki, Hisanao Akima
Human-Object Interaction (HOI) detection is a task to localize humans and objects in an image and predict the interactions in human-object pairs.
Human-Object Interaction Detection Systematic Generalization
no code implementations • 17 Mar 2023 • Anirban Sarkar, Matthew Groth, Ian Mason, Tomotake Sasaki, Xavier Boix
Deep Neural Networks (DNNs) often fail in out-of-distribution scenarios.
1 code implementation • 30 Sep 2022 • Yoshihiro Okawa, Tomotake Sasaki, Hitoshi Yanami, Toru Namerikawa
We define the safety during learning as satisfaction of the constraint conditions explicitly defined in terms of the state and propose a safe exploration method that uses partial prior knowledge of a controlled object and disturbance.
1 code implementation • 27 Jan 2022 • Moyuru Yamada, Vanessa D'Amario, Kentaro Takemoto, Xavier Boix, Tomotake Sasaki
We reveal that Neural Module Networks (NMNs), i. e., question-specific compositions of modules that tackle a sub-task, achieve better or similar systematic generalization performance than the conventional Transformers, even though NMNs' modules are CNN-based.
1 code implementation • 25 Jan 2022 • Kimberly Villalobos, Vilim Štih, Amineh Ahmadinejad, Shobhita Sundaram, Jamell Dozier, Andrew Francl, Frederico Azevedo, Tomotake Sasaki, Xavier Boix
Only recurrent networks trained with small images learn solutions that generalize well to almost any curve.
1 code implementation • 8 Dec 2021 • Shobhita Sundaram, Darius Sinha, Matthew Groth, Tomotake Sasaki, Xavier Boix
Symmetry is omnipresent in nature and perceived by the visual system of many species, as it facilitates detecting ecologically important classes of objects in our environment.
1 code implementation • 30 Oct 2021 • Akira Sakai, Taro Sunagawa, Spandan Madan, Kanata Suzuki, Takashi Katoh, Hiromichi Kobashi, Hanspeter Pfister, Pawan Sinha, Xavier Boix, Tomotake Sasaki
While humans have a remarkable capability of recognizing objects in out-of-distribution (OoD) orientations and illuminations, Deep Neural Networks (DNNs) severely suffer in this case, even when large amounts of training examples are available.
no code implementations • 28 Sep 2021 • Avi Cooper, Xavier Boix, Daniel Harari, Spandan Madan, Hanspeter Pfister, Tomotake Sasaki, Pawan Sinha
The capability of Deep Neural Networks (DNNs) to recognize objects in orientations outside the distribution of the training data is not well understood.
no code implementations • 13 Jul 2021 • Vanessa D'Amario, Sanjana Srivastava, Tomotake Sasaki, Xavier Boix
Datasets often contain input dimensions that are unnecessary to predict the output label, e. g. background in object recognition, which lead to more trainable parameters.
1 code implementation • 30 Jun 2021 • Spandan Madan, Tomotake Sasaki, Hanspeter Pfister, Tzu-Mao Li, Xavier Boix
This result provides evidence supporting theories attributing adversarial examples to the proximity of data to ground-truth class boundaries, and calls into question other theories which do not account for this more stringent definition of adversarial attacks.
1 code implementation • NeurIPS 2021 • Vanessa D'Amario, Tomotake Sasaki, Xavier Boix
Neural Module Networks (NMNs) aim at Visual Question Answering (VQA) via composition of modules that tackle a sub-task.
1 code implementation • 5 Mar 2021 • Mei Minami, Yuka Masumoto, Yoshihiro Okawa, Tomotake Sasaki, Yutaka Hori
To overcome this limitation, we propose a model-free two-step design approach that improves the transient learning performance of RL in an optimal regulator redesign problem for unknown nonlinear systems.
no code implementations • 5 Mar 2021 • Yoshihiro Okawa, Tomotake Sasaki, Hidenao Iwane
In reinforcement learning (RL) algorithms, exploratory control inputs are used during learning to acquire knowledge for decision making and control, while the true dynamics of a controlled object is unknown.
no code implementations • 1 Jan 2021 • Spandan Madan, Timothy Henry, Jamell Arthur Dozier, Helen Ho, Nishchal Bhandari, Tomotake Sasaki, Fredo Durand, Hanspeter Pfister, Xavier Boix
We find that learning category and viewpoint in separate networks compared to a shared one leads to an increase in selectivity and invariance, as separate networks are not forced to preserve information about both category and viewpoint.
no code implementations • 1 Jan 2021 • Vanessa D'Amario, Sanjana Srivastava, Tomotake Sasaki, Xavier Boix
In this paper, we investigate the impact of unnecessary input dimensions on one of the central issues of machine learning: the number of training examples needed to achieve high generalization performance, which we refer to as the network's data efficiency.
2 code implementations • 15 Jul 2020 • Spandan Madan, Timothy Henry, Jamell Dozier, Helen Ho, Nishchal Bhandari, Tomotake Sasaki, Frédo Durand, Hanspeter Pfister, Xavier Boix
In this paper, we investigate when and how such OOD generalization may be possible by evaluating CNNs trained to classify both object category and 3D viewpoint on OOD combinations, and identifying the neural mechanisms that facilitate such OOD generalization.
no code implementations • ICML 2020 • Keizo Kato, Jing Zhou, Tomotake Sasaki, Akira Nakagawa
We show our method has the following properties: (i) the Jacobian matrix between the input space and a Euclidean latent space forms a constantlyscaled orthonormal system and enables isometric data embedding; (ii) the relation of PDFs in both spaces can become tractable one such as proportional relation.
no code implementations • 25 Sep 2019 • Kimberly M Villalobos, Vilim Stih, Amineh Ahmadinejad, Jamell Dozier, Andrew Francl, Frederico Azevedo, Tomotake Sasaki, Xavier Boix
At the heart of image segmentation lies the problem of determining whether a pixel is inside or outside a region, which we denote as the "insideness" problem.