1 code implementation • 16 Oct 2024 • Luis Barroso-Luque, Muhammed Shuaibi, Xiang Fu, Brandon M. Wood, Misko Dzamba, Meng Gao, Ammar Rizvi, C. Lawrence Zitnick, Zachary W. Ulissi
The ability to discover new materials with desirable properties is critical for numerous applications from helping mitigate climate change to advances in next generation computing hardware.
3 code implementations • 6 Feb 2024 • Nate Gruver, Anuroop Sriram, Andrea Madotto, Andrew Gordon Wilson, C. Lawrence Zitnick, Zachary Ulissi
We propose fine-tuning large language models for generation of stable materials.
1 code implementation • 25 Oct 2023 • Nima Shoghi, Adeesh Kolluru, John R. Kitchin, Zachary W. Ulissi, C. Lawrence Zitnick, Brandon M. Wood
Similar success in atomic property prediction has been limited due to the challenges of training effective models across multiple chemical domains.
2 code implementations • 7 Feb 2023 • Saro Passaro, C. Lawrence Zitnick
Graph neural networks that model 3D data, such as point clouds or atoms, are typically desired to be $SO(3)$ equivariant, i. e., equivariant to 3D rotations.
1 code implementation • 29 Nov 2022 • Janice Lan, Aini Palizhati, Muhammed Shuaibi, Brandon M. Wood, Brook Wander, Abhishek Das, Matt Uyttendaele, C. Lawrence Zitnick, Zachary W. Ulissi
Computational catalysis is playing an increasingly significant role in the design of catalysts across a wide range of applications.
2 code implementations • 29 Jun 2022 • C. Lawrence Zitnick, Abhishek Das, Adeesh Kolluru, Janice Lan, Muhammed Shuaibi, Anuroop Sriram, Zachary Ulissi, Brandon Wood
We propose the Spherical Channel Network (SCN) to model atomic energies and forces.
1 code implementation • 17 Jun 2022 • Richard Tran, Janice Lan, Muhammed Shuaibi, Brandon M. Wood, Siddharth Goyal, Abhishek Das, Javier Heras-Domingo, Adeesh Kolluru, Ammar Rizvi, Nima Shoghi, Anuroop Sriram, Felix Therrien, Jehad Abed, Oleksandr Voznyy, Edward H. Sargent, Zachary Ulissi, C. Lawrence Zitnick
The development of machine learning models for electrocatalysts requires a broad set of training data to enable their use across a wide variety of materials.
no code implementations • 6 Apr 2022 • Johannes Gasteiger, Muhammed Shuaibi, Anuroop Sriram, Stephan Günnemann, Zachary Ulissi, C. Lawrence Zitnick, Abhishek Das
This work investigates this question by first developing the GemNet-OC model based on the large Open Catalyst 2020 (OC20) dataset.
Ranked #1 on Initial Structure to Relaxed Energy (IS2RE) on OC20
1 code implementation • ICLR 2022 • Anuroop Sriram, Abhishek Das, Brandon M. Wood, Siddharth Goyal, C. Lawrence Zitnick
Recent progress in Graph Neural Networks (GNNs) for modeling atomic simulations has the potential to revolutionize catalyst discovery, which is a key step in making progress towards the energy breakthroughs needed to combat climate change.
Ranked #2 on Initial Structure to Relaxed Energy (IS2RE) on OC20
1 code implementation • NeurIPS 2021 • Drew A. Hudson, C. Lawrence Zitnick
We introduce the GANformer2 model, an iterative object-oriented transformer, explored for the task of generative modeling.
1 code implementation • 17 Jun 2021 • Muhammed Shuaibi, Adeesh Kolluru, Abhishek Das, Aditya Grover, Anuroop Sriram, Zachary Ulissi, C. Lawrence Zitnick
We introduce a novel approach to modeling angular information between sets of neighboring atoms in a graph neural network.
Ranked #3 on Initial Structure to Relaxed Energy (IS2RE) on OC20
Graph Neural Network Initial Structure to Relaxed Energy (IS2RE)
no code implementations • 2 Mar 2021 • Weihua Hu, Muhammed Shuaibi, Abhishek Das, Siddharth Goyal, Anuroop Sriram, Jure Leskovec, Devi Parikh, C. Lawrence Zitnick
By not imposing explicit physical constraints, we can flexibly design expressive models while maintaining their computational efficiency.
2 code implementations • 1 Mar 2021 • Drew A. Hudson, C. Lawrence Zitnick
We introduce the GANformer, a novel and efficient type of transformer, and explore it for the task of visual generative modeling.
Ranked #1 on Image Generation on FFHQ (FID-10k-training-steps metric)
1 code implementation • ICLR 2021 • Songwei Ge, Vedanuj Goswami, C. Lawrence Zitnick, Devi Parikh
Sketching or doodling is a popular creative activity that people engage in.
5 code implementations • 20 Oct 2020 • Lowik Chanussot, Abhishek Das, Siddharth Goyal, Thibaut Lavril, Muhammed Shuaibi, Morgane Riviere, Kevin Tran, Javier Heras-Domingo, Caleb Ho, Weihua Hu, Aini Palizhati, Anuroop Sriram, Brandon Wood, Junwoong Yoon, Devi Parikh, C. Lawrence Zitnick, Zachary Ulissi
Catalyst discovery and optimization is key to solving many societal and energy challenges including solar fuels synthesis, long-term energy storage, and renewable fertilizer production.
no code implementations • 14 Oct 2020 • C. Lawrence Zitnick, Lowik Chanussot, Abhishek Das, Siddharth Goyal, Javier Heras-Domingo, Caleb Ho, Weihua Hu, Thibaut Lavril, Aini Palizhati, Morgane Riviere, Muhammed Shuaibi, Anuroop Sriram, Kevin Tran, Brandon Wood, Junwoong Yoon, Devi Parikh, Zachary Ulissi
As we increase our reliance on renewable energy sources such as wind and solar, which produce intermittent power, storage is needed to transfer power from times of peak generation to peak demand.
1 code implementation • Proceedings of the National Academy of Sciences 2020 • Alexander Rives, Joshua Meier, Tom Sercu, Siddharth Goyal, Zeming Lin, Demi Guo, Myle Ott, C. Lawrence Zitnick, Jerry Ma, Rob Fergus
In the field of artificial intelligence, a combination of scale in data and model capacity enabled by unsupervised learning has led to major advances in representation learning and statistical generation.
no code implementations • 15 May 2020 • Devi Parikh, C. Lawrence Zitnick
As a first step towards studying the ability of human crowds and machines to effectively co-create, we explore several human-only collaborative co-creation scenarios.
3 code implementations • 14 Apr 2020 • Anuroop Sriram, Jure Zbontar, Tullie Murrell, Aaron Defazio, C. Lawrence Zitnick, Nafissa Yakubova, Florian Knoll, Patricia Johnson
The slow acquisition speed of magnetic resonance imaging (MRI) has led to the development of two complementary methods: acquiring multiple views of the anatomy simultaneously (parallel imaging) and acquiring fewer samples than necessary for traditional signal processing methods (compressed sensing).
Ranked #1 on MRI Reconstruction on fastMRI Knee 4x
1 code implementation • 6 Jan 2020 • Florian Knoll, Tullie Murrell, Anuroop Sriram, Nafissa Yakubova, Jure Zbontar, Michael Rabbat, Aaron Defazio, Matthew J. Muckley, Daniel K. Sodickson, C. Lawrence Zitnick, Michael P. Recht
Conclusion: The challenge led to new developments in machine learning for image reconstruction, provided insight into the current state of the art in the field, and highlighted remaining hurdles for clinical adoption.
1 code implementation • CVPR 2020 • Anuroop Sriram, Jure Zbontar, Tullie Murrell, C. Lawrence Zitnick, Aaron Defazio, Daniel K. Sodickson
In this paper, we present a novel method to integrate traditional parallel imaging methods into deep neural networks that is able to generate high quality reconstructions even for high acceleration factors.
1 code implementation • 22 Jul 2019 • Arthur Szlam, Jonathan Gray, Kavya Srinet, Yacine Jernite, Armand Joulin, Gabriel Synnaeve, Douwe Kiela, Haonan Yu, Zhuoyuan Chen, Siddharth Goyal, Demi Guo, Danielle Rothermel, C. Lawrence Zitnick, Jason Weston
In this document we describe a rationale for a research program aimed at building an open "assistant" in the game Minecraft, in order to make progress on the problems of natural language understanding and learning from dialogue.
3 code implementations • 19 Jul 2019 • Jonathan Gray, Kavya Srinet, Yacine Jernite, Haonan Yu, Zhuoyuan Chen, Demi Guo, Siddharth Goyal, C. Lawrence Zitnick, Arthur Szlam
This paper describes an implementation of a bot assistant in Minecraft, and the tools and platform allowing players to interact with the bot and to record those interactions.
1 code implementation • 12 Feb 2019 • Yuandong Tian, Jerry Ma, Qucheng Gong, Shubho Sengupta, Zhuoyuan Chen, James Pinkerton, C. Lawrence Zitnick
The AlphaGo, AlphaGo Zero, and AlphaZero series of algorithms are remarkable demonstrations of deep reinforcement learning's capabilities, achieving superhuman performance in the complex game of Go with progressively increasing autonomy.
12 code implementations • 21 Nov 2018 • Jure Zbontar, Florian Knoll, Anuroop Sriram, Tullie Murrell, Zhengnan Huang, Matthew J. Muckley, Aaron Defazio, Ruben Stern, Patricia Johnson, Mary Bruno, Marc Parente, Krzysztof J. Geras, Joe Katsnelson, Hersh Chandarana, Zizhao Zhang, Michal Drozdzal, Adriana Romero, Michael Rabbat, Pascal Vincent, Nafissa Yakubova, James Pinkerton, Duo Wang, Erich Owens, C. Lawrence Zitnick, Michael P. Recht, Daniel K. Sodickson, Yvonne W. Lui
Accelerating Magnetic Resonance Imaging (MRI) by taking fewer measurements has the potential to reduce medical costs, minimize stress to patients and make MRI possible in applications where it is currently prohibitively slow or expensive.
2 code implementations • NeurIPS 2017 • Yuandong Tian, Qucheng Gong, Wenling Shang, Yuxin Wu, C. Lawrence Zitnick
In addition, our platform is flexible in terms of environment-agent communication topologies, choices of RL methods, changes in game parameters, and can host existing C/C++-based game environments like Arcade Learning Environment.
5 code implementations • ICCV 2017 • Justin Johnson, Bharath Hariharan, Laurens van der Maaten, Judy Hoffman, Li Fei-Fei, C. Lawrence Zitnick, Ross Girshick
Existing methods for visual reasoning attempt to directly map inputs to outputs using black-box architectures without explicitly modeling the underlying reasoning processes.
Ranked #5 on Visual Question Answering (VQA) on CLEVR-Humans
5 code implementations • CVPR 2017 • Justin Johnson, Bharath Hariharan, Laurens van der Maaten, Li Fei-Fei, C. Lawrence Zitnick, Ross Girshick
When building artificial intelligence systems that can reason and answer questions about visual data, we need diagnostic tests to analyze our progress and discover shortcomings.
no code implementations • 31 Aug 2016 • C. Lawrence Zitnick, Aishwarya Agrawal, Stanislaw Antol, Margaret Mitchell, Dhruv Batra, Devi Parikh
As machines have become more intelligent, there has been a renewed interest in methods for measuring their intelligence.
no code implementations • 17 Jun 2016 • Abhishek Das, Harsh Agrawal, C. Lawrence Zitnick, Devi Parikh, Dhruv Batra
We conduct large-scale studies on `human attention' in Visual Question Answering (VQA) to understand where humans choose to look to answer questions about images.
no code implementations • EMNLP 2016 • Abhishek Das, Harsh Agrawal, C. Lawrence Zitnick, Devi Parikh, Dhruv Batra
We conduct large-scale studies on `human attention' in Visual Question Answering (VQA) to understand where humans choose to look to answer questions about images.
1 code implementation • NAACL 2016 • Ting-Hao, Huang, Francis Ferraro, Nasrin Mostafazadeh, Ishan Misra, Aishwarya Agrawal, Jacob Devlin, Ross Girshick, Xiaodong He, Pushmeet Kohli, Dhruv Batra, C. Lawrence Zitnick, Devi Parikh, Lucy Vanderwende, Michel Galley, Margaret Mitchell
We introduce the first dataset for sequential vision-to-language, and explore how this data may be used for the task of visual storytelling.
no code implementations • 28 Mar 2016 • Ishan Misra, C. Lawrence Zitnick, Martial Hebert
With this simple task and no semantic labels, we learn a powerful visual representation using a Convolutional Neural Network (CNN).
Ranked #48 on Self-Supervised Action Recognition on HMDB51
no code implementations • CVPR 2016 • Ishan Misra, C. Lawrence Zitnick, Margaret Mitchell, Ross Girshick
When human annotators are given a choice about what to label in an image, they apply their own subjective judgments on what to ignore and what to mention.
no code implementations • CVPR 2016 • Arjun Chandrasekaran, Ashwin K. Vijayakumar, Stanislaw Antol, Mohit Bansal, Dhruv Batra, C. Lawrence Zitnick, Devi Parikh
We collect two datasets of abstract scenes that facilitate the study of humor at both the scene-level and the object-level.
no code implementations • CVPR 2016 • Sean Bell, C. Lawrence Zitnick, Kavita Bala, Ross Girshick
In this paper we present the Inside-Outside Net (ION), an object detector that exploits information both inside and outside the region of interest.
Ranked #230 on Object Detection on COCO test-dev
no code implementations • ICCV 2015 • Ramakrishna Vedantam, Xiao Lin, Tanmay Batra, C. Lawrence Zitnick, Devi Parikh
We show that the commonsense knowledge we learn is complementary to what can be learnt from sources of text.
no code implementations • NeurIPS 2015 • Fereshteh Sadeghi, C. Lawrence Zitnick, Ali Farhadi
In this paper, we study the problem of answering visual analogy questions.
no code implementations • CVPR 2015 • Xinlei Chen, C. Lawrence Zitnick
Results are better than or comparable to state-of-the-art results on the image and sentence retrieval tasks for methods using similar visual features.
1 code implementation • 17 May 2015 • Jacob Devlin, Saurabh Gupta, Ross Girshick, Margaret Mitchell, C. Lawrence Zitnick
We explore a variety of nearest neighbor baseline approaches for image captioning.
21 code implementations • ICCV 2015 • Aishwarya Agrawal, Jiasen Lu, Stanislaw Antol, Margaret Mitchell, C. Lawrence Zitnick, Dhruv Batra, Devi Parikh
Given an image and a natural language question about the image, the task is to provide an accurate natural language answer.
18 code implementations • 1 Apr 2015 • Xinlei Chen, Hao Fang, Tsung-Yi Lin, Ramakrishna Vedantam, Saurabh Gupta, Piotr Dollar, C. Lawrence Zitnick
In this paper we describe the Microsoft COCO Caption dataset and evaluation server.
24 code implementations • CVPR 2015 • Ramakrishna Vedantam, C. Lawrence Zitnick, Devi Parikh
We propose a novel paradigm for evaluating image descriptions that uses human consensus.
no code implementations • 20 Nov 2014 • Xinlei Chen, C. Lawrence Zitnick
Results are better than or comparable to state-of-the-art results on the image and sentence retrieval tasks for methods using similar visual features.
1 code implementation • CVPR 2015 • Hao Fang, Saurabh Gupta, Forrest Iandola, Rupesh Srivastava, Li Deng, Piotr Dollár, Jianfeng Gao, Xiaodong He, Margaret Mitchell, John C. Platt, C. Lawrence Zitnick, Geoffrey Zweig
The language model learns from a set of over 400, 000 image descriptions to capture the statistics of word usage.
Ranked #1 on Image Captioning on COCO Captions test
no code implementations • 12 Nov 2014 • Ramakrishna Vedantam, C. Lawrence Zitnick, Devi Parikh
We describe our two new datasets with images described by humans.
no code implementations • 20 Jun 2014 • Piotr Dollár, C. Lawrence Zitnick
We formulate the problem of predicting local edge masks in a structured learning framework applied to random decision forests.
37 code implementations • 1 May 2014 • Tsung-Yi Lin, Michael Maire, Serge Belongie, Lubomir Bourdev, Ross Girshick, James Hays, Pietro Perona, Deva Ramanan, C. Lawrence Zitnick, Piotr Dollár
We present a new dataset with the goal of advancing the state-of-the-art in object recognition by placing the question of object recognition in the context of the broader question of scene understanding.