1 code implementation • 27 Feb 2023 • Shaohan Huang, Li Dong, Wenhui Wang, Yaru Hao, Saksham Singhal, Shuming Ma, Tengchao Lv, Lei Cui, Owais Khan Mohammed, Barun Patra, Qiang Liu, Kriti Aggarwal, Zewen Chi, Johan Bjorck, Vishrav Chaudhary, Subhojit Som, Xia Song, Furu Wei
A big convergence of language, multimodal perception, action, and world modeling is a key step toward artificial general intelligence.
Ranked #2 on
Image Captioning
on Flickr30k Captions test
(CIDEr metric)
no code implementations • CVPR 2023 • Wenhui Wang, Hangbo Bao, Li Dong, Johan Bjorck, Zhiliang Peng, Qiang Liu, Kriti Aggarwal, Owais Khan Mohammed, Saksham Singhal, Subhojit Som, Furu Wei
A big convergence of language, vision, and multimodal pretraining is emerging.
1 code implementation • 22 Aug 2022 • Wenhui Wang, Hangbo Bao, Li Dong, Johan Bjorck, Zhiliang Peng, Qiang Liu, Kriti Aggarwal, Owais Khan Mohammed, Saksham Singhal, Subhojit Som, Furu Wei
A big convergence of language, vision, and multimodal pretraining is emerging.
Ranked #1 on
Zero-Shot Cross-Modal Retrieval
on Flickr30k
no code implementations • ICLR 2022 • Johan Bjorck, Carla P. Gomes, Kilian Q. Weinberger
In this paper, we investigate causes for this perceived instability.
no code implementations • NeurIPS 2021 • Johan Bjorck, Carla P. Gomes, Kilian Q. Weinberger
In this paper we investigate how RL agents are affected by exchanging the small MLPs with larger modern networks with skip connections and normalization, focusing specifically on actor-critic algorithms.
no code implementations • 26 Feb 2021 • Johan Bjorck, Xiangyu Chen, Christopher De Sa, Carla P. Gomes, Kilian Q. Weinberger
Low-precision training has become a popular approach to reduce compute requirements, memory footprint, and energy consumption in supervised learning.
no code implementations • 1 Jan 2021 • Johan Bjorck, Carla P Gomes
Neural networks are known to be data-hungry, and collecting large labeled datasets is often a crucial step in deep learning deployment.
no code implementations • 27 Dec 2020 • Johan Bjorck, Kilian Weinberger, Carla Gomes
We also show how the growth of network weights is heavily influenced by the dataset and its generalization properties.
no code implementations • 25 Sep 2019 • Johan Bjorck, Carla Gomes, Kilian Weinberger
Non-negative matrix factorization (NMF) is a highly celebrated algorithm for matrix decomposition that guarantees strictly non-negative factors.
no code implementations • 25 Feb 2019 • Johan Bjorck, Brendan H. Rappazzo, Di Chen, Richard Bernstein, Peter H. Wrege, Carla P. Gomes
In this work, we consider applying machine learning to the analysis and compression of audio signals in the context of monitoring elephants in sub-Saharan Africa.
no code implementations • NeurIPS 2018 • Johan Bjorck, Carla Gomes, Bart Selman, Kilian Q. Weinberger
Batch normalization (BN) is a technique to normalize activations in intermediate layers of deep neural networks.
no code implementations • 18 Nov 2017 • Johan Bjorck, Yiwei Bai, Xiaojian Wu, Yexiang Xue, Mark C. Whitmore, Carla Gomes
Cascades represent rapid changes in networks.
1 code implementation • 3 Oct 2016 • Yexiang Xue, Junwen Bai, Ronan Le Bras, Brendan Rappazzo, Richard Bernstein, Johan Bjorck, Liane Longpre, Santosh K. Suram, Robert B. van Dover, John Gregoire, Carla P. Gomes
A key problem in materials discovery, the phase map identification problem, involves the determination of the crystal phase diagram from the materials' composition and structural characterization data.