no code implementations • 9 Oct 2021 • Sarah Di, Robin Yu, Amol Kapoor
Any general artificial intelligence system must be able to interpret, operate on, and produce data in a multi-modal latent space that can represent audio, imagery, text, and more.