This is a collection of (mostly) pen-and-paper exercises in machine learning.
In our model, medical text annotation is introduced to compensate for the quality deficiency in image data.
The ability to separate signal from noise, and reason with clean abstractions, is critical to intelligence.
Human mobility data contains rich but abundant information, which yields to the comprehensive region embeddings for cross domain tasks.
We introduce Ivy, a templated Deep Learning (DL) framework which abstracts existing DL frameworks.
Attention-based models trained on protein sequences have demonstrated incredible success at classification and generation tasks relevant for artificial intelligence-driven protein design.
Based on this formulation, we implement the classical renderer by a scattering-based method and propose a two-stage neural renderer to fix the erroneous areas from the classical renderer.
We test language models on our forecasting task and find that performance is far below a human expert baseline.
3D object detection in autonomous driving aims to reason "what" and "where" the objects of interest present in a 3D world.
We present the Pathways Autoregressive Text-to-Image (Parti) model, which generates high-fidelity photorealistic images and supports content-rich synthesis involving complex compositions and world knowledge.
Ranked #1 on
Text-to-Image Generation
on COCO