Search Results for author: Toni J. B. Liu

LLMs learn governing principles of dynamical systems, revealing an in-context neural scaling law

Pretrained large language models (LLMs) are surprisingly effective at performing zero-shot tasks, including time-series forecasting.

Paper
Code

These networks rely heavily on the dot product attention operator, which computes the similarity between two points by taking their inner product.

Paper
Code

Specifically, we model partial orders as subset relations between shadows formed by a light source and opaque objects in hyperbolic space.

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.