Search Results for author: Benjamin Graham

Found 24 papers, 18 papers with code

Sparse arrays of signatures for online character recognition

no code implementations1 Aug 2013 Benjamin Graham

We show that the path signature, used as a set of features for consumption by a convolutional neural network (CNN), improves the accuracy of online character recognition---that is the task of reading characters represented as a collection of paths.

Spatially-sparse convolutional neural networks

3 code implementations22 Sep 2014 Benjamin Graham

Convolutional neural networks (CNNs) perform well on problems such as handwriting recognition and image classification.

Ranked #144 on Image Classification on CIFAR-100 (using extra training data)

Handwriting Recognition Image Classification

Fractional Max-Pooling

5 code implementations18 Dec 2014 Benjamin Graham

However, if you simply alternate convolutional layers with max-pooling layers, performance is limited due to the rapid reduction in spatial size, and the disjoint nature of the pooling regions.

Image Classification

Confusing Deep Convolution Networks by Relabelling

no code implementations23 Oct 2015 Leigh Robinson, Benjamin Graham

Deep convolutional neural networks have become the gold standard for image recognition tasks, demonstrating many current state-of-the-art results and even achieving near-human level performance on some tasks.

Low-Precision Batch-Normalized Activations

no code implementations27 Feb 2017 Benjamin Graham

Artificial neural networks can be trained with relatively low-precision floating-point and fixed-point arithmetic, using between one and 16 bits.

Quantization

Submanifold Sparse Convolutional Networks

6 code implementations5 Jun 2017 Benjamin Graham, Laurens van der Maaten

Convolutional network are the de-facto standard for analysing spatio-temporal data such as images, videos, 3D shapes, etc.

3D Part Segmentation

The iisignature library: efficient calculation of iterated-integral signatures and log signatures

2 code implementations22 Feb 2018 Jeremy Reizenstein, Benjamin Graham

Iterated-integral signatures and log signatures are vectors calculated from a path that characterise its shape.

Data Structures and Algorithms Mathematical Software Rings and Algebras

Unsupervised learning with sparse space-and-time autoencoders

1 code implementation26 Nov 2018 Benjamin Graham

We use spatially-sparse two, three and four dimensional convolutional autoencoder networks to model sparse structures in 2D space, 3D space, and 3+1=4 dimensional space-time.

Handwriting Recognition Segmentation

C3DPO: Canonical 3D Pose Networks for Non-Rigid Structure From Motion

2 code implementations ICCV 2019 David Novotny, Nikhila Ravi, Benjamin Graham, Natalia Neverova, Andrea Vedaldi

We propose C3DPO, a method for extracting 3D models of deformable objects from 2D keypoint annotations in unconstrained images.

Training with Quantization Noise for Extreme Model Compression

4 code implementations ICLR 2021 Angela Fan, Pierre Stock, Benjamin Graham, Edouard Grave, Remi Gribonval, Herve Jegou, Armand Joulin

A standard solution is to train networks with Quantization Aware Training, where the weights are quantized during training and the gradients approximated with the Straight-Through Estimator.

Image Generation Model Compression

RidgeSfM: Structure from Motion via Robust Pairwise Matching Under Depth Uncertainty

1 code implementation20 Nov 2020 Benjamin Graham, David Novotny

Using a set of high-quality sparse keypoint matches, we optimize over the per-frame linear combinations of depth planes and camera poses to form a geometrically consistent cloud of keypoints.

Exploring Data-Efficient 3D Scene Understanding with Contrastive Scene Contexts

2 code implementations CVPR 2021 Ji Hou, Benjamin Graham, Matthias Nießner, Saining Xie

The rapid progress in 3D scene understanding has come with growing demand for data; however, collecting and annotating 3D scenes (e. g. point clouds) are notoriously hard.

3D Semantic Segmentation Instance Segmentation +2

Pri3D: Can 3D Priors Help 2D Representation Learning?

1 code implementation ICCV 2021 Ji Hou, Saining Xie, Benjamin Graham, Angela Dai, Matthias Nießner

Inspired by these advances in geometric understanding, we aim to imbue image-based perception with representations learned under geometric constraints.

Contrastive Learning Instance Segmentation +5

DensePose 3D: Lifting Canonical Surface Maps of Articulated Objects to the Third Dimension

no code implementations ICCV 2021 Roman Shapovalov, David Novotny, Benjamin Graham, Patrick Labatut, Andrea Vedaldi

The method learns, in an end-to-end fashion, a soft partition of a given category-specific 3D template mesh into rigid parts together with a monocular reconstruction network that predicts the part motions such that they reproject correctly onto 2D DensePose-like surface annotations of the object.

3D Reconstruction Monocular Reconstruction +1

KeyTr: Keypoint Transporter for 3D Reconstruction of Deformable Objects in Videos

no code implementations CVPR 2022 David Novotny, Ignacio Rocco, Samarth Sinha, Alexandre Carlier, Gael Kerchenbaum, Roman Shapovalov, Nikita Smetanin, Natalia Neverova, Benjamin Graham, Andrea Vedaldi

Compared to weaker deformation models, this significantly reduces the reconstruction ambiguity and, for dynamic objects, allows Keypoint Transporter to obtain reconstructions of the quality superior or at least comparable to prior approaches while being much faster and reliant on a pre-trained monocular depth estimator network.

3D Reconstruction Depth Estimation +2

Self-Supervised Correspondence Estimation via Multiview Registration

1 code implementation6 Dec 2022 Mohamed El Banani, Ignacio Rocco, David Novotny, Andrea Vedaldi, Natalia Neverova, Justin Johnson, Benjamin Graham

To address this, we propose a self-supervised approach for correspondence estimation that learns from multiview consistency in short RGB-D video sequences.

Real-time volumetric rendering of dynamic humans

1 code implementation21 Mar 2023 Ignacio Rocco, Iurii Makarov, Filippos Kokkinos, David Novotny, Benjamin Graham, Natalia Neverova, Andrea Vedaldi

We present a method for fast 3D reconstruction and real-time rendering of dynamic humans from monocular videos with accompanying parametric body fits.

3D Reconstruction

DynamicStereo: Consistent Dynamic Depth from Stereo Videos

1 code implementation CVPR 2023 Nikita Karaev, Ignacio Rocco, Benjamin Graham, Natalia Neverova, Andrea Vedaldi, Christian Rupprecht

The network learns to pool information from neighboring frames to improve the temporal consistency of its predictions.

CoTracker: It is Better to Track Together

1 code implementation14 Jul 2023 Nikita Karaev, Ignacio Rocco, Benjamin Graham, Natalia Neverova, Andrea Vedaldi, Christian Rupprecht

We introduce CoTracker, a transformer-based model that tracks dense points in a frame jointly across a video sequence.

motion prediction Object Tracking +2

Cannot find the paper you are looking for? You can Submit a new open access paper.