Search Results for author: Benjamin Graham

Found 24 papers, 18 papers with code

Sparse arrays of signatures for online character recognition

no code implementations • 1 Aug 2013 • Benjamin Graham

We show that the path signature, used as a set of features for consumption by a convolutional neural network (CNN), improves the accuracy of online character recognition---that is the task of reading characters represented as a collection of paths.

Paper
Add Code

Spatially-sparse convolutional neural networks

3 code implementations • 22 Sep 2014 • Benjamin Graham

Convolutional neural networks (CNNs) perform well on problems such as handwriting recognition and image classification.

Ranked #144 on Image Classification on CIFAR-100 (using extra training data)

Handwriting Recognition Image Classification

1,990

Paper
Code

Fractional Max-Pooling

5 code implementations • 18 Dec 2014 • Benjamin Graham

However, if you simply alternate convolutional layers with max-pooling layers, performance is limited due to the rapid reduction in spatial size, and the disjoint nature of the pooling regions.

Ranked #17 on Image Classification on MNIST

Image Classification

1,990

Paper
Code

Confusing Deep Convolution Networks by Relabelling

no code implementations • 23 Oct 2015 • Leigh Robinson, Benjamin Graham

Deep convolutional neural networks have become the gold standard for image recognition tasks, demonstrating many current state-of-the-art results and even achieving near-human level performance on some tasks.

Paper
Add Code

Low-Precision Batch-Normalized Activations

no code implementations • 27 Feb 2017 • Benjamin Graham

Artificial neural networks can be trained with relatively low-precision floating-point and fixed-point arithmetic, using between one and 16 bits.

Quantization

Paper
Add Code

Submanifold Sparse Convolutional Networks

6 code implementations • 5 Jun 2017 • Benjamin Graham, Laurens van der Maaten

Convolutional network are the de-facto standard for analysing spatio-temporal data such as images, videos, 3D shapes, etc.

Ranked #34 on 3D Part Segmentation on ShapeNet-Part

3D Part Segmentation

1,990

Paper
Code

Large-Scale 3D Shape Reconstruction and Segmentation from ShapeNet Core55

1 code implementation • 17 Oct 2017 • Li Yi, Lin Shao, Manolis Savva, Haibin Huang, Yang Zhou, Qirui Wang, Benjamin Graham, Martin Engelcke, Roman Klokov, Victor Lempitsky, Yuan Gan, Pengyu Wang, Kun Liu, Fenggen Yu, Panpan Shui, Bingyang Hu, Yan Zhang, Yangyan Li, Rui Bu, Mingchao Sun, Wei Wu, Minki Jeong, Jaehoon Choi, Changick Kim, Angom Geetchandra, Narasimha Murthy, Bhargava Ramu, Bharadwaj Manda, M. Ramanathan, Gautam Kumar, P Preetham, Siddharth Srivastava, Swati Bhugra, Brejesh lall, Christian Haene, Shubham Tulsiani, Jitendra Malik, Jared Lafer, Ramsey Jones, Siyuan Li, Jie Lu, Shi Jin, Jingyi Yu, Qi-Xing Huang, Evangelos Kalogerakis, Silvio Savarese, Pat Hanrahan, Thomas Funkhouser, Hao Su, Leonidas Guibas

We introduce a large-scale 3D shape understanding benchmark using data and annotation from ShapeNet 3D object database.

3D Part Segmentation 3D Reconstruction +1

1,990

Paper
Code

3D Semantic Segmentation with Submanifold Sparse Convolutional Networks

5 code implementations • CVPR 2018 • Benjamin Graham, Martin Engelcke, Laurens van der Maaten

Submanifold sparse convolutional networks

Ranked #5 on 3D Semantic Segmentation on SensatUrban

3D Semantic Segmentation

1,990

Paper
Code

The iisignature library: efficient calculation of iterated-integral signatures and log signatures

2 code implementations • 22 Feb 2018 • Jeremy Reizenstein, Benjamin Graham

Iterated-integral signatures and log signatures are vectors calculated from a path that characterise its shape.

Data Structures and Algorithms Mathematical Software Rings and Algebras

Paper
Code

Unsupervised learning with sparse space-and-time autoencoders

1 code implementation • 26 Nov 2018 • Benjamin Graham

We use spatially-sparse two, three and four dimensional convolutional autoencoder networks to model sparse structures in 2D space, 3D space, and 3+1=4 dimensional space-time.

Handwriting Recognition Segmentation

1,990

Paper
Code

Equi-normalization of Neural Networks

1 code implementation • ICLR 2019 • Pierre Stock, Benjamin Graham, Rémi Gribonval, Hervé Jégou

Modern neural networks are over-parametrized.

114

Paper
Code

And the Bit Goes Down: Revisiting the Quantization of Neural Networks

3 code implementations • ICLR 2020 • Pierre Stock, Armand Joulin, Rémi Gribonval, Benjamin Graham, Hervé Jégou

In this paper, we address the problem of reducing the memory footprint of convolutional network architectures.

Quantization

634

Paper
Code

C3DPO: Canonical 3D Pose Networks for Non-Rigid Structure From Motion

2 code implementations • ICCV 2019 • David Novotny, Nikhila Ravi, Benjamin Graham, Natalia Neverova, Andrea Vedaldi

We propose C3DPO, a method for extracting 3D models of deformable objects from 2D keypoint annotations in unconstrained images.

314

Paper
Code

Training with Quantization Noise for Extreme Model Compression

4 code implementations • ICLR 2021 • Angela Fan, Pierre Stock, Benjamin Graham, Edouard Grave, Remi Gribonval, Herve Jegou, Armand Joulin

A standard solution is to train networks with Quantization Aware Training, where the weights are quantized during training and the gradients approximated with the Straight-Through Estimator.

Image Generation Model Compression

29,201

Paper
Code

3D Multi-bodies: Fitting Sets of Plausible 3D Human Models to Ambiguous Image Data

no code implementations • NeurIPS 2020 • Benjamin Biggs, Sébastien Ehrhadt, Hanbyul Joo, Benjamin Graham, Andrea Vedaldi, David Novotny

We consider the problem of obtaining dense 3D reconstructions of humans from single and partially occluded views.

3D Reconstruction

Paper
Add Code

RidgeSfM: Structure from Motion via Robust Pairwise Matching Under Depth Uncertainty

1 code implementation • 20 Nov 2020 • Benjamin Graham, David Novotny

Using a set of high-quality sparse keypoint matches, we optimize over the per-frame linear combinations of depth planes and camera poses to form a geometrically consistent cloud of keypoints.

112

Paper
Code

Exploring Data-Efficient 3D Scene Understanding with Contrastive Scene Contexts

2 code implementations • CVPR 2021 • Ji Hou, Benjamin Graham, Matthias Nießner, Saining Xie

The rapid progress in 3D scene understanding has come with growing demand for data; however, collecting and annotating 3D scenes (e. g. point clouds) are notoriously hard.

Ranked #8 on 3D Semantic Segmentation on ScanNet200

3D Semantic Segmentation Instance Segmentation +2

1,108

Paper
Code

Pri3D: Can 3D Priors Help 2D Representation Learning?

1 code implementation • ICCV 2021 • Ji Hou, Saining Xie, Benjamin Graham, Angela Dai, Matthias Nießner

Inspired by these advances in geometric understanding, we aim to imbue image-based perception with representations learned under geometric constraints.

Contrastive Learning Instance Segmentation +5

147

Paper
Code

DensePose 3D: Lifting Canonical Surface Maps of Articulated Objects to the Third Dimension

no code implementations • ICCV 2021 • Roman Shapovalov, David Novotny, Benjamin Graham, Patrick Labatut, Andrea Vedaldi

The method learns, in an end-to-end fashion, a soft partition of a given category-specific 3D template mesh into rigid parts together with a monocular reconstruction network that predicts the part motions such that they reproject correctly onto 2D DensePose-like surface annotations of the object.

3D Reconstruction Monocular Reconstruction +1

Paper
Add Code

KeyTr: Keypoint Transporter for 3D Reconstruction of Deformable Objects in Videos

no code implementations • CVPR 2022 • David Novotny, Ignacio Rocco, Samarth Sinha, Alexandre Carlier, Gael Kerchenbaum, Roman Shapovalov, Nikita Smetanin, Natalia Neverova, Benjamin Graham, Andrea Vedaldi

Compared to weaker deformation models, this significantly reduces the reconstruction ambiguity and, for dynamic objects, allows Keypoint Transporter to obtain reconstructions of the quality superior or at least comparable to prior approaches while being much faster and reliant on a pre-trained monocular depth estimator network.

3D Reconstruction Depth Estimation +2

Paper
Add Code

Self-Supervised Correspondence Estimation via Multiview Registration

1 code implementation • 6 Dec 2022 • Mohamed El Banani, Ignacio Rocco, David Novotny, Andrea Vedaldi, Natalia Neverova, Justin Johnson, Benjamin Graham

To address this, we propose a self-supervised approach for correspondence estimation that learns from multiview consistency in short RGB-D video sequences.

Paper
Code

Real-time volumetric rendering of dynamic humans

1 code implementation • 21 Mar 2023 • Ignacio Rocco, Iurii Makarov, Filippos Kokkinos, David Novotny, Benjamin Graham, Natalia Neverova, Andrea Vedaldi

We present a method for fast 3D reconstruction and real-time rendering of dynamic humans from monocular videos with accompanying parametric body fits.

3D Reconstruction

2,396

Paper
Code