Search Results for author: Haozhi Qi

Found 15 papers, 10 papers with code

Twisting Lids Off with Two Hands

no code implementations • 4 Mar 2024 • Toru Lin, Zhao-Heng Yin, Haozhi Qi, Pieter Abbeel, Jitendra Malik

Manipulating objects with two multi-fingered hands has been a long-standing challenge in robotics, attributed to the contact-rich nature of many manipulation tasks and the complexity inherent in coordinating a high-dimensional bimanual system.

reinforcement-learning

Paper
Add Code

Neural feels with neural fields: Visuo-tactile perception for in-hand manipulation

no code implementations • 20 Dec 2023 • Sudharshan Suresh, Haozhi Qi, Tingfan Wu, Taosha Fan, Luis Pineda, Mike Lambeta, Jitendra Malik, Mrinal Kalakrishnan, Roberto Calandra, Michael Kaess, Joseph Ortiz, Mustafa Mukadam

Our neural representation driven by multimodal sensing can serve as a perception backbone towards advancing robot dexterity.

Benchmarking

Paper
Add Code

General In-Hand Object Rotation with Vision and Touch

no code implementations • 18 Sep 2023 • Haozhi Qi, Brent Yi, Sudharshan Suresh, Mike Lambeta, Yi Ma, Roberto Calandra, Jitendra Malik

We introduce RotateIt, a system that enables fingertip-based object rotation along multiple axes by leveraging multimodal sensory inputs.

Object

Paper
Add Code

In-Hand Object Rotation via Rapid Motor Adaptation

1 code implementation • 10 Oct 2022 • Haozhi Qi, Ashish Kumar, Roberto Calandra, Yi Ma, Jitendra Malik

Generalized in-hand manipulation has long been an unsolved challenge of robotics.

Object Reinforcement Learning (RL)

Paper
Code

Coupling Vision and Proprioception for Navigation of Legged Robots

no code implementations • CVPR 2022 • Zipeng Fu, Ashish Kumar, Ananye Agarwal, Haozhi Qi, Jitendra Malik, Deepak Pathak

A safety advisor module adds sensed unexpected obstacles to the occupancy map and environment-determined speed limits to the velocity command generator.

Paper
Add Code

ReduNet: A White-box Deep Network from the Principle of Maximizing Rate Reduction

2 code implementations • 21 May 2021 • Kwan Ho Ryan Chan, Yaodong Yu, Chong You, Haozhi Qi, John Wright, Yi Ma

This work attempts to provide a plausible theoretical framework that aims to interpret modern deep (convolutional) networks from the principles of data compression and discriminative representation.

Data Compression

529

Paper
Code

Deep Networks from the Principle of Rate Reduction

3 code implementations • 27 Oct 2020 • Kwan Ho Ryan Chan, Yaodong Yu, Chong You, Haozhi Qi, John Wright, Yi Ma

The layered architectures, linear and nonlinear operators, and even parameters of the network are all explicitly constructed layer-by-layer in a forward propagation fashion by emulating the gradient scheme.

529

Paper
Code

Learning Long-term Visual Dynamics with Region Proposal Interaction Networks

1 code implementation • ICLR 2021 • Haozhi Qi, Xiaolong Wang, Deepak Pathak, Yi Ma, Jitendra Malik

Learning long-term dynamics models is the key to understanding physical common sense.

Ranked #1 on Visual Reasoning on PHYRE-1B-Within

Common Sense Reasoning Object +2

110

Paper
Code

From A Glance to "Gotcha": Interactive Facial Image Retrieval with Progressive Relevance Feedback

no code implementations • 30 Jul 2020 • Xinru Yang, Haozhi Qi, Mingyang Li, Alexander Hauptmann

Facial image retrieval plays a significant role in forensic investigations where an untrained witness tries to identify a suspect from a massive pool of images.

Face Image Retrieval Retrieval

Paper
Add Code

Deep Isometric Learning for Visual Recognition

1 code implementation • ICML 2020 • Haozhi Qi, Chong You, Xiaolong Wang, Yi Ma, Jitendra Malik

Initialization, normalization, and skip connections are believed to be three indispensable techniques for training very deep convolutional neural networks and obtaining state-of-the-art performance.

142

Paper
Code

NeurVPS: Neural Vanishing Point Scanning via Conic Convolution

1 code implementation • NeurIPS 2019 • Yichao Zhou, Haozhi Qi, Jingwei Huang, Yi Ma

We present a simple yet effective end-to-end trainable deep network with geometry-inspired convolutional operators for detecting vanishing points in images.

Camera Calibration

171

Paper
Code

Learning to Reconstruct 3D Manhattan Wireframes from a Single Image

2 code implementations • ICCV 2019 • Yichao Zhou, Haozhi Qi, Yuexiang Zhai, Qi Sun, Zhili Chen, Li-Yi Wei, Yi Ma

In this paper, we propose a method to obtain a compact and accurate 3D wireframe representation from a single image by effectively exploiting global structural regularities.

171

Paper
Code

End-to-End Wireframe Parsing

1 code implementation • ICCV 2019 • Yichao Zhou, Haozhi Qi, Yi Ma

We conduct extensive experiments and show that our method significantly outperforms the previous state-of-the-art wireframe and line extraction algorithms.

Ranked #5 on Line Segment Detection on wireframe dataset

Line Segment Detection Wireframe Parsing

476

Paper
Code

Deformable Convolutional Networks

38 code implementations • ICCV 2017 • Jifeng Dai, Haozhi Qi, Yuwen Xiong, Yi Li, Guodong Zhang, Han Hu, Yichen Wei

Convolutional neural networks (CNNs) are inherently limited to model geometric transformations due to the fixed geometric structures in its building modules.

Ranked #3 on Vessel Detection on Vessel detection Dateset