Search Results for author: Xingyu Liu

Found 35 papers, 17 papers with code

KeyPose: Multi-View 3D Labeling and Keypoint Estimation for Transparent Objects

1 code implementation • CVPR 2020 • Xingyu Liu, Rico Jonschkowski, Anelia Angelova, Kurt Konolige

We address two problems: first, we establish an easy method for capturing and labeling 3D keypoints on desktop objects with an RGB camera; and second, we develop a deep neural network, called $KeyPose$, that learns to accurately predict object poses using 3D keypoints, from stereo input, and works even for transparent objects.

3D Pose Estimation Keypoint Estimation +1

32,755

Paper
Code

Ego4D: Around the World in 3,000 Hours of Egocentric Video

6 code implementations • CVPR 2022 • Kristen Grauman, Andrew Westbury, Eugene Byrne, Zachary Chavis, Antonino Furnari, Rohit Girdhar, Jackson Hamburger, Hao Jiang, Miao Liu, Xingyu Liu, Miguel Martin, Tushar Nagarajan, Ilija Radosavovic, Santhosh Kumar Ramakrishnan, Fiona Ryan, Jayant Sharma, Michael Wray, Mengmeng Xu, Eric Zhongcong Xu, Chen Zhao, Siddhant Bansal, Dhruv Batra, Vincent Cartillier, Sean Crane, Tien Do, Morrie Doulaty, Akshay Erapalli, Christoph Feichtenhofer, Adriano Fragomeni, Qichen Fu, Abrham Gebreselasie, Cristina Gonzalez, James Hillis, Xuhua Huang, Yifei HUANG, Wenqi Jia, Weslie Khoo, Jachym Kolar, Satwik Kottur, Anurag Kumar, Federico Landini, Chao Li, Yanghao Li, Zhenqiang Li, Karttikeya Mangalam, Raghava Modhugu, Jonathan Munro, Tullie Murrell, Takumi Nishiyasu, Will Price, Paola Ruiz Puentes, Merey Ramazanova, Leda Sari, Kiran Somasundaram, Audrey Southerland, Yusuke Sugano, Ruijie Tao, Minh Vo, Yuchen Wang, Xindi Wu, Takuma Yagi, Ziwei Zhao, Yunyi Zhu, Pablo Arbelaez, David Crandall, Dima Damen, Giovanni Maria Farinella, Christian Fuegen, Bernard Ghanem, Vamsi Krishna Ithapu, C. V. Jawahar, Hanbyul Joo, Kris Kitani, Haizhou Li, Richard Newcombe, Aude Oliva, Hyun Soo Park, James M. Rehg, Yoichi Sato, Jianbo Shi, Mike Zheng Shou, Antonio Torralba, Lorenzo Torresani, Mingfei Yan, Jitendra Malik

We introduce Ego4D, a massive-scale egocentric video dataset and benchmark suite.

De-identification Ethics

4,978

Paper
Code

FlowNet3D: Learning Scene Flow in 3D Point Clouds

10 code implementations • CVPR 2019 • Xingyu Liu, Charles R. Qi, Leonidas J. Guibas

In this work, we propose a novel deep neural network named $FlowNet3D$ that learns scene flow from point clouds in an end-to-end fashion.

Motion Segmentation

646

Paper
Code

EIE: Efficient Inference Engine on Compressed Deep Neural Network

4 code implementations • 4 Feb 2016 • Song Han, Xingyu Liu, Huizi Mao, Jing Pu, Ardavan Pedram, Mark A. Horowitz, William J. Dally

EIE has a processing power of 102GOPS/s working directly on a compressed network, corresponding to 3TOPS/s on an uncompressed network, and processes FC layers of AlexNet at 1. 88x10^4 frames/sec with a power dissipation of only 600mW.

644

Paper
Code

Efficient Sparse-Winograd Convolutional Neural Networks

1 code implementation • ICLR 2018 • Xingyu Liu, Jeff Pool, Song Han, William J. Dally

First, we move the ReLU operation into the Winograd domain to increase the sparsity of the transformed activations.

Network Pruning

187

Paper
Code

MitoEM Dataset: Large-scale 3D Mitochondria Instance Segmentation from EM Images

1 code implementation • Medical Image Computing and Computer Assisted Intervention 2020 • Donglai Wei, Zudi Lin, Daniel Franco-Barranco, Nils Wendt, Xingyu Liu, Wenjie Yin, Xin Huang, Aarush Gupta, Won-Dong Jang, Xueying Wang, Ignacio Arganda-Carreras, Jeff Lichtman, Hanspeter Pfister

On MitoEM, we find existing instance segmentation methods often fail to correctly segment mitochondria with complex shapes or close contacts with other instances.

Ranked #2 on 3D Instance Segmentation on MitoEM (AP75-R-Test metric)

3D Instance Segmentation Segmentation +1

162

Paper
Code

Learning Video Representations from Correspondence Proposals

2 code implementations • CVPR 2019 • Xingyu Liu, Joon-Young Lee, Hailin Jin

In particular, it can effectively learn representations for videos by mixing appearance and long-range motion with an RGB-only input.

Ranked #1 on Action Recognition In Videos on Jester (Gesture Recognition)

Action Recognition In Videos

146

Paper
Code

MeteorNet: Deep Learning on Dynamic 3D Point Cloud Sequences

2 code implementations • ICCV 2019 • Xingyu Liu, Mengyuan Yan, Jeannette Bohg

Understanding dynamic 3D environment is crucial for robotic agents and many other applications.

Action Recognition Scene Flow Estimation +1

146

Paper
Code

FinEval: A Chinese Financial Domain Knowledge Evaluation Benchmark for Large Language Models

1 code implementation • 19 Aug 2023 • Liwen Zhang, Weige Cai, Zhaowei Liu, Zhi Yang, Wei Dai, Yujie Liao, Qianru Qin, Yifei Li, Xingyu Liu, Zhiqiang Liu, Zhoufan Zhu, Anbo Wu, Xin Guo, Yun Chen

Our work offers a more comprehensive financial knowledge evaluation benchmark, utilizing data of mock exams and covering a wide range of evaluated LLMs.

Multiple-choice

121

Paper
Code

RePOSE: Fast 6D Object Pose Refinement via Deep Texture Rendering

1 code implementation • ICCV 2021 • Shun Iwase, Xingyu Liu, Rawal Khirodkar, Rio Yokota, Kris M. Kitani

Furthermore, we utilize differentiable Levenberg-Marquardt (LM) optimization to refine a pose fast and accurately by minimizing the feature-metric error between the input and rendered image representations without the need of zooming in.

Ranked #5 on 6D Pose Estimation using RGB on LineMOD

6D Pose Estimation 6D Pose Estimation using RGB +1

Paper
Code

Occlusion-Aware Self-Supervised Monocular 6D Object Pose Estimation

1 code implementation • 19 Mar 2022 • Gu Wang, Fabian Manhardt, Xingyu Liu, Xiangyang Ji, Federico Tombari

6D object pose estimation is a fundamental yet challenging problem in computer vision.

6D Pose Estimation 6D Pose Estimation using RGB +3

Paper
Code

CATRE: Iterative Point Clouds Alignment for Category-level Object Pose Refinement

1 code implementation • 17 Jul 2022 • Xingyu Liu, Gu Wang, Yi Li, Xiangyang Ji

While category-level 9DoF object pose estimation has emerged recently, previous correspondence-based or direct regression methods are both limited in accuracy due to the huge intra-category variances in object shape and color, etc.

Object Pose Estimation

Paper
Code

REvolveR: Continuous Evolutionary Models for Robot-to-robot Policy Transfer

1 code implementation • 10 Feb 2022 • Xingyu Liu, Deepak Pathak, Kris M. Kitani

We interpolate between the source robot and the target robot by finding a continuous evolutionary change of robot parameters.

Imitation Learning

Paper
Code

Sequential Voting with Relational Box Fields for Active Object Detection

1 code implementation • CVPR 2022 • Qichen Fu, Xingyu Liu, Kris M. Kitani

While our voting function is able to improve the bounding box of the active object, one round of voting is typically not enough to accurately localize the active object.

Active Object Detection Imitation Learning +4

Paper
Code

Learning from Temporal Spatial Cubism for Cross-Dataset Skeleton-based Action Recognition

1 code implementation • 17 Jul 2022 • Yansong Tang, Xingyu Liu, Xumin Yu, Danyang Zhang, Jiwen Lu, Jie zhou

Different from the conventional adversarial learning-based approaches for UDA, we utilize a self-supervision scheme to reduce the domain shift between two skeleton-based action datasets.

Action Recognition Self-Supervised Learning +2

Paper
Code

SynFacePAD 2023: Competition on Face Presentation Attack Detection Based on Privacy-aware Synthetic Training Data

1 code implementation • 9 Nov 2023 • Meiling Fang, Marco Huber, Julian Fierrez, Raghavendra Ramachandra, Naser Damer, Alhasan Alkhaddour, Maksim Kasantcev, Vasiliy Pryadchenko, Ziyuan Yang, Huijie Huangfu, Yingyu Chen, Yi Zhang, Yuchen Pan, Junjun Jiang, Xianming Liu, Xianyun Sun, Caiyong Wang, Xingyu Liu, Zhaohua Chang, Guangzhe Zhao, Juan Tapia, Lazaro Gonzalez-Soler, Carlos Aravena, Daniel Schulz

This paper presents a summary of the Competition on Face Presentation Attack Detection Based on Privacy-aware Synthetic Training Data (SynFacePAD 2023) held at the 2023 International Joint Conference on Biometrics (IJCB 2023).

Face Presentation Attack Detection valid

Paper
Code

KP-RED: Exploiting Semantic Keypoints for Joint 3D Shape Retrieval and Deformation

1 code implementation • 15 Mar 2024 • Ruida Zhang, Chenyangguang Zhang, Yan Di, Fabian Manhardt, Xingyu Liu, Federico Tombari, Xiangyang Ji

In this paper, we present KP-RED, a unified KeyPoint-driven REtrieval and Deformation framework that takes object scans as input and jointly retrieves and deforms the most geometrically similar CAD models from a pre-processed database to tightly match the target.

3D Shape Retrieval Retrieval

Paper
Code

Exploring the Regularity of Sparse Structure in Convolutional Neural Networks

no code implementations • 24 May 2017 • Huizi Mao, Song Han, Jeff Pool, Wenshuo Li, Xingyu Liu, Yu Wang, William J. Dally

Since memory reference is more than two orders of magnitude more expensive than arithmetic operations, the regularity of sparse structure leads to more efficient hardware design.

Paper
Add Code

Time-Efficient Mars Exploration of Simultaneous Coverage and Charging with Multiple Drones

no code implementations • 16 Nov 2020 • Yuan Chang, Chao Yan, Xingyu Liu, Xiangke Wang, Han Zhou, Xiaojia Xiang, Dengqing Tang

This paper presents a time-efficient scheme for Mars exploration by the cooperation of multiple drones and a rover.

Navigate Scheduling

Paper
Add Code

KDFNet: Learning Keypoint Distance Field for 6D Object Pose Estimation

no code implementations • 21 Sep 2021 • Xingyu Liu, Shun Iwase, Kris M. Kitani

To address this problem, we propose a novel continuous representation called Keypoint Distance Field (KDF) for projected 2D keypoint locations.

6D Pose Estimation using RGB

Paper
Add Code

StereOBJ-1M: Large-scale Stereo Image Dataset for 6D Object Pose Estimation

no code implementations • ICCV 2021 • Xingyu Liu, Shun Iwase, Kris M. Kitani

We present a large-scale stereo RGB image object pose estimation dataset named the $\textbf{StereOBJ-1M}$ dataset.

6D Pose Estimation using RGB Object +1

Paper
Add Code

Mesure de similarité textuelle pour l’évaluation automatique de copies d’étudiants (Textual similarity measurement for automatic evaluation of students’ answers)

no code implementations • JEP/TALN/RECITAL 2021 • Xiaoou Wang, Xingyu Liu, Yimei Yue

Cet article décrit la participation de l’équipe Nantalco à la tâche 2 du Défi Fouille de Textes 2021 (DEFT) : évaluation automatique de copies d’après une référence existante.

Sentence Sentence Embeddings

Paper
Add Code

V-MAO: Generative Modeling for Multi-Arm Manipulation of Articulated Objects

no code implementations • 7 Nov 2021 • Xingyu Liu, Kris M. Kitani

Manipulating articulated objects requires multiple robot arms in general.

Object

Paper
Add Code

Tripartite: Tackle Noisy Labels by a More Precise Partition

no code implementations • 19 Feb 2022 • Xuefeng Liang, Longshan Yao, Xingyu Liu, Ying Zhou

Instead, we propose a Tripartite solution to partition training data more precisely into three subsets: hard, noisy, and clean.

Self-Supervised Learning

Paper
Add Code

Classification automatique de questions spontanées vs. préparées dans des transcriptions de l’oral (Automatic Classification of Spontaneous vs)

no code implementations • JEP/TALN/RECITAL 2022 • Iris Eshkol-Taravella, Angèle Barbedette, Xingyu Liu, Valentin-Gabriel Soumah

Ce travail a pour objectif de développer un modèle linguistique pour classifier automatiquement des questions issues de transcriptions d’enregistrements provenant des corpus ESLO2 et ACSYNT en deux catégories “spontané” et “préparé”.

Classification

Paper
Add Code

HERD: Continuous Human-to-Robot Evolution for Learning from Human Demonstration

no code implementations • 8 Dec 2022 • Xingyu Liu, Deepak Pathak, Kris M. Kitani

The ability to learn from human demonstration endows robots with the ability to automate various tasks.

Paper
Add Code

Deformer: Dynamic Fusion Transformer for Robust Hand Pose Estimation

no code implementations • ICCV 2023 • Qichen Fu, Xingyu Liu, ran Xu, Juan Carlos Niebles, Kris M. Kitani

Accurately estimating 3D hand pose is crucial for understanding how humans interact with the world.

Hand Pose Estimation

Paper
Add Code

Knowledge Distillation for Efficient Sequences of Training Runs

no code implementations • 11 Mar 2023 • Xingyu Liu, Alex Leonardi, Lu Yu, Chris Gilmer-Hill, Matthew Leavitt, Jonathan Frankle

We find that augmenting future runs with KD from previous runs dramatically reduces the time necessary to train these models, even taking into account the overhead of KD.

Knowledge Distillation

Paper
Add Code

A Survey on Graph Classification and Link Prediction based on GNN

no code implementations • 3 Jul 2023 • Xingyu Liu, Juan Chen, Quan Wen

Traditional convolutional neural networks are limited to handling Euclidean space data, overlooking the vast realm of real-life scenarios represented as graph data, including transportation networks, social networks, and reference networks.

Graph Classification Link Prediction +1

Paper
Add Code

Parallel Attention Interaction Network for Few-Shot Skeleton-Based Action Recognition

no code implementations • ICCV 2023 • Xingyu Liu, Sanping Zhou, Le Wang, Gang Hua

Learning discriminative features from very few labeled samples to identify novel classes has received increasing attention in skeleton-based action recognition.

Action Recognition Skeleton Based Action Recognition

Paper
Add Code

COMPOSER: Scalable and Robust Modular Policies for Snake Robots

no code implementations • 2 Oct 2023 • Yuyou Zhang, Yaru Niu, Xingyu Liu, Ding Zhao

Instead of perceiving the hyper-redundancy and flexibility of snake robots as mere challenges, there lies an unexplored potential in leveraging these traits to enhance robustness and generalizability at the control policy level.

Multi-agent Reinforcement Learning

Paper
Add Code

Iris Liveness Detection Competition (LivDet-Iris) -- The 2023 Edition

no code implementations • 6 Oct 2023 • Patrick Tinsley, Sandip Purnapatra, Mahsa Mitcheff, Aidan Boyd, Colton Crum, Kevin Bowyer, Patrick Flynn, Stephanie Schuckers, Adam Czajka, Meiling Fang, Naser Damer, Xingyu Liu, Caiyong Wang, Xianyun Sun, Zhaohua Chang, Xinyue Li, Guangzhe Zhao, Juan Tapia, Christoph Busch, Carlos Aravena, Daniel Schulz

New elements in this fifth competition include (1) GAN-generated iris images as a category of presentation attack instruments (PAI), and (2) an evaluation of human accuracy at detecting PAI as a reference benchmark.

Paper
Add Code

Diff-Transfer: Model-based Robotic Manipulation Skill Transfer via Differentiable Physics Simulation

no code implementations • 7 Oct 2023 • Yuqi Xiang, Feitong Chen, Qinsi Wang, Yang Gang, Xiang Zhang, Xinghao Zhu, Xingyu Liu, Lin Shao

In this work, we introduce $\textit{Diff-Transfer}$, a novel framework leveraging differentiable physics simulation to efficiently transfer robotic skills.

Q-Learning

Paper
Add Code

Structure-Preserving Instance Segmentation via Skeleton-Aware Distance Transform

no code implementations • 8 Oct 2023 • Zudi Lin, Donglai Wei, Aarush Gupta, Xingyu Liu, Deqing Sun, Hanspeter Pfister

Objects with complex structures pose significant challenges to existing instance segmentation methods that rely on boundary or affinity maps, which are vulnerable to small errors around contacting pixels that cause noticeable connectivity change.

Image Segmentation Instance Segmentation +3

Paper
Add Code

RaSim: A Range-aware High-fidelity RGB-D Data Simulation Pipeline for Real-world Applications

no code implementations • 5 Apr 2024 • Xingyu Liu, Chenyangguang Zhang, Gu Wang, Ruida Zhang, Xiangyang Ji

In robotic vision, a de-facto paradigm is to learn in simulated environments and then transfer to real-world applications, which poses an essential challenge in bridging the sim-to-real domain gap.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.