Search Results for author: Zhaoyang Lv

Found 17 papers, 9 papers with code

EgoLifter: Open-world 3D Segmentation for Egocentric Perception

no code implementations • 26 Mar 2024 • Qiao Gu, Zhaoyang Lv, Duncan Frost, Simon Green, Julian Straub, Chris Sweeney

In this paper we present EgoLifter, a novel system that can automatically segment scenes captured from egocentric sensors into a complete decomposition of individual 3D objects.

3D Reconstruction Object

Paper
Add Code

Aria Everyday Activities Dataset

1 code implementation • 20 Feb 2024 • Zhaoyang Lv, Nicholas Charron, Pierre Moulon, Alexander Gamino, Cheng Peng, Chris Sweeney, Edward Miller, Huixuan Tang, Jeff Meissner, Jing Dong, Kiran Somasundaram, Luis Pesqueira, Mark Schwesinger, Omkar Parkhi, Qiao Gu, Renzo De Nardi, Shangyi Cheng, Steve Saarinen, Vijay Baiyya, Yuyang Zou, Richard Newcombe, Jakob Julian Engel, Xiaqing Pan, Carl Ren

We present Aria Everyday Activities (AEA) Dataset, an egocentric multimodal open dataset recorded using Project Aria glasses.

327

Paper
Code

LAVE: LLM-Powered Agent Assistance and Language Augmentation for Video Editing

no code implementations • 15 Feb 2024 • Bryan Wang, Yuliang Li, Zhaoyang Lv, Haijun Xia, Yan Xu, Raj Sodhi

Based on these findings, we propose design implications to inform the future development of agent-assisted content editing.

Video Editing

Paper
Add Code

Project Aria: A New Tool for Egocentric Multi-Modal AI Research

no code implementations • 24 Aug 2023 • Jakob Engel, Kiran Somasundaram, Michael Goesele, Albert Sun, Alexander Gamino, Andrew Turner, Arjang Talattof, Arnie Yuan, Bilal Souti, Brighid Meredith, Cheng Peng, Chris Sweeney, Cole Wilson, Dan Barnes, Daniel DeTone, David Caruso, Derek Valleroy, Dinesh Ginjupalli, Duncan Frost, Edward Miller, Elias Mueggler, Evgeniy Oleinik, Fan Zhang, Guruprasad Somasundaram, Gustavo Solaira, Harry Lanaras, Henry Howard-Jenkins, Huixuan Tang, Hyo Jin Kim, Jaime Rivera, Ji Luo, Jing Dong, Julian Straub, Kevin Bailey, Kevin Eckenhoff, Lingni Ma, Luis Pesqueira, Mark Schwesinger, Maurizio Monge, Nan Yang, Nick Charron, Nikhil Raina, Omkar Parkhi, Peter Borschowa, Pierre Moulon, Prince Gupta, Raul Mur-Artal, Robbie Pennington, Sachin Kulkarni, Sagar Miglani, Santosh Gondi, Saransh Solanki, Sean Diener, Shangyi Cheng, Simon Green, Steve Saarinen, Suvam Patra, Tassos Mourikis, Thomas Whelan, Tripti Singh, Vasileios Balntas, Vijay Baiyya, Wilson Dreewes, Xiaqing Pan, Yang Lou, Yipu Zhao, Yusuf Mansour, Yuyang Zou, Zhaoyang Lv, Zijian Wang, Mingfei Yan, Carl Ren, Renzo De Nardi, Richard Newcombe

Egocentric, multi-modal data as available on future augmented reality (AR) devices provides unique challenges and opportunities for machine perception.

Paper
Add Code

AdaNeRF: Adaptive Sampling for Real-time Rendering of Neural Radiance Fields

1 code implementation • 21 Jul 2022 • Andreas Kurz, Thomas Neff, Zhaoyang Lv, Michael Zollhöfer, Markus Steinberger

However, rendering images with this new paradigm is slow due to the fact that an accurate quadrature of the volume rendering equation requires a large number of samples for each ray.

Novel View Synthesis

236

Paper
Code

LiveView: Dynamic Target-Centered MPI for View Synthesis

no code implementations • 11 Jul 2021 • Sushobhan Ghosh, Zhaoyang Lv, Nathan Matsuda, Lei Xiao, Andrew Berkovich, Oliver Cossairt

Existing Multi-Plane Image (MPI) based view-synthesis methods generate an MPI aligned with the input view using a fixed number of planes in one forward pass.

Novel View Synthesis

Paper
Add Code

Neural 3D Video Synthesis from Multi-view Video

1 code implementation • CVPR 2022 • Tianye Li, Mira Slavcheva, Michael Zollhoefer, Simon Green, Christoph Lassner, Changil Kim, Tanner Schmidt, Steven Lovegrove, Michael Goesele, Richard Newcombe, Zhaoyang Lv

We propose a novel approach for 3D video synthesis that is able to represent multi-view video recordings of a dynamic real-world scene in a compact, yet expressive representation that enables high-quality view synthesis and motion interpolation.

Motion Interpolation

233

Paper
Code

STaR: Self-supervised Tracking and Reconstruction of Rigid Objects in Motion with Neural Rendering

no code implementations • CVPR 2021 • Wentao Yuan, Zhaoyang Lv, Tanner Schmidt, Steven Lovegrove

We achieve this by jointly optimizing the parameters of two neural radiance fields and a set of rigid poses which align the two fields at each frame.

Neural Rendering Object

Paper
Add Code

SENSE: a Shared Encoder Network for Scene-flow Estimation

1 code implementation • ICCV 2019 • Huaizu Jiang, Deqing Sun, Varun Jampani, Zhaoyang Lv, Erik Learned-Miller, Jan Kautz

We introduce a compact network for holistic scene flow estimation, called SENSE, which shares common encoder features among four closely-related tasks: optical flow estimation, disparity estimation from stereo, occlusion estimation, and semantic segmentation.

Disparity Estimation Occlusion Estimation +3

Paper
Code

miniSAM: A Flexible Factor Graph Non-linear Least Squares Optimization Framework

1 code implementation • 3 Sep 2019 • Jing Dong, Zhaoyang Lv

Many problems in computer vision and robotics can be phrased as non-linear least squares optimization problems represented by factor graphs, for example, simultaneous localization and mapping (SLAM), structure from motion (SfM), motion planning, and control.

Benchmarking Motion Planning +1

463

Paper
Code

Multi-class Classification without Multi-class Labels

1 code implementation • ICLR 2019 • Yen-Chang Hsu, Zhaoyang Lv, Joel Schlosser, Phillip Odom, Zsolt Kira

This work presents a new strategy for multi-class classification that requires no class-specific labels, but instead leverages pairwise similarity between examples, which is a weaker form of annotation.

Classification General Classification +1

313

Paper
Code

Taking a Deeper Look at the Inverse Compositional Algorithm

1 code implementation • CVPR 2019 • Zhaoyang Lv, Frank Dellaert, James M. Rehg, Andreas Geiger

In this paper, we provide a modern synthesis of the classic inverse compositional algorithm for dense image alignment.

Motion Estimation regression

153

Paper
Code

A probabilistic constrained clustering for transfer learning and image category discovery

no code implementations • 28 Jun 2018 • Yen-Chang Hsu, Zhaoyang Lv, Joel Schlosser, Phillip Odom, Zsolt Kira

The proposed objective directly minimizes the negative log-likelihood of cluster assignment with respect to the pairwise constraints, has no hyper-parameters, and demonstrates improved scalability and performance on both supervised learning and unsupervised transfer learning.

Ranked #1 on Ecg Risk Stratification on ngm

Constrained Clustering Deep Clustering +2

Paper
Add Code

Learning Rigidity in Dynamic Scenes with a Moving Camera for 3D Motion Field Estimation

1 code implementation • ECCV 2018 • Zhaoyang Lv, Kihwan Kim, Alejandro Troccoli, Deqing Sun, James M. Rehg, Jan Kautz

Estimation of 3D motion in a dynamic scene from a temporal pair of images is a core task in many scene understanding problems.

Optical Flow Estimation Scene Flow Estimation +1

146

Paper
Code

Learning to cluster in order to transfer across domains and tasks

1 code implementation • ICLR 2018 • Yen-Chang Hsu, Zhaoyang Lv, Zsolt Kira

The key insight is that, in addition to features, we can transfer similarity information and this is sufficient to learn a similarity function and clustering network to perform both domain adaptation and cross-task transfer learning.

Constrained Clustering Transfer Learning +1