Search Results for author: Renjie Li

Found 12 papers, 2 papers with code

PANDORA: Diffusion Policy Learning for Dexterous Robotic Piano Playing

no code implementations17 Mar 2025 Yanjia Huang, Renjie Li, Zhengzhong Tu

We present PANDORA, a novel diffusion-based policy learning framework designed specifically for dexterous robotic piano performance.

Denoising Language Modeling +2

AutoTrust: Benchmarking Trustworthiness in Large Vision Language Models for Autonomous Driving

1 code implementation19 Dec 2024 Shuo Xing, Hongyuan Hua, Xiangbo Gao, Shenzhe Zhu, Renjie Li, Kexin Tian, Xiaopeng Li, Heng Huang, Tianbao Yang, Zhangyang Wang, Yang Zhou, Huaxiu Yao, Zhengzhong Tu

Our findings call for immediate and decisive action to address the trustworthiness of DriveVLMs -- an issue of critical importance to public safety and the welfare of all citizens relying on autonomous transportation systems.

Autonomous Driving Benchmarking +4

Large Spatial Model: End-to-end Unposed Images to Semantic 3D

1 code implementation24 Oct 2024 Zhiwen Fan, Jian Zhang, Wenyan Cong, Peihao Wang, Renjie Li, Kairun Wen, Shijie Zhou, Achuta Kadambi, Zhangyang Wang, Danfei Xu, Boris Ivanovic, Marco Pavone, Yue Wang

To tackle the scarcity of labeled 3D semantic data and enable natural language-driven scene manipulation, we incorporate a pre-trained 2D language-based segmentation model into a 3D-consistent semantic feature field.

3D Reconstruction Attribute

4K4DGen: Panoramic 4D Generation at 4K Resolution

no code implementations19 Jun 2024 Renjie Li, Panwang Pan, Bangbang Yang, Dejia Xu, Shijie Zhou, Xuanyang Zhang, Zeming Li, Achuta Kadambi, Zhangyang Wang, Zhengzhong Tu, Zhiwen Fan

Subsequently, we propose \textbf{Dynamic Panoramic Lifting} to elevate the panoramic video into a 4D immersive environment while preserving spatial and temporal consistency.

4k

Inverse Design of Photonic Crystal Surface Emitting Lasers is a Sequence Modeling Problem

no code implementations8 Mar 2024 Ceyao Zhang, Renjie Li, Cheng Zhang, Zhaoyu Zhang, Feng Yin

By modeling the inverse design of PCSEL as a sequential decision-making problem, RL approaches can construct a satisfactory PCSEL structure from scratch.

Decision Making Reinforcement Learning (RL) +1

Rapid-Motion-Track: Markerless Tracking of Fast Human Motion with Deeper Learning

no code implementations18 Jan 2023 Renjie Li, Chun Yu Lao, Rebecca St. George, Katherine Lawler, Saurabh Garg, Son N. Tran, Quan Bai, Jane Alty

RMT and a range of DLC models were applied to the video data with tapping frequencies up to 8Hz to extract movement features.

Rhythm

Hybrid CNN -Interpreter: Interpret local and global contexts for CNN-based Models

no code implementations31 Oct 2022 Wenli Yang, Guan Huang, Renjie Li, Jiahao Yu, Yanyu Chen, Quan Bai, Beyong Kang

Convolutional neural network (CNN) models have seen advanced improvements in performance in various domains, but lack of interpretability is a major barrier to assurance and regulation during operation for acceptance and deployment of AI-assisted applications.

Feature Correlation

A Comprehensive Review on Deep Supervision: Theories and Applications

no code implementations6 Jul 2022 Renjie Li, Xinyi Wang, Guan Huang, Wenli Yang, Kaining Zhang, Xiaotong Gu, Son N. Tran, Saurabh Garg, Jane Alty, Quan Bai

Deep supervision, or known as 'intermediate supervision' or 'auxiliary supervision', is to add supervision at hidden layers of a neural network.

POViT: Vision Transformer for Multi-objective Design and Characterization of Nanophotonic Devices

no code implementations17 May 2022 Xinyu Chen, Renjie Li, Yueyao Yu, Yuanwen Shen, Wenye Li, Zhaoyu Zhang, Yin Zhang

In this work, we propose the first-ever Transformer model (POViT) to efficiently design and simulate semiconductor photonic devices with multiple objectives.

Endowing Deep 3D Models with Rotation Invariance Based on Principal Component Analysis

no code implementations20 Oct 2019 Zelin Xiao, Hongxin Lin, Renjie Li, Hongyang Chao, Shengyong Ding

Interestingly, the principal component analysis exactly provides an effective way to define such a frame, i. e. setting the principal components as the frame axes.

Object Retrieval

Cannot find the paper you are looking for? You can Submit a new open access paper.