Search Results for author: Zhiwei Jia

Found 15 papers, 7 papers with code

Chain-of-Thought Predictive Control

1 code implementation3 Apr 2023 Zhiwei Jia, Fangchen Liu, Vineet Thumuluri, Linghao Chen, Zhiao Huang, Hao Su

We study generalizable policy learning from demonstrations for complex low-level control tasks (e. g., contact-rich object manipulations).

Imitation Learning

Improving Policy Optimization with Generalist-Specialist Learning

1 code implementation26 Jun 2022 Zhiwei Jia, Xuanlin Li, Zhan Ling, Shuang Liu, Yiran Wu, Hao Su

Generalization in deep reinforcement learning over unseen environment variations usually requires policy learning over a large set of diverse training variations.

Imitation Learning

Learning to Act with Affordance-Aware Multimodal Neural SLAM

1 code implementation24 Jan 2022 Zhiwei Jia, Kaixiang Lin, Yizhou Zhao, Qiaozi Gao, Govind Thattai, Gaurav Sukhatme

With the proposed Affordance-aware Multimodal Neural SLAM (AMSLAM) approach, we obtain more than 40% improvement over prior published work on the ALFRED benchmark and set a new state-of-the-art generalization performance at a success rate of 23. 48% on the test unseen scenes.

Efficient Exploration Test unseen

TRIG: Transformer-Based Text Recognizer with Initial Embedding Guidance

no code implementations16 Nov 2021 Yue Tao, Zhiwei Jia, Runze Ma, Shugong Xu

We propose a 1-D split to address the challenges of complexity and replace the CNN with the transformer encoder to reduce the need for a context modeling module.

Inductive Bias Scene Text Recognition

LUMINOUS: Indoor Scene Generation for Embodied AI Challenges

1 code implementation10 Nov 2021 Yizhou Zhao, Kaixiang Lin, Zhiwei Jia, Qiaozi Gao, Govind Thattai, Jesse Thomason, Gaurav S. Sukhatme

However, current simulators for Embodied AI (EAI) challenges only provide simulated indoor scenes with a limited number of layouts.

Indoor Scene Synthesis Scene Generation

IFR: Iterative Fusion Based Recognizer For Low Quality Scene Text Recognition

no code implementations13 Aug 2021 Zhiwei Jia, Shugong Xu, Shiyi Mu, Yue Tao, Shan Cao, Zhiyong Chen

In this paper, we propose an Iterative Fusion based Recognizer (IFR) for low quality scene text recognition, taking advantage of refined text images input and robust feature representation.

Image Restoration Scene Text Recognition

ManiSkill: Generalizable Manipulation Skill Benchmark with Large-Scale Demonstrations

3 code implementations30 Jul 2021 Tongzhou Mu, Zhan Ling, Fanbo Xiang, Derek Yang, Xuanlin Li, Stone Tao, Zhiao Huang, Zhiwei Jia, Hao Su

Here we propose SAPIEN Manipulation Skill Benchmark (ManiSkill) to benchmark manipulation skills over diverse objects in a full-physics simulator.

Tracking Based Semi-Automatic Annotation for Scene Text Videos

no code implementations29 Mar 2021 Jiajun Zhu, Xiufeng Jiang, Zhiwei Jia, Shugong Xu, Shan Cao

Moreover, a paired low-quality scene text video dataset named Text-RBL is proposed, consisting of raw videos, blurry videos, and low-resolution videos, labeled by the proposed convenient semi-automatic labeling strategy.

Scene Text Detection text annotation +1

One-pixel Signature: Characterizing CNN Models for Backdoor Detection

no code implementations ECCV 2020 Shanjiaoyang Huang, Weiqi Peng, Zhiwei Jia, Zhuowen Tu

One-pixel signature is a general representation that can be used to characterize CNN models beyond backdoor detection.

Information-Theoretic Local Minima Characterization and Regularization

1 code implementation ICML 2020 Zhiwei Jia, Hao Su

Recent advances in deep learning theory have evoked the study of generalizability across different local minima of deep neural networks (DNNs).

Learning Theory

Controllable Top-down Feature Transformer

no code implementations6 Dec 2017 Zhiwei Jia, Haoshen Hong, Siyang Wang, Kwonjoon Lee, Zhuowen Tu

We study the intrinsic transformation of feature maps across convolutional network layers with explicit top-down control.

Data Augmentation Style Transfer

Cannot find the paper you are looking for? You can Submit a new open access paper.