Search Results for author: Yu-Jhe Li

Found 24 papers, 3 papers with code

Multi-Person 3D Pose Estimation from Multi-View Uncalibrated Depth Cameras

no code implementations • 28 Jan 2024 • Yu-Jhe Li, Yan Xu, Rawal Khirodkar, Jinhyung Park, Kris Kitani

In order to evaluate our proposed pipeline, we collect three video sets of RGBD videos recorded from multiple sparse-view depth cameras and ground truth 3D poses are manually annotated.

3D Human Pose Estimation 3D Pose Estimation +2

Paper
Add Code

3D-CLFusion: Fast Text-to-3D Rendering with Contrastive Latent Diffusion

no code implementations • 21 Mar 2023 • Yu-Jhe Li, Tao Xu, Ji Hou, Bichen Wu, Xiaoliang Dai, Albert Pumarola, Peizhao Zhang, Peter Vajda, Kris Kitani

We note that the novelty of our model lies in that we introduce contrastive learning during training the diffusion prior which enables the generation of the valid view-invariant latent code.

Contrastive Learning Text to 3D

Paper
Add Code

Azimuth Super-Resolution for FMCW Radar in Autonomous Driving

no code implementations • CVPR 2023 • Yu-Jhe Li, Shawn Hunt, Jinhyung Park, Matthew O’Toole, Kris Kitani

We also propose a hybrid super-resolution model (Hybrid-SR) combining our ADC-SR with a standard RAD super-resolution model, and show that performance can be improved by a large margin.

Autonomous Driving object-detection +2

Paper
Add Code

3D-Aware Encoding for Style-based Neural Radiance Fields

no code implementations • 12 Nov 2022 • Yu-Jhe Li, Tao Xu, Bichen Wu, Ningyuan Zheng, Xiaoliang Dai, Albert Pumarola, Peizhao Zhang, Peter Vajda, Kris Kitani

In the first stage, we introduce a base encoder that converts the input image to a latent code.

Contrastive Learning Image Reconstruction

Paper
Add Code

Domain Adaptive Hand Keypoint and Pixel Localization in the Wild

no code implementations • 16 Mar 2022 • Takehiko Ohkawa, Yu-Jhe Li, Qichen Fu, Ryosuke Furuta, Kris M. Kitani, Yoichi Sato

We aim to improve the performance of regressing hand keypoints and segmenting pixel-level hand masks under new imaging conditions (e. g., outdoors) when we only have labeled images taken under very different conditions (e. g., indoors).

Domain Adaptation Knowledge Distillation

Paper
Add Code

Modality-Agnostic Learning for Radar-Lidar Fusion in Vehicle Detection

no code implementations • CVPR 2022 • Yu-Jhe Li, Jinhyung Park, Matthew O'Toole, Kris Kitani

To mitigate this problem, we propose the Self-Training Multimodal Vehicle Detection Network (ST-MVDNet) which leverages a Teacher-Student mutual learning framework and a simulated sensor noise model used in strong data augmentation for Lidar and Radar.

Autonomous Vehicles Data Augmentation

Paper
Add Code

Cross-Domain Adaptive Teacher for Object Detection

2 code implementations • CVPR 2022 • Yu-Jhe Li, Xiaoliang Dai, Chih-Yao Ma, Yen-Cheng Liu, Kan Chen, Bichen Wu, Zijian He, Kris Kitani, Peter Vajda

To mitigate this problem, we propose a teacher-student framework named Adaptive Teacher (AT) which leverages domain adversarial learning and weak-strong data augmentation to address the domain gap.

Data Augmentation Domain Adaptation +3

170

Paper
Code

Adaptive Unbiased Teacher for Cross-Domain Object Detection

no code implementations • 29 Sep 2021 • Yu-Jhe Li, Xiaoliang Dai, Chih-Yao Ma, Yen-Cheng Liu, Kan Chen, Bichen Wu, Zijian He, Kris M. Kitani, Peter Vajda

This enables the student model to capture domain-invariant features.

Data Augmentation Domain Adaptation +3

Paper
Add Code

Wide-Baseline Multi-Camera Calibration using Person Re-Identification

no code implementations • CVPR 2021 • Yan Xu, Yu-Jhe Li, Xinshuo Weng, Kris Kitani

We address the problem of estimating the 3D pose of a network of cameras for large-environment wide-baseline scenarios, e. g., cameras for construction sites, sports stadiums, and public spaces.

Camera Calibration Person Re-Identification

Paper
Add Code

Visio-Temporal Attention for Multi-Camera Multi-Target Association

no code implementations • ICCV 2021 • Yu-Jhe Li, Xinshuo Weng, Yan Xu, Kris M. Kitani

We propose a inter-tracklet (person to person) attention mechanism that learns a representation for a target tracklet while taking into account other tracklets across multiple views.

Paper
Add Code

Semantics-Guided Representation Learning with Applications to Visual Synthesis

no code implementations • 21 Oct 2020 • Jia-Wei Yan, Ci-Siang Lin, Fu-En Yang, Yu-Jhe Li, Yu-Chiang Frank Wang

Learning interpretable and interpolatable latent representations has been an emerging research direction, allowing researchers to understand and utilize the derived latent space for further applications such as visual synthesis or recognition.

Representation Learning

Paper
Add Code

Semantics-Guided Clustering with Deep Progressive Learning for Semi-Supervised Person Re-identification

no code implementations • 2 Oct 2020 • Chih-Ting Liu, Yu-Jhe Li, Shao-Yi Chien, Yu-Chiang Frank Wang

As a result, our approach is able to augment the labeled training data in the semi-supervised setting.

Clustering Image Retrieval +2

Paper
Add Code

Transforming Multi-Concept Attention into Video Summarization

no code implementations • 2 Jun 2020 • Yen-Ting Liu, Yu-Jhe Li, Yu-Chiang Frank Wang

Video summarization is among challenging tasks in computer vision, which aims at identifying highlight frames or shots over a lengthy video input.

Video Summarization

Paper
Add Code

Learning Shape Representations for Clothing Variations in Person Re-Identification

no code implementations • 16 Mar 2020 • Yu-Jhe Li, Zhengyi Luo, Xinshuo Weng, Kris M. Kitani

To tackle the re-ID problem in the context of clothing changes, we propose a novel representation learning model which is able to generate a body shape feature representation without being affected by clothing color or patterns.

Disentanglement Person Re-Identification

Paper
Add Code

Cross-Resolution Adversarial Dual Network for Person Re-Identification and Beyond

no code implementations • 19 Feb 2020 • Yu-Jhe Li, Yun-Chun Chen, Yen-Yu Lin, Yu-Chiang Frank Wang

Person re-identification (re-ID) aims at matching images of the same person across camera views.

Generative Adversarial Network Person Re-Identification

Paper
Add Code

Cross-Dataset Person Re-Identification via Unsupervised Pose Disentanglement and Adaptation

no code implementations • ICCV 2019 • Yu-Jhe Li, Ci-Siang Lin, Yan-Bo Lin, Yu-Chiang Frank Wang

Person re-identification (re-ID) aims at recognizing the same person from images taken across different cameras.

Ranked #16 on Unsupervised Domain Adaptation on Market to Duke

Disentanglement Person Re-Identification +1

Paper
Add Code

Recover and Identify: A Generative Dual Model for Cross-Resolution Person Re-Identification

no code implementations • ICCV 2019 • Yu-Jhe Li, Yun-Chun Chen, Yen-Yu Lin, Xiaofei Du, Yu-Chiang Frank Wang

Person re-identification (re-ID) aims at matching images of the same identity across camera views.

Generative Adversarial Network Person Re-Identification

Paper
Add Code

Learning Resolution-Invariant Deep Representations for Person Re-Identification

no code implementations • 25 Jul 2019 • Yun-Chun Chen, Yu-Jhe Li, Xiaofei Du, Yu-Chiang Frank Wang

Moreover, the extension of our model for semi-supervised re-ID further confirms the scalability of our proposed method for real-world scenarios and applications.

Image Super-Resolution Person Re-Identification

Paper
Add Code

Dual-modality seq2seq network for audio-visual event localization

2 code implementations • 20 Feb 2019 • Yan-Bo Lin, Yu-Jhe Li, Yu-Chiang Frank Wang

Audio-visual event localization requires one to identify theevent which is both visible and audible in a video (eitherat a frame or video level).

audio-visual event localization

158

Paper
Code

Deep Reinforcement Learning for Playing 2.5D Fighting Games

4 code implementations • 5 May 2018 • Yu-Jhe Li, Hsin-Yu Chang, Yu-Jing Lin, Po-Wei Wu, Yu-Chiang Frank Wang

Deep reinforcement learning has shown its success in game playing.

OpenAI Gym reinforcement-learning +1

Paper
Code

Adaptation and Re-Identification Network: An Unsupervised Deep Transfer Learning Approach to Person Re-Identification

no code implementations • 25 Apr 2018 • Yu-Jhe Li, Fu-En Yang, Yen-Cheng Liu, Yu-Ying Yeh, Xiaofei Du, Yu-Chiang Frank Wang

Person re-identification (Re-ID) aims at recognizing the same person from images taken across different cameras.

Ranked #19 on Unsupervised Domain Adaptation on Duke to Market

Person Re-Identification Transfer Learning +1

Paper
Add Code

Deep Learning for Malicious Flow Detection

no code implementations • 9 Feb 2018 • Yun-Chun Chen, Yu-Jhe Li, Aragorn Tseng, Tsungnan Lin

We also conduct a partial flow experiment which shows the feasibility of real-time detection and a zero-shot learning experiment which justifies the generalization capability of deep learning in cyber security.

Zero-Shot Learning

Paper
Add Code

使用語音評分技術輔助台語語料的驗證 (Using Speech Assessment Technique for the Validation of Taiwanese Speech Corpus) [In Chinese]

no code implementations • ROCLINGIJCLCLP 2013 • Yu-Jhe Li, Chung-Che Wang, Liang-Yu Chen, Jyh-Shing Roger Jang, Ren-Yuan Lyu

Paper
Add Code

台語關鍵詞辨識之實作與比較 (Implementation and Comparison of Keyword Spotting for Taiwanese) [In Chinese]

no code implementations • ROCLINGIJCLCLP 2012 • Chung-Che Wang, Che-Hsuan Chou, Liang-Yu Chen, Yu-Jhe Li, Jyh-Shing Jang, Hsun-Cheng Hu, Shih-Peng Lin, You-Lian Huang

Keyword Spotting

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.