1 code implementation • 17 Apr 2024 • Sherry X. Chen, Yaron Vaxman, Elad Ben Baruch, David Asulin, Aviad Moreshet, Kuo-Chin Lien, Misha Sra, Pradeep Sen
Previous approaches have focused on either fine-tuning pre-trained T2I models on specific datasets to generate certain kinds of images (e. g., with a specific object or person), or on optimizing the weights, text prompts, and/or learning features for each input image in an attempt to coax the image generator to produce the desired result.
no code implementations • 29 Oct 2022 • Seyed Mehdi Iranmanesh, Xiaotong Chen, Kuo-Chin Lien
In this approach, we detect an object bounding box as a pair of keypoints, the top-left corner and the center, using two decoders.
no code implementations • 1 Jan 2022 • Xiaotong Chen, Seyed Mehdi Iranmanesh, Kuo-Chin Lien
In this paper, we present PatchTrack, a Transformer-based joint-detection-and-tracking system that predicts tracks using patches of the current frame of interest.
no code implementations • 10 Sep 2019 • Jing Zhu, Yunxiao Shi, Mengwei Ren, Yi Fang, Kuo-Chin Lien, Junli Gu
To this end, we introduce a new Structure-Oriented Memory (SOM) module to learn and memorize the structure-specific information between RGB image domain and the depth domain.
Ranked #48 on Monocular Depth Estimation on KITTI Eigen split
no code implementations • ICCV 2019 • Jing Zhu, Yi Fang, Husam Abu-Haimed, Kuo-Chin Lien, Dongdong Fu, Junli Gu
Environment perception, including object detection and distance estimation, is one of the most crucial tasks for autonomous driving.