no code implementations • 29 Jul 2024 • Chia-Hao Kao, Cheng Chien, Yu-Jen Tseng, Yi-Hsin Chen, Alessandro Gnutti, Shao-Yuan Lo, Wen-Hsiao Peng, Riccardo Leonardi
MLLMs have extended the success of large language models to modalities (e. g. images) beyond text, but their billion scale hinders deployment on resource-constrained end devices.
no code implementations • 22 Sep 2023 • Chia-Hao Kao, Yi-Hsin Chen, Cheng Chien, Wei-Chen Chiu, Wen-Hsiao Peng
This paper presents a Transformer-based image compression system that allows for a variable image quality objective according to the user's preference.
1 code implementation • ICCV 2023 • Yi-Hsin Chen, Ying-Chieh Weng, Chia-Hao Kao, Cheng Chien, Wei-Chen Chiu, Wen-Hsiao Peng
This work aims for transferring a Transformer-based image compression codec from human perception to machine perception without fine-tuning the codec.
no code implementations • 29 Dec 2022 • Mu-Jung Chen, Hong-Sheng Xie, Cheng Chien, Wen-Hsiao Peng, Hsueh-Ming Hang
Most learned video codecs operate internally in the RGB domain for P-frame coding.