no code implementations • 28 Nov 2023 • Jacob Zhiyuan Fang, Skyler Zheng, Vasu Sharma, Robinson Piramuthu
Regardless of their effectiveness, larger architectures unavoidably prevent the models from being extended to real-world applications, so building a lightweight VL architecture and an efficient learning schema is of great practical value.
no code implementations • 27 May 2023 • Zhongping Zhang, Jian Zheng, Jacob Zhiyuan Fang, Bryan A. Plummer
Using the input image as a control could mitigate these issues, but since these models are trained via reconstruction, a model can simply hide information about the original image when encoding it to perfectly reconstruct the image without learning the editing task.