Search Results for author: Jacob Zhiyuan Fang

Found 2 papers, 0 papers with code

E-ViLM: Efficient Video-Language Model via Masked Video Modeling with Semantic Vector-Quantized Tokenizer

no code implementations28 Nov 2023 Jacob Zhiyuan Fang, Skyler Zheng, Vasu Sharma, Robinson Piramuthu

Regardless of their effectiveness, larger architectures unavoidably prevent the models from being extended to real-world applications, so building a lightweight VL architecture and an efficient learning schema is of great practical value.

Language Modelling Question Answering +3

Text-to-image Editing by Image Information Removal

no code implementations27 May 2023 Zhongping Zhang, Jian Zheng, Jacob Zhiyuan Fang, Bryan A. Plummer

Using the input image as a control could mitigate these issues, but since these models are trained via reconstruction, a model can simply hide information about the original image when encoding it to perfectly reconstruct the image without learning the editing task.

Image Generation Image Reconstruction

Cannot find the paper you are looking for? You can Submit a new open access paper.