Search Results for author: Weili Zeng

Infusion: Preventing Customized Text-to-Image Diffusion from Overfitting

Text-to-image (T2I) customization aims to create images that embody specific visual concepts delineated in textual descriptions.

Paper
Add Code

Referential dialogue is a superset of various vision-language (VL) tasks.

679

Paper
Code

Energy-based models parameterize the unnormalized log-probability of data samples, but there is a lack of guidance on how to construct the "energy".

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.