no code implementations • 15 Dec 2023 • Zhiqiang Li, Hengrong Lan, Lijie Huang, Qiong He, Jianwen Luo
We hypothesize that a single PW can take a shortcut to reach the diffusion trajectory of PWC, removing the need to begin with Gaussian noise.
no code implementations • 17 Jan 2023 • Hengrong Lan, Lijie Huang, Zhiqiang Li, Jing Lv, Jianwen Luo
We find that dynamically masking a high proportion of the channels, e. g., 80%, yields nontrivial self-supervisors in both image and signal domains, which decrease the multiplicity of the pseudo solution to efficiently reconstruct the image from fewer PA measurements with minimum error of the image.
no code implementations • 8 Dec 2021 • Wenbo Gou, Wen Shi, Jian Lou, Lijie Huang, Pan Zhou, Ruixuan Li
Natural language video localization (NLVL) is an important task in the vision-language understanding area, which calls for an in-depth understanding of not only computer vision and natural language side alone, but more importantly the interplay between both sides.