2 code implementations • 5 Dec 2024 • Jian Han, Jinlai Liu, Yi Jiang, Bin Yan, Yuqi Zhang, Zehuan Yuan, Bingyue Peng, Xiaobing Liu
We present Infinity, a Bitwise Visual AutoRegressive Modeling capable of generating high-resolution, photorealistic images following language instruction.
no code implementations • 28 Oct 2019 • Dongdong Yu, Zehuan Yuan, Jinlai Liu, Kun Yuan, Changhu Wang
Instance Segmentation is an interesting yet challenging task in computer vision.
no code implementations • NeurIPS 2018 • Chenfei Wu, Jinlai Liu, Xiaojie Wang, Xuan Dong
A chain of reasoning (CoR) is constructed for supporting multi-step and dynamic reasoning on changed relations and objects.
no code implementations • 16 Sep 2018 • Jinlai Liu, Zehuan Yuan, Changhu Wang
Leveraging both visual frames and audio has been experimentally proven effective to improve large-scale video classification.