no code implementations • CSRR (ACL) 2022 • Yue Wan, Yueen Ma, Haoxuan You, Zhecan Wang, Shih-Fu Chang
Large-scale visual-linguistic pre-training aims to capture the generic representations from multimodal features, which are essential for downstream vision-language tasks.
no code implementations • LREC (LAW) 2022 • Daniel Bauer, Tom Longley, Yueen Ma, Tony Wilson
In this paper we explore the use of an NLP system to assist the work of Security Force Monitor (SFM).
no code implementations • 28 Jan 2025 • Yueen Ma, Yuzheng Zhuang, Jianye Hao, Irwin King
3D vision and spatial reasoning have long been recognized as preferable for accurately perceiving our three-dimensional world, especially when compared with traditional visual reasoning based on 2D images.
1 code implementation • 23 May 2024 • Yueen Ma, Zixing Song, Yuzheng Zhuang, Jianye Hao, Irwin King
To this end, we present the first survey on VLAs for embodied AI.
no code implementations • 3 Jul 2023 • Yueen Ma, Dafeng Chi, Jingjing Li, Kai Song, Yuzheng Zhuang, Irwin King
The natural language generation domain has witnessed great success thanks to Transformer models.
1 code implementation • 25 Jun 2022 • Yueen Ma, Zixing Song, Xuming Hu, Jingjing Li, Yifei Zhang, Irwin King
As it is intractable for data augmentation to fully capture the structural information of the ConcreteGraph due to a large amount of potential concept pairs, we further introduce a novel Graph Component Contrastive Learning framework to implicitly learn the complete structure of the ConcreteGraph.
1 code implementation • 13 Jan 2022 • Daniel Bauer, Tom Longley, Yueen Ma, Tony Wilson
In this working paper we explore the use of an NLP system to assist the work of Security Force Monitor (SFM).