no code implementations • 28 Mar 2025 • Yizhang Zhu, Runzhi Jiang, Boyan Li, Nan Tang, Yuyu Luo
Text-to-SQL automatically translates natural language queries to SQL, allowing non-technical users to retrieve data from databases without specialized SQL knowledge.
no code implementations • 3 Mar 2025 • Teng Lin, Yizhang Zhu, Yuyu Luo, Nan Tang
The effectiveness of current retrieval-augmented generation (RAG) methods is limited by the LLMs' capacity to aggregate insights from numerous pages.
1 code implementation • 9 Feb 2025 • Xudong Yang, Yizhang Zhu, Nan Tang, Yuyu Luo
Conventional multi-modal multi-label emotion recognition (MMER) from videos typically assumes full availability of visual, textual, and acoustic modalities.
1 code implementation • 26 Dec 2024 • Xudong Yang, Yifan Wu, Yizhang Zhu, Nan Tang, Yuyu Luo
To effectively train AskChart, we design a three-stage training strategy to align visual and textual modalities for learning robust visual-textual representations and optimizing the learning of the MoE layer.
1 code implementation • 12 Jun 2024 • Yizhang Zhu, Shiyin Du, Boyan Li, Yuyu Luo, Nan Tang
Large Language Models (LLMs) have demonstrated impressive capabilities across a range of scientific tasks including mathematics, physics, and chemistry.