Search Results for author: Qian-Wen Zhang

Found 10 papers, 4 papers with code

Pre-training and Fine-tuning Neural Topic Model: A Simple yet Effective Approach to Incorporating External Knowledge

no code implementations ACL 2022 Linhai Zhang, Xuemeng Hu, Boyu Wang, Deyu Zhou, Qian-Wen Zhang, Yunbo Cao

Recent years have witnessed growing interests in incorporating external knowledge such as pre-trained word embeddings (PWEs) or pre-trained language models (PLMs) into neural topic modeling.

Topic Models Word Embeddings

Sequential-NIAH: A Needle-In-A-Haystack Benchmark for Extracting Sequential Needles from Long Contexts

no code implementations7 Apr 2025 Yifei Yu, Qian-Wen Zhang, Lingfeng Qiao, Di Yin, Fang Li, Jie Wang, Zengxi Chen, Suncong Zheng, Xiaolong Liang, Xing Sun

Evaluating the ability of large language models (LLMs) to handle extended contexts is critical, particularly for retrieving information relevant to specific queries embedded within lengthy inputs.

8k

CJEval: A Benchmark for Assessing Large Language Models Using Chinese Junior High School Exam Data

1 code implementation24 Sep 2024 Qian-Wen Zhang, Haochen Wang, Fang Li, Siyu An, Lingfeng Qiao, Liangcai Gao, Di Yin, Xing Sun

Online education platforms have significantly transformed the dissemination of educational resources by providing a dynamic and digital infrastructure.

SNFinLLM: Systematic and Nuanced Financial Domain Adaptation of Chinese Large Language Models

no code implementations5 Aug 2024 Shujuan Zhao, Lingfeng Qiao, Kangyang Luo, Qian-Wen Zhang, Junru Lu, Di Yin

To address these issues, we propose a novel large language model specifically designed for the Chinese financial domain, named SNFinLLM.

Articles Domain Adaptation +3

MAC-SQL: A Multi-Agent Collaborative Framework for Text-to-SQL

1 code implementation18 Dec 2023 Bing Wang, Changyu Ren, Jian Yang, Xinnian Liang, Jiaqi Bai, Linzheng Chai, Zhao Yan, Qian-Wen Zhang, Di Yin, Xing Sun, Zhoujun Li

Our framework comprises a core decomposer agent for Text-to-SQL generation with few-shot chain-of-thought reasoning, accompanied by two auxiliary agents that utilize external tools or models to acquire smaller sub-databases and refine erroneous SQL queries.

SQL Parsing Text to SQL +1

Cannot find the paper you are looking for? You can Submit a new open access paper.