Search Results for author: Yukun Yan

Found 11 papers, 5 papers with code

Say More with Less: Understanding Prompt Learning Behaviors through Gist Compression

1 code implementation • 25 Feb 2024 • Xinze Li, Zhenghao Liu, Chenyan Xiong, Shi Yu, Yukun Yan, Shuo Wang, Ge Yu

It finetunes the compression plugin module and uses the representations of gist tokens to emulate the raw prompts in the vanilla language model.

Language Modelling

Paper
Code

Cleaner Pretraining Corpus Curation with Neural Web Scraping

1 code implementation • 22 Feb 2024 • Zhipeng Xu, Zhenghao Liu, Yukun Yan, Zhiyuan Liu, Chenyan Xiong, Ge Yu

The web contains large-scale, diverse, and abundant information to satisfy the information-seeking needs of humans.

Language Modelling

178

Paper
Code

ActiveRAG: Revealing the Treasures of Knowledge via Active Learning

1 code implementation • 21 Feb 2024 • Zhipeng Xu, Zhenghao Liu, Yibin Liu, Chenyan Xiong, Yukun Yan, Shuo Wang, Shi Yu, Zhiyuan Liu, Ge Yu

Retrieval Augmented Generation (RAG) has introduced a new paradigm for Large Language Models (LLMs), aiding in the resolution of knowledge-intensive tasks.

Active Learning Position +2

Paper
Code

MatPlotAgent: Method and Evaluation for LLM-Based Agentic Scientific Data Visualization

1 code implementation • 18 Feb 2024 • Zhiyu Yang, Zihan Zhou, Shuo Wang, Xin Cong, Xu Han, Yukun Yan, Zhenghao Liu, Zhixing Tan, Pengyuan Liu, Dong Yu, Zhiyuan Liu, Xiaodong Shi, Maosong Sun

Scientific data visualization plays a crucial role in research by enabling the direct display of complex information and assisting researchers in identifying implicit patterns.

Code Generation Data Visualization

Paper
Code

UltraLink: An Open-Source Knowledge-Enhanced Multilingual Supervised Fine-tuning Dataset

1 code implementation • 7 Feb 2024 • Haoyu Wang, Shuo Wang, Yukun Yan, Xujia Wang, Zhiyu Yang, Yuzhuang Xu, Zhenghao Liu, Liner Yang, Ning Ding, Xu Han, Zhiyuan Liu, Maosong Sun

Different from previous works that simply translate English instructions, we consider both the language-specific and language-agnostic abilities of LLMs.

Cross-Lingual Transfer Data Augmentation

Paper
Code

UniMem: Towards a Unified View of Long-Context Large Language Models

no code implementations • 5 Feb 2024 • Junjie Fang, Likai Tang, Hongzhe Bi, Yujia Qin, Si Sun, Zhenyu Li, Haolun Li, Yongjian Li, Xin Cong, Yukun Yan, Xiaodong Shi, Sen Song, Yankai Lin, Zhiyuan Liu, Maosong Sun

Although there exist various methods devoted to enhancing the long-context processing ability of large language models (LLMs), they are developed in an isolated manner and lack systematic analysis and integration of their strengths, hindering further developments.

Management

Paper
Add Code

GitAgent: Facilitating Autonomous Agent with GitHub by Tool Extension

no code implementations • 28 Dec 2023 • Bohan Lyu, Xin Cong, Heyang Yu, Pan Yang, Yujia Qin, Yining Ye, Yaxi Lu, Zhong Zhang, Yukun Yan, Yankai Lin, Zhiyuan Liu, Maosong Sun

As GitHub has hosted a multitude of repositories which can be seen as a good resource for tools, a promising solution is that LLM-based agents can autonomously integrate the repositories in GitHub according to the user queries to extend their tool set.

Paper
Add Code

Local Hypergraph-based Nested Named Entity Recognition as Query-based Sequence Labeling

no code implementations • 25 Apr 2022 • Yukun Yan, Sen Song

There has been a growing academic interest in the recognition of nested named entities in many domains.

named-entity-recognition Named Entity Recognition +2

Paper
Add Code

Zooming Network

no code implementations • 4 Oct 2018 • Yukun Yan, Daqi Zheng, Zhengdong Lu, Sen Song

Structural information is important in natural language understanding.

Natural Language Understanding

Paper
Add Code

Event Identification as a Decision Process with Non-linear Representation of Text

no code implementations • 3 Oct 2017 • Yukun Yan, Daqi Zheng, Zhengdong Lu, Sen Song

We propose scale-free Identifier Network(sfIN), a novel model for event identification in documents.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

Object-oriented Neural Programming (OONP) for Document Understanding

no code implementations • ACL 2018 • Zhengdong Lu, Xianggen Liu, Haotian Cui, Yukun Yan, Daqi Zheng

We propose Object-oriented Neural Programming (OONP), a framework for semantically parsing documents in specific domains.

document understanding Object +2

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.