Search Results for author: Kai Lv

Found 14 papers, 7 papers with code

InternLM2 Technical Report

1 code implementation • 26 Mar 2024 • Zheng Cai, Maosong Cao, Haojiong Chen, Kai Chen, Keyu Chen, Xin Chen, Xun Chen, Zehui Chen, Zhi Chen, Pei Chu, Xiaoyi Dong, Haodong Duan, Qi Fan, Zhaoye Fei, Yang Gao, Jiaye Ge, Chenya Gu, Yuzhe Gu, Tao Gui, Aijia Guo, Qipeng Guo, Conghui He, Yingfan Hu, Ting Huang, Tao Jiang, Penglong Jiao, Zhenjiang Jin, Zhikai Lei, Jiaxing Li, Jingwen Li, Linyang Li, Shuaibin Li, Wei Li, Yining Li, Hongwei Liu, Jiangning Liu, Jiawei Hong, Kaiwen Liu, Kuikun Liu, Xiaoran Liu, Chengqi Lv, Haijun Lv, Kai Lv, Li Ma, Runyuan Ma, Zerun Ma, Wenchang Ning, Linke Ouyang, Jiantao Qiu, Yuan Qu, FuKai Shang, Yunfan Shao, Demin Song, Zifan Song, Zhihao Sui, Peng Sun, Yu Sun, Huanze Tang, Bin Wang, Guoteng Wang, Jiaqi Wang, Jiayu Wang, Rui Wang, Yudong Wang, Ziyi Wang, Xingjian Wei, Qizhen Weng, Fan Wu, Yingtong Xiong, Chao Xu, Ruiliang Xu, Hang Yan, Yirong Yan, Xiaogui Yang, Haochen Ye, Huaiyuan Ying, JIA YU, Jing Yu, Yuhang Zang, Chuyu Zhang, Li Zhang, Pan Zhang, Peng Zhang, Ruijie Zhang, Shuo Zhang, Songyang Zhang, Wenjian Zhang, Wenwei Zhang, Xingcheng Zhang, Xinyue Zhang, Hui Zhao, Qian Zhao, Xiaomeng Zhao, Fengzhe Zhou, Zaida Zhou, Jingming Zhuo, Yicheng Zou, Xipeng Qiu, Yu Qiao, Dahua Lin

The evolution of Large Language Models (LLMs) like ChatGPT and GPT-4 has sparked discussions on the advent of Artificial General Intelligence (AGI).

Ranked #5 on Long-Context Understanding on Ada-LEval (BestAnswer)

4k Long-Context Understanding

5,163

Paper
Code

LongWanjuan: Towards Systematic Measurement for Long Text Quality

1 code implementation • 21 Feb 2024 • Kai Lv, Xiaoran Liu, Qipeng Guo, Hang Yan, Conghui He, Xipeng Qiu, Dahua Lin

The quality of training data are crucial for enhancing the long-text capabilities of foundation models.

Language Modelling

Paper
Code

Building Category Graphs Representation with Spatial and Temporal Attention for Visual Navigation

no code implementations • 6 Dec 2023 • Xiaobo Hu, Youfang Lin, Hehe Fan, Shuo Wang, Zhihao Wu, Kai Lv

To this end, an agent needs to 1) learn a piece of certain knowledge about the relations of object categories in the world during training and 2) look for the target object based on the pre-learned object category relations and its moving trajectory in the current unseen environment.

Object Visual Navigation

Paper
Add Code

A Reliable Representation with Bidirectional Transition Model for Visual Reinforcement Learning Generalization

no code implementations • 4 Dec 2023 • Xiaobo Hu, Youfang Lin, Yue Liu, Jinwen Wang, Shuo Wang, Hehe Fan, Kai Lv

Visual reinforcement learning has proven effective in solving control tasks with high-dimensional observations.

Paper
Add Code

CoLLiE: Collaborative Training of Large Language Models in an Efficient Way

1 code implementation • 1 Dec 2023 • Kai Lv, Shuo Zhang, Tianle Gu, Shuhao Xing, Jiawei Hong, Keyu Chen, Xiaoran Liu, Yuqing Yang, Honglin Guo, Tengxiao Liu, Yu Sun, Qipeng Guo, Hang Yan, Xipeng Qiu

This paper introduces CoLLiE, an efficient library that facilitates collaborative training of large language models using 3D parallelism, parameter-efficient fine-tuning (PEFT) methods, and optimizers such as Lion, Adan, Sophia, LOMO and AdaLomo.

381

Paper
Code

AdaLomo: Low-memory Optimization with Adaptive Learning Rate

1 code implementation • 16 Oct 2023 • Kai Lv, Hang Yan, Qipeng Guo, Haijun Lv, Xipeng Qiu

Building on this insight, we introduce the low-memory optimization with adaptive learning rate (AdaLomo), which offers an adaptive learning rate for each parameter.

925

Paper
Code

Full Parameter Fine-tuning for Large Language Models with Limited Resources

1 code implementation • 16 Jun 2023 • Kai Lv, Yuqing Yang, Tengxiao Liu, Qinghui Gao, Qipeng Guo, Xipeng Qiu

Large Language Models (LLMs) have revolutionized Natural Language Processing (NLP) but demand massive GPU resources for training.

925

Paper
Code

Unified Demonstration Retriever for In-Context Learning

1 code implementation • 7 May 2023 • Xiaonan Li, Kai Lv, Hang Yan, Tianyang Lin, Wei Zhu, Yuan Ni, Guotong Xie, Xiaoling Wang, Xipeng Qiu

To train UDR, we cast various tasks' training signals into a unified list-wise ranking formulation by language model's feedback.

In-Context Learning Language Modelling +1

Paper
Code

GHQ: Grouped Hybrid Q Learning for Heterogeneous Cooperative Multi-agent Reinforcement Learning

no code implementations • 2 Mar 2023 • Xiaoyang Yu, Youfang Lin, Xiangsen Wang, Sheng Han, Kai Lv

We firstly define and describe the heterogeneous problems in SMAC.

Q-Learning reinforcement-learning +3

Paper
Add Code

CoNT: Contrastive Neural Text Generation

2 code implementations • 29 May 2022 • Chenxin An, Jiangtao Feng, Kai Lv, Lingpeng Kong, Xipeng Qiu, Xuanjing Huang

We validate CoNT on five generation tasks with ten benchmarks, including machine translation, summarization, code comment generation, data-to-text generation and commonsense generation.

Code Comment Generation Comment Generation +4

419

Paper
Code

FN-Net:Remove the Outliers by Filtering the Noise

no code implementations • 23 Jan 2022 • Kai Lv

When estimating the relationship between two images, it is often disturbed by outliers.

Denoising Pose Estimation

Paper
Add Code

Conservative Distributional Reinforcement Learning with Safety Constraints

no code implementations • 18 Jan 2022 • Hengrui Zhang, Youfang Lin, Sheng Han, Shuo Wang, Kai Lv

Then, CDMPO uses a conservative value function loss to reduce the number of violations of constraints during the exploration process.

Distributional Reinforcement Learning reinforcement-learning +1

Paper
Add Code

Agent-Centric Relation Graph for Object Visual Navigation

no code implementations • 29 Nov 2021 • Xiaobo Hu, Youfang Lin, Shuo Wang, Zhihao Wu, Kai Lv

ACRG is a highly effective structure that consists of two relationships, i. e., the horizontal relationship among objects and the distance relationship between the agent and objects .

Object Relation +1

Paper
Add Code

A region-based descriptor network for uniformly sampled keypoints

no code implementations • 26 Jan 2021 • Kai Lv, Zongqing Lu, Qingmin Liao

By the new descriptor, we can obtain more high confidence matching points without extremum operation.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.