Search Results for author: Si Shen

Found 5 papers, 1 papers with code

RevOrder: A Novel Method for Enhanced Arithmetic in Language Models

no code implementations6 Feb 2024 Si Shen, Peijun Shen, Danhao Zhu

This paper presents RevOrder, a novel technique aimed at improving arithmetic operations in large language models (LLMs) by reversing the output digits in addition, subtraction, and n-digit by 1-digit (nD by 1D) multiplication tasks.

GSM8K Math

GujiBERT and GujiGPT: Construction of Intelligent Information Processing Foundation Language Models for Ancient Texts

no code implementations11 Jul 2023 Dongbo Wang, Chang Liu, Zhixiao Zhao, Si Shen, Liu Liu, Bin Li, Haotian Hu, Mengcheng Wu, Litao Lin, Xue Zhao, Xiyu Wang

In the context of the rapid development of large language models, we have meticulously trained and introduced the GujiBERT and GujiGPT language models, which are foundational models specifically designed for intelligent information processing of ancient texts.

Model Selection Part-Of-Speech Tagging +2

Increasing Visual Awareness in Multimodal Neural Machine Translation from an Information Theoretic Perspective

no code implementations16 Oct 2022 Baijun Ji, Tong Zhang, Yicheng Zou, Bojie Hu, Si Shen

Multimodal machine translation (MMT) aims to improve translation quality by equipping the source sentence with its corresponding image.

Multimodal Machine Translation Sentence +1

Cannot find the paper you are looking for? You can Submit a new open access paper.