Search Results for author: Shuo Shang

Found 21 papers, 9 papers with code

Position-Aware Depth Decay Decoding ($D^3$): Boosting Large Language Model Inference Efficiency

no code implementations11 Mar 2025 Siqi Fan, Xuezhi Fang, Xingrun Xing, Peng Han, Shuo Shang, Yequan Wang

Experiments on large language models (\ie the Llama) with $7 \sim 70$ billion parameters show that $D^3$ can achieve an average 1. 5x speedup compared with the full-inference pipeline while maintaining comparable performance with nearly no performance drop ($<1\%$) on the GSM8K and BBH benchmarks.

GSM8K Language Modeling +4

More is not always better? Enhancing Many-Shot In-Context Learning with Differentiated and Reweighting Objectives

no code implementations7 Jan 2025 Xiaoqing Zhang, Ang Lv, YuHan Liu, Flood Sung, Wei Liu, Shuo Shang, Xiuying Chen, Rui Yan

Recognizing the lack of multi-task datasets with diverse many-shot distributions, we develop the Many-Shot ICL Benchmark (ICL-50)-a large-scale benchmark of 50 tasks that cover shot numbers from 1 to 350 within sequences of up to 8, 000 tokens-for fine-tuning purposes.

In-Context Learning

RED: Effective Trajectory Representation Learning with Comprehensive Information

no code implementations22 Nov 2024 Silin Zhou, Shuo Shang, Lisi Chen, Christian S. Jensen, Panos Kalnis

Trajectory representation learning (TRL) maps trajectories to vectors that can then be used for various downstream tasks, including trajectory similarity computation, trajectory classification, and travel-time estimation.

Representation Learning Travel Time Estimation

Grid and Road Expressions Are Complementary for Trajectory Representation Learning

1 code implementation22 Nov 2024 Silin Zhou, Shuo Shang, Lisi Chen, Peng Han, Christian S. Jensen

Trajectory representation learning (TRL) maps trajectories to vectors that can be used for many downstream tasks.

Representation Learning

CausalStock: Deep End-to-end Causal Discovery for News-driven Stock Movement Prediction

no code implementations10 Nov 2024 Shuqi Li, Yuebo Sun, Yuxin Lin, Xin Gao, Shuo Shang, Rui Yan

On the other hand, there is substantial noise existing in the news data leading to extracting effective information with difficulty.

Causal Discovery Prediction

MobileVLM: A Vision-Language Model for Better Intra- and Inter-UI Understanding

1 code implementation23 Sep 2024 Qinzhuo Wu, Weikai Xu, Wei Liu, Tao Tan, Jianfeng Liu, Ang Li, Jian Luan, Bin Wang, Shuo Shang

These fine-tuned VLMs may still ignore the relationships between UI pages, neglect the roles of elements in page transitions and lack inter-UI understanding.

Language Modeling Language Modelling

What Affects the Stability of Tool Learning? An Empirical Study on the Robustness of Tool Learning Frameworks

no code implementations3 Jul 2024 Chengrui Huang, Zhengliang Shi, Yuntao Wen, Xiuying Chen, Peng Han, Shen Gao, Shuo Shang

Tool learning methods have enhanced the ability of large language models (LLMs) to interact with real-world applications.

Mobile-Bench: An Evaluation Benchmark for LLM-based Mobile Agents

1 code implementation1 Jul 2024 Shihan Deng, Weikai Xu, Hongda Sun, Wei Liu, Tao Tan, Jianfeng Liu, Ang Li, Jian Luan, Bin Wang, Rui Yan, Shuo Shang

With the remarkable advancements of large language models (LLMs), LLM-based agents have become a research hotspot in human-computer interaction.

Benchmarking

Simulating Financial Market via Large Language Model based Agents

no code implementations28 Jun 2024 Shen Gao, Yuntao Wen, Minghang Zhu, Jianing Wei, YuHan Cheng, Qunzi Zhang, Shuo Shang

In this paper, we propose \textbf{A}gent-based \textbf{S}imulated \textbf{F}inancial \textbf{M}arket (ASFM), which first constructs a simulated stock market with a real order matching system.

Language Modeling Language Modelling +1

Facilitating Multi-Role and Multi-Behavior Collaboration of Large Language Models for Online Job Seeking and Recruiting

no code implementations28 May 2024 Hongda Sun, Hongzhan Lin, Haiyu Yan, Chen Zhu, Yang song, Xin Gao, Shuo Shang, Rui Yan

To this end, we propose MockLLM, a novel applicable framework that divides the person-job matching process into two modules: mock interview generation and two-sided evaluation in handshake protocol, jointly enhancing their performance through collaborative behaviors between interviewers and candidates.

DRE: Generating Recommendation Explanations by Aligning Large Language Models at Data-level

no code implementations9 Apr 2024 Shen Gao, Yifan Wang, Jiabao Fang, Lisi Chen, Peng Han, Shuo Shang

Recommendation systems play a crucial role in various domains, suggesting items based on user behavior. However, the lack of transparency in presenting recommendations can lead to user confusion.

Recommendation Systems

360$^\circ$REA: Towards A Reusable Experience Accumulation with 360° Assessment for Multi-Agent System

1 code implementation8 Apr 2024 Shen Gao, Hao Li, Chengrui Huang, Quan Tu, Zhiliang Tian, Minlie Huang, Shuo Shang

The framework employs a novel 360$^\circ$ performance assessment method for multi-perspective performance evaluation with fine-grained assessment.

Language Modeling Language Modelling +1

"In Dialogues We Learn": Towards Personalized Dialogue Without Pre-defined Profiles through In-Dialogue Learning

no code implementations5 Mar 2024 Chuanqi Cheng, Quan Tu, Shuo Shang, Cunli Mao, Zhengtao Yu, Wei Wu, Rui Yan

Personalized dialogue systems have gained significant attention in recent years for their ability to generate responses in alignment with different personas.

Dialogue Generation

DetermLR: Augmenting LLM-based Logical Reasoning from Indeterminacy to Determinacy

1 code implementation28 Oct 2023 Hongda Sun, Weikai Xu, Wei Liu, Jian Luan, Bin Wang, Shuo Shang, Ji-Rong Wen, Rui Yan

Recent advances in large language models (LLMs) have revolutionized the landscape of reasoning tasks.

Logical Reasoning

Generative and Contrastive Paradigms Are Complementary for Graph Self-Supervised Learning

no code implementations24 Oct 2023 Yuxiang Wang, Xiao Yan, Chuang Hu, Fangcheng Fu, Wentao Zhang, Hao Wang, Shuo Shang, Jiawei Jiang

For graph self-supervised learning (GSSL), masked autoencoder (MAE) follows the generative paradigm and learns to reconstruct masked graph edges or node features.

Contrastive Learning Graph Classification +4

BenchTemp: A General Benchmark for Evaluating Temporal Graph Neural Networks

1 code implementation31 Aug 2023 Qiang Huang, Jiawei Jiang, Xi Susie Rao, Ce Zhang, Zhichao Han, Zitao Zhang, Xin Wang, Yongjun He, Quanqing Xu, Yang Zhao, Chuang Hu, Shuo Shang, Bo Du

To handle graphs in which features or connectivities are evolving over time, a series of temporal graph neural networks (TGNNs) have been proposed.

Diversity Link Prediction +1

CharacterChat: Learning towards Conversational AI with Personalized Social Support

1 code implementation20 Aug 2023 Quan Tu, Chuanqi Chen, Jinpeng Li, Yanran Li, Shuo Shang, Dongyan Zhao, Ran Wang, Rui Yan

In our modern, fast-paced, and interconnected world, the importance of mental well-being has grown into a matter of great urgency.

Cannot find the paper you are looking for? You can Submit a new open access paper.