Search Results for author: Yufan Zhuang

Found 11 papers, 5 papers with code

Text Generation Beyond Discrete Token Sampling

no code implementations20 May 2025 Yufan Zhuang, Liyuan Liu, Chandan Singh, Jingbo Shang, Jianfeng Gao

In standard autoregressive generation, an LLM predicts the next-token distribution, samples a discrete token, and then discards the distribution, passing only the sampled token as new input.

Code Generation Mathematical Reasoning +1

Self-Taught Agentic Long Context Understanding

1 code implementation21 Feb 2025 Yufan Zhuang, Xiaodong Yu, Jialian Wu, Ximeng Sun, Ze Wang, Jiang Liu, Yusheng Su, Jingbo Shang, Zicheng Liu, Emad Barsoum

Answering complex, long-context questions remains a major challenge for large language models (LLMs) as it requires effective question clarifications and context retrieval.

Long-Context Understanding

Data Contamination Can Cross Language Barriers

1 code implementation19 Jun 2024 Feng Yao, Yufan Zhuang, Zihao Sun, Sunan Xu, Animesh Kumar, Jingbo Shang

In addition, we discuss the potential utilization of cross-lingual contamination in interpreting LLMs' working mechanisms and in post-training LLMs for enhanced multilingual capabilities.

Memorization

Learning a Decision Tree Algorithm with Transformers

1 code implementation6 Feb 2024 Yufan Zhuang, Liyuan Liu, Chandan Singh, Jingbo Shang, Jianfeng Gao

Decision trees are renowned for their ability to achieve high predictive performance while remaining interpretable, especially on tabular data.

Meta-Learning

WavSpA: Wavelet Space Attention for Boosting Transformers' Long Sequence Learning Ability

1 code implementation5 Oct 2022 Yufan Zhuang, Zihan Wang, Fangbo Tao, Jingbo Shang

Recent works show that learning attention in the Fourier space can improve the long sequence learning capability of Transformers.

Data-Driven AI Model Signal-Awareness Enhancement and Introspection

no code implementations10 Nov 2021 Sahil Suneja, Yufan Zhuang, Yunhui Zheng, Jim Laredo, Alessandro Morari

AI modeling for source code understanding tasks has been making significant progress, and is being adopted in production development pipelines.

Learning to map source code to software vulnerability using code-as-a-graph

no code implementations15 Jun 2020 Sahil Suneja, Yunhui Zheng, Yufan Zhuang, Jim Laredo, Alessandro Morari

We explore the applicability of Graph Neural Networks in learning the nuances of source code from a security perspective.

Graph Neural Network Vulnerability Detection

Cannot find the paper you are looking for? You can Submit a new open access paper.