no code implementations • 20 May 2025 • Yufan Zhuang, Liyuan Liu, Chandan Singh, Jingbo Shang, Jianfeng Gao
In standard autoregressive generation, an LLM predicts the next-token distribution, samples a discrete token, and then discards the distribution, passing only the sampled token as new input.
1 code implementation • 21 Feb 2025 • Yufan Zhuang, Xiaodong Yu, Jialian Wu, Ximeng Sun, Ze Wang, Jiang Liu, Yusheng Su, Jingbo Shang, Zicheng Liu, Emad Barsoum
Answering complex, long-context questions remains a major challenge for large language models (LLMs) as it requires effective question clarifications and context retrieval.
1 code implementation • 8 Oct 2024 • Yufan Zhuang, Chandan Singh, Liyuan Liu, Jingbo Shang, Jianfeng Gao
Large language models (LLMs) have shown remarkable in-context learning (ICL) capabilities on textual data.
1 code implementation • 19 Jun 2024 • Feng Yao, Yufan Zhuang, Zihao Sun, Sunan Xu, Animesh Kumar, Jingbo Shang
In addition, we discuss the potential utilization of cross-lingual contamination in interpreting LLMs' working mechanisms and in post-training LLMs for enhanced multilingual capabilities.
1 code implementation • 6 Feb 2024 • Yufan Zhuang, Liyuan Liu, Chandan Singh, Jingbo Shang, Jianfeng Gao
Decision trees are renowned for their ability to achieve high predictive performance while remaining interpretable, especially on tabular data.
1 code implementation • 5 Oct 2022 • Yufan Zhuang, Zihan Wang, Fangbo Tao, Jingbo Shang
Recent works show that learning attention in the Fourier space can improve the long sequence learning capability of Transformers.
no code implementations • 10 Nov 2021 • Sahil Suneja, Yufan Zhuang, Yunhui Zheng, Jim Laredo, Alessandro Morari
AI modeling for source code understanding tasks has been making significant progress, and is being adopted in production development pipelines.
no code implementations • 7 Sep 2021 • Yufan Zhuang, Sahil Suneja, Veronika Thost, Giacomo Domeniconi, Alessandro Morari, Jim Laredo
Identifying vulnerable code is a precautionary measure to counter software security breaches.
no code implementations • 25 Nov 2020 • Sahil Suneja, Yunhui Zheng, Yufan Zhuang, Jim Laredo, Alessandro Morari
We measure the signal awareness of models using a new metric we propose- Signal-aware Recall (SAR).
no code implementations • 22 Jun 2020 • Luca Buratti, Saurabh Pujar, Mihaela Bornea, Scott McCarley, Yunhui Zheng, Gaetano Rossiello, Alessandro Morari, Jim Laredo, Veronika Thost, Yufan Zhuang, Giacomo Domeniconi
We explore this hypothesis through the use of a pre-trained transformer-based language model to perform code analysis tasks.
no code implementations • 15 Jun 2020 • Sahil Suneja, Yunhui Zheng, Yufan Zhuang, Jim Laredo, Alessandro Morari
We explore the applicability of Graph Neural Networks in learning the nuances of source code from a security perspective.