no code implementations • NAACL (TextGraphs) 2021 • Jinman Zhao, Gerald Penn, Huan Ling
In this paper, we define an abstract task called structural realization that generates words given a prefix of words and a partial representation of a parse tree.
no code implementations • 23 Feb 2025 • Zhili Feng, Dhananjay Ram, Cole Hawkins, Aditya Rawal, Jinman Zhao, Sheng Zha
The next token prediction loss is the dominant self-supervised training objective for large language models and has achieved promising results in a variety of downstream tasks.
no code implementations • 21 Sep 2024 • Jinman Zhao, Zifan Qian, Linbo Cao, Yining Wang, Yitian Ding, Yulan Hu, Zeyu Zhang, Zeyong Jin
Role-play in large language models (LLMs) enhances their ability to generate contextually relevant and high-quality responses by simulating diverse cognitive perspectives.
1 code implementation • 21 Sep 2024 • Guohui Cai, Ruicheng Zhang, Hongyang He, Zeyu Zhang, Daji Ergu, Yuanzhouhan Cao, Jinman Zhao, Binbin Hu, Zhinbin Liao, Yang Zhao, Ying Cai
Pulmonary nodules are critical indicators for the early diagnosis of lung cancer, making their detection essential for timely treatment.
no code implementations • 21 Sep 2024 • Jinman Zhao, Xueyan Zhang, Xingyu Yue, Weizhe Chen, Zifan Qian, Ruiyu Wang
Current common interactions with language models is through full inference.
1 code implementation • 31 Aug 2024 • Zhiyuan Hu, Yuliang Liu, Jinman Zhao, Suyuchen Wang, Yan Wang, Wei Shen, Qing Gu, Anh Tuan Luu, See-Kiong Ng, Zhiwei Jiang, Bryan Hooi
Large language models (LLMs) face significant challenges in handling long-context tasks because of their limited effective context window size during pretraining, which restricts their ability to generalize over extended sequences.
no code implementations • 8 Aug 2024 • Jiahao Tian, Jinman Zhao, Zhenkai Wang, Zhicheng Ding
This surge in content presents unique challenges for designing effective recommender systems.
no code implementations • 14 Jul 2024 • Zhicheng Ding, Jiahao Tian, Zhenkai Wang, Jinman Zhao, Siyang Li
We evaluate our LLM-based imputation method across various tasks within the recommendation system domain, including single classification, multi-classification, and regression compared to traditional data imputation methods.
no code implementations • 5 Mar 2024 • Jinman Zhao, Xueyan Zhang
We present a comprehensive evaluation of large language models(LLMs)' ability to reason about composition relations through a benchmark encompassing 1, 500 test cases in English, designed to cover six distinct types of composition relations: Positional, Comparative, Personal, Mathematical, Identity, and Other.
no code implementations • 1 Mar 2024 • Jinman Zhao, Yitian Ding, Chen Jia, Yining Wang, Zifan Qian
We investigate the outputs of the GPT series of LLMs in various languages using our three measurement methods.
1 code implementation • NeurIPS 2023 • Tuan Dinh, Jinman Zhao, Samson Tan, Renato Negrinho, Leonard Lausen, Sheng Zha, George Karypis
We find that the presence of potential bugs significantly degrades the generation performance of the high-performing Code-LLMs.
no code implementations • 1 Jun 2023 • Hengzhi Pei, Jinman Zhao, Leonard Lausen, Sheng Zha, George Karypis
To better solve this task, we query a program analyzer for information relevant to a given function call, and consider ways to provide the analyzer results to different code completion models during inference and training.
1 code implementation • 30 Mar 2020 • Seohyun Kim, Jinman Zhao, Yuchi Tian, Satish Chandra
We provide comprehensive experimental evaluation of our proposal, along with alternative design choices, on a standard Python dataset, as well as on a Python corpus internal to Facebook.
Ranked #1 on
Type prediction
on Py150
Type prediction
Value prediction
Software Engineering
1 code implementation • EMNLP 2018 • Jinman Zhao, Sidharth Mudgal, YIngyu Liang
We approach the problem of generalizing pre-trained word embeddings beyond fixed-size vocabularies without using additional contextual information.
no code implementations • 11 Sep 2018 • Jinman Zhao, Aws Albarghouthi, Vaibhav Rastogi, Somesh Jha, Damien Octeau
We address the problem of discovering communication links between applications in the popular Android mobile operating system, an important problem for security and privacy in Android.
no code implementations • NeurIPS 2018 • Lingjiao Chen, Hongyi Wang, Jinman Zhao, Dimitris Papailiopoulos, Paraschos Koutris
Distributed implementations of mini-batch stochastic gradient descent (SGD) suffer from communication overheads, attributed to the high frequency of gradient updates inherent in small-batch training.