no code implementations • 4 Mar 2024 • Fiona Anting Tan, Gerard Christopher Yeo, Fanyou Wu, Weijie Xu, Vinija Jain, Aman Chadha, Kokil Jaidka, Yang Liu, See-Kiong Ng
Drawing inspiration from psychological research on the links between certain personality traits and Theory-of-Mind (ToM) reasoning, and from prompt engineering research on the hyper-sensitivity of prompts in affecting LLMs capabilities, this study investigates how inducing personalities in LLMs using prompts affects their ToM reasoning capabilities.
no code implementations • 27 Feb 2024 • Xi Fang, Weijie Xu, Fiona Anting Tan, Jiani Zhang, Ziqing Hu, Yanjun Qi, Scott Nickleach, Diego Socolinsky, Srinivasan Sengamedu, Christos Faloutsos
Recent breakthroughs in large language modeling have facilitated rigorous exploration of their application in diverse tasks related to tabular data modeling, such as prediction, tabular data synthesis, question answering, and table understanding.
1 code implementation • 1 Feb 2024 • Weijie Xu, Zicheng Huang, Wenxiang Hu, Xi Fang, Rajesh Kumar Cherukuri, Naumaan Nayyar, Lorenzo Malandri, Srinivasan H. Sengamedu
The data generation pipeline is transferable and can be easily adapted for labeled conversation data generation in other domains.
no code implementations • 28 Oct 2023 • Yixin Wan, Fanyou Wu, Weijie Xu, Srinivasan H. Sengamedu
We explore the correlation between the level of hallucination in model responses and two types of sequence-level certainty: probabilistic certainty and semantic certainty.
1 code implementation • 23 Oct 2023 • Weijie Xu, Wenxiang Hu, Fanyou Wu, Srinivasan Sengamedu
Additionally, by exploiting the power of diffusion model, our framework also provides the capability to do topic based text generation.
no code implementations • 6 Jul 2023 • Weijie Xu, Jay Desai, Srinivasan Sengamedu, Xiaoyu Jiang, Francis Iannacci
Across a variety of datasets, S2vNTM outperforms existing semi-supervised topic modeling methods in classification accuracy with limited keywords provided.
no code implementations • 4 Jul 2023 • Weijie Xu, Xiaoyu Jiang, Jay Desai, Bin Han, Fuqin Yan, Francis Iannacci
In text classification tasks, fine tuning pretrained language models like BERT and GPT-3 yields competitive accuracy; however, both methods require pretraining on large text datasets.
no code implementations • 4 Jul 2023 • Tamas Madl, Weijie Xu, Olivia Choudhury, Matthew Howard
Despite progress in differential privacy and generative modeling for privacy-preserving data release in the literature, only a few approaches optimize for machine learning utility: most approaches only take into account statistical metrics on the data itself and fail to explicitly preserve the loss metrics of machine learning models that are to be subsequently trained on the generated data.
1 code implementation • 3 Jul 2023 • Weijie Xu, Xiaoyu Jiang, Srinivasan H. Sengamedu, Francis Iannacci, Jinjin Zhao
Recently, Neural Topic Models (NTM), inspired by variational autoencoders, have attracted a lot of research interest; however, these methods have limited applications in the real world due to the challenge of incorporating human knowledge.
Ranked #1 on Topic Models on 20NewsGroups
no code implementations • 30 Jun 2023 • Weijie Xu, Jinjin Zhao, Francis Iannacci, Bo wang
Generative modeling has been used frequently in synthetic data generation.