Search Results for author: Weijie Xu

Found 10 papers, 3 papers with code

PHAnToM: Personality Has An Effect on Theory-of-Mind Reasoning in Large Language Models

no code implementations • 4 Mar 2024 • Fiona Anting Tan, Gerard Christopher Yeo, Fanyou Wu, Weijie Xu, Vinija Jain, Aman Chadha, Kokil Jaidka, Yang Liu, See-Kiong Ng

Drawing inspiration from psychological research on the links between certain personality traits and Theory-of-Mind (ToM) reasoning, and from prompt engineering research on the hyper-sensitivity of prompts in affecting LLMs capabilities, this study investigates how inducing personalities in LLMs using prompts affects their ToM reasoning capabilities.

Prompt Engineering

Paper
Add Code

Large Language Models(LLMs) on Tabular Data: Prediction, Generation, and Understanding -- A Survey

no code implementations • 27 Feb 2024 • Xi Fang, Weijie Xu, Fiona Anting Tan, Jiani Zhang, Ziqing Hu, Yanjun Qi, Scott Nickleach, Diego Socolinsky, Srinivasan Sengamedu, Christos Faloutsos

Recent breakthroughs in large language modeling have facilitated rigorous exploration of their application in diverse tasks related to tabular data modeling, such as prediction, tabular data synthesis, question answering, and table understanding.

Language Modelling Navigate +1

Paper
Add Code

HR-MultiWOZ: A Task Oriented Dialogue (TOD) Dataset for HR LLM Agent

1 code implementation • 1 Feb 2024 • Weijie Xu, Zicheng Huang, Wenxiang Hu, Xi Fang, Rajesh Kumar Cherukuri, Naumaan Nayyar, Lorenzo Malandri, Srinivasan H. Sengamedu

The data generation pipeline is transferable and can be easily adapted for labeled conversation data generation in other domains.

Paper
Code

Sequence-Level Certainty Reduces Hallucination In Knowledge-Grounded Dialogue Generation

no code implementations • 28 Oct 2023 • Yixin Wan, Fanyou Wu, Weijie Xu, Srinivasan H. Sengamedu

We explore the correlation between the level of hallucination in model responses and two types of sequence-level certainty: probabilistic certainty and semantic certainty.

Dialogue Generation Hallucination

Paper
Add Code

DeTiME: Diffusion-Enhanced Topic Modeling using Encoder-decoder based LLM

1 code implementation • 23 Oct 2023 • Weijie Xu, Wenxiang Hu, Fanyou Wu, Srinivasan Sengamedu

Additionally, by exploiting the power of diffusion model, our framework also provides the capability to do topic based text generation.

Text Generation Topic Models

Paper
Code

S2vNTM: Semi-supervised vMF Neural Topic Modeling

no code implementations • 6 Jul 2023 • Weijie Xu, Jay Desai, Srinivasan Sengamedu, Xiaoyu Jiang, Francis Iannacci

Across a variety of datasets, S2vNTM outperforms existing semi-supervised topic modeling methods in classification accuracy with limited keywords provided.

Language Modelling text-classification +1

Paper
Add Code

KDSTM: Neural Semi-supervised Topic Modeling with Knowledge Distillation

no code implementations • 4 Jul 2023 • Weijie Xu, Xiaoyu Jiang, Jay Desai, Bin Han, Fuqin Yan, Francis Iannacci

In text classification tasks, fine tuning pretrained language models like BERT and GPT-3 yields competitive accuracy; however, both methods require pretraining on large text datasets.

Knowledge Distillation text-classification +1

Paper
Add Code

Approximate, Adapt, Anonymize (3A): a Framework for Privacy Preserving Training Data Release for Machine Learning

no code implementations • 4 Jul 2023 • Tamas Madl, Weijie Xu, Olivia Choudhury, Matthew Howard

Despite progress in differential privacy and generative modeling for privacy-preserving data release in the literature, only a few approaches optimize for machine learning utility: most approaches only take into account statistical metrics on the data itself and fail to explicitly preserve the loss metrics of machine learning models that are to be subsequently trained on the generated data.

Privacy Preserving Synthetic Data Generation

Paper
Add Code

vONTSS: vMF based semi-supervised neural topic modeling with optimal transport

1 code implementation • 3 Jul 2023 • Weijie Xu, Xiaoyu Jiang, Srinivasan H. Sengamedu, Francis Iannacci, Jinjin Zhao

Recently, Neural Topic Models (NTM), inspired by variational autoencoders, have attracted a lot of research interest; however, these methods have limited applications in the real world due to the challenge of incorporating human knowledge.

Ranked #1 on Topic Models on 20NewsGroups

text-classification Topic Classification +1

Paper
Code

FFPDG: Fast, Fair and Private Data Generation

no code implementations • 30 Jun 2023 • Weijie Xu, Jinjin Zhao, Francis Iannacci, Bo wang

Generative modeling has been used frequently in synthetic data generation.

Fairness Synthetic Data Generation

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.