Search Results for author: Xiaoming Zhai

Found 35 papers, 3 papers with code

Efficient Multi-Task Inferencing: Model Merging with Gromov-Wasserstein Feature Alignment

no code implementations12 Mar 2025 Luyang Fang, Ehsan Latif, Haoran Lu, Yifan Zhou, Ping Ma, Xiaoming Zhai

The improvements in micro F1 and per-label accuracy were statistically significant compared to GPT-o1-based merging (p=0. 04, p=0. 01).

Computational Efficiency

Interpreting and Steering LLMs with Mutual Information-based Explanations on Sparse Autoencoders

no code implementations21 Feb 2025 Xuansheng Wu, Jiayi Yuan, Wenlin Yao, Xiaoming Zhai, Ninghao Liu

Large language models (LLMs) excel at handling human queries, but they can occasionally generate flawed or unexpected responses.

Self-Regularization with Latent Space Explanations for Controllable LLM-based Classification

no code implementations19 Feb 2025 Xuansheng Wu, Wenhao Yu, Xiaoming Zhai, Ninghao Liu

In training the classification model, we propose a simple and effective regularizer, by minimizing the similarity between the classifier weights and the identified unintended feature, to remove the impacts of these unintended features toward classification.

Fairness text-classification +1

Fine-tuning ChatGPT for Automatic Scoring of Written Scientific Explanations in Chinese

no code implementations12 Jan 2025 Jie Yang, Ehsan Latif, Yuze He, Xiaoming Zhai

These findings demonstrate the effectiveness of LLMs in automatic scoring within a Chinese context and emphasize the importance of linguistic features and reasoning complexity in fine-tuning scoring models for educational assessments.

Efficient Multi-Task Inferencing with a Shared Backbone and Lightweight Task-Specific Adapters for Automatic Scoring

no code implementations30 Dec 2024 Ehsan Latif, Xiaoming Zhai

The integration of Artificial Intelligence (AI) in education requires scalable and efficient frameworks that balance performance, adaptability, and cost.

Fairness

Can OpenAI o1 outperform humans in higher-order cognitive thinking?

no code implementations7 Dec 2024 Ehsan Latif, Yifan Zhou, Shuchen Guo, Lehong Shi, Yizhu Gao, Matthew Nyaaba, Arne Bewerdorff, Xiantong Yang, Xiaoming Zhai

For scientific reasoning, it achieved near-perfect performance (mean = 0. 99, SD = 0. 12) on the TOSLS,, exceeding the highest human scores of 0. 85, SD = 0. 13 (z = 1. 78).

Logical Reasoning

Foundation Models for Low-Resource Language Education (Vision Paper)

no code implementations6 Dec 2024 Zhaojun Ding, Zhengliang Liu, Hanqi Jiang, Yizhu Gao, Xiaoming Zhai, Tianming Liu, Ninghao Liu

Recent studies show that large language models (LLMs) are powerful tools for working with natural language, bringing advances in many areas of computational linguistics.

Transcending Language Boundaries: Harnessing LLMs for Low-Resource Language Translation

no code implementations18 Nov 2024 Peng Shu, JunHao Chen, Zhengliang Liu, Hui Wang, Zihao Wu, Tianyang Zhong, Yiwei Li, Huaqin Zhao, Hanqi Jiang, Yi Pan, Yifan Zhou, Constance Owl, Xiaoming Zhai, Ninghao Liu, Claudio Saunt, Tianming Liu

Our comparison with the zero-shot performance of GPT-4o and LLaMA 3. 1 405B, highlights the significant challenges these models face when translating into low-resource languages.

Retrieval Translation

Using Generative AI and Multi-Agents to Provide Automatic Feedback

no code implementations11 Nov 2024 Shuchen Guo, Ehsan Latif, Yifan Zhou, Xuan Huang, Xiaoming Zhai

This study investigates the use of generative AI and multi-agent systems to provide automatic feedback in educational contexts, particularly for student constructed responses in science assessments.

Large Language Models for Manufacturing

no code implementations28 Oct 2024 Yiwei Li, Huaqin Zhao, Hanqi Jiang, Yi Pan, Zhengliang Liu, Zihao Wu, Peng Shu, Jie Tian, Tianze Yang, Shaochen Xu, Yanjun Lyu, Parker Blenk, Jacob Pence, Jason Rupram, Eliza Banu, Ninghao Liu, Linbing Wang, WenZhan Song, Xiaoming Zhai, Kenan Song, Dajiang Zhu, Beiwen Li, Xianqiao Wang, Tianming Liu

The rapid advances in Large Language Models (LLMs) have the potential to transform manufacturing industry, offering new opportunities to optimize processes, improve efficiency, and drive innovation.

A Systematic Assessment of OpenAI o1-Preview for Higher Order Thinking in Education

no code implementations11 Oct 2024 Ehsan Latif, Yifan Zhou, Shuchen Guo, Yizhu Gao, Lehong Shi, Matthew Nayaaba, Gyeonggeon Lee, Liang Zhang, Arne Bewersdorff, Luyang Fang, Xiantong Yang, Huaqin Zhao, Hanqi Jiang, Haoran Lu, Jiaxi Li, Jichao Yu, Weihang You, Zhengliang Liu, Vincent Shung Liu, Hui Wang, Zihao Wu, Jin Lu, Fei Dou, Ping Ma, Ninghao Liu, Tianming Liu, Xiaoming Zhai

This study evaluates OpenAI o1-preview's ability to perform higher-order cognitive tasks across 14 dimensions, including critical thinking, systems thinking, computational thinking, design thinking, metacognition, data literacy, creative thinking, abstract reasoning, quantitative reasoning, logical reasoning, analogical reasoning, and scientific reasoning.

Logical Reasoning

Transforming Teachers' Roles and Agencies in the Era of Generative AI: Perceptions, Acceptance, Knowledge, and Practices

no code implementations3 Oct 2024 Xiaoming Zhai

This paper explores the transformative impact of Generative Artificial Intelligence (GenAI) on teachers' roles and agencies in education, presenting a comprehensive framework that addresses teachers' perceptions, knowledge, acceptance, and practices of GenAI.

Unveiling Scoring Processes: Dissecting the Differences between LLMs and Human Graders in Automatic Scoring

no code implementations4 Jul 2024 Xuansheng Wu, Padmaja Pravin Saraf, Gyeonggeon Lee, Ehsan Latif, Ninghao Liu, Xiaoming Zhai

Specifically, we prompt LLMs to generate analytic rubrics that they use to assign scores and study the alignment gap with human grading rubrics.

Logical Reasoning

Realizing Visual Question Answering for Education: GPT-4V as a Multimodal AI

no code implementations12 May 2024 Gyeong-Geon Lee, Xiaoming Zhai

In this paper, chapter II reviews the development of VQA techniques, which primes with the release of GPT-4V.

Question Answering Visual Question Answering

AI and Machine Learning for Next Generation Science Assessments

no code implementations23 Apr 2024 Xiaoming Zhai

The paper begins with a discussion of the Framework for K-12 Science Education, which calls for a shift from conceptual learning to knowledge-in-use.

Multiple-choice

Usable XAI: 10 Strategies Towards Exploiting Explainability in the LLM Era

1 code implementation13 Mar 2024 Xuansheng Wu, Haiyan Zhao, Yaochen Zhu, Yucheng Shi, Fan Yang, Tianming Liu, Xiaoming Zhai, Wenlin Yao, Jundong Li, Mengnan Du, Ninghao Liu

Therefore, in this paper, we introduce Usable XAI in the context of LLMs by analyzing (1) how XAI can benefit LLMs and AI systems, and (2) how LLMs can contribute to the advancement of XAI.

G-SciEdBERT: A Contextualized LLM for Science Assessment Tasks in German

1 code implementation9 Feb 2024 Ehsan Latif, Gyeong-Geon Lee, Knut Neumann, Tamara Kastorff, Xiaoming Zhai

The advancement of natural language processing has paved the way for automated scoring systems in various languages, such as German (e. g., German BERT [G-BERT]).

Language Modelling Large Language Model

Can generative AI and ChatGPT outperform humans on cognitive-demanding problem-solving tasks in science?

no code implementations7 Jan 2024 Xiaoming Zhai, Matthew Nyaaba, Wenchao Ma

We compared the performance of ChatGPT and GPT-4 on 2019 NAEP science assessments with students by cognitive demands of the items.

Gemini Pro Defeated by GPT-4V: Evidence from Education

no code implementations27 Dec 2023 Gyeong-Geon Lee, Ehsan Latif, Lehong Shi, Xiaoming Zhai

This study compared the classification performance of Gemini Pro and GPT-4V in educational settings.

Image Classification Question Answering +1

Knowledge Distillation of LLM for Automatic Scoring of Science Education Assessments

no code implementations26 Dec 2023 Ehsan Latif, Luyang Fang, Ping Ma, Xiaoming Zhai

We compared accuracy with state-of-the-art (SOTA) distilled models, TinyBERT, and artificial neural network (ANN) models.

Knowledge Distillation Mathematical Reasoning

Collaborative Learning with Artificial Intelligence Speakers (CLAIS): Pre-Service Elementary Science Teachers' Responses to the Prototype

no code implementations20 Dec 2023 Gyeong-Geon Lee, Seonyeong Mun, Myeong-Kyeong Shin, Xiaoming Zhai

This research aims to demonstrate that AI can function not only as a tool for learning, but also as an intelligent agent with which humans can engage in collaborative learning (CL) to change epistemic practices in science classrooms.

speech-recognition Speech Recognition

Multimodality of AI for Education: Towards Artificial General Intelligence

no code implementations10 Dec 2023 Gyeong-Geon Lee, Lehong Shi, Ehsan Latif, Yizhu Gao, Arne Bewersdorff, Matthew Nyaaba, Shuchen Guo, Zihao Wu, Zhengliang Liu, Hui Wang, Gengchen Mai, Tiaming Liu, Xiaoming Zhai

This paper presents a comprehensive examination of how multimodal artificial intelligence (AI) approaches are paving the way towards the realization of Artificial General Intelligence (AGI) in educational contexts.

Automatic Scoring of Students' Science Writing Using Hybrid Neural Network

no code implementations2 Dec 2023 Ehsan Latif, Xiaoming Zhai

We also have observed that HNN is x2 more efficient in training and inferencing than BERT and has comparable efficiency to the lightweight but less accurate Naive Bayes model.

regression

Applying Large Language Models and Chain-of-Thought for Automatic Scoring

no code implementations30 Nov 2023 Gyeong-Geon Lee, Ehsan Latif, Xuansheng Wu, Ninghao Liu, Xiaoming Zhai

We found a more balanced accuracy across different proficiency categories when CoT was used with a scoring rubric, highlighting the importance of domain-specific reasoning in enhancing the effectiveness of LLMs in scoring tasks.

Few-Shot Learning Prompt Engineering +1

NERIF: GPT-4V for Automatic Scoring of Drawn Models

no code implementations21 Nov 2023 Gyeong-Geon Lee, Xiaoming Zhai

The results of this study show that utilizing GPT-4V for automatic scoring of student-drawn models is promising.

Few-Shot Learning

Using GPT-4 to Augment Unbalanced Data for Automatic Scoring

no code implementations25 Oct 2023 Luyang Fang, Gyeong-Geon Lee, Xiaoming Zhai

Notably, we found that a varying amount of augmented data (20%-40%) was needed to obtain stable improvement for automatic scoring.

Data Augmentation Language Modelling +1

Fine-tuning ChatGPT for Automatic Scoring

no code implementations16 Oct 2023 Ehsan Latif, Xiaoming Zhai

In this study, we fine-tuned GPT-3. 5 on six assessment tasks with a diverse dataset of middle-school and high-school student responses and expert scoring.

Language Modelling

Elucidating STEM Concepts through Generative AI: A Multi-modal Exploration of Analogical Reasoning

no code implementations21 Aug 2023 Chen Cao, Zijian Ding, Gyeong-Geon Lee, Jiajun Jiao, Jionghao Lin, Xiaoming Zhai

Our study demonstrates the potential of applying large language models to educational practice on STEM subjects.

AGI: Artificial General Intelligence for Education

no code implementations24 Apr 2023 Ehsan Latif, Gengchen Mai, Matthew Nyaaba, Xuansheng Wu, Ninghao Liu, Guoyu Lu, Sheng Li, Tianming Liu, Xiaoming Zhai

AGI, driven by the recent large pre-trained models, represents a significant leap in the capability of machines to perform tasks that require human-level intelligence, such as reasoning, problem-solving, decision-making, and even understanding human emotions and social interactions.

Decision Making Fairness

Context Matters: A Strategy to Pre-train Language Model for Science Education

no code implementations27 Jan 2023 Zhengliang Liu, Xinyu He, Lei Liu, Tianming Liu, Xiaoming Zhai

However, the ideal type of data to contextualize pre-trained language model and improve the performance in automatically scoring student written responses remains unclear.

Language Modeling Language Modelling

Matching Exemplar as Next Sentence Prediction (MeNSP): Zero-shot Prompt Learning for Automatic Scoring in Science Education

1 code implementation20 Jan 2023 Xuansheng Wu, Xinyu He, Tianming Liu, Ninghao Liu, Xiaoming Zhai

Developing models to automatically score students' written responses to science problems is critical for science education.

Sentence

Pseudo AI Bias

no code implementations14 Oct 2022 Xiaoming Zhai, Joseph Krajcik

Pseudo Artificial Intelligence bias (PAIB) is broadly disseminated in the literature, which can result in unnecessary AI fear in society, exacerbate the enduring inequities and disparities in access to and sharing the benefits of AI applications, and waste social capital invested in AI research.

Cannot find the paper you are looking for? You can Submit a new open access paper.