no code implementations • 15 May 2025 • Shuchen Guo, Yun Wang, Jichao Yu, Xuansheng Wu, Bilgehan Ayik, Field M. Watts, Ehsan Latif, Ninghao Liu, Lei Liu, Xiaoming Zhai
This study investigated potential scoring biases and disparities toward English Language Learners (ELLs) when using automatic scoring systems for middle school students' written responses to science assessments.
no code implementations • 20 Apr 2025 • Luyang Fang, Xiaowei Yu, Jiazhang Cai, Yongkai Chen, Shushan Wu, Zhengliang Liu, Zhenyuan Yang, Haoran Lu, Xilin Gong, Yufang Liu, Terry Ma, Wei Ruan, Ali Abbasi, Jing Zhang, Tao Wang, Ehsan Latif, Wei Liu, Wei zhang, Soheil Kolouri, Xiaoming Zhai, Dajiang Zhu, Wenxuan Zhong, Tianming Liu, Ping Ma
Despite substantial progress, open challenges remain in preserving emergent reasoning and linguistic diversity, enabling efficient adaptation to continually evolving teacher models and datasets, and establishing comprehensive evaluation protocols.
no code implementations • 12 Mar 2025 • Luyang Fang, Ehsan Latif, Haoran Lu, Yifan Zhou, Ping Ma, Xiaoming Zhai
The improvements in micro F1 and per-label accuracy were statistically significant compared to GPT-o1-based merging (p=0. 04, p=0. 01).
1 code implementation • 12 Mar 2025 • Ehsan Latif, Xiaoming Zhai
Data privacy remains a critical concern in educational research, requiring strict adherence to ethical standards and regulatory protocols.
no code implementations • 12 Jan 2025 • Jie Yang, Ehsan Latif, Yuze He, Xiaoming Zhai
These findings demonstrate the effectiveness of LLMs in automatic scoring within a Chinese context and emphasize the importance of linguistic features and reasoning complexity in fine-tuning scoring models for educational assessments.
no code implementations • 30 Dec 2024 • Ehsan Latif, Xiaoming Zhai
The integration of Artificial Intelligence (AI) in education requires scalable and efficient frameworks that balance performance, adaptability, and cost.
no code implementations • 7 Dec 2024 • Ehsan Latif, Yifan Zhou, Shuchen Guo, Lehong Shi, Yizhu Gao, Matthew Nyaaba, Arne Bewerdorff, Xiantong Yang, Xiaoming Zhai
For scientific reasoning, it achieved near-perfect performance (mean = 0. 99, SD = 0. 12) on the TOSLS,, exceeding the highest human scores of 0. 85, SD = 0. 13 (z = 1. 78).
no code implementations • 11 Nov 2024 • Shuchen Guo, Ehsan Latif, Yifan Zhou, Xuan Huang, Xiaoming Zhai
This study investigates the use of generative AI and multi-agent systems to provide automatic feedback in educational contexts, particularly for student constructed responses in science assessments.
no code implementations • 11 Oct 2024 • Ehsan Latif, Yifan Zhou, Shuchen Guo, Yizhu Gao, Lehong Shi, Matthew Nayaaba, Gyeonggeon Lee, Liang Zhang, Arne Bewersdorff, Luyang Fang, Xiantong Yang, Huaqin Zhao, Hanqi Jiang, Haoran Lu, Jiaxi Li, Jichao Yu, Weihang You, Zhengliang Liu, Vincent Shung Liu, Hui Wang, Zihao Wu, Jin Lu, Fei Dou, Ping Ma, Ninghao Liu, Tianming Liu, Xiaoming Zhai
This study evaluates OpenAI o1-preview's ability to perform higher-order cognitive tasks across 14 dimensions, including critical thinking, systems thinking, computational thinking, design thinking, metacognition, data literacy, creative thinking, abstract reasoning, quantitative reasoning, logical reasoning, analogical reasoning, and scientific reasoning.
1 code implementation • 8 Jul 2024 • Siva Krishna Ravipati, Ehsan Latif, Ramviyas Parasuraman, Suchendra M. Bhandarkar
Classification of different object surface material types can play a significant role in the decision-making algorithms for mobile robots and autonomous vehicles.
no code implementations • 4 Jul 2024 • Xuansheng Wu, Padmaja Pravin Saraf, Gyeonggeon Lee, Ehsan Latif, Ninghao Liu, Xiaoming Zhai
Specifically, we prompt LLMs to generate analytic rubrics that they use to assign scores and study the alignment gap with human grading rubrics.
1 code implementation • 9 Feb 2024 • Ehsan Latif, Gyeong-Geon Lee, Knut Neumann, Tamara Kastorff, Xiaoming Zhai
The advancement of natural language processing has paved the way for automated scoring systems in various languages, such as German (e. g., German BERT [G-BERT]).
no code implementations • 27 Dec 2023 • Gyeong-Geon Lee, Ehsan Latif, Lehong Shi, Xiaoming Zhai
This study compared the classification performance of Gemini Pro and GPT-4V in educational settings.
no code implementations • 26 Dec 2023 • Ehsan Latif, Luyang Fang, Ping Ma, Xiaoming Zhai
We compared accuracy with state-of-the-art (SOTA) distilled models, TinyBERT, and artificial neural network (ANN) models.
no code implementations • 10 Dec 2023 • Gyeong-Geon Lee, Lehong Shi, Ehsan Latif, Yizhu Gao, Arne Bewersdorff, Matthew Nyaaba, Shuchen Guo, Zihao Wu, Zhengliang Liu, Hui Wang, Gengchen Mai, Tiaming Liu, Xiaoming Zhai
This paper presents a comprehensive examination of how multimodal artificial intelligence (AI) approaches are paving the way towards the realization of Artificial General Intelligence (AGI) in educational contexts.
no code implementations • 2 Dec 2023 • Ehsan Latif, Xiaoming Zhai
We also have observed that HNN is x2 more efficient in training and inferencing than BERT and has comparable efficiency to the lightweight but less accurate Naive Bayes model.
no code implementations • 30 Nov 2023 • Gyeong-Geon Lee, Ehsan Latif, Xuansheng Wu, Ninghao Liu, Xiaoming Zhai
We found a more balanced accuracy across different proficiency categories when CoT was used with a scoring rubric, highlighting the importance of domain-specific reasoning in enhancing the effectiveness of LLMs in scoring tasks.
no code implementations • 16 Oct 2023 • Ehsan Latif, Xiaoming Zhai
In this study, we fine-tuned GPT-3. 5 on six assessment tasks with a diverse dataset of middle-school and high-school student responses and expert scoring.
1 code implementation • 22 Jun 2023 • Ehsan Latif, Ramviyas Parasuraman
The availability of accurate localization is critical for multi-robot exploration strategies; noisy or inconsistent localization causes failure in meeting exploration objectives.
1 code implementation • 26 May 2023 • Ehsan Latif, WenZhan Song, Ramviyas Parasuraman
The results demonstrate significantly higher coverage accuracy and efficiency while reducing costs and overlaps even in high packet loss and low communication range scenarios.
no code implementations • 24 Apr 2023 • Ehsan Latif, Gengchen Mai, Matthew Nyaaba, Xuansheng Wu, Ninghao Liu, Guoyu Lu, Sheng Li, Tianming Liu, Xiaoming Zhai
AGI, driven by the recent large pre-trained models, represents a significant leap in the capability of machines to perform tasks that require human-level intelligence, such as reasoning, problem-solving, decision-making, and even understanding human emotions and social interactions.