no code implementations • 25 Nov 2024 • Fu-Chieh Chang, Pei-Yuan Wu
In this work, we propose that LLMs learn arithmetic by capturing algebraic structures, such as \emph{Commutativity} and \emph{Identity} properties.
no code implementations • 31 Oct 2024 • Fu-Chieh Chang, Yu-Ting Lee, Hui-Ying Shih, Pei-Yuan Wu
This work provides a theoretical framework for understanding the effectiveness of reinforcement learning on CoT reasoning and STaR.
1 code implementation • 19 Sep 2024 • Tzu-Lin Kuo, Feng-Ting Liao, Mu-Wei Hsieh, Fu-Chieh Chang, Po-chun Hsu, Da-Shan Shiu
In real-world applications with Large Language Models (LLMs), external retrieval mechanisms - such as Search-Augmented Generation (SAG), tool utilization, and Retrieval-Augmented Generation (RAG) - are often employed to enhance the quality of augmented generations in dialogues.
no code implementations • 22 Aug 2024 • Yen-Ru Lai, Fu-Chieh Chang, Pei-Yuan Wu
This situation highlights the importance of finding effective ways to use unlabelled data in offline RL, especially when labelled data is limited or expensive to obtain.
no code implementations • 27 Jun 2024 • Jia-Hau Bai, Chi-Ting Liu, Yu Wang, Fu-Chieh Chang, Pei-Yuan Wu
This study uses CAPM (Convex Adversarial Polytope for Maxpool-based CNN) to improve the verified bound for general purpose maxpool-based convolutional neural networks (CNNs) under bounded norm adversarial perturbations.
no code implementations • 1 Feb 2023 • Sing-Yuan Yeh, Fu-Chieh Chang, Chang-Wei Yueh, Pei-Yuan Wu, Alberto Bernacchia, Sattar Vakili
To the best of our knowledge, this is the first result showing a finite sample complexity under such a general model.
no code implementations • 13 Apr 2022 • Fu-Chieh Chang, Yu-Wei Tseng, Ya-Wen Yu, Ssu-Rui Lee, Alexandru Cioba, I-Lun Tseng, Da-Shan Shiu, Jhih-Wei Hsu, Cheng-Yuan Wang, Chien-Yi Yang, Ren-Chu Wang, Yao-Wen Chang, Tai-Chen Chen, Tung-Chieh Chen
Recently, successful applications of reinforcement learning to chip placement have emerged.
no code implementations • 29 May 2019 • Fu-Chieh Chang, Hao-Jen Wang, Chun-Nan Chou, Edward Y. Chang
Performing supervised learning from the data synthesized by using Generative Adversarial Networks (GANs), dubbed GAN-synthetic data, has two important applications.
no code implementations • 25 Jul 2017 • Chun-Nan Chou, Chuen-Kai Shie, Fu-Chieh Chang, Jocelyn Chang, Edward Y. Chang
Deep learning owes its success to three key factors: scale of data, enhanced models to learn representations from data, and scale of computation.