1 code implementation • 19 Sep 2024 • Yi Cui
This paper presents a case study of coding tasks by the latest reasoning models of OpenAI, i. e. o1-preview and o1-mini, in comparison with other frontier models.
Ranked #1 on Code Generation on WebApp1K-React
1 code implementation • 8 Sep 2024 • Yi Cui
This paper presents insights from evaluating 16 frontier large language models (LLMs) on the WebApp1K benchmark, a test suite designed to assess the ability of LLMs to generate web application code.
Ranked #3 on Code Generation on WebApp1K-React
no code implementations • 7 Aug 2024 • Yi Cui, Désiré Kédagni, Huan Wu
We introduce two causal parameters: the local average treatment-controlled direct effect (LATCDE), and the local average instrument-controlled direct effect (LAICDE).
1 code implementation • 30 Jul 2024 • Yi Cui
We introduce WebApp1K, a practical code-generation benchmark to measure LLM ability to develop web apps.
1 code implementation • 16 May 2024 • Yi Cui, Yao Li, Jayson R. Miedema, Sharon N. Edmiston, Sherif Farag, J. S. Marron, Nancy E. Thomas
Automated region of interest detection in histopathological image analysis is a challenging and important topic with tremendous potential impact on clinical practice.
no code implementations • 2 Nov 2022 • Le Xie, Tong Huang, Xiangtian Zheng, Yan Liu, Mengdi Wang, Vijay Vittal, P. R. Kumar, Srinivas Shakkottai, Yi Cui
The transition towards carbon-neutral electricity is one of the biggest game changers in addressing climate change since it addresses the dual challenges of removing carbon emissions from the two largest sectors of emitters: electricity and transportation.
no code implementations • 29 Oct 2022 • Yi Cui, Yao Li, Jayson R. Miedema, Sherif Farag, J. S. Marron, Nancy E. Thomas
Even though we test the experiments on the skin tumor dataset, our work could also be extended to other medical image detection problems, such as various tumors' classification and prediction, to help and benefit the clinical evaluation and diagnosis of different tumors.
no code implementations • 21 Jun 2022 • Yi Cui, Wenfeng Shen, Jian Zhang, Weijia Lu, Chuang Liu, Lin Sun, Si Chen
The generator in IDS-EBGAN is responsible for converting the original malicious network traffic in the training set into adversarial malicious examples.
no code implementations • 21 Oct 2020 • Shutang You, Yilu Liu, Hongyu Li, Shengyuan Liu, Kaiqi Sun, Yinfeng Zhao, Huangqing Xiao, Jiaojiao Dong, Yu Su, Weikang Wang, Yi Cui
Power grid data are going big with the deployment of various sensors.