Search Results for author: Chang Gao

Found 27 papers, 12 papers with code

FreeEval: A Modular Framework for Trustworthy and Efficient Evaluation of Large Language Models

2 code implementations9 Apr 2024 Zhuohao Yu, Chang Gao, Wenjin Yao, Yidong Wang, Zhengran Zeng, Wei Ye, Jindong Wang, Yue Zhang, Shikun Zhang

The rapid development of large language model (LLM) evaluation methodologies and datasets has led to a profound challenge: integrating state-of-the-art evaluation techniques cost-effectively while ensuring reliability, reproducibility, and efficiency.

Fairness Language Modelling +1

CodeShell Technical Report

no code implementations23 Mar 2024 Rui Xie, Zhengran Zeng, Zhuohao Yu, Chang Gao, Shikun Zhang, Wei Ye

Through this process, We have curated 100 billion high-quality pre-training data from GitHub.

8k

Exploring Safety Generalization Challenges of Large Language Models via Code

no code implementations12 Mar 2024 Qibing Ren, Chang Gao, Jing Shao, Junchi Yan, Xin Tan, Yu Qiao, Wai Lam, Lizhuang Ma

The rapid advancement of Large Language Models (LLMs) has brought about remarkable generative capabilities but also raised concerns about their potential misuse.

Code Completion

KIEval: A Knowledge-grounded Interactive Evaluation Framework for Large Language Models

2 code implementations23 Feb 2024 Zhuohao Yu, Chang Gao, Wenjin Yao, Yidong Wang, Wei Ye, Jindong Wang, Xing Xie, Yue Zhang, Shikun Zhang

Automatic evaluation methods for large language models (LLMs) are hindered by data contamination, leading to inflated assessments of their effectiveness.

OpenDPD: An Open-Source End-to-End Learning & Benchmarking Framework for Wideband Power Amplifier Modeling and Digital Pre-Distortion

1 code implementation16 Jan 2024 Yizhuo Wu, Gagan Deep Singh, Mohammadreza Beikmirza, Leo C. N. de Vreede, Morteza Alavi, Chang Gao

With the rise in communication capacity, deep neural networks (DNN) for digital pre-distortion (DPD) to correct non-linearity in wideband power amplifiers (PAs) have become prominent.

Benchmarking

Exploiting Symmetric Temporally Sparse BPTT for Efficient RNN Training

no code implementations14 Dec 2023 Xi Chen, Chang Gao, Zuowen Wang, Longbiao Cheng, Sheng Zhou, Shih-Chii Liu, Tobi Delbruck

Implementing online training of RNNs on the edge calls for optimized algorithms for an efficient deployment on hardware.

Incremental Learning

StrategyLLM: Large Language Models as Strategy Generators, Executors, Optimizers, and Evaluators for Problem Solving

no code implementations15 Nov 2023 Chang Gao, Haiyun Jiang, Deng Cai, Shuming Shi, Wai Lam

Most existing chain-of-thought (CoT) prompting methods suffer from the issues of generalizability and consistency, as they often rely on instance-specific solutions that may not be applicable to other cases and lack task-level consistency in their reasoning steps.

Math

Online Relocating and Matching of Ride-Hailing Services: A Model-Based Modular Approach

no code implementations13 Oct 2023 Chang Gao, Xi Lin, Fang He, Xindi Tang

This study proposes an innovative model-based modular approach (MMA) to dynamically optimize order matching and vehicle relocation in a ride-hailing platform.

JsonTuning: Towards Generalizable, Robust, and Controllable Instruction Tuning

1 code implementation4 Oct 2023 Chang Gao, Wenxuan Zhang, Guizhen Chen, Wai Lam

Instruction tuning has emerged as a crucial process for harnessing the capabilities of large language models (LLMs) by providing explicit task instructions, leading to improved performance in various tasks.

3ET: Efficient Event-based Eye Tracking using a Change-Based ConvLSTM Network

1 code implementation22 Aug 2023 Qinyu Chen, Zuowen Wang, Shih-Chii Liu, Chang Gao

This paper presents a sparse Change-Based Convolutional Long Short-Term Memory (CB-ConvLSTM) model for event-based eye tracking, key for next-generation wearable healthcare technology such as AR/VR headsets.

Pupil Tracking

To Spike or Not To Spike: A Digital Hardware Perspective on Deep Learning Acceleration

1 code implementation27 Jun 2023 Fabrizio Ottati, Chang Gao, Qinyu Chen, Giovanni Brignone, Mario R. Casu, Jason K. Eshraghian, Luciano Lavagno

The power efficiency of the biological brain outperforms any large-scale deep learning ( DL ) model; thus, neuromorphic computing tries to mimic the brain operations, such as spike-based information processing, to improve the efficiency of DL models.

M3Exam: A Multilingual, Multimodal, Multilevel Benchmark for Examining Large Language Models

1 code implementation NeurIPS 2023 Wenxuan Zhang, Sharifah Mahani Aljunied, Chang Gao, Yew Ken Chia, Lidong Bing

M3Exam exhibits three unique characteristics: (1) multilingualism, encompassing questions from multiple countries that require strong multilingual proficiency and cultural knowledge; (2) multimodality, accounting for the multimodal nature of many exam questions to test the model's multimodal understanding capability; and (3) multilevel structure, featuring exams from three critical educational periods to comprehensively assess a model's proficiency at different levels.

Easy-to-Hard Learning for Information Extraction

1 code implementation16 May 2023 Chang Gao, Wenxuan Zhang, Wai Lam, Lidong Bing

Information extraction (IE) systems aim to automatically extract structured information, such as named entities, relations between entities, and events, from unstructured texts.

Towards Generalizable and Robust Text-to-SQL Parsing

1 code implementation23 Oct 2022 Chang Gao, Bowen Li, Wenxuan Zhang, Wai Lam, Binhua Li, Fei Huang, Luo Si, Yongbin Li

Text-to-SQL parsing tackles the problem of mapping natural language questions to executable SQL queries.

SQL Parsing Text-To-SQL

A Projection-Based K-space Transformer Network for Undersampled Radial MRI Reconstruction with Limited Training Subjects

no code implementations15 Jun 2022 Chang Gao, Shu-Fu Shih, J. Paul Finn, Xiaodong Zhong

However, non-Cartesian trajectories such as the radial trajectory need to be transformed onto a Cartesian grid in each iteration of the network training, slowing down the training process and posing inconvenience and delay during training.

Data Augmentation MRI Reconstruction

UniGDD: A Unified Generative Framework for Goal-Oriented Document-Grounded Dialogue

1 code implementation ACL 2022 Chang Gao, Wenxuan Zhang, Wai Lam

The goal-oriented document-grounded dialogue aims at responding to the user query based on the dialogue context and supporting document.

Multi-Task Learning Response Generation +1

Skydiver: A Spiking Neural Network Accelerator Exploiting Spatio-Temporal Workload Balance

no code implementations14 Mar 2022 Qinyu Chen, Chang Gao, Xinyuan Fang, Haitao Luan

Spiking Neural Networks (SNNs) are developed as a promising alternative to Artificial Neural networks (ANNs) due to their more realistic brain-inspired computing models.

Image Segmentation Semantic Segmentation

Spiking Cochlea with System-level Local Automatic Gain Control

no code implementations14 Feb 2022 Ilya Kiselev, Chang Gao, Shih-Chii Liu

The bandpass filter gain of a channel is adapted dynamically to the input amplitude so that the average output spike rate stays within a defined range.

regression

Spartus: A 9.4 TOp/s FPGA-based LSTM Accelerator Exploiting Spatio-Temporal Sparsity

no code implementations4 Aug 2021 Chang Gao, Tobi Delbruck, Shih-Chii Liu

The pruned networks running on Spartus hardware achieve weight sparsity levels of up to 96% and 94% with negligible accuracy loss on the TIMIT and the Librispeech datasets.

speech-recognition Speech Recognition

Ranking Items in Large-Scale Item Search Engines with Reinforcement Learning

no code implementations CUHK Course IERG5350 2020 Chang Gao

Ranking items in large-scale item search engines such as Amazon and Taobao is a typical multi-step decision-making problem.

Decision Making reinforcement-learning +1

Recurrent Neural Network Control of a Hybrid Dynamic Transfemoral Prosthesis with EdgeDRNN Accelerator

no code implementations8 Feb 2020 Chang Gao, Rachel Gehlhar, Aaron D. Ames, Shih-Chii Liu, Tobi Delbruck

Lower leg prostheses could improve the life quality of amputees by increasing comfort and reducing energy to locomote, but currently control methods are limited in modulating behaviors based upon the human's experience.

EdgeDRNN: Enabling Low-latency Recurrent Neural Network Edge Inference

no code implementations22 Dec 2019 Chang Gao, Antonio Rios-Navarro, Xi Chen, Tobi Delbruck, Shih-Chii Liu

This paper presents a Gated Recurrent Unit (GRU) based recurrent neural network (RNN) accelerator called EdgeDRNN designed for portable edge computing.

Edge-computing

ReCoNet: Real-time Coherent Video Style Transfer Network

8 code implementations3 Jul 2018 Chang Gao, Derun Gu, Fangjun Zhang, Yizhou Yu

Image style transfer models based on convolutional neural networks usually suffer from high temporal inconsistency when applied to videos.

Semantic Segmentation Style Transfer +1

DeepSketch2Face: A Deep Learning Based Sketching System for 3D Face and Caricature Modeling

no code implementations7 Jun 2017 Xiaoguang Han, Chang Gao, Yizhou Yu

This system has a labor-efficient sketching interface, that allows the user to draw freehand imprecise yet expressive 2D lines representing the contours of facial features.

Caricature

Cannot find the paper you are looking for? You can Submit a new open access paper.