1 code implementation • 17 Nov 2024 • Jintao Zhang, Haofeng Huang, Pengle Zhang, Jia Wei, Jun Zhu, Jianfei Chen
Second, we propose a method to smooth $Q$ and $V$, enhancing the accuracy of attention with INT4 $QK$ and FP8 $PV$.
1 code implementation • 17 Oct 2024 • Jintao Zhang, Mingyue Cheng, Xiaoyu Tao, Zhiding Liu, Daoyu Wang
Time series forecasting is vital in numerous web applications, influencing critical decision-making across industries.
1 code implementation • 3 Oct 2024 • Jintao Zhang, Jia Wei, Haofeng Huang, Pengle Zhang, Jun Zhu, Jianfei Chen
Although quantization has proven to be an effective method for accelerating model inference, existing quantization methods primarily focus on optimizing the linear layer.
no code implementations • 17 Sep 2024 • Mingyue Cheng, Jintao Zhang, Zhiding Liu, Chunli Liu, Yanhu Xie
Intraoperative hypotension (IOH) prediction using Mean Arterial Pressure (MAP) is a critical research area with significant implications for patient outcomes during surgery.
no code implementations • 30 May 2024 • Pengyu Jie, Wanquan Liu, Chenqiang Gao, Yihui Wen, Rui He, Weiping Wen, Pengcheng Li, Jintao Zhang, Deyu Meng
Fully-supervised deep learning methods achieve promising performance with pixel-level annotations but impose a significant annotation burden on experts.
no code implementations • 24 Mar 2024 • Hongfu Guo, Wencheng Zou, Zeyu Zhang, Shuishan Zhang, Ruitong Wang, Jintao Zhang
Manifold regularization model is a semi-supervised learning model that leverages the geometric structure of a dataset, comprising a small number of labeled samples and a large number of unlabeled samples, to generate classifiers.
no code implementations • 21 Sep 2023 • Riko I Made, Jing Lin, Jintao Zhang, Yu Zhang, Lionel C. H. Moh, Zhaolin Liu, Ning Ding, Sing Yang Chiam, Edwin Khoo, Xuesong Yin, Guangyuan Wesley Zheng
Battery health assessment and recuperation play a crucial role in the utilization of second-life Li-ion batteries.
6 code implementations • arXiv 2019 • Jintao Zhang
Therefore, designing lightweight networks with low memory requirement and computational cost is one of the most practical solutions for face verification on mobile platform.
Ranked #2 on Lightweight Face Recognition on AgeDB-30
2 code implementations • 9 May 2019 • Jintao Zhang
In this paper, we are interested in boosting the representation capability of convolution neural networks which utilizing the inverted residual structure.
no code implementations • 9 Nov 2018 • Hongyang Jia, Yinqi Tang, Hossein Valavi, Jintao Zhang, Naveen Verma
Chip measurements show an energy efficiency of 152/297 1b-TOPS/W and throughput of 4. 7/1. 9 1b-TOPS (scaling linearly with the matrix/input-vector element precisions) at VDD of 1. 2/0. 85V.