no code implementations • 17 Jan 2025 • Gansen Hu, Zhaoguo Wang, Jinglin Wei, Wei Huang, Haibo Chen
We propose TARDIS, which enables optimization of LLMs with non-linear activations by partially approximating them with linear functions in frequently occurring input ranges.
no code implementations • 24 Dec 2024 • Rongxin Cheng, Yifan Peng, Yuxin Lai, Xingda Wei, Rong Chen, Haibo Chen
The stateful nature of large language model (LLM) servingcan easily throttle precious GPU memory under load burstor long-generation requests like chain-of-thought reasoning, causing latency spikes due to queuing incoming requests.
no code implementations • 4 Dec 2024 • Yanqi Zhang, Yuwei Hu, Runyuan Zhao, John C. S. Lui, Haibo Chen
Large language models (LLMs) demonstrate exceptional performance but incur high serving costs due to substantial memory demands, with the key-value (KV) cache being a primary bottleneck.
1 code implementation • 10 Jun 2024 • Yixin Song, Haotong Xie, Zhengyan Zhang, Bo Wen, Li Ma, Zeyu Mi, Haibo Chen
By applying our neuron sparsification method to the Mistral and Mixtral models, only 2. 5 billion and 4. 3 billion parameters are activated per inference iteration, respectively, while achieving even more powerful model performance.
1 code implementation • 10 Jun 2024 • Zhenliang Xue, Yixin Song, Zeyu Mi, Xinrui Zheng, Yubin Xia, Haibo Chen
The storage engine provides a fine-grained pipeline mechanism that coordinates cluster-level computation and I/O operations, enhanced by a segmented neuron cache to reduce I/O activities.
no code implementations • 6 May 2024 • Rongxin Cheng, Yifan Peng, Xingda Wei, Hongrui Xie, Rong Chen, Sijie Shen, Haibo Chen
In this paper, we are the first to characterize the trade-off of performance and index size in existing SSD-based graph and cluster indexes: to improve throughput by 5. 7$\times$ and 1. 7$\times$, these indexes have to pay a 5. 8$\times$ storage amplification and 7. 7$\times$ with respect to the dataset size, respectively.
no code implementations • 13 Mar 2024 • Jiafu Chen, Wei Xing, Jiakai Sun, Tianyi Chu, Yiling Huang, Boyan Ji, Lei Zhao, Huaizhong Lin, Haibo Chen, Zhizhong Wang
3D scene stylization refers to transform the appearance of a 3D scene to match a given style image, ensuring that images rendered from different viewpoints exhibit the same style as the given style image, while maintaining the 3D consistency of the stylized scene.
no code implementations • 13 Mar 2024 • Tianyi Chu, Wei Xing, Jiafu Chen, Zhizhong Wang, Jiakai Sun, Lei Zhao, Haibo Chen, Huaizhong Lin
Given that many deterministic conditional image generative models have been able to produce high-quality yet fixed results, we raise an intriguing question: is it possible for pre-trained deterministic conditional image generative models to generate diverse results without changing network structures or parameters?
2 code implementations • 16 Dec 2023 • Yixin Song, Zeyu Mi, Haotong Xie, Haibo Chen
This paper introduces PowerInfer, a high-speed Large Language Model (LLM) inference engine on a personal computer (PC) equipped with a single consumer-grade GPU.
1 code implementation • 12 Sep 2023 • Haibo Chen, Lei Zhao, Jun Li, Jian Yang
To address this issue, we imitate the drawing process of humans and propose a Two-Stage Statistics-Aware Transformation (TSSAT) module, which first builds the global style foundation by aligning the global statistics of content and style features and then further enriches local style details by swapping the local statistics (instead of local features) in a patch-wise manner, significantly improving the stylization effects.
no code implementations • 15 May 2023 • Jinming Du, Yanqun Tang, Xizhang Wei, Jiaojiao Xiong, Jiajun Zhu, Haoran Yin, Chi Zhang, Haibo Chen
Integrated sensing and communication (ISAC) is considered as a promising solution for improving spectrum efficiency and relieving wireless spectrum congestion.
no code implementations • 28 Feb 2023 • Yu Zhou, Haoran Yin, Jiaojiao Xiong, Shiyu Song, Jiajun Zhu, Jinming Du, Haibo Chen, Yanqun Tang
In the high-mobility scenarios of next-generation wireless communication systems (beyond 5G/6G), the performance of orthogonal frequency division multiplexing (OFDM) deteriorates drastically due to the loss of orthogonality between the subcarriers caused by large Doppler frequency shifts.
1 code implementation • 28 Nov 2022 • Zhizhong Wang, Lei Zhao, Zhiwen Zuo, Ailin Li, Haibo Chen, Wei Xing, Dongming Lu
The style encoder, coupled with a modulator, encodes the style image into learnable dual-modulation signals that modulate both intermediate features and convolutional filters of the decoder, thus injecting more sophisticated and flexible style signals to guide the stylizations.
1 code implementation • 6 Dec 2021 • Zhizhong Wang, Lei Zhao, Haibo Chen, Ailin Li, Zhiwen Zuo, Wei Xing, Dongming Lu
In addition, we also introduce a novel learning-free view-specific texture reformation (VSTR) operation with a new semantic map guidance strategy to achieve more accurate semantic-guided and structure-preserved texture transfer.
1 code implementation • NeurIPS 2021 • Haibo Chen, Lei Zhao, Zhizhong Wang, Huiming Zhang, Zhiwen Zuo, Ailin Li, Wei Xing, Dongming Lu
Although existing artistic style transfer methods have achieved significant improvement with deep neural networks, they still suffer from artifacts such as disharmonious colors and repetitive patterns.
1 code implementation • OSDI 2021 • Erhu Feng, Xu Lu, Dong Du, Bicheng Yang, Xueqiang Jiang, Yubin Xia, Binyu Zang, Haibo Chen
Upon these two primitives, our system can scale to thousands of concurrent enclaves with high resource utilization and eliminate the high-cost initialization of secure memory using fork-style enclave creation without weakening the security guarantees.
no code implementations • CVPR 2021 • Haibo Chen, Lei Zhao, Zhizhong Wang, Huiming Zhang, Zhiwen Zuo, Ailin Li, Wei Xing, Dongming Lu
Artistic style transfer is an image editing task that aims at repainting everyday photographs with learned artistic styles.
no code implementations • 24 Feb 2021 • Haibo Chen, Xiansheng Dai, Mingqiang Liu
We construct a class of non-weight modules over the twisted $N=2$ superconformal algebra $\T$.
Representation Theory
no code implementations • 12 Feb 2021 • Jonas Oberhauser, Rafael Lourenco de Lima Chehab, Diogo Behrens, Ming Fu, Antonio Paolillo, Lilith Oberhauser, Koustubha Bhat, Yuzhong Wen, Haibo Chen, Jaeho Kim, Viktor Vafeiadis
Finally, in Sec.
Logic in Computer Science
no code implementations • 16 Jan 2021 • Zhizhong Wang, Lei Zhao, Haibo Chen, Zhiwen Zuo, Ailin Li, Wei Xing, Dongming Lu
Gram-based and patch-based approaches are two important research lines of style transfer.
no code implementations • ICCV 2021 • Haibo Chen, Lei Zhao, Huiming Zhang, Zhizhong Wang, Zhiwen Zuo, Ailin Li, Wei Xing, Dongming Lu
Image style transfer aims to transfer the styles of artworks onto arbitrary photographs to create novel artistic images.
1 code implementation • SOCC 2020 • Tianyi Yu, Qingyuan Liu, Dong Du, Yubin Xia, Bingyu Zang, Haibo Chen
This, however, also presents new challenges including how to efficiently design high-performance serverless platforms and how to efficiently program on the platforms.
no code implementations • 8 Aug 2020 • Zhiwen Zuo, Lei Zhao, Zhizhong Wang, Haibo Chen, Ailin Li, Qijiang Xu, Wei Xing, Dongming Lu
Multimodal image-to-image translation (I2IT) aims to learn a conditional distribution that explores multiple possible images in the target domain given an input image in the source domain.
no code implementations • ICLR 2020 • Zhiwen Zuo, Lei Zhao, Huiming Zhang, Qihang Mo, Haibo Chen, Zhizhong Wang, Ailin Li, Lihong Qiu, Wei Xing, Dongming Lu
Generative Adversarial Networks (GANs) have shown impressive results in modeling distributions over complicated manifolds such as those of natural images.
2 code implementations • CVPR 2020 • Zhizhong Wang, Lei Zhao, Haibo Chen, Lihong Qiu, Qihang Mo, Sihuan Lin, Wei Xing, Dongming Lu
Image style transfer is an underdetermined problem, where a large number of solutions can satisfy the same constraint (the content and style).
no code implementations • 4 Aug 2019 • Dong Cao, Lisha Xu, HaiBo Chen
This model can implement action recognition under zero-shot conditions, and has good recognition performance for untrimmed video data.
no code implementations • 19 Apr 2019 • Dong Cao, Dong-dong Zhang, Haibo Chen
In the process of construction, we propose a task-oriented hybrid construction method based on natural language generation algorithm.
no code implementations • 2 Feb 2019 • Chuzhe Tang, Zhiyuan Dong, Minjie Wang, Zhaoguo Wang, Haibo Chen
In this paper, we demonstrate that the missing consideration of access patterns and dynamic data distribution notably hinders the applicability of learned indexes.
no code implementations • 25 Mar 2016 • Wei Wang, Gang Chen, Haibo Chen, Tien Tuan Anh Dinh, Jinyang Gao, Beng Chin Ooi, Kian-Lee Tan, Sheng Wang
The other is scalability, that is the deep learning system must be able to provision for a huge demand of computing resources for training large models with massive datasets.