Search Results for author: Zichen Fan

Found 5 papers, 1 papers with code

ConSmax: Hardware-Friendly Alternative Softmax with Learnable Parameters

no code implementations31 Jan 2024 Shiwei Liu, Guanchen Tao, Yifei Zou, Derek Chow, Zichen Fan, Kauna Lei, Bangfei Pan, Dennis Sylvester, Gregory Kielian, Mehdi Saligane

Compared to state-of-the-art Softmax hardware, ConSmax results in 14. 5x energy and 14. 0x area savings with a comparable accuracy on a GPT-2 model and the WikiText103 dataset.

Language Modelling Large Language Model

Efficient Computation Sharing for Multi-Task Visual Scene Understanding

1 code implementation ICCV 2023 Sara Shoouri, Mingyu Yang, Zichen Fan, Hun-Seok Kim

Solving multiple visual tasks using individual models can be resource-intensive, while multi-task learning can conserve resources by sharing knowledge across different tasks.

Multi-Task Learning Scene Understanding

Cannot find the paper you are looking for? You can Submit a new open access paper.