Weβre releasing highly optimized GPU kernels for an underexplored class of neural network architectures: networks with block-sparse weights. The kernels allow for efficient evaluation and differentiation of linear layers, including convolutional layers, with flexibly configurable block-sparsity patterns in the weight matrix... (read more)
PDFTASK | DATASET | MODEL | METRIC NAME | METRIC VALUE | GLOBAL RANK | COMPARE |
---|---|---|---|---|---|---|
Sentiment Analysis | CR | Block-sparse LSTM | Accuracy | 92.2 | # 1 | |
Sentiment Analysis | IMDb | Block-sparse LSTM | Accuracy | 94.99 | # 7 | |
Sentiment Analysis | SST-2 Binary classification | Block-sparse LSTM | Accuracy | 93.2 | # 14 | |
Sentiment Analysis | Yelp Binary classification | Block-sparse LSTM | Error | 3.27 | # 7 |