A C-LSTM Neural Network for Text Classification

27 Nov 2015  ·  Chunting Zhou, Chonglin Sun, Zhiyuan Liu, Francis C. M. Lau ·

Neural network models have been demonstrated to be capable of achieving remarkable performance in sentence and document modeling. Convolutional neural network (CNN) and recurrent neural network (RNN) are two mainstream architectures for such modeling tasks, which adopt totally different ways of understanding natural languages. In this work, we combine the strengths of both architectures and propose a novel and unified model called C-LSTM for sentence representation and text classification. C-LSTM utilizes CNN to extract a sequence of higher-level phrase representations, and are fed into a long short-term memory recurrent neural network (LSTM) to obtain the sentence representation. C-LSTM is able to capture both local features of phrases as well as global and temporal sentence semantics. We evaluate the proposed architecture on sentiment classification and question classification tasks. The experimental results show that the C-LSTM outperforms both CNN and LSTM and can achieve excellent performance on these tasks.

PDF Abstract

Results from the Paper


Task Dataset Model Metric Name Metric Value Global Rank Result Benchmark
Sentiment Analysis SST-2 Binary classification C-LSTM Accuracy 87.8 # 69
Sentiment Analysis SST-5 Fine-grained classification C-LSTM Accuracy 49.2 # 21
Text Classification TREC-6 C-LSTM Error 5.4 # 10

Methods