Deep Pyramid Convolutional Neural Networks for Text Categorization

ACL 2017  ·  Rie Johnson, Tong Zhang ·

This paper proposes a low-complexity word-level deep convolutional neural network (CNN) architecture for text categorization that can efficiently represent long-range associations in text. In the literature, several deep and complex neural networks have been proposed for this task, assuming availability of relatively large amounts of training data. However, the associated computational complexity increases as the networks go deeper, which poses serious challenges in practical applications. Moreover, it was shown recently that shallow word-level CNNs are more accurate and much faster than the state-of-the-art very deep nets such as character-level CNNs even in the setting of large training data. Motivated by these findings, we carefully studied deepening of word-level CNNs to capture global representations of text, and found a simple network architecture with which the best accuracy can be obtained by increasing the network depth without increasing computational cost by much. We call it deep pyramid CNN. The proposed model with 15 weight layers outperforms the previous best models on six benchmark datasets for sentiment classification and topic categorization.

PDF Abstract
Task Dataset Model Metric Name Metric Value Global Rank Benchmark
Sentiment Analysis Amazon Review Full DPCNN Accuracy 65.19 # 2
Sentiment Analysis Amazon Review Polarity DPCNN Accuracy 96.68 # 2
Sentiment Analysis Yelp Binary classification DPCNN Error 2.64 # 8
Sentiment Analysis Yelp Fine-grained classification DPCNN Error 30.58 # 6

Results from Other Papers


Task Dataset Model Metric Name Metric Value Rank Source Paper Compare
Text Classification AG News DPCNN Error 6.87 # 9
Text Classification DBpedia DPCNN Error 0.88 # 10

Methods


No methods listed for this paper. Add relevant methods here