Fine-grained Structure-based News Genre Categorization
Journalists usually organize and present the contents of a news article following a well-defined structure. In this work, we propose a new task to categorize news articles based on their content presentation structures, which is beneficial for various NLP applications. We first define a small set of news elements considering their functions (e.g., \textit{introducing the main story or event, catching the reader{'}s attention} and \textit{providing details}) in a news story and their writing style (\textit{narrative} or \textit{expository}), and then formally define four commonly used news article structures based on their selections and organizations of news elements. We create an annotated dataset for structure-based news genre identification, and finally, we build a predictive model to assess the feasibility of this classification task using structure indicative features.
PDF Abstract