prachathai-67k

The prachathai-67k dataset was scraped from the news site Prachathai excluding articles with less than 500 characters of body text (mostly images and cartoons). It contains 67,889 articles with 51,797 tags from August 24, 2004 to November 15, 2018.

Source: prachathai-67k

Papers


Paper Code Results Date Stars

Tasks


License


  • Unknown

Modalities


Languages