UMC005 English-Urdu

UMC005 English-Urdu is a parallel corpus of texts in English and Urdu language with sentence alignments. The corpus can be used for experiments with statistical machine translation.

The texts come from four different sources:

  • Quran
  • Bible
  • Penn Treebank (Wall Street Journal)
  • Emille corpus
Source: UMC005 English-Urdu

Papers


Paper Code Results Date Stars

Dataset Loaders


Tasks


Similar Datasets


License


  • Unknown

Modalities


Languages