Using crowdsourcing system for creating site-specific statistical machine translation engine

19 Sep 2014 · Alexander Kalinin, George Savchenko ·

A crowdsourcing translation approach is an effective tool for globalization of site content, but it is also an important source of parallel linguistic data. For the given site, processed with a crowdsourcing system, a sentence-aligned corpus can be fetched, which covers a very narrow domain of terminology and language patterns - a site-specific domain. These data can be used for training and estimation of site-specific statistical machine translation engine

PDF Abstract