Decipherment
11 papers with code • 0 benchmarks • 1 datasets
Benchmarks
These leaderboards are used to track progress in Decipherment
Most implemented papers
Style Transfer from Non-Parallel Text by Cross-Alignment
We demonstrate the effectiveness of this cross-alignment method on three tasks: sentiment modification, decipherment of word substitution ciphers, and recovery of word order.
A Probabilistic Formulation of Unsupervised Text Style Transfer
Across all style transfer tasks, our approach yields substantial gains over state-of-the-art non-generative baselines, including the state-of-the-art unsupervised machine translation techniques that our approach generalizes.
An open dataset for oracle bone script recognition and decipherment
Oracle bone script, one of the earliest known forms of ancient Chinese writing, presents invaluable research materials for scholars studying the humanities and geography of the Shang Dynasty, dating back 3, 000 years.
Unsupervised Text Style Transfer using Language Models as Discriminators
Binary classifiers are often employed as discriminators in GAN-based unsupervised style transfer systems to ensure that transferred sentences are similar to sentences in the target domain.
Decipherment of Historical Manuscript Images
European libraries and archives are filled with enciphered manuscripts from the early modern period.
A Grounded Unsupervised Universal Part-of-Speech Tagger for Low-Resource Languages
We also show extrinsically that incorporating our POS tagger into a name tagger leads to state-of-the-art tagging performance in Sinhalese and Kinyarwanda, two languages with nearly no labeled POS data available.
Sign Clustering and Topic Extraction in Proto-Elamite
We describe a first attempt at using techniques from computational linguistics to analyze the undeciphered proto-Elamite script.
Neural Decipherment via Minimum-Cost Flow: from Ugaritic to Linear B
In this paper we propose a novel neural approach for automatic decipherment of lost languages.
Phonetic and Visual Priors for Decipherment of Informal Romanization
Informal romanization is an idiosyncratic process used by humans in informal digital communication to encode non-Latin script languages into Latin character sets found on common keyboards.
Deciphering Undersegmented Ancient Scripts Using Phonetic Prior
We evaluate the model on both deciphered languages (Gothic, Ugaritic) and an undeciphered one (Iberian).