HELFI: a Hebrew-Greek-Finnish Parallel Bible Corpus with Cross-Lingual Morpheme Alignment: An fine-grained and open morpheme-alignment between the most important 20th century Finnish Bible translation and the most relevant source texts.
ISLRM 840-665-876-625-0
Anssi Yli-Jyrä, Josi Purhonen, Matti Liljeqvist, Arto Antturi, Pekka Nieminen, Kari M. Räntilä, Valtter Luoto, authors. (2020). HELFI: a Hebrew-Greek-Finnish Parallel Bible Corpus with Cross-Lingual Morpheme Alignment Proceedings of the Twelfth International Conference on Language Resources and Evaluation (LREC 2020), Marseille, France, 11-16 May 2020. European Language Resources Association (ELRA). Authors' preprint
- Finnish1938 - The Finnish 1933/1938 Bible Translation with lemmas, morphology and cross-lingual morpheme alignment
- Greek1904 - The Nestle 1904 Greek New Testament with lemmas and morphology in a column-oriented file format containing token numbers for cross-lingual alignment, token transliterations and the Greek lemma numbers of the Finnish Ananlytical Bible Concordance.
- Hebrew1008 - The Leningrad Codex Hebrew Bible (the Tanach) with the OSHB lemmas and morphology in a column-oriented file format containing token numbers for cross-lingual alignment, token transliterations and the Hebrew/Aramaic lemma numbers of the Finnish Ananlytical Bible Concordance.
This corpus is under several licenses that cover its copyrighted parts. You must give appropriate credit, provide links to the licenses, and indicate if changes were made.
- CITING - How to cite or attribute the cross-lingual morpheme alignment
- LICENSES - The license and the parts that remain in the public domain or under their current restrictions
- ACKNOWLEDGEMENTS
- BUGS - The Known Bugs in the HELFI corpus
The HELFI corpus is maintained by Anssi Yli-Jyrä ([email protected]).