RNC News

The Old East Slavic corpus was expanded by more than 31 thousand tokens. The update includes, in particular, such literary texts as The Tale of the Destruction of  Rus  and Zadonshchina, as well as official documents: The Church Statute of Prince Yaroslav and legal acts (gramoty) of the 13th-15th centuries from Ukraine, Moldova, Lithuanian-Belarusian lands, Smolensk, Novgorod, Pskov and Moscow. The corpus' vocabulary has been expanded by almost a thousand lexemes, including earlier references to such modern words as чемодан ‘suitcase’, таможенник ‘customs officer’ and странствие ‘wandering’.

The Similar Words widget has appeared in the Word at a glance in the Old Russian Corpus. As in other corpora where the widget is available, the closest semantic associates of a word are generated automatically. The model used to search for associated words within the Old East Slavic corpus, as well as the updated vector space models for the Middle Russian corpus, are available for downloading in the RNC Neural network models section.

Show all