RNC News
The Russian MultiPARC has been expanded and counts almost 300 thousand tokens. It now features Chekhov's play “Three Sisters” staged by four different theaters: Gorky Moscow Art
The parallel corpus was expanded by 3 million tokens. Half of this amount is accounted for by English-language non-fiction texts (popular science and journalistic). In addition, the
The collections in the Accentological and Spoken corpora were updated. We added transcripts of expert talks, oral memories, and everyday dialogic speech. These texts were recorded in
The East Slavic epigraphy corpus now features 86 newly described inscriptions dating from the 11th to the 14th centuries and originating from diverse locations including Lucca, Bethlehem,
The Birchbark letters corpus now features 19 documents from Novgorod and Staraya Russa, found in 2023. They contain more than 300 tokens. In addition, the texts and
Personal accounts are now available on the Corpus website.
Its main task is to enhance the users’ individual workflow. Now you can save queries (in any corpus)
The Old East Slavic corpus was expanded with new texts and grew by 43 thousand token. On the one hand, it includes later texts of the 14th