RNC News

The Word at a glance service in the Main Corpus has been enriched with data on word families. The new widget now shows families of cognate words. For now, this option is only available for words with a single root (e.g. стол, but not пароход) that are manually annotated within the morphemic analysis dictionary. Data on other words will appear in future, but even now you can see interesting connections between words.

As it is a custom already, you see a "Rate" button next to the new widget. Feel free to let us know if you notice any bugs. Thanks to your feedback, we keep improving the neurolinguistic models underlying the Word at a glance service. It is very interesting and important to us what you think about the first version of the word family model.

It has become possible to specify more precisely the conditions of lexico-grammatical search in the Main, National media and Regional corpora. One may set conditions on the distance between words in the search form. Until now, if the specified range included 0 (for example, from -1 to 1), a single token in the results could match both words specified. Now, at the top of the search form, you can select the "word matches excluded" option to remove the zero distance from the range. For example, you can find plural animate nouns conjoined with крестьяне ‘peasants’. Here is the resulting frequency list. Previously, a similar query would also find the word крестьяне alone, without its "neighbors" (since at the zero distance it matches all the conditions for a conjoined noun).

Subscribe to our Telegram channel to follow our updates and receive illustrated corpus instructions.

Show all