русская версия

Russian National Corpus

This website contains a corpus of the modern Russian language incorporating over 150 million words. The corpus of Russian is a reference system based on a collection of Russian texts in electronic form.

The Corpus is intended for all who are interested in the Russian language and various associated fields: professional linguists, language teachers, school and university students, foreigners learning the language.

More details

News

June 15, 2010
A new version of the deeply annotated corpus of Russian texts, SynTagRus, has been uploaded.

November 18, 2009
The RNC was awarded the special prize of the electronical media competition «Impeccable command of the Russian language in the professional activity»

November 18, 2009
The RNC was awarded the special prize of the electronical media competition «Impeccable command of the Russian language in the professional activity»

November 18, 2009
The RAS Institute of the Russian language site now hosts four dictionaries based on the RNC: the Grammatical dictionary of Russian neologisms, the New Russian frequency dictionary, the Combinatory dictionary of Russian intensifiers, the Verbal combinatory dictionary of Russian abstract nouns.

November 18, 2009
A new version of the deeply annotated corpus of Russian texts, SynTagRus, has been uploaded. As compared to the previous version, the corpus has been supplemented by 88 modern papers of popular science, economic, and political genres, published in Russian newspapers, journals or magazines in 2007-2008. Simultaneously, certain errors have been detected and corrected. At present, SynTagRus counts 41,187 tagged sentences.

November 2, 2009
The Educational gateway of the RNC is now available.

November 2, 2009
Poetic corpus updated with XVIII – XIX century texts, including many poetae minores of the 1790s–1830s. The list of the authors is available with links to their subcorpora.

February 26, 2009
In the Main Corpus, words within idiomatic expressions and beyond them are now searchable. An Advanced Semantic Search is available that allows the user to look for the main and peripheral senses of a word and take into account (partial) word-sense disambiguation.

February 25, 2009
The parallel corpus updated: a German-Russian corpus is now available via the common search form for the parallel corpora.

January 12, 2009
Spoken and accentological corpora updated. There are now circa 4,45 million tokens in the accentological corpus, and circa 7,8 million tokens in the spoken corpus.

December 25, 2008
Main and poetic corpora updated. There are now more than 3 million tokens in the poetic corpus, the 18th century tunt now 2,6 million tokens, and the texts of the 1900-1950 period are expanded to 40 million.

December 8, 2008
English-Russian and Russian-English parallel corpora are searchable again, now on the main site of the RNC and with a standartized markup.

November 25, 2008
The English search and subcorpus customizing interface for three major corpora (main, spoken and syntactic) is available.

November 10, 2008
The English search interface for the main subcorpus of the RNC is now available.

October 24, 2008
The Historical Accentological Corpus is now searchable (Russian interface only).

October 3, 2008
The Dictionary of compound lexical units is now available (Russian interface only).

March 26, 2008
The Corpus of Spoken Russian is now searchable.

March 18, 2008
Deeply Annotated Corpus is now searchable.

March 17, 2008
Welcome to the English webpage of the Russian National Corpus.

Russian National Corpus
© 2003–2010
info@ruscorpora.ru