The national media corpus was inaugurated in 2010 and includes articles from the media since 1983 (Argumenty i Fakty newspaper) until 2021. Significant amounts of available digitalized media texts that are of great interest for monitoring real-time linguistic changes (for example, how the word smartfon appears and becomes habitual in Russian, or how the preposition po increases in usage) cannot be fully included into the main corpus, as this would distort its representativeness with regard of both genre and chronology. There is no such limitation for the separate media corpus. It is the largest subcorpus of the RNC, exceeding the main corpus and approaching the mark of 1 billion word uses.
The national media corpus includes texts from several media, both printed newspapers and digital editions, in roughly equal amounts. The corpus is being updated on an annual basis. Several dozens of millions of words are added every year.