The Similar words widget displays the closest semantic associates of the word. The proximity coefficient of words, which can be seen by hovering the mouse over a word in the Word cloud, is calculated using distributive semantics models based on the actual materials of the main corpus of the RNC. The closer the coefficient value is to 1, the larger the word in the Word cloud is, and the more similar the contexts with this word should be to the contexts with the keyword.

The current version of Similar words works only in the Main, Media, Educational, and some other corpora and only shows semantic associates of the same part of speech for nouns, verbs, adjectives and adverbs. For proper names, toponyms, abbreviations and words that have non-standard spellings or are rarely found in the corpus, similar words are not displayed.
With "Similar Words" widget at the Main corpus users can now examine context-based associated words not just across the entire corpus but within specific historical periods. All texts within the Main Corpus (1700–2020s) have been divided into 11 time spans. Users can view similar words from a single time span, compare word clouds across two different time spans, download a screenshot of the results.

The widget is marked with a special sign "Generated by NeuroRNC". It means, the selection of associates is completely automatical and errors may occur in the lists, for example, incorrectly formed words or word associations that are not intuitively clear.