Dialect corpus
The dialect corpus contains recordings of dialectal speech (presented both in phonologized and in loosely standardized orthography) from different regions of Russia. Morphological, syntactic and lexical peculiarities of these texts are preserved. The subcorpus employs special tags for specifically dialectal morphological features (including those absent in standard language); in addition, purely dialectal lexemes are supplied with commentary.