Charles Explorer logo
🇬🇧

Totality - corpus of totalitarian language

Publication

Abstract

The corpus contains samples of news texts from years 1952, 1969, and 1977 and a set of ideological books. The size of the whole corpus is more than 15 million words.

The corpus is lemmatized and morphologically tagged, every text contains the information of its source.