Charles Explorer logo
🇬🇧

The InterCorp corpus, release 8

Publication

Abstract

The core of the Czech part of the InterCorp parallel corpus release 8 includes 7 mil. more words than the previous release, and 4 mil. more words in the collections part, namely from the Project Syndicate and VoxEurop sites for 2013-2014. Metadata were added or corrected for hundreds of texts in the core.