Charles Explorer logo
🇬🇧

Building a Data Repository of Spontaneous Spoken Czech

Publication at Faculty of Arts |
2014

Abstract

The paper presents data repository of spontaneous spoken Czech, its design principles and practical solutions adopted during the data collection. The repository is designed as a representation of contemporary spontaneous spoken language used in informal, real-life situations on the area of the whole Czech Republic.

Therefore, it features manual annotation and broad regional coverage with large variety of speakers. The repository data contain both the audio recordings and their transcriptions manually aligned with time stamps.