A System for Syntactic Annotation of Large Czech Corpora

Publication at Faculty of Arts |

2013

Abstract

We present a system of pre-processing and post-processing of linguistic data leading to an improvement of stochastic dependency parsing results. We (( condense }} the data for the stochastic parser, i.e. we reduce the variability of word lemmas and forms in the text.

After the parsing is done, we correct some of the recurrent parsing errors with a rule-based correction system. We achieve a 10,8% relative error reduction.

Keywords

syntax corpus corpus annotation stochastic parsing dependency parsing syntactic annotation

A System for Syntactic Annotation of Large Czech Corpora

Abstract

Keywords

Person