Charles Explorer logo

Cross-Tagset Parsing Evaluation for Russian

Publication at Faculty of Mathematics and Physics |


Cross-tagset parsing is based on the substitution of one annotation layer for another while processing data within one language. As often as not, either the native tagger or the dependency parser used in (pre-)annotation of the Gold treebank is not available.

The cross-tagset approach allows one to annotate new texts using freely available tools or tools optimized to user's needs. We evaluate the robustness of Russian dependency parsing using different morphological and syntactic tagsets in input and output.

A qualitative analysis of errors shows that the cross-substitution of three morphological tagsets and two syntactic tagsets causes only a mild drop in performance.