Charles Explorer logo
🇬🇧

Morphological annotation of social media corpora with reference to its reliability for linguistic research

Publication

Abstract

This paper presents the results of the study devoted to the applicability of SOTA methods for morphological corpus annotation (based on GramEval2020) for analytical sociolinguistic research. The study shows that statistically successful technologies of morphosyntactic annotation for such purposes create a number of problems for researchers if they are used purely i.e. without any linguistic knowledge.

In this paper, methods for improving the morphological annotation, successfully implemented in GICR, from the point of view of its reliability are presented.