Charles Explorer logo
🇬🇧

If You Even Don't Have a Bit of Bible: Learning Delexicalized POS Taggers

Publication at Faculty of Mathematics and Physics |
2016

Abstract

Part-of-speech (POS) tagging is sometimes considered an almost solved problem in NLP. Standard supervised approaches often reach accuracy above 95% if sufficiently large hand-labeled training data are available (typically several hundred thousand tokens or more).

However, we still believe that it makes sense to study semi-supervised and unsupervised approaches.