Charles Explorer logo
🇨🇿

The Extraction of Terms Consisting of Several Words from Texts in Natural Languages Using the Syntactic Patterns IN NATURAL LANGUAGES USING THE SYNTACTIC PATTERNS

Publikace

Tento text není v aktuálním jazyce dostupný. Zobrazuje se verze "en".Abstrakt

Two problems arise when extracting terms consisting of several words using linguistic methods of text analysis: 1. A linguist has no skills in software systems development, however he (she) is required to present his (her) knowledge in the form of software system fragments or constructions in a formal language. 2.

Most software developers are not qualified enough in linguistics. This problem creates a semantic gap between the methods of linguistic analysis of texts and their software implementation.

The article presents an approach to extract the terms consisting of several words based on syntactic patterns tailored for a linguist. The proposed approach does not require additional skills and usage of various languages to describe syntactic patterns by a linguist.

The prototype of the software system was developed. The software system allows describing syntactic patterns without having knowledge of a formal language.

Moreover, as against the analogs the developed system is capable to use syntactic patterns in external systems for text analysis. The server of the prototype has an interface to make the syntactic patterns.