Charles Explorer logo
🇬🇧

Text mining as a subsidy for information organization: an exploratory study

Publication

Abstract

Six exploits are presented using the R package available for text mining. These text mining packages can be used to provide subsidies in the construction of subject headers, keywords and/or indexing terms for journal articles.

With textrank, slowraker and rapidraker packages, the coincidence between the keywords offered by the author of the document used as evidence reached 50%, but at the same time the packages offered complementary keywords as relevant subsidies to enrich the terminology focused on information retrieval. With the tm and udpipe packages, the coincidence between the keywords offered by the author of the document used as evidence reached 75%; likewise, both packages offered other perfectly pertinent keywords to enrich the terminology focused on information retrieval.

The only inappropriate package was the RKEA