Charles Explorer logo
🇬🇧

A Matrix model for XML Data

Publication at Faculty of Mathematics and Physics |
2005

Abstract

We start with a convenient vector space model and extend it with information about structural properties of an XML data collection C. According to the occurrences of a term t in the XML structure of C we represent t by a vector of weights.

To reduce its lengths, a form of a DataGuide for C is considered with a path as a structure unit. Then, an XML document D is represented by a matrix D of weights.