We start with a convenient vector space model and extend it with information about structural properties of an XML data collection C. According to the occurrences of a term t in the XML structure of C we represent t by a vector of weights.
To reduce its lengths, a form of a DataGuide for C is considered with a path as a structure unit. Then, an XML document D is represented by a matrix D of weights.