Charles Explorer logo
🇬🇧

Parallel dynamic batch loading in the M-tree

Publication at Faculty of Mathematics and Physics |
2009

Abstract

Although metric access methods (MAMs) proved their capabilities when performing efficient similarity search, their further performance improvement is needed due to extreme growth of data volumes. Since multi core processors become widely available, it is justified to exploit parallelism.

However, taking into account the Gustafson’s law, it is necessary to find tasks suitable for parallelization. Such a task could be M-tree construction.

Unfortunately, parallelism during an object insertion in hierarchical index structures is limited by a node capacity. It is much less restrictive to run several independent insertions in parallel.

In this paper we present our new technique of M-tree construction. The technique postpones splitting of overfull nodes and thus allows simple parallelization of M-tree construction.

Our experiments confirm the new technique guarantees significant speed up of M-tree construction and also improves the quality of the index.