A reference set of collocation candidates extracted as surface bigrams from the Prague Depedency Treebank and annotated as collocational or non-collocational.