Introduction, basic concepts and principles
Boolean retrieval
Indexing
Vector space model
Evaluation in information retrieval
Query expansion
Probabilistic information retrieval
Language models for information retrieval
Text classification
Clustering
Web search
Near-duplicate detection
The course introduces modern algorithms and principles used in the field of information retrieval in large data collections. The students will gain practical knowledge and experience with experimentation and evaluation on real data.
A special focus is given to web search.