Welcome!
In this course, we’ll study the theory, design, and implementation of text-based and Web-based information retrieval systems. By the end of the semester you will be able to:
-
•Define and explain the key concepts and models relevant to information storage and retrieval, including efficient text indexing and compression, boolean, vector space, and XML retrieval models, relevance feedback, document clustering and text categorization, Web search, including crawling, indexing, and link-based algorithms like PageRank, and so on.
-
•Design, implement, and evaluate the core algorithms underlying a fully functional IR system, including the indexing, retrieval, and ranking components, as well as advanced algorithms like document clustering and text categorization.
-
•Distinguish between the traditional IR problem domain (its underlying assumptions, approaches, and techniques) and the Web IR problem domain.
-
•Identify the salient features and apply recent research results in information storage and retrieval, including collaborative filtering, adversarial information retrieval, question answering, and social information management.

