Welcome!

In this course, we’ll study the theory, design, and implementation of text-based and Web-based information retrieval systems. By the end of the semester you will be able to:

  1. Define and explain the key concepts and models relevant to information storage and retrieval, including efficient text indexing and compression, boolean, vector space, and XML retrieval models, relevance feedback, document clustering and text categorization, Web search, including crawling, indexing, and link-based algorithms like PageRank, and so on.

  2. Design, implement, and evaluate the core algorithms underlying a fully functional IR system, including the indexing, retrieval, and ranking components, as well as advanced algorithms like document clustering and text categorization.

  3. Distinguish between the traditional IR problem domain (its underlying assumptions, approaches, and techniques) and the Web IR problem domain.

  4. Identify the salient features and apply recent research results in information storage and retrieval, including collaborative filtering, adversarial information retrieval, question answering, and social information management.

CPSC 670: Information Storage and Retrieval

Spring 2008

Texas A&M University

Dept of Computer Science



Course Meetings

HRBB 113

T/Th 11:10am-12:25pm



Teaching Staff


Instructor

James Caverlee


TA

Paul Bogen



Office Hours

Caverlee: T/Th 10am-11am

Bogen: W 1pm-3pm


Grading Scheme

  1. 5% Class Participation

  2. 15% Midterm Exam

  3. 20% Final Exam

  4. 30% Homework(10/10/10)

  5. 30% Project (10/10/10)

 

Photo courtesy of netzkobold.