CS276
My reading notes for CS276: Information Retrieval and Web Search, Spring 2019 from Stanford University.
Mostly contains the summary of the chapters from Introduction to Information Retrieval, along with memos for certain lecture slides and other sources.
Note: To cope with mathematical notations, I've used Pandoc to convert all the Markdown notes into LaTeX pdfs. I think this is the easiest solution at the moment to view math formulas on GitHub. Please let me know if you think there's a better solution.
Week 01: Introduction and boolean retrieval
- IIR Chapter 01: Boolean Retrieval
- IIR Chapter 02: The term vocabulary and postings lists
Week 02: More on inverted index
- IIR Chapter 04: Index construction
- IIR Chapter 05: Index compression
Week 03: Tolerant retrieval, and vector space model
- IIR Chapter 03: Dictionaries and tolerant retrieval
- More on noisy channel
- IIR Chapter 06: Scoring, term weighting, and the vector space model
Week 04: Probabilistic Retrieval
- IIR Chapter 11: Probabilistic information retrieval
Week 05: IR system evaluation, and speedups for free text queries
- IIR Chapter 8: Evaluation in information retrieval
- IIR Chapter 7: Computing scores in a complete search system
Week 06: Text classification using Naive Bayes, kNN, and SVM
- IIR Chapter 13: Text classification and Naive Bayes
- IIR Chapter 14: Vector space classification
- IIR Chapter 15: Support vector machines and machine learning on documents (WIP)
Week 07: Word Embeddings
- Notes from Lecture 14: Distributed word representations for information retrieval
Week 09: Crawling and Question Answering
- Notes from Lecture 19: Question answering