Informationssökning, 7.5hp, HT 2011
Kurskod: 5LN440
Kursplan
Lärare: Jörg Tiedemann (JT), Martin Hassel (MH), Magnus Rosell (MR)
News
- Preliminary schedule for the presentations
- More information about presentations
- Ny version av Labbinstuktionerna för labb 1
- Exempeltenta
Preliminärt schema
Typ |
Datum |
Tid | Lokal |
Lärare |
Innehåll | att läsa |
|---|---|---|---|---|---|---|
| F1 | 2011-09-05 | 10-12 | Turing | JT | Introduction & overview | ch1, ch2 |
| F2 | 2011-09-07 | 10-12 | Turing | MH | IR basics & Link Analysis | ch6, ch8, ch21 |
| F3 | 2011-09-19 | 13-15 | Turing | MH | Web Crawlers & LSA/RI | ch18-ch20, MS |
| L1 | 2011-09-19 | 15-17 | Turing | MH | wget, LSA & RI | deadline: 2011-09-26 |
| F4 | 2011-09-22 | 10-12 | Turing | JT | Dictionaries & tolerant retrieval | ch3 |
| F5 | 2011-09-26 | 10-12 | Turing | JT | Ranked Retrieval | ch6, ch7, mittkursutvärdering |
| F6 | 2011-09-27 | 10-12 | Turing | MR | Clustering | ch16, ch17, MR |
| L3 | 2011-09-27 | 13-16 | Turing | MR | Clustering | |
| F7 | 2011-10-06 | 10-12 | Turing | MH | Text Extraction & Summarization | MH07 (ch 2-4 except 4.4-4.5) |
| L2 | 2011-10-06 | 13-15 | Turing | MH | stemming & regular expressions | |
| F8 | 2011-10-10 | 10-12 | Turing | JT | Text classification | ch13 |
| F9 | 2011-10-12 | 10-12 | Turing | JT | Question answering | JM 23.2, Bouma, Wikipedia |
| Seminar | 2011-10-24 | 10-12 | Turing | JT | Presentations | kursutvärdering |
| Seminar | 2011-10-26 | 10-12 | Turing | JT | Presentations | kursutvärdering |
| Tenta | 2011-11-04 | 8-12 | Bergs- brunnag. 15, sal 1 |
Tentamen |
F=föreläsning, L=laboration.
Chomsky = datasal 9-2043. Turing = datasal 9-2042
Examination
Examination sker genom obligatoriska laborationer och inlämningsuppgifter (måste bli godkända), en muntlig presentation (sista föreläsningstillfälle, 25% av betyget) och en skriftlig tentamen (75% av betyget). Both parts will get a score between 1 and 10 and the average score will be used to determine the final grade for this course. For G you need to get a score of 6 or more and for VG a score of 8 or more. In order to obtain VG for the presentation you need to show a deep understanding of the topic you're presenting and you are required to present it in a pedagogical way that makes it easy for your fellow students to understand this topic.
Kurslitteratur
- Christopher D. Manning, Prabhakar Raghavan and Hinrich Schütze, Introduction to Information Retrieval, Cambridge University Press. 2008. Valda avsnitt. Finns på nätet: länk
- [MS] Magnus Sahlgren: An Introduction to Random Indexing
- [MH07] Martin Hassel: Resource Lean and Portable Automatic Text Summarization, Doctoral Thesis, ch 2-4
- Wikipedia: Question answering
- Gosse Bouma, Ismail Fahmi, Jori Mur, Gertjan van Noord, Lonneke van der Plas, and Jörg Tiedemann: Linguistic Knowledge and Question Answering. In Traitement Automatique des Langues (TAL), 2005/3
Ytterligare material kan tillkomma.
Topics for the presentation (individual or 2 students)
- Claudia Ehrentraut: Index construction (ch4) 24/10
- Max Epstein: Index compression (ch5)
- Arvid Lindahl: Evaluation in IR (ch8)
- Jonathan Karlstrand: Relevance feedback & Query expansion (ch9)
- Amanda Österholm: XML retrieval (ch10)
- Jenny Swedberg: Probabilistic IR & LM for IR (ch11 & ch12)
- Daniel Lindmark: Vector space classification (ch14) 26/10
- Fredrik Norlind: SVM's & machine learning (ch15)
- Your own topic (talk to me first)
Preliminary schedule:
2010-10-24, 10-12 10:15-10:30 Claudia Ehrentraut: Index construction (ch4) 10:35-10:50 Max Epstein: Index compression (ch5) 10:55-11:10 Amanda Österholm: XML retrieval (ch10) 11:15- Kursutvärdering 2010-10-26, 10-12 10:15-10:30 Jonathan Karlstrand: Relevance feedback & Query expansion (ch9) 10:35-10:50 Jenny Swedberg: Probabilistic IR & LM's (ch11,12) 10:55-11:10 Daniel Lindmark: Vector space classification (ch14) 11:15-11:30 Fredrik Norlind: SVM's & machine learning (ch15) 11:35-11:50 Arvid Lindahl: Evaluation in IR (ch8) 11:50- Kursutvärdering
Länkar
- Web Data Mining, Exploring Hyperlinks, Contents and Usage Data. Bing Liu, Springer, December, 2006
- Text REtrieval Conference
- Cross-Language Evaluation
Forum (CLEF)
- Random Indexing
- PhD thesis. Magnus Rosell, 2009: "Text Clustering Exploration - Swedish Text Representation and Clustering Results Unraveled"
