UPPSALA UNIVERSITY : Department of Linguistics and Philology : Eva Forsbom : Course-related material : Thesis-related stuff
Uppsala University
Skip links

Thesis-related stuff

Motivation

The purpose of my thesis is to investigate if and how some automated textlinguistic methods can give more relevant hits in information retrieval, and give coherent summaries that are more query and user adapted than those usually given in information systems.

A lexical cohesion analysis is used as a basis for indexing, searching and a short summary in an information system. The analysis is based on a number of knowledge bases containing linguistic or world knowledge, and the result will mainly depend on what knowledge is available.

By combining the lexical cohesion analysis with a Rhetorical Structure Theory analysis, it should be possible to come to terms with some coherence problems in summaries only based on lexical cohesion analysis. At the same time, the less computationally costly lexical cohesion analysis could reduce the number of possible RST analyses, since it also gives an estimate on how closely sentences are related.

Unfortunately, the RST part turned out to be too wieldy to fit into the thesis, and had to be put off to a later date.

Some of the resources developed for the thesis are also relevant for other projects, e.g. Text and language assessment of mathematics and science and SNK/BLARK - Svensk nationell korpus [Swedish National Corpus]/Basic LAnguage Resource Kit.

Lexical cohesion analysis

Some resources developed for using in lexical cohesion analysis, and related papers and presentations:

Rhetorical Structure Theory and Veins Theory

Some resources developed for using in RST and VT analysis, and related course papers: