Toolbox
  • Printable version
 
Toolbox
LANGUAGES
Language
Personal tools
Categories
Wikipedia Affiliate Button
 

WikiWord thesis

From BrightByte

Jump to: navigation, search

My diploma thesis about a system to automatically build a multilingual thesaurus from wikipedia, "WikiWord", is finally done. I handed it in yesterday. My research will hopefully help to make Wikipedia more accessible for automatic processing, especially for applications natural languae processing, machine translation and information retrieval. What this could mean for Wikipedia is: better search and conceptual navigation, tools for suggesting categories, and more.

Here's the thesis (in German; Some key parts are available in English: Outline of a method for building a multilingual thesaurus from Wikipedia. Also, see WikiWord for more information):

Daniel Kinzler, Automatischer Aufbau eines multilingualen Thesaurus durch Extraktion semantischer und lexikalischer Relationen aus der Wikipedia, Diplomarbeit an der Abteilung für Automatische Sprachverarbeitung, Institut für Informatik, Universität Leipzig, 2008.

The thesis ended up being rather large... 220 pages thesis and 30k lines of code. To get a quick impression, read pages 26-31, they contain a good overview. I'm plannign to write a research paper in english soon, which will give an overview over WikiWord and what it can be used for.

The thesis is licensed under the GFDL, WikiWord is GPL software. All data taken or derived from wikipedia is GFDL.

(no comments yet)