Logo der HdM
Veranstaltungsbeschreibung

113444b Natural Language Processing

Zuletzt geändert:25.09.2017 / von Carlsburg
EDV-Nr:113444b
Studiengänge: Medieninformatik (Bachelor, 7 Semester), Prüfungsleistung im Modul Data Mining in Semester 3 4 6 7
Häufigkeit: nur WS
Dozent:
Sprache: Deutsch
Art: V
Umfang: 2 SWS
ECTS-Punkte: 3
Workload: Lecture:
15 x 2 SWS = 22.5 hours
Pre- and Postprocessing:
15 x 2 SWS = 22.5 hours
Exam preparation:
5 days, 8 hours/day = 40 hours
Overall Workload =85 hours
Prüfungsform:
Beschreibung:

Lecture 1 provides an overview on NLP, it's main goals, challenges and applications. Moreover in this lecture the overall structure of the course and it's goals are described

Lecture 2 is concerned with text preprocessing and normalisation. E.g. how to access data from the web, clean HTML, segment text in sentences and words (tokenization) and transform words to their baseforms (stemming and lemmatization). For this the most important encoding types are discussed. Moreover, the application of regular expressions in NLP is introduced and some popular stemmers are applied.

Lecture 3 introduces algorithms for text classification and methods to measure and evaluate performance of these algorithms

In Lecture 4 is concerned with the use of corpora and lexical resources in NLP.

Lecture 5is concerned with morphological- and lexical analysis, Part of Speech tagging and taggsets

Lecture 6 introduces Markov Chains and Hidden Markov Models and their application in NLP, e.g. for POS-Tagging.

Lecture 7 intoduces chunking (shallow parsing) and how this technique can be applied for Entity-Relaction Extraction. The application of chunking provides answers to questions like which objects are mentioned in a text ? and which relations between these objects are described in the text ?

Lectures 8 parsing techniques for syntactical analysis are presented. Context Free Grammers, their pros and cons and finally feature based grammars are introduced.

Lecture 9 introduces techniques for the extraction of meaning (semantics) from text (semantic analysis). Central in this lecture is the introduction of Lambda-Typen Logic and how the constituentes of natural language sentences can be transformed to this type of logic.

Literatur:
  1. S. Bird, E. Klein, E. Loper; Natural Language Processing with Python; O'Reilly, 2009
  2. C.D. Manning, H. Schütze; Foundations of Statistical Language Processing; MIT Press, 1999
  3. N. Indurkhya, F.J. Damerau (Editors); Handbook of Natural Language Processing; Chapman & Hall/ CRC, 2010


Weitere Literatur finden Sie in der HdM-Bibliothek.
Internet: http://www.hdm-stuttgart.de/~maucher/

Termine

Weitere Termine im HdM-Kalender

Studieninfotag 2017 in Outlook eintragen 22.11.2017 Studieninfotag 2017 mehr

Ringvorlesung zu aktuellen Themen aus der Welt der Medien in Outlook eintragen 22.11.2017 Ringvorlesung zu aktuellen Themen aus der Welt der Medien mehr

Games Day - 20 Jahre Games an der HdM in Outlook eintragen 23.11.2017  - 24.11.2017 Games Day - 20 Jahre Games an der HdM mehr

Kontaktstudium Design Thinking in Outlook eintragen 24.11.2017  - 25.11.2017 Kontaktstudium Design Thinking mehr

International Week in Outlook eintragen 27.11.2017  - 01.12.2017 International Week mehr

Innovation Day in Outlook eintragen 28.11.2017 Innovation Day mehr

34. EXIST-Workshop in Outlook eintragen 28.11.2017  - 29.11.2017 34. EXIST-Workshop mehr

TONI ERDMANN meets ProducersDAY@HdM  in Outlook eintragen 29.11.2017 TONI ERDMANN meets ProducersDAY@HdM mehr

Weitere Termine im HdM-Kalender

© Hochschule der Medien 2017 | Impressum | Hinweise zum Datenschutz Login