FPF:UBKKBK0049 Natural Language Processing I - Course Information
UBKKBK0049 Natural Language Processing I
Faculty of Philosophy and Science in OpavaWinter 2021
- Extent and Intensity
- 0/0/0. 4 credit(s). Type of Completion: z (credit).
- Teacher(s)
- Mgr. Daniel Valenta, Ph.D. (lecturer)
- Guaranteed by
- Mgr. Daniel Valenta, Ph.D.
Institute of The Czech Language and Library Science – Faculty of Philosophy and Science in Opava - Course Enrolment Limitations
- The course is also offered to the students of the fields other than those the course is directly associated with.
- fields of study / plans the course is directly associated with
- Library Science (programme FPF, KNIH-bc)
- Course objectives
- In the introductory part, students get acquainted with the basic concepts of formalized natural language processing such as grammar, semantics, pragmatics, vocabulary. From the application areas, the emphasis is on the automatic text indexing and the linguistic problems involved (recognition, lemmatization and grammatical analysis of the words and multi-word terms, evaluation of semantic relations among them).
- Learning outcomes
- Students will be:
- knowledgeable in the basic terminology and formalisms
- able to define and describe basic terms such as grammar, semantics, pragmatics, vocabulary
- describe and solve the problems of morphology, homonymy, homophony, homography and further linguistic problems - Syllabus
- 1. General background and context. Lexicon, grammar, semantics (definitions of the terms and their mutual relationships).
- 2. Overview of the main application areas (automatic indexing, automatic thesaurus generation, automatic referencing, database/robot/expert system communication, etc., machine and computer-aided translation, data/knowledge bases filling, automated text correction). Connection with other computer science fields.
- 3. Linguistic problems of automatic text indexing. Recognition of the terms and determining the level of their relevance.
- 4. Solving the problem of morphology. Semantic relations among the terms and possibilities of their use. The problem of homonymy.
- 5. Automatization of the creation and maintenance of the thesaurus. Thesaurus as the data structure (implementation of the thesaurus by a suitable type of the database system).
- 6. Automatization of the acquisition of the relevant lexicon. Automatization of finding semantic relationships among the terms.
- Teaching methods
- Interactive lecture, tutorial
- Assessment methods
- Credit:
Active participation at the tutorials min. 75%, pass the written test. - Language of instruction
- Czech
- Further comments (probably available only in Czech)
- Information on the extent and intensity of the course: 8 hod/sem.
- Enrolment Statistics (Winter 2021, recent)
- Permalink: https://is.slu.cz/course/fpf/winter2021/UBKKBK0049