UCJDAJK1 Corpus architecture and encoding

Faculty of Philosophy and Science in Opava
Summer 2021
Extent and Intensity
1/0/0. 0 credit(s). Type of Completion: dzk.
Teacher(s)
prof. Dr. Werner Wegstein (lecturer)
Guaranteed by
prof. Dr. Werner Wegstein
Institute of Foreign Languages – Faculty of Philosophy and Science in Opava
Course Enrolment Limitations
The course is also offered to the students of the fields other than those the course is directly associated with.
fields of study / plans the course is directly associated with
Course objectives
Mastering modern technologies of structuring electronic texts (text encoding) and building extensive linguistic data files and understanding basic software with respective linguistic tools.
Syllabus
  • 1. XML - standard
    2. 'TEI Guidelines for text encoding' (P4 resp. P5)
    3. Design, creation and processing of XML-document
    4. Principles of formating: XSL/XSLT
    5. XML-projecting: monolingual and multilingual text corpora I-III
    6. Corpus architecture: Design, typology, format, annotation, archiving
    7. Corpus processing I: Statististical analysis of corpus data
    8. Corpus processing II: Concordancing and collocations
Language of instruction
German
Further Comments
The course can also be completed outside the examination period.
Teacher's information
Examination
Recommended reading:
Bradley, Neil: The XML Companion, second edition, Harlow: Pearson Education Ltd., 2000
Sperberg -McQueen,Michael and Lou Burnard. Guidelines for Electronic Text Encoding and Interchange, XML-compatible edition. XML conversion by Syd Bauman, Lou Burnard,
Steven DeRose, and Sebastian Rahtz, TEI Consortium Copyright (c), 2001. Erhältlich als 'The XML Version of the TEI Guidelines (March 2002)' from http://www.tei-c.org/P4X/ together with tutorials and guides to local practice.
Text Tools and Corpora: Multext. Available from http://www.lpl.univ-aix.fr/projects/multext/
Tony McEnery and Andrew Wilson. Corpus Linguistics. Edinburgh: University Press, 1996 (2nd edition 2001)
John Sinclair. Corpus Concordance, Collocation. OUP, 1991
Elena Tognini Bonelli. Corpus Linguistics at Work. Amsterdam: John Benjamins, 2001
Catherine N. Ball. Tutorial Concordances and Corpora. Available from: http://www.georgetown.edu/faculty/ballc/corpora/tutorial.html
http://bowland-files.lancs.ac.uk/monkey/ihe/linguistics/contents.htm
The course is also listed under the following terms Summer 2008, Summer 2009, Summer 2010, Summer 2011, Summer 2012, Summer 2013, Summer 2014, Summer 2015, Summer 2016, Summer 2017, Summer 2018, Summer 2019, Summer 2020, Summer 2022.
  • Enrolment Statistics (Summer 2021, recent)
  • Permalink: https://is.slu.cz/course/fpf/summer2021/UCJDAJK1