2023-2024 / INFO0943-1

Textual corpora : analytical principles

Duration

30h Th

Number of credits

 Master in ancient and modern languages and literatures (120 ECTS)5 crédits 
 Master in ancient languages and literatures : classics (120 ECTS)5 crédits 
 Master in linguistics (120 ECTS) (joint-degree programme)5 crédits 
 Master in ancient languages and literatures : Oriental studies (120 ECTS)5 crédits 
 Master in ancient languages and literatures : classics (60 ECTS)5 crédits 

Lecturer

Dominique Longrée, Julien Perrez

Language(s) of instruction

French language

Organisation and examination

Teaching in the first semester, review in January

Schedule

Schedule online

Units courses prerequisite and corequisite

Prerequisite or corequisite units are presented within each program

Learning unit contents

Introduction to the principles of collecting and annotating textual corpora: historical overview of the development of corpus linguistics, definition of key concepts, discussion of different methods of corpus annotation (metadata, lemmatization, part-of-speech tagging,...), using as well semi-automatic as fully autpmatic tools and methods, and presentation of techniques making it possible to transform a corpus into a textual database; data mining and analysis.  

Learning outcomes of the learning unit

The main objective of this course is to introduce the students 1st year students of the Master's degree in Linguistics (à finalité spécialisée en Traitement automatique des textes et analyse statistique des données textuelles) to principles of constitution and preparation of corpora or textual databases, so as to make it possible for them to to integrate these principles and techniques into further disciplinary research.

Prerequisite knowledge and skills

None.

Planned learning activities and teaching methods

Lectures and practical exercises

Mode of delivery (face to face, distance learning, hybrid learning)

1th semester

Recommended or required readings

Exam(s) in session

Any session

- In-person

written exam ( open-ended questions ) AND oral exam

- Remote

written exam ( open-ended questions ) AND oral exam

Written work / report


Additional information:

Written and oral examination.

Work placement(s)

Organisational remarks and main changes to the course

1h taeching with J. Perrez et 1h teaching with D. Longrée

Contacts

Julien.Perrez@ulg.ac.be
dominique.longree@ulg.ac.be

Association of one or more MOOCs

Items online

eCampus course
eCampus course