Duration
10h Th, 10h Pr
Number of credits
| Bachelor in biology | 2 crédits | |||
| Master in bio-informatics and modelling (120 ECTS) | 2 crédits | |||
| Master in biology of organisms and ecology (120 ECTS) | 2 crédits |
Lecturer
Language(s) of instruction
French language
Organisation and examination
Teaching in the second semester
Schedule
Units courses prerequisite and corequisite
Prerequisite or corequisite units are presented within each program
Learning unit contents
The course is a general introduction to the most often used methods in multivariate statistics (i.e. when one studies several variables simultaneously) in biology. The course entails the following chapters:
- Graphical display and statistical summary of multivariate data
- Multivariate exploratory techniques: principal component analysis, clustering, principal coordinates analysis
- Multiple regression and generalized linear models
Learning outcomes of the learning unit
The methods of multivariate data analysis are taught based on a pragmatic approach. At the end of the course, the sudent should be capable of
- defining a multivariate problem,
- analysing the data,
- interpreting the results.
He/she should also be aware of the limitations of application of the methods.
Prerequisite knowledge and skills
The students must have attended a basic course on descriptive and inferential statistics. The concepts of normal distribution, confidence interval and hypothesis tests are considered as known. Moreover, basic knowledge of the software R is expected.
The methods are presented without emphasizing the mathematical justifications. Nevertheless, the students must have the following background in mathematics: basic linear algebra (vectors, matrices, including the notions of determinant and inverses), linear, exponential and logarithmic functions.
Planned learning activities and teaching methods
Together with the ex-cathedra courses focusing on a theoretical approach, the students will be asked to apply the techniques following the learning process describred below:
- Personal preparation at home in order to get familiar with the script constructed by the professor and her assistants;
- Discussion on the script and the interpretation of the results
- Group discussion on data analyses
Mode of delivery (face-to-face ; distance-learning)
The course counts 20 hours of face-to-face teaching, 10 of which are devoted to ex-cathedra lectures for the theory. During the 10 hours of practicals, the students will first be invited to ask all the questions they have on the scripts and the intepretation of the results. Then, they will discuss in small groups in order to analyse some data. A brief correction will be detailed at the end of the practical, before being put on line.
Recommended or required readings
There are no lecture notes but the slides that will be used for the lectures will be available on eCampus in advance. Also, the scripts of the software R and the statement of the data analyses (and their corrections) will also be displayed on line.
The following textbook (available on-line from the web site of the libraries of ULiège) will be used for most parts of the course (PCA, association measures and principal coordinates analysis, multiple regression and generalized regression):
A.F. Zuur, E.N. Ieno et G.M. Smith, Analysing ecological data, Springer serie (statistics for biology and health)
Assessment methods and criteria
The examination consists in the analysis of some data with the software R. The focus in the marking will be on the interpretation of the results and the appropriate use of the techniques but some attention will also be given to the use of the software R.
During the exam, the students may either use their own laptop or a laptop of the computer room of the maths department.
Work placement(s)
Organizational remarks
The course is organised on the time slots indicated on Celcat. Two groups will be constructed for the practical sessions. The students whose group has an available laptop will have the practical session in a clasic classroom while the others will be invited to work in the computer room of the Maths Departement.
Contacts
Lecturer
Gentiane Haesbroeck
Département de Mathématique (B37, bureau 0/60)
Tél: 04/366.95.94
Email: G. Haesbroeck@ulg.ac.be
Assistant
Sophie Klemkenberg
Email: S.Klenkenberg@uliege.be
Adaptation of teaching commitments following the COVID-19 pandemic for the May-June 2020 session
Teaching methods implemented : distance-learning
Following the decision of ULiège to organise the teaching on-line, the organisation of the lectures and practicals has been adapted as follows:
- Lectures on theory (3 lectures out of 5 have been organised on-line): videos recorded by the professor have been put on line on eCampus, the day before the official date of the lecture at the latest. A forum has been activated on eCampus in order for the professor to answer the questions of the students, directly during the time slot of the lecture,
- As far as the practical are concerned, the exercise sheets were made available before the time slot of the practical and a forum has been activted in order to answer questions on the spot. A detailed written correction is put on eCampus at the end of the practical, togehter with a video explaining the main elements.
Remark: following the query of some students, the practical material (exercises and corrections) were put on line in advance for the practical sessions 3 and 4 scheduled after the spring holidays.
Assessment subjects
The evaluation consists on the application (with the statistical software R) of the techniques taught during the lectures. There is no theory.
The themes that will be considered are the following:
- Multivariate Exploratory Data Analyis and multiple confirmatory analysis
- Principal Component Analysis
- Cluster Analysis and Principal Coordinates Analysis
- Linear Models
- GLM (Logistic and Poisson models)
Assessment methods
The exam is scheduled "on-line" on 22 June.
The data will be put on eCampus on Friday 19 June and the exam questionnaire will be transmitted on 22 June at 9 am.
A document (either a photograph of a handwritten document or a pdf file of a word document) containing the answers to the questions will have to be up-loaded by 2pm on 22 June at the latest. The R code as well as the grphics (if those could not be included in the main document) will also need to be submitted within the same deadline.
A systematic check for plagiat and collaborative work will be made and such behavior will be sanctionned.
Contacts
G. Haesbroeck (G.Haesbroeck@uliege.be)
Adaptation of teaching commitments following the COVID-19 pandemic for the Aug-Sept 2020 session
Assessment subjects
The content is the same as in May/June.
Assessment methods
The same organisation as in May/June will be considered.