| INFO2049-1 | ||||||||||||||
| Web and Text Analytics | ||||||||||||||
|
Duration :
|
||||||||||||||
| 30h Th | ||||||||||||||
|
Number of credits :
|
||||||||||||||
|
||||||||||||||
|
Lecturer :
|
||||||||||||||
| Ashwin Ittoo | ||||||||||||||
|
Language(s) of instruction :
|
||||||||||||||
| English language | ||||||||||||||
|
Organisation and examination :
|
||||||||||||||
| Teaching in the first semester, review in January | ||||||||||||||
|
Units courses prerequisite and corequisite :
|
||||||||||||||
| Prerequisite or corequisite units are presented within each program | ||||||||||||||
|
Learning unit contents :
|
||||||||||||||
| In recent years, we have witnessed the proliferation of data in text format.
The challenge now lies in how to automatically process these huge text collections in order to detect and extract meaningful information from their contents? For e.g. business organizations can extract user opinions from online reviews to improve product or service quality. Objective This course will therefore introduce students to key techniques/algorithms in text/data mining and machine learning. Some topics of natural language processing will also be covered. This course has both a strong theoretical and practical components. The practicals will be done mostly using R and scikit-learn (Python). Structure 1. Vector Space Model and Information Retrieval
2. Basic Text Pre-processing
3. Feature Selection
4. Decision Trees for Text Classification
5. Naïve-Bayes for Text Classification
6.Support Vector Machines (SVM) for Text Classification
7. Language Models
9. Introduction to Neural Language Models and Deep Learning
Depending on available time (and overalll class progress), we may be able to only cover 1 or 2 chapters of chapters 7, 8, 9 Practial Sessions In the practical, students will be taught how to perform basic text mining tasks programmatically, including:
Furthermore, the students are expect to work on a practical project, which will count for ~30-40% of the final grade. These projects will be comprehensive in the sense that they will encompass many of the different aspects taught in the lectures/practicals. Sample project topics include text classification or opinion mining. Projects will be executed in groups of 3 students. |
||||||||||||||
|
Learning outcomes of the learning unit :
|
||||||||||||||
|
||||||||||||||
|
Prerequisite knowledge and skills :
|
||||||||||||||
Students should have a good background:
|
||||||||||||||
|
Planned learning activities and teaching methods :
|
||||||||||||||
| The course carries 5 credits and therefore requires 150 hours of work (1 credit = 30 hours).
Theory lectures = 18-22 hours
|
||||||||||||||
|
Mode of delivery (face-to-face ; distance-learning) :
|
||||||||||||||
|
||||||||||||||
|
Recommended or required readings :
|
||||||||||||||
|
||||||||||||||
|
Assessment methods and criteria :
|
||||||||||||||
| Final written exam: 55%
Final practical project: 35% Practical Exercise: 10% |
||||||||||||||
|
Work placement(s) :
|
||||||||||||||
|
Organizational remarks :
|
||||||||||||||
|
Contacts :
|
||||||||||||||
| Ashwin Ittoo
ashwin.ittoo@ulg.ac.be |
||||||||||||||
|
Items online :
|
||||||||||||||
![]() | Lecture Notes Lecture Notes |
|||||||||||||