| INFO2049-1 | ||
| Web and Text Analytics | ||
|
Durée :
|
||
| 30h Th | ||
|
Nombre de crédits :
|
||
|
Nom du professeur :
|
||
| Ashwin Ittoo | ||
|
Langue(s) du cours :
|
||
| Langue anglaise | ||
|
Organisation et évaluation :
|
||
| Enseignement au deuxième quadrimestre | ||
|
Unités d'enseignement prérequises et corequises :
|
||
| Les unités prérequises ou corequises sont présentées au sein de chaque programme | ||
|
Contenus du cours :
|
||
| In recent years, we have witnessed the proliferation of data in text format. Typical examples include messages on social networks like Facebooks, tweets on Twitter about important topics and events (such as political campaigns and elections) and users opinions on products, brands and companies available from review sites (such as Amazon).
Buried within these huge volumes of texts are meaningful information nuggets, which if detected and extracted can be exploited to support a wide range of activities, especially Business Intelligence. For instance, business organizations use the opinion of users on products from review sites to improve their brand images, products and services. However, the challenge lies in how to automatically process these huge text collections in order to detect and extract meaningful information from their contents? Objective This course is intended to overcome the aforementioned challenge in text processing and analysis. To this aim, it will introduce students to key techniques/algorithms in text/data mining and machine learning. Some topics of natural language processing will also be covered. This course has both a strong theoretical and practical components. The practicals will be done mostly using R. The NLTK Python or Weka Java toolkits may also be used depending on the task at hand. Structure 1. Text Data Represenation
A. Introduction to Opinion Mining and Sentiment Analysis B. Part-of-Speech Tagging and Syntactic Parsing C. Text Classification Practial Sessions In the practical, students will be taught how to perform basic text mining tasks programmatically, including:
Furthermore, the students are expect to work on a practical project, which will count for ~30-40% of the final grade. These projects will be comprehensive in the sense that they will encompass many of the different aspects taught in the lectures/practicals. Sample project topics include text classification or opinion mining. Projects will be executed in groups of 3 students. |
||
|
Acquis d'apprentissage (objectifs d'apprentissage) du cours :
|
||
|
||
|
Savoirs et compétences prérequis :
|
||
| Any course in graduate level mathematics (vector algebra, probability/statistics)
Programming experiene Support will be offered to students
|
||
|
Activités d'apprentissage prévues et méthodes d'enseignement :
|
||
| The course carries 5 credits and therefore requires 150 hours of work (1 credit = 30 hours).
The course will be organized over 10 sessions of 3 hours
|
||
|
Mode d'enseignement (présentiel ; enseignement à distance) :
|
||
|
||
|
Lectures recommandées ou obligatoires et notes de cours :
|
||
Textbook for the course
|
||
|
Modalités d'évaluation et critères :
|
||
| Final written exam: 55%
Final practical project: 35% Practical Exercise: 10% |
||
|
Stage(s) :
|
||
|
Remarques organisationnelles :
|
||
|
Contacts :
|
||
| Dr Ashwin Ittoo
ashwin.ittoo@ulg.ac.be |
||
|
Notes en ligne :
|
||
![]() | Lecture Notes Lecture Notes |
|