Artificial Intelligence

Academic Year 2023/2024 - Teacher: Daniela GIORDANO

Expected Learning Outcomes

1) Knowledge and comprehension: knowledge of the classic Artificial Intelligence (AI) paradigm and of the Machine Learning paradigm for intelligent systems development, understanding of their strengths and weaknesses; knowledge of the operating logic of the main machine learning algorithms (supervised and unsupervised), of deep neural networks and of the recent neural architectures on which the current "Large Language Models" are based.

2) Ability to apply knowledge and comprehension: students will be able to autonomously prototype and validate systems that learn from data, with particular reference to textual data and Natural Language Processing (NLP) and Natural Language Understanding (NLU) tasks. They will be able to carry out the training of a model from scratch, the "fine-tuning" of a pre-trained model, choosing the appropriate metrics for performance evaluation, according to the task to be learned.

3) Autonomy of judgment: students will be able to autonomously evaluate which problems can be solved with machine learning techniques based on the available data; select the most appropriate ones among the available models and architectures for solving the task, and critically interpret the results obtained.

4) Communicative abilities: students will be able to document the problem faced, the characteristics of the dataset, the configuration of the model, and the results obtained, clearly and with proper language. They will also be able to argue their design choices.

5) Learning abilities: starting from the notions learned in the course, students will be able to understand new applications of AI by reading the reference literature in the Digital Humanities sector and to independently try their hand at learning new models and new software libraries

Course Structure

Lectures, practical labs, case study discussion.

Attendance of Lessons

Compulsory attendance.

Detailed Course Content

“Classic” Artificial Intelligence: Definition of intelligent autonomous agent in relation to the environment; Architecture of a cognitive system (perception, memory, reasoning, action, metacognition); Knowledge representation techniques; Types of reasoning: deductive and inductive, analogical, case-based, probabilistic. Uncertainty and inference.
Machine learning: Classification, regression and prediction. Supervised learning (Linear classifiers, K-Nearest Neighbours). Unsupervised learning (clustering). The "deep learning": Convolutional neural networks, outline of recurrent neural networks. Encoder-decoder architectures: the Transformers. BERT-based, GPT-based and BART-based models. Methodology for model creation, training and testing (overfitting and underfitting, dataset split in training, validation and testing, Loss functions, Cross-entropy, Softmax, model evaluation metrics; cross-validation). Self-supervised models. Fine-tuning of pre-trained models. Zero-shot, One-shot, Few-shot learning. Prompt Engineering.
The PyTORCH library for deep learning and the Hugging Face environment. Limits of models (inspectability, comprehensibility, bias in datasets)
Applications: NLP and NLU, Recommender system, Sentiment analysis, Conversational agents (Chatbot)

Textbook Information

For “classic” AI:

Selected chapters from S. Russell, P. Norvig, Artificial intelligence: a modern approach, Upper Saddle River, Pearson, 2020 (4th edition), chapter 1 (pp. 29-39, 63-93), chapter 2 (pp. 95-140), chapter  3(pp. 147-179, 189-199), chapter 24 (pp. 856-878).

 

For machine learning:

- A.W. Trask, Grokking Deep Learning, New York, Manning Publications, 2019 (pp 290)

- L. Tunstall, et alii, Natural Language Processing with Transformers: Building Language Applications with Hugging Face, O’Reilly Media, 2022 (pp. 370)

 

Technical documentation on the machine learning libraries used for the project,  available at HuggingFace site (https://huggingface.co/docs)

 

Please remember that in compliance with art 171 L22.04.1941, n. 633 and its amendments, it is illegal to copy entire books or journals, only 15% of their content can be copied.

For further information on sanctions and regulations concerning photocopying please refer to the regulations on copyright (Linee Guida sulla Gestione dei Diritti d’Autore) provided by AIDRO - Associazione Italiana per i Diritti di Riproduzione delle opere dell’ingegno (the Italian Association on Copyright).

All the books listed in the programs can be consulted in the Library.

VERSIONE IN ITALIANO