
Ph.D. in Ingegneria Informatica E Dei Sistemi , 37th cycle (2021-2024)
Ph.D. obtained in 2025
Dissertation:
Exploring the Use of Deep Models to Analyze Data in Multimodal Scenarios (Abstract)
Tutors:
Luca Cagliero Paolo Garza
Research presentation:
PosterProfile
Research topic
Exploring the use of Deep Natural Language Processing models to analyze documents in cross-lingual and multi-domain scenarios
Research interests
Biography
My PhD focuses on multimodal learning. I am deeply passionate about unraveling the complexities of social media content and visually rich documents.
Beyond my core expertise in vision-language problems, I am keenly interested in the audio and video domains. This diverse skill set allows me to approach a wide range of complex challenges, further expanding the horizons of my research endeavors.
Teaching
Teachings
Master of Science
- Data science lab: process and methods. A.A. 2021/22, DATA SCIENCE AND ENGINEERING. Collaboratore del corso
- Data science lab: process and methods. A.A. 2022/23, DATA SCIENCE AND ENGINEERING. Collaboratore del corso
- Data science lab: process and methods. A.A. 2023/24, DATA SCIENCE AND ENGINEERING. Collaboratore del corso
- Data science lab: process and methods. A.A. 2024/25, DATA SCIENCE AND ENGINEERING. Collaboratore del corso
- Deep natural language processing. A.A. 2023/24, DATA SCIENCE AND ENGINEERING. Collaboratore del corso
- Deep natural language processing. A.A. 2024/25, DATA SCIENCE AND ENGINEERING. Collaboratore del corso
Bachelor of Science
- Dati, algoritmi e le frontiere dell'informatica - Intraprendenti. A.A. 2024/25, INGEGNERIA AEROSPAZIALE. Collaboratore del corso
- Basi di dati. A.A. 2021/22, INGEGNERIA INFORMATICA. Collaboratore del corso
- Basi di dati. A.A. 2022/23, INGEGNERIA INFORMATICA. Collaboratore del corso
Research
Research groups
Publications
Works published during the Ph.D. View all publications in Porto@Iris
- Vaiani, Lorenzo (2025)
Exploring the Use of Deep Models to Analyze Data in Multimodal Scenarios. relatore: CAGLIERO, LUCA; GARZA, PAOLO; , 37. XXXVII Ciclo, P.: 121
Doctoral Thesis - Gallipoli, Giuseppe; Papicchio, Simone; Vaiani, Lorenzo; Cagliero, Luca; Miola, Arianna; ... (2024)
Keyword-based Annotation of Visually-Rich Document Content for Trend and Risk Analysis using Large Language Models. In: The Joint Workshop of the 7th Financial Technology and Natural Language Processing (FinNLP), the 5th Knowledge Discovery from Unstructured Data in Financial Services (KDF), and the 4th Economics and Natural Language Processing (ECONLP) Workshop (FinNLP-KD, Turin (ITA), 20 May, 2024, pp. 130-136
Contributo in Atti di Convegno (Proceeding) - LA QUATRA, Moreno; Koudounas, Alkis; Vaiani, Lorenzo; Baralis, Elena; Cagliero, Luca; ... (2024)
Benchmarking Representations for Speech, Music, and Acoustic Events. In: 2024 IEEE International Conference on Acoustics, Speech, and Signal Processing Workshops (ICASSPW), Seoul (KOR), 14-19 April 2024, pp. 505-509. ISBN: 979-8-3503-7451-3
Contributo in Atti di Convegno (Proceeding) - Vaiani, Lorenzo; Cagliero, Luca; Garza, Paolo (2024)
Emotion Recognition from Videos Using Multimodal Large Language Models. In: FUTURE INTERNET, vol. 16. ISSN 1999-5903
Contributo su Rivista - Benedetto, Irene; Koudounas, Alkis; Vaiani, Lorenzo; Pastor, Eliana; Cagliero, Luca; ... (2024)
MAINDZ at SemEval-2024 Task 5: CLUEDO-Choosing Legal oUtcome by Explaining Decision through Oversight. In: SemEval-2024 (Workshop of ACL), Mexico City (MEX), 20-21 June, 2024, pp. 997-1005
Contributo in Atti di Convegno (Proceeding) - Ding, Yihao; Vaiani, Lorenzo; Han, Caren; Lee, Jean; Garza, Paolo; Poon, Josiah; ... (2024)
3MVRD: Multimodal Multi-task Multi-teacher Visually-Rich Form Document Understanding. In: Association for Computational Linguistics 2024, Bangkok, Thailand and virtual meeting, August 11-16, 2024, pp. 15233-15244
Contributo in Atti di Convegno (Proceeding) - D'Amico, Lorenzo; Napolitano, Davide; Vaiani, Lorenzo; Cagliero, Luca (2023)
PoliTo at MULTI-Fake-DetectiVE: Improving FND-CLIP for Multimodal Italian Fake News Detection. In: EVALITA 2023, Parma, Italy, September 7-8, 2023. ISSN 1613-0073
Contributo in Atti di Convegno (Proceeding) - Vaiani, Lorenzo; Cagliero, Luca; Garza, Paolo (2023)
PoliTo at SemEval-2023 Task 1: CLIP-based Visual-Word Sense Disambiguation Based on Back-Translation. In: SemEval-2023 (Workshop of ACL), Toronto (CAN), July 9–14, 2023, pp. 1447-1453
Contributo in Atti di Convegno (Proceeding) - Koudounas, Alkis; LA QUATRA, Moreno; Vaiani, Lorenzo; Colomba, Luca; Attanasio, ... (2023)
ITALIC: An Italian Intent Classification Dataset. In: INTERSPEECH 2023, Dublin (Ireland), 20 August - 24 August 2023, pp. 2153-2157
Contributo in Atti di Convegno (Proceeding) - Napolitano, Davide; Vaiani, Lorenzo; Cagliero, Luca (2023)
Learning Confidence Intervals for Feature Importance: A Fast Shapley-based Approach. In: Data Analytics solutions for Real-LIfe APplications (DARLI-AP), Ioannina (Greece), March 28-31, 2023. ISSN 1613-0073
Contributo in Atti di Convegno (Proceeding) - Benedetto, Irene; Koudounas, Alkis; Vaiani, Lorenzo; Pastor, Eliana; Baralis, Elena; ... (2023)
PoliToHFI at SemEval-2023 Task 6: Leveraging Entity-Aware and Hierarchical Transformers For Legal Entity Recognition and Court Judgment Prediction. In: SemEval-2023 (Workshop of ACL), Toronto (CAN), July 9–14, 2023, pp. 1401-1411
Contributo in Atti di Convegno (Proceeding) - Morra, Lia; Azzari, Alberto; Bergamasco, Letizia; Braga, Marco; Capogrosso, Luigi; ... (2023)
Designing Logic Tensor Networks for Visual Sudoku puzzle classification. In: 17th International Workshop on Neural-Symbolic Learning and Reasoning (NeSy 2023), Certosa di Pontignano, Siena (Italia), July 3-5, 2023, pp. 223-232. ISSN 1613-0073
Contributo in Atti di Convegno (Proceeding) - Ravagli, Jason; Vaiani, Lorenzo (2022)
JRLV at SemEval-2022 Task 5: The Importance of Visual Elements for Misogyny Identification in Memes. In: International Workshop on Semantic Evaluation (SemEval-2022), Seattle (USA), July 10–15, 2022, pp. 610-617
Contributo in Atti di Convegno (Proceeding) - Vaiani, Lorenzo; LA QUATRA, Moreno; Cagliero, Luca; Garza, Paolo (2022)
ViPER: Video-based Perceiver for Emotion Recognition. In: Multimodal Sentiment Analysis Challenge (MuSe 2022), Lisbon (PT), October 10-15, 2022, pp. 67-73
Contributo in Atti di Convegno (Proceeding) - Vaiani, Lorenzo; Koudounas, Alkis; LA QUATRA, Moreno; Cagliero, Luca; Garza, Paolo; ... (2022)
How Much Attention Should we Pay to Mosquitoes?. In: Computational Paralinguistics ChallengE 2022 (ComParE 2022), Lisbon (PT), October 10-14, 2022, pp. 7135-7139. ISBN: 978-1-4503-9203-7
Contributo in Atti di Convegno (Proceeding) - Vaiani, Lorenzo; LA QUATRA, Moreno; Cagliero, Luca; Garza, Paolo (2022)
Leveraging multimodal content for podcast summarization. In: ACM/SIGAPP Symposium on Applied Computing, Virtual, Online, April 25th 2022 - April 29th 2022, pp. 863-870
Contributo in Atti di Convegno (Proceeding) - Vaiani, Lorenzo; Koudounas, Alkis; LA QUATRA, Moreno; Cagliero, Luca; Garza, Paolo; ... (2022)
Transformer-based Non-Verbal Emotion Recognition: Exploring Model Portability across Speakers’ Genders. In: Multimodal Sentiment Analysis Challenge (MuSe 2022), Lisbon (PT), October 10 2022, pp. 89-94
Contributo in Atti di Convegno (Proceeding)