Workshop 11 Febbraio 2026

Care colleghe, cari colleghi, studentesse e studenti,

siete invitati a partecipare al workshop online (locandina in allegato) “Digital Tools and Corpus-Based Approaches in (Arabic) Sociolinguistics: Methods, Challenges, and Cross-Disciplinary Insights”, organizzato nell'ambito del progetto SABIRANET (CUP E63C24001920006; ID SOE2024_0000078). 

Il workshop si terrà online mercoledì 11 febbraio 2026, dalle 9:00 alle 12:45. 

Link:

Partecipa: https://teams.microsoft.com/meet/39301106414845?p=X2ktWiTNkJYRyP81SK

ID riunione: 393 011 064 148 45

Passcode: 2TM3yR27

Di seguito trovate una breve descrizione 

Digital Tools and Corpus-Based Approaches in (Arabic) Sociolinguistics: Methods, Challenges, and Cross-Disciplinary Insights

Workshop Description

This workshop is intended for students and researchers interested in the intersection of sociolinguistics, corpus-based analysis, and digital tools, with a particular focus on Arabic and its varieties. Held online on February 11, 2026, the workshop aims to bring together scholars from diverse disciplinary backgrounds to reflect on methodological challenges and opportunities in the corpus-based analysis of Arabic sociolinguistic data. While Arabic and its many varieties (Standard and non-standard, spoken and written) are central to the discussion, the workshop also seeks to foster a cross-disciplinary dialogue with researchers who, though not necessarily specialists in Arabic, work with digital tools and corpus methods in sociolinguistics and related fields.

Arabic presents a particularly rich and complex terrain for sociolinguistic inquiry, given its diglossic structure, regional variation, and the interplay of spoken and written forms, often shaped by multilingualism and language contact. These features, combined with the increasing availability of digital data and tools, raise important methodological questions for how we collect, process, and analyze linguistic data.

Participants are invited to explore both practical and theoretical questions related to the use of IT tools, natural language processing, and corpus-based methodologies in sociolinguistic research, with a particular (but not exclusive) focus on Arabic.

Contributions may address, among others, the following topics:

  • Designing and processing corpora involving multiple varieties of Arabic (e.g., Standard, dialectal, mixed-language)
  • Methodological strategies for dealing with mixed data (written/oral, standard/non-standard, monolingual/multilingual)
  • Using corpora to analyze stylistic variation, register shifts, and genre diversity
  • Challenges in annotating and tagging sociolinguistically relevant features in under-resourced languages or dialects
  • Applications of NLP tools to Arabic and implications for sociolinguistic interpretation
  • Visualization and quantification of sociolinguistic phenomena through corpus data
  • Reflections on tool selection, customization, or development in light of specific linguistic and sociolinguistic questions
  • Comparative methodological perspectives: what can be learned from working across different languages and data types?
  • Theoretical implications of corpus-based approaches for analyzing variation, identity, and language practices

Each presentation will be allocated 20 minutes, followed by 10 minutes for discussion.

 

Organizing Committee: Rosa Pennisi (University of Catania)

This workshop is part of the activities of the SABIRANET Project (CUP E63C24001920006; ID SOE2024_0000078), funded by the European Union – NextGenerationEU under Italy’s National Recovery and Resilience Plan (PNRR), Young Researcher 2024 – SoE line, administered by the Italian Ministry of University and Research (MUR).


Data di pubblicazione: 09/02/2026

Vai alla scheda della prof.ssa Rosa PENNISI