Program

Abjad 2026 Programme

Workshop date Saturday 28 March 2026
Participation Hybrid (in-person and online)
Venue Palais Des Congres (Room: SALLE Les Riad)
Workshop duration 9:00 – 17:45

 

Time Mode Presentation
09:00–09:15 Welcome and overview of the shared tasks
Mo El-Haj, Saad Ezzini, Ahmad Abdelali, Shadi Abudalfa
09:15–10:00 Keynote: Dr. Violetta Cavalli-Sforza
Session 1: Benchmarking
10:00–10:15 Online AraLingBench: A Human-Annotated Benchmark for Evaluating Arabic Linguistic Capabilities of Large Language Models
Mohamad Bilal Zbib, Hasan Abed Al, Kader Hammoud, Ammar Mohanna, Nadine Rizk, Fatima Karnib, Sina Moukaled, Bernard Ghanem
10:15–10:30 Online U-MIRAGE: Benchmarking Chain-of-Thought Reasoning for Urdu Medical QA
Ali Faheem, Faizad Ullah, Muhammad Hammad, Ahmed Hassan, Muhammad Sohaib Ayub, Asim Karim
10:30–11:00 Coffee break
Session 2: Morphology
11:00–11:15 Online AjamiMorph: Zero-Annotation Morphological Discovery for Hausa Ajami via Multi-Method Consensus
Soumedhik Bharati, Shibam Mandal, Prithwish Ghosh, Swarup Kr Ghosh, Sayani Mondal
11:15–11:30 Online Morphological Feature Extraction for Fine-Grained Sorani Kurdish Dialect Identification
Soumedhik Bharati, Shibam Mandal, Subham Majumdar, Swarup Kr Ghosh, Sayani Mondal
11:30–11:45 In-person Murabaa: A Comprehensive Resource Platform for Arabic Morphology
Karim Bouzoubaa, Driss Namly, Hamid Jihad, Rachida Tajmout, Jamal Ezzouaine, Hakima Khamar
11:45–12:00 In-person QAMAR: A Fully Verified Quranic Arabic Morphological Analysis Resource
Sara Faqihi, Karim Bouzoubaa, Rachida Tajmout, Driss Namly
Session 3: Sentiment
12:00–12:15 In-person Rethinking Polarity Detection: When BPE Fails Across Scripts
Manodyna K H, Luc De Nardi
12:15–12:30 Online Enhancing Urdu Sentiment Classification through Instruction-Tuned LLMs
Hasan Faraz Khan, Noor Fatima, Irfan Ahmad
12:30–12:45 Online Improving Models for Sentiment Analysis on Saudi-English Code-Switching Text
Samaher Alghamdi, Paul Rayson, Reem Alotibi
12:45–13:00 In-person Reliability-Guided QUBO Selection for Arabic Sentiment Prediction
Rabab Alkhalifa
13:00–14:00 Lunch break
Session 4: Character set issues
14:00–14:15 Online KazakhOCR: Multimodal Benchmark for Low-Resource Kazakh Script OCR
Henry Gagnier, Sophie Gagnier, Ashwin Kirubakaran
14:15–14:30 Online Character-Level Transformer for Tajik–Persian Transliteration
Arabov Mullosharaf Kurbonovich
14:30–14:45 Online Orthographic Robustness of Persian Named Entity Recognition Models
Henry Gagnier, Sophie Gagnier
14:45–15:00 In-person Code-Switching as a Safety Failure Mode in Large Language Models
Waleed Jamil, Saima Rafi
15:00–15:15 Online AraModernBERT: Transtokenized Initialization and Long-Context Encoder Modeling for Arabic
Omar Elshehy, Omer Nacar, Abdelbasset Djamai, Muhammed Ragab, Khloud Al Jallad, Mona Abdelazim
15:15–15:30 Online From Classical to Contemporary: Evolutionary Analysis & Classification of Urdu Poetry
Noor Fatima, Hasan Faraz Khan, Irfan Ahmad
15:30–16:30 Poster session (coffee break 15:30–16:00) <<<See list of accepted posters below>>>
Session 5: Speech and toxicity
16:30–16:45 In-person Current State of LLMs for Arabic Dialectal Machine Translation
Josef Jon, Rawan Bondok, Ondřej Bojar
16:45–17:00 In-person LLM-to-Speech: A Synthetic Data Pipeline for Training Dialectal Text-to-Speech Models
Ahmed Khamis, Hesham Ali Ahmed
17:00–17:15 Online Parameter-Efficient Adaptation of Self-Supervised Models for Arabic Speech Recognition
Wafa Mohammed Alshehri, Wasfi G. Al-Khatib, Mohammad Ismail Amro
17:15–17:30 Online Optimizer Choice and Calibration for QARiB on Arabic-Script Social Media Offensive Language Detection
Auda Elshokry, Mohammed Alhanjouri
17:30–17:45 Online HACS-TL: Cross-Script Transfer Learning for Hausa Ajami Hate Speech Detection Using Transformer-Based Architecture
Abdulkadir Shehu Bichi, Muqaddar Ali, Prashant Sharma, Ismail Dauda Abubakar

 

List of accepted posters

Paper title Authors
Tarab: A Multi-Dialect Corpus of Arabic Lyrics and Poetry Mo El-Haj
Seeing Words Differently: Visual Embeddings for Robust English-Arabic Machine Translation Mahdi Alshaikh Saleh and Irfan Ahmad
QurSci-Onto: A Hierarchical Ontology and Dataset for Scientific Exegesis in the Quran Ibad-ur-Rehman Rashid, Junaid Hussain and Sadam Al-Azani
OMAN-SPEECH: A Multi-Layer Annotated Speech Corpus for Omani Arabic Dialects Rayyan S. Al Khadhuri, Firas Al Mahrouqi, Salim Al Mandhari, Amir Azad Al-Kathiri, Omar Said Alshahri, Ghassab Mansoor Alsaqr, Badri Abdulhakim Mudhsh and Tarek Fatnassi
Hala Technical Report Building Arabic-Centric Instruction & Translation Models at Scale Hasan Abed Al Kader Hammoud, Mohamad Bilal Zbib and Bernard Ghanem
From Posts to Pressure: An Arabic Dataset about Stress and Mental-Health Monitoring Wajdi Zaghouani, Eman Sedqy Shlkamy and Mabrouka Bessghaier
DeformAR: A Visual Analytics Framework for Evaluation of Arabic Named Entity Recognition Ahmed Mustafa Younes
Back-of-the-Book Index Automation for Arabic Documents Nawal Haidar, Ahmad Kashmar and Fadi Zaraket
ArabicDialectHub: A Cross-Dialectal Arabic Learning Resource and Platform Salem Lahlou
Arabic-Adapted One-Step Speech-to-Diacritized ASR: Evaluation and Error Analysis Osamah A. I. Abduljalil, Dalal Ali, Razan A. Bajaman and Abdullah I. Alharbi
Arabic Dialect Translation with Small LLMs: Enhancing through Reasoning-Oriented Reinforcement Learning Sohaila Abdulsattar and Keith Ross
Arabic Citation Parsing using Part of Speech and Named Entity Recognition Youssef Karout, Hadi Hammoud and Fadi Zaraket
Alkhalil Corpus: An Open-Source Thematic and Lemmatized Corpus for Modern Standard Arabic Samir Belayachi and Azzeddine Mazroui
A Knowledge Graph Based Diagnostic Framework for Analyzing Hallucinations in Arabic Machine Reading Comprehension Najwa Abdullah AlGhamdi, Sadam Al-Azani, Kwabena Nuamah and Alan Bundy
A Hybrid Confidence-Aware Framework for Arabic Toxicity Detection in Social Media Fawzia Zaal Alanazi, Asma Mohammed Alamri, Arwa Bin Saleh and Abdullah I. Alharbi
A Corpus-Based Investigation of Contemporary Arabic Dialects Using the SADA Corpus Ghada Alfattni