[Image credit for Proto-Sinaitic ‘alp 𐤀 used in logo: here (CC-BY-2.5, Author: Pmx)] |
Second Workshop on Computation and Written Language (CAWL 2024) To be held in conjunction with LREC-COLING 2024 |
9:00-9:10 | Organizers | Opening remarks |
9:10-10:10 | Invited speaker: Nizar Habash | On Writing Arabic |
10:10-10:30 | Rayyan Merchant & Kevin Tang | ParsText: A Digraphic Corpus for Tajik-Farsi Transliteration |
10:30-11:00 | Coffee Break | |
11:00-11:30 | Invited talk: Jalal Maleki | Balancing Linguistic Integrity and Practicality: The Design Journey of Dabire, a Romanized Writing System for Persian |
11:30-12:00 | Wieke Harmsen, Catia Cucchiarini, Roeland van Hout & Helmer Strik | A Joint Approach for Automatic Analysis of Reading and Writing Errors |
12:00-12:20 | Luna Peck & Susan Brown | Tool for Constructing a Large-Scale Corpus of Code Comments and Other Source Code Annotations |
12:20-2:00 | Lunch break | |
2:00-2:30 | Rastislav Hronsky & Emmanuel Keuleers | Tokenization via Language Modeling: the Role of Preceding Text |
2:30-2:50 | Kyle Gorman & Brian Roark | Abbreviation across the world's languages and scripts |
2:50-3:20 | Daan van Esch | Now You See Me, Now You Don't: ‘Poverty of the Stimulus' Problems and Arbitrary Correspondences in End-to-End Speech Models |
3:20-3:40 | Logan Born, M. Willis Monroe, Kathryn Kelley & Anoop Sarkar | Towards Fast Cognate Alignment on Imbalanced Data |
3:40-4:00 | Organizers | SIGWrit business meeting |
4:00-4:30 | Coffee Break | |
4:30-4:50 | Yixia Wang & Emmanuel Keuleers | Simplified Chinese Character Distance Based on Ideographic Description Sequences |
4:50-5:00 | Organizers | Closing remarks, discussion |
The 2024 CAWL workshop is supported by Google: