publications

check out my Google Scholar for more up-to-date works

2026

  1. LREC
    The Limits of Data Scaling: Sub-token Utilization and Acoustic Saturation in Multilingual ASR
    Siyu Liang, Nicolas Ballier, Gina-Anne Levow, and 1 more author
    In Proceedings of the 15th International Conference on Language Resources and Evaluation (LREC 2026), 2026
  2. LREC
    A Sociophonetic Analysis of Racial Bias in Commercial ASR Systems Using the Pacific Northwest English Corpus
    Michael Scott, Siyu Liang, Alicia Wassink, and 1 more author
    In Proceedings of the 15th International Conference on Language Resources and Evaluation (LREC 2026), 2026
  3. EACL-FieldMatters
    Hybrid Neural-LLM Pipeline for Morphological Glossing in Endangered Language Documentation: A Case Study of Jungar Tuvan
    Siyu Liang, Talant Mawkanuli, and Gina-Anne Levow
    In Proceedings of the 5th Workshop on NLP Applications to Field Linguistics (FieldMatters), 2026
  4. EACL-LChange
    The Tonogenesis Continuum in Tibetan: A Computational Investigation
    Siyu Liang and Zhaxi Zerong
    In Proceedings of the 6th International Workshop on Computational Approaches to Language Change (LChange), 2026

2025

  1. EMNLP
    Beyond WER: Probing Whisper’s Sub-token Decoder Across Diverse Language Resource Levels
    Siyu Liang, Nicolas Ballier, Gina-Anne Levow, and 1 more author
    In Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025
    🏆 SAC Highlight Award
  2. ACL-FieldMatters
    Breaking the Transcription Bottleneck: Fine-tuning ASR Models for Extremely Low-Resource Fieldwork Languages
    Siyu Liang and Gina-Anne Levow
    In Proceedings of the 4th Workshop on NLP Applications to Field Linguistics (FieldMatters), 2025
  3. ACL-SIGTYP
    Tone in Perspective: A Computational Typological Analysis of Tone Function in ASR
    Siyu Liang and Gina-Anne Levow
    In Proceedings of the 7th Workshop on Research in Computational Linguistic Typology and Multilingual NLP (SIGTYP), 2025

2020

  1. PLC
    Documenting Eynu: A Case Study of Language Contact
    Siyu Liang
    In Proceedings of the 43rd Annual Penn Linguistics Conference, 2020