Siyu Liang

๐Ÿ’ผ 3940 Benton Ln NE, Room 407, Seattle, WA, USA

prof_pic.jpg

๐Ÿ“ท in Kiama, Australia

August 2025

Photo credit: @Zhenfa

I am a fifth-year PhD candidate in Computational Linguistics at the University of Washington advised by Drs. Gina-Anne Levow and Richard Wright. My research develops computational methods for multilingual and under-resourced speech grounded in linguistic theory and fairness.

I study how large speech language models represent and generalize across phonetic and structural variation, and design methods to evaluate and improve their performance across resource levels and dialectal variation. As a linguist, I am particularly interested in studying speech variation through both lab phonology and corpus-based approaches. I am also interested in how computational approaches can be applied to linguistic analysis and documentation, and I have worked on Iranian, Tibeto-Burman, Turkic, Nilotic, and areally Southeast Asian languages. I actively conduct fieldwork and collaborate with field linguists to build models and resources in real-world documentation settings.

When Iโ€™m not working, you can find me on red-eye flights โœˆ๏ธ, at concerts ๐ŸŽถ, in the mountains ๐Ÿž๏ธ, and occasionally up the wall ๐Ÿง— or under the sea ๐Ÿคฟ.

You can reach me at liangsy [@] uw [dot] edu.

news

Feb 12, 2026 2 papers (on sociophonetic analysis of ASR bias and tokenization in multilingual ASR) accepted to LREC 2026.
Jan 23, 2026 2 workshop papers (on automatic IGT for Jungar Tuvan and tonogenesis in Tibetan) accepted to FieldMatters and LChange at EACL 2026. More details forthcoming!
Nov 08, 2025 ๐Ÿงจ Our paper on sub-token level disparity in multilingual speech recognition models was awarded an SAC Highlight at EMNLP 2025!