Yi Xu
Professor of Speech Sciences

Speech Hearing and Phonetic Sciences, Linguistics,
Division of Psychology and Language Sciences, University College London, UK

Haskins Laboratories, USA

My research is primarily concerned with the basic mechanisms of speech production and perception in connected discourse, especially in terms of how multiple layers of communicative meanings can be encoded through a common process of articulation. In particular, I am interested in the production, perception, typology, and modelling and synthesis of speech prosody, as well as the basic mechanisms of coarticulation. I am also concerned with computational modeling of the neural process of speech acquisition. More recently, I have developed an interest in the understanding of emotional expressions in speech from an evolutionary perspective. For more information, please see my Research page.


Common Prosody Platform (CPP) -- Where theories and model of prosody can be directly compared

ProsodyPro is now emotion-friendly: New emotion-relevant measurements

15 new measurements directly relevant to emotional prosody

PENTAtrainer2 -- Analysis and synthesis of intonation via computational modeling

FormantPro -- A Praat script for large-scale systematic analysis of continuous formant movements


Call for international collaborations: Cross-linguistic distribution of Post-Focus Compression (PFC) and its hisotircal origin.



  • Birkholz, P., Martin, L., Xu, Y., Scherbaum, S. and Neuschaefer-Rube, C. (2017). Manipulation of the prosodic features of vocal tract length, nasality and articulatory precision using articulatory synthesis. Computer Speech & Language 41:116-127. Accepted version

  • Xu, Y., Lee, A., Prom-on, S. & Liu, F. (2015). Explaining the PENTA model: A reply to Arvaniti and Ladd. Phonology 32: 505-535. Accepted version

  • Xu, Y. (2015). Speech Prosody -- Theories, models and analysis. To appear in Courses on Speech Prosody. A. R. Meireles (ed.): Cambridge Scholars Publishing. pp. 146-177. Author version

  • Xu, Y. and Prom-on, S. (2014). Toward invariant functional representations of variable surface fundamental frequency contours: Synthesizing speech melody via model-based stochastic learning. Speech Communication 57, 181-208. On-line access

  • Prom-on, S., Birkholz, P. and Xu, Y. (2014) Identifying underlying articulatory targets of Thai vowels from acoustic data based on an analysis-by-synthesis approach, EURASIP Journal on Audio, Speech, and Music Processing 23. doi:10.1186/1687-4722-2014-23.

Research tools


Chandler House, 2 Wakefield Street, University College London, London WC1N IPF, UK