My research is primarily concerned with the basic mechanisms of speech production and perception in connected discourse, especially in terms of how multiple layers of communicative meanings can be encoded through a common process of articulation. In particular, I am interested in the production, perception, typology, and modelling and synthesis of speech prosody, as well as the basic mechanisms of coarticulation. I am also concerned with computational modeling of the neural process of speech acquisition. More recently, I have developed an interest in the understanding of emotional expressions in speech from an evolutionary perspective. For more information, please see my Research page.
Birkholz, P., Martin, L., Xu, Y., Scherbaum, S. and Neuschaefer-Rube, C. (2017). Manipulation of the prosodic features of vocal tract length, nasality and articulatory precision using articulatory synthesis. Computer Speech & Language 41:116-127. Accepted version
Xu, Y., Lee, A., Prom-on, S. & Liu, F. (2015). Explaining the PENTA model: A reply to Arvaniti and Ladd. Phonology 32: 505-535. Accepted version
Xu, Y. (2015). Speech Prosody -- Theories, models and analysis. To appear in Courses on Speech Prosody. A. R. Meireles (ed.): Cambridge Scholars Publishing. pp. 146-177. Author version
Xu, Y. and Prom-on, S. (2014). Toward invariant functional representations of variable surface fundamental frequency contours: Synthesizing speech melody via model-based stochastic learning. Speech Communication 57, 181-208. On-line access
Prom-on, S., Birkholz, P. and Xu, Y. (2014) Identifying underlying articulatory targets of Thai vowels from acoustic data based on an analysis-by-synthesis approach, EURASIP Journal on Audio, Speech, and Music Processing 23. doi:10.1186/1687-4722-2014-23.