COMPUTATIONAL LINGUISTIC MATERIAL FOR VIETNAMESE SPEECH PROCESSING: APPLYING IN VIETNAMESE TEXT-TO-SPEECH

Đồng Văn Phạm

Abstract


The motivation of this paper is to propose a set of best-quality linguistic materials for Vietnamese speech processing, which can be used for Vietnamese TTS and ASR problems. This proposed material includes: (1) a pronunciation dictionary, which adapts from X-SAMPA,  (2) a rule-based grapheme to phoneme for Vietnamese. In order to test and evaluate, we have built a Vietnamese TTS system based on the Merlin engine, using the above materials, and evaluating the quality of speech and the accuracy of pronunciation. The results show that the applicability of these materials is favorable for further research and development on Vietnamese speech processing.

Keywords


Text-to-speech, Dictionary, Grapheme-to-Phoneme, X-SAMPA, computer coding, speech processing, Vietnamese

Full Text:

PDF

References


P. Taylor, “Text-To-Speech Synthesis,” Camb. Univ. Press, 2009.

A.-G. Haudricourt, “La place du vietnamien dans les langues austroasiatiques,” Bull. Société Linguist. Paris, vol. 49, no. 1, pp. 122–128, 1953.

“Phương Ngữ Học Tiếng Việt (NXB Đại Học Quốc Gia 2009) - Hoàng Thị Châu - 287 Trang | PDF,” Scribd. https://www.scribd.com/document/534836166/Ph%C6%B0%C6%A1ng-Ng%E1%BB%AF-H%E1%BB%8Dc-Ti%E1%BA%BFng-Vi%E1%BB%87t-NXB-%C4%90%E1%BA%A1i-H%E1%BB%8Dc-Qu%E1%BB%91c-Gia-2009-Hoang-Th%E1%BB%8B-Chau-287-Trang (accessed Dec. 14, 2022).

Q. C. Nguyen, “Reconnaissance de la parole en langue Vietnamienne,” PhD Thesis, Grenoble INPG, 2002.

J. C. Wells, “Computer-coding the IPA: a proposed extension of SAMPA,” Revis. Draft, vol. 4, no. 28, p. 1995, 1995.

N. T. T. Trang, C. D’ALESSANDRO, A. RILLIARD, and T. Do Dat, “HMM-based TTS for Hanoi Vietnamese: issues in design and evaluation,” in 14th Annual Conference of the International Speech Communication Association (Interspeech 2013), 2013, pp. 2311–2315.

J. Kirby, “Kirby, James. vPhon: a Vietnamese phonetizer.” Nov. 15, 2016. Accessed: Nov. 21, 2019. [Online]. Available: https://github.com/kirbyj/vPhon

T. T. T. Nguyen, “HMM-based Vietnamese Text-To-Speech: Prosodic Phrasing Modeling, Corpus Design System Design, and Evaluation,” Paris 11, 2015. Accessed: May 27, 2017. [Online]. Available: http://www.theses.fr/2015PA112201

Z. Wu, O. Watts, and S. King, “Merlin: An Open Source Neural Network Speech Synthesis System.,” in SSW, 2016, pp. 202–207.

Z. Malisz, H. Berthelsen, J. Beskow, and J. Gustafson, “Controlling Prominence Realisation in Parametric DNN-Based Speech Synthesis.,” in INTERSPEECH, 2017, pp. 1079–1083.




DOI: https://doi.org/10.26483/ijarcs.v13i6.6935

Refbacks

  • There are currently no refbacks.




Copyright (c) 2023 International Journal of Advanced Research in Computer Science