Print Email Facebook Twitter Evaluating the performance of TDNN-BLSTM on Mandarin read and spontaneous speech Title Evaluating the performance of TDNN-BLSTM on Mandarin read and spontaneous speech Author Chiroşca, Mihail (TU Delft Electrical Engineering, Mathematics and Computer Science) Contributor Feng, S. (mentor) Scharenborg, O.E. (mentor) Jonker, C.M. (graduation committee) Degree granting institution Delft University of Technology Programme Computer Science and Engineering Project CSE3000 Research Project Date 2021-07-01 Abstract A limitation of current ASR systems is the so-called out-of-vocabulary words. The solution to overcome this limitation is to use APR systems. Previous research on Dutch APR systems identified Time Delayed Bidirectional Long-Short Term Memory Neural Network (TDNN-BLSTM) as one of best performing state-of-the-art NN architecture for PR. The goal of this research is to evaluate the performance of the TDNN-BLSTM architecture for phoneme recognition on Mandarin read and spontaneous speech, analyze the differences in performance for the two speech styles as well as compare the results with previous research on Dutch PR.To achieve this goal 4 different NN models of the TDNN-BLSTM architecture were built and trained on Mandarin read and spontaneous speech. The test results of the NN models were used to calculate the phoneme error rate (PER), decomposed PER, and the contribution of individual phonemes to the overall PER. Based on these findings, conclusions are formulated regarding the impact of different languages, speech styles, and the architectural changes on the performance of the TDNN-BLSTM architecture. Subject Phoneme RecognitionSpeech recognitionNeural Networks To reference this document use: http://resolver.tudelft.nl/uuid:dd65b686-0acc-46de-a28f-456ac9aecf32 Part of collection Student theses Document type bachelor thesis Rights © 2021 Mihail Chiroşca Files PDF Evaluating_the_performanc ... ech_3_.pdf 345.39 KB Close viewer /islandora/object/uuid:dd65b686-0acc-46de-a28f-456ac9aecf32/datastream/OBJ/view