Implementation of a multispeaker text-to-speech synthesis on a server

Akhmetzhanov, Nursultan

Implementation of a multispeaker text-to-speech synthesis on a server

Akhmetzhanov, Nursultan (2021)

Avaa tiedosto

Akhmetzhanov_Nursultan.pdf (582.9Kt)

Lataukset:

Akhmetzhanov, Nursultan

2021

Näytä kaikki kuvailutiedot

Julkaisun pysyvä osoite on
https://urn.fi/URN:NBN:fi:amk-2021052812170

Tiivistelmä

The main goal of this thesis was to implement a multispeaker model on a web application, additional goals were to introduce the reader to text-to-speech synthesis and compare the open source multispeaker models.

Most of the information was taken from the original research papers and used to explain the work of the models and compare the models of multispeaker TTS using the MOS (Mean Opinion Score) values from the papers themselves. The development of the web application with multispeaker TTS functionality was based on open-source repositories for TTS.

As a result, the goal was not met due to the complexity of the chosen task, lack of knowledge and experience in deep learning and speech synthesis. The steps in the development of the practical application and the recommendations on how to proceed with the tasks were written and explained.

Kokoelmat

Opinnäytetyöt (Avoin kokoelma)

Implementation of a multispeaker text-to-speech synthesis on a server

Akhmetzhanov, Nursultan (2021)

Avaa tiedosto

Tiivistelmä

Kokoelmat

Samankaltainen aineisto

Speech synthesis : Developing a web application implementing speech technology ﻿

Speech and Language difficulties : An assessment of the parents experiences who have children with speech and language difficulties and services provided in northern part of Finland ﻿

Google Speech -rajapinta mobiilisovelluksessa ﻿

Speech synthesis : Developing a web application implementing speech technology

Speech and Language difficulties : An assessment of the parents experiences who have children with speech and language difficulties and services provided in northern part of Finland

Google Speech -rajapinta mobiilisovelluksessa