TÁVKÖZLÉSI ÉS MÉDIAINFORMATIKAI TANSZÉK
Budapesti Műszaki és Gazdaságtudományi Egyetem - Villamosmérnöki és Informatikai Kar

Speaker Adaptation Based deep neural network - Text to Speech Synthesis

Speech processing has attracted the interest of both scholars and industry during the last few decades. The technique of converting text into artificial speech is known as speech synthesis. It can be utilized in a blind person's speech monitoring system, a web browser, mobile phones, PCs, and laptops. Nowadays, every effort is taken to generate as natural a synthesized sound as possible. Our project aims to create a speaker adaption model that uses a Deep Neural Network to synthesize speech. The project will be completed using Merlin (a speech synthesis toolkit that uses neural networks to create speech).
Kapcsolódó oktatók:
Kapcsolódó tárgyak:
  • Info, BSc, Önálló laboratórium
  • Önálló laboratórium
  • Önálló laboratórium, VIR BSc szakirány
  • Önálló laboratórium 1, Médiainformatika
  • Önálló laboratórium 2, Médiainformatika
  • Önálló laboratórium 1, Info, Msc,
  • Önálló laboratórium 2, Info, MSc,