This projects is regarding high available services hosted in the cloud using different cloud technologies.
The project work includes investigating development and deployment of cloud-native services and increasing their availability, fault tolerance and operating costs and energy.
Automated Machine Learning methods using Reinforcement Learning
My research focuses mainly on:
-Design AutoML related models
-Neural Architecture Search
-Hyperparameters optimization
-Performance and Evaluation optimization
-Adaptive methods
Automatic speech recognition for less-resource language
Focused on Automatic Speech Recognition (ASR)
1.Attempting to explore the implications of data augmentation solution for limited-resource languages.
2.trying to find the optimal data augmentation solution by using Automatic Data Augmentation technical.
Automatic speech recognition for less-resource language
Focused on Automatic Speech Recognition (ASR)
1.Attempting to explore the implications of data augmentation solution for limited-resource languages.
2.trying to find the optimal data augmentation solution by using Automatic Data Augmentation technical.
Automatic speech recognition for low-resource languages
Speech recognition technology has been used for a long time, but recognizing a speech accurately is a very difficult task. In this topic, we mainly use the conformer-ctc model provided by open-source toolkits (Nemo), and fine-tune the model to achieve better training results. If you are interested in automatic speech recognition, and have a good foundation in python, it is highly recommended that you choose this topic.
Automatic speech recognition for low-resource languages
Speech recognition technology has been used for a long time, but recognizing a speech accurately is a very difficult task. In this topic, we mainly use the conformer-ctc model provided by open-sourch toolkits(Nemo), and fine-tune the model to achieve better training results. If you are interested in automatic speech recognition, and have a good foundation in python, it is highly recommended that you choose this topic.
Automatic speech recognition for low-resource languages
Speech recognition technology has been used for a long time, but recognizing a speech accurately is a very difficult task. In this topic, we mainly use the conformer-ctc model provided by open-source toolkits (Nemo), and fine-tune the model to achieve better training results. If you are interested in automatic speech recognition, and have a good foundation in python, it is highly recommended that you choose this topic.
Automatic speech recognition for low-resource languages
Speech recognition technology has been used for a long time, but recognizing a speech accurately is a very difficult task. In this topic, we mainly use the conformer-ctc model provided by open-source toolkits (Nemo), and fine-tune the model to achieve better training results. If you are interested in automatic speech recognition, and have a good foundation in python, it is highly recommended that you choose this topic.
Automatic speech recognition for low-resource languages
Speech recognition technology has been used for a long time, but recognizing a speech accurately is a very difficult task. In this topic, we mainly use the conformer-ctc model provided by open-source toolkits (Nemo), and fine-tune the model to achieve better training results. If you are interested in automatic speech recognition, and have a good foundation in python, it is highly recommended that you choose this topic.
BRAIN2SPEECH: EEG alapú kommunikációs agy-gép interfész deep learning módszerekkel
A beszéd az emberi kommunikáció elsődleges és legfontosabb eszköze. Sokan azonban elvesztették ezt a képességüket betegség vagy egészségkárosodás okán. A kommunikációs agy-gép interfész (BCI) célja, hogy természetes vagy ahhoz közeli kommunikációs csatornát biztosítsanak olyan személyek számára, akik fizikai vagy neurológiai károsodás miatt nem tudnak beszélni. A beszéd valós idejű szintézise közvetlenül a mért idegi aktivitásból (EEG) lehetővé tenné a természetes beszédet, és jelentősen javítaná az életminőségét, különösen a kommunikációban súlyosan korlátozott személyek számára.
A hallgató feladata megismerkedni a BRAIN2SPEECH területtel, majd új típusú neurális hálózat architektúrák (pl. konvolúciós és rekurrens hálózatok) kidolgozása és tanítása több beszélő adataival.
Az önálló munka / diplomaterv a BME Beszédtechnológia és Intelligens Interakciók Laboratóriumában készül. A hallgató feladatának a következőkre kell kiterjednie:
- Tekintse át beszédtechnológiában az electroencephalogram-akusztikum becslés szakirodalmát.
- Vizsgálja meg, milyen típusú neurális hálózatokat alkalmaztak eddig a kommunikációs agy-gép interfész területen!
- Vizsgálja meg különböző neurális hálózat architektúrák (pl. konvolúciós és rekurrens hálózatok, ResNet, SkipNet) alkalmazhatóságát.
- Az elkészült modelleket tesztelje objektív mérőszámokkal és szubjektív teszt keretében!
- Munkáját részletesen dokumentálja!