COMPUTER VISION AND NATURAL LANGUAGE PROCESSING IN MACHINE LEARNING
Computer vision (CV) and Natural Language Processing (NLP) are two main subfields of machine learning, and a lot of research is going on there.
These two subfields overlap together in tasks such as text generation out of image (image2text) or vice-versa (text2image).
A new subfield has emerged, i.e. Story Visualization, with the help of the advancement of GANs and Diffusion models.
The task of the student(s) is to explore Story Visualization topic by investigating and utilizing the state-of-the-art models in the field.
No. of students: 1 - 3 contact email: alshouha@edu.bme.hu