Multimodal Emotion Recognition: Integrating Text, Speech, and Facial Expressions for Enhanced Human‐Computer Interaction

dc.contributor.advisorLakatos, Róbert
dc.contributor.authorTakneshan, Mohammad
dc.contributor.departmentDE--Informatikai Kar
dc.date.accessioned2026-02-12T20:12:09Z
dc.date.available2026-02-12T20:12:09Z
dc.date.created2025-11-15
dc.description.abstractThis thesis presents a multimodal emotion recognition system integrating text, audio, and facial expression modalities using attention-based feature-level fusion. The system achieves 71% accuracy for seven-class emotion recognition and 85% for sentiment analysis on the MELD dataset benchmark. The thesis provides a realistic assessment of both the potential and current boundaries of MER systems, establishing a solid foundation for future research while acknowledging important limitations regarding real-world generalization and the need for culturally-specific models.
dc.description.courseProgramtervező informatikus
dc.description.degreeBSc/BA
dc.format.extent55
dc.identifier.urihttps://hdl.handle.net/2437/404525
dc.language.isoen
dc.rights.infoHozzáférhető a 2022 decemberi felsőoktatási törvénymódosítás értelmében.
dc.subjectMultimodal Emotion Recognition
dc.subjectAffective Computing
dc.subjectHuman-Computer Interaction
dc.subjectAttention Mechanisms
dc.subjectTransfer Learning
dc.subjectDeep Learning
dc.subject.dspaceInformatics
dc.subject.dspaceInformatics::Computer Science
dc.titleMultimodal Emotion Recognition: Integrating Text, Speech, and Facial Expressions for Enhanced Human‐Computer Interaction
Fájlok
Eredeti köteg (ORIGINAL bundle)
Megjelenítve 1 - 1 (Összesen 1)
Nincs kép
Név:
thesis.pdf
Méret:
968.88 KB
Formátum:
Adobe Portable Document Format
Leírás:
thesis
Engedélyek köteg
Megjelenítve 1 - 1 (Összesen 1)
Nincs kép
Név:
license.txt
Méret:
1.95 KB
Formátum:
Item-specific license agreed upon to submission
Leírás:
Gyűjtemények