Multimodal Emotion Recognition: Integrating Text, Speech, and Facial Expressions for Enhanced Human‐Computer Interaction

dc.contributor.advisor	Lakatos, Róbert
dc.contributor.author	Takneshan, Mohammad
dc.contributor.department	DE--Informatikai Kar
dc.date.accessioned	2026-02-12T20:12:09Z
dc.date.available	2026-02-12T20:12:09Z
dc.date.created	2025-11-15
dc.description.abstract	This thesis presents a multimodal emotion recognition system integrating text, audio, and facial expression modalities using attention-based feature-level fusion. The system achieves 71% accuracy for seven-class emotion recognition and 85% for sentiment analysis on the MELD dataset benchmark. The thesis provides a realistic assessment of both the potential and current boundaries of MER systems, establishing a solid foundation for future research while acknowledging important limitations regarding real-world generalization and the need for culturally-specific models.
dc.description.course	Programtervező informatikus
dc.description.degree	BSc/BA
dc.format.extent	55
dc.identifier.uri	https://hdl.handle.net/2437/404525
dc.language.iso	en
dc.rights.info	Hozzáférhető a 2022 decemberi felsőoktatási törvénymódosítás értelmében.
dc.subject	Multimodal Emotion Recognition
dc.subject	Affective Computing
dc.subject	Human-Computer Interaction
dc.subject	Attention Mechanisms
dc.subject	Transfer Learning
dc.subject	Deep Learning
dc.subject.dspace	Informatics
dc.subject.dspace	Informatics::Computer Science
dc.title	Multimodal Emotion Recognition: Integrating Text, Speech, and Facial Expressions for Enhanced Human‐Computer Interaction

Megjelenítve 1 - 1 (Összesen 1)

Megjelenítve 1 - 1 (Összesen 1)