Abstract: Emotion recognition from multimodal data has become increasingly vital for affective computing applications such as virtual assistants, social robots, and mental health monitoring. Among ...