Abstract: This paper proposes a multi-modal framework for enhancing robot social perception in human-robot interaction applications. By integrating multiple sensory modalities-such as visual, auditory ...