Visible to the public Biblio

Filters: Author is Li, Haoqi  [Clear All Filters]
2018-12-10
Tseng, Shao-Yen, Li, Haoqi, Baucom, Brian, Georgiou, Panayiotis.  2018.  "Honey, I Learned to Talk": Multimodal Fusion for Behavior Analysis. Proceedings of the 20th ACM International Conference on Multimodal Interaction. :239–243.
In this work we analyze the importance of lexical and acoustic modalities in behavioral expression and perception. We demonstrate that this importance relates to the amount of therapy, and hence communication training, that a person received. It also exhibits some relationship to gender. We proceed to provide an analysis on couple therapy data by splitting the data into clusters based on gender or stage in therapy. Our analysis demonstrates the significant difference between optimal modality weights per cluster and relationship to therapy stage. Given this finding we propose the use of communication-skill aware fusion models to account for these differences in modality importance. The fusion models operate on partitions of the data according to the gender of the speaker or the therapy stage of the couple. We show that while most multimodal fusion methods can improve mean absolute error of behavioral estimates, the best results are given by a model that considers the degree of communication training among the interlocutors.