Multimodal Fusion via Hypergraph Autoencoder and Contrastive Learning for Emotion Recognition in Con...

| Ask | Become a Library member

Multimodal Fusion via Hypergraph Autoencoder and Contrastive Learning for Emotion Recognition in Con...

https://devfeature-collection.sl.nsw.gov.au/record/TN_cdi_proquest_journals_3088965960

Multimodal Fusion via Hypergraph Autoencoder and Contrastive Learning for Emotion Recognition in Conversation

About this item

Full title

Multimodal Fusion via Hypergraph Autoencoder and Contrastive Learning for Emotion Recognition in Conversation

Author / Creator

Yi, Zijian , Zhao, Ziming , Shen, Zhishu and Zhang, Tiehua

Publisher

Ithaca: Cornell University Library, arXiv.org

Journal title

arXiv.org, 2024-09

Language

English

Formats

Articles

Publication information

Publisher

Ithaca: Cornell University Library, arXiv.org

Subjects

Subjects and topics

More information

Scope and Contents

Contents

Multimodal emotion recognition in conversation (MERC) seeks to identify the speakers' emotions expressed in each utterance, offering significant potential across diverse fields. The challenge of MERC lies in balancing speaker modeling and context modeling, encompassing both long-distance and short-distance contexts, as well as addressing the complexity of multimodal information fusion. Recent research adopts graph-based methods to model intricate conversational relationships effectively. Nevertheless, the majority of these methods utilize a fixed fully connected structure to link all utterances, relying on convolution to interpret complex context. This approach can inherently heighten the redundancy in contextual messages and excessive graph network smoothing, particularly in the context of long-distance conversations. To address this issue, we propose a framework that dynamically adjusts hypergraph connections by variational hypergraph autoencoder (VHGAE), and employs contrastive learning to mitigate uncertainty factors during the reconstruction process. Experimental results demonstrate the effectiveness of our proposal against the state-of-the-art methods on IEMOCAP and MELD datasets. We release the code to support the reproducibility of this work at https://github.com/yzjred/-HAUCL....

Alternative Titles

Full title

Multimodal Fusion via Hypergraph Autoencoder and Contrastive Learning for Emotion Recognition in Conversation

Authors, Artists and Contributors

Author / Creator

Yi, Zijian
Zhao, Ziming
Shen, Zhishu
Zhang, Tiehua

Identifiers

Primary Identifiers

Record Identifier

TN_cdi_proquest_journals_3088965960

Permalink

https://devfeature-collection.sl.nsw.gov.au/record/TN_cdi_proquest_journals_3088965960

Other Identifiers

E-ISSN

2331-8422

DOI

10.48550/arxiv.2408.00970

How to access this item

Full text available

View in old catalogue

Multimodal Fusion via Hypergraph Autoencoder and Contrastive Learning for Emotion Recognition in Con...

Multimodal Fusion via Hypergraph Autoencoder and Contrastive Learning for Emotion Recognition in Con...

Multimodal Fusion via Hypergraph Autoencoder and Contrastive Learning for Emotion Recognition in Conversation

About this item

Publication information

Subjects

More information

Scope and Contents

Alternative Titles

Authors, Artists and Contributors

Identifiers

Primary Identifiers

Other Identifiers

How to access this item

Connecting people and collections

Indigenous engagement

Learning

Stories