Log in to save to my catalogue

Multimodal Emotion Recognition from Raw Audio with Sinc-convolution

Multimodal Emotion Recognition from Raw Audio with Sinc-convolution

https://devfeature-collection.sl.nsw.gov.au/record/TN_cdi_proquest_journals_2928716451

Multimodal Emotion Recognition from Raw Audio with Sinc-convolution

About this item

Full title

Multimodal Emotion Recognition from Raw Audio with Sinc-convolution

Publisher

Ithaca: Cornell University Library, arXiv.org

Journal title

arXiv.org, 2024-02

Language

English

Formats

Publication information

Publisher

Ithaca: Cornell University Library, arXiv.org

More information

Scope and Contents

Contents

Speech Emotion Recognition (SER) is still a complex task for computers with average recall rates usually about 70% on the most realistic datasets. Most SER systems use hand-crafted features extracted from audio signal such as energy, zero crossing rate, spectral information, prosodic, mel frequency cepstral coefficient (MFCC), and so on. More recen...

Alternative Titles

Full title

Multimodal Emotion Recognition from Raw Audio with Sinc-convolution

Authors, Artists and Contributors

Identifiers

Primary Identifiers

Record Identifier

TN_cdi_proquest_journals_2928716451

Permalink

https://devfeature-collection.sl.nsw.gov.au/record/TN_cdi_proquest_journals_2928716451

Other Identifiers

E-ISSN

2331-8422

How to access this item