Abstract
Speech emotion recognition plays an important role in building more intelligent and human-like agents. Due to the difficulty of collecting speech emotional data, an increasingly popular solution is leveraging a related and rich source corpus to help address the target corpus. However, domain shift between the corpora poses a serious challenge, making domain shift adaptation difficult to function even on the recognition of positive/negative emotions. In this work, we propose class-wise adversarial domain adaptation to address this challenge by reducing the shift for all classes between different corpora. Experiments on the well-known corpora EMODB
and Aibo demonstrate that our method is effective even when only a very limited number of target labeled examples are provided.
and Aibo demonstrate that our method is effective even when only a very limited number of target labeled examples are provided.
| Original language | English |
|---|---|
| Title of host publication | Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP-2019) |
| DOIs | |
| Publication status | Published - 2019 |
Research Beacons, Institutes and Platforms
- Manchester Institute for Collaborative Research on Ageing