Speaker identification using time-delay HMEs.

K. Chen, D. Xie, H. Chi

    Research output: Contribution to journalArticlepeer-review

    Abstract

    In this paper, we extend the Hierarchical Mixture of Experts (HME) to temporal processing and explore it for a substantial problem, that of text-dependent speaker identification. For a specific multiway classification, we propose a generalized Bernoulli density instead of the multinomial logit density to avoid the instability during training. Time-delay technique is applied for spatio-temporal processing in the HME and a combining scheme is presented for combining multiple time-delay HMEs in order to complete a multi-scale analysis for the temporal data. Using the time-delay HME along with the EM algorithm as well as the combination of multiple time-delay HMEs, the speaker identification system has a good performance and yields significantly fast training. We have also addressed some issues about the time-delay techniques in the HME.
    Original languageEnglish
    Pages (from-to)29-43
    Number of pages14
    JournalInternational Journal of Neural Systems
    Volume7
    Issue number1
    DOIs
    Publication statusPublished - Mar 1996

    Fingerprint

    Dive into the research topics of 'Speaker identification using time-delay HMEs.'. Together they form a unique fingerprint.

    Cite this