Abstract
In this paper, we explore the Input/Output HMM (IOHMM) architecture for a substantial problem, that of text-dependent speaker identification. For subnetworks modeled with generalized linear models, we extend the IRLS algorithm to the M-step of the corresponding EM algorithm. Experimental results show that the improved EM algorithm yields significantly faster training than the original one. In comparison with the multilayer perceptron, the dynamic programming technique and hidden Markov models, we empirically demonstrate that the IOHMM architecture is a promising way to text-dependent speaker identification. © 1996 Kluwer Academic Publishers.
Original language | English |
---|---|
Title of host publication | Neural Processing Letters|Neural Process Letters |
Pages | 81-89 |
Number of pages | 8 |
Volume | 3 |
Publication status | Published - 1996 |
Event | Proceeding of World Congress on Neural Networks (WCNN-96) - San Diego, U.S.A. Duration: 1 Jan 1824 → … |
Conference
Conference | Proceeding of World Congress on Neural Networks (WCNN-96) |
---|---|
City | San Diego, U.S.A. |
Period | 1/01/24 → … |
Keywords
- EM algorithm
- Input/output HMM
- Speaker identification
- Temporal processing