A variable frame-rate scheme for sinusoidal transform coding

    Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

    Abstract

    Sinusoidal transform coding (STC) is known to be capable of producing good communication quality speech coded at bit-rates below 4kb/s. Discrete all-pole modelling (DAP) which can be more accurate than the conventional linear prediction (LP) analysis for voiced speech is adopted to improve the short-term spectral estimation with modification to accommodate the unvoiced speech in STC. A more robust frequency domain analysis-by-synthesis derived voicing cut-off frequency that divides the whole power spectrum into a lower voiced band and an upper unvoiced band enhances STC performance. In view of the different evolving characteristic of speech, in this paper, we propose a variable frame rate coding scheme by further investigating the potential reason of quality loss in reconstructed speech. This results in performance enhancement as well as bit-rate saving and leads to a more flexible and effective STC vocoder.
    Original languageEnglish
    Title of host publicationICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings|ICASSP IEEE Int Conf Acoust Speech Signal Process Proc
    Place of PublicationUSA
    PublisherIEEE
    PagesIV/4163
    Volume4
    Publication statusPublished - 2002
    Event2002 IEEE International Conference on Acoustic, Speech, and Signal Processing - Orlando, FL
    Duration: 1 Jul 2002 → …

    Conference

    Conference2002 IEEE International Conference on Acoustic, Speech, and Signal Processing
    CityOrlando, FL
    Period1/07/02 → …

    Fingerprint

    Dive into the research topics of 'A variable frame-rate scheme for sinusoidal transform coding'. Together they form a unique fingerprint.

    Cite this