Dorran, David and Lawlor, Bob and Coyle, Eugene
(2003)
High quality time-scale modification of speech using a peak alignment overlap-add algorithm (PAOLA).
In:
2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03).
IEEE, pp. 700-703.
ISBN 0780376633
Abstract
The duration of a speech passage can be altered using audio time-scale modification techniques. Time-scale modification can be achieved in the time domain by segmenting the input signal into overlapping frames and recombining the frames with an overlap differing from the analysis overlap. We present a time-scale modification algorithm that uses a simple peak alignment technique to synchronize overlapping synthesis frames. The peak alignment overlap-add (PAOLA) algorithm also takes advantage of waveform properties to ensure a high quality output for the minimum number of iterations. The new algorithm produces a time-scaled output of approximately equal quality to that of an adaptive implementation of the commercially popular synchronised overlap-add (SOLA) algorithm, but offers a computational saving ranging from a factor of 15 (for a time-scale factor of 0.5) to 170 (for a time-scale factor of 1.1).
Item Type: |
Book Section
|
Keywords: |
speech processing; time-domain analysis; synchronisation; speech synthesis; audio signal processing; |
Academic Unit: |
Faculty of Science and Engineering > Electronic Engineering |
Item ID: |
8791 |
Identification Number: |
https://doi.org/10.1109/ICASSP.2003.1198877 |
Depositing User: |
Robert Lawlor
|
Date Deposited: |
11 Sep 2017 15:46 |
Publisher: |
IEEE |
Refereed: |
Yes |
URI: |
|
Use Licence: |
This item is available under a Creative Commons Attribution Non Commercial Share Alike Licence (CC BY-NC-SA). Details of this licence are available
here |
Repository Staff Only(login required)
|
Item control page |
Downloads per month over past year
Origin of downloads