Accomplishments
Line spectral pairs based voice conversion using radial basis function
- Abstract
Voice conversion is a technique which morphs the speaker dependent acoustical cues of the source speaker to those of the target speaker. Speaker dependent acoustical cues are characterized at different levels such as shape of vocal tract and glottal excitation. In this paper, vocal tract parameters and glottal excitations are characterized using line spectral pairs (LSP) and pitch residual, respectively. Strong generalization ability of radial basis function (RBF) is utilized to map the acoustical cues namely, LSP and pitch residual of source speaker to that of target speaker. The subjective and objective measures are used to evaluate the comparative performance of RBF and state of the art Gaussian mixture model (GMM) based voice conversion system. Objective measures and simulation results indicate that the RBF transformation model performed better than GMM model. Subjective evaluations illustrate that the …