English  |  正體中文  |  简体中文  |  Items with full text/Total items : 64178/96951 (66%)
Visitors : 9386935      Online Users : 540
RC Version 7.0 © Powered By DSPACE, MIT. Enhanced by NTU Library & TKU Library IR team.
Scope Tips:
  • please add "double quotation mark" for query phrases to get precise results
  • please goto advance search for comprehansive author search
  • Adv. Search
    HomeLoginUploadHelpAboutAdminister Goto mobile version
    Please use this identifier to cite or link to this item: https://tkuir.lib.tku.edu.tw/dspace/handle/987654321/96043


    Title: Compact Speech Features Based on Wavelet Transform and PCA with Application to Speaker Identification
    Authors: Hsieh, Ching-Tang;Lai, Eugene;Chen, Wan-Chen;Wang, You-Chuang
    Contributors: 淡江大學電機工程學系
    Keywords: 主元件分析;特徵辨識;特徵萃取;多頻帶特徵;小波轉換;Principal Component Analysis;Feature Recognition;Fecture Extraction;Multiband Feature;Wavelet Transform
    Date: 2002-08
    Issue Date: 2014-02-13 11:38:27 (UTC+8)
    Abstract: The main goal of this paper is to find some effective methods to improve the performance of speaker identification system. In speaker identification, we use wavelet transform to decompose the speech signals into several frequency bands and then use cepstral coefficients to capture the individualities of vocal track within the interested bands based on the acoustic characteristic of human ear. In addition, an adaptive wavelet-based filtering mechanism is applied to eliminate the small variation of wavelet coefficients caused by noise. In order to effectively utilize all these multi-band speech features, we propose a modified vector quantization method called multi-layer eigen-codebook vector quantization (MLECVQ) as the identifier. This model uses the multi-layer concept to eliminate the interference between the multi-band coefficients and then uses the principal component analysis (PCA) method to evaluate the codebooks for capturing more details of phoneme character. Experimental results show that the proposed method is better than the GMM+MFCC model on computational cost and recognition performance under clean and noisy speech data evaluations.
    Relation: 第三屆國際中文口述語言處理研討會暨海峽兩岸口語語音處理論壇論文集,頁165-168
    Appears in Collections:[電機工程學系暨研究所] 會議論文

    Files in This Item:

    File SizeFormat
    Compact Speech Features Based on Wavelet Transform and PCA with Application to Speaker Identification_英文摘要.docx16KbMicrosoft Word183View/Open

    All items in 機構典藏 are protected by copyright, with all rights reserved.


    DSpace Software Copyright © 2002-2004  MIT &  Hewlett-Packard  /   Enhanced by   NTU Library & TKU Library IR teams. Copyright ©   - Feedback