English  |  正體中文  |  简体中文  |  全文筆數/總筆數 : 49378/84106 (59%)
造訪人次 : 7367588      線上人數 : 48
RC Version 7.0 © Powered By DSPACE, MIT. Enhanced by NTU Library & TKU Library IR team.
搜尋範圍 查詢小技巧:
  • 您可在西文檢索詞彙前後加上"雙引號",以獲取較精準的檢索結果
  • 若欲以作者姓名搜尋,建議至進階搜尋限定作者欄位,可獲得較完整資料
  • 進階搜尋
    請使用永久網址來引用或連結此文件: http://tkuir.lib.tku.edu.tw:8080/dspace/handle/987654321/34972


    題名: 利用關聯式法則改善文件分類準確度 : 靜態與動態門檻值問題之探討
    其他題名: Improve document classify accuracy by association rule : static threshold and dynamic threshold research
    作者: 洪茂盛;Hung, Mao-sheng
    貢獻者: 淡江大學資訊工程學系碩士在職專班
    蔣定安;Chiang, Ding-an
    關鍵詞: 文件分類;關聯式法則;靜態門檻值;動態門檻值;document classification;association rule;static threshold;dynamic threshold
    日期: 2009
    上傳時間: 2010-01-11 05:50:21 (UTC+8)
    摘要: 在利用關聯式法則(Association Rule)做分類時,一般關聯式法則分類(Association Rule Classification)的信賴度門檻值設定,大多是依據經驗法則來設定單一且固定的信賴值(Confidence)為其門檻值(Threshold value),所以在設定上較為主觀。同時為了提升分類的準確度,依經驗通常選取較高的信賴度當門檻值,但門檻值若設定太高時,則容易使得部分文件因缺乏規則無法判斷其歸屬的類別,而必須利用預設規則(default rule)將這些文件分類成預設類別;若將門檻值降低則可能造成文件分類錯誤而降低分類的效能。因此本論文將針對門檻值問題做相關探討。
    本論文針對信賴度門檻值問題,分兩部分來討論,一為採取靜態門檻值或動態門檻值;另外是採單一門檻值或多重門檻值。動態門檻值的概念是在每次分類後比較準確率是否有提升,決定是否向上修定原始的門檻值;而多重門檻值的概念則是可根據不同的類別,設定不同的門檻值。
    實驗將依據不同的組合來設定門檻值,同時,希望能依據實驗的結果,找出如何能以客觀的方式來設定信賴度門檻值,並提升分類的效能。
    While using association rule for classification, the experience for association classification rules setting is following single and fixed confidence threshold value, hence is comparatively subjective. In order to increase the accuracy of classification, usually choose higher confidence in accordance with experience, but if set the confidence too high, might cause a part of documentations failed to justify the attributes by lacking rules; if set the confidence too low, it may decrease the documentation classification efficiency.
    This thesis focus on the threshold value discussion, which divides into two parts, one is static threshold value, though the training process is quicker and simpler, but during the classification procedure, the accuracy that originally already been improved could probably be influenced by follow-up lower confidence rule, namely this kind of confidence rule accuracy is low than the original threshold value setting, therefore may decrease the documentation classification efficiency, so that this thesis proposes the dynamic threshold value, to determine whether the threshold value is upward revision by after each classified whether comparative improved the accuracy or not, also propose in an objective way to set the confidence threshold value to improve the classification efficiency, this thesis proved by experiment the dynamic threshold value can obtain better classification efficiency than static threshold value.
    顯示於類別:[資訊工程學系暨研究所] 學位論文

    文件中的檔案:

    檔案 大小格式瀏覽次數
    0KbUnknown159檢視/開啟

    在機構典藏中所有的資料項目都受到原著作權保護.

    TAIR相關文章

    DSpace Software Copyright © 2002-2004  MIT &  Hewlett-Packard  /   Enhanced by   NTU Library & TKU Library IR teams. Copyright ©   - 回饋