淡江大學機構典藏:Item 987654321/34972
English  |  正體中文  |  简体中文  |  全文笔数/总笔数 : 64178/96951 (66%)
造访人次 : 9305154      在线人数 : 233
RC Version 7.0 © Powered By DSPACE, MIT. Enhanced by NTU Library & TKU Library IR team.
搜寻范围 查询小技巧:
  • 您可在西文检索词汇前后加上"双引号",以获取较精准的检索结果
  • 若欲以作者姓名搜寻,建议至进阶搜寻限定作者字段,可获得较完整数据
  • 进阶搜寻


    jsp.display-item.identifier=請使用永久網址來引用或連結此文件: https://tkuir.lib.tku.edu.tw/dspace/handle/987654321/34972


    题名: 利用關聯式法則改善文件分類準確度 : 靜態與動態門檻值問題之探討
    其它题名: Improve document classify accuracy by association rule : static threshold and dynamic threshold research
    作者: 洪茂盛;Hung, Mao-sheng
    贡献者: 淡江大學資訊工程學系碩士在職專班
    蔣定安;Chiang, Ding-an
    关键词: 文件分類;關聯式法則;靜態門檻值;動態門檻值;document classification;association rule;static threshold;dynamic threshold
    日期: 2009
    上传时间: 2010-01-11 05:50:21 (UTC+8)
    摘要: 在利用關聯式法則(Association Rule)做分類時,一般關聯式法則分類(Association Rule Classification)的信賴度門檻值設定,大多是依據經驗法則來設定單一且固定的信賴值(Confidence)為其門檻值(Threshold value),所以在設定上較為主觀。同時為了提升分類的準確度,依經驗通常選取較高的信賴度當門檻值,但門檻值若設定太高時,則容易使得部分文件因缺乏規則無法判斷其歸屬的類別,而必須利用預設規則(default rule)將這些文件分類成預設類別;若將門檻值降低則可能造成文件分類錯誤而降低分類的效能。因此本論文將針對門檻值問題做相關探討。
    本論文針對信賴度門檻值問題,分兩部分來討論,一為採取靜態門檻值或動態門檻值;另外是採單一門檻值或多重門檻值。動態門檻值的概念是在每次分類後比較準確率是否有提升,決定是否向上修定原始的門檻值;而多重門檻值的概念則是可根據不同的類別,設定不同的門檻值。
    實驗將依據不同的組合來設定門檻值,同時,希望能依據實驗的結果,找出如何能以客觀的方式來設定信賴度門檻值,並提升分類的效能。
    While using association rule for classification, the experience for association classification rules setting is following single and fixed confidence threshold value, hence is comparatively subjective. In order to increase the accuracy of classification, usually choose higher confidence in accordance with experience, but if set the confidence too high, might cause a part of documentations failed to justify the attributes by lacking rules; if set the confidence too low, it may decrease the documentation classification efficiency.
    This thesis focus on the threshold value discussion, which divides into two parts, one is static threshold value, though the training process is quicker and simpler, but during the classification procedure, the accuracy that originally already been improved could probably be influenced by follow-up lower confidence rule, namely this kind of confidence rule accuracy is low than the original threshold value setting, therefore may decrease the documentation classification efficiency, so that this thesis proposes the dynamic threshold value, to determine whether the threshold value is upward revision by after each classified whether comparative improved the accuracy or not, also propose in an objective way to set the confidence threshold value to improve the classification efficiency, this thesis proved by experiment the dynamic threshold value can obtain better classification efficiency than static threshold value.
    显示于类别:[資訊工程學系暨研究所] 學位論文

    文件中的档案:

    档案 大小格式浏览次数
    0KbUnknown300检视/开启

    在機構典藏中所有的数据项都受到原著作权保护.

    TAIR相关文章

    DSpace Software Copyright © 2002-2004  MIT &  Hewlett-Packard  /   Enhanced by   NTU Library & TKU Library IR teams. Copyright ©   - 回馈