淡江大學機構典藏:Item 987654321/112025
English  |  正體中文  |  简体中文  |  全文筆數/總筆數 : 62797/95867 (66%)
造訪人次 : 3745176      線上人數 : 483
RC Version 7.0 © Powered By DSPACE, MIT. Enhanced by NTU Library & TKU Library IR team.
搜尋範圍 查詢小技巧:
  • 您可在西文檢索詞彙前後加上"雙引號",以獲取較精準的檢索結果
  • 若欲以作者姓名搜尋,建議至進階搜尋限定作者欄位,可獲得較完整資料
  • 進階搜尋
    請使用永久網址來引用或連結此文件: https://tkuir.lib.tku.edu.tw/dspace/handle/987654321/112025


    題名: Resource-Constrained Decision Tree Induction
    資源有限下的決策樹建構
    作者: 吳家齊
    Wu, Chia-Chi
    貢獻者: 淡江大學管科系
    陳彥良
    關鍵詞: data mining;cost-sensitive learning;decision tree;classification
    決策樹;資料探勘;分類;成本感知學習
    日期: 2010-07
    上傳時間: 2017-11-09 10:52:46 (UTC+8)
    摘要: Classification is one of the most important research domains in data mining. Among the existing classifiers, decision trees are probably the most popular and commonly-used classification models. Most of the decision tree algorithms aimed to maximize the classification accuracy and minimize the classification error. However, in many real-world applications, there are various types of cost or resource consumption involved in both the induction of decision tree and the classification of future instance. Furthermore, the problem we face may require us to complete a classification task with limited resource. Therefore, how to build an optimum decision tree with resource constraint becomes an important issue. In this study, we first propose two algorithms which are improved versions of traditional TDIDT(Top-Down Induction on Decision Trees) algorithms. Then, we adopt a brand new approach to deal with multiple resource constraints. This approach extracts association classification rules from training dataset first, and then builds a decision tree from the extracted rules. Empirical evaluations were carried out using real datasets, and the results indicated that the proposed methods can achieve satisfactory results in handling data under different resource constraints.

    分類是資料探勘中一個非常重要的研究領域。在現存的許多分類器當中,決策樹可能是最受歡迎、也最常被使用的分類模型。現有的大多數決策樹演算法皆致力於將分類精確度最大化、將分類錯誤率最小化。然而,在許多現實生活應用中,從以現有資料建立決策樹,到用決策樹分類未來資料的每個過程,都可能包含了各式各樣不同種類的成本或資源消耗。依據我們所面對的問題,我們也有可能需要在有限的資源底下完成分類工作。因此,如何在資源有限下建立出最適用的決策樹是一個很重要的議題。在本研究中,我們首先提出了兩個改良自傳統TDIDT﹝Top-Down Induction on Decision Trees, 由上而下的決策樹建構﹞的演算法。接著,我們採用了一個全新的方法來處理多種資源限制的問題。我們所提出的新方法先從訓練資料集中粹取出所有合法的分類規則,再利用這些粹取出的規則建出一棵決策樹。我們使用實際資料來進行完整的實驗評估。實驗結果顯示,我們提出的方法在不同資源限制下的表現都是令人滿意的。
    顯示於類別:[管理科學學系暨研究所] 學位論文

    文件中的檔案:

    沒有與此文件相關的檔案.

    在機構典藏中所有的資料項目都受到原著作權保護.

    TAIR相關文章

    DSpace Software Copyright © 2002-2004  MIT &  Hewlett-Packard  /   Enhanced by   NTU Library & TKU Library IR teams. Copyright ©   - 回饋