English  |  正體中文  |  简体中文  |  全文筆數/總筆數 : 62830/95882 (66%)
造訪人次 : 4156335      線上人數 : 720
RC Version 7.0 © Powered By DSPACE, MIT. Enhanced by NTU Library & TKU Library IR team.
搜尋範圍 查詢小技巧:
  • 您可在西文檢索詞彙前後加上"雙引號",以獲取較精準的檢索結果
  • 若欲以作者姓名搜尋,建議至進階搜尋限定作者欄位,可獲得較完整資料
  • 進階搜尋
    請使用永久網址來引用或連結此文件: https://tkuir.lib.tku.edu.tw/dspace/handle/987654321/34174


    題名: 一個能發掘更具意義循序樣式的探勘流程
    其他題名: A procedure to discover more meaningful sequential patterns
    作者: 顏志祐;Yen, Chih-yu
    貢獻者: 淡江大學資訊管理學系碩士班
    徐煥智;Shyur, Huan-jyh
    關鍵詞: 資料探勘;循序樣式;樣式成長;信賴度;data mining;sequential pattern;pattern- growth;Confidence
    日期: 2008
    上傳時間: 2010-01-11 04:59:48 (UTC+8)
    摘要: 循序樣式探勘主要是從序列資料庫中,找出與時間相關的行為樣式。過去針對循序樣式探勘所提出的方法中,多半沒有考慮到樣式的可信程度(confidence)。除此之外,探勘循序樣式雖然能夠得到事件發生的先後順序,但對於事件間的時間資訊卻非常有限。
    本篇論文提出一個新的演算法E-PrefixSpan,目的是從序列資料庫中探勘頻繁且更具可信度的關聯規則。我們以PrefixSpan演算法[20]為基礎,利用樣式成長(pattern-growth)[21]的探勘方法,來發掘時間相關的循序樣式。E-PrefixSpan演算法會記錄項目間的時間間隔,並建立映射資料庫來降低資料庫的掃描次數,在產生樣式過程中會依據樣式的可信度,來減少探勘中會產生龐大的樣式數量,同時確保不會造成重要樣式資訊的遺漏。
    我們與現存的循序樣式演算法比較,並說明我們演算法在其他方法上更能補足的地方。效能評估實驗顯示E-PrefixSpan能有效縮減所產生的關聯樣式,更能提供探勘結果額外的時間間隔資訊。
    Sequential pattern mining technique is developed to determine time-related behavior in sequence databases. Most of the previous proposed methods discover frequent subsequences as patterns but do not consider the confidence issue. Besides, although the discovered sequential patterns can reveal the order of events, but the time between events is not well determined.
    This dissertation presents, E-PrefixSpan, a new method for mining frequent and more confident association rules from sequential databases. The method is based on the PrefixSpan[20] algorithm. To take the advantage of the pattern-growth[21] mining approach and discover the time related sequential patterns, E-PrefixSpan records the time-intervals between items and creates projected databases to reduce the times of database scanning. Sequential pattern mining often generates a huge number of rules. To reduce the number of the correlated pattern without information loss, E-PrefixSpan applys the confidence pattern mining technique .
    The proposed approach is compared to existing sequential pattern mining methods to show how they complement each other to discover association rules. Our performance study shows that E-PrefixSpan is a valuable approach to condense the correlated patterns and provide additional time-interval information for sequential pattern.
    顯示於類別:[資訊管理學系暨研究所] 學位論文

    文件中的檔案:

    檔案 大小格式瀏覽次數
    0KbUnknown327檢視/開啟

    在機構典藏中所有的資料項目都受到原著作權保護.

    TAIR相關文章

    DSpace Software Copyright © 2002-2004  MIT &  Hewlett-Packard  /   Enhanced by   NTU Library & TKU Library IR teams. Copyright ©   - 回饋