English  |  正體中文  |  简体中文  |  全文筆數/總筆數 : 64185/96962 (66%)
造訪人次 : 12755982      線上人數 : 9961
RC Version 7.0 © Powered By DSPACE, MIT. Enhanced by NTU Library & TKU Library IR team.
搜尋範圍 查詢小技巧:
  • 您可在西文檢索詞彙前後加上"雙引號",以獲取較精準的檢索結果
  • 若欲以作者姓名搜尋,建議至進階搜尋限定作者欄位,可獲得較完整資料
  • 進階搜尋
    請使用永久網址來引用或連結此文件: https://tkuir.lib.tku.edu.tw/dspace/handle/987654321/52147


    題名: 具有時間限制條件的最長頻繁循序樣式探勘演算法
    其他題名: Maximal sequential patterns mining with timing constraints
    作者: 林師晟;Lin, Shi-cheng
    貢獻者: 淡江大學資訊管理學系碩士班
    周清江
    關鍵詞: 循序樣式;資料探勘;最長頻繁循序樣式;sequential pattern;data mining;maximal frequent sequential pattern
    日期: 2010
    上傳時間: 2010-09-23 16:56:44 (UTC+8)
    摘要: 循序樣式探勘的目的是從資料庫中,尋找頻繁出現且有順序的樣式,通常這些樣式會再被轉換成先前所不知道的、有用的與有價值的資訊。由於資料庫中通常存在大量且長時間的資料,因此循序樣式探勘往往需要花上大量時間。但是一般的頻繁循序樣式探勘演算法無法針對要尋找之頻繁循序樣式的時間條件加以限制,使得探勘得到的頻繁循序樣式太多且不易應用。探勘最長頻繁循序樣式雖可得到意義相同且較精簡的樣式集合,但針對長度為k之最長頻繁循序樣式探勘,不論是以PrefixSpan或是Apriori為基礎的演算法皆必須經過k個回合的探勘。當k值越大,所需的探勘回合數越多,探勘所需時間也越久。
    本研究提出具有時間限制條件的最長頻繁循序樣式探勘演算法,可針對最長頻繁循序樣式所發生的時間加以限制,且具有不需k回合即可找出長度為k之最長頻繁循序樣式探勘的特性,並以實驗證明此演算法在設定時間條件後可以加快最長頻繁循序樣式探勘的速度。最後,我們將演算法應用於探勘車流量紀錄之資料庫,說明加上不同的時間限制條件後,可以得到具有不同時間意義之最長頻繁循序樣式。
    The purpose of frequent sequential pattern mining is to find sequential patterns which occur more frequently than a given threshold. Normally these patterns are then transformed into previously-unknown useful and valuable information. Because of accumulated huge number of records in the database, frequent sequential pattern mining often takes a lot of time. Since most frequent sequential pattern mining algorithms do not have timing constraints, lots of frequent sequential patterns are found. It is difficult to decide which patterns among them are useful. Maximal frequent sequential pattern mining could obtain more compact patterns without losing any results obtained in frequent sequential pattern mining. However, most of these algorithms must complete k rounds to obtain maximal frequent sequential patterns with length k. The longer the maximal frequent sequential patterns, the more rounds the mining requires. The required mining time would be longer accordingly.
    We propose an algorithm which obtains maximal frequent sequential patterns with timing constraints. This algorithm can restrict the occurring time-interval of the obtained maximal sequential patterns. It could obtain maximal frequent sequential patterns with length k in less than k rounds. We demonstrate that the timing constraints could speed up the mining process. Finally, we apply our algorithm to a database of traffic flow records, and illustrate how to obtain maximal frequent sequential patterns with different timing meaning according to selected timing constraints.
    顯示於類別:[資訊管理學系暨研究所] 學位論文

    文件中的檔案:

    檔案 大小格式瀏覽次數
    index.html0KbHTML299檢視/開啟

    在機構典藏中所有的資料項目都受到原著作權保護.

    TAIR相關文章

    DSpace Software Copyright © 2002-2004  MIT &  Hewlett-Packard  /   Enhanced by   NTU Library & TKU Library IR teams. Copyright ©   - 回饋