淡江大學機構典藏:Item 987654321/33908
English  |  正體中文  |  简体中文  |  全文筆數/總筆數 : 62830/95882 (66%)
造訪人次 : 4139014      線上人數 : 368
RC Version 7.0 © Powered By DSPACE, MIT. Enhanced by NTU Library & TKU Library IR team.
搜尋範圍 查詢小技巧:
  • 您可在西文檢索詞彙前後加上"雙引號",以獲取較精準的檢索結果
  • 若欲以作者姓名搜尋,建議至進階搜尋限定作者欄位,可獲得較完整資料
  • 進階搜尋
    請使用永久網址來引用或連結此文件: https://tkuir.lib.tku.edu.tw/dspace/handle/987654321/33908


    題名: 機率類神經網路在垃圾郵件過濾之應用
    其他題名: Application of probabilistic neural network methods to spam filtering
    作者: 吳宗和;Wu, Tsung-ho
    貢獻者: 淡江大學統計學系碩士班
    陳景祥;Chen, Ching-hsiang
    關鍵詞: 機率類神經網路;貝式分類器;CART;垃圾郵件;Data mining;Decision Tree;Neural Network;Bayes Classifier;Spam
    日期: 2005
    上傳時間: 2010-01-11 04:39:44 (UTC+8)
    摘要: 本研究以資料探勘常見理論為基礎,建構出防堵垃圾郵件機制。使用PHP程式語言擷取電子郵件特徵,並透過機率類神經網路演算法、貝式分類器與C&RT (Classification and Regression Tree)對電子郵件分類,比較其分類模式之優劣。若考慮兩種可能狀況之下,發現設定平滑參數為0.01、0.1之機率類神經網路表現最好,其次貝式分類器與C&RT。也透過統計方法的變異數分析與Tukey真實顯著差異多重比較客觀分析其分類模式之優劣,發現與之前所做之結論一致。此外也使用風險分析,提供使用者在電子郵件分類不同的概念,評估分類模式是否符合使用者的需求。最後加入關鍵字搜尋,針對郵件主旨及寄件者名稱,建構黑白名單過濾,再配合機率類神經網路對電子郵件分類,看其評估準則是否提升。
    The purpose of the study is based on the common theory of data mining that build up the mechanism of anti-spam. Using PHP program to pick the character of spam mail, it performs probability neural network (PNN), classification and regression tree (C&RT) and naïve bayes classifier to the E-mail classification, and compares three kinds of classified patterns. If considers under two kind of possibilities conditions, the probability neural network of smooth parameter 0.01, 0.1 is best, next C&RT and naïve bayes classifier. Using the statistical method of one way ANOVA and Tukey Multiple comparison test, 0bjectly it fits and unfits qualities of classified pattern that is consistent with the front conclusion. In addition, it uses cost of risk that provides the user in the email classification different concept and evaluates the three of classified patterns whether conforms to user''s demand. Finally, it joint the method of keyword search that aim at the field of subject and from to construct white-list and black-list, then to use PNN to E-mail classification whether increasing accuracy rate.
    顯示於類別:[統計學系暨研究所] 學位論文

    文件中的檔案:

    檔案 大小格式瀏覽次數
    0KbUnknown244檢視/開啟

    在機構典藏中所有的資料項目都受到原著作權保護.

    TAIR相關文章

    DSpace Software Copyright © 2002-2004  MIT &  Hewlett-Packard  /   Enhanced by   NTU Library & TKU Library IR teams. Copyright ©   - 回饋