淡江大學機構典藏:Item 987654321/34135
English  |  正體中文  |  简体中文  |  全文笔数/总笔数 : 62861/95882 (66%)
造访人次 : 4211469      在线人数 : 663
RC Version 7.0 © Powered By DSPACE, MIT. Enhanced by NTU Library & TKU Library IR team.
搜寻范围 查询小技巧:
  • 您可在西文检索词汇前后加上"双引号",以获取较精准的检索结果
  • 若欲以作者姓名搜寻,建议至进阶搜寻限定作者字段,可获得较完整数据
  • 进阶搜寻


    jsp.display-item.identifier=請使用永久網址來引用或連結此文件: https://tkuir.lib.tku.edu.tw/dspace/handle/987654321/34135


    题名: 具結構描述之物件比對
    其它题名: Detection of duplicates in structured objects
    作者: 林志龍;Lin, Chih-lung
    贡献者: 淡江大學資訊管理學系碩士班
    魏世杰;Wei, Shih-chieh
    日期: 2005
    上传时间: 2010-01-11 04:57:33 (UTC+8)
    摘要: 本文提出一個可以在具結構描述之物件中找出重複物件的方法。因為在比對找尋過程,物件結構與元素的缺值對物件相似與否會有影響。物件結構欄位不同,在比對時有不同的重要度;物件元素的缺值則會影響我們對物件的瞭解,缺值愈多,對物件的認識度愈低,可參考的資訊也愈少。所以本文針對物件結構採權重差異化,對元素缺值採可信度值處理,再以匯總的方式對比對之物件產生一匯總相似值,以判斷兩物件是否為重複物件。最後,以通訊錄匯總及人口普查實驗例子證實本方法可以有效的提升物件比對之準確率及召回率。
    We propose a method for detecting the duplicates in structured objects. The structure of objects and the missing value of elements in objects are very important for detecting the duplicates. Different elements in a structured object have different importance in detecting duplicates. The missing value of elements influence us in understanding objects. The more of missing value, the less we understand the objects , and the less its reference value. So, we adopt the different weighting scheme of the structure of the object and compute the confidence value for missing values. Then, we summarize the similarity value and the confidence value of the two objects to decide if they are duplicates or not. Finally, we experiment with the synthetic addressbook and census datasets to test if our proposed method can improve the recall and precision in duplicate detection of objects.
    显示于类别:[資訊管理學系暨研究所] 學位論文

    文件中的档案:

    档案 大小格式浏览次数
    0KbUnknown277检视/开启

    在機構典藏中所有的数据项都受到原著作权保护.

    TAIR相关文章

    DSpace Software Copyright © 2002-2004  MIT &  Hewlett-Packard  /   Enhanced by   NTU Library & TKU Library IR teams. Copyright ©   - 回馈