English  |  正體中文  |  简体中文  |  Items with full text/Total items : 62830/95882 (66%)
Visitors : 4063620      Online Users : 416
RC Version 7.0 © Powered By DSPACE, MIT. Enhanced by NTU Library & TKU Library IR team.
Scope Tips:
  • please add "double quotation mark" for query phrases to get precise results
  • please goto advance search for comprehansive author search
  • Adv. Search
    HomeLoginUploadHelpAboutAdminister Goto mobile version
    Please use this identifier to cite or link to this item: https://tkuir.lib.tku.edu.tw/dspace/handle/987654321/125169


    Title: JCF: Joint Coarse and Fine-Grained Similarity Comparison for Plagiarism Detection Based on NLP
    Authors: Chang, C. Y.;Jhang, S.-J.;Wu, S.-J.;Roy, D. S.
    Keywords: Natural language processing;TF–IDF;Word2Vec;Coarse and fine grained;Document similarity
    Date: 2023-06-24
    Issue Date: 2024-03-07 12:05:58 (UTC+8)
    Publisher: Springer New York LLC
    Abstract: Document similarity recognition is one of the most important problems in natural language processing. This paper proposes a plagiarism comparison mechanism called JCF. Initially, the TF–IDF scheme is applied to build a bag of words as the representation of the common features of all documents. Then, the plagiarism comparison is carried out in a coarse-grained manner, which speeds up the similarity comparison. Finally, the most similar documents can then be compared in detail based on a fine-grained approach. In addition, the JCF detects plagiarism at both syntax level and semantic-like level. To prevent the distortion of similarity comparison, this paper further develops a similarity restoration approach such that the proposed JCF can obtain both advantages of quickness and accuracy. Performance studies confirm that the proposed JCF outperforms existing studies in terms of precision, recall and F1 score.
    Relation: Journal of Supercomputing 80, p.363-394
    DOI: 10.1007/s11227-023-05472-0
    Appears in Collections:[資訊工程學系暨研究所] 期刊論文

    Files in This Item:

    File Description SizeFormat
    index.html0KbHTML3View/Open

    All items in 機構典藏 are protected by copyright, with all rights reserved.


    DSpace Software Copyright © 2002-2004  MIT &  Hewlett-Packard  /   Enhanced by   NTU Library & TKU Library IR teams. Copyright ©   - Feedback