學生程式碼相似度之研究 : 以抄襲偵測之應用為例

淡江大學機構典藏 > 商管學院 > 資訊管理學系暨研究所 > 學位論文 > Item 987654321/34053

請使用永久網址來引用或連結此文件: https://tkuir.lib.tku.edu.tw/dspace/handle/987654321/34053

題名:	學生程式碼相似度之研究 : 以抄襲偵測之應用為例
其他題名:	Research on detection of similarity in student programs : with application to detection of plagiarism
作者:	林世唐;Lin, Shih-tang
貢獻者:	淡江大學資訊管理學系碩士班魏世杰;Wei, Shih-chieh
關鍵詞:	程式比對;結構度量;抄襲偵測;反轉文件頻率法;Program Compare;Structure Metric;Plagiarism Detection;Inverse Document Frequency
日期:	2005
上傳時間:	2010-01-11 04:50:17 (UTC+8)
摘要:	程式的相似判斷不像文字那麼複雜，比起文字的語法來說，程式的文法更有規則性。程式相似判斷的應用上有很多方面，在實際教學應用上，最常用來作抄襲的檢測。但我們發現在可以參考範例的考試或作業下，可能因為修改自相同的範例程式，因此雖然找出大量相似片段組，但有很多相似是因為參考範例本身而相似，不具抄襲意義。另一方面，在資訊檢索領域有IDF (Inverse Document Frequency, 反轉文件頻率法)的概念，發生頻率高的片段較不具意義，發生頻率低的片段較具意義。因此我們提出以IDF 為主的新方法，幫助我們找出發生頻率低的相似片段組，視為較有抄襲可能性之片段。並用開放式(open book)的一次考試和一次作業程式來作驗證。 Similarity detection on programs is simpler than on text documents. Compared to text documents, the grammar used in program languages is easier to define. As a result, more and more applications are developed to detect program similarity. One practical use of these appications is to detect plagiarism for educational purposes. In particular, when students have test or homework on programs where they can open books to consult the examples, they may copy from the example programs without much thinking and rewriting. In this case, we will find many similar code tiles that are copied from the same example but of little value in plagiarism detection. So in this paper based on the IDF (Inverse Document Frequency) concept from information retrieval ,we propose a new method to reduce the influence of high frequency code tiles, and compare with traditional non-IDF result using the datasets from an open-book program test and a homework.
顯示於類別:	[資訊管理學系暨研究所] 學位論文

文件中的檔案:

檔案	大小	格式	瀏覽次數
	0Kb	Unknown	766	檢視/開啟

在機構典藏中所有的資料項目都受到原著作權保護.

TAIR相關文章

資料載入中.....