A machine-learning approach for analyzing document layout structures with two reading orders

doi:10.1016/j.patcog.2008.03.014

淡江大學機構典藏 > 工學院 > 電機工程學系暨研究所 > 期刊論文 > Item 987654321/92967

請使用永久網址來引用或連結此文件: https://tkuir.lib.tku.edu.tw/dspace/handle/987654321/92967

題名:	A machine-learning approach for analyzing document layout structures with two reading orders
作者:	Wu, Chung-Chih;Chou, Chien-Hsing;Chang, Fu
貢獻者:	淡江大學電機工程學系
關鍵詞:	Binary decision;Document layout analysis;Reading order;Support vector machine;Taboo box;Textline;Text region
日期:	2008-10
上傳時間:	2013-10-29 15:30:49 (UTC+8)
出版者:	Kidlington: Pergamon
摘要:	The purpose of document layout analysis is to locate textlines and text regions in document images mostly via a series of split-or-merge operations. Before applying such an operation, however, it is necessary to examine the context to decide whether the place chosen for the operation is appropriate. We thus view document layout analysis as a matter of solving a series of binary decision problems, such as whether to apply, or not to apply, a split-or-merge operation to a chosen place. To solve these problems, we use support vector machines to learn whether or not to apply the previously mentioned operations from training documents in which all textlines and text regions have been located and their identifies labeled. The proposed approach is very effective for analyzing documents that allow both horizontal and vertical reading orders. When applied to a test data set composed of eight types of layout structure, the approach's accuracy rates for identifying textlines and text regions are 98.83% and 96.72%, respectively.
關聯:	Pattern Recognition 41(10), pp.3200-3213
DOI:	10.1016/j.patcog.2008.03.014
顯示於類別:	[電機工程學系暨研究所] 期刊論文

文件中的檔案:

檔案	描述	大小	格式	瀏覽次數
index.html		0Kb	HTML	346	檢視/開啟

在機構典藏中所有的資料項目都受到原著作權保護.

TAIR相關文章

資料載入中.....