English  |  正體中文  |  简体中文  |  全文笔数/总笔数 : 57325/90921 (63%)
造访人次 : 13033813      在线人数 : 273
RC Version 7.0 © Powered By DSPACE, MIT. Enhanced by NTU Library & TKU Library IR team.
搜寻范围 查询小技巧:
  • 您可在西文检索词汇前后加上"双引号",以获取较精准的检索结果
  • 若欲以作者姓名搜寻,建议至进阶搜寻限定作者字段,可获得较完整数据
  • 进阶搜寻

    jsp.display-item.identifier=請使用永久網址來引用或連結此文件: http://tkuir.lib.tku.edu.tw:8080/dspace/handle/987654321/92967

    题名: A machine-learning approach for analyzing document layout structures with two reading orders
    作者: Wu, Chung-Chih;Chou, Chien-Hsing;Chang, Fu
    贡献者: 淡江大學電機工程學系
    关键词: Binary decision;Document layout analysis;Reading order;Support vector machine;Taboo box;Textline;Text region
    日期: 2008-10
    上传时间: 2013-10-29 15:30:49 (UTC+8)
    出版者: Kidlington: Pergamon
    摘要: The purpose of document layout analysis is to locate textlines and text regions in document images mostly via a series of split-or-merge operations. Before applying such an operation, however, it is necessary to examine the context to decide whether the place chosen for the operation is appropriate. We thus view document layout analysis as a matter of solving a series of binary decision problems, such as whether to apply, or not to apply, a split-or-merge operation to a chosen place. To solve these problems, we use support vector machines to learn whether or not to apply the previously mentioned operations from training documents in which all textlines and text regions have been located and their identifies labeled. The proposed approach is very effective for analyzing documents that allow both horizontal and vertical reading orders. When applied to a test data set composed of eight types of layout structure, the approach's accuracy rates for identifying textlines and text regions are 98.83% and 96.72%, respectively.
    關聯: Pattern Recognition 41(10), pp.3200-3213
    DOI: 10.1016/j.patcog.2008.03.014
    显示于类别:[電機工程學系暨研究所] 期刊論文


    档案 描述 大小格式浏览次数



    DSpace Software Copyright © 2002-2004  MIT &  Hewlett-Packard  /   Enhanced by   NTU Library & TKU Library IR teams. Copyright ©   - 回馈