淡江大學機構典藏:Item 987654321/35039
English  |  正體中文  |  简体中文  |  Items with full text/Total items : 62805/95882 (66%)
Visitors : 3986758      Online Users : 500
RC Version 7.0 © Powered By DSPACE, MIT. Enhanced by NTU Library & TKU Library IR team.
Scope Tips:
  • please add "double quotation mark" for query phrases to get precise results
  • please goto advance search for comprehansive author search
  • Adv. Search
    HomeLoginUploadHelpAboutAdminister Goto mobile version
    Please use this identifier to cite or link to this item: https://tkuir.lib.tku.edu.tw/dspace/handle/987654321/35039


    Title: 影片中的文字擷取
    Other Titles: Text extraction on video
    Authors: 陳韻茹;Chen, Yun-ju
    Contributors: 淡江大學資訊工程學系碩士班
    顏淑惠;Yen, Shew-huey
    Keywords: 影片文字偵測;影片文字擷取;黑白穿透量;參考畫格;代表畫格;文字遮罩;video text detection;video text extraction;BWTC (black-white transition count);reference frame;corresponding frame;text mask
    Date: 2007
    Issue Date: 2010-01-11 05:56:24 (UTC+8)
    Abstract: 為了能夠有效管理眾多的影片檔案,本文發展一方法來擷取在影片中具有代表意義的文字。首先,對整體影片進行文字偵測,也就是說,每隔x個畫格從影片的開始至結束都檢查一次。偵測流程當中,不但要偵測該回合有無文字存在,並且要比對畫格間的文字重疊性及文字相似度。對於每個文字區段會紀錄其起始畫格、結束畫格、參考畫格、及代表畫格,並且標示該文字區域位在畫格之所在地。為了讓影片文字偵測結果更為準確,更進一步地進行文字區段間的合併,使得影片最終的文字偵測而得的文字區段段數達到最小值,以期與實際文字區段段數相符。
    文字偵測之後,首先利用型態影像學中的測量學擴張將背景資訊移除。接著應用簡單的長條圖等化法增強影像的對比。然後執行文字擷取以備將來文字辨識之用。
    With the rapid growth of digital technology, videos now play an important role in our life. Due to huge amount of video data, it needs efficient means to access and retrieve them. Text in videos is a powerful source to help us to understand the content of the videos. To achieve this task, we propose a method to extract text in videos. The text detection is achieved by overall video text detection and video clips mergence for same texts. Firstly, at each round,text regionsare roughly labeled by applying Canny edge detecting algorithm to 7 consecutive frames and taking the result of intersection of edge pixels. To determine whether there are the same texts on two frames, the comparison of region overlap and black-white transition count (BWTC) are used. For each text t, the video clip with start/end frame, reference frames, and corresponding frame will be recorded. The mergence of video clips occurs if two consecutive clips have the same text. Text mask Mt is constructed via reference frames of the text t. Text regions are thus refined using text masks. Before text extraction, the similarity of refined text regions is again compared for possible mergence of video clips.
    To accomplish the text extraction,three steps-background removal, contrast enhancement, and binarizaiton-are applied to the correspondence frame of the text. Background is removed by morphological reconstruction. In order to get better binary results, it will be enhanced by multi-stage histogram equalization. Finally, binarization is performed by moving average algorithm.
    Experimental results show that the effectiveness of the proposed method.
    Appears in Collections:[Graduate Institute & Department of Computer Science and Information Engineering] Thesis

    Files in This Item:

    File SizeFormat
    0KbUnknown270View/Open

    All items in 機構典藏 are protected by copyright, with all rights reserved.


    DSpace Software Copyright © 2002-2004  MIT &  Hewlett-Packard  /   Enhanced by   NTU Library & TKU Library IR teams. Copyright ©   - Feedback