    题名: 影片中的文字擷取
    其它题名: Text extraction on video
    作者: 陳韻茹;Chen, Yun-ju
    贡献者: 淡江大學資訊工程學系碩士班
    顏淑惠;Yen, Shew-huey
    关键词: 影片文字偵測;影片文字擷取;黑白穿透量;參考畫格;代表畫格;文字遮罩;video text detection;video text extraction;BWTC (black-white transition count);reference frame;corresponding frame;text mask
    日期: 2007
    上传时间: 2010-01-11 05:56:24 (UTC+8)
    摘要: 為了能夠有效管理眾多的影片檔案,本文發展一方法來擷取在影片中具有代表意義的文字。首先,對整體影片進行文字偵測,也就是說,每隔x個畫格從影片的開始至結束都檢查一次。偵測流程當中,不但要偵測該回合有無文字存在,並且要比對畫格間的文字重疊性及文字相似度。對於每個文字區段會紀錄其起始畫格、結束畫格、參考畫格、及代表畫格,並且標示該文字區域位在畫格之所在地。為了讓影片文字偵測結果更為準確,更進一步地進行文字區段間的合併,使得影片最終的文字偵測而得的文字區段段數達到最小值,以期與實際文字區段段數相符。
    With the rapid growth of digital technology, videos now play an important role in our life. Due to huge amount of video data, it needs efficient means to access and retrieve them. Text in videos is a powerful source to help us to understand the content of the videos. To achieve this task, we propose a method to extract text in videos. The text detection is achieved by overall video text detection and video clips mergence for same texts. Firstly, at each round,text regionsare roughly labeled by applying Canny edge detecting algorithm to 7 consecutive frames and taking the result of intersection of edge pixels. To determine whether there are the same texts on two frames, the comparison of region overlap and black-white transition count (BWTC) are used. For each text t, the video clip with start/end frame, reference frames, and corresponding frame will be recorded. The mergence of video clips occurs if two consecutive clips have the same text. Text mask Mt is constructed via reference frames of the text t. Text regions are thus refined using text masks. Before text extraction, the similarity of refined text regions is again compared for possible mergence of video clips.
    To accomplish the text extraction,three steps-background removal, contrast enhancement, and binarizaiton-are applied to the correspondence frame of the text. Background is removed by morphological reconstruction. In order to get better binary results, it will be enhanced by multi-stage histogram equalization. Finally, binarization is performed by moving average algorithm.
    Experimental results show that the effectiveness of the proposed method.
