淡江大學機構典藏:Item 987654321/52402
English  |  正體中文  |  简体中文  |  全文笔数/总笔数 : 62797/95867 (66%)
造访人次 : 3739444      在线人数 : 444
RC Version 7.0 © Powered By DSPACE, MIT. Enhanced by NTU Library & TKU Library IR team.
搜寻范围 查询小技巧:
  • 您可在西文检索词汇前后加上"双引号",以获取较精准的检索结果
  • 若欲以作者姓名搜寻,建议至进阶搜寻限定作者字段,可获得较完整数据
  • 进阶搜寻


    jsp.display-item.identifier=請使用永久網址來引用或連結此文件: https://tkuir.lib.tku.edu.tw/dspace/handle/987654321/52402


    题名: 可讀性分析在特徵選擇上作探討與研究
    其它题名: On the study of feature selection for readability analysis
    作者: 雷珵麟;Lei, Chin-lin
    贡献者: 淡江大學資訊工程學系碩士班
    郭經華
    关键词: 可讀性分析;特徵選取;詞性標記;線性內插法;Readability Analysis;Features Selection;POS tagging;Linear interpolation
    日期: 2010
    上传时间: 2010-09-23 17:36:16 (UTC+8)
    摘要: 隨者網路的發達,我們可以在各大英文教學網站取得學習的閱讀資料,這些閱讀資料的難易度都是由該網站的負責人所定義的,對於其他沒有定義的資料就不知它的閱讀難度,所以我們要找出一個可以區分文章難易程度的方法,讓學習者找出適合自己的閱讀教材。
    本論文為英語為第二外語的使用者找出一些可以分析英語文章難易度的特徵,如句子長度、單字、文法等因素,並把這些特徵結合用以分析兩篇閱讀文章彼此之間的難易度差距,方便讓學習者找出適合自己程度的閱讀資料,以達到學習的效果。本研究和以往相關文獻的差異在於多了文法分析,以往的研究大部分都在於用單字來分辨難度,在此我們利用POS Tagger來分析每句句子的詞性,接著再對句子作切割以找出可以辨別文章可讀性難易度差異的文法因素,最後在和上述的特徵作合併以計算兩篇文章的難度差距。
    As the capability of the internet is growing, we can get reading resources for learning English from popular websites. The readability of these reading resources was confirmed by the person in charge of these websites, but for other resources not confirmed is unknown. Thus we need a method to obtain the readability of these remaining resources to make learners to find their own reading materials.
    Main motivation of this thesis presents features of analyzing difficulty by length of sentences, vocabulary, and grammar for ESL learner. Interpret texts with these features leads to an appropriate differentiation for learner in finding the proper text and generate to the efficiency result. The difference between our research and related works is that our research applies grammar analysis; those works mostly recognize readability with single terms. First we tag the POS of each word with POS tagger, and we cut these sentences to find out the factors which support us in recognizing the readability in grammar, and then combine the features above to calculate the difference of difficulty.
    显示于类别:[資訊工程學系暨研究所] 學位論文

    文件中的档案:

    档案 大小格式浏览次数
    index.html0KbHTML219检视/开启

    在機構典藏中所有的数据项都受到原著作权保护.

    TAIR相关文章

    DSpace Software Copyright © 2002-2004  MIT &  Hewlett-Packard  /   Enhanced by   NTU Library & TKU Library IR teams. Copyright ©   - 回馈