淡江大學機構典藏:Item 987654321/119473
English  |  正體中文  |  简体中文  |  全文筆數/總筆數 : 62805/95882 (66%)
造訪人次 : 3886153      線上人數 : 501
RC Version 7.0 © Powered By DSPACE, MIT. Enhanced by NTU Library & TKU Library IR team.
搜尋範圍 查詢小技巧:
  • 您可在西文檢索詞彙前後加上"雙引號",以獲取較精準的檢索結果
  • 若欲以作者姓名搜尋,建議至進階搜尋限定作者欄位,可獲得較完整資料
  • 進階搜尋
    請使用永久網址來引用或連結此文件: https://tkuir.lib.tku.edu.tw/dspace/handle/987654321/119473


    題名: 巨量資料之矩陣視覺化
    其他題名: Matrix Visualization for Big Data
    作者: 高君豪
    關鍵詞: Matrix Visualization;Big Data;Exploratory Data Analysis;Symbolic Data Analysis;Generalized Association Plots
    日期: 2018-06-20
    上傳時間: 2020-10-29 12:11:05 (UTC+8)
    摘要: The innovation of biomedical and industrial techniques with continued development of computer technology have caused dramatic changes of data generation and collection. Data scale tends to grow exponentially while data quality becomes unreliable. Statistical methods for validation and analysis of big data with its computation techniques became important research topics nowadays. Visualization and exploratory data analysis (EDA) are going to play essential roles in deep analytics on big data analysis. Yet there are some problems to be solved and techniques to be developed. Most current big data visualization methods focus on node-link diagram based dynamic network drawing. They mainly rely on the 2D and 3D scatterplots that do not consume much computing memory, power, and display space; however, the drawback is the limitation on dimensions of variable for visualization. This works first aims to resolve the potential difficulties for applying the techniques of matrix visualization for continuous type big data: (1) computation and permutation of proximity matrices; (2) display of big data. We shall integrate the strength of GAP (generalized association plots), SDA (symbolic data analysis), with Hadoop/Spark computing facility for taking care of these problems of computation and display and for creating environment for matrix visualization of continuous type big data. Here we apply the proposed MV for big data techniques on the 2000 Longitudinal Health Insurance Database (LHID2000) of National Health Insurance Research Database (NHIRD) published by National Health Research Institutes (NHRI) in Taiwan. We will then move on and expand the environment for matrix visualization of continuous type big data to binary, categorical, cartography, and other types of big data. We expect to face even more challenging difficulties while developing related techniques.
    顯示於類別:[統計學系暨研究所] 專書

    文件中的檔案:

    檔案 描述 大小格式瀏覽次數
    index.html0KbHTML69檢視/開啟
    巨量資料之矩陣視覺化.pdf35180KbAdobe PDF2檢視/開啟

    在機構典藏中所有的資料項目都受到原著作權保護.

    TAIR相關文章

    DSpace Software Copyright © 2002-2004  MIT &  Hewlett-Packard  /   Enhanced by   NTU Library & TKU Library IR teams. Copyright ©   - 回饋