淡江大學機構典藏

Menu Search

主页
登入
上传
说明
关于機構典藏
管理
到一般网页

English | 正體中文 | 简体中文 |

全文笔数/总笔数 : 64185/96962 (66%)
造访人次 : 12812191
在线人数 : 4579

淡江大學機構典藏 > 商管學院 > 統計學系暨研究所 > 期刊論文 > Item 987654321/117009

請使用永久網址來引用或連結此文件: https://tkuir.lib.tku.edu.tw/dspace/handle/987654321/117009

題名:	Constructing endophenotypes of complex disease using non-negative matrix factorization and adjusted rand index
作者:	Wang HM, Hsiao CL, Hsieh Ai-Ru, Chang SW, Fann Cathy SJ
日期:	2012-07-16
上傳時間:	2019-09-17 12:11:33 (UTC+8)
出版者:	Wang et al
摘要:	Complex diseases are typically caused by combinations of molecular disturbances that vary widely among different patients. Endophenotypes, a combination of genetic factors associated with a disease, offer a simplified approach to dissect complex trait by reducing genetic heterogeneity. Because molecular dissimilarities often exist between patients with indistinguishable disease symptoms, these unique molecular features may reflect pathogenic heterogeneity. To detect molecular dissimilarities among patients and reduce the complexity of high-dimension data, we have explored an endophenotype-identification analytical procedure that combines non-negative matrix factorization (NMF) and adjusted rand index (ARI), a measure of the similarity of two clusterings of a data set. To evaluate this procedure, we compared it with a commonly used method, principal component analysis with k-means clustering (PCA-K). A simulation study with gene expression dataset and genotype information was conducted to examine the performance of our procedure and PCA-K. The results showed that NMF mostly outperformed PCA-K. Additionally, we applied our endophenotype-identification analytical procedure to a publicly available dataset containing data derived from patients with late-onset Alzheimer’s disease (LOAD). NMF distilled information associated with 1,116 transcripts into three metagenes and three molecular subtypes (MS) for patients in the LOAD dataset: MS1 (), MS2 (), and MS3 (). ARI was then used to determine the most representative transcripts for each metagene; 123, 89, and 71 metagene-specific transcripts were identified for MS1, MS2, and MS3, respectively. These metagene-specific transcripts were identified as the endophenotypes. Our results showed that 14, 38, 0, and 28 candidate susceptibility genes listed in AlzGene database were found by all patients, MS1, MS2, and MS3, respectively. Moreover, we found that MS2 might be a normal-like subtype. Our proposed procedure provides an alternative approach to investigate the pathogenic mechanism of disease and better understand the relationship between phenotype and genotype.
關聯:	PLoS ONE 7(7), p.e40996
DOI:	10.1371/journal.pone.0040996
顯示於類別:	[統計學系暨研究所] 期刊論文

文件中的檔案:

檔案	描述	大小	格式	瀏覽次數
Constructing endophenotypes of complex disease using non-negative matrix factorization and adjusted rand index.PDF		628Kb	Adobe PDF	1	檢視/開啟
index.html		0Kb	HTML	80	檢視/開啟

在機構典藏中所有的資料項目都受到原著作權保護.

TAIR相關文章

DSpace Software Copyright © 2002-2004 MIT & HP / Enhanced by NTU Library IR team Copyright © - 回饋