English  |  正體中文  |  简体中文  |  Post-Print筆數 : 27 |  Items with full text/Total items : 95905/126495 (76%)
Visitors : 31778748      Online Users : 392
RC Version 6.0 © Powered By DSPACE, MIT. Enhanced by NTU Library IR team.
Scope Tips:
  • please add "double quotation mark" for query phrases to get precise results
  • please goto advance search for comprehansive author search
  • Adv. Search
    HomeLoginUploadHelpAboutAdminister Goto mobile version
    Please use this identifier to cite or link to this item: http://nccur.lib.nccu.edu.tw/handle/140.119/109731

    Title: 充分維度縮減在基因集分析上的運用
    Authors: 薛慧敏
    Contributors: 統計系
    Keywords: 基因集分析;差異共變;充分維度縮減;非線性相關
    Gene set analysis;Differential coexpression;Sufficient dimension reduction;Non-linear associations
    Date: 2016
    Issue Date: 2017-05-17 16:31:06 (UTC+8)
    Abstract: 在基因微陣列(microarray)實驗中,基因集分析(gene-set analysis, GSA)的目的為檢定多個基因所形成的集合與外顯表現變數(phenotype)的相關顯著性。目前已有多個公開資料庫提供基因組相關資訊。例如分子特徵資料庫(MSigDB)中包含數個系列,其中包括彙整其他基因資料庫以及生物醫學相關學術期刊的結果所定義之基因庫。這些基因集合依據基因之生物功能將基因歸類。當外顯表現變數為二元或類別型態時,文獻上已發表的基因集分析方法多數是偵測基因表現量的平均差異。Cook與Weisberg (1991)曾提出的「切片平均變異法」(sliced average variance estimation)來估計基因資料的充分維度縮減(sufficient dimension reduction)中央子空間(central subspace),若基因集與外顯變數無關,則該空間的維度應當為零。所以我們提出以檢定”中央子空間維度為零”的假設以評估該基因集的顯著性。本方法將可掘取及運用資料中更豐富的資訊,而且本方法將適用於類別、量化的外顯變數資料。運用電腦模擬,我們驗證本方法的有效性。
    Gene set analysis (GSA) aims to evaluate the association between the expression of biological pathways, or a priori defined gene sets, and a particular phenotype. Numerous GSA methods have been proposed to assess the enrichment of sets of genes. However, most methods are developed with respect
    to a specific alternative scenario, such as a differential mean pattern or a differential coexpression. Moreover, a very limited number of methods can handle either binary, categorical, or continuous phenotypes. In this paper, we develop two novel GSA tests, called SDRs, based on the sufficient dimension reduction technique, which aims to capture sufficient information about the relationship
    between genes and the phenotype. The advantages of our proposed methods are that they allow for categorical and continuous phenotypes, and they are also able to identify a variety of enriched gene sets. Through simulation studies, we compared the type I error and power of SDRs with existing GSA methods for binary, triple, and continuous phenotypes. We found that SDR methods adequately control
    the type I error rate at the pre-specified nominal level, and they have a satisfactory power to detect gene sets with differential coexpression and to test non-linear associations between gene sets and a continuous phenotype. In addition, the SDR methods were compared with seven widely-used GSA methods using two real microarray datasets for illustration. We concluded that the SDR methods
    outperform the others because of their flexibility with regard to handling different kinds of phenotypes and their power to detect a wide range of alternative scenarios. Our real data analysis highlights the differences between GSA
    methods for detecting enriched gene sets.
    Relation: MOST 104-2118-M-004-002
    Data Type: report
    Appears in Collections:[統計學系] 國科會研究計畫

    Files in This Item:

    File Description SizeFormat
    104-2118-M-004-002.pdf1691KbAdobe PDF267View/Open

    All items in 政大典藏 are protected by copyright, with all rights reserved.

    社群 sharing

    DSpace Software Copyright © 2002-2004  MIT &  Hewlett-Packard  /   Enhanced by   NTU Library IR team Copyright ©   - Feedback