政大機構典藏-National Chengchi University Institutional Repository(NCCUR):Item 140.119/110783
English  |  正體中文  |  简体中文  |  Post-Print筆數 : 27 |  Items with full text/Total items : 110118/141061 (78%)
Visitors : 46517296      Online Users : 403
RC Version 6.0 © Powered By DSPACE, MIT. Enhanced by NTU Library IR team.
Scope Tips:
  • please add "double quotation mark" for query phrases to get precise results
  • please goto advance search for comprehansive author search
  • Adv. Search
    HomeLoginUploadHelpAboutAdminister Goto mobile version
    Please use this identifier to cite or link to this item: https://nccur.lib.nccu.edu.tw/handle/140.119/110783


    Title: Hi-C實驗資料正規化
    Hi-C data normalization
    Authors: 魏孝全
    Contributors: 薛慧敏
    魏孝全
    Keywords: 染色體捕捉技術
    Hi-C實驗資料
    正規化
    基因特徵偏差
    Chromosome conformation capture
    Hi-C data
    Normalization
    Genome feature
    Date: 2017
    Issue Date: 2017-07-11 11:26:01 (UTC+8)
    Abstract: 本研究探討高通量染色體捕捉技術 (high-throughput chromosome conformation capture, Hi-C) 實驗所產生的關聯矩陣資料之正規化方法。已知該類實驗主要用來測量染色體之間的空間距離,正規化的目的是移除資料中的系統性偏差,本文主要針對基因特徵所造成之偏差。有別於Hu等人 (2012) 所提出的「局部基因特徵正規化法」(local genome feature normalization, LGF法),我們所提出的「二次函數正規化法」(quadratic function normalization, QF法) 建立在更為一般化的二次對數模型與負二項分配假設上。本研究透過模擬實驗以及人類淋巴細胞資料 (GSE18199) 來評估QF法的表現,並且與其他方法比較。在模擬實驗中,我們發現當模型正確時,QF法能有效消除偏差。在實例中,當基因特徵偏差被消除後,則染色體之間的相對距離在重複實驗資料之間有更為一致的結果。另一方面,我們發現實驗所採用的限制酶影響關聯矩陣的結果,而且運用這些正規化方法並不能有效消除限制酶造成的偏差。
    Recently, the high-throughput chromosome conformation capture (Hi-C) experiment is developed to explore the three-dimensional structure of genomics. To assess the chromosomal interaction, a contact matrix is produced from a Hi-C experiment. Very often, systematic technical biases appear in the contact matrix and lead to inadequate conclusions. Consequently, data normalization to remove these biases is essential and necessary prior advanced inference. In this research, we propose the so-called quadratic function normalization method, which is a modification of the local genome feature normalization (Hu et al., 2012) by considering a more general model. Simulation studies are conducted to evaluate the proposed method. When the model assumption holds, the proposed method has adequate performance. Further, a Hi-C data set of a human lymphoblastoid cell GSE18199 is employed for a comparison of our method and two existing methods. It’s observed that normalization improves the reproducibility between experimental replicates. However, the effect of normalization is lean in eliminating the bias of restriction enzymes.
    Reference: 參考資料
    Agard DA, Hiraoka Y, Shaw P, Sedat JW, (1989).Fluorescence microscopy in three dimensions, Methods Cell Biol., 30, 353-377.
    Dekker J, Rippe K, Dekker M, Kleckner N, (2002).Capturing chromosome conformation, Science, 295, 1306-1311.
    Dostie J, Richmond TA, Arnaout RA, Selzer RR, Lee WL, Honan TA, Rubio ED, Krumm A, Lamb J, Nusbaum C, Green RD, Dekker J, (2006).Chromosome Conformation Capture Carbon Copy (5C): A massively parallel solution for mapping interactions between genomic elements, Genome Res., 16, 1299-1309.
    Dudchenko O, Batra SS, Omer AD, Nyquist SK, Hoeger M, Durand NC, Shamim MS, Machol I, Lander ES, Aiden AP, Aiden EL, (2017).De novo assembly of the Aedes aegypti genome using Hi-C yields chromosome-length scaffolds, Science, 356, 92-95.
    Gerstein MB, Kundaje A, Hariharan M, Landt SG, Yan KK, Cheng C, Mu XJ, Khurana E, Rozowsky J, Alexander R, Min R, Alves P, Abyzov A, Addleman N, Bhardwaj N, Boyle AP, Cayting P, Charos A, Chen DZ, Cheng Y, Clarke D, Eastman C, Euskirchen G, Frietze S, Fu Y, Gertz J, Grubert F, Harmanci A, Jain P, Kasowski M, Lacroute P, Leng J, Lian J, Monahan H, O`Geen H, Ouyang Z, Partridge EC, Patacsil D, Pauli F, Raha D, Ramirez L, Reddy TE, Reed B, Shi M, Slifer T, Wang J, Wu L, Yang X, Yip KY, Zilberman-Schapira G, Batzoglou S, Sidow A, Farnham PJ, Myers RM, Weissman SM, Snyder M, (2012).Architecture of the human regulatory network derived from ENCODE data, Nature, 489, 91-100.
    Hu M, Deng K, Selvaraj S, Qin Z, Ren B, Liu JS, (2012).HiCNorm: removing biases
    in Hi-C data via Poisson regression, Bioinformatics, 28, 3131-3133.
    Imakaev M, Fudenberg G, McCord RP, Naumova N, Goloborodko A, Lajoie BR, Dekker J, Mirny LA, (2012).Iterative correction of Hi-C data reveals hallmarks of chromosome organization, Nature Methods, 9, 999-1003.
    Li H, Ruan J, Durbin R, (2008).Mapping short DNA sequencing reads and calling variants using mapping quality scores, Genome Res., 18, 1851-1858.
    Lieberman-Aiden E, van Berkum NL, Williams L, Imakaev M, Ragoczy T, Telling A, Amit I, Lajoie BR, Sabo PJ, Dorschner MO, Sandstrom R, Bernstein B, Bender MA, Groudine M, Gnirke A, Stamatoyannopoulos J, Mirny LA, Lander ES, Dekker J, (2009).Comprehensive mapping of long range interactions reveals folding principles of the human genome, Science, 326, 289-293.
    Lupiáñez DG, Kraft K, Heinrich V, Krawitz P, Brancati F, Klopocki E, Horn D, Kayserili H, Opitz JM, Laxova R, Santos-Simarro F, Gilbert-Dussardier B, Wittler L, Borschiwer M, Haas SA, Osterwalder M, Franke M, Timmermann B, Hecht J, Spielmann M, Visel A, Mundlos S, (2015).Disruptions of topological chromatin domains cause pathogenic rewiring of gene-enhancer interactions, Cell, 161, 1012-1025.
    Simonis M, Klous P, Splinter E, Moshkin Y, Willemsen R, de Wit E, van Steensel B, de Laat W, (2006).Nuclear organization of active and inactive chromatin domains uncovered by chromosome conformation capture–on-chip (4C), Nature Genetics, 38, 1348-1354.
    Sexton T, Cavalli G, (2015). The role of chromosome domains in shaping the functional genome, Cell, 160, 1049–1059.
    Yaffe E, Tanay A, (2011).Probabilistic modeling of Hi-C contact maps eliminates systematic biases to characterize global chromosomal architecture, Nature Genetics, 43, 1059-1065.
    Description: 碩士
    國立政治大學
    統計學系
    104354025
    Source URI: http://thesis.lib.nccu.edu.tw/record/#G0104354025
    Data Type: thesis
    Appears in Collections:[Department of Statistics] Theses

    Files in This Item:

    File SizeFormat
    402501.pdf1210KbAdobe PDF2259View/Open


    All items in 政大典藏 are protected by copyright, with all rights reserved.


    社群 sharing

    著作權政策宣告 Copyright Announcement
    1.本網站之數位內容為國立政治大學所收錄之機構典藏,無償提供學術研究與公眾教育等公益性使用,惟仍請適度,合理使用本網站之內容,以尊重著作權人之權益。商業上之利用,則請先取得著作權人之授權。
    The digital content of this website is part of National Chengchi University Institutional Repository. It provides free access to academic research and public education for non-commercial use. Please utilize it in a proper and reasonable manner and respect the rights of copyright owners. For commercial use, please obtain authorization from the copyright owner in advance.

    2.本網站之製作,已盡力防止侵害著作權人之權益,如仍發現本網站之數位內容有侵害著作權人權益情事者,請權利人通知本網站維護人員(nccur@nccu.edu.tw),維護人員將立即採取移除該數位著作等補救措施。
    NCCU Institutional Repository is made to protect the interests of copyright owners. If you believe that any material on the website infringes copyright, please contact our staff(nccur@nccu.edu.tw). We will remove the work from the repository and investigate your claim.
    DSpace Software Copyright © 2002-2004  MIT &  Hewlett-Packard  /   Enhanced by   NTU Library IR team Copyright ©   - Feedback