GeSICA:Genome Segmentation from Intra-Chromosomal Associations
【摘要】：正Background:Various aspects of genome organization have been explored based on data from distinct technologies,including histone modification ChIP-Seq,3C,and its derivatives. Recently developed Hi-C techniques enable the genome-wide mapping of DNA interactomes, thereby providing the opportunity to study genome organization in detail,but these methods also pose challenges in methodology development. Methods:We propose a two-step strategy in this study.In the first step we categorize the entire human genome into two different states.In the second step we further segregated the regions with some state into more detailed clusters by employing a Markov Clustering algorithm Results:We developed Genome Segmentation from Intra-Chromosomal Associations, or GeSICA,to explore genome organization and applied the method to Hi-C data in human GM06990 and K562 cells.GeSICA calculates a simple logged ratio to efficiently segment the human genome into regions with two distinct states that correspond to rich and poor functional element states.Inside the rich regions,Markov Clustering was subsequently applied to segregate the regions into more detailed clusters.The binding sites of the insulator, cohesion,and transcription complexes are enriched in the boundaries between neighboring clusters,indicating that inferred clusters may have fine organizational features. Conclusions:Our study presents a novel analysis method,known as GeSICA,which gives insight into genome organization based on Hi-C data.GeSICA is open-source and freely available at:http://web.tongji.edu.cn/～zhanglab/GeSICA/.