Survey
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
Introduction to Epigenetics/Epigenomics Xiangqin Cui Department of Biostatistics University of Alabama at Birmingham Workshop on Statistical Genetics and Genomics, Southern Regional Meeting in New Orleans, Feb 12, 2009 Epigenetics • Epigenetics refers to the study of changes in the regulation of gene activity and expression that are not dependent on gene DNA sequence. • While epigenetics often refers to the study of single genes or sets of genes, epigenomics refers to more global analyses of epigenetic changes across the entire genome. Epigenetics Mechanisms RNA Interference Gene Expression Histone Modifications DNA Methylation DNA Methylation Hypomethylation Hypermethylation http://www.cellscience.com/reviews7/Taylor1.jpg Natural Roles of DNA Methylation in Mammalian System Imprinting X chromosome inactivation Heterochromatin maintenance Developmental controls Tissue specific expression controls DNA Methylation and Cancer Robertson, Nature Reviews Genetics, Vol6, 597 DNA Methylation and Other Human Diseases -- Imprinting Disorder: • Beckwith-Wiedemann syndrom (BWS) • Prader-Willi syndrome (PWS) • Transient neonatal diabetes mellitus (TNDM) -- Repeat-instability diseases • Fragile X syndrome (FRAXA) • Facioscapulohumeral muscular dystroph -- Defects of the methylation machinery • Systemic lupus erythemtosus (SLE) • Immunodeficiency, centromeric instability and facial anomalies (ICF) syndrome Histone Modifications http://porpax.bio.miami.edu/~cmallery/150/gene/c7.19.4.histone.mod.jpg Histone Modifications http://www.nature.com/nsmb/journal/v14/n11/images/nsmb1337-F1.gif Li e. al. (2007) Cell 128, 707 Histone Modifications in Relation to Gene Transcription Li e. al. (2007) Cell 128, 707 Histone Modifications and Human Diseases Coffin-Lowry syndrome is a rare genetic disorder characterized by mental retardation and abnormalities of the head and facial and other areas. It is caused by mutations in the RSK2 gene (histone phosphorylation) and is inherited as an Xlinked dominant genetic trait. Males are usually more severely affected than females. Rubinstein-Taybi syndrome is characterized by short stature, moderate to severe intellectual disability, distinctive facial features, and broad thumbs and first toes. It is caused by mutations in CREB-binding protein (histone acetylation) RNA Interference (RNAi) http://www.nature.com/ncpneuro/journal/v3/n7/images/ncpneuro0551-f1.jpg siRNA Mediated Heterochromatin Maintenance Technologies for Studying Epigenetics/Epigenomics DNA Methylation Microarray or deep sequencing Irizarry et. Al. (2008) Genome Research 18(5):780 Bisulfite Sequencing Bisulfite Sequencing Global Interrogation of DNA Methylation using Microarrays 5’ promoter Expression array Exon array Splicing array Promoter array Tiling array exon intron 3’ Methylated/Unmethylated DNA Enrichment Methods Restriction Enzyme Based Enrichment Methods Zilberman and henikoff (2007) Development 134, 3959 Methylated/Unmethylated DNA Enrichment Methods Immunoprecipation Based Method Zilberman and henikoff (2007) Development 134, 3959 Hybridization to microarrays Genomic DNA Enrichment for methylated DNA Labeling Labeling Zilberman and henikoff (2007) Development 134, 3959 Technologies for Interrogating Epigenetics/Epigenomics ChIP-chip http://www.nature.com/jid/journal/v125/n2/extref/5603467x1.jpg Technologies for Interrogating Epigenetics/Epigenomics Histone Modifications ChIP-chip Antibody specific to one type of histone modification ChIP-seq Deep sequencing http://www.nature.com/jid/journal/v125/n2/extref/5603467x1.jpg Epigenomics Microarray Data Analyses Major difference between data generated in epigenomics study and expression study is at the utility of the information of probe location in the genome. The adjacent probes are often correlated. Data Analysis steps: Data normalization Statistical Modeling • Modeling • Peak Finding Results Visualization along the genome Normalization • The loess assumption is sometimes violated in ChIP-chip data Normalization Cont. • Quantile Normalization • Variance Stabilizing Normalization • Probe Model Based Normalization Nomalization Based on Probe Sequences Model the probe behavior based on probe sequence for Affymetrix arrays Log(PM) = nucleotide + NucleotideCount2 + log(probeCopy#) + ε Model the probe behavior based on probe GC content for long oligo arrays For each group of probes with a give GC content, the background follows a normal distribution (µ,σ2). Johnson et al (2006) PNAS 103, 12457; Song et al (2007)Genome Biology.8:R178 Other Special Properties GC content affects antibody binding to methylated DNA--- need for normalization against GC content. Pelizzola et al (2008) Genome Research, 18, 1652. Other Special Properties Cont. • Map high intensity probes back to the genome • Locate TF binding location ChIP-DNA Noise Probes Chromosome Smoothing Peak Call • Identify regions that show epigenetic changes. Results Visualization Cisgenome http://www.biostat.jhsph.edu/~hji/cisgenome/ Ji et al (2008) Nature Biotechnology. 26, 1293 Software Packages and Comparisons Johnson et al (2008) Genome Research 18: 393 Array Platform Comparisons Johnson et al (2008) Genome Research 18: 393