* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
Download Protein RNA DNA - Molecular Systems Biology
Survey
Document related concepts
Protein moonlighting wikipedia , lookup
List of types of proteins wikipedia , lookup
Deoxyribozyme wikipedia , lookup
Promoter (genetics) wikipedia , lookup
Secreted frizzled-related protein 1 wikipedia , lookup
RNA interference wikipedia , lookup
Non-coding DNA wikipedia , lookup
Genome evolution wikipedia , lookup
Transcriptional regulation wikipedia , lookup
Molecular evolution wikipedia , lookup
Epitranscriptome wikipedia , lookup
Gene regulatory network wikipedia , lookup
Artificial gene synthesis wikipedia , lookup
RNA silencing wikipedia , lookup
Non-coding RNA wikipedia , lookup
Gene expression profiling wikipedia , lookup
Silencer (genetics) wikipedia , lookup
Transcript
THE HUMAN TRANSCRIPTOME The central dogma in molecular biology The central dogma of molecular biology explains that DNA codes for RNA, which codes for proteins. DNA is the molecule of heredity that passes from parents to offspring. It contains the instructions for building RNA and proteins, which make up the structure of the body and carry out most of its functions. RNA TRANSCRIPT TRANSCRIPTION RNA TRANSCRIPTOME A U C G NUCLEOBASES mRNA tRNA POLYPEPTIDE TRANSLATION © 2016 EMBO Press | Coordination by SciLifeLab: T. Alm and F. Henningson Johnson | Concept by M. Uhlén, B.M. Hallström, C. Lindskog, A. Mardinoglu, F. Pontén and J. Nielsen | Illustrations: Molecular Systems Biology ( U. Mackensen) Protein PROTEOME A E L S R Q K T N G M W D H F Y C I P V AMINO ACIDS 32 Several genome-wide transcriptome profiling methods have been used for quantifying global gene expression patterns. Some of these publicly available repositories for transcriptomics data are summarized in the table of QR codes with a focus on datasets from RNA-seq experiments. The resources include repositories for external data, such as Expression Atlas from the European Bioinformatics Institute (EBI) and Gene Expression Omnibus from the National Center for Biotechnology Information (NCBI), as well as repositories with internally generated transcriptome data, such as the GTEx, the Human Protein Atlas, and the Allen Brain Atlas. Human Protein Atlas KTH Royal Institute of Technology (Sweden) Tissue-based RNA data based on surgically removed tissues (RNA-seq) UHLÉN et al, 2015 Tissues & Organs HPA Massachusetts Institute of Technology MIT (USA) Alternative isoform regulation in human tissue transcriptomes GTEx FANTOM A Cerebellum Tonsil Salivary gland Thyroid gland Lung Esophagus Lymph node Heart muscle Liver Breast Adrenal gland Stomach Gallbladder Spleen Duodenum Kidney Small intestine Pancreas Colon Fallopian tube Appendix Ovary Rectum Placenta Smooth muscle Endometrium Prostate Cervix Urinary bladder Vagina Testis Bone marrow Skeletal muscle Blood Adipose tissue Artery Skin Nerve Expression not detected 1021 13 % Expression enriched in tissue 2359 Enhanced expression 3227 Group enriched expression 1208 B Brain GTEx Fallopian tube HPA Testis Skeletal muscle Skin Kidney Liver A 21 22 16 Esophagus Small Adipose intestine tissue Adrenal Lung gland Thyroid Pancreas Heart Stomach Colon Prostate Ovary FANTOM 16 Tissues & Organs CAGE Genome-scale metabolic models for human cells and tissues Proteomics A| Proteomics and transcriptomics data can be used for generating and improving contextspecific biological networks including protein– protein interaction, regulatory, signaling, and metabolic networks in order to gain further insights into the differences in cellular functions across tissues. Genome-scale metabolic models (GEMs) that can be reconstructed directly from proteomics or transcriptomics data are particularly well suited for analyzing biological functions, since they can be applied to examine the metabolic functions associated with a given cell type. B| Timeline of selected human genomic-scale cell-/tissue-specific metabolic models. HMR2 is currently the most comprehensive global reconstruction of human metabolism and this model together with other models of human metabolism has served as a basis for the reconstruction of context-specific GEMs. Metabolomics Tissues & Organs GTEx RNA-seq B GTEx RNA-Seq Atlas Gene Expression Omnibus Tissue-based RNA data based on CAGE Tissue-based RNA data based on post mortem samples (RNA-seq) Repository for RNA expression data (both microarray and RNA-seq) Riken Institute (Japan) Broad Institute (USA) J. Gutenberg University (Germany) YU et al, 2015 KEEN & MOORE, 2015 A reference database for gene expression profiling in normal tissue by next-generation sequencing fantom.gsc.riken.jp www.gtexportal.org KRUPP et al, 2012 medicalgenomics.org/rna_seq_atlas Expression Atlas The European Bioinformatics Institute EMBL-EBI (UK) Repository for RNA expression data (both microarray and RNA-seq) ArrayExpress The European Bioinformatics Institute EMBL-EBI (UK) International functional genomics public data repositories WANG et al, 2008 PETRYSZAK et al, 2015 RUSTICI et al, 2013 www.ncbi.nlm.nih.gov/geo /query/acc.cgi?acc=GSE12946 www.ebi.ac.uk/gxa www.ebi.ac.uk/arrayexpress National Center for Biotechnology Information NCBI (USA) BARRETT et al, 2013 www.ncbi.nlm.nih.gov/geo Illumina Body Map Allen Brain Atlas RNA-seq of 16 human individual tissues An anatomically comprehensive atlas of the adult human brain transcriptome Illumina (USA) RUSTICI et al, 2013 www.ebi.ac.uk/arrayexpress /experiments/E-MTAB-513 Allen Institute (USA) HAWRYLYCZ et al, 2012 human.brain-map.org 2007 Evolution of gene expression The evolution of gene expression levels in mammalian organs BRAWAND et al, 2011 Recon1 EHMN University of Lausanne (Switzerland) 2010 HepatoNet 2013 iAdipocytes www.ncbi.nlm.nih.gov/geo /query/acc.cgi?acc=GSE30352 Scan the QR code with a mobile device to visit the web sites. A| The classification of the human protein-coding transcriptome shows that almost half of the genes are detected in all tissues, while 13% show a mixed expression. The number of tissueenriched genes in the different tissues and the overlap between the Human Protein Atlas consortium (HPA) and the genome-based tissue expression consortium (GTEx) are shown in B. Overall, it is reassuring that there is a significant overlap in the tissue classification of the genes based on the two independent datasets. The fact that similar results are obtained when using fresh frozen tissue (HPA) and postmortem tissue (GTEx) suggests negligible effects of the sampling procedures used by the GTEx consortium on RNA degradation. Spleen Transcriptomics 29 FANTOM Genes Metabolites Recon2 2014 Reactions HMR2 0 1500 3000 4500 6000 7500 9000 Modified from: Uhlén M, Hallström BM, Lindskog C, Mardinoglu A, Pontén F, Nielsen J (2016). Mol Syst Biol. 12(4): 862 To explore the human proteome in more depth, visit www.proteinatlas.org Mixed expression 2484 RNA-seq 22 Classification of all human genes based on tissue expression Genes expressed in all tissues 8385 HPA 45 % Pituitary www.proteinatlas.org AltIso HPA Cerebrum Overview of the tissues and organs analyzed using RNA-seq by the Human Protein Atlas consortium (HPA, dark red), using cap analysis gene expression (CAGE) by the FANTOM consortium (light red), and using RNA-seq by the genome-based tissue expression consortium (GTEx, pink). Altogether, 22 tissues and organs were studied by both the HPA and FANTOM, while 21 tissues overlapped between the HPA and GTEx datasets. Human transcriptome resources GTEx NUCLEOBASES Global transcriptomics analysis of human tissues and organs FANTOM A T C G HPA DNA GENOME The human genome consists of DNA, a molecule that contains the instructions needed to build and maintain cells. These instructions are spelled out in the form of four "base pairs”, organized into approximately 20,000 protein-coding genes. For the instructions to be carried out, DNA must be "read" and transcribed into RNA transcripts. The transcriptome is a collection of all the transcripts present in a cell. Here, we focus on the protein-coding transcriptome to explore when and where each transcript and the corresponding protein is present in the various cells. ROYAL INSTITUTE OF TECHNOLOGY msb.embopress.org /content/12/4/862