Download Protein RNA DNA - Molecular Systems Biology

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

Protein moonlighting wikipedia , lookup

List of types of proteins wikipedia , lookup

Deoxyribozyme wikipedia , lookup

Promoter (genetics) wikipedia , lookup

MicroRNA wikipedia , lookup

Secreted frizzled-related protein 1 wikipedia , lookup

RNA interference wikipedia , lookup

Non-coding DNA wikipedia , lookup

Genome evolution wikipedia , lookup

Transcriptional regulation wikipedia , lookup

Molecular evolution wikipedia , lookup

Epitranscriptome wikipedia , lookup

Gene regulatory network wikipedia , lookup

Artificial gene synthesis wikipedia , lookup

RNA silencing wikipedia , lookup

Non-coding RNA wikipedia , lookup

Gene expression profiling wikipedia , lookup

Silencer (genetics) wikipedia , lookup

Gene expression wikipedia , lookup

RNA-Seq wikipedia , lookup

Transcript
THE HUMAN TRANSCRIPTOME
The central dogma in
molecular biology
The central dogma of molecular biology explains that
DNA codes for RNA, which codes for proteins. DNA is
the molecule of heredity that passes from parents to
offspring. It contains the instructions for building RNA
and proteins, which make up the structure of the body
and carry out most of its functions.
RNA
TRANSCRIPT
TRANSCRIPTION
RNA
TRANSCRIPTOME
A U C G
NUCLEOBASES
mRNA
tRNA
POLYPEPTIDE
TRANSLATION
© 2016 EMBO Press | Coordination by SciLifeLab: T. Alm and F. Henningson Johnson | Concept by M. Uhlén, B.M. Hallström, C. Lindskog, A. Mardinoglu, F. Pontén and J. Nielsen | Illustrations: Molecular Systems Biology ( U. Mackensen)
Protein
PROTEOME
A
E
L
S
R
Q
K
T
N
G
M
W
D
H
F
Y
C
I
P
V
AMINO ACIDS
32
Several genome-wide transcriptome profiling methods
have been used for quantifying global gene expression
patterns. Some of these publicly available repositories
for transcriptomics data are summarized in the table
of QR codes with a focus on datasets from RNA-seq
experiments. The resources include repositories for
external data, such as Expression Atlas from the
European Bioinformatics Institute (EBI) and Gene
Expression Omnibus from the National Center for
Biotechnology Information (NCBI), as well as repositories
with internally generated transcriptome data, such as the
GTEx, the Human Protein Atlas, and the Allen Brain Atlas.
Human Protein Atlas
KTH Royal Institute of Technology
(Sweden)
Tissue-based RNA data based on
surgically removed tissues (RNA-seq)
UHLÉN et al, 2015
Tissues & Organs
HPA
Massachusetts Institute of Technology
MIT (USA)
Alternative isoform regulation in
human tissue transcriptomes
GTEx
FANTOM
A
Cerebellum
Tonsil
Salivary gland
Thyroid gland
Lung
Esophagus
Lymph node
Heart muscle
Liver
Breast
Adrenal gland
Stomach
Gallbladder
Spleen
Duodenum
Kidney
Small intestine
Pancreas
Colon
Fallopian tube
Appendix
Ovary
Rectum
Placenta
Smooth muscle
Endometrium
Prostate
Cervix
Urinary bladder
Vagina
Testis
Bone marrow
Skeletal muscle
Blood
Adipose tissue
Artery
Skin
Nerve
Expression
not detected
1021
13 %
Expression
enriched in tissue
2359
Enhanced
expression
3227
Group enriched expression
1208
B
Brain
GTEx
Fallopian
tube
HPA
Testis
Skeletal
muscle
Skin
Kidney
Liver
A
21
22
16
Esophagus
Small
Adipose
intestine
tissue
Adrenal
Lung
gland
Thyroid
Pancreas
Heart
Stomach
Colon
Prostate
Ovary
FANTOM
16
Tissues &
Organs
CAGE
Genome-scale metabolic
models for human cells
and tissues
Proteomics
A| Proteomics and transcriptomics data can be
used for generating and improving contextspecific biological networks including protein–
protein interaction, regulatory, signaling, and
metabolic networks in order to gain further
insights into the differences in cellular functions
across tissues. Genome-scale metabolic models
(GEMs) that can be reconstructed directly from
proteomics or transcriptomics data are particularly
well suited for analyzing biological functions, since
they can be applied to examine the metabolic
functions associated with a given cell type.
B| Timeline of selected human genomic-scale
cell-/tissue-specific metabolic models. HMR2 is
currently the most comprehensive global
reconstruction of human metabolism and this
model together with other models of human
metabolism has served as a basis for the
reconstruction of context-specific GEMs.
Metabolomics
Tissues &
Organs
GTEx
RNA-seq
B
GTEx
RNA-Seq Atlas
Gene Expression Omnibus
Tissue-based RNA data
based on CAGE
Tissue-based RNA data based on
post mortem samples (RNA-seq)
Repository for RNA expression data
(both microarray and RNA-seq)
Riken Institute (Japan)
Broad Institute (USA)
J. Gutenberg University (Germany)
YU et al, 2015
KEEN & MOORE, 2015
A reference database for gene expression
profiling in normal tissue by next-generation
sequencing
fantom.gsc.riken.jp
www.gtexportal.org
KRUPP et al, 2012
medicalgenomics.org/rna_seq_atlas
Expression Atlas
The European Bioinformatics Institute
EMBL-EBI (UK)
Repository for RNA expression data
(both microarray and RNA-seq)
ArrayExpress
The European Bioinformatics Institute
EMBL-EBI (UK)
International functional genomics
public data repositories
WANG et al, 2008
PETRYSZAK et al, 2015
RUSTICI et al, 2013
www.ncbi.nlm.nih.gov/geo
/query/acc.cgi?acc=GSE12946
www.ebi.ac.uk/gxa
www.ebi.ac.uk/arrayexpress
National Center for Biotechnology Information
NCBI (USA)
BARRETT et al, 2013
www.ncbi.nlm.nih.gov/geo
Illumina Body Map
Allen Brain Atlas
RNA-seq of 16 human individual tissues
An anatomically comprehensive atlas of
the adult human brain transcriptome
Illumina (USA)
RUSTICI et al, 2013
www.ebi.ac.uk/arrayexpress
/experiments/E-MTAB-513
Allen Institute (USA)
HAWRYLYCZ et al, 2012
human.brain-map.org
2007
Evolution of gene expression
The evolution of gene expression levels
in mammalian organs
BRAWAND et al, 2011
Recon1
EHMN
University of Lausanne (Switzerland)
2010
HepatoNet
2013
iAdipocytes
www.ncbi.nlm.nih.gov/geo
/query/acc.cgi?acc=GSE30352
Scan the QR code with
a mobile device to visit
the web sites.
A| The classification of the human
protein-coding transcriptome shows
that almost half of the genes are detected
in all tissues, while 13% show a mixed
expression. The number of tissueenriched genes in the different tissues
and the overlap between the Human
Protein Atlas consortium (HPA) and the
genome-based tissue expression
consortium (GTEx) are shown in B. Overall,
it is reassuring that there is a significant
overlap in the tissue classification of the
genes based on the two independent
datasets. The fact that similar results are
obtained when using fresh frozen tissue
(HPA) and postmortem tissue (GTEx)
suggests negligible effects of the sampling
procedures used by the GTEx consortium
on RNA degradation.
Spleen
Transcriptomics
29
FANTOM
Genes
Metabolites
Recon2
2014
Reactions
HMR2
0
1500
3000
4500
6000
7500
9000
Modified from: Uhlén M, Hallström BM, Lindskog C,
Mardinoglu A, Pontén F, Nielsen J (2016).
Mol Syst Biol. 12(4): 862
To explore the human proteome
in more depth, visit
www.proteinatlas.org
Mixed
expression
2484
RNA-seq
22
Classification of all
human genes based
on tissue expression
Genes
expressed in
all tissues
8385
HPA
45 %
Pituitary
www.proteinatlas.org
AltIso
HPA
Cerebrum
Overview of the tissues and
organs analyzed using RNA-seq
by the Human Protein Atlas
consortium (HPA, dark red),
using cap analysis gene
expression (CAGE) by the
FANTOM consortium (light red),
and using RNA-seq by the
genome-based tissue
expression consortium (GTEx,
pink). Altogether, 22 tissues
and organs were studied by
both the HPA and FANTOM,
while 21 tissues overlapped
between the HPA and GTEx
datasets.
Human transcriptome
resources
GTEx
NUCLEOBASES
Global
transcriptomics
analysis of
human tissues
and organs
FANTOM
A T C G
HPA
DNA
GENOME
The human genome consists of DNA, a molecule that contains the instructions needed to build and maintain cells. These instructions are spelled out in the form of four "base pairs”,
organized into approximately 20,000 protein-coding genes. For the instructions to be carried out, DNA must be "read" and transcribed into RNA transcripts. The transcriptome is a
collection of all the transcripts present in a cell. Here, we focus on the protein-coding transcriptome to explore when and where each transcript and
the corresponding protein is present in the various cells.
ROYAL INSTITUTE
OF TECHNOLOGY
msb.embopress.org
/content/12/4/862