Download Pathway analysis

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

Pathogenomics wikipedia , lookup

Metagenomics wikipedia , lookup

Gene expression profiling wikipedia , lookup

RNA-Seq wikipedia , lookup

Mir-92 microRNA precursor family wikipedia , lookup

Metabolic network modelling wikipedia , lookup

Transcript
Pathway analysis
Daniel Hurley
Pathway analysis: summary
A popular buzzword…
but what does it mean?
How do you do it?
How do you interpret the
results?
First
What we mean by ‘pathway analysis’
• A ‘pathway’ implies causation, but don’t be fooled!
What we mean by ‘pathway analysis’
• A ‘pathway’ implies causation, but don’t be fooled!
• Most ‘pathway analysis’ actually identifies groups of
functionally similar transcripts.
• Louis’ example:
(http://gather.genome.duke.edu/)
What we mean by ‘pathway analysis’
• A useful paper….
But the conclusion is: lots of tools,
some quite different approaches!
What we mean by ‘pathway analysis’
• Pathway analysis tools like GATHER, DAVID, and
GeneSetDB typically rely on enrichment analyses to tell
us things.
• This set of techniques asks the question ‘of this set of
genes, how many share any particular function, and is
that more than we would expect by chance?’
• Example: the top 200 most differentially-expressed
genes by some ranking (e.g. adjusted p-value)
• Determination of ‘by chance’ is usually done using a
permutation (= Monte Carlo) approach
• Other ‘pathway analyses’ involve signatures of groups
of transcripts (e.g. using Principal Component Analysis)
What we mean by ‘pathway analysis’
But what do we mean by a ‘function’?
Lots of things:
Protein function
Hypothetical
protein function
Chromosomal
location
Metabolic
pathway
association
Disease
association
Daunting
The key
point
What we mean by ‘pathway analysis’
• Pathway analysis can
identify common features
present in a group of
transcripts
• What the output means
depends on the specific
biology under study
• No such thing really as a
‘general’ pathway analysis
• A good place to start is by
finding papers relevant to
the specific biology
What can you do with it?
Some answers:
Get a general picture
of the active functions
in a condition (vs.
control)
Identify conditions
with similar
functions
Investigate the
functions
associated with a
particular gene
Investigate whether
a particular function
is active in a
condition
Differentiate
conditions by their
active functions
Next
Pathway analysis: how you do it
• Begin with a list of transcripts of interest
Pathway analysis: how you do it
• Choose a web-based tool: GATHER, DAVID and
GeneSetDB are good ones to start
But Pathguide.org has 325
pathway links at last count
Pathway analysis: how you do it
• Enter the list of transcripts: with most tools, you will
either paste in gene names or identifiers, or upload a
file
Finally
Pathway analysis: interpreting results
• Basic tools will produce ranked lists of the most
‘enriched’ categories:
GATHER
Pathway analysis: interpreting results
• More sophisticated ones will produce ‘network’
diagrams
DAVID
Ingenuity Pathways
But the interpretation of these
is
Analysis
rather subjective
Summary
• Pathway analysis should probably be called information
enrichment analysis – a more accurate term
• Used prudently, it is a useful tool for exploring the
functional landscape of an experiment
• To make it meaningful, you need to interpret the results
in the context of the specific biology under study
• There are a lot of web-based tools; start with one which
is current and produces a result you value
• To start, you need a set of (transcripts) of interest
• To present the results, you can use a simple table, or a
more complex ‘network’ diagram
• Risk: false-positives are very difficult to identify, and
with enough data you can link any molecular species to
any other species
FIN
Any questions?