Download Tutorial_Part2_incl_results_final - Bioinfo-casl

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

CAN bus wikipedia , lookup

Network tap wikipedia , lookup

IEEE 1355 wikipedia , lookup

List of wireless community networks by region wikipedia , lookup

Airborne Networking wikipedia , lookup

Transcript
Tutorial session 2
Network annotation
Exploring PPI networks using Cytoscape
EMBO Practical Course Session 8
Nadezhda Doncheva and Piet Molenaar
Overview
Focus: Network annotation and visualization


Loading and manipulating attributes

Identifier mapping

Mapping data onto the network

Use visuals to convey data
Concepts


Vizmapper
Data


2
Human Neuroblastoma mutated genes list
5/23/2017
Attributes
Nodes and edges can have
attributes associated with
them


Gene expression data

Mass spectrometry data

Protein structure information

Gene Ontology terms, etc.
Cytoscape supports multiple
data types: Numbers, Text,
Logical, Lists...

3
5/23/2017
Loading attributes

Use pre-formatted attribute files

Import attribute from table

Excel file

Comma or tab delimited text
Import attribute from web services


NCBI Entrez Gene

Ensembl Biomart

Use ‘import attribute or expression matrix’

Create attributes manually in the attribute browser
4
5/23/2017
Loading attributes from table (Demo)
5
5/23/2017
Use Case 2.1: Neuroblastoma

Childhood neuro endocrine tumor

Young children

Variable clinical outcome



Low stages

Good prognosis

Numeric changes of chromosomal copy numbers
High stages

Poor prognosis

Structural chromosomal defects (LOH1p / 11q etc)
Few gene defects identified

MYCN amplification (20%)

ALK activation (7%)

CCND1 / PHOX2B / NF1
Use Case 2.1: Neuroblastoma

Poor prognosis

Subgroup (~1/3) characterized by MYCN amplification

Rest unknown
Use case: Assignment 2.1

Whole genome sequence of 86 tumor vs blood

1043 genes with mutations
1.
Load the list of genes
(neuroblastoma_mutated_symbols.txt) as a network
2.
Use the tab separated dataset
(neuroblastoma_mutated_annotations.txt) to map
additional information
1.
8
Make sure the attributes have informative names
5/23/2017
Assignment 2.1 results
Load the list; use the
same importer
1.
1.
Load the annotations
2.
9
No interactions yet
1.
Check text import
settings
2.
Check mapping settings
5/23/2017
Attribute management
Different type of attributes: Strings, Numbers, …
Select attributes for display
Node or Edge ID
10
Specific Attribute Tabs: for Nodes, Edges, and Network
5/23/2017
Tips & Tricks: Root Graph and sessions

”There is one graph to rule them all...”

The networks in Cytoscape are all ”views” on a single
graph.

Changing the attribute for a node in one network will
also change that attribute for a node with the same ID in
all other loaded networks

There is no way to ”copy” a node and keep the same ID
 Make a copy of the session
11
5/23/2017
Identifier mapping

Identifiers (IDs) are ideally unique, stable name or numbers

But: too many IDs and different database records for Gene, DNA,
RNA, Protein

The ID Mapping challenge:


Avoid errors by mapping IDs correctly

Gene names are ambiguous

Excel introduces errors

Problems reaching 100 % coverage
Recommendations (for proteins and genes):

Map everything to Entrez Gene IDs using a spreadsheet

Manually curate missing mappings to achieve 100 % coverage

Be careful of Excel auto conversions
12
5/23/2017
Identifier mapping (Demo)
13
5/23/2017
Use case: Assignment 2.2
1.
Use the Biomart plugin to map UniProt identifiers on
the genes
http://cytoscape.org/manual/Cytoscape2_8Manual.html#Node Name Mapping
14
5/23/2017
Assignment 2.2 results

Use the Ensemble 68 set

Input data type is HGNC
symbols

Import more than just the
UniProt IDs
15
5/23/2017
Data mapping

Mapping of data values associated with graph elements
onto graph visuals
16
5/23/2017
Data mapping



Visual attributes

Node fill color, border color, border width, size, shape, opacity,
label

Edge type, color, width, ending type, ending size, ending color
Mapping types

Passthrough (labels)

Continuous (numeric values)

Discrete (categories)
Visual style
17
5/23/2017
VizMapper
List of Visual Styles
Default Visual
Style editor
List of Data attributes
List of Visual attributes
Mapping definition
18
5/23/2017
Data mapping (Demo)
19
5/23/2017
Tips & Tricks: Data mapping

Avoid cluttering your visualization with too much data

Map the data you are specifically interested in to call out
meaningful differences

Mapping too much data to visual attributes may just
confuse the viewer

Create multiple networks and map different values
20
5/23/2017
Use case: Assignment 2.3
1.
Map the size of the nodes to the number of
occurrences
2.
Map color to the tumor ids
1.
Hint: use a rainbow pattern
http://cytoscape.org/manual/Cytoscape2_8Manual.html#Visual Styles
21
5/23/2017
Assignment 2.3 results

Use gradient


Readily shows higher
number of mutations
Use rainbow

22
Similar names, similar
colors
5/23/2017
Assignment 2.3 results
23
5/23/2017
Exploring expression data

VistaClara plugin

Exploratory data analysis of multi-experiment microarray
studies

A graphical and interactive alternative to the standard attribute
browser
24
5/23/2017
VistaClara (Demo)
25
5/23/2017
Filtering & editing data


Use filters

QuickFind nodes and edges

Index the network based on a node or edge attribute

Dynamic filtering for numerical attributes

Build complex filters using AND, OR, NOT relations

Define topological filters (considers properties of near-by
nodes)
Create subnetworks
26
5/23/2017
Filtering & editing data (Demo)
27
5/23/2017
Use case: Assignment 2.4
1.
What is the gene with most mutations?
2.
Filter the network for genes with more than one
mutation (why?) and save the new network.
3.
Use the Bisogenet plugin...
1.
...to find interactions among these
2.
...to find interactions among these and their first neighbours
(or explore different settings of Bisogenet according to your
taste)
4.
Store your session for later use
http://cytoscape.org/manual/Cytoscape2_8Manual.html#Finding and Filtering Nodes and Edges
http://bio.cigb.edu.cu/biso/BisoGenet_User_Manual.pdf
28
5/23/2017
Assignment 2.4 results
MYCN
1.
1.
2.
29
Frequently amplified; no
additional information
More likely not to be
bystander
5/23/2017
Assignment 2.4 results
MYCN
1.
1.
Frequently amplified; no
additional information
2.
More likely not to be
bystander
3.
Bisogenet
1.
Between: Only large
genes
2.
Neighbours: Promising
hairball
30
5/23/2017
To be continued…

Build, visualize and analyze your own network with
Cytoscape

Network generation

Network annotation and visualization

Network analysis
31

Identify active subnetworks

Analyze Gene Ontology enrichment

Perform topological analysis

Find network clusters

Find network motifs
5/23/2017