Download Permutation to assess the generalizability of the reduction in error

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

X-inactivation wikipedia , lookup

Quantitative trait locus wikipedia , lookup

Saethre–Chotzen syndrome wikipedia , lookup

Epistasis wikipedia , lookup

Epigenetics in learning and memory wikipedia , lookup

NEDD9 wikipedia , lookup

Gene therapy of the human retina wikipedia , lookup

Neuronal ceroid lipofuscinosis wikipedia , lookup

Polycomb Group Proteins and Cancer wikipedia , lookup

Epigenetics of neurodegenerative diseases wikipedia , lookup

Genetic engineering wikipedia , lookup

Copy-number variation wikipedia , lookup

Essential gene wikipedia , lookup

Pathogenomics wikipedia , lookup

Public health genomics wikipedia , lookup

Epigenetics of diabetes Type 2 wikipedia , lookup

Vectors in gene therapy wikipedia , lookup

History of genetic engineering wikipedia , lookup

Gene therapy wikipedia , lookup

Minimal genome wikipedia , lookup

Genomic imprinting wikipedia , lookup

Ridge (biology) wikipedia , lookup

RNA-Seq wikipedia , lookup

Therapeutic gene modulation wikipedia , lookup

Nutriepigenomics wikipedia , lookup

Biology and consumer behaviour wikipedia , lookup

Gene nomenclature wikipedia , lookup

The Selfish Gene wikipedia , lookup

Genome evolution wikipedia , lookup

Gene desert wikipedia , lookup

Site-specific recombinase technology wikipedia , lookup

Gene wikipedia , lookup

Epigenetics of human development wikipedia , lookup

Genome (book) wikipedia , lookup

Gene expression programming wikipedia , lookup

Artificial gene synthesis wikipedia , lookup

Gene expression profiling wikipedia , lookup

Designer baby wikipedia , lookup

Microevolution wikipedia , lookup

Transcript
Permutation to assess the generalizability of the reduction in error rate
observed by addition of the ‘histology gene’
To assess whether the reduction seen in error rate was specific to the
selected set of 10 discriminatory genes all 1280 genes were ranked by adding
the squared values of the Kolmogorov Smirnov statistic (unweighted this time
ie 0.0 to 1.0) from the subsets as described above. In this scoring system
high scores in any single subset will effect the gene ranking more than
medium scores across all subsets. From the 20 highest scoring genes
random gene subsets consisting of between 8 and 16 genes were selected so
that both the number of genes and the particular genes used varied.
For each of these predictors we determined the predictor error with and
without the addition of the histology gene. Histograms of the errors are shown
without (S2a) and with (S2b) the histology gene. The distribution of errors for
these random predictors is shifted toward lower errors by incorporation of
information about tumour content from the histology gene.
Five of these
predictors (all unique) gave errors of 9% (4 errors) when combined with the
histology gene. Fig. S2c shows the histogram of paired error differences such
that positive values indicate reduced error by addition of the histology gene.
Of the 1000 predictors, 91% showed either no change or a fall in prediction
error, 49% showed an absolute reduction in error rate of greater than 10%
and 10% showed an absolute reduction in error rate of greater than 20%.