Download Glimmer and GeneMark

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

Epigenetics of diabetes Type 2 wikipedia , lookup

Transposable element wikipedia , lookup

Human genome wikipedia , lookup

Epigenetics in learning and memory wikipedia , lookup

Public health genomics wikipedia , lookup

Extrachromosomal DNA wikipedia , lookup

Genetic engineering wikipedia , lookup

Gene nomenclature wikipedia , lookup

Cancer epigenetics wikipedia , lookup

Oncogenomics wikipedia , lookup

Gene desert wikipedia , lookup

Non-coding DNA wikipedia , lookup

Epigenetics of neurodegenerative diseases wikipedia , lookup

Point mutation wikipedia , lookup

Quantitative trait locus wikipedia , lookup

Polycomb Group Proteins and Cancer wikipedia , lookup

Essential gene wikipedia , lookup

Vectors in gene therapy wikipedia , lookup

Genomics wikipedia , lookup

Pathogenomics wikipedia , lookup

Metagenomics wikipedia , lookup

Gene expression programming wikipedia , lookup

RNA-Seq wikipedia , lookup

Genomic imprinting wikipedia , lookup

Site-specific recombinase technology wikipedia , lookup

Nutriepigenomics wikipedia , lookup

Therapeutic gene modulation wikipedia , lookup

Ridge (biology) wikipedia , lookup

History of genetic engineering wikipedia , lookup

Genome (book) wikipedia , lookup

Epigenetics of human development wikipedia , lookup

Gene wikipedia , lookup

Minimal genome wikipedia , lookup

Genome evolution wikipedia , lookup

Biology and consumer behaviour wikipedia , lookup

Designer baby wikipedia , lookup

Microevolution wikipedia , lookup

Gene expression profiling wikipedia , lookup

Helitron (biology) wikipedia , lookup

Artificial gene synthesis wikipedia , lookup

Transcript
Glimmer and GeneMark
Glimmer
• Glimmer is a system for finding genes in microbial
DNA
http://ccb.jhu.edu/software/glimmer/index.shtml
• The system works by creating a variable-length
Markov model from a training set of genes and
then using that model to attempt to identify all
genes in a given DNA sequence.
Glimmer
• Local Installation on burrow.soic.indiana.edu:
– /l/glimmer3.02/bin
• All the relevant code is in
– /l/glimmer3.02/bin/
• I have added E.coli data in the following
directory to play with:
– /tmp/ecoli
Glimmer
• Running Glimmer involves a two-step process
1. Building the model using known genes
– /l/glimmer3.02/bin/build-icm -r run1.icm < /tmp/ecoli/ecoli-genes.fasta
2. Make gene predictions using glimmer3 program
– /l/glimmer3.02/bin/glimmer3 -o50 -g110 -t30 /tmp/ecoli/ecoli.fna run1.icm
run1
• For more details please refer:
– /l/glimmer3.02/glim302notes.pdf
GeneMark
• GeneMark includes a suite of software tools
for predicting protein coding genes in various
types of genomes
http://opal.biology.gatech.edu/
• The algorithms use Hidden Markov models
reflecting the "grammar" of gene organization.
GeneMark
• Local Installation on burrow:
– /l/gmsuite/
• You can run the code for prokaryotic gene
prediction using the following command
– /l/gmsuite/gmsn.pl --prok --format GFF /tmp/ecoli/ecoli.fna
• For more details please refer to:
– /l/gmsuite/README.GeneMarkSuite