* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
Download Mfold
Point mutation wikipedia , lookup
Genetic code wikipedia , lookup
Epitranscriptome wikipedia , lookup
Biosynthesis wikipedia , lookup
Biochemistry wikipedia , lookup
Homology modeling wikipedia , lookup
Deoxyribozyme wikipedia , lookup
Artificial gene synthesis wikipedia , lookup
SNP genotyping wikipedia , lookup
Protein structure prediction wikipedia , lookup
Nucleic acid analogue wikipedia , lookup
Bisulfite sequencing wikipedia , lookup
Create a folder “BIO” in your computer Download bioinformatics08.exe from 163.25.97.214 or 163.25.97.215 Decompress bioinformatics08.exe Open bioinformatics08.ppt Graphic Display in GCG Configuring Graphics Languages and Devices Program CodonPreference DotPlot Figure Frames FrameSearch /PLOt GapShow GrowTree HelicalWheel Isoelectric MapPlot Moment PepPlot PileUp PAUPDisplay PlasmidMap PlotFold PlotSimilarity PlotStructure PlotTest PrettyBox Prime StatPlot TestCode WordSearch -PLOt •GIF (Graphics Interchange Format) – GIF87a, GIF89a •HPGL (HP Graphics Language) – ColorPro, HP7470, HP7475, HP7550, HP7580, LaserJet3 •PNG (Portable Network Graphics) – For WWW Browser •PostScript •ReGIS •Sixel •Tektronix •Xwindows – Dowload x-win412.exe Exercise 08-1 Configuring X-windows Double click lecture09/x-windows/x-win412.exe Accept all default settings Start x-win32 Connect to GCG via TELNET gcg 2% go gcg 3% xwindows Use XWindows graphics with what device: Color Workstation Gray Scale Workstation Monochrome Workstation Please choose one ( * COLORWORKSTATION * ) Plotting Configuration set to: Language: xwindows Device: COLORWORKSTATION Port or Queue: GCG_Graphics gcg 4% plottest Nucleic Acid Secondary Structure Stemloop and Mfold In Nucleic acids, inverted repeat sequences may indicate foldback (self pairing)structures. Identifying Inverted Repeats Stemloop Calculating RNA Folding Mfold Displaying of Folding Structures Plotfold/Dotplot STEMLOOP StemLoop finds stems (inverted repeats) within a sequence. You specify the minimum stem length (number of nucleotides in a paired stretch), minimum and maximum loop sizes, and the minimum number of bonds per stem (length of nucleotide sequence between the paired regions). Vertical bars ('|') indicating the base pairs. The associated loop is shown to the right of the stem. If either the stem or loop is too long start size 217 AGGCTGCAGTG AGCCGTGAT 11, 25 to be displayed in its entirety quality |||||| |||| C on the line, then only that 257 TCCGGCCTCAC GTCACCGCG part that fits on the line is shown. The first and last end coordinates of the stem are stem displayed on the left, and the length of the stem (size), the number of bonds in the stem (quality), and the loop size are shown on the right. STEMLOOP Output formats 1) See the stems 2) See the stem coordinates 3) File the stems (*.fld) 4) File the stems as points for DOTPLOT 5) Choose new parameters 6) Get a different sequence Sort stems by: 1) Position 2) Quality 3) Size 221 TGCAGTG AGCCGTG 7, 18 ||||||| 248 ACGTCAC CGCGCTA 14 Loop Start End Size Quality 1 35 54 8 18 *.stem *.pnt DOTPLOT MFOLD Using energy minimization criteria, any predicted "optimal" secondary structure for an RNA or DNA molecule depends on the model of folding and the specific folding energies used to calculate that structure. Different optimal foldings may be calculated if the folding energies are changed even slightly. Because of uncertainties in the folding model and the folding energies, the "correct" folding may not be the "optimal" folding determined by the program. You may therefore want to view many optimal and suboptimal structures within a few percent of the minimum energy. You can use the variation among these structures to determine which regions of the secondary structure you can predict reliably. For instance, a region of the RNA molecule containing the same helix in most calculated optimal and suboptimal secondary structures may be more reliably predicted than other regions with greater variation. Mfold output file: *.mfold MFOLD How to read *.mfold? Survey of optimal and suboptimal foldings A) sub-optimal energy plot B) p-num plot Sampling of optimal and suboptimal foldings C) circles D) domes E) mountains F) squiggles PLOTFOLD Exercise 08-2 Stemloop & X-windows Open the file “stemloop.doc” and follow the steps. gcg2 4% fetch gb:d00063 d00063.gb_pl1 gcg2 5% stemloop d00063.gb_pl1 There are 16 stems. Would you like to 1) See the stems 2) See the stem coordinates 3) File the stems 4) File the stems as points for DOTPLOT 5) Choose new parameters 6) Get a different sequence Q)uit? Please choose one (* 1 *): Try 1-4 Sort stems by: 1) Position 2) Quality 3) Size Q)uit Please choose one (* 1 *): Exercise 08-3 Mfold & Plotfold Open the file “mfold.doc” and follow the steps. gcg2 4% fetch gb:j02061 J02061.gb_vi gcg2 5% mfold j02061.gb_vi j02061.mfold $ Mfold (Linear) MFOLD what sequence ? j02061.gb_vi Begin (* 1 *) ? End (* 121 *) ? What should I call the energy matrix output file (* j02061.mfold *) ? Primer Selection Nucleotide sequences S p e c i f i c i t y - %G C Dimer – Hairpin - Tm Amino Acid sequences Pileup Pretty Prettybox CONSENSUS Nucleotide Amino Acid backtranslate Primer Selection Program-Prime Confirm by BLAST Primer Length Minimum Maximum - ---------------------------------------------PCR Product Length Minimum Maximum - ---------------------------------------------Maximum number of primers or PCR products in output (range 1 thru 2500) Primer DNA concentration (nM) (range .1 thru 500.0) Salt concentration (mM) (range .1 thru 500.0) - ---------------------------------------------Select: forward primers, only reverse primers, only primers on both strands for PCR Set maximum overlap (in base pairs) between predicted PCR products Forward strand primer extension must include position Reverse strand primer extension must include position ---------------------------------------------- Reject duplicate primer binding sites on template Specify primer 3' clamp (using IUB ambiguity codes) ----------------------------------------------Primer % G+C Minimum (range 0.0 thru 100.0) Maximum ----------------------------------------------Primer Melting Temperature (degrees Celsius) Minimum (range 0.0 thru 200.0) Maximum ----------------------------------------------Maximum difference between melting temperatures of two primers in PCR (degrees Celsius) (range 0.0 thru 25.0) ----------------------------------------------Product % G+C Minimum (range 0.0 thru 100.0) Maximum ----------------------------------------------Product Melting Temperature (degrees Celsius) Minimum (range 0.0 thru 200.0) Maximum