Download Searching for Discriminant Fragments of

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

Promoter (genetics) wikipedia , lookup

Replisome wikipedia , lookup

Agarose gel electrophoresis wikipedia , lookup

DNA sequencing wikipedia , lookup

Cell-penetrating peptide wikipedia , lookup

DNA barcoding wikipedia , lookup

Molecular cloning wikipedia , lookup

Gel electrophoresis of nucleic acids wikipedia , lookup

Bisulfite sequencing wikipedia , lookup

Ancestral sequence reconstruction wikipedia , lookup

DNA vaccination wikipedia , lookup

Expanded genetic code wikipedia , lookup

Deoxyribozyme wikipedia , lookup

Biochemistry wikipedia , lookup

Genetic code wikipedia , lookup

Cre-Lox recombination wikipedia , lookup

Non-coding DNA wikipedia , lookup

Nucleic acid analogue wikipedia , lookup

Molecular evolution wikipedia , lookup

Community fingerprinting wikipedia , lookup

Artificial gene synthesis wikipedia , lookup

Transcript
Searching for Discriminant Fragments of Cytochrome C Oxidase for Orders of
Hexapoda and A Computer Program DRNAA Designed in VB.Net
齊 心(Hsin Chi),吳行中(Shyng-Jong Wu),黃玉冰(Yu-Bing Huang),蔡宗儒(Tsung-Ju Tsai),
黃秀雯(Hsiu-Wen Huang),陳彥勳(Yen-Shun Chen),林彥廷(Yen-Ting Lin),劉秉純(Ping-Chun Liu),
許文馨(Wen-Hsin Hsu),林庭竹(Ting-Chu Lin),楊曼妙(Man-Miao Yang)
國立中興大學昆蟲學系 (Department of Entomology, National Chung Hsing University, Taichung, Taiwan)
Abstract: We collected more than 250 sequences of cytochrome c oxidase for species of the most orders of Hexapoda from Swiss-Prot
protein knowledgebase. The discriminant fragments of cytochrome c oxidase at the order level were tentatively determined. We
compared the frequency distributions of different amino acids in each sequence within and among orders. The results indicate that
discrimination based on multiple criteria is much reliable than any single one. A computer program DRNAA in Visual BASIC.Net is
designed for this study. Multiple DNA sequences can be plotted in a simple metric space for visual comparison. Dot-plot matching
for amino acid sequences is included.
Introduction With the advance of sequencing technology, DNA and protein databases of numerous organisms were accumulated
quickly, and the evolutionary relationships at different phylogenetic levels formally constructed based on morphology were examined
frequently. Cytochrome c oxidase and the respective DNA sequences have been used in phylogenetic analysis. In this study, we
collected sequences of cytochrome c oxidase of Hexapoda and searched the discriminant fragments by using a computer program.
Results and Discussion We designed a VB.net program DRNAA to plot the dot plots (Maizel and Lenk 1981) and search for the
discriminant fragments (Fig. 1). The discriminant fragments of cytochrome c oxidase at the order level were tentatively determined
(Table 1). The frequency distribution of amino acids vary slightly among orders (Fig. 2). DRNAA plots the DNA sequences in a
simple metric space (Gates 1986) (Fig. 3) and in H-curves (Hamori and Ruskin 1983) (Fig. 4). Because the cytochrome c oxidases
that have been sequenced differ substantially among studies, a proper phylogenetic analysis at the order level is not attainable at this
moment. A more coordinated approach on collecting homologous sequence data will be helpful for molecular systematics. More
coordination is also needed among biologists, information scientists, and computer scientists.
Diplura
40
Comparison of
Ct. canis:
Cr. minerva:
Po. pradoi:
Pu. irritans:
all amino acid sequences:
QLTFFHDHTLMILTMITILVGYVMVTL
FHNHSMLIILLITMLVSYIMGSLFFNK
PLMEQLMFFHNHSMLIILMITILVGYL
INFFHNHSMIIILLITILIGYMMSTLF
Table 1. The discriminant fragments for orders of Hexapoda
02. Diplura
30
20
10
WSA---PLFVWSVFITAILLLLSLPVLAGAITMLLT
0
03. Collembola
GSY---TPAL---GFVFLFTVGGLTGIIL--MFLGVNLTFFPQHFLGLNGMPRRYSDYPDAFTSWN--GFIVWAHHMFTVGMDVDTRAYFTTATMIIAVPTGVKIFSW
04. Archeognatha
SSILGAANFITT
07. Odonata
MNN---PPL---GAV---PLF---VLA---GFG
08. Blattaria
LSMLIRAEL---INMKPINM---YGSQL---YSAPCLW--GGLTGVVLANSSIDIVLHDTYYVVAHFHYVLSMGAVFAIMAGFIQWYPLF--IMFLGVNLTFFPQHFLGLAGMPRRYSDYPDAY
30
Collembola
25
20
15
10
5
0
40
Archeognatha
30
20
10
0
35
Thysanura
30
25
12. Isoptera
20
15
MLIRTELGQPGSLIGDDQIYNVIVTAHAFVMIFFMVMPIMIG--NWLVPLMLGAPDMAFPRMNNMSFWLLPPSLTLLLTSSTVESGAGTGWTVYPPLASG
IAHAGASVDLAIFSLHLAGVSSILGAVNFI---MKPERIPLFVWSVAITALLLLLSLP--GAITMLLTDRNLNTSFFDPAGGGDPILYQHLFWFFGHPEVYILILPGFGM---SHIICHE
SGKKEAFGNLGMIFAM---VWAHHMFTVGMDVDTRAYFTSATMIIAVPTGIKIFSWL
AT---YGTRMTYSAACLWALGFVF
10
5
0
35
Ephemeroptera
30
25
20
15
10
5
0
A
G
I
L
M
F
P
V
N
C
Q
H
S
T
W
Y
R
D
E
K
Fig. 2. Frequency of amino acids of cytochrome
c oxidases of five orders of Hexapoda.
Discriminant fragments:
NHS---IIL---WTI---TIL---ILP---FIALPSLRLLYL
---PSL---RLL---LLY---YLL---LLD---IGHQWYWSY
EY---DFNN---FDSYMI---FRLLDVDNR
Fig. 1. An example of sequence-comparison by using
dot plots.
Fig. 4. An example of H curves of DNA sequences.
References
European Bioinformatics Intitute. 2003. Swiss-Prot database. http://www.ebi.ac.uk/swissprot/ (2003.09.04)
Gates, M. A. 1986. A simple way to look at DNA. Theor. Biol. 119: 319-328.
Hamori, E. and J. Ruskin. 1983. H curves, a novel method of representation of nucleotide series especially suited for long DNA sequences. The Journal of Biological
Chemistry 258(2): 1318-1327.
Maizel, J. V. and R. P. Lenk. 1981. Enhanced graphic matrix analysis of nucleic acid and protein sequences. Proc. Natl. Acad. Sci. USA 78: 7665-7669.
Fig. 3. An example of DNA sequences plotted
in a metric space. (Image source of the
dipluran: http://www.kepu.com.cn/gb/lives
/insect/abc/ abc402_03.html)
If you would like to take a closer look at DRNAA, please call 0932633124 to make an appointment (today
only). DRNAA can be useful for introductory courses. (Free for non-profit use)