Download Exercise #3 There is a file called sequences.txt in the wiki 1. Write a

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

Homology modeling wikipedia , lookup

Transcript
Exercise
#3
There
is
a
file
called
sequences.txt
in
the
wiki
1.
Write
a
script
called
perl3a.pl
that
does
the
following:
–Assign
a
name
to
each
sequence
in
sequences.txt
–Names
should
be
in
the
following
format,
YourInitials(sequential
number)...
In
other
words,
my
first
name
would
be
RGH1,
second
RGH2,
through
RGHn
–Your
output
should
be
a
single
record
containing
the
name,
a
tab
(\t)
and
the
sequence.
(RGH1
ATGCG...
RGH2
CAACGG...
....
RGHn
GATCGA..)
–Save
this
output
in
a
file
called
named_sequences.txt
–How
many
records
are
in
the
file?
What
is
the
average
sequence
length?
Save
these
answers
in
a
file
called
sequence_info.txt
I
don’t
have
the
scripts
for
the
next
two
questions
But
I
believe
you
1)can
try
to
write
them…2)
try
using
EMBOSS
•2.
You
have
purchased
a
very
fine
software
package,
which
takes
a
DNA
sequence
and
splits
it
into
groups
of
three
bases
(codons).
Use
the
codons.pl
script
to
generate
a
list
of
codons,
using
sequences.txt
as
the
input
file.
Save
the
results
to
sequence_codons.txt
•3.
This
same
very
fine
software
package
contains
a
utility
to
translate
codons
into
amino
acid
codes
(translate.pl).
You
wish
to
take
your
file
of
sequences
and
create
a
list
of
named
and
translated
protein
sequences.
Conveniently,
you
remember
a
lecture
in
the
distant
past
that
told
you
how
to
do
this
all
in
one
step
from
the
command
line.
Using
sequences.txt
as
your
source
file:
–pipe
the
output
of
codons.pl
to
translate.pl
–pipe
the
output
of
translate.pl
to
perl3a.pl
–save
the
output
of
perl3a.pl
as
sequences_final.txt
–copy
the
command
you
used
to
do
this
to
your
sequence_info.txt
file