Survey
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
Exercise #3 There is a file called sequences.txt in the wiki 1. Write a script called perl3a.pl that does the following: –Assign a name to each sequence in sequences.txt –Names should be in the following format, YourInitials(sequential number)... In other words, my first name would be RGH1, second RGH2, through RGHn –Your output should be a single record containing the name, a tab (\t) and the sequence. (RGH1 ATGCG... RGH2 CAACGG... .... RGHn GATCGA..) –Save this output in a file called named_sequences.txt –How many records are in the file? What is the average sequence length? Save these answers in a file called sequence_info.txt I don’t have the scripts for the next two questions But I believe you 1)can try to write them…2) try using EMBOSS •2. You have purchased a very fine software package, which takes a DNA sequence and splits it into groups of three bases (codons). Use the codons.pl script to generate a list of codons, using sequences.txt as the input file. Save the results to sequence_codons.txt •3. This same very fine software package contains a utility to translate codons into amino acid codes (translate.pl). You wish to take your file of sequences and create a list of named and translated protein sequences. Conveniently, you remember a lecture in the distant past that told you how to do this all in one step from the command line. Using sequences.txt as your source file: –pipe the output of codons.pl to translate.pl –pipe the output of translate.pl to perl3a.pl –save the output of perl3a.pl as sequences_final.txt –copy the command you used to do this to your sequence_info.txt file