* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
Download Assignment 2
Gene expression profiling wikipedia , lookup
Genetic engineering wikipedia , lookup
Public health genomics wikipedia , lookup
Gene nomenclature wikipedia , lookup
Protein moonlighting wikipedia , lookup
History of genetic engineering wikipedia , lookup
Therapeutic gene modulation wikipedia , lookup
Microevolution wikipedia , lookup
Expanded genetic code wikipedia , lookup
Designer baby wikipedia , lookup
Genetic code wikipedia , lookup
Non-coding DNA wikipedia , lookup
Site-specific recombinase technology wikipedia , lookup
Oncogenomics wikipedia , lookup
United Kingdom National DNA Database wikipedia , lookup
No-SCAR (Scarless Cas9 Assisted Recombineering) Genome Editing wikipedia , lookup
Pathogenomics wikipedia , lookup
Whole genome sequencing wikipedia , lookup
Minimal genome wikipedia , lookup
Genome (book) wikipedia , lookup
Helitron (biology) wikipedia , lookup
Genomic library wikipedia , lookup
Human genome wikipedia , lookup
Artificial gene synthesis wikipedia , lookup
Point mutation wikipedia , lookup
Genome editing wikipedia , lookup
Assignment II Using NCBI databases 1. Find the protein with the accession number: P23367 in the NCBI protein database. (10 points) a. How many amino acids are in the protein? b. What is the function of the protein? 2. Find the gene mutL of Escherichia coli. (15 points) a. How many records did you retrieve in the NCBI Gene database? b. How many mutL genes does one Escherichia coli genome have? 3. Searching for the Homo sapiens g6pd protein in the NCBI protein database will result in records from both RefSeq and GenBank. (10 points) a. How many records are from GenBank, RefSeq, and SwissProt? b. Read about RefSeq and GenBank (e.g., in http://www.ncbi.nlm.nih.gov/RefSeq/RSfaq.html#rsgbdiff). In which database you expect to find more records? Why? 4. Find the tumor suppressor pp32r1 gene (accession number AF008216) in the nucleotide database. (15 points) a. What is the source organism and the chromosome from which the sequence has been obtained? b. At which nucleotide does translation start? c. How many amino acids are in the protein? 5. Using the NCBI cross-database search, find all entries for Human immunodeficiency virus 2 (HIV-2). (15 points) a. In which database will you be able to find how many coding sequences are in its genome? b. How many coding sequences are there? 6. Using the NCBI genome database, find the entry for the genome of Aquifex aeolicus VF5 genome (without plasmids). (10 points) a. What is the GC content of its chromosome? b. What is the length of its genome? 7. Using the NCBI Genome Project database, answer the following questions: (10 points) a. How many chromosomes are in the genome of Saccharomyces cerevisiae? b. How many different Saccharomyces species can be found in the Genome Project database? 8. Mutations on BRCA1 gene have been reported to be associated with the early onset of breast cancer. (15 points) a. How many non-synonymous mutations of human BRCA1 are in SNP database? Examine record rs70953662 in SNP database. b. What are the nucleotide alleles? c. What is the amino acid change in between these alleles?