Download Toward a DNA Solution to the Shortest Common Superstring Problem

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the work of artificial intelligence, which forms the content of this project

Document related concepts
no text concepts found
Transcript
Toward a DNA Solution to the Shortest
Common Superstring Problem
Greg Gloor, Lila Kari, Michelle Gaasenbeek, and Sheng Yu
International Journal on Artificial Intelligence Tools, Vol. 8, No. 4 ,
pp. 385-399, 1999
Abstract
This paper proposes a DNA algorithm for solving an NP-complete
problem—The Shortest Common Superstring Problem by manipulation of
biomolecules.
1. The Shortest Common Superstring Problem
 Input: Alphabet  , finite set R = x1 , x2 ......xn , n  1, of strings
*
from  , and a positive integer k .
*
 Question: Is there a string w   with length w  k such that
each string xi  R, 1  i  n , is a substring of w .
 Example:
If   B, D , R  BDB , DBD , BDD and k =5, the answer to
the problem is “yes”. Indeed, the superstring BDBDD of length
5, contains all the input strings as subsequences.
2. Biomolecular Solution
 DNA algorithm:
 Example:
  B, D
R  BDB , DBD , BDD
k =5
Rule: B->AC
D->TA
Encoding: ACTAAC
TAACTA
ACTATA
In the solution pool, we may find a string : TGATTGATAT,
such that
TGATTGATAT
ACTAAC
TAACTA
ACTATA
The complement of the common superstring : TGATTGATAT
We can find the nucleotide sequence of the common
superstring by PCR. =>TGATTGATAT
ACTAACTATA
Decode the nucleotide sequence =>BDBDD
Created by: Hung-Wei Huang
Date: Sept. 17, 2003
Related documents