Download Supplementary Methods - Word file (146 KB )

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

Agarose gel electrophoresis wikipedia , lookup

Gel electrophoresis of nucleic acids wikipedia , lookup

Gel electrophoresis wikipedia , lookup

QPNC-PAGE wikipedia , lookup

List of types of proteins wikipedia , lookup

Deoxyribozyme wikipedia , lookup

Multi-state modeling of biomolecules wikipedia , lookup

Evolution of metal ions in biological systems wikipedia , lookup

Messenger RNA wikipedia , lookup

Oligonucleotide synthesis wikipedia , lookup

Epitranscriptome wikipedia , lookup

Drug design wikipedia , lookup

Photosynthetic reaction centre wikipedia , lookup

Glass transition wikipedia , lookup

Artificial gene synthesis wikipedia , lookup

Biochemistry wikipedia , lookup

Transcript
1
Supplementary Materials and Methods
Design of the automata components
A computer program was developed to design the symbols of the diagnostic molecules.
It generates a random sequence of 6 nucleotides for each disease marker and improves
this random set using a genetic algorithm. We constrained the sequences to contain 75%
CG content in each 4-nucleotides sticky end. All sticky ends derived from the symbols
were checked for complete or partial complementarity. The algorithm renders sequences
with minimal partial complementarity between non-related sticky ends. Several runs
were performed and a set of symbols with best non-overlapping properties was chosen
for diagnostic molecules construction. In the actual diagnostic molecules the 6-bp
symbols were separated by 1-bp spacers to obtain symbols of 7-bp total length.
A computer program was developed to select mRNA activation and deactivation tags,
which were then realized using ssDNA molecules in most of our experiments. It accepts
a set of mRNA sequences of the disease markers for a particular disease and provides
the two most unique short subsequences for each of these markers which also contained
a partial FokI recognition site (preferentially, first three nucleotides: 5'-TCC) to
facilitate the strand exchange. We use Hamming distance1, which is a number of
nucleotides that need to be changed to obtain one sequence from another, as the
uniqueness criterion and assume that specific interaction of each transition molecule
with its regulatory tag depends only on the uniqueness of its regulatory sticky end. The
lengths of the tags were adjusted to have a melting temperature of ~25 oC, using a
simplified assumption to determine Tm of a sequence. In a model ssDNA regulatory tags
are separated by a linker ~40 nt long, designed to have minimum interaction with other
ssDNA sequences in the system. Each tag sequence was used as a template for the
design of the transition molecules. The complete set of oligonucleotides, comprising the
2
automaton and the model disease markers, was tested for cross-interactions using the
Oligonucleotide Modeling Platform (DNA Software, Inc.) software tool for possible
flaws in the design.
Construction of the automata components
All deoxyribonucleotides employed for automata construction were ordered from
Sigma-Genosys or from the Weizmann Institute DNA synthesis unit, PAGE-purified to
homogeneity and quantified by GeneQuant instrument (Pharmacia). Non-labeled
double-stranded components were prepared by annealing 1000 pmol of each single
strand in 10 l of 50 mM NaCl, by heating to 94 oC and slow cooling down in a PCR
machine block. Diagnostic molecules employed for the experiments in Fig. 3b and
Supplementary Fig. S4 were prepared by annealing of 1000 pmol of their singlestranded components, with 3 pmol of an antisense oligonucleotide phosphorylated by
Redivue [-32P] ATP (~3000 mCi/mmol, 3.33 pmol/l, Amersham-Pharmacia).
Fluorescently labelled diagnostic inputs employed for parallel diagnosis experiment
(Fig. 3c) were prepared by annealing non-labelled sense strand of the input and either
FAM- or Cy5 5'-labelled antisense strands. Diagnostic molecules with drug release and
drug suppressor release moieties were prepared by ligation, employing 32P-labeled 5'
terminus of one of the oligonucleotides to introduce internal label in the single-stranded
loop.
Preparation of PPAP2B↓GSTP5↓PC: The oligonucleotides for the construction of the
drug-release diagnostic molecule were RL.21
(CCGAGGCGGTGCGCGACGCTCGAGCCTCGACGCTCGTTGGTATTG) and RL.22 (32PCACATCCAACGAGCGTCGAGCGTCGAGCGTCGCGCACCGCC).
The ligation was afforded by the
bridging oligonucleotides RL.25
(CTCGACGCTCGTTGGATGTGCAATACCAACGAGCGTCGAGCGTCGAGCGTCGCGCACCGCCTCGG)
Twenty pmol of RL.22 oligonucleotide (out of 1000 pmol) were 32P-labelled with 5 l
3
of [-32P] ATP (~3000 mCi/mmol, 3.33 pmol/l, Amersham) in 50 l reaction
containing T4 Polynucleotide Kinase Buffer and 20 u of T4 Polynucleotide Kinase
(New England Biolabs). After 1 hour at 37 oC, 20 u of T4 Polunucleotide Kinase in T4
Ligase Buffer were added, the volume was increased to 165 l and the reaction
continued for additional hour at 37 oC. Double stranded block was prepared by
annealing of 1000 pmol of RL.21 and 1200 pmol of RL.25. For ligation, 1000 pmol of
the labeled RL.22 oligonucleotide was mixed with the annealed block and ligated using
1,600 u of Taq Ligase (New England Biolabs) in 1 ml of Taq Ligase buffer at 55 oC for
18 hours. The ligation products were ethanol-precipitated, resuspended in TE buffer, pH
8.0 and separated using 12% denaturing PAGE (40 cm x 1.5 mm). The correct-length
ligation product was excised from the gel and extracted using standard methods. The
product was refolded prior to use. Drug suppressor-release molecule was constructed by
the identical protocol using the oligonucleotides RL.23
(CCGAGGCGGTGCGCGCGAGGCGCGAGGCGCGAGGCCCATGTGCAATAC), RL.24 (32PCAACGCACATGGGCCTCGCGCCTCGCGCCTCGCGCGCACCGCC)
and the auxiliary oligonucleotide
RL.27
(CGCGAGGCCCATGTGCGTTGGTATTGCACATGGGCCTCGCGCCTCGCGCCTCGCGCGCACCGCCTCGG).
Preparation of PPAP2BGSTP1PIM1HEPSINPC: The oligonucleotides for the
construction of the inputs were: RL.5-50
(CCGAGGCGGTGCGCGCAGGGCGGGTGGCGACGCTCGACGCTCGACGCTCG)
and RL.3-51 (32P-
TTGGTATTGCACATCCAACGAGCGTCGAGCGTCGAGCGTCGCCACCCGCCCTGCGCGCACCGCC).
They
were ligated with the help of a bridging oligonucleotide RL.25n
(GGATGTGCAATACCAACGAGCGTCGAGCGTCGAGCGTCGCCACCCGCCCTGCGCGC). Twenty pmol
of the RL.3-51 oligonucleotide were 32P-labeled; 1000 pmol of the same substrate were
phosphorylated with PNK in T4 DNA Ligase buffer with 1 mM ATP. For ligation, 1000
pmol of the RL.3-51 (mixture of 32P-labeled and phosphorylated substrates), R.5-50 and
RL.25n (bridge) oligonucleotides were mixed and ligated by 2,000 u of Taq Ligase
4
(New England Biolabs) in 1 ml of Taq Ligase buffer at 60 oC for 2 hours. The ligation
products were ethanol-precipitated, resuspended in TE buffer, pH 8.0 and separated
using 8% denaturing PAGE (40 cm x 1.5 mm). The correct-length ligation product was
excised from the gel and extracted using standard methods. The product was refolded
prior to use. Note that according to our observation, ligation product migrates much
faster than could be expected from its length, probably due to its stem-loop structure.
Regulation by mRNA
We used an mRNA transcribed from a pTRI-Xef 1 ~1900 bp DNA template provided
with the MEGAScript T7 kit (Ambion) as a generic mRNA disease marker. mRNA
sequence was folded using mFold server v 3.0 (URL:
http://www.bioinfo.rpi.edu/applications/mfold/old/rna/) and visually examined to find
sequences of low secondary structure. mRNA was synthesized using MEGAScript T7
kit and quantified by GeneQuant (Pharmacia). mRNA was refolded in solution by
heating to 70 oC and slow cooling down prior to regulation experiments. Transition
molecules were designed to match these sequences and screened to determine the most
effective activation and inactivation tags of the mRNA. These were identified at the
locations around 600 nt and 1500 nt. Transition molecules were built from fluorescently
labeled oligonucleotides to facilitate their identification. A mixture of 0.25 M active
negative and 0.25 M inactive positive transition molecules and 0.25 M of the 5'→3'
oligonucleotide for positive transition were incubated in 10 l of NEB4 (New England
Biolabs) buffer at 37 oC for 20 min with varying amounts of mRNA and analyzed by
native acrylamide gel (15%). Transition molecules involved in the experiment described
in Supplementary Fig. S3a were similar but not identical to the fluorescently labeled
molecules used in direct visualization of the regulation process presented in Fig. 3a. The
only difference was converting the negative transition to positive transition and vice
versa, by introduction and removal of spacers between the FokI sites and the state-
5
symbol recognition sticky ends, respectively, for technical reasons (Table S1). To
improve the regulation pattern, negative transition molecule was taken at 0.5 M while
positive transition molecule was at 1 M concentration.
Table S1. Molecules involved in single-step computation with pTRI-Xef mRMA
Symbol
Positive (Yes → Yes) transition
Negative (Yes → No) transition
CAGGGCC
TTCCTCATATCTTTTCTGACTGTATGGGGGATGCC
ACATACCCCCTACGGGTCC
CTGAGGATG
GCGACGAAAGACTCCTACGTCC
+
GGAAATATTCATTTGGTTTTCGCTGCTTTCTGAGCAG
ACTCA
Diagnostic computations
Diagnostic computations consist of three steps: 1) mixing the active negative and
inactive positive transition molecules for each diagnosed marker and the diagnostic
molecule(s); 2) equilibrating the software component with the mixture of ssDNA
oligonucleotides or mRNA molecules representing the molecular disease markers and 3)
processing of the diagnostic string by the hardware enzyme. For each symbol of
diagnostic string, we combine transitions in the following manner: if its marker is under
expressed in a disease, we mix 1 M of active positive transition molecule and 1 M of
inactive negative transition molecule. For a marker over-expressed in a disease, we mix
1 M of active negative transition molecule and 1 M of inactive positive transition
molecule. For some transition molecules, inactivated only by high marker
concentrations, we add 1 M of the protecting oligonucleotide (see Fig. 2 and
Supplementary Fig. S2 and their legends) to improve regulation (namely, for each pair
of transitions in the SCLC diagnosis and for PPAP2B and GSTP5-related transitions in
the PC diagnosis). All other components except FokI, including the diagnostic string
6
molecules (1 M), neutral transition molecules (1 M each), Yes- and No-verification
transition molecules (2 M each) and NEB4x10 buffer are admixed at this stage. A
mixture of model ssDNA or mRNA molecular markers is prepared in parallel, with each
marker at either zero (normal state for over-expressed gene and disease state for underexpressed gene) or 3 M concentration (normal state for under-expressed gene and
disease state for over-expressed gene). Both mixtures are thoroughly mixed to a total
volume of 9 l and incubated at 15 oC for ssDNA markers or at 37 oC for mRNA
markers for 20 min. Following equilibration, the computation is initiated by adding 1 l
of FokI enzyme (New England Biolabs) solution, either at concentration equal to the
total concentration of active transition molecules or at 5.4 M. Typical reaction
proceeded for 30 min at 15 oC, but for shorter diagnostic strings (2 symbols) incubation
times were shortened to 15 min. The reaction was quenched by addition of 1 volume of
formamide loading buffer. Samples were analyzed by denaturing PAGE (15%)
following denaturation at 94 oC for 5 min. In this assay, Yes and No outputs are
represented by 17-nt and 15-nt long bands, respectively. Radioactive gels were exposed
to Imaging Plates (Fuji) and scanned on PhosphorImager (Fuji). In the parallel
diagnoses experiment (Fig. 3c), the diagnostic molecules were labeled with FAM and
Cy5 at the 5' of their antisense strands. The gels were scanned by Typhoon 9400
instrument (Amersham Pharmacia).
Controlled drug production
Drug production (Fig. 4a) was tested with PPAP2BGSTP1PIM1HEPSIN drug
release diagnostic string (0.5 M). Different diagnostic outputs were obtained by
transition regulation with a desease marker in one of the symbols, while the rest of the
7
symbols were processed by preformed positive transition molecules (1 M each).
Absolute transition molecules concentration of 1 M was taken for HEPSIN ssDNA
model regulation and 0.25 M for pTRI-Xef mRNA regulation. The samples were
analyzed by denaturing PAGE (15%). For regulated drug release experiments (Fig. 4b)
equal amounts of diagnostic string molecules PPAP2BGSTP1 with drug-release and
drug-suppressor release moieties (0.5 M each) were mixed with 1 M of
B
GSTP1
 Yes transition molecule and varying ratios of Yes  Yes and
Yes PPAP2
1
 No at 1 M total concentration to model different diagnostic outcomes. In
Yes GSTP
all experiments Yes- and No-verifying transition molecules were added at 2 M each
and FokI enzyme at ~ 4.3 M in 10 l final volume. The mixtures were incubated at 15
o
C for 30 min, quenched with EDTA, mixed with loading buffer and analyzed by native
PAGE (20%). To control drug:drug suppressor ratio at a constant confidence level in a
positive diagnosis (Fig. 4c), varying amounts of diagnostic string molecules
PPAP2BGSTP1 with drug-release and drug-suppressor release moieties (1 M total
B
 Yes transition molecule and 0.6
concentration) were mixed with 1 M of Yes PPAP2
1
GSTP1
 Yes and 0.4 M of Yes  No transition molecules to obtain 50%
M of Yes GSTP
confidence in the positive diagnosis. Yes- and No-verifying transition molecules were
added at 2 M each and FokI enzyme at ~ 4.3 M in 10 l final volume. The mixtures
were incubated at 15 oC for 30 min, quenched with EDTA, mixed with loading buffer
and analyzed by native PAGE (20%).
Molecular components of the computer and the disease markers
DNA sequences of the oligonucleotides used for the construction of the molecular
computer components are shown in Tables S2-S7. The coloring of the nucleotides
reflects their function, as described in the main text. X stands for AAGAGCTAGAGTC
in the sense strand and for its complementary sequence in the antisense strand.
8
Table S2. Transition molecules for SCLC diagnostic moiety
Yes → Yes transition
Symbol
ASCL1↑
GCAGGGC
GRIA2↑
CCGAGGC
INSM1↑
GGTGCGC
PTTG1↑
CGGAGGC
CDKN2A↑
CGAGGGC
CCATGGATGTC
GCTCGGCCGGTACCTACAGCGTC
+
GAGACCGGCGAGCCGGCCATGGAGTTCGC
CCAAGGATGAC
TTTCCGTGGGTTCCTACTGGGCT
+
TGCCGCATAAAGGCACCCAAGGAAAACCCGA
AGCGGATGCG
TGGCTCTCTTCGCCTACGCCCAC
+
ACGCCACACCGAGAGAAGCGGAGACGGG
GGTGGATGGG
AAGGGTACCACCTACCCGCCT
+
CAGATTGGATTCCCATGGTGGAGAGGCG
GAGGGATGGC
GAGACCTCCCTACCGGCTC
+
CCTCAAATCCTCTGGAGGGACCGCCG
Yes → No transition
No → No
transition
GGCGGGGAGGTGAAGGGATG
CCACTTCCCTACCGTC
-
TGCATTCCCCCTTCGGGATG
GGGAAGCCCTACGGCT
XGGATGCC
XCCTACGGCTCC
ACCTTGGCGCACTCGGGATG
CGTGAGCCCTACCCAC
XGGATGCC
XCCTACGGACGC
ATAGGCATCATCTGAGGCAGGATG
TAGACTCCGTCCTACGCCT
XGGATGCC
XCCTACGGCTCC
GAGCACTTAGCGAATGTGGATG
CGCTTACACCTACGCTC
XGGATGCC
XCCTACGGTCCC
Table S3. ssDNA models for SCLC markers
Marker
ASCL1
GRIA2
INSM1
PTTG1
CDKN2A
Sequence (5'→3')
GAACTCCATGGCCGGCTCGCCGGTCTCATTACAATGCTGCAAACTAAGAATCTCAACGGTCCCTTCACCTCCCCGCC
TCTGGTTTTCCTTGGGTGCCTTTATGCGGCAAACAATGCTGCAAACTAAGAATCTCAACTCTCCCGAAGGGGGAATG
CA
CGTCTCCGCTTCTCTCGGTGTGGCGTGCCAGCAGATTACAATGCTGCAAACTAAGAATCTCAACTCCCGAGTGCGCC
AAGGT
CCTCTCCACCATGGGAATCCAATCTGCCAGCAGATTACAATGCTGCAAACTAAGAATCTCAACGTTCCTGCCTCAGA
TGATGCCTAT
GCGGTCCCTCCAGAGGATTTGAGGAAAAAAAATTACAATGCTGCAAACTAAGAATCTCAACAAAAAAAGTGCCACAT
TCGCTAAGTGCTC
9
Table S4. Transition molecules for PC diagnostic moiety
Symbol
PPAP2B↓
CCGAGG
GSTP1↓
GGTGCGC
PIM1↑
GCAGGGC
HEPSIN↑
GGGTGGC
Yes → Yes transition
TACTGTCTGATGAGATTGGATGGC
TACTCTAACCTACCGGGCT
ATATGCTGAGAGCAGGGGGATGGC
GTCCCCCTACCGCCAC
GCCGGATGCC
GTCGTGTCGGCCTACGGCGTC
+
TATTTCTCCCAGCACAGCCGGAGTCCGC
GTTGGATGGC
GGACCGCAACCTACCGCCCA
+
CGGCTACCCTGGCGTTGGAGCGCGG
Yes → No transition
CACCAGGATG
GTCCGGTGGTCCTACGGCT
+
CTATGCAGCAGGCCACCAGGGCTCCGA
No → No
transition
-
GATCTGGATG
TCCTCTAGACCTACCCAC
+
TCAGCGAAGGAGATCTGGTCTGGTG
XGGATGCC
XCCTACGGACGC
CTAAAGGAGGCAGAAAAAAGGATG
GTCTTTTTTCCTACCGTC
XGGATGCC
XCCTACGGTCCC
TTCGCACGTCCAGCTCGGATG
AGGTCGAGCCTACCCCA
XGGATGCC
XCCTACGGCACC
Table S5. ssDNA models for PC markers.
Marker
PPAP2B
GSTP1
PIM1
HEPSIN
Sequence (5'→3')
GCTCTCCAATCTCATCAGACAGTAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAGCCCTGGTGGCCTGC
TGCATAG
GCGTTCCCCCTGCTCTCAGCATATAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAGACCAGATC
TCCTTCGCTGA
TGCCCTTTTTTCTGCCTCCTTTAGAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAGGACTCCGGCTGT
GCTGGGAGAAATA
ACTCCGAGCTGGACGTGCGAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAGCGCTCCAACGCCA
GGGTAGCCG
Table S6. Diagnostic strings for SCLC and PC
Diagnostic string
Structure
ASCLGRIAINSMPTTGSCLC
GCAGGGCCCGAGGCGGTGCGCCGGAGGCAAAATTTACCGATTAAGTTGGA
CCGGGCTCCGCCACGCGGCCTCCGTTTTAAATGGCTAATTCAACC
PPAP2BGSTP1PIMHEPSINPC
CCGAGGCGGTGCGCGCAGGGCGGGTGGCAAAATTTACCGATTAAGTTGGA
CCGCCACGCGCGTCCCGCCCACCGTTTTAAATGGCTAATTCAACC
PTTG1CDKN2ASCLC
CGGAGGCCGAGGGCAAAATTTACCGATTAAGTTGGA
CCGGCTCCCGTTTTAAATGGCTAATTCAACC
PIM1HEPSINPC
GCAGGGCGGGTGGCAAAATTTACCGATTAAGTTGGA
CCGCCCACCGTTTTAAATGGCTAATTCAACC
10
Table S7. Molecules related to drug administration
Molecule
Yes-verifying transition molecule
No-verifying transition molecule
Drug-release moiety
Drug-suppressor release moiety
Structure
GGTCAGCAGCTGAGGATGCC
CCAGTCGTCGACTCCTACGGCTGC
GGTCAGCAGCTGAGGATGCC
CCAGTCGTCGACTCCTACGGCTCC
TATT
GACGCTCGACGCTCGACGCTCGTTGG
G
CTGCGAGCTGCGAGCTGCGAGCAACC
G
TACA
AATA
GCGAGGCGCGAGGCGCGAGGCCCATGTGC
C
CGCTCCGCGCTCCGCGCTCCGGGTACACG
C
CAA
Supplementary references
1. Hamming, R. W. Error Detecting and Error Correcting Codes. Bell Syst. Tech. J. 29,
147-160 (1950).