Construction of promoter vector pYPKa_Z_TDH3 and terminator vector pYPKa_E_TDH3¶

This notebook describe the construction of E. coli vectors pYPKa_Z_TDH3 and pYPKa_E_TDH3 with the same insert for which PCR primers are also designed.

The insert defined below is cloned in pYPKa using the blunt restriction enzymes ZraI and EcoRV in two different plasmids. The insert cloned in ZraI will be used as a promoter, while in the EcoRV site the insert will be used as a terminator.

pYPKa_Z and pYPKa_E plasmids

The pydna package is imported in the code cell below. There is a publication describing pydna as well as documentation available online. Pydna is developed on Github.

In [1]:

from pydna.readers import read
from pydna.parsers import parse
from pydna.parsers import parse_primers
from pydna.design import primer_design
from pydna.amplify import pcr
from pydna.amplify import Anneal

The vector backbone pYPKa is read from a local file.

In [2]:

pYPKa = read("pYPKa.gb")

Both restriction enzymes are imported from Biopython

In [3]:

from Bio.Restriction import ZraI, EcoRV

The vector is linearized with both enzymes.

In [4]:

pYPKa_ZraI  = pYPKa.linearize(ZraI)
pYPKa_EcoRV = pYPKa.linearize(EcoRV)

The insert sequence is read from a local file. This sequence was parsed from the ypkpathway data file.

In [5]:

ins = read("TDH3.gb")

Primers for the terminator promoter need specific tails in order to produce a SmiI and a PacI when cloned in pYPKa in the EcoRV cloning position.

In [6]:

fp_tail = "ttaaat"
rp_tail = "taattaa"

Primers with the tails above are designed in the code cell below.

In [7]:

ins = primer_design(ins)
fp = fp_tail + ins.forward_primer
rp = rp_tail + ins.reverse_primer

The primers are included in the new_primer.txt list and in the end of the pathway notebook file.

In [8]:

print(fp.format("fasta"))
print(rp.format("fasta"))
with open("new_primers.txt", "a+") as f:
    f.write(fp.format("fasta"))
    f.write(rp.format("fasta"))

>fw698 TDH3
ttaaatATAAAAAACACGCTTTTTC

>rv698 TDH3
taattaaTTTGTTTGTTTATGTGTGTTT

PCR to create the insert using the newly designed primers.

In [9]:

prd = pcr(fp, rp, ins)

The PCR product has this length in bp.

In [10]:

len(prd)

Out[10]:

A figure of the primers annealing on template.

In [11]:

prd.figure()

Out[11]:

      5ATAAAAAACACGCTTTTTC...AAACACACATAAACAAACAAA3
                             ||||||||||||||||||||| tm 50.8 (dbd) 56.4
                            3TTTGTGTGTATTTGTTTGTTTaattaat5
5ttaaatATAAAAAACACGCTTTTTC3
       ||||||||||||||||||| tm 48.8 (dbd) 55.3
      3TATTTTTTGTGCGAAAAAG...TTTGTGTGTATTTGTTTGTTT5

A suggested PCR program.

In [12]:

prd.program()

Out[12]:

Taq (rate 30 nt/s) 35 cycles             |711bp
95.0°C    |95.0°C                 |      |Tm formula: Biopython Tm_NN
|_________|_____          72.0°C  |72.0°C|SaltC 50mM
| 03min00s|30s  \         ________|______|Primer1C 1.0µM
|         |      \ 50.9°C/ 0min21s| 5min |Primer2C 1.0µM
|         |       \_____/         |      |GC 34%
|         |         30s           |      |4-12°C

The final vectors are:

In [13]:

pYPKa_Z_TDH3 = (pYPKa_ZraI  + prd).looped().synced(pYPKa)
pYPKa_E_TDH3 = (pYPKa_EcoRV + prd).looped().synced(pYPKa)

The final vectors with reverse inserts are created below. These vectors theoretically make up fifty percent of the clones. The PCR strategy below is used to identify the correct clones.

In [14]:

pYPKa_Z_TDH3b = (pYPKa_ZraI  + prd.rc()).looped().synced(pYPKa)
pYPKa_E_TDH3b = (pYPKa_EcoRV + prd.rc()).looped().synced(pYPKa)

A combination of standard primers and the newly designed primers are used for the strategy to identify correct clones. Standard primers are listed here.

In [15]:

p = { x.id: x for x in parse_primers("standard_primers.txt") }

Diagnostic PCR confirmation¶

The correct structure of pYPKa_Z_TDH3 is confirmed by PCR using standard primers 577 and 342 that are vector specific together with the TDH3fw primer specific for the insert in a multiplex PCR reaction with all three primers present.

Two PCR products are expected if the insert was cloned, the sizes depend on the orientation. If the vector is empty or contains another insert, only one product is formed.

Expected PCR products sizes from pYPKa_Z_TDH3:¶

pYPKa_Z_TDH3 with insert in correct orientation.

In [16]:

Anneal( (p['577'], p['342'], fp), pYPKa_Z_TDH3).products

Out[16]:

[Amplicon(1645), Amplicon(1477)]

pYPKa_Z_TDH3 with insert in reverse orientation.

In [17]:

Anneal( (p['577'], p['342'], fp), pYPKa_Z_TDH3b).products

Out[17]:

[Amplicon(1645), Amplicon(879)]

Empty pYPKa clone.

In [18]:

Anneal( (p['577'], p['342'], fp), pYPKa).products

Out[18]:

[Amplicon(934)]

Expected PCR products sizes pYPKa_E_TDH3:¶

pYPKa_E_TDH3 with insert in correct orientation.

In [19]:

Anneal( (p['577'], p['342'], fp), pYPKa_E_TDH3).products

Out[19]:

[Amplicon(1645), Amplicon(1396)]

pYPKa_E_TDH3 with insert in reverse orientation.

In [20]:

Anneal( (p['577'], p['342'], fp), pYPKa_E_TDH3b).products

Out[20]:

[Amplicon(1645), Amplicon(960)]

The cseguid checksum for the resulting plasmids are calculated for future reference. The cseguid checksum uniquely identifies a circular double stranded sequence.

In [21]:

print(pYPKa_Z_TDH3.cseguid())
print(pYPKa_E_TDH3.cseguid())

Z8LFAVHm3ruuq_dov31lqpmfeqA
8zg87HoFsdPFl5Ao-Sup64AGvFs

The sequences are named based on the name of the cloned insert.

In [22]:

pYPKa_Z_TDH3.locus = "pYPKa_Z_TDH3"[:16]
pYPKa_E_TDH3.locus = "pYPKa_Z_TDH3"[:16]

Sequences are stamped with the cseguid checksum. This can be used to verify the integrity of the sequence file.

In [23]:

pYPKa_Z_TDH3.stamp()
pYPKa_E_TDH3.stamp()

Out[23]:

cSEGUID_8zg87HoFsdPFl5Ao-Sup64AGvFs

Sequences are written to local files.

In [24]:

pYPKa_Z_TDH3.write("pYPKa_Z_TDH3.gb")
pYPKa_E_TDH3.write("pYPKa_E_TDH3.gb")

pYPKa_Z_TDH3.gb

pYPKa_E_TDH3.gb

Download pYPKa_Z_TDH3 ¶

In [25]:

import pydna
reloaded = read("pYPKa_Z_TDH3.gb")
reloaded.verify_stamp()

Out[25]:

cSEGUID_Z8LFAVHm3ruuq_dov31lqpmfeqA

Download pYPKa_E_TDH3 ¶

In [26]:

import pydna
reloaded = read("pYPKa_E_TDH3.gb")
reloaded.verify_stamp()

Out[26]:

cSEGUID_8zg87HoFsdPFl5Ao-Sup64AGvFs