Mirzadeh K, Martínez V, Toddo S, Guntur S, Herrgård MJ, Elofsson A, Nørholm MH, Daley DO
ACS Synth. Biol. 4 (9) 959-965 [2015-09-18; online 2015-05-15]
Protein production in Escherichia coli is a fundamental activity for a large fraction of academic, pharmaceutical, and industrial research laboratories. Maximum production is usually sought, as this reduces costs and facilitates downstream purification steps. Frustratingly, many coding sequences are poorly expressed even when they are codon-optimized and expressed from vectors with powerful genetic elements. In this study, we show that poor expression can be caused by certain nucleotide sequences (e.g., cloning scars) at the junction between the vector and the coding sequence. Since these sequences lie between the Shine-Dalgarno sequence and the start codon, they are an integral part of the translation initiation region. To identify the most optimal sequences, we devised a simple and inexpensive PCR-based step that generates sequence variants at the vector-coding sequence junction. These sequence variants modulated expression by up to 1000-fold. FACS-seq analyses indicated that low GC content and relaxed mRNA stability (ΔG) in this region were important, but not the only, determinants for high expression.