Previous Article | Next Article ![]()
Infection and Immunity, April 2002, p. 1896-1908, Vol. 70, No. 4
0019-9567/02/$04.00+0 DOI: 10.1128/IAI.70.4.1896-1908.2002
Copyright © 2002, American Society for Microbiology. All Rights Reserved.
P27 Is Not Related to Other Stx Phage Genomes, but the Modular Genetic Structure Is Conserved
Institut für Hygiene und Mikrobiologie der Universität Würzburg, Würzburg, Germany
Received 21 September 2001/ Returned for modification 20 November 2001/ Accepted 5 January 2002
|
|
|---|
P27, isolated from the Shiga toxin-producing Escherichia coli patient isolate 2771/97.
P27 is integrated as a prophage in the chromosomal yecE gene. This integration generates identity segments of attL and attR sites with lengths of 11 nucleotides. The integrated prophage genome has a size of 42,575 bp. We identified 58 open reading frames (ORFs), each with a length of >150 nucleotides. The deduced proteins of 44 ORFs showed significant homologies to other proteins present in sequence databases, whereas 14 putative proteins did not. For 29 proteins, we could deduce a putative function. Most of these are related to the basic phage propagation cycle. The
P27 genome represents a mosaic composed of genetic elements which are obviously derived from related and unrelated phages. We identified five short linker sequences of 22 to 151 bp in the
P27 sequence which have also been detected in a couple of other lambdoid phages. These linkers are located between functional modules in the phage genome and are thought to play a role in genetic recombination. Although the overall DNA sequence of
P27 is not highly related to other known phages, the data obtained demonstrate a typical lambdoid genome structure. |
|
|---|
P27 and the phage morphology were characterized. Furthermore, it showed that only 1 out of 11 Stx2e-producing STEC isolates harbors an infectious Stx2e-encoding phage. From the other isolates no Stx phages could be induced (42).
The aim of the present study was to determine the nucleotide sequence of Stx2e-encoding phage
P27, to describe its genome organization, and to compare it with that of other Stx phages. Furthermore, we were interested in the general genetic localization of stx2e genes in E. coli.
|
|
|---|
transductants lysogenized with
P27. E. coli DH5
(GibcoBRL, Eggenstein, Germany) was used as the host for cloning experiments. Stx2e-producing strains 55/89 (O2:H5), 3054/97 (O60:H2), EH186 (O60:H-), EH60 (O101:H9), 24059/97 (ONT:H-), 3229/98 (O60:H-), and 3357/98 (O60:H-) were isolated from patients with diarrhea; strains 24066/97 (ONT:H-) and 26275/97 (ONT:H-) were from asymptomatic carriers; and strains 2392/98 (O60:H-) and 626/98 (O26:H-) originated from porcine meat. The two strains isolated from the feces of pigs, E57 (O138:K81) and ED-53 (O101:H-), were described earlier (42). Plasmid pK18 was used as a cloning vector (53). Plasmid pIM10 contains functional recA of E. coli (21) and was kindly provided by Gabriele Blum-Oehler and Jörg Hacker, Würzburg, Germany.
Preparation of
P27 phage DNA.
A freshly grown single colony of the lysogen T21(pIM10) was suspended in 200 ml of Luria-Bertani broth and incubated with vigorous shaking until an optical density at 600 nm of 0.5 was reached. After adjusting the culture with mitomycin C (Sigma-Aldrich, Deisenhofen, Germany) to a final concentration of 0.5 µg/ml, the flask with the bacterial suspension was wrapped in aluminum foil and incubated overnight. The phage particles were separated from the cell debris by centrifugation (23,000 x g, 30 min, 4°C) followed by filtration through a funnel filter (Filtrak, Niederschlag, Germany). To remove bacterial nucleic acids, DNase I and RNase A (Sigma-Aldrich) were added to a final concentration of 0.5 µg/ml (each). After an incubation at 37°C for 45 min, sodium chloride was added to a final concentration of 5.8% (wt/vol) and dissolved and the solution was incubated on ice for 1 h. After a centrifugation step (23,000 x g, 10 min, 4°C), the phage particles were precipitated by adjusting the supernatant to 10% polyethylene glycol 8000 (wt/vol). After dissolving the polyethylene glycol 8000 at room temperature, the mixture was incubated on ice for 1 h. Phage particles were harvested by centrifugation (25,000 x g, 30 min, 4°C). The resulting phage pellet was dried at room temperature and dissolved in 1 ml of SM buffer (5.8 g of NaCl, 2 g of MgSO4·7H2O, 50 ml of 1 M Tris-HCl [pH 7.5], and 0.1 g of gelatin in a final volume of 1 liter of double-distilled H2O). To release phage DNA, 1 ml of twofold-concentrated proteinase K buffer (20 ml of 1 M Tris-HCl [pH 8.0], 20 ml of 0.5 M EDTA [pH 8.0], and 100 ml of 10% sodium dodecyl sulfate [wt/vol] in a final volume of 1 liter of double-distilled H2O) and proteinase K (Merck GmbH, Darmstadt, Germany) at a final concentration of 20 mg/ml were added and the suspension was incubated for 1 h at 56°C. The lysate was deproteinized twice by phenol-chloroform (1:1 [vol/vol]) extraction. Then the phage DNA was collected by ethanol precipitation. The resulting pellet was washed with ethanol (70% [vol/vol]) and dried at room temperature. Finally, the DNA was dissolved in 200 µl of double-distilled H2O.
DNA techniques.
DNA sequencing was performed with an automated DNA sequencer (ABI Prism 377; Perkin-Elmer/Applied Biosystems, Weiterstadt, Germany) by using an ABI Prism BigDye Terminator Cycle Sequencing kit (Perkin-Elmer/Applied Biosystems). Random EcoRI, BamHI, SalI, SphI, KpnI, and PstI fragments of
P27 were cloned in pK18, and initial sequence information of these clones was obtained with universal and reverse primers for pUC/M13 vectors. These sequences were used to create customized oligonucleotides for primer walking. Genomic DNA of phage
P27 was used as the template for primer walking.
PCR was conducted with the GeneAmp 9600 PCR System (Perkin-Elmer/Applied Biosystems). Briefly, one freshly grown colony of the respective strain was suspended in 50 µl of 0.85% sodium chloride solution. The amplification reaction was performed in a total volume of 50 µl containing 5 µl of bacterial suspension, 30 pmol of each primer, 5 µl of 10-fold Taq DNA polymerase buffer, 3 µl of MgCl2 stock solution, and 2 U of Amplitaq DNA polymerase (Perkin-Elmer/Applied Biosystems).
Whole bacterial DNA was isolated as described by Heuvelink et al. (31). DNA-DNA hybridization was performed according to the manufacturer's recommendation (Roche Diagnostics, Mannheim, Germany). An int probe was generated by PCR with primers 2771-12 (5'-GAAAACCATCTGGGCAC-3') and 2771-21 (5'-GCACCACATCAAGGTTC-3'). The generation of PCR-derived hybridization probes was described earlier (56). PCR and sequence analysis of aroE were performed with primers aro+1 (5'-CGGATGAGCTTACTGAAC-3') and aro-2 (5'-CAATAAATGCCTGGATGAATGAG-3'). Oligonucleotide primers were purchased from Sigma-ARK GmbH, Darmstadt, Germany.
Sequence assembly and analysis.
Sequence data were assembled into contigs by using PREGAP 4 and GAP 4 of the Staden package (65). Searches for open reading frames (ORFs) and predictions of translation start positions were performed online with the WebGeneMark.hmm software (5). Searches for homologous DNA and protein sequences were conducted with the BLAST software (1) against the nonredundant GenBank database (http://www.ncbi.nlm.nih.gov/blast/blast/). Predicted
P27 protein sequences were also compared with Prosite (10) and the Protein Families (Pfam) database for conserved motifs (4).
P27 protein sequences were further compared with proteins found in BLAST searches by using the program GeneDoc, version 2.6.01 (www.psc.edu/biomed/genedoc). Transmembrane regions were predicted with TMpred (32). tRNA genes and tRNA structure were determined with tRNAscan-SE (36). A compendium of online tools at the German Cancer Research Center (DKFZ), Heidelberg, Germany (http://genome.dkfz-heidelberg.de/) was used to perform further DNA analysis.
Nucleotide sequence accession number.
The complete sequence of bacteriophage
P27 has been deposited in the EMBL and GenBank database libraries and given the accession no. AJ298298.
|
|
|---|
P27 genome.
In order to determine the complete nucleotide sequence of
P27, we isolated phage DNA from the E. coli transductant T21(pIM10) (42). DNA fragments of
P27 restricted with either EcoRI, BamHI, SalI, SphI, KpnI, or PstI were randomly cloned in pK18, and initial sequences were determined by using universal and reverse primers for pUC/M13 vectors. Using this sequence information, walking primers were designed to determine the whole sequence. For this purpose, isolated genomic DNA of
P27 was used as a template.
The assembly of all sequence information obtained revealed a linear phage genome and did not result in a circular permutation of the phage sequence. The absence of direct terminal repeats upon sequence assembly indicated that
P27 DNA is restricted by a site-specific mechanism for DNA packaging and not by using a head full mechanism as postulated for E. coli O157:H7 Stx phage 933W (52). In phage
, site-specific packaging results in the formation of cohesive ends (cos sites) (18). In the prophage state, cos sites are covalently joined together. We used two primers (Fig. 1b) to amplify the region containing the putative cos sites of the
P27 prophage. We compared the DNA sequence of this PCR product with those of the corresponding regions of the DNA isolated from free phage particles (Fig. 1a). From the latter, we revealed runoff sequences where the sequencing reaction stopped abruptly, indicating the end of the template. Alignment of the sequences of the PCR product and phage ends revealed a gap of 10 nucleotides in the latter sequence (Fig. 1). This demonstrates that
P27 possesses cos sites and that these ends contain 3' single-strand extensions (5'-CGCCCGCCCC-3').
![]() View larger version (38K): [in a new window] |
FIG. 1. Scheme of the sequencing procedure used for the identification of cos sites in the P27 genome. (a) Sequencing primer binding sites were selected close to the ends of the linear P27 genome to obtain runoff sequences. Electropherograms of the runoff sequences are presented. (b) In the upper part, the DNA sequence of the corresponding region sequenced from the prophage is shown. The sequencing procedure is explained in the lower part. Black arrows indicate binding sites and the directions of primers. Waves depict cycle sequence reactions that have been performed directly from the phage DNA template (a) and from a PCR product (b). Arrowheads with bars indicate the stops of sequencing reactions. The phage attachment site (attP) and the attL and attR sites generated by insertion of the phage are highlighted by boxes. Underlined nucleotides were added by the Taq polymerase independent of the template.
|
P27 in the bacterial chromosome, we looked for a DNA fragment at the prophage-host chromosome border containing both phage DNA and bacterial DNA. Integration of lambdoid phages into the bacterial chromosome generally occurs by site-specific recombination between the phage attP and the bacterial attB sites (12). As a result of integration, the prophage is flanked by short direct repeats (identity segments of attL and attR), separating both phage and host chromosome. One of the phage-host border sequences is generally neighbored by a phage integrase (int) gene (11).
We restricted bacterial DNA of E. coli 2771/97 and phage DNA of
P27 with EcoRV and hybridized the restriction fragments with a probe specific for the
P27 integrase gene (int). This revealed a fragment of 2.2 kb in the genomic DNA of E. coli 2771/97 and a 3.0-kb fragment in the DNA isolated from phage particles. Due to the integration event, changes of restriction fragment lengths have been expected. We performed an inverse PCR approach for further analysis. Chromosomal DNA of E. coli 2771/97 was digested with EcoRV, and restriction fragments were treated with T4 DNA ligase and used as templates for PCR. Two primers were designed, which are orientated in opposite directions outwards the integrase gene. This revealed a PCR product with the expected length of 2.2 kb which was subsequently sequenced. By sequence analysis of the PCR product, we identified the phage integrase gene close to a gene identical to chromosomal E. coli yecE (Fig. 2). The prophage is integrated in the 5' region of the yecE gene. According to WebGeneMark.hmm, the phage provides yecE with a new translational start. This putatively leads to the production of a truncated YecE, missing its 31 N-terminal amino acids.
![]() View larger version (22K): [in a new window] |
FIG. 2. Determination of the phage integration site. (a) The attP site of P27 is depicted, including parts of ORF L58 and int. (b) The corresponding attB site in the chromosomal yecE gene is shown. (c) Depiction of the DNA sequence of one of the borders of the integrated prophage. According to WebGeneMark.hmm, integration of the prophage into yecE leads to a truncated protein. The upper lines are the nucleotide sequences, and the lower lines are the amino acid sequences. The numbers below the amino acid sequence mark the amino acid positions of the respective protein. yecE' labels the truncated amino acid sequence, asterisks indicate stop codons, and arrows indicate the transcription orientation. Black boxes in the nucleotide sequences indicate the identity segment of attachment sites.
|
P27 is the yecE gene. We were able to determine the identity segments of the attL and attR sites with the sequence 5'-CACGCAGTTAA-3' (Fig. 2). The yecE gene has not been reported as an integration site for phages before. yecE is located in a gene cluster flanking the aspartate tRNA ligase (aspS) gene in E. coli K-12 and E. coli O157 (6, 49). The gene order is aspS, yecD, yecE, yecN, yecO, yecP, and bisZ, with the latter encoding a biotin sulfoxide reductase 2 (6). For the yec genes, no functions have yet been described. However, orthologous genes have been annotated in Salmonella enterica serovar Typhimurium and Yersinia pestis, and also the deduced amino acid sequence is significantly homologous to a putative cytoplasmic protein found in S. enterica serovar Typhi, Y. pestis, and Vibrio cholerae El Tor (27, 45, 46). YecE contains sequence motifs which categorize this protein as a member of the Pfam family DUF72 (4), which includes proteins of Bacillus subtilis and some members of the Archaea.
DNA characteristics and search for ORFs.
The DNA sequence of the
P27 prophage consists of 42,575 bp starting with attL and ending with attR (Fig. 3; Table 1). The overall G+C content of the
P27 genome is 49.3 mol%.
![]() View larger version (19K): [in a new window] |
FIG. 3. Schematic illustration of the P27 genome. (a) Boxes depict ORFs determined by WebGenemark.hmm. ORFs directed in rightward orientation are drawn above the black line; ORFs directed in the leftward direction are drawn below the line. Boxes in dark gray represent ORFs encoding proteins described in Results. Boxes in light gray mark ORFs, the deduced proteins of which show homology to known proteins which are described only in Table 1. Empty boxes label ORFs encoding proteins without known homologies. (b) Black boxes and vertical bars represent regions with nucleotide sequence homologies to other sequences, including linkers. Homology values are depicted by boxes. late reg., late regulation.
|
|
View this table: [in a new window] |
TABLE 1. Characteristics of P27 ORFs and deduced proteins
|
P27 proteins (Table 1).
Description of selected ORFs.
The deduced protein of L01 is closely related to members of the integrase-recombinase family (3). The highest similarities are found to putative integrases of Stx-converting phages VT1-Sakai (74), CP-933V (49), 933W (52), and VT2-Sakai (37) (Table 1). The deduced amino acid sequence of L02 demonstrates significant sequence similarity to the putative excisionases of phages 933W and VT2-Sakai (Table 1). The protein deduced from ORF L11 is homologous to various repressor proteins of the
CI type (54). The most related protein is the corresponding regulator of phage P22, there termed C2 (71). The
CI repressor inhibits almost all phage gene expression by binding to DNA operator sequences. For this purpose, CI proteins of many bacteriophages contain strong helix-turn-helix (HTH) motifs in their amino-terminal domains (24). Analysis of the deduced amino acid sequence of L11 with the program of Dodd and Egan (17) indicated a strong HTH motif starting at position 17. Similar to the situation in
, the putative cI gene of
P27 is flanked by putative operator sequences OL and OR on both sides. Each of the operators consists of three adjacent suboperators with imperfect dyad symmetry (OL1, bp 5997-6010; OL2, bp 6026-6039; OL3, bp 6061-6074; OR3, bp 6776-6789; OR2, bp 6800-6813; OR1 bp 6832-6845).
The L14 protein is highly homologous to the Roi proteins of bacteriophages HK620, H-19B, VT2-Sa, HK022, HK97, and 933W (16, 33, 37, 43, 52). In HK022, Roi causes phage propagation dependent on the integration host factor (IHF). Roi binds to a sequence within its own coding gene (14), which is closely linked to an IHF binding site. The search with an IHF binding consensus sequence (5'-AATCAANNNNTTA-3') revealed a putative binding site within the L14 coding sequence (base pair positions 8028 to 8040).
DNA replication in
involves a replication complex including phage proteins gp O and gp P and a variety of host proteins, including primase (DnaG) and helicase (DnaB) (69). Phage
contains four 20-bp direct repeat sequences in the coding region of the O gene. These repeats are recognized by the O protein to initiate replication (22). In the O gene coding region of phage VT2-Sakai, two 30-bp direct repeats were identified which are thought to be involved in the initiation of replication in this phage (37). The search of direct repeats in the
P27 region associated with phage replication revealed two 17-bp direct repeats (5'-AAAACATGATCCGCAAG-3') which are located in ORF L17 at positions 10816 to 10832 and 10840 to 10856, respectively. The N-terminal part of the L17 protein contains a putative HTH motif (similar to the SMART family HTH_GNTR), and this region is homologous to a putative replication protein (BAB35699) from E. coli O157:H7 (26). These findings together with the occurrence of a direct repeat let us suggest that the gene product of L17 might be involved in
P27 DNA replication.
L21 encodes a protein homologous to Q-like proteins of phages CP-933N, APSE-1, H-19B, and 933W (Table 1) (43, 49, 52, 72). In
, the Q gene encodes a product of the delayed early region. Q functions as a transcription antiterminator that regulates expression of late-phase genes by modifying transcription complexes by binding to a DNA sequence (qut) near a late promoter pR' (20).
The region from L22 to L31 of the
P27 genome was already the subject of a recent publication and was described previously in detail (42). It includes the genes encoding an adenine-specific DNA methylase (L24), the Stx2e subunits (L25 and L26), and a lysis cassette including two holins (L28 and L29) and an endolysin (L30).
The L34 protein demonstrates weak homology to a putative holin of Bacillus phage
105 (Table 1).
We have shown that
P27 creates cohesive ends during DNA packaging and that these cos sites extend from base pair positions 21806 to 21815 in the prophage. The two ORFs following downstream can be the
P27 terminase components for the small (L35) and the large (L36) subunits according to their location and homology search results. In lambdoid phages, the terminase complex interacts with the portal of the prohead, consisting of an oligomer of the portal protein, for example, 12 subunits in phage HK97 (29). According to Blast results, L38 may be the portal protein of phage
P27. Once packaging is complete, another protein interacts with the DNA-filled head in order to complete the head. In
, one of these is gpW, a small (68 amino acids) highly positive charged protein with an isoelectric point (pI) of 10.8, forming either a plug at the connector to prevent ejection of the DNA, or binding directly to the DNA (50). In the
P27 genome, we detected a corresponding ORF (L37), putatively encoding a small protein (60 amino acids) with a calculated pI value of 9.6 and thereby comparable to the pI of gpW. Despite lacking overall sequence similarity to gpW-like proteins, L37 could be the head-tail joining protein of
P27.
The deduced protein of ORF L40 shows homology to the major capsid proteins of phages found in gram-negative and -positive bacterial species such as Mesorhizobium loti, Caulobacter crescentus (AAK24747), Haemophilus influenzae, E. coli (AAG55944) and Streptomyces spp. (NP_047927). The deduced protein of L39 is homologous to the head maturation proteases of HK022 and HK97 (33), which are necessary to cleave the N termini from assembled head proteins.
L44 encodes a protein which is homologous to the tail sheath protein of phages FLUMU and Mu (Table 1) (68). Phages with contractile tails contain such sheath proteins. The existence of a tail sheath protein in
P27 is indicative of the presence of a contractile tail. This is in contrast to most other lambdoid phages, where flexible noncontractile tails are common. None of the other Shiga toxin-converting phages investigated so far contains such a sheath protein.
The protein determining the length of the tail in phage
is the tape measure protein. By deletion or formation of small duplications of genes encoding tape measure proteins of phage
(28, 34, 35) and lactococcal phage TP901-1 (48), it was shown that the length of the phage tail was proportional to the size of the respective tape measure protein. The L47 protein demonstrates homology to the tape measure protein of a phage detected in Xylella fastidiosa (Table 1). In phage TP901-1, a deletion mutant with a reduction in the length of the tape measure protein shows also a comparative reduction of the tail length (48). With a length of 649 amino acids, the deduced tape measure protein of
P27 has about the same length as the deletion product in TP901-1 and the tail of
P27 has the corresponding length of about 95 nm.
According to BlastP results, the
P27 ORF L54 encodes long side tail fibers and L55, a protein putatively needed for the assembly of the fibers (Table 1). Interestingly, the deduced protein of ORF L56 also demonstrates sequence similarity to phage tail fiber proteins. The sequence similarity is located in the N-terminal part of the L56 protein, and the remaining part is not homologous to known proteins. This raises the question of whether
P27 has two sets of tail fibers to broaden its host range as discussed for the phage K1-5 (59).
Genome structure.
The overall genetic organization of phage
P27 is similar to that of other lambdoid phages. The left-most part of
P27 is devoted to site-specific integration and excision (Fig. 3; Table 1). Downstream there is a region putatively responsible for immunity. Following a cluster of genes with proposed functions in phage replication, the late-phase region starts with a gene encoding a Q-like antiterminator. A lysis cassette and genes for DNA packaging, as well as head and tail genes, are located downstream of the antiterminator gene (Fig. 3). The genes encoding Stx2e are located between the genes encoding the late antiterminator Q and the lysis cassette. Most of the deduced proteins of ORFs L44 to L55 demonstrate homology to proteins of phage Mu and show the same gene order (data not shown). With the exception of the area encoding Stx2e, the
P27 genome has only low-level sequence homology to those of other phages. However, five short sequences attracted attention because they could be detected with high sequence identities in other lambdoid phages. These regions range from 22 to 151 bp, and we refer to them as linkers 1 to 5 (Table 2; Fig. 3).
|
View this table: [in a new window] |
TABLE 2. Length and location of linker sequences of P27 and comparison with those of other bacteriophages
|
P27 and 933W (Fig. 4a) demonstrates regions of high homology exclusively in the linker sequences (Fig. 4) and in the Shiga toxin region, including the preceding tRNA genes. Comparison of
P27 and H-19B (Fig. 4c) demonstrates only low homologies in the stx region. Whereas stx2 (933W) and stx1 (H19-B) share approximately 59% overall nucleotide sequence identity, stx2 (933W) and stx2e (
P27) share 90% overall identity and stx1 and stx2e share 60% overall identity. Interestingly, Stx-negative phage CP-933N (Fig. 4b) and
P27 contain two regions of higher homology. The first one comprises the tRNA genes, and the second one comprises L23 (Fig. 4).
![]() View larger version (9K): [in a new window] |
FIG. 4. Dot blot matrix of the stx2e-flanking region of P27 (genes Q to L32) and corresponding regions of phages 933W (a), CP-933N (b), and H-19B (c). At every point in the matrix where the two sequences are identical a dot is placed. A diagonal stretch of dots indicates regions where the two sequences are similar. Dots with circles represent linkers 3 (left side) and 4 (right side).
|
P27 from STEC 2771/97, described its stx-flanking region, and demonstrated that only 2771/97, out of 11 Stx2e-producing E. coli strains, harbored an inducible Stx2e phage (42). Here we used the sequence information obtained from
P27 and performed PCR and sequencing approaches to characterize the stx2e-flanking regions of the 10 remaining strains mentioned above and three additional stx2e-positive E. coli strains.
Seventeen PCR primers (Table 3) were used in multiple PCR approaches to amplify and analyze the region between Q and R of all strains included in this study. This strategy is depicted in Fig. 5. To confirm the identity of the PCR products, initial sequencing was performed with the same primers as employed for PCR. In 12 of 13 strains, we found an association between stx2e and R. Interestingly, in only 2 of 13 strains, stx2e was linked to
P27 Q-like sequences. In all strains,
P27-like sequences extend from stx2e to the methylase gene (L24).
|
View this table: [in a new window] |
TABLE 3. Oligonucleotide primers used for characterization of stx2e-flanking regions
|
![]() View larger version (30K): [in a new window] |
FIG. 5. Schematic illustration of the PCR and sequencing approach used for characterization of stx2e-flanking regions of different STEC isolates. (a) Map of the stx2e-flanking region of P27 spanning from Q to R. (b) Bars in light gray indicate the regions analyzed by PCR and initial sequencing with the same primers as employed for PCR. Black bars between dotted vertical lines depict regions which have been completely sequenced and used for the phylogenetic analysis depicted in Fig. 6. Numbers on black bars show percent DNA identity of the fragments compared to the corresponding fragment of P27. (c) Size and location of PCR products and applied primers.
|
P27 segment demonstrated a high degree of homology which is depicted in Fig. 5. To investigate clonal relationships among the strains, we performed a phylogenetic tree analysis. The same was performed for a host housekeeping gene. A 415-bp fragment of aroE which encodes the shikimate 5-dehydrogenase was amplified (2), sequenced, and analyzed from all investigated strains. The phylogenetic trees constructed from phage sequences and aroE sequences are similar (Fig. 6) and distribute most of the investigated strains in two large clusters. These clusters are serotype dependent. Whereas cluster 1 contains human isolates with serotype ONT:H- and one isolate with serotype O101:H-, cluster 2 mostly contains isolates of serotype O60:H-. This observation let us suggest that phages and housekeeping genes have undergone a coevolution and indicates an uptake of the Stx2e phages early in the evolution of this particular STEC group.
![]() View larger version (41K): [in a new window] |
FIG. 6. Phylogenetic analyses and comparison of sequences obtained from the 1,123-bp 5' stx2e-flanking region and the housekeeping gene aroE from particular STEC isolates. We used GrowTree (HUSAR software package) to create a phylogenetic tree from a distance matrix created by the program by using the unweighted pair group method with arithmetic averages. The left side contains aroE sequences (a), and the right side contains phage sequences (b).
|
|
|
|---|
Here we present the complete genome sequence of Stx2e-encoding phage
P27 and compare it with that of other lambdoid phages. Albeit the homology to other phages is low on DNA sequence level, the functional structure of lambdoid phages is conserved in
P27.
Bacteriophage
P27 contains a small genome. It consists of only 42,575 bp of DNA compared with 48,502 bp of phage lambda (58 versus 66 ORFs, respectively). There are some major differences in the gene composition of both phages. The longest ORF in
P27 comprises 1,950 bp, whereas the longest
ORF consists of 3,399 bp. However, the average ORF length of 662 to 664 bp is similar in both phage genomes. It appears that
P27 lacks some genes that are present in
. Whereas 30 genes are located between int and Q in phage
,
P27 harbors only 18 ORFs in this region. This region is approximately 3 kb shorter than the corresponding
region. We have not been able to find nin-like or ren-like genes as well as an N-like gene in
P27. Also, the region between the endolysin genes and int, which contains the machinery for head and tail structures, is shorter in
P27 (20,760 bp) than in
(approximately 27,000 bp). However, the region including Q and the endolysin gene is 6,402 bp in
P27 and only 2,541 bp in
. Although the overall genetic structure is conserved in both phages, they differ in the presence or absence of particular genes or gene groups.
Lambdoid bacteriophages are composed of functional genetic modules which carry the information for recombination, replication, host lysis, head and tail proteins, and assembly of phage particles (11). These genetic modules can be exchanged between lambdoid bacteriophages by homologous or illegitimate recombination. Hendrix et al. (30) have hypothesized that all lambdoid bacteriophages existing in nature in different habitats may serve as a gene pool for the development of new bacteriophages. Lambdoid phages appear to be genetic mosaics with different members sharing different sets of genes (7, 11, 66). The genome of lambdoid phages has been divided into 11 major segments of functionally clustered genes (13). Thus, they have a modular type of genomic organization. The order of these modules is identical in different lambdoid phages, but the corresponding module-encoded proteins can be heterogeneous.
We detected five short sequences in the range from 22 to 151 bp in the
P27 genome which were also found in other phages. Casjens et al. (13) suggested that such short regions of homology (e.g., conserved sites such as promoters or terminators) are recombination hot spots. According to Shen and Huang (61), for homologous regions smaller than 23 to 27 bp, recombination becomes inefficient. Thus, the five DNA segments of
P27 regions, referred to as linkers 1 to 5, may be large enough to be involved in homologous recombination between different phages. A similar phenomenon has been suggested for lactococcal bacteriophages (7). Four of the linkers are located between functional modules, at sites where recombination between two phages would make sense in order to move whole functional modules. A new phage arising after exchange of complete functional modules is likely to form a viable phage: interacting proteins are transmitted together and genes that encode DNA binding proteins remain linked with the DNA elements they are adapted to. For example, a recombination event at linker 3 would transfer the late regulator Q together with its site of action (qut) and a functional lysis cassette to a new descendant. A recombination at linker 4 would transfer a cos site together with the DNA packaging machine and phage head and tail proteins. On the other hand, an exchange at linker 5 would combine a phage receptor specificity carried in the phage tail region with a new integration site, attP, for prophage integration.
We suggest that exchanges at these linkers have occurred in the evolution of phage
P27. Evidence for such events may be found in the G+C distribution of the
P27 genome. From attL to linker 2, the G+C content is 44.29 mol%; from linker 2 to linker 3, the G+C content increases to 50.57 mol%. In the region between linker 3 and linker 4, the G+C content decreases to 45.93 mol%, and the last area following linker 4 comprises a G+C content of 52.05 mol% (data not shown).
Due to the fact that experiments failed to obtain Stx2e-transducing phages from Stx2e-producing E. coli strains, this variant was thought to be chromosomally encoded rather than phage borne (38, 73). With the isolation of phage
P27 (42), this opinion had to be modified. E. coli strain 2771/97 was the only Stx2e producer out of 11 test strains capable of producing an Stx2e-converting phage (42). We were interested in the genetic background of stx genes from the remaining 10 strains. Therefore, we investigated stx2e-flanking regions of these 10 and another 3 stx2e-containing strains. Using sequence information obtained from phage
P27, we analyzed the linkage of stx2e genes and
P27-like genes by PCR. In all but one strain (55/89), the stx2e genes are located at the same distance, upstream of a
P27-like endolysin, representing a part of the phage lysis cassette. In the other direction, two strains show a linkage between a
P27-like late antiterminator Q and the toxin genes (24059/97 and E57). In all other stains,
P27-like sequences can at least be traced to the start of the DNA methylase. Thus, it becomes clear that stx2e genes seem to be generally located on the genome of
P27-related phages at the same positions shown for all other toxin variants investigated so far.
The reason why we could not isolate infectious phage particles from most of the strains is at present not known. It is conceivable that parts of the respective prophages are deleted, as shown in Shigella dysenteriae (39), that insertion elements prevent phage propagation as suggested for VT-2 Sakai (37), or that defective phage particles are produced that are not able to successfully infect indicator strains. We sequenced a DNA fragment of 1,123 bp from all stx2e-positive strains which contains a part of the methylase gene and the tRNA coding area. This was performed in order to investigate the clonal relationship between the phage sequences found upstream of the investigated stx2e genes. For most of the strains, the sequences are very similar. Interestingly, the resulting phylogenetic tree correlates well with that of aroE sequences. The similar phylogenetic profile lets us suggest that the prophages were acquired by their bacterial host early in the evolution of this specific STEC group, and the related trees indicate specific adaptation of phages to their hosts. STEC can be isolated from a large number of sources. They seem to have adapted to specific ecological niches. Since stx genes seem to occur ubiquitously in E. coli, selection advantages of stx carriers may be postulated. Some studies demonstrated that mobile genetic elements can impose fitness costs on the bacteria carrying them. Therefore, it may not be plausible that genes on mobile genetic elements, once they have been acquired, will be sequestered as chromosomal genes. Following a hypothesis of Smith (64), mobile genetic elements of obligate pathogenic bacteria prevent a population of pathogens from losing their pathogenicity genes and help to regain the virulence of so-called cheater strains (64). Future work on Stx phages is needed to fully understand the role of these mobile genetic elements in STEC evolution.
|
|
|---|
and higher organisms. Blackwell Scientific Publications & Cell Press, Cambridge, Mass.
This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»