Previous Article | Next Article ![]()
Infection and Immunity, November 2003, p. 6298-6306, Vol. 71, No. 11
0019-9567/03/$08.00+0 DOI: 10.1128/IAI.71.11.6298-6306.2003
Copyright © 2003, American Society for Microbiology. All Rights Reserved.
Gordon Stevenson, and Peter R. Reeves*
School of Molecular and Microbial Biosciences, University of Sydney, Sydney, Australia
Received 20 December 2002/ Returned for modification 31 March 2003/ Accepted 5 August 2003
|
|
|---|
|
|
|---|
The pathogenesis of EIEC and Shigella involves invasion of mucosal epithelial cells of the large intestine (1, 34). Virulence in Shigella strains is dependent on the presence of a large 210- to 230-kb plasmid. The virulence plasmids pWR100 in S. flexneri serotype 5, pMYSH6000 in S. flexneri serotype 2a, and pSS120 in S. sonnei, together with those of other Shigella bacteria, have been shown to carry determinants for invasiveness and the ability to cause disease. These large plasmids are collectively termed pINV plasmids (13), which are also present in EIEC strains. The cell invasion capacity of Shigella-EIEC is determined by a cluster of 38 genes within a 32-kb segment of the pINV plasmid, often referred to as the entry or invasion region, which includes genes for invasins, molecular chaperones, motility, regulation, and a specialized type III secretion apparatus (32).
In a previous study three pINV genes, ipgD, mxiC, and mxiA, were sequenced from strains representing a range of Shigella isolates and two EIEC isolates and showed that the plasmid exists in two related but clearly distinct sequence forms (pINV A and pINV B) (20). The phylogenetic relationships of the plasmid and chromosomal genes of Shigella strains are largely consistent. The cluster 1 and cluster 3 strains tested have pINV A and pINV B plasmids, respectively. However, of the three cluster 2 organisms, S. boydii serotypes 9 and 15 have pINV A while S. boydii serotype 11 has a pINV B plasmid. S. dysenteriae serotypes 8 and 10 and S. sonnei and also EIEC strains, none of which fall into the three clusters, were all found to have either pINV A or pINV B, except for S. dysenteriae serotype 1, which has a distinct pINV form. The outlier organisms other than S. dysenteriae serotype 1 must have obtained the plasmid relatively recently, after divergence of the two forms.
The invasion plasmid from S. flexneri serotype 5 strain M90T has been sequenced by two groups (9, 50). The two sequences are essentially identical but use different numbering and often different gene names. We use the numbering and gene names of Buchrieser et al. (9). The plasmid contains 93 segments, totaling 58 kb, which are homologous to known or putative insertion sequences (ISs), suggesting a remarkable history of IS-mediated acquisition of DNA. Analysis of the GC content, position, and function of non-IS-related genes indicates that the plasmid contains blocks of genes of various origins. There are three partition systems, two functional and one remnant, and the GC content of the replication system genes is different from that of the partition genes, indicating yet another source for this region (9). It appears that pINV was assembled from several different plasmids (9).
In this study we sequenced the non-IS-related coding regions of an A form of pINV from S. flexneri serotype 6 (designated the F6 plasmid) to compare with the sequenced S. flexneri serotype 5 pINVB form (designated the F5 plasmid). The divergence ranged from 0 to 5.56%, with most genes differing by 0 to 1.0%, consistent with divergence levels in housekeeping genes. However, one group of genes and a few isolated genes showed much higher levels of divergence. The comparison indicated that divergence in these genes is driven by selection pressure, which aids our understanding of virulence variation and mechanisms of pathogenesis.
|
|
|---|
DNA sequencing strategy. pINV plasmid DNA was prepared as described previously (42). Primers were designed to amplify overlapping 1-kb segments for each non-IS region based on S. flexneri serotype 5 pINV (9).
PCR and DNA sequencing. The following cycling profile was used: initial denaturation at 94°C for 2 min and then 35 cycles of 94°C for 15 s, 55 to 60°C for 30 s, and 72°C for 1 min with a final extension at 72°C for 5 min. The annealing temperature was between 55 and 60°C depending on primer pair. PCR products were purified with the Wizard purification system (Promega). Samples were sequenced using dye terminator technology (Perkin-Elmer) through the Sydney University and Prince Alfred Hospital Macromolecule Analysis Center and with an automated 377 DNA sequencer (Applied Biosystems).
DNA sequence editing and analysis. Sequences were edited using the PHRED, PHRAP, and CONSED programs (http://bozeman.mbt.washington.edu) (12). Sequence comparisons were done using MULTICOMP (39). Open reading frame (ORF) analysis and database searches were done using ORF finder and BLAST search facilities at the National Center for Biotechnology Information (http://www.ncbi.nlm.nih.gov). The synonymous and nonsynonymous substitution rates were calculated using a program provided by Li (23).
|
|
|---|
|
View this table: [in a new window] |
TABLE 1. Gene coding regions of pINV F6
|
|
View this table: [in a new window] |
TABLE 2. Sequence variation of pINV between F6 and F5 and between F2A and F5 plasmids
|
![]() View larger version (18K): [in a new window] |
FIG. 1. Plot of percentage nucleotide differences between F5 and F6 pINV genes. The nonsynonymous percentages were plotted above the percentage of total variation. The genes on the x axis are in map order, allowing clustering of variation to be visualized. A bar above a group of gene names indicates that these genes are in the same inter-IS region (Table 1).
|
Values of Ks, Ka, and the ratio Ks/Ka are also given in Table 2. Ks is a measure of the rate of synonymous substitutions, being base substitutions without amino acid substitution, and Ka is a similar measure for substitutions which cause amino acid substitution. Synonymous substitutions are generally neutral or of very low adaptive value. Many amino acid substitutions are deleterious and some are neutral, but, critically, adaptive mutations are almost always in the latter category (18). The ratio Ks/Ka is often used as an indicator of neutral or adaptive variation. All genes are subjected to purifying selection to remove deleterious mutations, and it appears that this applies to most nonsynonymous substitutions, giving for an E. coli-Salmonella enterica comparison an average Ks/Ka ratio of 10 to 20 for most genes (45). Genes subject to selection for change have a lower value for Ks/Ka, as selection for adaptive amino acid substitutions increases Ka, which lowers the Ks/Ka ratio. In contrast, when strong conservation of amino acid sequences is required for biological function, Ks greatly exceeds Ka.
Of the 80 genes sequenced, 11 have only synonymous substitutions and 9 have only nonsynonymous substitutions, while for the others Ks/Ka ranges from 0.37 to 8.8. For nine genes Ks/Ka is under 1.0. Apart from the genes with only synonymous substitutions, there are only three genes, spa24, ipgB2, and spa29, with a ratio above 8, which is close to the range for housekeeping genes. Spa24 and Spa29 are putative inner membrane proteins (6). For the majority (35 genes) the ratio is between 1 and 8, and it is difficult to give a cutoff that indicates which are under selection pressure for change. For this discussion we use the value of 4.0 as the cutoff, which we consider to be conservative, given that most genes have Ks/Ka ratios between 10 and 20 in E. coli-S. enterica. We categorize the 30 genes with Ks/Ka ratios between 1 and 4 as being under positive selection pressure and the 5 genes with Ks/Ka ratios between 4 and 8 as being intermediate and not easily categorized. Therefore, of the 80 genes, a total of 48, including 9 with nonsynonymous changes only, 9 with Ks/Ka ratios less than 1, and 30 with Ks/Ka ratios between 1 and 4, appear to be under positive selection according to the Ks/Ka ratio. This is a very high proportion, indicating the adaptive nature of invasive virulence and its functional complexity. For many genes the total number of substitutions is very low, but the overall picture is of many genes under selection pressure for change.
Genes for secreted proteins. The patterns of variation in secreted proteins are of particular interest. Many of these proteins are effector molecules interacting with host cells, and such proteins have been reported to be more diverse in other type III secretion systems (19, 33). There are at least 25 genes encoding proteins secreted by the Mxi-Spa secretion apparatus (9), of which we were able to sequence 23. The average sequence difference is 0.87%, which is not much higher than average for the genes sequenced. However, five genes (ospB, ospC2, ospD2, ospF, and mxiL) have only nonsynonymous substitutions and another four (ipaH9.8, ospC1, ipaA, and ipaD) have a Ks/Ka ratio less than 1.0, an indication that these genes are under strong positive selection pressure for amino acid variation.
The ipa genes (ipaA to ipaD) encode proteins which are invasins (27, 34). They are humoral immunogens, the predominant antigens recognized by sera from human patients and animal models (15, 38, 49), and so it is not surprising to find that all are under positive selection pressure with Ks/Ka ratios for ipaA and ipaD less than 1.0 and for ipaC and ipaB around 2.5. IpaC and IpaB proteins are inserted into the host membranes to form a pore for translocation of other invasins into target cells (5). Variation in these genes may be due to adaptation for optimal interaction with the host cell.
Two of the secreted proteins are toxins. OspD2 is annotated as shET2-2, an enterotoxin, by Venkatesan et al. (50), and OspD3 (SenA) is a known enterotoxin (29). Both genes appear to be under positive selection. ospD2 has no synonymous substitutions with a single amino acid substitution, and ospD3 has a Ks/Ka ratio of 1.28. However, the enterotoxin gene ospD3 (senA) is found in only 75% of EIEC strains and 83% of Shigella strains (29).
However, three genes (ipgC, ipgA, and spa32) encoding secreted proteins have no nonsynonymous differences. Spa32 is involved in regulation or assembly of the secretion apparatus (24, 48). IpgC is a cytoplasmic chaperone for IpaC and IpaB (27). The function of IpgA is not precisely known, but it is a putative chaperone. These proteins interact with other pINV proteins rather than host components, and this probably accounts for the lack of diversifying selection acting on these particular secreted proteins.
Genes for essential virulence functions (nonsecreted proteins). There are a number of genes for nonsecreted proteins that are essential in invasion and show strong selection pressure for amino acid variation.
Three genes, icsA, virK, and icsP, are needed for Shigella to move within the host cell (31). All three have a low Ks/Ka ratio. IcsA is an outer membrane protein required for actin-based movement within the host cell. There are nine substitutions in icsA with eight changing the amino acid. VirK is required for proper production or localization of IcsA, although its precise function is not known. virK shows high levels of divergence. IcsP is an outer membrane protease for cleavage of IcsA. Four of the five changes in icsP are nonsynonymous.
The Shigella type III secretion system is composed of a basal body and an external needle (6, 47). The needle interacts with the host cell to deliver effector proteins. MxiH and MxiI are the major and minor needle components, respectively. The mxiH gene, only 252 bp in size, has the highest variation (5.556%) with a Ks/Ka ratio of 2.22, while mxiI has two changes with one being nonsynonymous. Variation in these genes may confer a selective advantage because of direct interaction with host cells. It has been observed that, when MxiH was overexpressed, the type III secretion machinery protruded longer needles than did the wild type, and the bacteria invaded the host cells much more efficiently than the wild type did (47). However, genes encoding proteins for the basal body, mxiD, mxiG, and mxiJ, also show low Ks/Ka ratios, for which we have no explanation.
Virulence-associated regulatory genes under purifying selection. In contrast to the genes discussed above that are under selection pressure for change, several genes known to be associated with regulation of virulence are strongly conserved. spa32 has no variation at all while virF, virB, ipgC, and mxiE all have only synonymous substitutions. Spa32 and IpgC have been discussed above. VirF is a transcription activator of the AraC family required for expression of genes of the entry region and icsA (31, 41). It positively controls synthesis of VirB, a DNA-binding protein which binds to promoter regions of virulence genes for activation of transcription (2). MxiE is a transcriptional regulator (17, 26) for at least six secreted pINV proteins (OspB, OspC1, OspE2, OspF, VirA, and IpaH9.8). This observation of regulatory genes being subject to purifying selection also gives indirect support to the conclusion above that the changes observed in other genes are a result of positive selection pressure, rather than accumulation of mildly deleterious mutations.
Genes absent in the F6 plasmid. A few genes were confirmed as absent in the F6 plasmid by hybridization (data not shown) after PCR failed to generate a product. These include sepA, orf85, orf85b, orf94, stbA, and stbB. The whole stbA and stbB region is likely absent. We discuss stbA and stbB in the replication section and sepA below, but the functions of the other three genes are unknown.
It is interesting that sepA is absent, as an S. flexneri serotype 5 sepA mutant exhibited attenuated virulence in the rabbit ligated ileal loop model (3). SepA is a serine protease of the immunoglobulin A1 protease family and the major protein secreted by S. flexneri serotype 5 (3). SepA is not required for entry into cultured epithelial cells, nor for intercellular dissemination (3), but might be involved in invasion and destruction of the host intestinal epithelium (4). We surveyed some other Shigella strains and found sepA to be variably present. Among pINV A plasmids it is present in those of S. dysenteriae serotype 10 and S. boydii serotype 9 but absent from those of S. boydii serotype 14 and S. dysenteriae serotype 3, while it is absent from the S. sonnei plasmid, the only other pINV B form tested (data not shown).
There are three genes, impCAB, that are known to be absent in S. flexneri serotype 5 (50) but present in some other strains (40). Southern hybridization showed them to be absent in S. flexneri serotype 6 (data not shown).
Comparison of replication, partition, and transfer regions. Several regions of the plasmid are involved in replication, partition, stable maintenance, or transfer (9, 50). In general genes in these regions have lower levels of difference than do those in other regions.
We sequenced the whole replication region which includes oriR (the origin of replication), repA (required for initiation of replication at oriR), repB (copB) (repressor of transcription of repA), tapA (upstream of repA, encoding leader peptide required for translation of repA), and copA (coding for antisense RNA that binds to leader region of repA for copy number control). The only sequence differences were four single base substitutions in the oriR region of 362 bases. We also sequenced downstream of oriR to the IS. Substantial differences were observed from the start of repA4, a pseudogene. The F5 plasmid has a segment of 220 bp after repA4, followed by IS 91.04 and part of IS 1294, whereas the F6 plasmid has a unique 577-bp segment after repA4 followed by part of IS 1294, which precedes the IS 1294 in the F5 plasmid by 285 bp. Either this part of the sequence has undergone recombination, or one or both have undergone rapid change. However, there appear to be no functional genes in either the F6 or F5 sequence.
(i) Plasmid partition genes. Low-copy-number plasmids generally have more than one system for stable maintenance of the plasmid (14). These may affect partitioning or cause postsegregational killing (PSK), the latter being a "fail-safe" mechanism (14). pINV F5 has genes for two partition systems, parAB and stbAB, and two PSK systems, ccdAB and mvpTA (9, 50). Note a possible source of confusion through use of the name stb for different genes. The genes now known as mvpA and mvpT were previously called STBORF1 and STBORF2. The stbA and stbB genes referred to in this paper are the second of two partition systems and were first observed by Buchrieser et al. (9) when the plasmid genome was sequenced.
The parA and parB genes of S. flexneri serotypes 5 and 6 are identical, but the stbAB genes are absent in S. flexneri serotype 6. The stbAB genes are also absent in S. sonnei but present in S. dysenteriae serotypes 1 and 4 and S. boydii serotype 1, with presence or absence found in both pINV A and pINV B forms. For the two PSK systems in S. flexneri serotype 5, the ccdA and ccdB genes are identical in S. flexneri serotype 6, while for mvpT and mvpA, which are encoded on the opposite strand of trbH, mvpT has only one substitution (nonsynonymous), and mvpA has two substitutions (one nonsynonymous).
The only reported functional difference between the two pINV forms is incompatibility. An early study (25) showed that the Shigella pINV plasmids belong to two incompatibility groups, later found to correspond to pINV A and pINV B (20). The complete sequence comparison allows us to look for substitutions affecting incompatibility. There are several factors that determine plasmid incompatibility. pINV belongs to the IncFII family (46), in which the major incompatibility determinant is the copA (inc) RNA (14). However, copA is identical in S. flexneri serotypes 5 and 6 and so cannot be the determinant of their incompatibility difference.
We consider next the possibility that mvpAT is responsible for the difference in compatibility. Compatibility was tested by introducing a plasmid (pMSYH6610) with kanamycin resistance and the compatibility region of the S. flexneri serotype 2a plasmid into Shigella strains with selection for kanamycin. The mvpAT system of the S. flexneri serotype 2a pINV B is present in plasmid MSYH6610 and in the compatibility test is retained in all surviving cells. When MSYH6610 is introduced into Shigella strains with kanamycin selection, the pINV plasmids of S. flexneri serotype 2a, 1b, or 2b (and presumably S. flexneri serotype 5) are lost, whereas pINV B of S. flexneri serotype 6 is stably retained. One would expect plasmids with the same copy number system to be lost under these conditions unless there was a PSK system operating on S. flexneri serotype 6 and a sufficient number of cells to retain two copies for these to maintain growth in the presence of kanamycin. MvpT is the toxin and MvpA is the antidote (44), so survival of pINV F6 indicates that it has an MvpT toxin that is not inactivated by the S. flexneri serotype 2a MvpA. It seems at first sight unlikely, as each differs from the S. flexneri serotype 2a form by only one amino acid residue. However, the F plasmid homologue is compatible with S. flexneri serotype 2a pINV (same form as that of S. flexneri serotype 5), and there are only four amino acid differences between MvpA of the F plasmid and that of S. flexneri serotype 5 and two differences in MvpT (37). One or more of these substitutions must determine the difference in incompatibility between F and pINV B. It is also possible that pINV of S. flexneri serotype 6 has a PSK system that is not present at all on pINV from S. flexneri serotype 2a, which would not have been detected by our use of only S. flexneri serotype 5-based PCR primers.
(ii) The plasmid transfer (tra) region. It is known from tests on several S. flexneri strains that pINV is unable to initiate conjugation (43). However, there is a partial tra region in pINV F5, including the trbH, traI, traX, and finO genes and part of traD. This is only about 25% of the total tra operon as seen in the K-12 F factor, and as virtually the whole is needed for function, the remaining genes presumably have no function. We obtained sequences from trbH, traI, traX, and finO genes. There are four substitutions in trbH, of which three are nonsynonymous, giving a ratio of 0.613, but as trbH presumably has no role in the plasmid, divergence may result from selection on the mvpAT genes on the opposite strand. traI is nonfunctional in both F5 and F6. In comparison with the traI gene of the F plasmid, there are a deletion of seven bases and an insertion of seven bases in F5 and F6, respectively, in a region flanked by two palindromic sequences that disrupted the reading frame. traI also has 21 point mutation substitutions between F5 and F6. The traI gene in F6 suffered further damage with a point mutation of C to T at position 956, resulting in a stop codon. This indicates that the traI gene was inactivated independently after the divergence of the two plasmids. traX has a single synonymous substitution while finO has three substitutions, of which two are nonsynonymous.
ORFs of unknown function. Ten of the ORFs sequenced have no known function (orf13, orf22, orf131a, orf131b, orf136, orf137, orf186, orf201, orf47, and orf48), and two (orf169a and orf169b) are probably involved in plasmid maintenance (9). orf48, orf131b, and orf136 have no substitutions while all or the majority of substitutions in orf131a, orf169a, and orf169b are synonymous. However, orf13 and orf47 are clearly under positive selection pressure. All five substitutions in orf13 are nonsynonymous, and two of the three substitutions in orf47 are also nonsynonymous with a Ks/Ka ratio less than 1. These ORFs may play an important role in virulence, making them candidates for functional analysis.
The only gene to have suffered a frameshift mutation is orf137, which is flanked by IS elements on both sides. It may be part of a gene that has been disrupted by an IS element, in which case the frameshift observed could be part of a continuing process of gene degradation.
Comparison of two pINV B plasmids. The pINV B plasmid of S. flexneri serotype 2a (plasmid F2A) has now been sequenced as part of a genome sequence (16). We compared it with the B-form pINV F5 plasmid to enhance our understanding of pINV evolution (Table 2). S. flexneri serotypes 2a and 5 belong to the same cluster based on chromosomal gene trees (36), and divergence is relatively recent. All 80 genes that we sequenced in F6 are all present in F2A. Of those, 37 are identical in F5 and F2A, 17 have a single base substitution, and 24 have from two to four substitutions. traI and ospC3 have much bigger differences. traI has eight single nucleotide substitutions, a 10-bp deletion, and an inversion at a palindromic sequence. ospC3 has 23 substitutions, with part of the sequence being similar to ospC2, most likely as a result of inter- or intrachromosomal recombination. If we consider traI and ospC to have undergone 10 events and 1 event, respectively, we have a total of 76 polymorphisms (65 if traI and ospC are excluded), to be compared with 494 polymorphic nucleotide sites in the F5-F6 comparison. The much greater divergence of F5 and F6 relative to F5 and F2A over 80 genes strongly supports the recognition of two forms of pINV, based originally on the study of only three genes (20).
We also looked at the number of synonymous and nonsynonymous substitutions between F2A and F5. The 11 genes that are identical in F5 and F6 have between them only one (nonsynonymous) substitution in F2A. The 11 genes with synonymous changes only in F5-F6 have 10 substitutions with three being nonsynonymous. The nine genes that have only nonsynonymous F5-F6 substitutions have a total of 10 F5-F2A substitutions, all nonsynonymous. For the genes with a Ks/Ka ratio of 4 or less, we excluded ospC3 from the calculation, but for the 38 other genes in this category 250 of 402 F5-F6 substitutions and 36 of 59 F5-F2A substitutions are nonsynonymous. For the genes with a Ks/Ka ratio above 4 in the F5-F6 comparison, 12 of 37 F5-F6 substitutions are nonsynonymous, but two of two F5-F2A substitutions are nonsynonymous. The pattern of synonymous and nonsynonymous substitutions in the F5-F2A comparison of two pINV B plasmids mirrors the pattern observed in the F5-F6 comparison with the possible exception of the last category where the number of F5-F2A substitutions is too low to be meaningful. It appears that genes thought to be under positive selection in the F5-F6 comparison have similar characteristics in the F2A-F5 comparison.
Variation in the invasion gene homologues in S. enterica. The invasion ability of S. enterica is determined by a cluster of genes many of which are homologous to the invasion genes of pINV. Eight have been compared in multiple strains of S. enterica (7, 22). The Ks/Ka ratio of invE (pINV mxiC), invA (mxiA), spaP (spa24), and spaQ (spa9) is similar to that of housekeeping genes while invH (no pINV homologue), spaM (spa13), spaN (spa32), and spaO (spa33) have a much lower Ks/Ka ratio. It was concluded (7, 22) that proteins that are membrane bound or membrane associated are relatively conserved whereas those that are exposed to the extracellular environment are hypervariable, reflecting the action of diversifying selection. Our data with a much larger number of genes also show that in general proteins exposed to the extracellular environment are more variable, but it is interesting that, for the seven homologues of the S. enterica genes used, the genes can differ in Ks/Ka ratio in E. coli and S. enterica. In fact only three genes, spaP (spa24), spaQ (spa9), and spaO (spa33), are consistent in the two species. spaM and spaN have a low Ks/Ka ratio in S. enterica, while there is only one (synonymous) change in spa13 and no change in spa32 between F5 and F6 pINV plasmids, and mxiA and mxiC have low Ks/Ka ratios of 3.55 and 1.97, respectively, while those of the S. enterica homologues are similar in this regard to housekeeping genes.
Spa32 and SpaN share only 15% amino acid identity, but the function of Spa32 can be complemented by SpaN (48), whereas there is known functional difference between these homologues (11). Spa32 is implicated in the release of the Ipa proteins but not their surface presentation. In contrast, SpaN is involved in both the secretion and the surface presentation of the Sip proteins (10). However, the earlier observation that Spa32, unlike SpaN, is not secreted to the culture supernatant (51) may not be entirely correct, as it was recently found that Spa32 is translocated through the type III secretion machinery into the medium (48). This may explain the difference in selection pressures between Spa32 and SpaN. It seems likely that there are functional differences for the other three homologues to account for the opposing trends.
Concluding comments. The pINV plasmid of Shigella-EIEC occurs in two major forms as shown earlier by distribution of sequence forms of three genes sequenced from a range of strains. We have now sequenced 80 genes from the A-form plasmid, pINV F6, enabling us to make a detailed comparison of representative pINV A and pINV B plasmids. The three genes, ipgD, mxiA, and mxiC, studied previously (20) are shown to be representative of the range in level of difference between pINV A and pINV B plasmids. Generally genes for plasmid maintenance vary less than virulence-associated genes, those encoding secreted proteins that interact with host cells being particularly variable.
Many genes appear to be under selection pressure for change. In addition some of the F5 genes are absent in F6. Some of the plasmid-encoded proteins are immunogens, and for them selection may be in part at least for avoidance of the host immune system. More interesting is the possibility that differences may reflect ongoing adaptation to the Shigella niche, or selection for variants of that niche. Shigella strains infect only humans and as a group are a major human pathogen. Their mode of pathogenesis is thought to have become effective only 35,000 to 270,000 years ago, and its antecedents are not known. The high proportion of genes under selection pressure in the pINV plasmid that encodes many of the functions characteristic of this mode of pathogenesis may reflect continuing adaptation to achieve an optimal interaction with the human host. It is also possible that the many serotypes are adapted to different variants of the general niche occupied, and that the selection relates to competition between different Shigella clones which differ in details of their adaptation to the niche. The observation that senA (29), a toxin gene apparently under selection pressure for change, and sepA, known to play a role in virulence in S. flexneri serotype 2a (3, 4), are absent in some Shigella strains supports the hypothesis that Shigella strains differ in the details of their adaptation to their mode of pathogenesis. A similar explanation may account for the observation that genes under selection pressure differ between E. coli and S. enterica.
As noted previously (20), not much has been reported on variation in virulence of Shigella clones, but only a few serotypes are responsible for many of the cases and there are also perceived differences between Shigella and EIEC clones. It could be interesting to relate variation in virulence within and between Shigella and EIEC strains to variation in the genes under selection pressure and genes present or absent in the pINV plasmids. This may help us to better understand the pathogenesis and epidemiology of Shigella-EIEC infections. Comparative functional studies of the genes that differ may point to the factors that give S. flexneri, S. sonnei, and S. dysenteriae serotype 1, for example, their distinctive disease characteristics and global distributions.
Incompatibility between the two plasmids was deduced to be due to very small differences: a single amino acid substitution in mvpA being the only difference likely to be involved. Its role in gene flow-recombination between the two forms and plasmid survival is not clear.
pINV plasmids are nonconjugative (25, 43). The substitution patterns in the incomplete tra region indicate that the remaining genes might be functional. This raises questions of transfer of pINV to other E. coli strains since many such events are presumed to have occurred to give rise to Shigella-EIEC strains of diverse genetic backgrounds (20). The remaining tra region genes may be involved in this process, complemented by genes on other plasmids in the recipient cells, leading to pINV transfer.
We thank the anonymous referees for comments and suggestions.
Present address: School of Biotechnology and Biomolecular Sciences, The University of New South Wales, Sydney, New South Wales 2052, Australia. ![]()
|
|
|---|
This article has been cited by other articles:
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»