Previous Article | Next Article ![]()
Infection and Immunity, July 2006, p. 3715-3726, Vol. 74, No. 7
0019-9567/06/$08.00+0 doi:10.1128/IAI.00239-06
Copyright © 2006, American Society for Microbiology. All Rights Reserved.
Departments of Pathobiology,1 Medicine, University of Washington, Seattle, Washington,2 Department of Obstetrics, Gynecology, and Reproductive Sciences, University of California, San Francisco, San Francisco, California,3 Atelier de Bioinformatique and Institut Pasteur, Paris, France4
Received 13 February 2006/ Returned for modification 23 March 2006/ Accepted 7 April 2006
|
|
|---|
|
|
|---|
At 580 kb, the M. genitalium G-37T genome is one of the smallest of any self-replicating cellular organism (9, 17, 49), yet 4% of the fully sequenced genome consists of repeated elements (43) designated as MgPa repeats or MgPar sequences (17). These repeat sequences are so named because they are composed of partial, incomplete copies of the mgpB gene, which encodes the MgPa protein that mediates attachment to various cell types, including the ciliated epithelium of human fallopian tubes (8). Although there is only a single mgpB expression site, there are nine distinct MgPar sequences found throughout the genome (17). The MgPa protein is antigenic and elicits an immune response in animal models (25, 37) and in women with tubal factor infertility (6). The presence of MgPar sequences and the immunogenicity of the MgPa protein suggests that that recombination between mgpB and MgPar sequences could be a mechanism that promotes MgPa antigenic diversity, potentially allowing for persistence of infection through mechanisms such as immune evasion (10, 17, 42). This hypothesis is supported by the heterogeneity of mgpB sequences observed between different M. genitalium clinical isolates in which variability could be explained by recombination between the mgpB expression site and MgPar sequences (42); however, to date, the extent of mgpB heterogeneity within a single M. genitalium strain has yet to be evaluated.
In the current study, we further define the architecture and sequence homology of the MgPar regions in relation to the mgpB gene, determine the heterogeneity of mgpB within the type strain of M. genitalium G-37T cultured in Seattle (G37-S), and demonstrate extensive mgpB sequence variability within a single M. genitalium strain in a persistently infected woman. The results of our analysis are consistent with extensive intrastrain mgpB heterogeneity both in vitro and in vivo and suggest that short sequences within the mgpB and MgPar sequences participate in recombination. These findings further suggest that this mechanism contributes to the persistence and survival of M. genitalium in the human reproductive tract.
(These data were presented, in part, at the 16th biennial meeting of the International Society for Sexually Transmitted Disease Research [ISSTDR], Amsterdam, The Netherlands, 10 to 13 July 2005.)
|
|
|---|
|
View this table: [in a new window] |
TABLE 1. Genome location, previous nomenclature, and accession numbers for the complete MgPar sequences of M. genitalium G-37T
|
Amplification of mgpB from M. genitalium G37-S. Designated regions of the mgpB gene were PCR amplified using the conditions and primers listed in Table 2. For consistency throughout the report, all base pair locations are listed relative to the presumed mgpB translational start, rather than the start of the MgPa operon (containing mgpA, mgpB, and mgpC), the numbering system that has been used previously (42).
|
View this table: [in a new window] |
TABLE 2. Primers used to amplify various regions of the mgpB gene
|
Cloning of PCR products, sequencing, and in silico analyses. Following PCR, resultant amplicons were concentrated using the MinElute PCR purification kit (QIAGEN, Valencia, CA) and ligated into the pCR2.1-Topo vector (Invitrogen, Carlsbad, CA) by following the manufacturer's standard protocols. Ligation reaction mixtures were transformed into TOP10-competent Escherichia coli cultures (Invitrogen), which were plated onto Luria agar media (36) containing 100 µg/ml ampicillin and 40 µg/ml 5-bromo-4-chloro-3-indolyl-ß-D-galactopyranoside (X-Gal). Well-isolated, white colonies from each transformation were inoculated into 2.5 ml of Luria broth (36) with 100 µg/ml ampicillin. Following overnight incubation at 37°C shaking, plasmids were isolated from the E. coli cultures using the QIAprep spin miniprep kit (QIAGEN), restriction enzyme digested with EcoRI, and analyzed by agarose gel electrophoresis to confirm insertion of the PCR product into the cloning vector. Plasmids containing inserts of the appropriate size were sequenced at the University of Washington Biochemistry DNA Sequencing Facility (http://depts.washington.edu/biowww/dna/) in the forward and reverse directions using the M13F or M13R primer provided in the Topo cloning kit (Invitrogen). The ClustalW program (http://www.ch.embnet.org/software/ClustalW.html) was used to align the resulting sequences, which were manually adjusted to correct for sequencing and/or alignment errors. To determine if mgpB sequences were heterogeneous at the amino acid level or contained in-frame stop codons, all nucleotide sequences were submitted to the European Bioinformatics Institute's (EMBL-EBI) EMBOSS Transeq program (http://www.ebi.ac.uk/emboss/transeq), where the DNA sequence was translated using both the mycoplasma genetic code (in which UGA encodes a tryptophan instead of a translational stop) (26) and the appropriate reading frame; the predicted amino acid sequences were then aligned using the ClustalW program referenced above. Because alternative sequences identified within mgpB are flanked by sequences of identity, regions of deviation for alternative sequences were defined by the first and last divergent nucleotide present. For our analysis of variation in the mgpB gene of M. genitalium G37-S, five PCRs were performed for each repeat region where five of the resulting transformed colonies were selected for sequencing, producing a total of 25 sequences analyzed for each repeat region.
Analysis of mgpB heterogeneity in cervical specimens from a persistently infected woman. Cervical samples collected from women persistently infected with M. genitalium were used to analyze mgpB sequence variation in vivo. These samples were collected longitudinally approximately every 2 months from a cohort of 300 women monitored for up to 3 years to assess correlates of sexually transmitted infections among female commercial sex workers in Nairobi, Kenya (Cohen et al., submitted). Although these women resided in or near Kariobangi, a slum area inside Nairobi, Kenya, their clients included men from Nairobi, other parts of Kenya, and other countries (C. R. Cohen, unpublished data), suggesting strains detected in these women were not confined to a limited geographic region. All women gave written informed consent, and the study was approved by the Institutional Review Board of the University of Washington and the Ethical Review Committee at Kenyatta National Hospital. Cervical samples were tested for M. genitalium by our standard diagnostic PCR assay (16), and the samples from one woman with a long-term (21 months) M. genitalium infection, designated as participant no. 10139, were evaluated further. In addition, eight other women also identified as having a persistent M. genitalium infection (more than five M. genitalium-positive cervical samples over the course of 1 year) were chosen for strain typing of M. genitalium as described below. For PCR amplification, DNA was purified from 150 µl of cervical sample and inhibitors were removed using the MasterPure DNA purification kit (Epicenter) according to the manufacturer's standard protocol for the isolation of DNA from fluid samples. Samples were concentrated fivefold by rehydrating isolated DNA in 30 µl of 10 mM Tris, 1 mM EDTA, pH 8.0. Each PCR assay was performed as described above for G37-S, with the exception that 6 µl of purified DNA (representing 30 µl of the original cervical sample) was used in a final volume of 50 µl. Due to the limited volume of cervical specimens available for analysis, only one PCR assay from each time point was performed in the analysis of the mgpB repeat region B from participant no. 10139; from each PCR, 10 transformed colonies were isolated and sequenced so that among the four cervical samples amplified, a total of 40 sequences were evaluated.
M. genitalium strain typing. The strain typing method developed by Jensen, et al. (29) was used to confirm that the culture of G37-S used for these analyses was not contaminated with other M. genitalium strains and to identify and differentiate M. genitalium strains among the cervical samples from nine persistently infected women. This strain typing system has been shown to have a high discriminatory index (0.95) for unrelated strains (21) and is based on single nucleotide polymorphisms that occur within the first 500 bases of mgpB, a region of the gene that is present in a single copy in the genome and is thus relatively conserved. All strain typing was performed using primer set 1F/1R (Table 2). Primer set 2F/2R was also used to further evaluate the M. genitalium strain present in cervical samples from participant no. 10139. PCR amplification, cloning, and sequencing were each performed as described above using conditions outlined in Table 2. Due to limited sample volume, as mentioned above, in the analysis of no. 10139, one PCR assay per time point was performed with five clones per PCR sequenced. Similarly, for the remaining persistently infected women, one PCR was performed with three clones per PCR sequenced.
Nucleotide sequence accession number. All sequence data presented in this study have been submitted to the GenBank database and are available, as indicated in either Table 1 or any of the relevant figures, under accession numbers DQ248067 to DQ248104.
|
|
|---|
![]() View larger version (21K): [in a new window] |
FIG. 1. Schematic drawing of M. genitalium G-37T MgPar sequences showing their genome location, architecture, and the location of homologous sequences in the mgpB gene. (A) Location of the full-length mgpB expression site and nine MgPar sequences in the M. genitalium G-37T genome. (B) Homology between the mgpB gene and the nine MgPar sequences in M. genitalium G-37T, as determined by pairwise alignment. Vertical dotted lines within mgpB define fragments A through J described by Dallo and Baseman (10). Shaded black boxes (labeled B, EF, and G) represent mgpB repeat regions identified in the analysis presented in Fig. 2. Below, each of the MgPar sequences (MgPar 1 to 9) is listed on the left with their corresponding size. Sequences within each MgPar site with homology to mgpB are shown diagrammatically by a colored line; the location of each line corresponds to the location in the mgpB gene in which homology was observed, numbered according to the GenBank entry (Table 1). For example, MgPar 1 has homology to repeat regions B, EF, and G between bp 12 to 445, 445 to 1038, and 2010 to 2170, respectively. The remainder of MgPar 1 is not homologous to sequences found within the mgpB gene. Sequence alignments for the mgpB and MgPar sequences diagrammatically pictured in panel B are presented in Fig. S1 to S3 in the supplemental material.
|
We next used our newly identified complete MgPar sequences to perform pairwise alignments with the mgpB gene and determined that sequences in mgpB repeat regions B, EF, and G are homologous to seven, eight, and five of the different MgPar sites, respectively (Fig. 1B). In addition, the MgPar sequences homologous to repeat regions B, EF, and G are colinear with the sequences in the mgpB expression site yet lack the intervening sequences that correspond to those regions that are unique to the mgpB gene. The MgPar sequences are not themselves predicted to serve as alternative, functional expression sites because they not only contain AT-rich sequences that contain stop codons in every frame but also because they lack over half of the mgpB expression site sequence that would be required for translation of a complete MgPa protein (42). MgPar sequences with homology to repeat regions B, EF, and G of mgpB are 79 to 90% identical to mgpB, including both strictly identical sequences that would be necessary for efficient homologous recombination (47, 59) as well as divergent sequences, highlighting their potential to generate mgpB heterogeneity via homologous recombination.
To validate the location of repeated sequences within mgpB, we analyzed the mgpB expression site for repetition elsewhere in the chromosome (not limiting the search to MgPar sequences) using a modification of the computer algorithm developed by Rocha and Blanchard (44). Consistent with previous studies (10, 42) and our pairwise alignment (Fig. 1B), sequence repetition was concentrated within the three distinct regions, B, EF, and G, while the remaining mgpB sequences were exclusive to the expression site (Fig. 2). These findings are different from a previously published in silico analysis in which an additional repeat region was identified between regions B and EF (44). This extra peak of repetition was not detected in this current study due to improvements in the analysis that made it more precise (see Materials and Methods).
![]() View larger version (15K): [in a new window] |
FIG. 2. Regions of mgpB sequence repetition in the M. genitalium G-37T genome. The computer-based algorithm of Rocha and Blanchard (44) was modified and used to probe the G-37T genome for sequences with 100% identity to overlapping 23-bp segments of the mgpB gene. Sequences of the mgpB gene found in multiple copies are concentrated within three regions that overlap with repeat regions B (bp 575 to 1016), EF (bp 2292 to 2876), and G (bp 3305 to 3550). Small arrows illustrate the locations of primers listed in Table 2.
|
The in silico analysis described above also identified a sharp peak of repetition at bp 2964 (Fig. 2) that was found to result from seven consecutive "TAG" repeats that have identity to several other "TAG" trinucleotide repeats found elsewhere in the genome. A BLAST search of the M. genitalium G-37T genome revealed that this extended trinucleotide repeat (seven or more consecutive "TAG" repeats) was also present in the mgpC gene and MgPars 2, 8, and 9, each containing 11, 16, 10, and 9 "TAG" repeats, respectively. In addition, regardless of the number of "TAG" repeats at all five locations (mgpB, mgpC, and MgPars 2, 8, and 9) each trinucleotide repeat was flanked 5' by "TGA" and 3' by "TAC." While the "TAG" codon encodes a translational stop in mycoplasmas, in silico translation of the trinucleotide repeat sequence in mgpB suggests that the corresponding MgPa protein would contain a run of seven serine residues ("AGT"). It is interesting, however, that a shift in the reading frame via slipped strand mispairing, for example, could change the expressed amino acid sequence to include seven valine residues ("GTA") or seven stop codons ("TAG").
In vitro heterogeneity of the mgpB gene in M. genitalium G37-S. To determine the extent of mgpB sequence variation in M. genitalium G37-S, the stock of G-37 grown and maintained in Seattle, repeat regions B, EF, and G were PCR amplified from a single culture using primer sets 3F/3R, 4F/4R, and 5F/5R (Fig. 2), and the resulting amplicons were cloned into E. coli to separate individual PCR products, sequenced, and assessed for variability (Fig. 3). Three different sequences (sequences 1-1, 1-2, and 1-3) were identified in the clones analyzed from repeat region B (Fig. 3B). Sequence 1-1 was identical to the published G-37T mgpB sequence (identified in 23 of 25 clones), but sequences 1-2 and 1-3 (identified in one clone each) diverged over regions that instead had identity to MgPar 7 or 8, respectively (Fig. 3B and 3D). While variability in repeat region EF was not detected in this analysis, two different sequences were identified from repeat region G; sequence 3-1 (identified in 13 of 25 clones) was identical to the published G-37T mgpB sequence and sequence 3-2 (identified in 12 of 25 clones) contained divergent sequences identical to MgPar 3 (Fig. 3C). Variation of mgpB region B within the strain G-37 has been previously published for a separately maintained stock of G-37 (Fig. 3B, G37-DK) in which 45 bases had identity to MgPar 8 in place of the published G-37T sequence (42). The sequences of repeat region B are shown in the alignment in Fig. 3D in which the apparent insertion of MgPar sequences into the mgpB expression site can be better appreciated. Together, these results are consistent with multiple, independent recombination events between the mgpB and MgPar sequences.
![]() View larger version (82K): [in a new window] |
FIG. 3. Heterogeneity of the mgpB gene, determined by PCR amplification, cloning, and sequencing of M. genitalium G37-S. (A) Diagram of the mgpB gene showing the locations of repeat regions B, EF, and G. (B) Sequence heterogeneity identified in repeat region B. Sequence 1-1 was identical to the mgpB gene of the published M. genitalium G-37T sequence (17), while alternative sequences 1-2 and 1-3 contained 294 and 209 bases that were identical to MgPar 7 (103 to 391 bp) and MgPar 8 (65 to 263 bp), respectively, in place of the published mgpB sequence. The region B sequence for G37-DK (accession number X91075) (42) contains 45 bases that have identity to MgPar 8 (326 to 370 bp). (C) Sequence heterogeneity identified in the clones analyzed for repeat region G. Sequence 3-1 was identical to the M. genitalium G-37T published mgpB sequence, while sequence 3-2 contains 115 bp identical to MgPar 3 (474 to 592 bp) in place of the published mgpB sequence. (D) Nucleotide sequence alignment for repeat region B between the published sequences for G-37T mgpB, 1-1, 1-2, 1-3, G37-DK, and MgPars 7 and 8. The sequence heterogeneity is flanked by regions of sequence identity, and the conserved sequence at bp 838 to 900 is presumably where the recombination event between region B and MgPar 8 that generated sequence 1-3 and G37-DK took place. Bases identical to the G-37T sequence are highlighted in gray, while sequences identical to MgPars 7 and 8 are highlighted in aqua and pink, respectively. GenBank accession numbers for alternative G37-S sequences are as follows: 1-2, DQ248067; 1-3, DQ248068; 3-2, DQ248069. The accession numbers for MgPars 7 and 8 are listed in Table 1.
|
The development and evolution of mgpB repeat region B sequence heterogeneity in a woman persistently infected with M. genitalium. The presence of nine MgPar sequences in the G-37T genome, as well as the heterogeneity of mgpB repeat regions B and G in a single culture of M. genitalium G37-S, prompted us to investigate the extent of mgpB heterogeneity in vivo. To accomplish this, we evaluated the scope and temporal relationships of mgpB sequence heterogeneity in mgpB repeat region B in the context of a persistent M. genitalium infection. A woman (participant no. 10139) from which 15 consecutive M. genitalium-positive cervical samples were obtained over a 21-month period was chosen for this analysis. Accordingly, repeat region B of the mgpB gene was amplified and sequenced from the DNA isolated from the first two (4 October 2001 and 14 February 2002) and last two (13 December 2002 and 9 July 2003) cervical samples collected to provide a longitudinal "snapshot" of the location and extent of in vivo mgpB region B diversity.
Among the 40 mgpB repeat region B sequences analyzed from participant no. 10139, 17 unique sequences (listed as A through Q in Fig. 4) were identified: three sequences (A to C) were identified at the first time point (4 October 2001), five (D to H) at the second time point (14 February 2002), three (I to K) at the third time point (13 December 2002), and six (L to Q) at the final time point (9 July 2003). Remarkably, in our analysis, none of the complete mgpB region B sequences were identified at more than one time point. For example, sequence G was identified in five different clones, all of which were derived from the 14 February 2002 cervical sample; this sequence was never identified in the analysis of the other three cervical samples (Fig. 4A). All sequences identified in this analysis contained regions with absolute identity to G-37T mgpB as well as regions that diverged from this published sequence (Fig. 4A). In some cases, these alternative sequences were identical to G-37T MgPar sequences (as noted in the legend to Fig. 4A), while others were not (and are designated "novel"). While all mgpB region B sequences share common elements (for example, sequences C, E to I, K, N, P, and Q all contained a 5' sequence shown in Fig. 4A), each sequence is unique when mgpB region B is considered as a whole. Closer examination of these sequences revealed that sequences are chimeric and most likely originated from multiple recombination events with MgPar sequences (Fig. 4B). Several internal mgpB region B sequences are consistent with a temporal change in prevalence, as exemplified by the predominance and then absence of novel sequence 3 after the emergence, then dominance, of novel sequence 4 (Fig. 4A and C).
![]() View larger version (73K): [in a new window] |
FIG. 4. Repeat region B mgpB sequences identified in cervical specimens from participant no. 10139, a woman persistently infected with M. genitalium. A total of 40 sequences (10 per cervical sample) were analyzed, and 17 unique variants (A to Q) were detected in the four time points examined. (A) Schematic representation of the 17 sequences identified. White regions are homologous to the published G-37T sequence, while areas shaded as indicated by the key diverge from the mgpB sequence and instead have homology to G-37T MgPar sequences or have no homology to any sequences in the G-37T genome (designated "novel"). Block sizes shown in the key represent five nucleotides each. Sequence I contained an extra adenosine base (shown with an A) at bp 878, which is predicted to alter the reading frame and introduce three in-frame stop codons, indicated with red arrows. The schematically depicted sequences are presented in more detail in panels B and C. (B) Sequence alignment of mgpB bp 620 to 709 of G-37T; patient no. 10139 sequences A, B, D, E, and I; and the G-37T MgPar sequences showing the chimeric nature of mgpB sequences and possible explanations for their generation. Sequence A or B could have initially recombined with MgPar 4 to generate sequence D, which then could have recombined with novel sequence 1 to generate sequence E or I. (C) The shift in mgpB sequences over the course of a persistent infection exemplified by an alignment between sequences A through Q and the published G-37T mgpB sequence between bp 767 to 816. Bases with identity to the G-37T mgpB sequence are highlighted in gray, while bases identified as novel sequences 3 and 4 are shaded in red and yellow, respectively. The horizontal arrows at the top and bottom of panels A and C, respectively, indicate the conserved sequence targeted by primer 2R (Fig. 2; Table 2) used in the strain typing experiments. Sequences A through Q have been deposited in GenBank under accession numbers DQ248070 through DQ248086, respectively.
|
In vivo mgpB variability occurs within a single M. genitalium strain. To investigate whether the mgpB heterogeneity identified in samples from participant no. 10139 was the result of infection by one or multiple M. genitalium strains, strain typing was performed on the same samples used to perform the repeat region B analysis (Fig. 4A). Primers 1F/1R (Fig. 2) were used to analyze the first 500 bp of the mgpB gene, which is present only once in the genome. In this experiment, all 20 sequences analyzed (five per time point x four time points) from participant no. 10139 were identical and clearly distinguishable from sequences obtained from eight other persistently infected women sampled at similar dates in the same cohort of women in Nairobi (Fig. 5). In addition, the longitudinal evaluation of seven women with infections lasting 10 months or longer has demonstrated that the persistence of one infecting strain over the course of a long-term M. genitalium infection, as seen with participant no. 10139, is not uncommon in this cohort of women (Cohen et al., submitted). These results are consistent with the conclusion that participant no. 10139 was infected with a single M. genitalium strain that is different from other strains circulating among this cohort during this study period.
![]() View larger version (34K): [in a new window] |
FIG. 5. Strain typing of M. genitalium sequences derived from cervical samples from women persistently infected with M. genitalium, suggesting that participant no. 10139 was infected with a single strain that can be distinguished from other strains present in this population during the same time period. Participant identification numbers and dates of sample collection are shown to the left. Dashed lines indicate sequence identity to the published G-37T sequence, between bp 221 and 380 of mgpB. Bases shaded in black are those previously shown to distinguish M. genitalium strains, which therefore serve as the basis of the strain typing system (29). For participant no. 10139, five clones were sequenced per cervical sample, while a minimum of three clones were sequenced from other persistently infected women; all sequences obtained from a given woman were identical. Sequences (with GenBank accession numbers) are as follows: 10139, DQ248092; 10046, DQ248087; 10081, DQ248088; 10089, DQ248089; 10090, DQ248090; 10114, DQ248091; 10163, DQ248093; 10177, DQ248094; 10274, DQ248095.
|
260 bp of the heterogeneous region B from a cervical sample collected from participant no. 10139 at an intermediate time point (25 April 2002). In this analysis, five different sequences were obtained (data not shown). All contained the heterogeneity characteristic of bp 575 to 817 of sequences D, G, H, I, J, and K (Fig. 4) linked to the same characteristic strain typing sequence of the infecting M. genitalium strain (Fig. 5 and data not shown). These results, including the conservation of the same strain typing sequence over the period of observation, confirms that participant no. 10139 was persistently infected for almost 2 years with a single strain of M. genitalium and that the apparent evolution of mgpB repeat region B heterogeneity observed was not the result of coinfection with multiple strains. |
|
|---|
The availability of a fully sequenced M. genitalium genome allowed us to expand on the number and architecture of the MgPar sequences, as well as their potential to generate variants of mgpB within a single strain. The M. genitalium G-37T strain available from ATCC was derived from a clonal isolate (57, 58); however, in the analysis of our stock culture of M. genitalium G37-S, we identified variant mgpB repeat region B and G sequences that were identical to sequences within identified MgPar sequences, consistent with recombination between the mgpB expression sites and these putative MgPar donor sites. Strain typing confirmed that the G37-S stock was not contaminated with other M. genitalium strains (data not shown), indicating that mgpB variability developed within this strain during in vitro passage of the organism. The diversity of mgpB sequences from M. genitalium cultured under laboratory conditions, however, may not be representative of its diversity in vivo due to the absence of the selective pressures pathogens may experience within a host, such as immune pressure, and/or to the possible selection of individual M. genitalium clones either at the time of isolation or during the unknown number of subcultures since its initial isolation in 1981 (57). Given this limitation, we evaluated the heterogeneity and variability of mgpB sequences from a woman persistently infected with a single strain of M. genitalium, and while limited sample volume prevented the performance of multiple PCRs, we remarkably identified a minimum of 17 different mgpB region B variants. Many of the regions containing divergent sequences in these variants were identical to the MgPar sequences published for M. genitalium G-37T, which is consistent with the conservation of at least some MgPar sequences between strains. However, other mgpB sequences differed from the reported mgpB and MgPar sequences of G-37T, suggesting that in other strains, MgPar sequences may diverge from those of the published M. genitalium genome. While extensive MgPar heterogeneity, including the identification of sequences not present in the G-37T genome, has been detected in other M. genitalium strains (S. L. Iverson-Cabral and P. A. Totten, unpublished data), further studies are needed to determine the variability within sequences and, possibly, numbers of MgPar sequences among and within M. genitalium strains. Any such findings would emphasize the potential for, and generation of, population diversity within this species.
The mgpB region B sequences identified in our analysis of the persistently infected woman suggests that sequential multiple recombination events occurred over the course of infection, although at present, we cannot determine if these mgpB variants were present in the initial inoculum or whether they were generated by recombination during infection. The chimeric nature of mgpB sequences and possible evolution over time is exemplified in the alignment shown in Fig. 4. The mgpB hybrid sequences detected in vivo are consistent with the pattern of mgpB heterogeneity within G37-S and the previously published G37-DK (42) in which different portions of MgPar 8 have recombined into a single mgpB repeat region at least two different ways (Fig. 3D). These in vitro and in vivo data demonstrate the occurrence of multiple recombination events between mgpB and the MgPar sequences, which would enhance the potential for MgPa diversity.
Heterogeneity and antigenic variation of immunodominant proteins has been characterized in other sexually transmitted pathogens, including Treponema pallidum and Neisseria gonorrhoeae. In T. pallidum, heterogeneity has been shown to accumulate within distinct variable regions of the tprK gene, with alternative sequences linked to putative tprK donor sequences found elsewhere in the genome (5). For N. gonorrhoeae, variant pilin proteins are expressed when silent, alternative pilS sequences recombine into the pilE expression site (38, 39). The variability of pilS sequences and the ongoing recombination between pilE and pilS sites allow N. gonorrhoeae an almost unlimited repertoire of pilin protein antigenic epitopes, which is dramatically increased by the variability and number of pilS sequences between N. gonorrhoeae strains (19). Pilin proteins help facilitate host cell attachment, and different pilE sequence variants have been associated with differences in host cell tropism (31), while antigenic and phase variation of another surface-expressed antigen, the opacity proteins (using mechanisms that differ from the pilE/pilS system described above), is responsible for the induction of different signaling pathways in target host cells (12, 20). It is currently unknown if MgPa variability similarly alters M. genitalium host cell tropism, facilitates different cell signaling pathways, and/or allows the organism to escape the host immune response.
The architecture and sequence similarity between MgPar and mgpB sequences strongly suggests that alternative sequences in mgpB are generated by homologous recombination. This process has been better characterized in other bacterial species, including E. coli and N. gonorrhoeae, in which recombination is dependent on the length and degree of sequence identity. In E. coli, homologous recombination requires flanking sequences containing a minimum of 20 bp of identity, with 25 to 100 bp as an average range (47, 59). In N. gonorrhoeae, antigenic variation occurs by homologous recombination between pilS and pilE sequences, as described above (38, 39), with variant sequences bordered by sequences of identity that range from 3 to 68 bp in length (24). Similar to these variant pilE sequences, we identified stretches of homologous sequences that range from less than 10 bp to more than 60 bp (Fig. 3D and 4B; see also Fig. S1 to S3 in the supplemental material) that flank variant mgpB sequences. These regions of identical sequences are not only found at the termini of mgpB repeat regions but also located within the repeat regions that flank internal regions of divergence, each of which could independently recombine to create an almost unlimited potential for mgpB heterogeneity and, therefore, MgPa diversity.
While very little is known about recombination in mycoplasma species, annotation of the M. genitalium genome suggests that the components for basic recombination are present, including homologues of the RecA, RuvA, RuvB, RecU, and PolA proteins (17, 45). A comparative analysis of the homologous recombination components present in various bacteria genomes suggests that the recombination machinery in M. genitalium is reduced and lacks important initiation enzymes such as RecBCD, RecQ, or RecFOR (4, 45). The lack of such important presynaptic proteins has also been observed for other bacterial pathogens, including the highly recombinant Helicobacter pylori (45, 54), suggesting that for many organisms, presynaptic functions are performed by other enzymes not identified in current genome annotations. Interestingly, in M. genitalium, the mgpA gene (MG190 in TIGR database) found upstream of mgpB and mgpC in the MgPa operon has a phosphoesterase motif and homology to the RecJ protein in E. coli (2). While both mgpB and mgpC are repeated throughout the genome (10, 42), the sequences within mgpA are not. It is tempting to speculate that mgpA provides the exonuclease activity required to produce a free 3' single-stranded DNA to which RecA can bind for the initiation of homologous recombination (1). The mechanisms and requirements for recombination in the reduced M. genitalium genome, including the accessory proteins involved, needs to be evaluated not only to elucidate the recombination machinery of M. genitalium and other Firmicutes but also to clarify the requirements for recombination in other bacterial pathogens, such as H. pylori, that appear to lack large portions of a complete recombination system.
Regardless of how recombination occurs, the variant mgpB sequences identified in our analysis of a persistently infected woman imply that sequences change over time, presumably in response to immune pressure. As can be seen in the example given in Fig. 4C, novel sequence 4 was not identified at earlier time points but emerged in the third cervical sample and came to dominate sequences identified in the final sample. This coincides with the apparent elimination of novel sequence 3, which was predominant in earlier specimens. These observations, as well as our analysis of mgpB heterogeneity in vitro, is in agreement with the hypothesis that MgPa antibodies produced during persistent infection select new mgpB variants generated via mgpB-MgPar recombination until they in turn are eliminated by the immune system. Clearly, antibodies against the MgPa protein are induced during M. genitalium infection (6, 25, 37; Iverson-Cabral and Totten, unpublished), yet the antigenic epitopes of MgPa, the biologic activity of anti-MgPa antibodies, and the contribution of antibodies to the generation of protein diversity have not been established. The hypothesis of immune evasion relies on the assumption that variable MgPa regions (i.e., those encoded by mgpB regions B, EF, or G) correspond with immunogenic epitopes, and consistent with this model, Opitz and Jacobs (40) identified three regions of the MgPa protein that overlap with mgpB repeat regions B, EF, and G, which mediate attachment and are therefore assumed to be surface exposed. In addition to facilitating survival under immune pressure, variation of this dominant attachment protein may also contribute to colonization in different anatomical niches, promote evasion of uptake and killing by professional phagocytes, and/or enhance transmission of infection.
The architecture and repetition of mgpB sequences in M. genitalium are very similar to the P1 gene in the respiratory pathogen Mycoplasma pneumoniae, in which portions of the P1 gene are repeated throughout the M. pneumoniae genome (46, 60); the P1 protein of M. pneumoniae is the homologue of MgPa of M. genitalium (11, 37). Based on these similarities, one would presume that P1 antigenic variation occurs in M. pneumoniae (46); however, only minimal P1 heterogeneity has been documented among M. pneumoniae isolates with strains classified into one of two groups (50, 51). Subsequent restriction fragment length polymorphism analysis of over 200 clinical M. pneumoniae strains identified additional subtypes within these two groups (13, 14, 33). The sequencing of variant P1 genes indicated that sequence variation could be explained by recombination, with the repeated elements found outside the P1 expression site (13, 14, 33). Furthermore, M. pneumoniae strains expressing different P1 variants were shown to differ in their ability to colonize the respiratory tract of guinea pigs, suggesting that pathogenicity may be influenced by P1 variation (15). It is reasonable that MgPa expression similarly influences M. genitalium colonization, survival, and persistence in the genital tract, although future experiments will be required to establish this possibility. As the mgpB diversity reported here is much more extensive than what has been previously observed in M. pneumoniae, the study of M. genitalium mgpB sequence heterogeneity and its effect on persistence in vivo may serve as a model for future studies of M. pneumoniae P1 diversity and its relationship with chronic respiratory infection.
As indicated throughout this discussion, much remains to be discovered about the pathogenicity of M. genitalium, including the mechanisms and consequences of antigenic variation. This study provides compelling evidence that M. genitalium persistence is due to antigenic variation of the immunodominant MgPa protein. Our results are consistent with the recombination of alternative MgPar sequences into the mgpB expression site both in a strain of M. genitalium cultured in vitro as well as among sequences identified from a strain in a persistently infected woman. We hypothesize that the almost unlimited generation of mgpB diversity contributes to the chronic nature of M. genitalium infection. Future studies establishing the surface exposure of MgPa, the nature of mgpB-MgPar recombination, the contribution of immune response to the generation and selection of alternative mgpB sequences in vivo, the requirements for homologous recombination, and possible differences in binding and/or host cell signaling will promote a better understanding of the sophisticated mechanisms of persistence, survival, and pathogenesis of this genomically challenged organism.
We thank Gwen Wood, Arturo Centurion-Lara, and Steven Moseley for critical reading of the manuscript, George Kenny for assistance with M. genitalium growth, and Sheila Lukehart for scientific discussion and suggestions.
Supplemental material for this article may be found at http://iai.asm.org/. ![]()
|
|
|---|
This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»