Previous Article | Next Article ![]()
Infection and Immunity, October 2003, p. 5650-5661, Vol. 71, No. 10
0019-9567/03/$08.00+0 DOI: 10.1128/IAI.71.10.5650-5661.2003
Copyright © 2003, American Society for Microbiology. All Rights Reserved.
and Ning Zhi
Department of Veterinary Biosciences, College of Veterinary Medicine, The Ohio State University, Columbus, Ohio 43210
Received 26 March 2003/ Returned for modification 8 May 2003/ Accepted 7 July 2003
| ABSTRACT |
|---|
|
|
|---|
70-type promoter was found upstream of tr1. The p44 genes include a central hypervariable region flanked by conserved regions. The hypervariable region sequence in the p44 expression locus was duplicated and, regardless of the expression status, conserved at another locus in both low- and high-passage cell cultures of strain NY-37. No significant differences in the hypervariable region were found when we compared p44 sequences, at the level of cDNA, within the expression locus and within other loci in the genomes of strains NY-37 and HZ. Similarly, in cDNA isolated from patients and from assorted cultures of strains NY-31, NY-36, and NY-37, hypervariable regions of 450 deduced amino acid sequences of various p44s within each strain were found to be identical, as were those of p44 sequences in the genome of strain HZ. These data suggest that variations in p44 sequences at the level of the p44 expression locus occur through unidirectional conversion of the entire (nonsegmental) p44 hypervariable region including flanking regions with a corresponding sequence copied from one of the conserved donor p44 genomic loci. The data suggest that the P44 antigenic repertoire within the hypervariable region is restricted. | INTRODUCTION |
|---|
|
|
|---|
p44 is homologous to msp2 from Anaplasma marginale, a bovine intraerythrocytic agent (43). It was reported that the segmental gene conversion of multiple msp2s at a single expression locus confers unlimited variation of expressed msp2 chimeras or mosaics (2, 5, 6). Recently, Barbet et al. proposed the same mechanism for the expression of variable p44s in A. phagocytophilum (3). If segmental gene conversion occurs, the sequences of the hypervariable regions of p44 in the expression locus would be chimeras or mosaics derived from more than two donor p44s located elsewhere within the same genome. This is inconsistent with our previous data showing that the hypervariable region sequences in several p44 paralogs cloned from different genomic loci are identical in the corresponding regions of p44 cDNAs (43, 44). In the present study, we tried to resolve this conundrum by testing the msp2 (p44) segmental gene conversion theory in various strains used in our previous studies. Our data suggest that, instead of segmental gene conversion, the entire hypervariable region is replaced to alter p44 expression. Understanding the versatility in the structure and expression of surface component P44s may provide insights into the adaptive capabilities of A. phagocytophilum in mammalian and tick hosts and host clearance mechanisms.
| MATERIALS AND METHODS |
|---|
|
|
|---|
Reverse transcription-PCR (RT-PCR) analysis. Total RNA was extracted from 5 x 106 A. phagocytophilum-infected HL-60 cells (80 to 100% infection) with the TRIzol reagent (Invitrogen). The RNA was further purified with the RNeasy minikit (Qiagen, Valencia, Calif.). After DNase I treatment, the isolated RNA (5 µg) was heated at 70°C for 10 min. Samples were then subjected to reverse transcription at 42°C for 50 min in a 20-µl reaction mixture containing 0.5 mM (each) deoxynucleotide triphosphate (dNTP), 200 U of SuperScript II reverse transcriptase (Invitrogen), 200 ng of random hexamers, and 3 mM MgCl2. PCR was performed in a 50-µl reaction mixture containing 4 µl of the cDNA product, 10 pmol of each primer (primer sets 1, 12, 13, 14, and 15, as shown in Table 1), 0.2 mM (each) dNTP, 5 U of Taq DNA polymerase, and 1.5 mM MgCl2. PCR conditions included 3 min of denaturation at 94°C, followed by 35 cycles consisting of 1 min of denaturation at 94°C, 1 min of annealing at 55°C, and 1 min of extension at 72°C. PCR products were purified from a gel and cloned into a pCRII vector (Invitrogen). Twenty to 56 cDNA clones were randomly selected from the transformants and sequenced on an ABI 373XL Stretch DNA sequencer with the ABI PRISM BigDye Terminator Cycle Sequencing Reaction kit.
|
5'RACE. The experiment involving 5' rapid amplification of cDNA ends (5'RACE) was performed according to the protocol provided by the manufacturer (Invitrogen). DNase I-treated total RNA (3 µg) was reverse transcribed with Superscript II and gene-specific primers at 42°C for 50 min (primer sets 2, 3, 4, and 5, as shown in Table 1). For cDNA synthesis of long transcripts (>3 kb), the DNase I-treated total RNA (5 µg) was reverse transcribed by using a Thermotranscript kit (Invitrogen) according to the manufacturer's recommended protocol at 54°C for 50 min with a gene-specific primer (primer set 6, as shown in Table 1). The cDNA was tailed by using terminal transferase to add cytidine or adenosine residues at the 3' end and then amplified by PCR with a second gene-specific primer and either an oligo(dG)- or oligo(dT)-linked amplification primer. The PCR conditions included 35 cycles of 1 min of denaturation at 94°C, 1 min of annealing at 57°C, and 1 min of extension at 72°C. Primary PCR products were further amplified by a nested gene-specific primer and the amplification primer without an oligo(dG) or oligo(dT) anchor. The secondary PCR products were purified and cloned. Inserts of 20 clones were sequenced for each sample.
Genomic Southern blot analysis.
Genomic DNA of A. phagocytophilum was extracted from purified organisms as described elsewhere (29). Briefly, samples were subjected to lysis with sodium dodecyl sulfate, pronase digestion, phenol-chloroform extraction, and ethanol precipitation. The purified genomic DNA was then digested with either EcoRI, PstI, XbaI, or SpeI. PCR products amplified with primer sets 7, 8, and 9 (Table 1) were labeled with [
-32P]dCTP by using Ready-To-Go DNA labeling beads (Amersham Pharmacia Biotech), and the labeled PCR products were then used as DNA probes. Hybridization was carried out under high-stringency conditions (65°C) in Rapid hybridization buffer (Amersham Pharmacia Biotech), as described elsewhere (29). After being washed, the membrane was exposed to Hyperfilm to visualize the locations of the hybridized bands (Amersham Pharmacia Biotech).
Genome walking. Based on the unique sequences of identified p44 transcripts, unknown 5'- and 3'-end flanking regions were amplified by using a Universal GenomeWalker kit (Clontech Laboratories, Palo Alto, Calif.) or by multiplex restriction site PCR (2, 16, 40). The primers used for genome walking (primer sets 10, 11, 16, and 17) are shown in Table 1. The PCR products were cloned into a pCRII vector or a pCR-XL-TOPO vector and subjected to DNA sequencing.
Western blot analysis. The P44-28 oligopeptide REKSGNTNTKPQOND, Pep28, was synthesized (Alpha Diagnostic, San Antonio, Tex.). Antiserum against Pep28 was generated by immunizing rabbits with keyhole limpet hemocyanin-conjugated Pep28 (ProSci, Inc., Poway, Calif.). P44-1 and P44-18 oligopeptides and antisera were generated as described elsewhere (43). The monoclonal antibody 5C11 recognizes the N-terminal conserved region of P44 (17), and Western blot analysis was performed as described elsewhere (21, 33).
Sequence analysis. The protein signal sequence was analyzed with the SignalP program (www.cbs.dtu.dk). Sequence assembling, alignments, and analysis were done with SeqMan, MegAlign, and MapDraw programs in the DNAStar software. The genome of A. phagocytophilum strain HZ was sequenced at The Institute for Genomic Research, and the sequences are available at http://www.tigr.org. The GenBank accession number of the A. phagocytophilum strain HZ unfinished whole genome is NC_004351.
Nucleotide sequence accession numbers. The GenBank accession numbers for the p44 expression loci in A. phagocytophilum strains are as follows: NY-37, AY137510 and AY319262 (p44-28 and p44-1 in the expression locus, respectively); NY-36, AY319262; NY-31, AY319264; HZ, AY319265 (p44-18 in the expression locus). The accession number for the p44-1/p44-18 gene locus is AY151054, and that for the p44-50/p44-28 pseudogene locus is AY151053.
| RESULTS |
|---|
|
|
|---|
To identify the p44 expression locus, we analyzed the upstream sequences of the major transcript species by 5'RACE in strains NY-31, NY-36, NY-37, and HZ. p44-28 was the major mRNA species in the low-passage cell cultures of strains NY-31 and NY-36 and in the high-passage cell culture of strain NY-37, p44-1 was the major species in the low-passage culture of strain NY-37, and p44-18 was the major species in the high-passage cell culture of strain HZ (passage numbers >100). It was found that all upstream sequences up to 199 bp from the start codon of each transcript species were identical, except for a 1-bp difference in the upstream sequence of p44-18 in strain HZ. These results suggest that all p44 major transcript species in the four A. phagocytophilum strains were expressed at identical genomic loci. To confirm this hypothesis, Southern blot analysis was performed on genomic DNA from each strain using a DNA probe specific to the common p44 upstream sequence (primer set 7; Table 1). The DNA used in these experiments was derived from the same samples from which RNA was extracted for 5'RACE analysis. The genomic DNA from the four strains (HZ, NY-31, NY-36, and NY-37) was digested with either EcoRI, PstI, or XbaI. The relative position of the probe (bp -199 to -42) is shown in Fig. 1. The DNA probe hybridized to a single fragment in samples of genomic DNA treated with each of these restriction enzymes, and hybridization patterns for the four strains were identical (Fig. 1). The hybridization patterns indicated that this genetic locus was conserved in the various human isolates of A. phagocytophilum, including one isolated 5 years earlier (HZ). Since the ank gene sequences of A. phagocytophilum isolates derived from the northeastern United States were shown to be distinct from Minnesota and Wisconsin A. phagocytophilum isolates (25), we examined whether this expression locus was present in the genomes of three A. phagocytophilum strains isolated in Minnesota by DNA PCR using primers specific to the p44 expression locus (primer sets 12 and 18; Table 1 and Fig. 2). The target region corresponding to the p44 expression locus in each of the three Minnesota isolates was amplified (data not shown). Barbet et al. also showed that A. phagocytophilum strains from New York and Wisconsin had the same p44 expression locus (3). These findings strongly suggest that the same p44 expression locus is present in all strains of A. phagocytophilum, regardless of their geographic origins.
|
|
The p44 expression locus also included a truncated recA (265 bp) with a start codon (ATG) 294 bp downstream of p44 and a stop codon (TAA). This fragment encoded the N-terminal 88 amino acid residues of the full-length RecA (357 amino acids) and is located elsewhere in the genome. The full-length recA gene was highly expressed by all four A. phagocytophilum strains in HL-60 cell cultures, as determined by RT-PCR (data not shown). The full-length recA in A. phagocytophilum strain HZ was highly homologous to recA in Escherichia coli (BlastP E-value of e-108). The A. phagocytophilum truncated and full-length RecAs each had an ATP binding site consensus sequence GPESSGKT, which corresponds to (G/A)XXXXGK(T/S) of the Walker A box, also referred to as the P loop (24). The putative DNA binding sites were located at amino acid residues 150 to 176 and 190 to 227, which were highly conserved in A. phagocytophilum and E. coli RecA proteins. Because DNA binding activity is essential for the function of RecA proteins (24) and because the recA fragment downstream of the p44 expression locus lacked DNA binding sequences, this RecA fragment is likely to be nonfunctional.
We designated the area consisting of tr1, omp-1X, omp-1N, and a downstream transcriptionally active p44 the p44 expression locus (GenBank accession number AY137510). Barbet et al. (3) independently identified omp-1N, the p44 paralog, and the 3' terminus recA truncated in A. phagocytophilum strains NY-18, Webster, and HGE2 and in a strain from HGE patient no. 2. The sequence alignment of this region showed that the upstream and downstream sequences of p44 paralogs were 99.6 to 99.9% identical to the respective sequences in strains NY-31, NY-36, NY-37, and HZ.
Transcription at the p44 expression locus. In the Florida and South Idaho strains of A. marginale, msp2 transcripts were shown to be polycistronically transcribed with ORF4 (OpAG3), ORF3 (OpAG2), and ORF2 (OpAG1) (2, 22). Because tr1, omp-1X, om-1N, and p44 were found in the same transcriptional orientation and because their intergenic spaces were relatively short, we considered the possibility that these genes could be polycistronically transcribed. To examine this possibility, we used RT-PCR to investigate whether transcripts of four sets of two genes adjacent to the p44 expression locus, including their intergenic spaces, were present the NY-31, NY-36, and NY-37 strains of A. phagocytophilum cultivated in HL-60 cells. As shown in Fig. 3A, all transcripts in the tr1-omp-1X, omp-1X-omp-1N, and omp-1N-p44 regions were cotranscribed, including the intergenic spaces. In contrast, no transcript from the ndk-tr1 region was detectable. The major 3-kb fragment was also detected by 5'RACE using specific primers located in the 5' upstream region of p44 in the expression locus, confirming polycistronic expression from the tr1 promoter (Fig. 3B). These results indicate that the four tandem genes (tr1, omp-1X, omp-1N, and a p44 paralog) are polycistronically transcribed under certain conditions. In addition, several minor fragments, ranging from 1.3 to 2.5 kb, were also detected, suggesting that additional promoters might be present within and upstream of omp-1N. Barbet et al. showed that omp-1N (p44ESup1) and a p44 (msp2) paralog were coexpressed (3).
|
70-type consensus promoter sequences of E. coli (30), were found upstream of the initiation site. This site differs from the polycistronic transcriptional start site of A. marginale, which is located 74 bp 5' to the ATG initiation codon of ORF4 (OpAG3) (2). Barbet et al. identified a putative promoter region 80 bp upstream of omp-1N (p44ESup1) (3) that closely resembles the promoter identified in the msp2 expression site of A. marginale.
|
To identify the genomic loci of the major p44 cDNA species, we compared the Southern blotting patterns of genomic DNA isolated from low-passage cultures of strain NY-37 (
10 passages) with those of DNA isolated from high-passage cultures (>20 passages). Three probes, a p44 expression locus probe, a p44-1-specific probe, and a p44-28-specific probe, were used for this purpose. In both low- and high-passage cultures, the expression locus probe hybridized to a single DNA fragment (Fig. 5A), showing that the p44 expression locus was stably located in the genome during the culture passages. In contrast, the p44-1-specific probe hybridized to two fragments in the early culture passages: one corresponding to the p44 expression locus (Fig. 5B) and another corresponding to the p44-1/p44-18 locus that was previously identified in the HZ strain (43). However, in the high-passage culture, the p44-1 band in the p44 expression locus disappeared and a single band in the p44-1/p44-18 locus remained (Fig. 5B). Upon hybridization with the p44-28-specific probe, two DNA fragments were detected in PstI and XbaI digestions of genomic DNA from the low-passage culture: one in the p44 expression locus (weakly hybridized bands in Fig. 5C) and another in an unknown locus (strongly hybridized bands in Fig. 5C). A single overlapping band was detected in the EcoRI digestion of DNA from the low-passage culture (Fig. 5C). In the high-passage culture, the weakly hybridized band in the p44 expression locus became much stronger in relative density (Fig. 5C). The p44-28 paralog within the unknown locus appeared to be transcriptionally inactive, since the p44-28 transcript was not detected despite its presence at an unknown locus within the genome in the low-passage culture. This unknown p44-28 locus was previously detected by genomic Southern blot analysis in five strains of A. phagocytophilum at similar relative densities (21). These results indicate that, during cell culture passages, (i) p44-1 and p44-28 were stably present, regardless of their expression status, and (ii) the genetic population of A. phagocytophilum with p44-1 in the p44 expression locus was replaced with the population of A. phagocytophilum with p44-28 in the expression locus.
|
Sequence alignment of the new locus revealed that the sequences of two p44-28 hypervariable regions and flanking regions (411 bp, extending from position 31 to 541 of p44-28p) in the transcriptionally active p44-28 of the p44 expression locus and the inactive p44-28p found in the p44-50/p44-28 pseudogene locus were identical (Fig. 6). Nucleotide sequences external to these regions differed between the expression and pseudogene loci. These results suggest a nonreciprocal and nonsegmental gene conversion, whereby the entire central hypervariable region of p44-28 in the expression locus and a portion of the flanking sequences were derived from p44-28p in the donor locus.
|
To obtain the sequences of the p44 hypervariable region in the expression locus, DNA PCR was performed with expression locus-specific primers (primer set 18 in Table 1 and Fig. 2). Of a total of 21 clones sequenced, four different p44 species were identified: p44-18, p44-14, p44-2a, and p44-60. Donor p44-2a and donor p44-14 were found to be complete genes, whereas donor p44-18 and p44-60 were pseudogenes. Sequence alignment of four pairs of the p44 species in the expression locus and in their respective donor loci in strain HZ revealed that each pair had identical sequences within the hypervariable region and within a portion of the flanking regions. These identical flanking sequences extended to different lengths in each pair (Fig. 7), and sequence variations were observed within each pair external to the putative recombined sequences (Fig. 7). These results suggest that (i) the entire p44 hypervariable region in the expression locus was derived from the identical region in the donor locus and (ii) the recombination for each p44 species included a part of the flanking 5' and 3' conserved regions.
|
We also considered the possibilities that the various p44 species might exhibit minor differences in sequence and/or that they might be differentially affected by temporal factors. Therefore, we compared the predicted amino acid sequences of all p44 clones that were identified by RT-PCR, 5'RACE, and p44 expression locus DNA PCR in the NY-31, NY-36, and NY-37 strains. We made this comparison at three different time points: in patients (when the cultures were isolated), during low passages of cell culture after isolation, and during high passages of cell culture. In strains NY-36 and NY-37, the p44 types had identical sequences in the central hypervariable region and parts of flanking sequences and no chimera sequence was found. In strain NY-31, alignment of 150 predicted amino acid sequences from a total of 13 different p44 species showed that the sequences of all p44 species were identical except for that of p44-23. A total of 15 p44-23 cDNA clones were sequenced, and this species was continuously expressed by strain NY-31 at all three time points. Two of these cDNA clones, one p44-23 cDNA derived from a low-passage culture and one of five p44-23 cDNAs derived from a high-passage culture, included a sequence (encoding FVQFAKAVGVSHPS) which bordered the 5' conserved region and the central hypervariable region (Fig. 8). The same block of sequence was found in the p44-15 transcript in the blood from the patient. This sequence differed from that encoding IVQFANAVKISSPE found in the same region of 13 other p44-23 cDNA clones derived from the blood of patient NY31 and from high-passage cell cultures. No p44-15 transcript was detected in cell cultures with the NY-31 strain.
|
| DISCUSSION |
|---|
|
|
|---|
Although the 5' and 3' regions of p44 paralogs are relatively conserved, they are not identical. Therefore, we propose that heteroduplex formation and resolution occur at the 5' and 3' conserved regions in each recombination event. Sequences of p44 hypervariable regions appear to be too diverse to form heteroduplexes in these regions. After heteroduplex resolution, the conserved regions, rather than the hypervariable regions, appear to create chimera or mosaic sequences that include sequences derived from the conserved regions of the new p44 species and the previous sequence in the p44 expression locus. The key finding of this study is that sequences of the hypervariable region, the expression locus, and the corresponding donor locus are conserved within the same A. phagocytophilum strains. Despite extensive analyses, we did not identify any chimera or mosaic sequences within the hypervariable region of p44 cDNA or p44 paralogs in the expression locus. We also identified a short block of shared amino acid sequences encoded by two different p44 species in two different strains of A. phagocytophilum; the coding sequences are outside of the three hypervariable domains in the central hypervariable region (21). This is similar to the findings of Barbet et al., which suggested segmental gene conversion of p44 in A. phagocytophilum (3).
Our extensive comparison of p44 sequences within the same strains revealed a short block of p44-15 sequence within the p44-23 cDNA of the NY-31 strain. One possible explanation for the sequence diversity within this region is the formation of heteroduplexes combined with mismatch base repairing. It is possible that p44-23 recombined at the expression locus to replace p44-15 through homologous recombination. Such a recombination could have included the small DNA fragment within the 5'- and 3'-end conserved region, so that the new p44-23 in the expression locus retained this short block of p44-15. Insertion and deletion of such a short sequence are known to occur during RecA-mediated heteroduplex formation and mismatch base repairing in E. coli, as described by DasGupta and Radding (10) and Bianchi and Radding (4) and in more recent review articles (19, 20). Sequence variations in the msp2 expression locus could potentially also result from heteroduplex formation between two partially homologous sequences (multigene family) with base pair mismatches and bubbles. Since msp2 pseudogenes have short blocks of homologous regions within the hypervariable region (6), the several patterns of bubbles corresponding to the same heteroduplex and their resolution may create complex patchworks, as previously described (18). It is also possible that there is another p44-23 paralog at a different genetic locus that has a hypervariable region sequence identical to that of p44-23 with this block of p44-15 sequence at the border, since we previously found three bands in the genomic Southern blot analysis of NY-31 strain using the p44-23 probe (21). In fact, duplicated p44 paralogs are present in the A. phagocytophilum chromosome. We previously cloned p44-2a and p44-2b, which have identical nucleotide sequences in the central hypervariable region and a part of 5'- and 3'-end conserved regions in two different genomic loci of strain HZ (43, 44). In addition we found that two copies of p44-16, p44-7, and p44-15 are present in the non-p44 expression locus.
Thus, although p44 and msp2 have similar gene structures and expression loci, different mechanisms appear to have evolved to create sufficient antigenic diversity. The reason for the differences may lie in the number and the nature of the hypervariable region sequence of the paralog. The A. phagocytophilum genome includes more than 80 paralogs of p44, whereas only 8 to 10 msp2s are predicted in A. marginale (7). Thus, the number and sequence diversity of p44 paralogs may have been sufficient for A. phagocytophilum to evade host immune surveillance by a simple gene conversion, but the mechanism may have required modification for A. marginale to create sufficient antigenic diversity. In agreement with this hypothesis, to date we have identified 59 different p44 cDNA sequences, all of which have been found in the preliminary A. phagocytophilum genome sequencing database (www.tigr.org).
Interestingly, despite the presence of 5' and 3' conserved flanking sequences, no promiscuous recombination was detected within the nonexpressed reserve p44 genes. Since genomic loci of individual p44 paralogs within the same strain were conserved, this suggests that recombination was restricted to the single p44 expression locus. Previous genomic Southern blot analyses with probes specific for each of the p44 paralogs showed identical patterns among A. phagocytophilum strains isolated from HGE patients from New York in 1995 and 2000, suggesting that DNA sequences and genomic loci of individual p44 orthologs are also conserved among different strains and over a time period of at least 5 years in this geographic locus (21). The present study further confirmed that the p44 hypervariable region sequences are conserved among four strains. A few base variations observed within the hypervariable region of p44s may be the result of point mutations. Point mutations and gene conversion are not mutually exclusive and partially account for vlsE variation in Borrelia burgdorferi (36).
Our present results suggest that nonsegmental gene conversion is the primary mechanism for variable p44 gene expression by A. phagocytophilum at the p44 expression site. If this is true, the next question would be what recombination pathway is involved in changing p44s in the expression locus. Our analysis of the preliminary A. phagocytophilum genome sequence data revealed that A. phagocytophilum has genes homologous to E. coli recA, recF, recO, recR, recO, and recJ genes that could be used for the RecA-dependent RecF homologous recombination pathway (14, 19, 23). We did not find recBCD or recE homologs in the A. phagocytophilum genome, indicating that A. phagocytophilum might lack the RecBCD and RecE homologous recombination pathways. We did find the ruvABC and recG genes, which are used to resolve Holliday junctions generated by homologous recombination (19) within the A. phagocytophilum genome. Thus, the mechanism of recombination of A. phagocytophilum may be RecF dependent, similar to antigenic variation in Neisseria gonorrhoeae pilin (26).
No direct evidence for msp2 or p44 recombination has yet been found in either A. marginale or A. phagocytophilum. Unlike the cloning of vlsE in B. burgdorferi (41), the cloning of A. phagocytophilum p44 phenospecies (p44 expression locus genospecies) has been difficult. However, if there is no recombination and all P44 variants observed to date by us and others (3, 8, 13, 21, 43, 44) are due to selection alone, various P44 phenospecies may have been eliminated from each stock of these strains after continuous passages in different environments, and they would not be expected to reappear. In contrast, in one patient's blood (strain NY-37), p44-28 was the major transcript (21), but once this strain was isolated in the HL-60 cell line, the p44-1 transcript became the predominant species in the NY-37 isolate and the p44-28 transcript disappeared. However, in later culture passages, the p44-28 transcript once again became the predominant transcript and p44-1 mRNA disappeared. Similar observations have been reported during the transmission of A. phagocytophilum strain HZ from ticks to horses (44). Without recombination to explain this phenomenon, each isolate of A. phagocytophilum would need to maintain >80 different p44 genotypes at the expression locus, such that >80 different genetic clones would be present in each strain under all circumstances, which would clearly be inefficient. It is more plausible that A. phagocytophilum evolved this mechanism of antigenic variation to maintain the ability to express diverse p44 antigenic repertoires. Of course, differences in growth rate and selection also contribute to the overall dominance of any particular P44 phenotype.
It has been proposed that, in A. marginale, the msp2 variant change is driven by the humoral immune pressure of the cattle (6, 11). However, our present data and those previously reported by others (3, 15) showed that changes in p44 variants occur in cell culture in the absence of antibodies. It will be of considerable interest to determine the factors that influence the dominance of particular phenotypes under identical cell culture conditions.
Other novel findings of the present study are the polycistronic expression of the p44 expression locus from the
70-type promoter sequence upstream of tr1. In addition, our 5'RACE results suggest that additional promoters may exist within and upstream of omp-1X. Barbet et al. reported that a promoter with significant nucleotide sequence identity to the msp2 expression site promoter of A. marginale exists upstream of omp-1N (p44ESuP1) (3). Taken together, the p44 expression locus appears to be regulated by multiple promoters.
A. phagocytophilum p44s and A. marginale msp2s (31), along with E. chaffeensis (the etiologic agent of human monocytic ehrlichiosis) omp-1s (27, 29) and Ehrlichia canis (the etiologic agent of canine ehrlichiosis) p30s (27, 28, 38), have been identified as immunodominant major outer membrane protein multigene families in the family Anaplasmataceae. In both Ehrlichia spp., 22 paralogous genes are arranged in tandem at a single locus and are regulated by both polycistronic and monocistronic transcription (27, 29). In Anaplasma spp., the paralogs are dispersed throughout the genomes, and their primary transcription seems to primarily depend on a specific expression locus. Previous studies by others (2, 3) revealed that this locus is linked to omp-1 orthologs. Our study showed that the omp-1-linked expression locus of A. phagocytophilum is arranged in tandem downstream from the tr1 gene, similar to those of the Ehrlichia spp. (27, 28). This implies that the expression loci of major outer membrane protein multigene families of pathogenic bacteria in the family Anaplasmataceae evolved from a common locus in a bacterium ancestral to both Ehrlichia and Anaplasma spp. More-detailed analyses of these loci should facilitate our understanding of the evolutionary relationship between these two genera. Understanding the molecular mechanisms underlying regulation of p44 and msp expression represents an important first step toward developing therapies and vaccines against HGE.
| ACKNOWLEDGMENTS |
|---|
| FOOTNOTES |
|---|
Present address: Laboratory of Environmental Microbiology, Institute of Environmental Sciences, University of Shizuoka, Shizuoka 422-8526, Japan. ![]()
| REFERENCES |
|---|
|
|
|---|
| 1. | Bakken, J. S., J. Krueth, C. Wilson-Nordskog, R. L. Tilden, K. Asanovich, and J. S. Dumler. 1996. Clinical and laboratory characteristics of human granulocytic ehrlichiosis. JAMA 275:199-205.[Abstract] |
| 2. | Barbet, A. F., A. Lundgren, J. Yi, F. R. Rurangirwa, and G. H. Palmer. 2000. Antigenic variation of Anaplasma marginale by expression of MSP2 mosaics. Infect. Immun. 68:6133-6138. |
| 3. | Barbet, A. F., P. F. M. Meeus, M. Belanger, M. V. Bowie, J. Yi, A. M. Lundgren, A. R. Alleman, S. J. Wong, F. K. Chu, U. G. Munderloh, and S. D. Jauron. 2003. Expression of multiple outer membrane protein sequence variants from a single genomic locus of Anaplama phagocytophilum. Infect. Immun. 71:1706-1718. |
| 4. | Bianchi, M. E., and C. M. Radding. 1983. Insertions, deletions and mismatches in heteroduplex DNA made by recA protein. Cell 35:511-520.[CrossRef][Medline] |
| 5. | Brayton, K. A., D. P. Knowles, T. C. McGuire, and G. H. Palmer. 2001. Efficient use of a small genome to generate antigenic diversity in tick-borne ehrlichial pathogens. Proc. Natl. Acad. Sci. USA 98:4130-4135. |
| 6. | Brayton, K. A., G. H. Palmer, A. Lundgren, J. Yi, and A. F. Barbet. 2002. Antigenic variation of Anaplasma marginale msp2 occurs by combinatorial gene conversion. Mol. Microbiol. 43:1151-1159.[CrossRef][Medline] |
| 7. | Brown, W. C., K. A. Brayton, C. M. Styer, and G. H. Palmer. 2003. The hypervariable region of Anaplasma marginale major surface protein 2 (MSP2) contains multiple immunodominant CD4+ T lymphocyte epitopes that elicit variant-specific proliferative and IFN-gamma responses in MSP2 vaccinates. J. Immunol. 170:3790-3798. |
| 8. | Caspersen, K., J. H. Park, S. Patil, and J. S. Dumler. 2002. Genetic variability and stability of Anaplasma phagocytophila msp2 (p44). Infect. Immun. 70:1230-1234. |
| 9. | Chen, S. M., J. S. Dumler, J. S. Bakken, and D. H. Walker. 1994. Identification of a granulocytotropic Ehrlichia species as the etiologic agent of human disease. J. Clin. Microbiol. 32:589-595. |
| 10. | DasGupta, C., and C. M. Radding. 1982. Polar branch migration promoted by recA protein: effect of mismatched base pairs. Proc. Natl. Acad. Sci. USA 79:762-766. |
| 11. | French, D. M., T. F. McElwain, T. C. McGuire, and G. H. Palmer. 1998. Expression of Anaplasma marginale major surface protein 2 variants during persistent cyclic rickettsemia. Infect. Immun. 66:1200-1207. |
| 12. | Hardalo, C. J., V. Quagliarello, and J. S. Dumler. 1995. Human granulocytic ehrlichiosis in Connecticut: report of a fatal case. Clin. Infect. Dis. 21:910-914.[Medline] |
| 13. | IJdo, J. W., C. Wu, S. R. Telford III, and E. Fikrig. 2002. Differential expression of the p44 gene family in the agent of human granulocytic ehrlichiosis. Infect. Immun. 70:5295-5298. |
| 14. | Ivancic-Bace, I., P. Peharec, S. Moslavac, N. Skrobot, E. Salaj-Smic, and K. Brcic-Kostic. 2003. RecFOR function is required for DNA repair and recombination in a RecA loading-deficient recB mutant of Escherichia coli. Genetics 163:485-494. |
| 15. | Jauron, S. D., C. M. Nelson, V. Fingerle, M. D. Ravyn, J. L. Goodman, R. C. Johnson, R. Lobentanzer, B. Wilske, and U. G. Munderloh. 2001. Host cell-specific expression of a p44 epitope by the human granulocytic ehrlichiosis agent. J. Infect. Dis. 184:1445-1450.[CrossRef][Medline] |
| 16. | Johnston, S. L., M. Strausbauch, G. Sarkar, and P. J. Wettstein. 1995. A novel method for sequencing members of multi-gene families. Nucleic Acids Res. 23:3074-3075. |
| 17. | Kim, H. Y., and Y. Rikihisa. 1998. Characterization of monoclonal antibodies to the 44-kilodalton major outer membrane protein of the human granulocytic ehrlichiosis agent. J. Clin. Microbiol. 36:3278-3284. |
| 18. | Kourilsky, P. 1983. Genetic exchanges between partially homologous nucleotide sequences: possible implications for multigene families. Biochimie 65:85-93.[Medline] |
| 19. | Kowalczykowski, S. C., D. A. Dixon, A. K. Eggleston, S. D. Lauder, and W. M. Rehrauer. 1994. Biochemistry of homologous recombination in Escherichia coli. Microbiol. Rev. 58:401-465. |
| 20. | Kuzminov, A. 1999. Recombinational repair of DNA damage in Escherichia coli and bacteriophage lambda. Microbiol. Mol. Biol. Rev. 63:751-813. |
| 21. | Lin, Q., N. Zhi, N. Ohashi, H. W. Horowitz, M. E. Aguero-Rosenfeld, J. Raffalli, G. P. Wormser, and Y. Rikihisa. 2002. Analysis of sequences and loci of p44 homologs expressed by Anaplasma phagocytophila in acutely infected patients. J. Clin. Microbiol. 40:2981-2988. |
| 22. | Lohr, C. V., K. A. Brayton, V. Shkap, T. Molad, A. F. Barbet, W. C. Brown, and G. H. Palmer. 2002. Expression of Anaplasma marginale major surface protein 2 operon-associated proteins during mammalian and arthropod infection. Infect. Immun. 70:6005-6012. |
| 23. | Lovett, S. T., R. L. Hurley, V. A. Sutera, Jr., R. H. Aubuchon, and M. A. Lebedeva. 2002. Crossing over between regions of limited homology in Escherichia coli. RecA-dependent and RecA-independent pathways. Genetics 160:851-859. |
| 24. | Lusetti, S. L., and M. M. Cox. 2002. The bacterial RecA protein and the recombinational DNA repair of stalled replication forks. Annu. Rev. Biochem. 71:71-100.[CrossRef][Medline] |
| 25. | Massung, R. F., J. H. Owens, D. Ross, K. D. Reed, M. Petrovec, A. Bjoersdorff, R. T. Coughlin, G. A. Beltz, and C. I. Murphy. 2000. Sequence analysis of the ank gene of granulocytic ehrlichiae. J. Clin. Microbiol. 38:2917-2922. |
| 26. | Mehr, I. J., and H. S. Seifert. 1998. Differential roles of homologous recombination pathways in Neisseria gonorrhoeae pilin antigenic variation, DNA transformation and DNA repair. Mol. Microbiol. 30:697-710.[CrossRef][Medline] |
| 27. | Ohashi, N., Y. Rikihisa, and A. Unver. 2001. Analysis of transcriptionally active gene clusters of major outer membrane protein multigene family in Ehrlichia canis and E. chaffeensis. Infect. Immun. 69:2083-2091. |
| 28. | Ohashi, N., A. Unver, N. Zhi, and Y. Rikihisa. 1998. Cloning and characterization of multigenes encoding the immunodominant 30-kilodalton major outer membrane proteins of Ehrlichia canis and application of the recombinant protein for serodiagnosis. J. Clin. Microbiol. 36:2671-2680. |
| 29. | Ohashi, N., N. Zhi, Y. Zhang, and Y. Rikihisa. 1998. Immunodominant major outer membrane proteins of Ehrlichia chaffeensis are encoded by a polymorphic multigene family. Infect. Immun. 66:132-139. |
| 30. | Oliver, D. 1985. Protein secretion in Escherichia coli. Annu. Rev. Microbiol. 39:615-648.[CrossRef][Medline] |
| 31. | Palmer, G. H., G. Eid, A. F. Barbet, T. C. McGuire, and T. F. McElwain. 1994. The immunoprotective Anaplasma marginale major surface protein 2 is encoded by a polymorphic multigene family. Infect. Immun. 62:3808-3816. |
| 32. | Petrovec, M., S. L. Furlan, T. A. Zupanc, F. Strle, P. Brouqui, V. Roux, and J. S. Dumler. 1997. Human disease in Europe caused by a granulocytic Ehrlichia species. J. Clin. Microbiol. 35:1556-1559.[Abstract] |
| 33. | Rikihisa, Y., S. A. Ewing, and J. C. Fox. 1994. Western immunoblot analysis of Ehrlichia chaffeensis, E. canis, or E. ewingii infections in dogs and humans. J. Clin. Microbiol. 32:2107-2112. |
| 34. | Rikihisa, Y., S. A. Ewing, J. C. Fox, A. G. Siregar, F. H. Pasaribu, and M. B. Malole. 1992. Analyses of Ehrlichia canis and a canine granulocytic Ehrlichia infection. J. Clin. Microbiol. 30:143-148. |
| 35. | Rikihisa, Y., N. Zhi, G. P. Wormser, B. Wen, H. W. Horowitz, and K. E. Hechemy. 1997. Ultrastructural and antigenic characterization of a granulocytic ehrlichiosis agent directly isolated and stably cultivated from a patient in New York State. J. Infect. Dis. 175:210-213.[Medline] |
| 36. | Sung, S. Y., J. V. McDowell, and R. T. Marconi. 2001. Evidence for the contribution of point mutations to vlsE variation and for apparent constraints on the net accumulation of sequence changes in vlsE during infection with Lyme disease spirochetes. J. Bacteriol. 183:5855-5861. |
| 37. | Telford, S. R., III, J. E. Dawson, P. Katavolos, C. K. Warner, C. P. Kolbert, and D. H. Persing. 1996. Perpetuation of the agent of human granulocytic ehrlichiosis in a deer tick-rodent cycle. Proc. Natl. Acad. Sci. USA 93:6209-6214. |
| 38. | Unver, A., N. Ohashi, T. Tajima, R. W. Stich, D. Grover, and Y. Rikihisa. 2001. Transcriptional analysis of p30 major outer membrane multigene family of Ehrlichia canis in dogs, ticks, and cell culture at different temperatures. Infect. Immun. 69:6172-6178. |
| 39. | van Dobbenburgh, A., A. P. van Dam, and E. Fikrig. 1999. Human granulocytic ehrlichiosis in western Europe. N. Engl. J. Med. 340:1214-1216. |
| 40. | Weber, K. L., M. E. Bolander, and G. Sarkab. 1998. Rapid acquisition of unknown DNA sequence adjacent to a known segment by multiplex restriction site PCR. BioTechniques 25:415-419.[Medline] |
| 41. | Zhang, J. R., J. M. Hardham, A. G. Barbour, and S. J. Norris. 1997. Antigenic variation in Lyme disease borreliae by promiscuous recombination of VMP-like sequence cassettes. Cell 89:275-285.[CrossRef][Medline] |
| 42. | Zhi, N., N. Ohashi, and Y. Rikihisa. 2002. Activation of a p44 pseudogene in Anaplasma phagocytophila by bacterial RNA splicing: a novel mechanism for post-transcriptional regulation of a multigene family encoding immunodominant major outer membrane proteins. Mol. Microbiol. 46:135-145.[CrossRef][Medline] |
| 43. | Zhi, N., N. Ohashi, and Y. Rikihisa. 1999. Multiple p44 genes encoding major outer membrane proteins are expressed in the human granulocytic ehrlichiosis agent. J. Biol. Chem. 274:17828-17836. |
| 44. | Zhi, N., N. Ohashi, T. Tajima, J. Mott, R. W. Stich, D. Grover, S. R. Telford III, Q. Lin, and Y. Rikihisa. 2002. Transcript heterogeneity of the p44 multigene family in a human granulocytic ehrlichiosis agent transmitted by ticks. Infect. Immun. 70:1175-1184. |
This article has been cited by other articles: