Previous Article | Next Article ![]()
Infection and Immunity, October 2004, p. 5574-5581, Vol. 72, No. 10
0019-9567/04/$08.00+0 DOI: 10.1128/IAI.72.10.5574-5581.2004
Copyright © 2004, American Society for Microbiology. All Rights Reserved.
Department of Veterinary Biosciences, College of Veterinary Medicine, The Ohio State University, Columbus, Ohio,1 Division of Viral and Rickettsial Diseases, Centers for Disease Control and Prevention, Atlanta, Georgia,2 Department of Veterinary Pathology, Veterinary Teaching Hospital, University of Liverpool, Leahurst, Neston, Wirral, United Kingdom,3 Vector Ecology Laboratory, Louis Calder Center, Fordham University, Armonk, New York4
Received 28 February 2004/ Returned for modification 20 May 2004/ Accepted 7 July 2004
|
|
|---|
|
|
|---|
Gene duplications underlie the diversification of genes, often resulting in the development of novel gene functions or pseudogenes that may help the organism adapt to new environments (28). The p44 multigene family of A. phagocytophilum consists of more than 100 nonidentical polymorphic paralogs, which encode P44s, 44-kDa major outer membrane proteins that are exposed on the cell surface (41). Anti-P44 antibodies can protect mice from experimental infection with A. phagocytophilum (16), and a recombinant P44 protein induces proinflammatory cytokines in human leukocytes in vitro (17); thus, P44s are likely to be important in the pathogenesis of A. phagocytophilum in its mammalian hosts. The diversity of paralogous p44 genes and the surface proteins they encode among strains may reflect differences among geographic and host-specific variants. Thus, several studies attempted to use p44 for typing A. phagocytophilum strains (4). However, due to the presence of a large number of diverse p44 paralogs per genome, it has been difficult to delineate strain-specific variations. Further characterization of this gene and protein family may help to clarify some of the mechanisms of functional divergence under the selection pressures of different environments and correlate A. phagocytophilum strains.
All p44 genes contain a central hypervariable region (hv) of approximately 280 bp, flanked by conserved 5'- and 3'-terminal sequences of 100 to 500 bp. Most p44 paralogs lack the translational start codon, ATG. The hv has four short conserved segments that divide it into three extremely diverse domains (20). Among several strains isolated from New York and Massachusetts, the hv sequences of each p44 ortholog are conserved across strains (12, 19, 20). Therefore, each individual p44 paralog or ortholog can be identified by its signature hv nucleotide sequence (1, 19, 20, 41). Similarly, the genomic loci of several p44 paralogs are conserved in several human isolates from New York State, as demonstrated by Southern blot analysis (19, 20, 41). Published data show that diverse p44 paralogs are expressed by the same strain or by different strains growing in different hosts (human, mouse, horse, and ticks) and under different environmental conditions (1, 12, 14, 19, 20, 40, 42). A. phagocytophilum has a unique polymorphic p44 expression locus (1, 19), and it has been proposed that the expression of all p44 paralogs takes place in the same locus by segmental gene conversion (1). However, it has also been shown that some p44s can be expressed at their own genomic loci (40, 41).
In the present study, we examined conservation of p44s among A. phagocytophilum strains. Since the genomic locus consisting of two tandem p44s, p44-1 and p44-18, was found to be strain polymorphic, we further examined the structure, geographic polymorphism, and transcription of this genomic locus. At this genomic locus the HZ strain (isolated from a human patient in New York State in 1995) expresses the p44-18 pseudogene but not the full-length gene p44-1 when cultivated in HL-60 cells at 37°C. The same expression pattern, which utilizes a novel RNA splicing mechanism, also occurs in vivo in infected horses (40). Here we studied the sequence of the p44-1/18 locus in 14 strains isolated from different geographical areas and from different mammalian and tick hosts and the p44 transcription at this locus in several strains.
|
|
|---|
|
View this table: [in a new window] |
TABLE 1. A. phagocytophilum isolates
|
Total DNA from nymphal Ixodes scapularis was purified as follows. Each tick was homogenized in a sterile glass depression slide with a glass rod. The homogenate was placed in a 1.5-ml tube with 75 µl of 10 mM Tris-EDTA buffer, pH 8.0, and 25 µl of proteinase K solution (20 mg/ml) was added. The mixture was incubated at 56°C for 30 min. To inactivate proteinase K, the sample was heated to 100°C for 60 min. After centrifuging at 2,000 x g for 10 min, the supernatant was transferred to a new 1.5-ml tube, and StrataClean resin was added following the manufacturer's instructions to separate DNA from proteins, including restriction enzymes and some modification enzymes (Stratagene, La Jolla, Calif.).
Reverse transcription-PCR and DNA-PCR. Reverse transcription-PCR was performed as described previously (20). Total RNA was extracted from 5 x 106 A. phagocytophilum-infected HL-60 cells (90 to 100% infection) with the RNeasy mini kit (Qiagen). After DNase I treatment, the isolated RNA (5 µg) was heated to 70°C for 10 min. Samples were then reverse transcribed at 42°C for 50 min in a 20-µl reaction mixture containing a 0.5 mM concentration of each deoxynucleoside triphosphate, 200 U of SuperScript II reverse transcriptase (Invitrogen), 200 ng of random hexamers, and 3 mM MgCl2. PCR was performed in a 50-µl reaction mixture containing 4 µl of the cDNA product or 5 µl of DNAs purified as above, 10 pmol of each primer, a 0.2 mM concentration of each deoxynucleoside triphosphate, 5 U of Taq DNA polymerase, and 1.5 mM MgCl2.
PCR conditions included 3 min of denaturation at 94°C, followed by 35 cycles consisting of 30 s of denaturation at 94°C, 1 min of annealing at 54°C, and 1 min of extension at 72°C. Gene-specific primers 13 to 15 are listed in Table 2. A BLAST search of the GenBank and A. phagocytophilum HZ strain databases showed that no other locus in the A. phagocytophilum genome anneals with these primer pairs. DNA PCR products were purified from the gel and cloned into a pCRII vector (Invitrogen). Twenty cDNA clones were randomly selected from the transformants and sequenced on an ABI 373XL Stretch DNA sequencer, with the ABI Prism BigDye terminator cycle sequencing reaction kit.
|
View this table: [in a new window] |
TABLE 2. Oligonucleotide primers for PCR and Southern blot analysis
|
-32P]dCTP with Ready-To-Go DNA labeling beads (Amersham Pharmacia Biotech) to make various p44 probes. Genomic DNA PCR products from various strains of A. phagocytophilum were subjected to electrophoresis on a 1% agarose gel and transferred to a positively charged nylon membrane (Amersham Pharmacia Biotech). Hybridization was carried out under high-stringency conditions (65°C) in rapid hybridization buffer (Amersham Pharmacia Biotech) as described elsewhere (20). After being washed, the membrane was exposed to Hyperfilm (Amersham Pharmacia Biotech) to visualize the locations of the hybridized bands. Nucleotide sequence accession numbers. The sequences identified in this study have been assigned the following GenBank accession numbers: p44-1/p44-18 locus of strain HZ, AY151054; p44-61/p44-18 locus of strain MN-1, AY319831; p44-61/p44-18 locus of strain MN-2, AY299122; and the p44-OS locus of strain OS, AY541008. The genome of A. phagocytophilum HZ was sequenced at the Institute for Genomic Research, and the sequences are available at www.tigr.org.
|
|
|---|
![]() View larger version (28K): [in a new window] |
FIG. 1. PCR and hybridization analysis of p44s in different strains. (A) Specificity of p44 PCR followed by p44 species-specific hybridization. p44-1, p44-2, p44-18, and p44-28 were amplified by gene-specific primer sets 2 to 5 (Table 2). The amplified products were resolved on 1% agarose gels containing ethidium bromide. After agarose gel electrophoresis, the amplicons were transferred to a nylon membrane and hybridized with 32P-labeled p44-1, p44-2, p44-18, and p44-28 probes. (B) p44 PCR and hybridization of different A. phagocytophilum human isolates from New York and Minnesota with four gene-specific probes. The strain names and probes used are shown above each panel. (C) p44 PCR of DNA from seven individual nymphal ticks from New York State and hybridization with four gene-specific probes.
|
DNA specimens (5 µl) from 7 of 45 nymphal ticks collected in Westchester County, N.Y., were p44-PCR positive with a p44 gene-specific primer pair (set 1 in Table 2). Since there are more than 100 copies of p44 per A. phagocytophilum genome (www.tigr.org) (21) and this PCR could detect
10 copies of p44, most likely PCR-negative ticks were not infected with A. phagocytophilum. All A. phagocytophilum genomes from the seven individual ticks had all four p44 species (Fig. 1C). Thus, our method was sensitive enough to detect p44 species present in A. phagocytophilum genomes from individual field-collected nymphal ticks. The A. phagocytophilum strains present in these ticks contained the same p44 species as human isolates from New York State, perhaps reflecting a geographic type.
Characterization of the p44-1/18 locus in northeastern United States isolates. In strain HZ, p44-1 is a single and complete gene that is 1,242 bp long. It occurs in tandem with a downstream p44-18 pseudogene (762 bp), which overlaps p44-1 by 22 bp with a frame shift of 1 bp (40). This tandem locus is referred to as p44-1/18. Since both p44-1 and p44-18 were detectable in all New York human and tick strains but only p44-18 was detectable in Minnesota human isolates, we analyzed the p44-1/18 genomic locus in various strains. In addition to the HZ strain, this locus had previously been identified by Southern blotting in strains NY-31, NY-36, and NY-37 (20). We designed a p44-1/18 locus-specific primer pair (set 6 in Table 2 and Fig. 2A) based on the region of the strain HZ p44-1/18 locus, starting 133 bp upstream of p44-1 and ending 77 bp downstream of the p44-18 pseudogene. The p44-1/18 locus was amplified with locus-specific primer pairs and sequenced for LL, a 1996 New York State isolate, and NY-31, NY-36, and NY-37, isolated in 2000. Sequencing results (2,230 bp) showed that the p44-1/18 loci of these four New York isolates were 100% identical to that of strain HZ isolated in 1995 (Fig. 2A). These results suggest that the p44-1/18 genomic locus has been conserved among the New York human isolates for at least 5 years.
![]() View larger version (18K): [in a new window] |
FIG. 2. Comparison of p44-1/18 genomic loci in geographically different isolates. (A) Schematic diagram of p44-1/18 genomic loci. p44-1, p44-18, p44-61, p44-MRK, and p44-OS hv regions are shown by different fill patterns. Flanking conserved regions are shown as gray bars. The p44-1 and p44-18 overlap region in northeastern isolates is shown as a small grid. p44-61 and p44-18 are separated by a 37-bp intergenic space in the p44-61/18 locus in Minnesota strains (white box). Primer set 6 was used to amplify the p44-1/18 locus in all northeastern strains; primer set 7 was used to amplify the p44-1/18 locus equivalents in the Minnesota, MRK, and OS strains. Primer sets 13 to 15 were used in transcriptional analysis of this locus. Amplicons are shown above the genetic map; the in-frame stop codon in p44-OS is shown by an asterisk. (B) Sequence comparison of the p44-1 (p44-61) and p44-18 junction in different United States isolates. Nucleotides shown on a black background are identical in the Minnesota, MRK, and Trustom strains. The boxed nucleotides (TCT) shown are the first in-frame codon of p44-18. The boxed nucleotides TAA are the stop codon for p44-1 in northeastern strains and for p44-61 in Minnesota and MRK strains. The boxed nucleotides with black background indicate the first in-frame codon (AGG) of p44-18. Gaps are shown by dashes.
|
2,230 bp) were conserved and showed an average of 99.6% identity to the locus in strain HZ (Fig. 2). In the Gaillard and AVK-HLPA1 isolates, p44-1 and p44-18 overlapped by 22 bp, as in strain HZ. The p44-1 and p44-18 junctional genomic region of the Trustom isolate differed from those of the other six strains by 30 bp, including an 8-bp insertion (AGTATTAT) (Fig. 2B).
Characterization of the p44-1/18 genomic locus in Minnesota strains and a California horse isolate, MRK.
This locus also existed in all three Minnesota strains, MN-1, MN-2, and MN-9 (19), as detected by p44-1/18 locus-specific PCR (primer set 7 in Table 2) and DNA sequencing. The locus (
2,000 bp) showed 99.7 to 99.9% identity among strains MN-1, MN-2, and MN-9. Notably in this locus, p44-1 was replaced by a new p44 species, p44-61, in all Minnesota strains, but p44-18 was conserved (Fig. 2 and 3). p44-61 was 1,167 bp long, with an ATG start codon and a TAA stop codon (Fig. 3). p44-61 and p44-1 showed only 83% nucleotide and amino acid identity. This explains why p44-1 was undetectable in the Minnesota strains but p44-18 was detectable by PCR hybridization analysis (Fig. 1B). The p44-18 ortholog in the Minnesota strains was 708 bp long and was located 37 bp downstream from p44-61 (Fig. 2) rather than overlapping the upstream cistron, as in HZ (40) and most other northeastern strains. There was an additional polymorphism in this locus. The three Minnesota strains had 30 bp, including an 8-bp insertion fragment, that were 100% identical to that of the Trustom (Rhode Island) strain (Fig. 2B). The p44-18 orthologs were completely conserved (base sequences 100% identical) between MN-2 and MN-9 and 99.9% identical between MN-1 and MN-2 and between MN-1 and MN-9. The p44-18 of the three Minnesota strains had 96% base sequence identity with HZ p44-18.
![]() View larger version (69K): [in a new window] |
FIG. 3. Sequence comparison of p44-61hv, p44-20hv, and p44-23hv in the Minnesota and MRK strains corresponding to the p44-61 region of Fig. 2. The chimeric regions are marked by gray and black backgrounds. The underlined sequence was conserved among the three p44s. Arrows indicate the gene-specific primers used to amplify the p44-20 hv and p44-23 hv. The locations of primer sets 8 and 9 (p3761-p44-20R and p44-20F-p4183) (Table 2), used to amplify the p44-20 hv, and primer sets 10 and 11 (p3761-p44-23R and p44-23F-p4183) (Table 2), used to amplify the p44-23 hv, are shown.
|
Alignment of p44-61 with previously identified p44 homologs suggested that this p44 is derived from two other p44 family members, p44-23 and p44-20, which were originally identified in all New York State strains, including HZ, LL, NY-31, NY-36, and NY-37 (20, 36, 43). To determine whether p44-20 and p44-23 exist in the Minnesota and MRK strains, these two p44 species were amplified with gene-specific primer sets 8 to 11 (Table 2 and Fig. 3) in the Minnesota and MRK strains and sequenced. The p44-20hv and p44-23hv sequences were present and identical between the Minnesota and MRK strains and had 95 to 98% identity to p44-20 and p44-23 of strain HZ at the nucleotide level. Sequence comparison showed that the hv region of p44-61 was a chimera of partially duplicated p44-20 and p44-23 present within the same genome of each strain. Specifically, in both the Minnesota and MRK strains, nucleotides 543 to 739 of p44-61 were identical to those of p44-20, and nucleotides 724 to 843 of p44-61 were identical to those of p44-23. In each strain, p44-61, p44-20, and p44-23 showed 17 bp of absolutely conserved sequence within the hv regions (nucleotides 724 to 740, GGTAAAAACTGGCCTAG; amino acid sequence GKNWP) (Fig. 3). This sequence included the absolutely conserved GCCTAG (amino acid sequence WP) in one of the four highly conserved regions within the hv region in p44 homologs (20).
p44-1/18 genomic locus in an ovine strain from the United Kingdom. A. phagocytophilum (previously known as Ehrlichia [Cytoecetes] phagocytophila) causes tick-borne fever in sheep and cattle and is transmitted by Ixodes ricinus ticks in the United Kingdom and mainland Europe (13). The p44-1/18 locus was also amplified in strain OS with locus-specific primer set 7 (Table 2). The downstream 55-bp sequences upstream of the 3' primer of primer set 7 were 100% identical between HZ and all 14 U.S. and Scotland OS strains (Fig. 2). The upstream primer of pair 6 (Table 2) did not amplify the p44-OS locus, indicating that its 5'-flanking region was different from those of the U.S. strains. The PCR product was a single band of approximately 1,200 bp. Sequencing revealed only one p44, p44-OS, in this genomic locus in strain OS. p44-OS was a truncated p44 paralog with an in-frame stop codon at 233 bp. p44-OS was 1,068 bp long, including the stop codon; it had 74.7% identity to strain HZ p44-1 at both the nucleotide and amino acid levels. p44-OS was not found by a BLAST search of the A. phagocytophilum HZ genome database (www.tigr.org) or the GenBank database. p44-1 and p44-18 were undetectable in strain OS by PCR (primer sets 2 and 4 in Table 2).
Transcription at the p44-1/18 genomic locus varies among strains. p44-1/18 in strain HZ is a unique genomic locus that can undergo posttranscriptional splicing (40). Only the p44-18 transcript is expressed at this genomic locus in strain HZ-infected HL-60 cells at 37°C because transcribed RNA is spliced to remove a part of p44-1; at 24°C, on the other hand, primarily p44-1 is expressed (40). We studied transcription from the p44-1/18 locus in the three New York strains from 2000, because we did not find p44-18 expression in these strains in our previous study (20) despite the presence of an identical p44-1/18 locus between New York strains and strain HZ.
By reverse transcription-PCR with primer pairs specific to this locus and the individual genes within it (primer sets 13 to 15 in Table 2) (40), p44-1 was expressed by all three New York strains growing in HL-60 cells at 37°C (Fig. 4A). The p44-18 transcript was undetectable in these three isolates with both locus-specific and p44-generic primers (primer sets 1 and 15 in Table 2; Fig. 2, Fig. 4A). p44-61 was expressed at this genomic locus in the three Minnesota strains cultured in HL-60 cells at 37°C (Fig. 4B). These results suggest that transcription of p44-1/18 varies among strains. The results were in agreement with our previous observation (40) that p44s can be expressed in genomic loci distinct from the polycistronic p44 expression locus (1, 19).
![]() View larger version (29K): [in a new window] |
FIG. 4. Reverse transcription-PCR analysis of p44 expression in the p44-1/18 and p44-61/18 loci. (A) p44-1 and p44-18 expression in the p44-1/18 locus of strains NY-31, NY-36, and NY-37. (B) p44-61 and p44-18 expression in the p44-61/18 locus of strains MN-1, MN-2, and MN-9. The DNA template control included 0.1 ng of genomic DNA from each strain. RT+ and RT indicate the presence and absence of reverse transcriptase, respectively.
|
|
|
|---|
So far, the 16S rRNA gene sequences of A. phagocytophilum strains infecting humans have been identical. However, divergent (98.8%) 16S rRNA gene sequences were found in ticks and animals (25, 27, 33). Generally, human isolates can infect laboratory mice, but Massung et al. (24) reported that strains isolated from wild deer cannot infect laboratory mice. Analysis of the p44 profiles of these wild deer isolates would facilitate better understanding of the functional divergence of p44 genes. The diversity in host mammalian species adaptation suggests genetic heterogeneity among strains. Our present study clearly demonstrates diversity among A. phagocytophilum strains at the p44-1/18 genomic locus. Intraspecies variation of p44 genes may contribute to our understanding of the role of p44s in immunoprotection, pathogenesis, and perhaps the mammalian host specificity of A. phagocytophilum.
The classic model for the evolution of a multigene family involves gene duplication followed by divergence. One mechanism of p44 molecular evolution discovered in this study was recombination among conserved hv sequences with a short conserved sequence within the hv region. The identical sequence found within the same genome suggests recombination within the same genome after the emergence of p44-20 and p44-23. Although it appears that p44-1 and p44-18 are under the same transcriptional control (operon), the present study indicated that each p44 paralog may have evolved at different rates, suggesting that each p44 may be under different selective pressure. For example, p44-1 appeared to be evolving more rapidly than p44-18 and perhaps p44-2, p44-28, p44-20, and p44-23. p44-18 was conserved at this genomic locus in all United States strains. The shared p44-61/18 in the Minnesota and MRK strains indicates common ancestry of the human and equine strains in the western and northwestern United States. Although the corresponding genomic locus was found in the European sheep isolate OS, neither p44-1 nor p44-18 was detected. Thus, the OS and U.S. strains may have diverged much earlier, before the divergence of the western and midwestern strains from the northeastern strains in the United States These results suggest that the p44-1/18 locus exemplifies independent evolution of p44 genes in geographically separate A. phagocytophilum strains.
The unique p44-1/18 genomic loci may thus be useful as markers to distinguish strains from different geographic areas. Other methods used to identify genetic strains have proven problematic. Although the 16S rRNA, groESL, and ank genes show sequence variability among isolates from humans, ticks, wild animals, horses, sheep, and goats (5, 8, 15, 18, 22, 23, 37), pulsed-field gel electrophoresis failed to distinguish seven North American strains from diverse states and host species (9). Restriction fragment length polymorphism analysis of PCR-amplified p44 genes has shown p44 polymorphism among different isolates; however, multiple p44s per genome generate overlapping banding patterns that might make regional strain typing difficult (3). Our results suggest that p44-1/18 and corresponding genomic loci differ significantly among geographically separated strains and are conserved within each region; thus, p44-1/18 has the requisite properties of a geographic strain marker for A. phagocytophilum.
The expression of p44 at an additional genomic locus different from the polymorphic p44 expression locus (1, 19) may result in more flexible control of transcription than a single expression site. The present study further confirmed p44 transcription at the p44-1/18 genomic locus in six additional strains. The p44-1/18 locus is self-expressing, suggesting that it is under selective pressure different from that of paralogs expressed from the common expression locus. Further comparative analysis of other self-expressing p44s may provide insights into differential p44 evolution.
|
|
|---|
This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»