Previous Article | Next Article ![]()
Infection and Immunity, September 2006, p. 5272-5283, Vol. 74, No. 9
0019-9567/06/$08.00+0 doi:10.1128/IAI.00546-06
Copyright © 2006, American Society for Microbiology. All Rights Reserved.
Patricia Antalis,1
John Gladitz,1
Azad Ahmed,1
Bethany Dice,1
Benjamin Janto,1
Richard Dopico,1
Randy Keefe,1
Jay Hayes,1
Sandra Johnson,1
Sujun Yu,1
Nathan Ehrlich,1,
Jennifer Jocz,1,
Laura Kropp,1,
Ray Wong,1,
Robert M. Wadowsky,3
Malcolm Slifkin,4,¶
Robert A. Preston,1
Geza Erdos,1,
J. Christopher Post,1,2
Garth D. Ehrlich,1,2* and
Fen Z. Hu1,2
Center for Genomic Sciences, Allegheny-Singer Research Institute, Pittsburgh, Pennsylvania 15212,1 Department of Microbiology and Immunology, Drexel University College of Medicine, Allegheny Campus, Pittsburgh, Pennsylvania 15212,2 Children's Hospital of Pittsburgh, Pittsburgh, Pennsylvania 15213,3 Clinical Microbiology Laboratory, Department of Pathology, Allegheny General Hospital, Pittsburgh, Pennsylvania 152124
Received 4 April 2006/ Returned for modification 25 May 2006/ Accepted 23 June 2006
|
|
|---|
1.4 kb) from this library demonstrated that 348 (10.8%) of the clones were unique with respect to all genomic sequences of the P. aeruginosa prototype strain, PAO1. Hypothetical translations of the open reading frames within these unique sequences demonstrated protein homologies to a number of bacterial virulence factors and other proteins not previously identified in P. aeruginosa. PCR and reverse transcription-PCR-based assays were performed to analyze the distribution and expression patterns of a 70-open reading frame subset of these sequences among 11 of the clinical strains. These sequences were unevenly distributed among the clinical isolates, with nearly half (34/70) of the novel sequences being present in only one or two of the individual strains. Expression profiling revealed that a vast majority of these sequences are expressed, strongly suggesting they encode functional proteins. |
|
|---|
Many P. aeruginosa clinical isolates are multiply resistant to antibiotics at concentrations achievable in vivo through systemic administration, and single or combination antimicrobial therapy fails in 5% to 70% of instances, depending on the series. The basic mechanisms of antibiotic resistance, including alteration of the drug target, prevention of drug access to the target, and drug inactivation, mostly arise via the acquisition of exogenous genetic material through horizontal gene transfer (HGT) mechanisms (29, 58). These studies suggested that other genes associated with virulence may also be commonly exchanged among the pseudomonads and even among phylogenetically diverse genera via similar mechanisms. Studies of multiple clinical isolates of both the gram-negative Haemophilus influenzae and the gram-positive Streptococcus pneumoniae demonstrated that both of these naturally transformable obligate human pathogens have a population-based supragenome that is multiple times the size of a given bacterium's genome. Moreover, it has been shown that the infecting populations of these species are polyclonal in nature and that recombination takes place among the strains during infection, leading to the development of novel strains (17, 33, 44, 45, 47-49; J. Hogg et al., unpublished data). We wished to determine if the distributed genome hypothesis (DGH) (15) and its associated virulence corollary were also applicable to bacterial species that had adapted to a broader array of ecological niches.
The natural population structure of P. aeruginosa infections is not fully understood with regard to genotypic clonality, but there is consensus that this organism can form heterogeneous three-dimensional biofilms and that its life cycle is complex, composed of multiple discrete stages including planktonic, reversible attachment, irreversible attachment, biofilm growth and maturation, and active dispersion wherein planktonic showers vent from mature biofilm structures (40, 61). It is also clear that P. aeruginosa biofilms are composed of multiple types of differentiated cells (6, 23, 40) even when grown in vitro from a clonal population. Thus, these cells have the intrinsic capacities for development and differentiation, phenomena long believed to be unique to eukaryotes (38). Clearly, their ability to undergo myriad phenotypic transformations provides the population as a whole with profound resistances to both host defenses and antibiotics under clinical conditions (9, 11, 14, 15, 43). A further layer of complexity may result if natural infecting populations are polyclonal. Such a scenario would provide for continuous reassortment of genic characters among strains during the infectious process that would result in the production of new strains, some of which would have a reproductive advantage in the host environment.
It has long been known that the various P. aeruginosa strains display different phenotypic characters, but the mechanisms underlying these differences could not definitively be ascribed to expression or genomic heterogeneity. Serotyping of P. aeruginosa has been used for nearly a century to differentiate among strains (3), and multiple schemata have been developed to aid in this process. Liu et al. have identified 20 P. aeruginosa serotypes based on the structure of the lipopolysaccharide (30, 31), but this method lacks sufficient discriminating power, identifies differences at only a very limited set of loci, and is not applicable to rough-type colonies. Thus, more comprehensive methods of strain discrimination using genomic fingerprinting approaches have been developed including, restriction fragment length polymorphism typing, pulsed-field gel electrophoresis, and PCR-based methods. Each of these techniques is capable of discriminating among large numbers of strains (52); however, none of these methods is capable of identifying the individual coding elements that compose a genome, and so it is not possible to distinguish between allelic and genic differences using these techniques.
Comparative whole-genome DNA sequence analysis provides the most comprehensive means to identify differences among bacterial isolates. The complete genomic sequences of two P. aeruginosa strains have been determined: PAO1, the prototype lab strain (54), and UCBPP-PA14, a pathogenic strain (53). Comparative evaluations of these strains have revealed that each strain contains unique genes with respect to the other. Whole-genome comparisons, while ideal, are currently impractical under most circumstances, especially for organisms with larger genomes, but the random sequencing of clones from multiple strains compared against a fully sequenced prototype strain can provide an estimate of the genomic plasticity within a species (44, 45). Recently, Spencer et al. partially sequenced (average coverage of
0.5x) the genomes of three P. aeruginosa strains, two CF isolates and one aquatic isolate, wherein they showed that at least 10% of genes were novel with respect to PAO1.
The current study was designed to estimate the degree of genomic plasticity (i.e., genic differences, as opposed to allelic differences) among human pathogenic strains of P. aeruginosa. This was accomplished through the random sequencing of clones from a highly redundant, pooled genomic library prepared from 11 low-passage clinical isolates obtained from patients with otorrhea (n = 6) or other chronic infections (n = 5) and 1 clinical strain from the American Type Culture Collection (ATCC).
|
|
|---|
TOP10 One Shot Escherichia coli competent cells (Invitrogen Corp., California) were grown in LB broth or on LB agar. Ampicillin (50 µg/ml) or kanamycin (30 µg/ml) was supplemented in the culture medium for clone selection.
Isolation of bacterial genomic DNA and total RNA. Genomic DNA was extracted essentially as described elsewhere (4). Briefly, cells were collected by centrifugation from 100-ml overnight cultures and resuspended in TE buffer (10 mM Tris-HCl, pH 8.0; 1 mM EDTA, pH 8.0). Bacteria were lysed by the addition of sodium dodecyl sulfate (SDS; Invitrogen) to a final concentration of 0.5%. The lysates were then digested with RNase A (50 µg/ml; Gentra Systems, Inc., Minneapolis, MN) at 37°C for 1 h and then with proteinase K (100 µg/ml; Invitrogen) at 37°C for 1 h. Cetyltrimethylammonium bromide (Sigma) was added to a final concentration of 1%, and the mixture was incubated at 65°C for 20 min. Following a chloroform-isoamyl alcohol (24:1) extraction, the DNA was precipitated from the aqueous phase with 0.6 volume of isopropanol. The DNA was then pelleted by centrifugation and washed with 70% ethanol. After air drying, the pellets were resuspended in TE buffer at 65°C for 1 h. The purified DNA was tested for quantity and quality by UV absorbance at 260/280 nm (DU650 spectrophotometer; Beckman) and agarose gel electrophoresis, respectively.
Total bacterial RNAs were extracted using the hot phenol method (10). Briefly, stationary-phase cultures (optical density at 600 nm of 4) were treated with a cell stop solution (1 volume of buffer-saturated phenol, pH 6.6, and 19 volumes of 100% ethanol), and the bacteria were pelleted at 5,500 x g for 5 min at 4°C. The pellet was resuspended in cell resuspension solution (10 mM KCl, 5 mM MgCl2, 10 mM Tris, pH 7.5) followed by the addition of fresh hot phenol solution (five parts hot phenol buffer [400 mM NaCl, 40 mM EDTA, 1% ß-mercaptoethanol, 1% SDS, 20 mM Tris pH 7.5] and one part buffer-saturated phenol [pH 6.6; Ambion, Inc., Austin, TX]), heated at 95°C for 2 min, and centrifuged at 20,000 x g for 15 min at room temperature. The aqueous layer was recovered and extracted, once with acid phenol-chloroform (5:1) and then twice with chloroform, followed by centrifugation at 20,000 x g for 15 min at room temperature. The RNA was precipitated with isopropanol, placed on dry ice for 10 min, and centrifuged at 16,000 x g for 15 min at 4°C. The RNA pellet was washed twice with 70% ethanol and resuspended in 1x TE buffer at 37°C for 30 min followed by treatment with TURBO DNase (Ambion). RNA quality was checked using an Agilent 2100 Bioanalyzer with the RNA 6000 Nano assay kit (Agilent Technologies, Palo Alto, CA).
Construction of the pooled genomic library.
A pooled genomic library was constructed using equimolar quantities of DNA from each of 12 clinical isolates of P. aeruginosa as described previously (16, 46). Briefly, the genomic DNA from each isolate was hydrodynamically fragmented to an average size of
1.5 kb using the HydroShear apparatus (GeneMachines, San Carlos, CA) according to the manufacturer's instructions. Aliquots (10 µg each) of the sheared DNA preparations were pooled, end repaired, ligated into the vector pCR4Blunt-TOPO, and transformed into E. coli TOP10 competent cells according to the manufacturer's protocol (Invitrogen Corp., Carlsbad, CA). The transformants were plated onto 22-cm2 LB agar plates (Genetix, Christchurch, United Kingdom) at densities between 2,000 and 3,000 colonies per tray and incubated at 37°C for 16 h. A total of 241,152 bacterial colonies were picked and arrayed using the CGS multifunctional three-axis Q-bot (Genetix Limited, United Kingdom). Arrayed clones were chosen randomly for further analysis.
DNA sequencing. Plasmid DNA templates for sequencing were prepared using the QIAprep Miniprep kits (QIAGEN Inc., Valencia, CA) on a Beckman FX robot (Beckman Instruments), digested with EcoRI (Invitrogen), and analyzed on ethidium bromide-stained 1% agarose gels in Tris-acetate-EDTA buffer. Only constructs containing inserts of >0.5 kb were used as sequencing templates. Dideoxy sequencing was performed as described for both LiCor IR2 Gene ReadIR instruments (LiCor, Inc., Lincoln, Nebr.) and Beckman CEQ 2000 XL automated capillary electrophoresis sequencing instruments (Beckman Coulter, Inc., Fullerton, CA) (44). Confirmatory sequencing was performed using an ABI 3730xl DNA analyzer (Applied Biosystems Inc., Foster City, CA) in which the sequencing reactions were prepared in 3-µl volumes, using the Parallab 350 nanoliter genomic work station (Brooks Automation, Inc., Chelmsford, MA). For Parallab-based sequencing, the reaction mixtures consisted of the following: (i) 1.4 µl of plasmid template (approximately 100 ng DNA); (ii) 0.5 µl of 10-pmol/µl primer; and (iii) 1.1 µl from the BigDye Terminator v3.1 cycle sequencing kit (Applied Biosystems). Reaction aliquots of 500 nl were then thermal cycled and purified within the Nano-Pipetter of the Parallab. Cycling conditions were as follows: 35 cycles with a 0-s denaturation step at 96°C, a 0-s annealing step at 50°C, and a 60°C extension step for 45 s. The sequencing reactions were then run on the ABI 3730xl and analyzed using the ABI Analysis software v.5.1.
DNA sequence analysis. Sequencing results were analyzed and contigs were determined using Sequencher (version 4.1.4; Gene Codes Corporation, Ann Arbor, MI). DNA sequence similarity searches utilizing the BLASTn and BLASTx algorithms (2) were performed using the CGS high-speed BLAST cluster (J. Gladitz, G. Erdos, and S. Yu, unpublished data) that is automatically updated on a weekly basis from the National Center for Biotechnology Information website (http://www.ncbi.nlm.nih.gov/). This system, including a custom-designed software package, automatically performs vector trimming, sequence quality checks, and BLAST homology searches, permitting fast and accurate analyses of thousands of clones daily.
PCR-based gene distribution studies. Primer pairs were designed using Oligo 6.65 (Molecular Biology Insights, Cascade, CO) and Vector NTI 9.0 (Invitrogen) for the performance of distribution studies among the parent strains for the 70 unique genes. Primer sequences are available on the CGS website (Supplementary Table 1 at www.centerforgenomicsciences.org). Genomic DNAs isolated from each of the clinical strains were used as amplification templates to determine which clinical strains possessed each of the non-PAO1 sequences. Positive controls for each genomic template DNA were also performed using primers specific for the P. aeruginosa polyphosphate kinase (ppk) and glyceraldehyde-3-phosphate dehydrogenase (GAPDH) genes (see Supplementary Table 1 at the CGS website). Amplifications were performed in a volume of 12.5 µl containing 0.3 units of Taq DNA polymerase, 50 ng of template DNA, 5 pmol of each primer, 1.5 mM MgCl2, and 0.4 mM deoxynucleoside triphosphates and using the Eppendorf MasterTaq kit (Brinkman Instruments, Inc., Westbury, NY). PCR was performed using a suite of Perkin-Elmer 9600 thermal cyclers programmed as follows: 1 cycle at 95°C for 2 min; 35 cycles at 94°C for 30 s, 55°C for 30 s, and 72°C for 30 s; 1 cycle for 7 min at 72°C; and then a 4°C hold. The PCR products were electrophoresed through 1.75% agarose gels, stained with ethidium bromide, photographed with a Kodak Image Station 440 UV light scanner (NEN Life Science Products, Inc., Boston, MA), and analyzed with Kodak 1D Image Analysis software (Eastman Kodak Company, Rochester, NY).
|
View this table: [in a new window] |
TABLE 1. Distribution of non-PAO1 DNA sequences and RNA transcripts among P. aeruginosa clinical strains via PCR and RT-PCR
|
Southern blot analysis.
P. aeruginosa genomic DNAs were isolated from liquid cultures as described above, digested with EcoRI, and electrophoresed into 1% agarose gels. Each gel also contained one lane with a pool of the unique plasmid clones for which we were probing, which served as a positive control. The DNAs were transferred to positively charged nylon membranes by capillary blotting using 0.4 M NaOH (Amersham Pharmacia Biotech, Buckinghamshire, United Kingdom) according to the method of Southern (51). Probes for the unique genes under study were produced by PCR-based amplification of the corresponding plasmid inserts, followed by purification of the amplimers by using the QIAquick PCR purification kit (QIAGEN). Radioactive labeling of the probes was performed using the random primer DNA labeling system (Invitrogen) according to the manufacturer's instructions. Probes were purified via gel exclusion chromatography (G-50 Sephadex columns; Roche Diagnostics, Indianapolis, IN). Specific activity was measured in a Bioscan QC4000XER counter (Bioscan, Washington, D.C.). A 30-min prehybridization of the filters was carried out at 42°C (5x SSC [1x SSC is 0.15 M NaCl plus 0.015 M sodium citrate], 5x Denhardt solution, 50% [wt/vol] formamide, 1% [wt/vol] SDS) with heat-denatured sheared salmon sperm DNA (Sigma) added immediately before incubation. Hybridization of the novel gene probes to the blotted DNAs was accomplished by adding
2 x 107 dpm of heat-denatured probe to each prehybridization reaction mixture and incubating at 42°C overnight. Following hybridization, the membranes were washed in 2x SSC-0.1% (wt/vol) SDS for 5 min at room temperature three times, then in 0.2x SSC-0.1% (wt/vol) SDS at room temperature for 15 min, then in 0.2x SSC-0.1% (wt/vol) SDS at 42°C for 15 min, and finally in 0.1x SSC-0.1% SDS at 68°C for 15 min. The membranes were then autoradiographed using Kodak XAR film.
Nucleotide sequence accession numbers. The 70 novel nucleotide sequences reported in this paper have been deposited in GenBank with the following accession numbers: CL422968 to CL422970, CL422972 to CL422973, CL422976 to CL422983, CL440257 to CL440259, CL440261 to CL440265, CL440267 to CL440270, CL440272 to CL440280, and DU708885 to DU708920.
|
|
|---|
96% contained plasmids. Sequencing of 3,453 randomly chosen clones demonstrated that
93% contained inserts of >100 bp, suggesting a functional library of
216,000 clones. The average insert size was 1.4 kb, resulting in a library of
3.0 x 108 bp of DNA, which translates to
4x coverage of each of the 12 genomes, assuming a genome size of 6.2 Mb. The average readable length for each sequencing reaction was
650 bases. For clone sequences that were longer than 1.3 kb, primer walking was used to sequence the interior of the fragment. Identification of novel (non-PAO1) sequences from clinical isolates. The nucleotide sequences from the 3,214 clones containing inserts of >100 bp were analyzed by BLAST against the reference PAO1 genome (54) to identify novel sequences contained within the genomes of the 12 clinical isolates. A total of 89.2% (2,866/3,214) of the clones revealed a minimum of 350 bp of contiguous homology (at least 80% nucleotide identity) to PAO1 and were classified as PAO1-like; however, most of these sequences showed allelic differences, including small indels and point mutations. The remaining 348 (10.8%) sequences were classified as novel and did not contain any blocks of nucleotide homology (>100 bases) when compared with all known sequences in the GenBank database at the time of the initial search. BLASTx searches using conceptual translations of the novel sequence ORFs identified varying degrees of similarity to proteins from throughout the prokaryotic domain (see Supplementary Table 2 at the CGS website). Some of the novel sequences displayed very low levels of similarity to any known proteins and thus may encode novel peptides with new functions not previously recognized, whereas other sequences showed relatively higher similarity to known virulence factors or other structural or metabolic proteins of known function. Seventy high-quality sequences were randomly selected for genic distribution studies among the component strains that make up the pooled library to gauge the degree of interstrain genomic plasticity. Among these unique sequences, we identified a range of hypothetical genes by ORF analysis that are predicted to encode phage proteins or virulence-related determinants, as well as proteins associated with cellular metabolism. Below we describe a subset of these putative novel gene products.
|
View this table: [in a new window] |
TABLE 2. Summary of distribution of non-PAO1 DNA sequences and RNA transcripts among P. aeruginosa clinical strains
|
(i) Phage and phage-like sequences.
Hypothetical phage genes were the most prevalent class of novel clones, with 20/70 (29%) of the unique sequences demonstrating some level of protein similarity to known or putative phage genes (Table 1; see also Supplementary Table 2 at the CGS website). Bacteriophages play a critically important role in HGT among the pseudomonads, as they are incompetent for natural transformation (38, 42). Over 60 temperate bacteriophages have been isolated from P. aeruginosa (1, 57), all of which can integrate their genomes as well as transduce genes from their previous hosts into their new host's genome via transposition. Clone L001_PA_0515_A13 contains four ORFs with between 84 and 94% amino acid identity to the Pseudomonas phage D3; however, the nucleotide homology was too low in all cases to identify a match. This phage contains an O-acetylase gene (encoded by ORF28) that adds an acetyl group to the lipopolysaccharide O-antigen which results in a serotype switch from O5 to O16 (27) in the presence of D3-encoded
-polymerase inhibitor and a ß-polymerase gene (34). The ORFs represented in our clone are ORF21, ORF22, ORF25, and ORF26, suggesting that ORFs 23 and 24 have been deleted. We also identified six ORFs (24, 53, 59, 60, 62, and 63) from our random clone sequencing that showed limited protein similarity to the phage F116. Clone L001_PA_0166_M20 appears to be a chimeric gene that resulted from recombination between two known integrases, (ORF100) from bacteriophage pf1 and an integrase present in PA0728. Clone L001_PA_0022_J20 contains two ORFs with 86% and 88% amino acid identity to proteins of phage B3, and clone L001_PA_0021_F02 contains an ORF whose hypothetical translation would produce a protein with 95% amino acid identity to the CP7 gene of the plasmid pKLC102, which can integrate into the genome of P. aeruginosa C strains (24) at the tRNALys gene. Clone L001_PA_0018_I13 contains an ORF whose highest similarity (70% amino acid identity) is to the NLP/P60 amidohydrolase gene in Pseudomonas syringae. This protein acts on
-glutamyl-containing substrates such as peptidoglycans (41); however, most of the identified protein homologs for this gene are phage tail assembly proteins, such as amidase protein K of phage lambda. Moreover, the adjacent ORF in this clone shows similarity to another phage tail protein, suggesting that both ORFs are probably phage encoded.
(ii) Novel soxR-like gene. Clone L001_PA_0002_J02 contains an ORF that encodes a hypothetical protein with homology (59% amino acid identity, 75% similarity) to the putative SoxR protein of P. putida KT2440 (see Supplementary Table 2 at the CGS website). This gene and its transcripts were identified in 3 out of 11 of the clinical strains. In PAO1 there is another soxR gene (PA2273) that controls a six-gene regulon in response to oxidative stress (36); this gene is also required for virulence in a mouse model of intrapulmonary infection and is highly induced in P. aeruginosa strain PAK in the mouse burn model (20). The soxR-like clone identified here has only a short region (nucleotides 92 to 168) of limited nucleotide homology (75%) to the A2273 gene, indicating that it is most likely a unique gene within the same gene family.
Orthologs of this gene in other bacterial species, including E. coli, also encode transcriptional regulators associated with superoxide responsiveness (22) which in turn activate another transcriptional regulator, soxS, that activates a regulon that includes dozens of oxidative and nitric oxide stress response genes (12, 39). P. aeruginosa does not have a soxS homolog, suggesting that other genes, perhaps soxR homologs, may function in additional oxidative stress pathways in this species (25).
(iii) hlyD-like gene.
Clone L001_PA_0020_A06 contains an ORF encoding a hypothetical protein that shares similarity to the putative HlyD protein (50% amino acid identity, 64% similarity) of the marine oligotroph Sphingopyxis alaskensis RB2256. The hlyD gene product in E. coli is known to be part of an oligomeric protein that also contains the products of the hlyB and -C genes and that is essential for the secretion of
-hemolysin, a cytotoxin encoded by the hlyA gene (37). There are numerous hlyB-, -C-, and -D-like genes throughout the bacterial domain that are associated with various membrane transporter mechanisms for the secretion of extracellular proteins (19), and it is likely that this novel member of this gene family is part of a unique transport complex.
(iv) alkS-like gene. The hypothetical translation of the ORF in L001_PA_0613_O07 produces a protein with 99% amino acid identity to AlkS of Pseudomonas putida. AlkS is a regulator of the gene cluster alkBFGHJKL, which encodes a group of proteins involved in alkane utilization (7, 55). The alk genes were first characterized in P. putida (strain Gpo1), and since then multiple species have been found to contain the alkane hydroxylase system (50, 56, 60). However, only a few isolated genes from this regulon have been found in any of the P. aeruginosa strains, and alkS has never been described. The sequence of our putative alkS gene is only 44.8% GC, which is in stark contrast to the 67% GC content of the P. aeruginosa genome overall, and the alk genes in P. putida only have GC contents of between 44 and 47%, versus 61.5% for the P. putida genome. Collectively, these data suggest that this gene family may have recently entered the pseudomonal supragenome; this is further supported by the observation that of the 11 clinical strains examined, only W28869 possesses this gene and its transcript.
(v) Novel nhaA-like sequence. Clone L001_PA_0607_O20 contains a partial ORF that could encode an NhaA-like Na+/H+ antiporter. Like the putative alkS gene, this gene was found only in strain W28869. The greatest degree of homology was observed with Nocardioides sp. strain JS614 (48% amino acid identity, 63% similarity). Na+/H+ antiporters are ubiquitous membrane proteins in nearly all cells, from bacteria to humans, that play a major role in pH and Na+ homeostasis by exchanging Na+ (or Li+) for H+ (35). Several Na+/H+ antiporters have been recognized in prokaryotic cells, including NhaA, NapA, NhaP, NhaC, NhaD, and NhaB. In P. aeruginosa, Na+/H+ antiporter genes nhaB and nhaP correspond to PA1820 and PA3887 in the PAO1 genome separately (GenBank) (28).
(vi) PAPS reductase gene-like sequence. Clone L001_PA_0014_F23 harbors an ORF which has limited protein homology (27% amino acid identity, 46% similarity) to the phosphoadenosine 5'-phosphosulfate (PAPS) reductase proteins of Trichodesmium erythraeum IMS101. There are two systems for assimilatory sulfate reduction: (i) plants, algae, and phototrophic bacteria utilize adenosine 5'-phosphosulfate (APS), and (ii) chemotrophic bacteria and fungi use PAPS. APS reductase and PAPS reductase enzymes are key elements of their respective metabolic systems (8, 26). It was shown previously that P. aeruginosa uses a plant-like APS rather than PAPS for its sulfate assimilation (5). Although the PAO1 cysH gene (PA1756) was annotated as a PAPS reductase, it shows no observable homology to the F23 clone, which does have a conserved SGGKDS motif that is seen in many PAPS reductases, including one from P. flluorescens. These observations suggest that some P. aeruginosa strains may be able to use both systems; this gene and its transcript were found only in the Pitt E strain.
Distribution of novel (non-PAO1) DNA sequences. PCR and RT-PCR-based assays were carried out to analyze the distribution and expression patterns for 70 of the novel DNA sequences among the 11 clinical strains (Table 2). Genomic DNA and RNA from the reference strain PAO1 were used as negative controls. The P. aeruginosa polyphosphate kinase (ppk) and GAPDH genes were used as positive controls and were successfully amplified from all 11 clinical strains and PAO1. All distribution and expression assays were performed in quadruplicate, and a positive call required that at least 3/4 of the independent results were positive. Our PCR-based distribution analyses are theoretically limited by the fact that divergent alleles of the novel genes that vary significantly at the primer-binding sites could produce a false-negative result, which would lead to an underestimation of their penetration into the pseudomonal gene pool. Thus, we performed Southern hybridizations for six of the clones as a test of the PCR results. Overall, these two methods were highly corroborative, but there were a few minor differences (Table 3). Nevertheless, it remains possible that genes encoding surface-exposed antigens, which come in contact with the host immune system, could display heterogeneity beyond what could be detected even by Southern hybridization analyses.
|
View this table: [in a new window] |
TABLE 3. Comparison of Southern blot hybridization and PCR data for gene distribution
|
|
View this table: [in a new window] |
TABLE 4. Frequencies of P. aeruginosa strains showing unique sequences
|
Pairwise comparisons of the 11 clinical strains using the PCR results were performed for all 70 novel sequences under study (Table 5). The greatest differences in genic content were observed between the strain pairs Pitt E and M18858 and Pitt E and W28869, both of which differed at 42 loci. The smallest degree of difference was observed between strains W27912 and W27931, which differed at only six loci. The Pitt E strain, which has the greatest number of the novel sequences, displays the greatest number of differences (315) when compared against all other strains. Conversely, the Pitt A strain has the fewest genic differences (191) when compared against all other strains (Table 6). A similar comparison using the RT-PCR results demonstrated that Pitt E and W28869 have the largest number (39) of gene expression differences. Interestingly, strains M18851 and M18858 had identical expression patterns for the 70 novel sequences. Overall, Pitt E and W28869 are the two strains most different from other strains, which is consistent with the fact that these two strains have the highest number (eight each) of unique DNA sequences.
|
View this table: [in a new window] |
TABLE 5. Pairwise tabulations of unique gene content and expression among 11 P. aeruginosa clinical isolates
|
|
View this table: [in a new window] |
TABLE 6. Summary of pairwise comparisons for each strain based on unique gene content and transcripts among 11 P. aeruginosa clinical isolates
|
11% of the clones as being novel with respect to the PAO1 genome, i.e., a BLASTn search revealed no identifiable homologs. A BLASTx search of 117 of these novel sequences employing hypothetical translations of the ORFs within these clones (see Supplementary Table 2 at the CGS website) revealed that 4 had their greatest similarity to PAO1 genes, 17 were most similar to known or putative phage genes, 29 showed their greatest similarity to genes from other P. aeruginosa strains, and another 28 had their greatest similarity to various other species within the pseudomonas family, including Ralstonia, Burkholderia, Azotobacter, and Rhodopseudomonas species among others. This last finding suggests that there are not significant barriers to HGT among the pseudomonads generally. Even more surprising is the finding that 39 of the 117 novel genes had their greatest similarity to genes from a very wide range of organisms, including archea and eukaryotes. This of course does not mean that these genes had their origins in these species identified, only that they likely did not have their origin within the pseudomonads.
Spencer et al. surveyed three sequenced P. aeruginosa genomes (0.5-fold coverage) and found that 11.6% of their high-quality sequence was unique with respect to the PAO1 genome (53). Taken together with our findings, it would appear that globally P. aeruginosa strains vary by
11% on average. The genic distribution of the 70 selected sequences examined in the current study demonstrate the nonuniform distribution of genes within the population-based supragenome. These findings support the virulence corollary to the DGH that says chronic pathogens use gene distribution, recombination, and natural selection during infection as supravirulence factors that contribute to persistence in the face of host defense mechanisms. As further support of this hypothesis, we identified a number of novel sequences that, based upon protein similarities, would appear to encode highly variant forms of known virulence factors in P. aeruginosa and other pathogenic bacterial species. Nineteen of the 70 novel sequences were identified exclusively in only 1 of the 11 strains; therefore, these genes may be useful as strain-specific markers in epidemiological investigations and for studying gene flow in polyclonal animal model studies.
We previously presented evidence that extensive genomic plasticity exists among both gram-negative and gram-positive obligate respiratory pathogens (44, 45), and in the current study we have extended these findings to include opportunistic pathogens. Comparing the genomic plasticity characteristics among clinical strains of P. aeruginosa, S. pneumoniae, and H. influenzae, we found 48.5% of the novel P. aeruginosa sequences present in only one or two strains; this was in contrast to 37.9% for S. pneumoniae and only 3.8% for H. influenzae. Thus, P. aeruginosa, despite the fact that it is not naturally competent, displays the greatest genomic plasticity and the highest number of unfixed genes. This suggests that other HGT mechanisms must also play important roles in diversity generation. P. aeruginosa is a robust biofilm former and is associated with numerous medical biofilms, including those found with otorrhea, chronic obstructive pulmonary disease, and CF. P. aeruginosa biofilms contain high concentrations of DNA within their extracellular matrices (59), and this observation combined with the fact that the levels of HGT in biofilms have been demonstrated to be several orders of magnitude higher than in planktonic cells of the same strain make it likely that much of the strain evolution that occurs in vivo takes place during the biofilm phase of the bacterial life cycle. It is interesting to speculate that P. aeruginosa, which can form pili or pseudopili (13, 21), may be able to directly take up DNA from within the biofilm environment. Regardless of the mechanism of diversity generation, it is clear that P. aeruginosa is genomically highly plastic. The data presented here will become a base for future in vivo polyclonal animal studies for testing the DGH.
Present address: University of Pittsburgh, Pittsburgh, PA 15213. ![]()
Present address: Carnegie-Mellon University, Pittsburgh, PA 15213. ![]()
Present address: Pennsylvania State University, State College, PA 16802. ![]()
|
|
|---|
This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»