Infection and Immunity, May 2007, p. 2645-2647, Vol. 75, No. 5
0019-9567/07/$08.00+0 doi:10.1128/IAI.01317-06
Copyright © 2007, American Society for Microbiology. All Rights Reserved.

University of Maryland, Baltimore, Maryland 21201,1 Veterans Affairs Maryland Health Care System, Baltimore, Maryland 21201,2 National Microbial Pathogen Data Resource, Chicago, Illinois3
Received 16 August 2006/ Returned for modification 21 October 2006/ Accepted 14 December 2006
|
|
|---|
|
|
|---|
The V. cholerae NRT36S genome was sequenced by the company 454 Life Sciences. Genomic DNA was extracted with a QIAamp DNA Mini kit (QIAGEN, Valencia, CA). 454 Life Sciences employed a different sequencing strategy from conventional shotgun sequencing, as described in detail elsewhere (15). Briefly, genomic DNA was randomly sheared to small fragments and ligated to common adaptors. Single fragments were attached to beads in an emulsion. Amplification by PCR was done in the emulsion and produced
107 copies of the fragments per bead. After removal of the emulsion, the beads were deposited on a fiber optic slide. The DNAs were sequenced using a pyrosequencing protocol. Sequencing was performed on a Genome Sequencer 20 system. The four sequencing runs generated 1,082,967 reads and output 104,531,256 bp of sequence. The estimated coverage depth was 26x. The draft genome consisted of 174 large contigs plus the superintegron region, with a total length of 4,079,433 bp. The average GC content for the draft genome was 47.5%. The genome was annotated by the National Microbial Pathogen Data Resource and is available online (http://anno-2.nmpdr.org/umd/FIG/index.cgi).
We compared the genome of V. cholerae NRT36S to the published genome of V. cholerae N16961, a serogroup O1 isolate, using the program MUMMER 3.0 (using the default settings of the program) (Fig. 1) (12). The genome sizes were comparable, at approximately 4.1 megabases. About 3.5 megabases (89%) of sequence was common to both genomes. Substantial differences between the genomes of V. cholerae NRT36S and N16961 were noted, especially in the genes for pathogenesis, and surface polysaccharides and in the superintegrons. The sequences identified in only one isolate by MUMMER (cutoff, 70% nucleotide identity) were considered strain specific.
![]() View larger version (19K): [in a new window] |
FIG. 1. Comparison of V. cholerae N16961 and NRT36S genomes. The two closed rings represent two chromosomes of N16961. Chromosome I is on the left, and chromosome II is on the right. The white areas in the rings are sequences conserved between the two genomes. Gray areas are sequences specific to N16961, with labels inside the rings. Black areas outside the rings are sequences specific to NRT36S, with labels outside the rings. VSP-I and VSP-II, Vibrio seventh pandemic islands I and II; LPS, lipopolysaccharide; VPI-1 and VPI-2, Vibrio pathogenic islands 1 and 2; CTX, cholera toxin genes; PTS, phosphoenolpyruvate phosphotransferase system; CPS, capsule polysaccharide; TTSS, type III secretion system; TRH, thermostable direct hemolysin; SI, superintegron.
|
prophage (VC1452 to VC1478), which encodes CT in V. cholerae O1 (20), was absent from NRT36S. The genome of CTX
also carries genes responsible for phage morphogenesis and its insertion into the host genome. None of these genes was present in NRT36S. The toxin-coregulated pilus (TCP), the major colonization factor for V. cholerae O1 (7, 19), was also missing from NRT36S. TCP is encoded in a 39.5-kb region of the V. cholerae genome designated Vibrio pathogenic island 1 (VC0819 to VC0845) (10). Together with the TCP gene, the genes carried in Vibrio pathogenic island 1 were absent from NRT36S. Another genomic island, described as Vibrio pathogenic island 2 for V. cholerae O1 (VC1758 to -1803), was altered in NRT36S (Fig. 2). VC1759 to -1772 encode a restriction modification system, and VC1773 to -1787 encode enzymes, including a sialidase (neuraminidase; EC 3.2.1.18), involved in the transport and utilization of sialic acid. VC1788 to -1803 encode phage proteins (8). Neuraminidase was found to be an important virulence factor that can increase the amount of the human receptor for CT (5a). Most of Vibrio pathogenic island 2 (VC1759 to -1772 and VC1788 to -1803) was absent from NRT36S, but an internal section (VC1773 to -1787), the sialic acid-related region, was present. This interesting mosaic structure suggests that this region of the genome is a hot spot for lateral gene transfer. |
View larger version (13K): [in a new window] |
FIG. 2. Mosaic pattern of Vibrio pathogenic island 2. The three regions in the Vibrio pathogenic island were described previously (8). White boxes are conserved regions between N16961 and NRT36S. Shaded boxes indicate genes specific to N16961. The hatched box indicates genes specific to NRT36S.
|
The genes related to pathogenesis in NRT36S were different from those in N16961. In a previous study (16), Morris et al. suggested that the heat-stable enterotoxin NAG-ST and the ability to colonize were the major factors contributing to the occurrence of disease in human volunteers ingesting NRT36S. NAG-ST causes fluid accumulation in a suckling mouse model (1, 16, 17). NAG-ST was not present in N16961. Interestingly, the NAG-ST gene was located in the superintegron region of the NRT36S genome (described below).
The colonization factor(s) in NRT36S was not identified in previous studies. We found four pilus systems in NRT36S. First, a type 1 pilus assembly system was identified to be encoded by an 8-kb sequence specific to the genome of NRT36S. Second, a pilus system in NRT36S related to a mating pilus system in Salmonella enterica serovar Typhi (2) was identified. Third, a type IV pilus system, which was a mannose-sensitive hemagglutinin described by Jonson et al. (9), was present in both NRT36S and N16961. The protein sequences of the mannose-sensitive hemagglutinin systems shared 60 to 99% identity between the two isolates. Fourth, another type IV pilus system, which was described previously (5), was also conserved in NRT36S and N16961. None of the pilus systems identified in NRT36S was similar to TCP, which is a type IV pilus.
We also identified a type III secretion system in a 48-kb gene cluster specific to NRT36S. It is highly similar to the one described for another non-O1/non-O139 V. cholerae isolate, AM-19226 (4), sharing 99% sequence similarity. The location of the type III secretion system in NRT36S was next to the homolog of VC1758, while in N16961, the genes next to VC1758 were designated as Vibrio pathogenic island 2. In NRT36S, we also found two additional potential toxin gene clusters that were not found in N16961. The first was an exotoxin A precursor gene. This gene had homologs in the non-O1/non-O139 V. cholerae isolates AM-19226 (4) and V51 (http://www.ncbi.nlm.nih.gov/). The predicted protein product is related to an NAD-dependent ADP-ribosyltransferase of Pseudomonas aeruginosa. Second, there were two RTX toxin genes present in NRT36S. The first one was similar to rtxA of N16961, sharing 99% amino acid identity (AAI). The second RTX toxin gene was similar to the RTX toxin gene in Aeromonas salmonicida (E value = 0; score = 640) and was very divergent from the rtxA gene found in N16961.
We identified two additional phage-like gene clusters in NRT36S. One prophage-like gene cluster was 33 kb long and was located adjacent to genes which are on chromosome II in N16961. The other cluster was 6 kb long and showed similarity to the filamentous bacteriophages KSF-1
and VGJ
. Whether these two prophages play a role in virulence remains unknown. Altogether, the putative virulence genes and phage sequences made up 33% of the strain-specific sequences in NRT36S.
Besides the genes related to toxin and colonization, V. cholerae N16961 and NRT36S differed in the genes associated with the synthesis of surface polysaccharide, which might also contribute to their virulence and survival. These genes made up 9% and 13% of the strain-specific sequences in N16961 and NRT36S, respectively. The genes for O and K antigens in the two isolates were entirely different. N16961 is a serogroup O1 isolate and has no capsule; its O antigen biogenesis region has been identified (14), occurring between gmhD and rjg in chromosome II. NRT36S is a serogroup O31 isolate and has a capsule in addition to the O31 antigen. The O antigen and capsule shared the same genetic locus in NRT36S, also located between gmhD and rjg (3a). Despite the similar locations, the gene contents of this locus in N16961 and NRT36S were very different.
A superintegron exists in all Vibrio genomes examined so far (3, 6, 13), and NRT36S was no exception. The superintegron is a highly variable region. Our sequencing strategy generated only short reads (an average length of 96.5 bp) that could not read through the 128-bp Vibrio cholerae repeat and caused problems when assembling the superintegron of NRT36S. Therefore, our knowledge of the superintegron region is not complete. Among the genes we could confirm to be in the superintegron region, most of them encoded hypothetical proteins, as expected. Several genes were homologous to the recognized genes in the N16961 superintegron: genes for a killer protein, four putative acetyltransferases, and some hypothetical proteins were conserved in the superintegrons of both NRT36S and N16961. There were also several other genes recognized to be specific to the NRT36S superintegron. As mentioned above, the superintegron encoded NAG-ST, which may be the major toxin of NRT36S. We also identified a quinone oxidoreductase gene and a hydrolase gene in the NRT36S superintegron.
In addition to the genes for toxins, pili, and surface polysaccharide and those in the superintegron, the differences between N16961 and NRT36S extended to other genes in other functional categories, such as chemotaxis, transportation and metabolism, and transcriptional regulation. These genes made up 32% and 24% of the strain-specific sequences in N16961 and NRT36S, respectively.
To gain insight into the variations between homologous genes in V. cholerae, we also compared the proteomes of the two isolates by BLASTP (settings for BLASTP, no filter and E values of <1e10). All best-hit pairs were identified for the two genomes. "Best-hit pair" was defined as a pair in which one gene in one genome found the other in the other genome as its best match and vice versa. Genes with 85% or more AAI that covered at least 70% of the full length were considered conserved. Eighty-four percent of the genes were conserved between the two genomes, <1% of the genes had low AAI (between 21% and 84%), another 1% of the genes represented paralogs in the genome, and the other 14% of genes from NRT36S did not match those in N16961. Genes that had <85% AAI were considered strain specific.
Our genome analysis revealed extensive variation among pathogenic strains of V. cholerae. Sixteen percent of genes were not conserved between the two genomes, suggesting the occurrence of lateral gene transfer (with the presence of prophages raising the possibility of phage-mediated gene transfer). NRT36S and the O1 strain N169061 clearly have entirely different sets of virulence-associated genes. The observed variations in their surface polysaccharides and membrane transportation systems may reflect adaptation to different niches during their life cycles. The function of the superintegron remains cryptic. The NAG-ST gene is the first functional virulence gene found in the superintegron, which strongly suggests that the superintegron is a mechanism by which members of this species can import exogenous genes and convert them for their own use.
Published ahead of print on 5 February 2007. ![]()
|
|
|---|
This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»