Previous Article | Next Article 
Infection and Immunity, October 1998, p. 4611-4623, Vol. 66, No. 10
0019-9567/98/$04.00+0
Copyright © 1998, American Society for Microbiology. All rights reserved.
DNA Sequencing and Analysis of the
Low-Ca2+-Response Plasmid pCD1 of Yersinia
pestis KIM5
Robert D.
Perry,1,*
Susan C.
Straley,1
Jacqueline D.
Fetherston,1
Debra J.
Rose,2
Jason
Gregor,2 and
Frederick
R.
Blattner2
Department of Microbiology and Immunology,
University of Kentucky, Lexington, Kentucky
40536-0084,1 and
Department of
Genetics, University of Wisconsin, Madison, Wisconsin
537062
Received 14 April 1998/Returned for modification 19 June
1998/Accepted 10 July 1998
 |
ABSTRACT |
The low-Ca2+-response (LCR) plasmid pCD1 of the plague
agent Yersinia pestis KIM5 was sequenced and analyzed for
its genetic structure. pCD1 (70,509 bp) has an IncFIIA-like replicon
and a SopABC-like partition region. We have assigned 60 apparently
intact open reading frames (ORFs) that are not contained within
transposable elements. Of these, 47 are proven or possible members of
the LCR, a major virulence property of human-pathogenic
Yersinia spp., that had been identified previously in one
or more of Y. pestis or the enteropathogenic yersiniae
Yersinia enterocolitica and Yersinia
pseudotuberculosis. Of these 47 LCR-related ORFs, 35 constitute a
continuous LCR cluster. The other LCR-related ORFs are interspersed
among three intact insertion sequence (IS) elements (IS100
and two new IS elements, IS1616 and IS1617) and
numerous defective or partial transposable elements. Regional
variations in percent GC content and among ORFs encoding effector
proteins of the LCR are additional evidence of a complex history for
this plasmid. Our analysis suggested the possible addition of a new Syc- and Yop-encoding operon to the LCR-related pCD1 genes and gave no
support for the existence of YopL. YadA likely is not expressed, as was
the case for Y. pestis EV76, and the gene for the
lipoprotein YlpA found in Y. enterocolitica likely is a
pseudogene in Y. pestis. The yopM gene is
longer than previously thought (by a sequence encoding two leucine-rich
repeats), the ORF upstream of ypkA-yopJ is discussed as a
potential Syc gene, and a previously undescribed ORF downstream of
yopE was identified as being potentially significant. Eight
other ORFs not associated with IS elements were identified and deserve
future investigation into their functions.
 |
INTRODUCTION |
The human-pathogenic yersiniae
Yersinia pestis, which causes plague, and Yersinia
pseudotuberculosis and Yersinia enterocolitica, which
primarily cause gastrointestinal disease, have a ca. 70-kb plasmid that
encodes a complex virulence property called the low-Ca2+
response (LCR) (35, 45, 87, 88, 120, 121). The LCR was
discovered in Y. pestis growing in vitro, where the bacteria respond to the absence of Ca2+ at 37°C by the strong
expression and secretion of a virulence protein called V antigen (now
also called LcrV). In certain media, this is accompanied by a growth
response termed restriction in which the yersiniae undergo an orderly
metabolic shutdown and cease growth (23, 25, 42, 51, 119).
It is now known that under these in vitro LCR-inductive conditions, the
yersiniae maximally induce the transcription, translation, and
secretion of a set of virulence proteins called Yops
(Yersinia outer proteins) in addition to LcrV. The operons
encoding these proteins and other similarly regulated operons on the
LCR plasmid have been referred to as the LCR stimulon (LCRS) (83,
107). Millimolar concentrations of Ca2+ permit a full
growth yield at 37°C, weaker expression of LcrV and Yops, and
essentially no secretion of these proteins (83, 107). There
is only very weak, basal expression of Yops and LcrV at environmental
temperatures and no secretion: the LCR is designed to function within a
mammal. In addition to the presence or absence of Ca2+,
other environmental inputs, such as Mg2+, Cl
,
Na+, glutamate, nucleotides, and anaerobiosity, modulate
the LCR (23, 42, 60, 118, 119). The molecular basis of these
effects has not been determined, but these elements of environmental
modulation could be important in adjusting virulence protein expression
and secretion in response to the wide range of niches that yersiniae are expected to encounter during an infection (106).
It is believed that the absence of Ca2+ mimics an
unidentified signal that yersiniae receive when they are in contact
with a mammalian cell (33, 83). The LCR plasmid encodes a
type III secretion system called Ysc, for Yop secretion
(68), that is dedicated to the secretion of Yops, LcrV, and
some regulatory proteins in the LCR; cell contact causes this system to
be locally activated at the interface between the bacterium and the
eukaryotic cell. Environmentally regulated inner and outer gates of the
Ysc (LcrG and LcrE [also called YopN], respectively) then open,
permitting the secretion of negative regulatory proteins (a key one
being LcrQ, also called YscM). This allows full transcriptional
activation of LCRS operons by an AraC-like activator protein, LcrF.
Yops are secreted locally, without processing. The secretion mechanism recognizes two signals: one in the first 45 nucleotides of the yop mRNA and one related to a domain that has been found for
some Yops to bind a specific Yop chaperone (Syc), also
encoded by the LCR plasmid (29, 112). Some Yops (YopB, YopD,
and YopK) serve as components of a mechanism for targeting effector
Yops directly into the eukaryotic cell. The effector Yops (YopE, YopH,
YpkA [Yersinia protein kinase], YopM, and probably YopJ)
then act on their intracellular target molecules and derange cellular
signaling and cytoskeletal functions. LcrV has a bifunctional role in
the LCR: it is a regulatory protein, acting at the levels of Yop
secretion and targeting, and it has a role as a potent antihost protein (77, 79, 83). LcrV is the only LCRS protein that is secreted in large amounts into the surrounding medium by yersiniae in contact with eukaryotic cells (79). It is the only LCRS protein that has been shown to have an effect when given by itself to mice (77); all others require delivery by the Ysc machinery from yersiniae in intimate contact with mammalian cells.
The overall effect of the LCR is a profound immunosuppression,
resulting from the paralysis of innate defenses at the site of
infection and the failure to mobilize an effective cell-mediated immune
response. Y. pestis, and also the enteropathogenic yersiniae in immunocompromised individuals, grows unchecked in the lymphoid system in a fulminant disease that has a high mortality if not treated
with appropriate antibiotics (26, 83). In contrast, without
the LCR plasmid, these bacteria are completely avirulent (33,
83).
It is now apparent that several other important pathogens have
virulence systems with many striking similarities to the LCR; however,
the LCR is the best characterized of these and remains a prototype for
investigations at the forefront of molecular pathogenesis. The fact
that the LCR was plasmid borne greatly facilitated its characterization, which began in the early years of the existence of
molecular genetic techniques. The early studies delineated a large
cluster of genes necessary for the Ca2+ dependence of
Y. pestis growth (47, 87, 116, 117). Subsequent studies with the enteropathogenic yersiniae as well as Y. pestis resulted in the following picture for the layout of genes
involved in the LCR (83). The Ca2+ dependence
region turned out to encode LCR regulatory proteins and the enormously
complex Ysc type III secretion mechanism, which is comprised of at
least 22 gene products. This LCR cluster now includes the immediately
adjacent Yop-targeting and secretion control operon
lcrGVH-yopBD and is 25.7 kb in size. Only part of this had
been sequenced for Y. pestis (lcrDR,
lcrGVH-yopB', yscN'OPQRS,
ysc[A]B-F, and lcrF
[7, 37, 49, 53, 76, 82, 84, 89]), but the similarities
of LCR-related genes among the three species of human-pathogenic
yersiniae have been so high that information from one species has been
assumed to apply to the others (83). There are Yops within
the LCR cluster (e.g., YopB, YopD, and LcrE [which proved to be the
same as YopN] and YscH, which apparently is a Yop
[3]), but the effector Yops were found to be scattered
outside the cluster (104). The only Yop genes in Y. pestis that had been sequenced were yopM and
yopE, with the divergently transcribed SycE-encoding gene (40, 61).
There are two other LCR-associated genes outside the LCR cluster. YlpA
is a lipoprotein studied only for Y. enterocolitica; it is a
member of the LCRS group, but a knockout of its gene did not affect
virulence in an intravenous mouse model (30). Although not a
true LCR member, YadA is an adhesin whose gene is activated for
transcription by LcrF (99). It is enormously important for virulence of Y. enterocolitica, and in both Y. enterocolitica and Y. pseudotuberculosis, it serves as
an adhesin that can promote the productive contact with eukaryotic
cells, resulting in Ysc activation and Yop targeting (see, e.g.,
reference 15). Interestingly, its gene in Y. pestis EV76 has a frameshift mutation that would effectively
abolish expression of the protein (98).
The high homology of the LCR cluster and yop genes from the
three pathogenic Yersinia species includes a similar genetic
organization. In addition, the replication and partitioning functions
of the family of LCR plasmids are also highly conserved (6, 12, 32, 68, 83, 87, 88, 108). Sequencing of the replication region of
the Y. enterocolitica LCR plasmid pYVe439-80 demonstrated that it possessed all the essential components of an IncFIIA replicon (108). Plasmid pYVe439-80 is maintained at an estimated
seven copies per cell (32). Although the replicon region of
this plasmid has 68% homology with the IncFIIA replicon of plasmid
R100, the two plasmids are compatible. This may be due to differences
in the nucleotide sequences in the stem-loop structures of the
antisense RNA encoded by copA (64, 108). The
observed incompatibility of LCR plasmids with derivatives of the F
plasmid appears to be due to a shared partitioning system (6, 12,
40).
Despite all the similarities within the LCR plasmid family, there are
also significant differences. Plasmids from Y. pseudotuberculosis and Y. pestis are more closely
related than those from Y. enterocolitica. Y. enterocolitica
plasmid regions outside the LCRS, replication, and partitioning
function regions do not hybridize to analogous regions from Y. pseudotuberculosis or Y. pestis LCR plasmids
(88). In addition, the location of the replicon in relation
to the LCR cluster and the organization of the yopE-yadA
region on Y. enterocolitica plasmids differ from those of
LCR plasmids from the other two yersiniae (12, 32, 68, 83).
DNA sequencing has revealed two distinct types of lcrV that
differ in a hypervariable region between amino acids (aa) 225 and 232 in the translated protein. Serotype O:8 Y. enterocolitica
strains synthesize one type, while Y. pestis, Y. pseudotuberculosis, and Y. enterocolitica serotypes O:3, O:9, and O:5,27 produce the other (94). Motin et al.
(76) identified a species-specific difference in the
lcrV genes from Y. pestis and Y. enterocolitica. These differences likely account for the ability
of some anti-LcrV antibodies to protect against infections with
Y. pestis and Y. pseudotuberculosis but not
against Y. enterocolitica infections (75, 94).
Finally, the more stringent growth restriction observed under
LCRS-inducing conditions for Y. pestis compared to that for
the enteropathogenic yersiniae appears to be due to undetermined
differences in the LCR plasmid itself (104). Consequently,
it is likely that additional significant differences may reside within
the LCR family of plasmids.
To identify new potential virulence factors, discover any further
differences between Y. pestis LCRS genes and those
previously sequenced for the enteropathogenic yersiniae, and identify
other undefined traits of this family of virulence plasmids, we have sequenced one entire LCR plasmid. We chose to sequence the LCR plasmid
pCD1 of Y. pestis KIM5 (for Kurdistan Iran man) because it
has several advantages over other strains. Derivatives of Y. pestis KIM have been directly demonstrated to retain
high-virulence characteristics in both mammals and fleas (8, 22,
52, 83). These derivatives are widely used in research and thus
are more genetically characterized than other strains. The vast
majority of investigations on the regulation, physiological
characteristics, and virulence properties of the Y. pestis
LCRS have been performed with derivatives of strain KIM. Finally,
nearly all previous DNA sequence information on the Y. pestis LCRS components (~16 kb) has been derived from pCD1 of
strain KIM5 (7, 22, 37, 49, 61, 82-84, 89).
Our analysis of the DNA sequence of pCD1 has revealed a set of LCRS
genes very similar to those sequenced for the enteropathogenic yersiniae, a potential new Yop and Yop chaperone, two new insertion sequences (ISs), the IncFIIA replication region, and SopABC
partitioning functions. We have also identified IS element remnants
scattered throughout the plasmid that suggest that pCD1 has undergone
numerous insertional events as well as genetic recombinations and
rearrangements during its history.
 |
MATERIALS AND METHODS |
Bacterial strains, media, and plasmid isolation.
Y.
pestis KIM5 is conditionally avirulent due to deletion of the
102-kb pgm locus; it possesses all three prototypical
Y. pestis plasmids, i.e., the 9.5-kb pPCP1, ~70-kb pCD1,
and ~100-kb pMT1 (83). Plasmid pCD1 was isolated from
Y. pestis KIM5 by alkaline lysis followed by precipitation
with polyethylene glycol (13, 55). Since pCD1 has no
selectable marker, a mixture of pCD1 and pBR322 was transformed into
Escherichia coli HB101 (47). Transformants were
selected for the ampicillin resistance encoded by pBR322
(5). Five hundred transformants were transferred to
nitrocellulose membranes and hybridized against pCD1 radioactively labeled by nick translation. One transformant containing both pCD1 and
pBR322 was identified. This isolate was cured of pBR322 by fusaric acid
selection (17). pCD1 appears to be stably maintained in
E. coli HB101, and this transformant has been stored in
buffered glycerol at
70°C. Plasmid DNA from E. coli
HB101(pCD1) cells grown in Luria broth was isolated by alkaline lysis
(13) followed by further purification with polyethylene
glycol (55). This purified pCD1 DNA was used in subsequent
sequencing.
DNA sequencing.
Libraries were prepared from nebulized,
size-fractionated plasmid DNA (63) in the M13 Janus vector
(24). DNA templates were purified from random library clones
(81), and sequences were collected by using
dye-terminator-labeled fluorescent cycle sequencing Prism reagents and
ABI377 automated sequencers (Applied Biosystem Division of
Perkin-Elmer). Sequences were assembled into contigs by the SeqMan II
program (DNASTAR), and clones were selected for sequencing from the
opposite end to fill in coverage, resolve ambiguities, and close gaps
(24). Final coverage was about eightfold.
In several instances, the sequence contradicted previously published
sequences for the yersiniae or yielded unexpected results. To ensure
that this did not result from mutations to pCD1 during carriage in
E. coli, we sequenced these regions by using pCD1 isolated
from the conditionally virulent Y. pestis KIM5
(83) or pJIT7, a recombinant plasmid containing the
IS1616 region adjacent to sopAB (Fig.
1).

View larger version (35K):
[in this window]
[in a new window]
|
FIG. 1.
Genetic organization of the LCR plasmid, pCD1, of
Y. pestis KIM5. The map shows significant ORFs and
transposable elements as well as replication and partitioning regions.
For genes on the outside of the map, transcription proceeds clockwise.
Genes on the inside are transcribed from the complementary strand. The
oriR and sopC regions are not transcribed. The
placement of transposable elements (solid boxes) does not indicate the
direction of transcription. A d indicates a defective element, while a
p indicates a partial element. Most IS remnants are not denoted on this
figure.
|
|
Annotation.
We identified open reading frames (ORFs)
encoding at least 50 aa, using Geneplot or GeneQuest (DNASTAR) programs
to display start codons (including GUG), stop codons, and codon usage
statistics plots for each reading frame. Codon usage analysis helped to
predict ORFs. It was assessed in the program by second- and third-order statistical comparisons (20) with a matrix built from all
available sequences for Yersinia species. Although this
matrix was more useful than one derived from E. coli genes,
it was necessarily constructed from a relatively small data set and is
no doubt imperfect. Generally, in the absence of experimental data, the
start codon farthest upstream was used to annotate the ORF start
(14). ORFs with products smaller than 50 aa were included if
codon usage statistics showed a high score. ORF amino acid sequences
were searched against SWISS-PROT 34 by using the BLOSUM26 matrix, with the DeCypher II System (TimeLogic Inc.). In the first pass, the best
hit was automatically saved as an annotation for each ORF, and then
known genes and putative functions were assigned for individual ORFs by
inspection of the search output.
Subsequent searches of DNA and protein databases for mobile genetic
elements, DNA features, and amino acid sequence similarities were
performed by using BLAST (4). Analysis and manipulation of
DNA sequence data were performed by using programs in the Genetics Computer Group software package version 9.0.
Nucleotide sequence accession number.
The entire sequence of
pCD1 from Y. pestis KIM5 has been deposited in the GenBank
database and assigned accession number AF074612.
 |
RESULTS AND DISCUSSION |
General description.
From our DNA sequencing, we have
constructed a genetic map of the Y. pestis KIM5 pCD1 plasmid
(Fig. 1), which is 70,509 nucleotides in length. Table
1 lists significant ORFs and their
primary characteristics. Of the 61 ORFs in Table 1 (excluding the
ylpA and 'yadA pseudogenes), 8 have GTG starts
(repA, yscD, yscJ, yscW,
Orf7, Orf42, Orf43, and Orf74) and 3 have TTG starts (Orf54, Orf61, and
Orf73). We have omitted most IS element remnants and partial ORFs that
appear to be nonfunctional due to IS-related events or other deletions and rearrangements.
The LCR-related genes are organized as was proposed in a
low-resolution, composite map assembled from numerous studies
(83). This consists of a large Ca2+ dependence
region (yscM to yopD, LCR cluster) that encodes
primarily secretion and regulatory functions, with genes encoding YopK, effector Yops (YpkA, YopJ, YopH, YopM, and YopE), and their chaperones (SycE and SycH) being scattered around the rest of pCD1. The locations and organization of LCR-related genes, as well as the partitioning and
replication regions, of pCD1 closely resemble those of the best-characterized LCR plasmid in Y. pseudotuberculosis,
pIB1 (12, 95). In contrast, pYVe O:9, from serotype O:9
Y. enterocolitica, has several notable differences. Compared
to those of pCD1 and pIB1, the LCR cluster of pYVe O:9 is in the
opposite orientation; this is also the case for the
sopA-to-sycE region (Fig. 1) (12, 32,
68). In addition, yopM and yopH are located
on opposite sides of the LCR cluster in pCD1 compared to pYVe O:9.
While yopH and sycH are located adjacent to each
other in Y. enterocolitica, they are separated by over 20 kb
in pCD1. Finally, ylpA and yopK are located near
the partitioning region (sopABC) of pCD1 but near
ypkA and the origin of replication in pYVe O:9 (Fig. 1)
(12, 32, 68). As noted by others, there is no simple, single
mechanism to explain the scrambled locations of these genes among the
LCR family of plasmids (12).
We identified a number of intact, defective, and partial IS elements in
pCD1. The site of an IS100 insertion, an element with numerous copies in the Y. pestis genome (36, 83),
was confirmed and refined. Two new IS elements, which we have named
IS1616 and IS1617, were discovered (Fig. 1) and
were registered through Esther Lederberg, Plasmid Reference Center,
Stanford, Calif. In addition, numerous IS element remnants were
identified; these partial ISs cluster primarily in four regions of pCD1
(discussed below).
It is curious that IS100 is near one end of the
yscM-to-yopD LCR cluster and that two partial
IS285 elements bound this same region (Fig. 1). The type III
secretion system and regulatory genes, exemplified by this LCR cluster,
are widespread among bacterial pathogens and have been suggested as a
possible pathogenicity island (PAI) (67). PAI hallmarks
include carriage of virulence genes, a distinct GC content compared to
that of the host bacterium, a discrete genetic unit often flanked by
direct repeats, association with tRNA genes and/or insertion sequences,
the presence of mobility genes (transposase genes, etc.), instability,
and absence in less pathogenic strains (48). An additional
requirement of a chromosomal location (48) may be somewhat
artificial (67) given the large sizes of many virulence
plasmids. Although the LCR cluster does have IS elements associated
with it, we failed to detect any tRNA genes anywhere on pCD1. In
addition, the LCR cluster does not contain genes for effector Yops
(except lcrV). Finally, the GC content of this region
(44.8%) matches that of the entire plasmid (Table
2) and is similar to the 46 to 47% GC
content of the genome of Y. pestis (9, 83).
However, there are intriguing differences in the GC contents of other
regions and genes within pCD1 (Table 2). The intact and large partial
IS elements all have GC contents significantly higher than that of the
Y. pestis genome. In addition, the scattered effector Yop
genes have a wide range of GC contents (Table 2) (discussed below). The
numerous IS remnants, varied GC content, and scattered yop
genes suggest that pCD1 has undergone multiple DNA incursions,
rearrangements, and deletions. These alterations may have eliminated or
disrupted some of the classic features of a PAI surrounding the LCR
cluster.
Replication and partition functions.
Sequencing confirmed that
pCD1 has an IncFIIA replication system (Fig.
2). For the IncFIIA resistance plasmid R1
and its close relatives (R100 and R6), a quite detailed analysis of
this replication system has been performed. RepA (formerly RepA1) is
required for replication at the origin of replication
(oriR). Transcription from PrepA is
repressed by CopB (RepB in Fig. 2 is its homologue); constitutive
transcription from the PcopB promoter
(PrepB in Fig. 2) results in a long mRNA
encoding RepB, Tap, and RepA that is the source of most of the RepA
protein in E. coli (110). Posttranscriptional expression of RepA is controlled by copA, which lies within
the leader region of RepA mRNA and encodes an antisense RNA transcribed in the opposite direction compared to RepA mRNA transcription. CopA and
its RepA mRNA target site, termed CopT, possess complex secondary
stem-loop structures that form a loop-loop RNA-RNA complex. This
interaction completely blocks translation of the adjacent gene,
tap, which encodes a 24-aa protein. Translation of
tap is required for translation of repA (16,
64).

View larger version (29K):
[in this window]
[in a new window]
|
FIG. 2.
Genetic organization of the replication and partitioning
regions of pCD1. (A) The replication region contains four genes
(repB, copA, tap, and repA)
and an origin of replication (oriR). Black boxes indicate
the promoter regions for repB and repA; the
promoter region for copA is not shown. (B) The
par region possesses two putative trans-acting
genes (sopA and sopB) and a putative DNA binding
site (sopC). Arrowheads within the promoter region show the
locations and orientations of short, imperfect repeats. The six 45-bp
direct repeats that compose sopC are shown as open boxes,
with the locations of five 16-bp inverted repeat structures designated
by arrowheads. RBS, ribosome binding site. For both panels, arrows
indicate the directions of transcription of individual genes, while
locations within pCD1 are indicated by numbers that follow the
numbering system in Fig. 1.
|
|
The pCD1 replication region showed highest homology to the LCR plasmid
pYVe439-80 from Y. enterocolitica (108) and the
virulence plasmid from Salmonella enteritidis
(93); consequently, we followed the nomenclature used for
these two plasmid replicons (Fig. 2). Y. pestis RepB
(homologue of CopB) had 100% identity to RepB of Y. enterocolitica and 69.1% identity and 80.3% similarity to
Salmonella RepB. Y. pestis RepA was 99.3%
identical (100% similar) and 86.5% identical (92.4% similar),
respectively, to the Y. enterocolitica and S. enteritidis gene products. For Tap, the homologies were 95.8%
identity (100% similarity) with Y. enterocolitica Tap and 68% identity (72% similarity) with S. enteritidis Tap
(93, 108). Similar to other IncFIIA replication systems, a
DnaA binding site and OriR region lie downstream of repA in
pCD1 (Fig. 2).
The copy number and incompatibility of IncFIIA plasmids are determined
by the loop-loop interactions between CopA antisense RNA and the CopT
mRNA region (16, 64). However, neither the copy number nor
the incompatibility characteristics of pCD1 can be empirically
determined from the sequence. Sequence changes in the stem and/or loop
affect copy number by altering complex formation rates and alter
incompatibility specificity (16, 64, 108, 110). However,
since the yersinial CopA antisense RNAs are 100% identical (reference
108 and this study), the copy number of seven copies
per chromosome determined for the Y. enterocolitica LCR
plasmid is probably valid for pCD1. Different CopA stem sequences have
been proposed as the reason for compatibility of various IncFIIA
plasmids, including pYVe439-80 and R100 (108). Thus, it is
likely that pCD1 and pYVE439-80 will also have the same incompatibility
characteristics.
As expected from DNA hybridization experiments and plasmid
incompatibility testing (6, 12, 40), the partitioning region of pCD1 is a homologue of the sopABC system of the F plasmid
(Fig. 2). The pCD1 SopA protein shows 68.8% identity and 83%
similarity to F plasmid SopA, while the SopB homologues are 48.9%
identical and 64.7% similar. We propose the methionines encoded at bp
52730 and 53896 in the sopA and sopB ORFs of
pCD1, respectively, as the initiating methionines. The amino acid
sequences after these methionines have strong similarity to the
experimentally determined N-terminal amino acid sequences of F plasmid
SopA and SopB (73, 74). The sopC regions of both
plasmids retain some structural similarities but are less homologous
than the other components of this system. The LCR plasmid
sopC region has six 45-bp tandem nearly perfect direct
repeats, with five of the direct repeats possessing a 16-bp inverted
repeat structure (TGGGACCGTGGTCCCA) (Fig. 2). The F plasmid
sopC possesses 12 43-bp tandem direct repeats. While the
direct repeats are not highly homologous, the inverted repeat
structures are identical except for the two central, unpaired
nucleotides (reference 73 and this study). Finally, the pCD1 sopABC promoter region has four possible 5- to 6-bp
imperfect repeats that show some similarity in sequence and location to imperfect repeats in the F plasmid sopABC promoter region
(Fig. 2) (50). These structural and amino acid sequence
similarities suggest that the pCD1 partitioning system functions in a
manner analogous to that of the F plasmid sopABC system.
In the F plasmid system, SopA binds to the four repeated sequences in
the promoter and acts to repress transcription of the operon. SopB
appears to enhance, in an unknown manner, binding of SopA to the
promoter region. SopB dimers directly bind to the direct repeats of
sopC; however, a single sopC direct repeat is sufficient for proper partitioning. In addition to its function as a
repressor, SopA may also act, by an unknown mechanism, in the
partitioning process. One model has sopC serving a
centromere-like function, with DNA wrapping around a core of Sop
proteins bound to sopC (10, 11, 50, 73).
Insertion elements.
Several mobile genetic elements have been
found in the pathogenic yersiniae, and most of them are present on LCR
plasmids as well as on the chromosome (27, 38, 40, 70, 78, 80, 90). ISs known to be associated with the LCR plasmid of Y. pestis include IS100 and IS285 (38, 83,
87). Additional elements are found on the LCR plasmid of Y. enterocolitica but are not present on the Y. pestis
plasmid (40, 78). Sequence analysis of pCD1 from Y. pestis KIM5 revealed the presence of three complete insertion
elements and numerous partial IS elements. Complete and partial IS
elements with >85% identity at the DNA sequence level were considered
to be the same as previously described IS elements. For the remaining
elements, the element with the highest database match at the amino acid
sequence level was considered the closest relative. Only complete IS
elements were given new IS number designations.
An intact copy of IS100 is located downstream of
yopH in pCD1 (Fig. 1 and Table 2). There are numerous copies
of IS100 throughout the genome of Y. pestis KIM
strains (36); the IS100 element (bp 12609 to
14562) in pCD1 is 100% identical in size and nucleotide sequence to a
copy of IS100 present on the pesticin plasmid of Y. pestis EV76-6 (66). IS100, which appears to
have inserted within the relic of another insertion element, is flanked
by a 5-bp direct repeat (Fig. 3). Five-
and seven-base-pair duplications have been found flanking other
IS100 elements in Y. pestis (36, 38,
85).

View larger version (28K):
[in this window]
[in a new window]
|
FIG. 3.
Diagrammatic representation of four clusters of intact,
defective, and partial IS elements in pCD1. A p or
p indicates that only part of the IS element is present. A
d indicates an IS element that is likely defective. Arrows
show the presence of inverted repeats within IS elements. Each type of
IS element is represented by a unique design that is maintained in each
of the four panels. In panel D, the yadA'/'yadA pseudogene
is represented as yadA. Numbers correspond to nucleotide
positions in Fig. 1.
|
|
IS1616 is a new 1,254-bp insertion element located at bp
50753 to 51987, between ylpA and the sopABC
partitioning region. The inverted repeats at the ends of
IS1616 are 40 bp long and contain nine mismatches. No direct
repeats flanking this element were detected. While some elements do not
generate a direct repeat upon transposition, the absence of direct
repeats could be indicative of changes in the flanking DNA as a result
of mutations that have occurred over time. There are three ORFs within
IS1616. The first ORF (OrfA, bp 50825 to 51142) is predicted
to encode a protein of 105 aa with a pI of 12.6. A second ORF, encoding
186 aa (OrfB, bp 51064 to 51624), overlaps OrfA in the
1 frame. An
additional 101 aa (orfC, bp 51625 to 51930), which may have
originally been encoded as part of the second ORF, are encoded in the
same frame just past the stop codon at bp 51622 for OrfB. A potential
ribosome frameshift site, AAAAAG (bp 51078 to 51083), is
located within the coding region of OrfA. Translational frameshifting
occurs in a number of different bacterial ISs (28). Most
frameshift signals consist of a heptanucleotide sequence; however, in
some IS elements a tetramer can function as a frameshift site (28, 34). Whether the hexanucleotide in IS1616 is a
functional frameshift signal is unknown. IS1616 has the
closest amino acid sequence homology to IS1236 from
Acinetobacter calcoaceticus. The region of homology includes
all of the IS1616 ORFs and spans the stop codon between OrfB
and OrfC. IS1236 is a member of the IS3 family and contains two ORFs that could be expressed as a fusion protein as a
result of translational frameshifting (46). Remnants of IS1616 are present elsewhere on pCD1 (Table
3 and Fig. 3). A second, presumably
defective, copy of IS1616 (IS1616d) is present at
bp 70056 to 782 of pCD1. IS1616d contains terminal inverted repeats but is not flanked by direct repeats. An intact OrfA gene is
present in IS1616d; however, the downstream ORF has been
disrupted by the inversion of a 227-bp fragment corresponding to the
complement of nucleotides 51561 to 51787 of IS1616. The
inverted segment is flanked by 7-bp inverted repeats in
IS1616d. In IS1616, three of the bases in the
downstream "inverted repeat" have been changed. When corrected for
the inversion, the nucleotide sequence of IS1616d is 99%
identical to that of IS1616 and does not possess the stop codon found between OrfB and OrfC of IS1616. While it seems
unlikely that either copy of IS1616 is functional, we have
designated the element at bp 70056 to 782 as IS1616d in the
figures and tables to distinguish it from the element downstream of
ylpA. Insufficient sequence information precludes the
identification of an IS1616 element downstream of
ylpA in Y. enterocolitica or Y. pseudotuberculosis. However, the remnant of IS1616
located downstream of yadA'/'yadA is present in Y. pseudotuberculosis pIB1 as well as in Y. pestis pYV019
but is absent from a similar site in Y. enterocolitica (100).
IS1617 is a new 1,214-bp element, with inverted repeats of
39 and 40 bp containing 13 mismatches, located downstream of
sycH (Fig. 3). The 5 bases flanking each end of
IS1617 are identical in four of five positions. Like
IS1616, this element belongs to the IS3 family
and contains two overlapping ORFs with OrfB in the
1 frame relative
to OrfA. OrfA could encode an 88-aa protein (bp 62202 to 62468, complement), while OrfB is open for 289 aa (bp 61369 to 62238, complement). A potential translational frameshift window of
AAAAAAG is present in OrfA. IS1617 is more
closely related to IS1222 from Enterobacter
agglomerans (102) and to ISD1 found in
Desulfovibrio vulgaris (43) than to
IS1616. A remnant of IS1617 is present downstream
of yopJ in pCD1 as well as in Y. pseudotuberculosis pIB1 (Table 3) (44, 59).
In addition to these elements, there are several remnants of other ISs
present on pCD1 (Table 3). Portions of IS285 are found at
either end of the main LCR gene cluster. The segment upstream of
IS100 is 86% identical to the 3' end of IS285
and includes one copy of the inverted repeat (Fig. 3A). The
IS285 remnant located downstream of yopD is 100%
identical to the first 214 bp of IS285 and also contains a
copy of the inverted repeat (Fig. 3B). However, these two regions
together do not make an intact IS. The IS285 segment
upstream of the LCR gene cluster is present in the same position in
pIB1 from Y. pseudotuberculosis YPIII, but the downstream remnant is absent (19).
Part of an IS21 element (IS21p) is located in the
region between sycE and sycH (Fig. 3). This
segment, which is 90% identical in nucleotide sequence to
IS21 (91), contains one copy of the inverted
repeat, an intact istA gene (bp 58849 to 60021), and the 5'
end of istB (bp 60021 to 60418). The Y. pestis
IS21p element appears to have disrupted part of an
IS1222 element that is 96% identical in nucleotide sequence
to IS1222 from E. agglomerans (102).
Both IS1222 inverted repeats are present; however, all of
OrfA and the 5' end of OrfB are missing. The IS1222 remnant is not closely related to the new Y. pestis IS elements.
Although IS1617 does have amino acid sequence homology to
IS1222-encoded products, it is not homologous at the
nucleotide level.
Upstream of the yadA'/'yadA pseudogene are the remains of an
element that is 85% identical in nucleotide sequence to
Tn1000 from E. coli (21). The Y. pestis Tn1000p remnant (Fig. 1 and Table 3) contains
one copy of the inverted repeats and an intact tnpR gene (bp
63519 to 64070, complement). The
tnpA gene (bp 64234 to
66549) is missing the nucleotides encoding aa 189 to 415. This relic of
Tn1000 is only distantly related (69.4%) to a
Tn3 homologue found on the LCR plasmid of Y. enterocolitica 29979 (53a). Tn2502, which
confers arsenic resistance and also contains a defective
tnpA gene, is present downstream of yadA on the
LCR plasmid of low-virulence strains of Y. enterocolitica (78). Although Tn2502 and the pCD1
Tn1000 remnant are located in the same general region, they
are unrelated.
Several IS remnants are found in the vicinity of yopM (Fig.
3B). Upstream of yopM, there is a portion of
IS285 and a region that has amino acid sequence homology to
the IS1600 transposase from Mycobacterium
fortuitum (65). A similar IS1600-like
sequence is present downstream of yopM, as is a segment that
is related to the transposase from the Rhizobium meliloti
element ISRm3 (Fig. 3 and Table 3) (113).
There are four regions (termed ISD1-like) with homology to
ISD1 from D. vulgaris (Fig. 3; Table 3). One
segment includes sequences containing genes termed lcrS and
lcrT in Y. enterocolitica (92). Other
investigators have noted similarity between lcrS and IS ORFs
(27). Sequence analysis indicates that the lcrST region is part of an IS element. Consequently we refer to
lcrS as OrfA and to lcrT as OrfB. OrfA (bp 14987 to 15253, complement) is 67% identical and 90.9% similar in amino
acid sequence to OrfA of ISD1 (43). In Y. pestis, OrfB is longer than lcrT of Y. enterocolitica due to a frameshift mutation and an 11-bp insertion
(92). OrfB (bp 14571 to 14951, complement) has the highest
homology at the amino acid level to OrfB of ISD1
(43) but is apparently truncated by the insertion of
IS100 (Fig. 3A). The homology to ISD1 OrfB is
continued in a second ISD1-like segment located upstream of IS100 (Fig. 3A). These two segments may have been part of an
ISD1-like element that was disrupted by IS100.
The partial ISD1-like element that remains possesses one
copy of an inverted repeat which matches the ISD1 repeat in
25 of 44 residues (43). A potential frameshift site
(AAAAAAAC, bp 14992 to 14999, complement) is found within OrfA and could yield an OrfAB transframe protein. Two additional regions with amino acid sequence homology to OrfB from ISD1
are located downstream of yopK and Tn1000p,
respectively (Table 3). While all of the ISD1-like OrfB
remnants are related at the amino acid sequence level, only these last
two regions have homologous nucleotide sequences, suggesting that they
were derived from an IS element different from that for the other
ISD1-like remnants.
A final IS remnant, containing sequences related to IS1327
from Erwinia herbicola (62), is located
downstream of the pCD1 sopABC partitioning region. The
sequence showing similarity is fairly small, encompassing only 350 bp,
and appears to be the only copy of this type on pCD1 (Table 3).
There are several regions that contain clusters of IS elements or
remnants (Fig. 3). Two of these IS clusters have already been
discussed, the group including IS100 (Fig. 3A) and the
remnants around yopM (Fig. 3B). A third collection of IS
elements is found in the vicinity of sycH. One of the new IS
elements, IS1617, is located downstream of sycH,
while the IS21p remnant, contained within an
IS1222p remnant, separates sycE from
sycH (Fig. 3C). The defective yadA'/'yadA gene
also appears to be surrounded by remnants of IS elements (Fig. 3D;
Table 3). While there are several interesting correlations with GC
content (Table 2), the significance of any of these IS clusters is
unknown. They could represent preferred insertion sites for the
respective elements. Alternatively, these groupings may simply
delineate regions that are nonessential for virulence or plasmid
maintenance.
Several of the IS elements appear to have inserted into other mobile
genetic elements. Thus, IS100 disrupted an
ISD1-like element (Fig. 3A), and IS21p may have
inserted into an IS1222p homologue (Fig. 3C). It is
difficult to determine if other genes were disrupted by IS
transposition. Over time the sequences flanking the ISs or remnants may
have diverged so that no apparent ORF remains. However, there are at
least two identifiable ORFs whose functions were probably disrupted by
IS insertions. These include an ORF encoding a nuclease at positions
782 to 1033 and one encoding a helicase at nucleotides 69568 to 70055. Only portions of these genes remain, to one side of an IS remnant,
suggesting that additional genomic rearrangements have occurred since
the initial insertion of the IS element.
The nuclease remnant is 94% similar to the carboxy-terminal 82 aa of
an endonuclease encoded by the LCR plasmid of Y. enterocolitica 15673 (53a) and 73% similar to a
plasmid-encoded endonuclease from Salmonella typhimurium and
E. coli. In E. coli and Salmonella this gene is located within a region containing sequences involved in
conjugation; however, it is not required for conjugation (86, 114). At this time, the function of this endonuclease is unknown. The portion of helicase remaining in Y. pestis is 66.7%
similar to residues 1306 to 1467 of TraI from E. coli. TraI
functions during conjugation, not only to unwind but also to nick the
DNA at oriT (115).
The presence of these gene remnants in Y. pestis, both of
which are associated with conjugation, as well as the genetic linkages of the replication and partitioning functions of pCD1 raise some interesting questions about potential origins of the LCR plasmids. It
is possible that at one time the plasmid was capable of conjugation but
that the transfer functions were subsequently lost or mutated.
Virulence-related genes.
The LCR genes previously sequenced
for Y. enterocolitica and Y. pseudotuberculosis
were all present in Y. pestis (with the exception of
yscM2, which appears to be unique to Y. enterocolitica [101]). Within previously
described operons, the gene order is conserved and the ORFs generally
have the same length (three exceptions are discussed below). As
anticipated, the homology is high: usually
95% and often 98 to 100%
identity. Two Ysc components, YscG and YscE, and one effector Yop,
YopJ, had 94% identity (95% similarity) to the corresponding proteins
of Y. enterocolitica, due to differences scattered within
the predicted proteins. Detailed studies will be required to determine
if these differences translate into significant functional differences
such as arise from the heterogeneity in LcrV (see the introduction).
The largest difference between effector Yop proteins of Y. pestis and Y. enterocolitica was in YopM (93% identity
[94% similarity] to YopM of Y. enterocolitica), discussed below.
For the genes for the effector Yops and other non-Ysc LCR proteins that
are scattered in the regions flanking the central uninterrupted LCR
cluster, the base composition varies considerably, from 33.5% GC for
yopK to 51.1% for yopE (Table 2), which is suggestive of possible multiple events of Yop gene acquisition by the
LCR plasmid. For example, even though ypkA and
yopJ lie within an operon in Y. pseudotuberculosis (44), and hence likely also in
Y. pestis, they differ significantly in composition. Unlike other LCR operons within the main LCR cluster, which tend to contain tightly packed cistrons, there is a 397-bp spacing between
ypkA and yopJ, which itself has 32.2% GC. The
DNA upstream of ypkA (including the partial IS sequence) is
45.8% GC, and that downstream from yopJ (including the
partial IS sequence) is 46.7% GC. These considerations suggest the
possibility that yopJ, together with the intercistronic
region, may have been acquired independently of ypkA by the
LCR plasmid. Likewise, yopK and the sequence between it and
yopT are significantly lower in GC content than are
sycT and yopT farther up or than the sequence
downstream from yopK, including ylpA. lcrV lies
within the cluster of LCR secretion regulation-related genes, but
because it also has a virulence role as a secreted protein, it is worth
reiterating (89) that it also has a relatively low GC
content of 37.7%, as contrasted to yopB and yopD
in the same operon, which have 46.7 and 43.1% GC, respectively. These
results potentially reflect an interesting evolution for the set of
virulence properties on this plasmid.
Previously studied loci.
The YpkA- and YopJ-encoding operon in
Y. pestis KIM5, like that in Y. pseudotuberculosis (44), also has a small ORF (Orf7) upstream, but in Y. pestis, this ORF is shorter by 7 residues and is spaced from ypkA by 17, compared to 6 bp. In
these two yersiniae these ORFs are identical up to the sequence
encoding the C-terminal 10 residues (17 in Y. pseudotuberculosis), where they become very different. An in-frame
deletion within this ORF in Y. pseudotuberculosis did not
affect expression or secretion of YpkA or YopJ or virulence in mice,
leaving it without any obvious role in the LCR or in virulence, and it
was speculated that this ORF in fact is not translated (44).
The Orf7 product is interesting because it has properties very much
like those of the Yop chaperones (Sycs). Its molecular mass of 15,747 Da is similar to the sizes of other described Sycs (111),
and like other Sycs, it is predicted to be acidic (pI of 4.39) and to
have an amphipathic character. Intriguingly, the database search
revealed some homology of this predicted ORF to an ORF upstream of the
gene encoding the Avr-like protein HrmA in Pseudomonas
syringae pv. syringae (a plant pathogen that secretes virulence
proteins by a type III secretion mechanism) (1, 2). These
similarities suggest that we should revisit the role of Orf7 to
reassess its expression and possible role as a Syc for YopJ, which now
is believed to be targeted into eukaryotic cells and has been shown to
cause apoptosis of macrophages by Y. pseudotuberculosis and
Y. enterocolitica (71, 72). Deletion of this ORF
might not have had an effect on virulence, because abolishing YopJ
itself did not affect virulence in mice infected orally by Y. pseudotuberculosis (44).
YopM is a leucine-rich repeat (LRR) protein (58) previously
noted to be encoded by a gene with a relatively low percent GC content
containing a number of exact, directly repeated sequences and inverse
complement sequences (61). YopM was reported to be 41,556 Da
with a pI of 4.06 (61) but now is seen to be larger, at
46.21 kDa, and to have a predicted pI of 4.23. This new DNA sequence
result has been confirmed independently (77a). It is likely
that the original analysis was confounded by priming in different,
directly repeated regions. YopM is now predicted to have 15 instead of
13 LRRs (77a). Interestingly, the sequence reported for
yopM of Y. enterocolitica O:9 (18),
which is predicted to encode a protein with 13 LRRs, now differs from
the Y. pestis sequence by exactly 2 LRRs, and the difference
between the presently predicted Y. pestis YopM and that of
Y. enterocolitica comes in a region where the gene contains
exact repeats. This kind of genetic structure might be expected to be
prone to duplications and deletions, and indeed, yopM in
different Yersinia strains varies in size in the
LRR-encoding region (77a). Functional studies are needed to
determine the significance of this variation.
YopL was designated as the ca. 15-kDa product of a two-cistron operon,
yopKL, based on its elimination, along with YopK, from outer
membrane fractions of Y. pseudotuberculosis 43 (serotype III) carrying a version of pCD1 of Y. pestis KIM5 that had
Mu dI(Ap lac) inserted in yopK; it was the only
observed protein species eliminated by a downstream Mu dI(Ap
lac) insertion (103). YopL has not been found in
Y. pseudotuberculosis or in Y. enterocolitica, although in all three yersiniae there is a spacing of ca. 500 bp
between yopK and the downstream monocistronic operon
encoding YlpA (30, 54). This spacing is 496 bp in Y. pestis KIM and 482 bp in the sequence from Y. enterocolitica (30); the two sequences of this region
are highly similar except for a 14-bp insertion (consisting of a
directly duplicated 7-bp sequence) 301 bp after the stop codon for
YopK. Although this 496 bp could be large enough to encode YopL, we
found that it contains two ORFs that show amino acid homology to the IS
element ISD1. Accordingly, the sequence analysis of pCD1
does not support the existence of yopL, and the identity of
the second protein eliminated in expression by the Mu d insertion in
yopK is not known.
The predicted translation initiation site for YlpA is an uncommon
valine codon and lies 40 aa downstream of the initiating methionine
predicted for Y. enterocolitica YlpA (30). This
surprising finding led us to check further and confirm the sequence in
this region by sequencing directly from pCD1 in Y. pestis
KIM5. The Y. enterocolitica ylpA 5' end is indeed present in
Y. pestis KIM5, but there is an extra A in a stretch of
seven A's beginning at bp 50070 which is responsible for shifting the
reading frame in the pCD1 gene. This 7-A stretch is also present in the
pCD1 sequence directly read from Y. pestis KIM5.
Accordingly, we believe that YlpA likely is not expressed in Y. pestis, and if it is, it would not be a secreted lipoprotein as it
is in Y. enterocolitica: the C residue at the beginning of
the mature protein (bp 50020) lies well upstream of the stretch of
seven A's. There are now several instances in which insertions or
frameshifts in Y. pestis have abolished expression of genes
that are important for virulence in the enteropathogenic yersiniae.
These differences are thought to represent adaptations to the
vector-borne transmission for Y. pestis and are important
for the greater disseminative character of Y. pestis
(98). It will be interesting to test the significance of
YlpA's loss in Y. pestis.
The yadA'/'yadA ORF in Y. pestis KIM5 has the
same 1-bp deletion as in Y. pestis EV76 (98),
showing that this potentially virulence-enhancing loss of a prominent
fibrillar adhesin occurred prior to the divergence of the orientalis
(strain EV76) and mediaevalis (KIM5) biotypes of Y. pestis.
Potential virulence-related new ORFs.
Fourteen ORFs are not
obviously associated with IS elements and either have products with no
significant similarity to proteins in the database with known functions
or have features suggesting a virulence-related role. These are ORFs
that deserve future study as potentially having virulence or
virulence-accessory functions.
Orf75 (Table 1; Fig. 1) lies just 1 bp downstream of yopE
and lacks an obvious ribosome binding site or upstream promoter. The
ORF could encode an 11,192-Da protein with at least one likely transmembrane domain and a noncleavable signal sequence. Its expression conceivably is translationally coupled to that of yopE,
suggesting that it could be a member of the LCR. yopE has
been called monocistronic, based on its estimated transcript size (750 bases in Y. pseudotuberculosis [41]). The
presence of this ORF has not been noted in the literature, even though
the beginning of Orf75 is present in the sequences previously submitted
for Y. pseudotuberculosis yopE (40, 41), Y. enterocolitica O:9 (69), and Y. pestis EV76
(40). Interestingly, it is intact but is separated from
yopE by an insertion element in Y. enterocolitica
O:8 strain 8081 (40). At high doses, a Y. pseudotuberculosis mutant containing an insertion in this ORF did
not show a loss of virulence in mice infected orally (40). Given that YopE's importance in virulence was determined with polar
insertion mutants (40, 41, 54, 95-97, 104, 105), the
significance of this ORF needs to be thoroughly tested.
While preparing this paper, we learned that two new ORFs we found in
Y. pestis have been designated yopT and
sycT in Y. enterocolitica (56).
sycT and yopT are arranged in what appears to be
a bicistronic operon 500 bp upstream and on the opposite strand from
yopK (Fig. 1). These genes indeed have properties suggesting
that they encode a Yop and associated Syc. sycT is predicted
to encode an acidic 15.42-kDa peripheral protein (Table 1). The
database search brought up weak homology with SycE (with which there is
22% identity). A multiple alignment of SycT with SycE, LcrH (SycD),
and SycH shows the greatest similarity toward the C termini of the
proteins, as previously demonstrated in a comparison of SycE and
LcrH/SycD (111). YopT is predicted to be a peripheral
36.31-kDa basic protein (Table 1). It shows 36.7% identity in residues
98 to 322 with the C terminus (residues 648 to 874) of a surface
antigen in Haemophilus somnus that is associated with serum
resistance (31). The regulation, mechanism of action, and
role in plague of YopT should be investigated.
Orf42 through Orf44, immediately downstream of tyeA (Fig.
1), have been noted to exist in Y. enterocolitica
(109). Orf42 has been sequenced for Y. pseudotuberculosis, and a polar insertion near its 3' end caused a
calcium-independent growth phenotype (39), typical of
mutations in genes necessary for the functioning of the type III
secretion system. As this mutation was complemented by DNA lacking a
complete lcrD/yscV gene (downstream of Orf44), this
phenotype likely was not caused by disruption of lcrD/yscV. For this reason and because of the location (within the LCR cluster and
downstream of tyeA, which is involved in Yop secretion
control [57]), we speculate that one or more of Orf42
through Orf44 have a role(s) in secretion or secretion control.
Orf5 (Fig. 1 and Table 1) lies isolated from other virulence-related
genes, within a gap between the origin region and an IS1236
remnant. It may not prove to be virulence related.
Orf59, Orf60, and Orf61 (Fig. 1) lie between yopM and
sycT. Orf59 is closest to yopM (242 bp away), on
the opposite strand, and is predicted to encode a ca. 4-kDa soluble
acidic protein (Table 1), significantly smaller than typical Sycs.
Orf60 and Orf61 lie 875 bp from Orf59, are separated by 272 bp, and are divergently oriented. Both are predicted to encode membrane-associated proteins with mildly basic pIs that hence do not resemble typical Sycs
(acidic, soluble, ca. 16 kDa) or Yops (soluble). Orf60 has an uncommon
translation initiation codon (leucine) (Table 1).
Orf73 and Orf74 (Fig. 1) lie in the vicinity of yopE. Their
predicted proteins are 10- to 11-kDa soluble acidic proteins that show
high similarity to unknown proteins of similar lengths in Mycobacterium tuberculosis; however, neither ORF has a
common translation initiation codon (leucine [Orf73] and valine
[Orf74]). Both ORFs are predicted to be transcribed in the same
direction, with Orf74 overlapping Orf73 by 8 bp (Table 1).
Orf84 and Orf85 (Fig. 1) occupy the region between IS1617
and Tn1000p. They are separated by 139 bp and would be
transcribed in the same direction. Both ORFs appear to encode soluble
proteins, with the product of Orf84 predicted to be basic and that of
Orf85 predicted to be acidic (Table 1).
Summary.
Our analysis of the pCD1 DNA sequence has identified
an IncFIIA replicon and Sop-like partitioning system necessary for
plasmid maintenance. We noted the insertion sites of IS100,
Tn1000p, IS21p, and several partial
IS285 elements as well as two new IS elements, IS1616 and IS1617. In addition, there are
numerous IS element remnants clustered in four regions of the 70-kb
plasmid. We found no evidence for the existence of yopL,
and, in Y. pestis, ylpA and yadA are
pseudogenes. Although regulatory and secretory components of the LCR
constitute a contiguous LCR cluster, elements suggesting that this
region is a PAI were not identified. Genes for effector Yops are
scattered throughout the plasmid and have widely varying GC contents,
indicative of multiple gene acquisition events. This observation
coupled with the presence of IS remnants from only distantly related
microorganisms suggests that a very complex history of DNA acquisition,
insertions, deletions, and rearrangements was required for assembly of
pCD1.
We failed to find genes with similarities to putative virulence factors
that are not potential members of the LCR. However, we did identify
eight ORFs of unknown function (Orfs 5, 59, 60, 61, 73, 74, 84, and
85). The products of Orfs 7, 42, 43, 44, and 75, as well as YopT and
its chaperone SycT, are potential new members of the LCR virulence
system. Sequence analysis of Orf7 suggests that its product could be a
chaperone for YopJ. Clearly, experimental analysis of all of these Orfs
is required to determine if they are LCR members or non-LCR virulence
determinants.
We corrected the sequence of yopM, showing that its product
has two additional LRR repeats that are absent in Y. enterocolitica. While most LCR-related Y. pestis gene
products showed 98% identity to their analogous Y. enterocolitica gene products, YopJ, YscG, and YscE were ~94%
identical to Y. enterocolitica products. It will be
necessary to determine whether any of the differences in YopM, YopJ,
YscG, and YscE and the lack of a functional YlpA gene product are
involved in the differing levels of virulence among the pathogenic
yersiniae.
 |
ACKNOWLEDGMENTS |
We thank G. Plunkett III for help with the codon usage matrix and
IS elements, N. T. Perna for performing the initial database searches and organizing their output, and G. F. Mayhew and the technical staff of the Wisconsin Genome Project for DNA sequencing. We
thank G. R. Cornelis for informing us of the yopT and
sycT designations used in an in-press manuscript from his
research group.
This work was supported by Public Health Service (PHS) grant P01
HG01428 to F.R.B. R.D.P. and J.D.F. were supported by PHS grants
AI25098 and AI33481. S.C.S. was supported by PHS grants AI21017 and
AI41668.
 |
FOOTNOTES |
*
Corresponding author. Mailing address: Department of
Microbiology and Immunology, MS415 Medical Center, University of
Kentucky, 800 Rose St., Lexington, KY 40536-0084. Phone: (606)
323-6341. Fax: (606) 257-8994. E-mail:
rperry{at}pop.uky.edu.
Editor:
V. A. Fischetti
 |
REFERENCES |
| 1.
|
Alfano, J. R., and A. Collmer.
1997.
The type III (Hrp) secretion pathway of plant pathogenic bacteria: trafficking harpins, Avr proteins, and death.
J. Bacteriol.
179:5655-5662[Free Full Text].
|
| 2.
|
Alfano, J. R.,
H.-S. Kim,
T. P. Delaney, and A. Collmer.
1997.
Evidence that the Pseudomonas syringae pv. syringae hrp-linked hrmA gene encodes an Avr-like protein that acts in an hrp-dependent manner within tobacco cells.
Mol. Plant-Microbe Interact.
10:580-588[Medline].
|
| 3.
|
Allaoui, A.,
R. Schulte, and G. R. Cornelis.
1995.
Mutational analysis of the Yersinia enterocolitica virC operon: characterization of yscE, F, G, I, J, K required for Yop secretion and yscH encoding YopR.
Mol. Microbiol.
18:343-355[Medline].
|
| 4.
|
Altschul, S. F.,
W. Gish,
W. Miller,
E. W. Myers, and D. J. Lipman.
1990.
Basic local alignment search tool.
J. Mol. Biol.
215:403-410[Medline].
|
| 5.
|
Ausubel, F. M.,
R. Brent,
R. E. Kingston,
D. D. Moore,
J. G. Seidman,
J. A. Smith, and K. Struhl (ed.).
1987.
Current protocols in molecular biology.
John Wiley & Sons, New York, N.Y.
|
| 6.
|
Bakour, R.,
Y. Laroche, and G. Cornelis.
1983.
Study of the incompatibility and replication of the 70-kb virulence plasmid of Yersinia.
Plasmid
10:279-289[Medline].
|
| 7.
|
Barve, S. S., and S. C. Straley.
1990.
lcrR, a low-Ca2+-response locus with dual Ca2+-dependent functions in Yersinia pestis.
J. Bacteriol.
172:4661-4671[Abstract/Free Full Text].
|
| 8.
|
Bearden, S. W.,
J. D. Fetherston, and R. D. Perry.
1997.
Genetic organization of the yersiniabactin biosynthetic region and construction of avirulent mutants in Yersinia pestis.
Infect. Immun.
65:1659-1668[Abstract].
|
| 9.
|
Bercovier, H.,
H. H. Mollaret,
J. M. Alonso,
J. Brault,
G. R. Fanning,
A. G. Steigerwalt, and D. J. Brenner.
1980.
Intra- and interspecies relatedness of Yersinia pestis by DNA hybridization and its relationship to Yersinia pseudotuberculosis.
Curr. Microbiol.
4:225-229.
|
| 10.
|
Biek, D. P., and J. Shi.
1994.
A single 43-bp sopC repeat of plasmid mini-F is sufficient to allow assembly of a functional nucleoprotein partition complex.
Proc. Natl. Acad. Sci. USA
91:8027-8031[Abstract/Free Full Text].
|
| 11.
|
Biek, D. P., and J. Strings.
1995.
Partition functions of mini-F affect plasmid DNA topology in Escherichia coli.
J. Mol. Biol.
246:388-400[Medline].
|
| 12.
|
Biot, T., and G. R. Cornelis.
1988.
The replication, partition and yop regulation of the pYV plasmids are highly conserved in Yersinia enterocolitica and Y. pseudotuberculosis.
J. Gen. Microbiol.
134:1525-1534[Medline].
|
| 13.
|
Birnboim, H. C., and J. Doly.
1979.
A rapid alkaline extraction procedure for screening recombinant plasmid DNA.
Nucleic Acids Res.
7:1513-1523[Abstract/Free Full Text].
|
| 14.
|
Blattner, F. R.,
G. Plunkett III,
C. A. Bloch,
N. T. Perna,
V. Burland,
M. Riley,
J. Collado-Vides,
J. D. Glasner,
C. K. Rode,
G. F. Mayhew,
J. Gregor,
N. W. Davis,
H. A. Kirpatrick,
M. A. Goeden,
D. J. Rose,
R. Mau, and Y. Shao.
1997.
The complete genome sequence of Escherichia coli K-12.
Science
277:1453-1474[Abstract/Free Full Text].
|
| 15.
|
Bliska, J. B.,
M. C. Copass, and S. Falkow.
1993.
The Yersinia pseudotuberculosis adhesin YadA mediates intimate bacterial attachment to and entry into HEp-2 cells.
Infect. Immun.
61:3914-3921[Abstract/Free Full Text].
|
| 16.
|
Blomberg, P.,
H. M. Engdahl,
C. Malmgren,
P. Romby, and E. G. H. Wagner.
1994.
Replication control of plasmid R1: disruption of an inhibitory RNA structure that sequesters the repA ribosome-binding site permits tap-independent RepA synthesis.
Mol. Microbiol.
12:49-60[Medline].
|
| 17.
|
Bochner, B. R.,
H.-C. Huang,
G. L. Schieven, and B. N. Ames.
1980.
Positive selection for loss of tetracycline resistance.
J. Bacteriol.
143:926-933[Abstract/Free Full Text].
|
| 18.
|
Boland, A.,
M.-P. Sory,
M. Iriarte,
C. Kerbourch,
P. Wattiau, and G. R. Cornelis.
1996.
Status of YopM and YopN in the Yersinia Yop virulon: YopM of Y. enterocolitica is internalized inside the cytosol of PU5-1.8 macrophages by the YopB, D, N delivery apparatus.
EMBO J.
15:5191-5201[Medline].
|
| 19.
|
Bölin, I., and H. Wolf-Watz.
1988.
The plasmid-encoded Yop2b protein of Yersinia pseudotuberculosis is a virulence determinant regulated by calcium and temperature at the level of transcription.
Mol. Microbiol.
2:237-245[Medline].
|
| 20.
|
Borodovsky, M., and J. McIninch.
1993.
Genmark: parallel gene recognition for both DNA strands.
Comput. Chem.
17:123-133.
|
| 21.
|
Broom, J. E.,
D. F. Hill,
G. Hughes,
W. A. Jones,
J. C. McNaughton,
P. A. Stockwell, and G. B. Petersen.
1995.
Sequence of a transposon identified as TN1000 ( ).
DNA Sequence
5:185-189[Medline].
|
| 22.
|
Brubaker, R. R.
1991.
Factors promoting acute and chronic diseases by yersiniae.
Clin. Microbiol. Rev.
4:309-324[Abstract/Free Full Text].
|
| 23.
|
Brubaker, R. R., and M. J. Surgalla.
1964.
The effect of Ca++ and Mg++ on lysis, growth, and production of virulence antigens by Pasteurella pestis.
J. Infect. Dis.
114:13-25[Medline].
|
| 24.
|
Burland, V.,
D. L. Daniels,
G. Plunkett III, and F. R. Blattner.
1993.
Genome sequencing on both strands: the Janus strategy.
Nucleic Acids Res.
21:3385-3390[Abstract/Free Full Text].
|
| 25.
|
Burrows, T. W., and G. A. Bacon.
1956.
The basis of virulence in Pasteurella pestis: an antigen determining virulence.
Br. J. Exp. Pathol.
37:481-493[Medline].
|
| 26.
|
Butler, T.
1995.
Yersinia species (including plague), p. 2070-2078.
In
G. L. Mandell, J. E. Bennett, and R. Dolin (ed.), Principles and practice of infectious diseases, 4th ed., vol. 2. Churchill Livingstone, New York, N.Y.
|
| 27.
|
Carniel, E.,
I. Guilvout, and M. Prentice.
1996.
Characterization of a large chromosomal "high-pathogenicity island" in biotype 1B Yersinia enterocolitica.
J. Bacteriol.
178:6743-6751[Abstract/Free Full Text].
|
| 28.
|
Chandler, M., and O. Fayet.
1993.
Translational frameshifting in the control of transposition in bacteria.
Mol. Microbiol.
7:497-503[Medline].
|
| 29.
|
Cheng, L. W.,
D. M. Anderson, and O. Schneewind.
1997.
Two independent type III secretion mechanisms for YopE in Yersinia enterocolitica.
Mol. Microbiol.
24:757-765[Medline].
|
| 30.
|
China, B.,
T. Michiels, and G. R. Cornelis.
1990.
The pYV plasmid of Yersinia encodes a lipoprotein, YlpA, related to TraT.
Mol. Microbiol.
4:1585-1593[Medline].
|
| 31.
|
Cole, S. P.,
D. G. Guiney, and L. B. Corbeil.
1993.
Molecular analysis of a gene encoding a serum-resistance-associated 76 kDa surface antigen of Haemophilus somnus.
J. Gen. Microbiol.
139:2135-2143[Medline].
|
| 32.
|
Cornelis, G. R.
1994.
Yersinia pathogenicity factors, p. 243-263.
In
J. L. Dangl (ed.), Bacterial pathogenesis of plants and animals. Molecular and cellular mechanisms. Springer-Verlag, New York, N.Y.
|
| 33.
|
Cornelis, G. R., and H. Wolf-Watz.
1997.
The Yersinia Yop virulon: a bacterial system for subverting eukaryotic cells.
Mol. Microbiol.
23:861-867[Medline].
|
| 34.
|
Farabaugh, P. J.
1996.
Programmed translational frameshifting.
Annu. Rev. Genet.
30:507-528[Medline].
|
| 35.
|
Ferber, D. M., and R. R. Brubaker.
1981.
Plasmids in Yersinia pestis.
Infect. Immun.
31:839-841[Abstract/Free Full Text].
|
| 36.
|
Fetherston, J. D., and R. D. Perry.
1994.
The pigmentation locus of Yersinia pestis KIM6+ is flanked by an insertion sequence and includes the structural genes for pesticin sensitivity and HMWP2.
Mol. Microbiol.
13:697-708[Medline].
|
| 37.
|
Fields, K. A.,
G. V. Plano, and S. C. Straley.
1994.
A low-Ca2+ response (LCR) secretion (ysc) locus lies within the lcrB region of the LCR plasmid in Yersinia pestis.
J. Bacteriol.
176:569-579[Abstract/Free Full Text].
|
| 38.
|
Filippov, A. A.,
P. N. Oleinikov,
V. L. Motin,
O. A. Prostsenko, and G. B. Smirnov.
1995.
Sequencing of two Yersinia pestis IS elements, IS285 and IS100.
Contrib. Microbiol. Immunol.
13:306-309[Medline].
|
| 39.
|
Forsberg, Å.,
A.-M. Viitanen,
M. Skrunik, and H. Wolf-Watz.
1991.
The surface-located YopN protein is involved in calcium signal transduction in Yersinia pseudotuberculosis.
Mol. Microbiol.
5:977-986[Medline].
|
| 40.
|
Forsberg, Å., and H. Wolf-Watz.
1990.
Genetic analysis of the yopE region of Yersinia spp.: identification of a novel conserved locus, yerA, regulating yopE expression.
J. Bacteriol.
172:1547-1555[Abstract/Free Full Text].
|
| 41.
|
Forsberg, Å., and H. Wolf-Watz.
1988.
The virulence protein Yop5 of Yersinia pseudotuberculosis is regulated at transcriptional level by plasmid-pIB1-encoded trans-acting elements controlled by temperature and calcium.
Mol. Microbiol.
2:121-133[Medline].
|
| 42.
|
Fowler, J. M., and R. R. Brubaker.
1994.
Physiological basis of the low calcium response in Yersinia pestis.
Infect. Immun. |