0022-538X/88/041186-08$02.00/0
Copyright © 1988, American Society for Microbiology
Nucleotide
Sequence of
the Tail Sheath Gene of
Bacteriophage
T4
and Amino
Acid
Sequence
of Its Product
FUMIO
ARISAKA,1*
TATSUSHINAKAKO,1
HIDEOTAKAHASHI,2
ANDSHIN-ICHI
ISHII1
Department of Biochemistry,
Faculty of
PharmaceuticalSciences, Hokkaido University,Sapporo,'
and Institute ofApplied
Microbiology, University of
Tokyo, Bunkyo-ku, Tokyo
113,2
Japan
Received 18May 1987/Accepted 1 January 1988
The nucleotide sequence of gene 18 ofbacteriophage T4 was determinedby the Maxam-Gilbertmethod, partiallyaidedbythedideoxymethod. To confirm the deduced amino acidsequence of the tail sheathprotein (gpl8) that isencodedby gene18, gpl8wasextensively digested by trypsin orlysylendopeptidase and subjected to reverse-phase high-performance liquid chromatography. Approximately40peptides, whichcover88% of the primary structure, werefractionated, theamino acidcompositionsweredetermined,and the correspond-ing sequences in DNA wereidentified. Furthermore, the amino acid sequences of 10 of the 40 peptides were determined by a gas phase protein sequencer, includingN- and C-terminal sequences. Thus, thecomplete amino acid sequence of gp18, which consists of 658 amino acids with a molecular weight of71,160, was determined.
Themoststriking morphological change in bacteriophage T4uponinfectionis the contraction of its tail sheath.Thetail sheathis 95nmlong and 20 nmin diameterandconsistsof 144 protomers of the sheath protein, gpl8, arranged in 24 annuli(5, 13, 15). The structureof the tail sheath has been studied in detail by three-dimensional
image
reconstruction from electron micrographs ofnegatively stainedspecimens
and hydrated, nonstained specimens by cryoelectron mi-croscopy(1, 20,30).Contraction
of the sheath istriggered by
conformational change in thebaseplate
uponinfection,
and thechange inarrangementofthe sheathprotomers is trans-mitted likea wave toward thehead-proximal
endofthetail (24). Theheadand the inner tuberotateduring
contraction with respect to the baseplate fixed on the outer membrane of Escherichia coli(6). The resultant contractedsheathis38 nm long and 25 nmindiameter. Thecontracted sheath isvery stable; it doesnotdissociate with8 M urea or 6 Mguanidine
hydrochloride, but it does dissociate when boiled in sodium dodecyl sulfate (33).The morphological pathway of the tailstructure hasbeen workedoutbyKing and co-workers (14, 15).Theassociation of monomeric sheath
protein
gpl8 with the tube-baseplate initiates as soon as tube protein monomer gpl9 starts topolymerize
onthebaseplate. The length of the tail sheath is determined by the tube, whose length is determined by an elongatedprotein,
possibly gp48 (9). There is a variety of missense, as wellas nonsense, mutations in gene 18. They should be useful in studying the structure-functionrelation-ship
of the protein. Furthermore, gpl8 can be readily iso-lated in sizable quantity as a monomeric form which can reassemble onto the tube-baseplate to form the extended sheath(2, 34). Takingadvantage of this system, we wish to elucidatethemechanismof sheath contraction at the molec-ular level. As afirststeptoward this goal, we determined the primary structure ofgpl8.MATERIALSANDMETHODS
Enzymes and reagents. DNase I and trypsin (bovine pan-creas) were purchased from Sigma Chemical Co. Lysyl endopeptidase, a lysine-specific serine protease from
*Correspondingauthor.
"Achromobacter lyticus" M467-1, was
purchased
from Wako Junyaku Co. Polynucleotide kinase and all of the restriction enzymes used were purchased from Takara ShuzoCo. Methanesulfonic acid(4 M)waspurchasedfrom PierceChemicalCo., and all otherreagentswereanalytical
grade, fromeither Nakarai Chemicals or WakoJunyakuCo. Nutrient broth, tryptone, yeast extract,Bacto-Agar,
and CasaminoAcidswere purchased from Difco Laboratories.Medium and buffer. M9A medium was used togrow E. coli. Itcontained, perliter ofwater, 6 gofNa2HPO4,3 gof KH2PO4, 0.2 g of
MgSO4.
7H20, 0.5 g ofNaCl, 1 g of NH4C1,4gofglucose, and10 gofCasamino Acids. Phages werestoredinB + buffer,which contained,perliterofwater, 7 g of Na2HPO4, 4 g of NaCl, and 3 g of KH2PO4, supplemented with 1 mMMgSO4 foruse.Phage, plasmids, and bacterial strains. T4D.23amH11 is fromourcollection.T4D.18am mutants(Fig. 1)werekindly
supplied
byW.B.Wood andJ.King.E.coliBE wasusedas anonpermissive
host for ambermutant phages, andE. coli CR63 was thepermissive
host. E. coli JM103 was used to propagate M13phages andtopreparereplicative-form DNA ofM13 phage. Three E. coli BE strains, which carry plas-mids p662, p655, and p664, respectively, were kind gifts fromTom Mattson. Eachofthe threeplasmids contains an EcoRI fragment which forms approximately one-third of gene 18(Fig. 1).Markerrescue test. MarkerrescueofT4D.18am mutants by plasmids was performed asdescribed by Mattson et al. (21). The results are also shown in Fig. 1.
DNA preparations for sequencing. DNA preparations for theMaxam-Gilbert method have been described by Christ-ensenandYoung(8). DNA was 5'end labeled with polynu-cleotide kinase, and the strands were then separated by polyacrylamide gel electrophoresis, or alternatively, the double-strandedDNAwas cut with a restrictionenzyme and the desired fragment was purified by polyacrylamide gel electrophoresis.
DNAsequencing procedures. The Maxam-Gilbert method was done essentially as described by Maxam and Gilbert (22). The dideoxy method was carried out as described by Messing (23).
Preparation of
gpl8. Gpl8
was prepared as described by1186
on November 10, 2019 by guest
http://jvi.asm.org/
z
E
0 V-It X f
Lu LuzZL u zz
E E EE E EE
a a a.a
als
662-I* 655 664
[markerl IamB81S,NG185 amNG275,NG302:amNG375.NG403
Lrescue NG317 NG474.NG591 NG154
FIG. 1. Markerrescueof18ammutations. Markerrescue
exper-iments were carried out asdescribed by Mattson et al. (21). The relative positions of some of the mutations were measured by
standardcrossing experiments with17amNG178or19amE1137and
areshown above the horizontalline. TheextentsofT4 DNAcarried
by plasmids p662, p655, andp664 areindicated above the marker rescueresults.
Tschoppetal. (34). Monomeric gpl8 thuspreparedwasused
within a week. Tails, which are the immediate source of
monomeric gpl8, could be stored for several months as a
solution with 0.02% sodium azide at 4°C without loss of activity as judged by the reassociation ability of isolated
gpl8. Freezing tails forstoragecauses somedecrease in the
yield of monomeric gpl8.
For Edmandegradation of whole protein, gpl8wasfurther
purified by sodium dodecyl sulfate-gel electrophoresis as
describedby Hunkapilleretal. (11).
Proteolytic digestion of gpl8. Monomeric gpl8 in 1 mM phosphate buffer, pH 7.0,was heat denaturedat90°C for 10 min, and trypsin, treated with TPCK (L-1-tosylamide-2-phenylmethyl chloromethyl ketone) was added ataweight
ratio of 1/100. Afterincubation at37°C for 2 h, trypsin was
furtheraddedataweight ratio of 1/100 and incubatedat37°C for 12 h. The reaction was stopped by heating in boiling
waterfor 3 min, and after cooling, trichloroacetic acid was
added to a final concentration of 5%. After removal of
precipitates by centrifugation at2,300 x g for 15 min, the
trichlororoacetic acid wasremovedby ether extraction and
water was removed by evaporation. The gpl8 digest was dissolvedinwater atabout 4mg/ml, and insoluble materials were removed with a 0.45-,um (pore size) membrane filter (Millipore Corp.) before application to high-performance liquid chromatography (HPLC).
HPLC of gpl8 digests. Peptides were analyzed and sepa-rated on a reverse-phase HPLC system (Hitachi 655 or Waters 600) equipped with a gradient former, a degasser (Erma ERC-3510), and UV and fluorescence monitors (Hi-tachi 638 and JASCO FP110, respectively). A column of Nucleosil SC18 (0.4by 25 cm) wasused. Water was deion-ized and doubly glass distilled before use. Twenty microli-tersofthesamplewasfirstanalyticallyexamined, and then 200
RI
wasapplied and peaks were manually fractionated.Amino acid analysis. The amino acid compositions of peptides, aswellasofthewhole protein,weredetermined, afterhydrolysis with4 Mmethanesulfonic acidat115°C for 24h as described bySimpson etal. (29), on anamino acid analyzer (Hitachi 835).
Sequencedetermination ofpeptides and protein. The amino acidsequencesofpeptidesand theproteinweredetermined on a gas phase protein sequencer (Applied BioSystems 470A). Usually, 0.5 to 1 nmol ofsample was applied and analyzed. Phenylthiohydantoin derivatives of amino acids were analyzedand identifiedby HPLC.
RESULTS
DNAsequence determination of gene 18. Three plasmids, p662, p655, and p664, were used to sequence gene 18. The sequencing strategy is shown inFig. 2b. Theorientation of the 0.6-kilobase EcoRI fragment from
p655
was originally decidedonthe basisof theopenreading frame and then later confirmed bypeptide
analysis of gp18 as described inthe nextsection. Because of thedifficulty encountered in finding appropriate restriction sites in the 5'-end region ofgene 18 by the Maxam-Gilbert method, the dideoxy method using M13phage wasalso used (Fig. 2b).As aresult, asequence ofapproximately 2,000 bases thatcoverall ofgene18,the 3' end ofgene 17, and the 5' end ofgene 19 wasdetermined662 655 664
a) | 11.5 kb) (0. 6 kb) (3.4 kb)
m X S Z Zm o - m m
O Og.~v:00.\ \/~9~~t S~ ~ ~~~~~~~~~10 -' - .
.ac-!n
LLXLL..'
XI~/7V.-g17
- .I I
* * -4
^~ aleD - - A;
6 zo
*-.
;-o -0.
-r
n-
..g
.... .T-16
i1 1A
I-9
_T-A2
MnffEU_
FIG. 2. DNAsequencingofgene18 andpeptide analysisof itsproduct. (a)Restrictionmapof thegene18region. (b)Orientations and
lengths ofnucleotide sequences obtained (arrows). Arrows with closed circles represent sequence information from the Maxam-Gilbert
method,andarrows withopencircles arefromdideoxysequencing usingM13 clones. (c)Assignmentoftryptic andlysylendopeptidase
fragmentsofgp18.Locations of thepeptidesidentifiedaredrawn incomparisonwith the DNAsequence. Boxes abovethe continuous line
arefortryptic peptides,and those under the lineareforlysylendopeptidase-digested peptides.Openboxesrepresent peptidesidentifiedby
aminoacidcomposition only,and closed boxesrepresentthoseidentifiedbyaminoacidsequenceaswell.
b)
c)
, ...El I I 11
--T-24LI _,LI -
on November 10, 2019 by guest
http://jvi.asm.org/
sid
oflimne
17 startslilogael
10 20GMSMV-~~~~~~~ ~~IML.C A I ~~~. ilL
10
'~20
30 40 50 60 4 70 480 90 100 110 1201 Hapl Sacl
TAAA30
K40
F~
IQ~
V T N E VD
T & D Y7rG'G
ACKG-A-UI;-TG-dGGYAAATCAGGGIG-cci',c.-T-,TCGATTAAAGTTTAC
iATG
ATTrFAAT-ATAAATACTTTTGGTCAACCAACCGC,TGAAAC,TGCTGMAf
130 A40 150 160 170 180 190 200 210 220 230 240
Hasill Hapl
F M S A M N 7? Lk Y G N D L R V8 R'A V D R D T A KI9N S S P I A G N I T.O Ij5
i
TTTTdTcGc--d_fTTTTCGAGAAGC-A-A-AGTCT-l'GT"-TGGAACGCAAAAf-ArdC--f-TMAAAtGi-h
MM250 260 270 280 290 300 310 320 330 340 350 360
K'YV12 140
T P G S N Y11R V GD K IT..KL V SD D IE T E G Kvit T E V D AD G K'IK K&'! N
TACCGAfAATA
-(GT~GGGAAA~ATAGTAATAGTTAATAATAATA=AAf-f-R-i-C-i-Y-;AAATAA'-ATA
370 380 390 400 410 420 430 440 450 460 470 4811
IPT A K
~~i1'5l0
~IA K ~ IP1_. L G SNWTj.Y. S I1§ L.k j40 500 50 520 530 540 550 560 570 580 59i 0
490510
~~~~~~~~~~~~~~~~~~~~~~~~Pat
0TGGAKAAT
ATfAD_A
Ci'.LA
210-TANT
J-K-'KAT'G-ARA-AATCGAAGCTGCTATGACAGCGGTTGACTTTCAAGCRAAATCTTAAAAAATAT
E CAGGAGTAGTAGC610 4620 630 640 650 660 670 680 690 70 AI~1 2
2in
240350
EcoRI 280Y.PGEL'oDK11E_EI SKI Y A ILIG A S L L P Y P G G G T R A S T
~I 730 740 750 760 770 780 790 800 810 820 830 840
Hasll
270 2891 290 KR 309
iJLJ
A K A V F G Y G P j T D 5jQ Y An V RRNf D A VISSVL LRiKG
YMfi(~AdIGTTM7CCK-f-TcrdA-t fAfMTGAAWf_Y_fUTA_G_T& _T &MGTOGAAAA
850 860 870
4F880
890 900 910 920 930 940 950 960HinflI
310 1 320 330 340
YaSNI I D Jr. A K'IG G S E Y I F A T A Q N W P E G F S G I L T L S G G L
970 9804aq 990 1000 1010 1020 1030 IOyJ1 1050 1060 1070 1080
D3LM31A
0 370 380S S N A E V' A GDLMEAWD F F A D R E S V D V Q L F I A G S C A G E S L E
1090 1100
1110
IIl2f
1130 114041150
1160 1170 11804II 91
1200Hindlill Hinf I Hapl HinfI
TA ST V~39
1,
VS I DCLVIV C 410 42TAS VQK .
I)LVVRtv
RI D C S P P R FI,VVi GIi.
Lv &L .AV D1210 1220 1230 1240 1250 $ 1260 1270 1280 1290 1I300 1310 1320
Fokl
£20 440 480 1EcoRI 8
1330 1340 1350 1360 1370 1380 1390 1400 1410 1420 1430 1440 NW48 V P L7 AK,C'D I A G L. C A R4?? DN VI I L ~N I . L (I.irr~~~~~~~~~~~~~~~~~C1TAACThTATTAALN Vjj 1450 1460 4 47 180 1490 1500 150 12 1530 1540 4i550 1560
PvuII Foki PvuII Hinf
IA IY.j.l 4 E.EA I N P V T _GY
1570 1580 1590 1000 1610
16T1
-ff-3
1640 1650 1660 1671) 1680 DIIo501TNSKIY7RIN V R RiJr..N UL,8I N I G R F ELIAN±- A,k~ .1JT0'.I F16911 41700 1710 1720 1730 1740 1750 1760 14770 1780 1790 1800
Sau3AI *Taq
T,590
K1A
600R4
V810 820Y,
iAL L G,.Y'L.YI L L i NLN.T.1!JIT PI SiV DJLAlL.E V
1810 1820 1830 1840 1850 1860 1870 1880 1890 1900
19111
1924i
630 640 880 8858 madWlgmmaII
ATFY I QP A
R'~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~S
I NY!I T L N FV A TAT ~ D.FfLA L T G.~AGGCXCTTAC T C GGM-TtAGATTrTCGATGAGdAAGGT TTGCTTTAATACGGTGCATT
1930 FokI194%Hapil 1950 1960 1970 1980 1990 2000 2010 2020 2030 2040
Pi V,"' 2 IlrmgmlM F V D D
FIG. 3. Sequence ofgene 18 and its product. T4 late promoter consensus sequences TATAAATA for genes 18 and 19 are doubly underlined by solid lines, and the putative Shine-Dalgarno sequences for the twogenes are underlined bybroken lines. Arrows pointed
downwardalongthe aminoacidsequenceindicate thecleavagesitesbytrypsinorlysylendopeptidase,and thosepointed upwardalongthe nucleotidesequenceindicate the restriction sites. The amino acids identified byanalysisof thepeptidesare underlinedby single bars, and those identified by amino acid sequenceanalysis as wellare underlined by arrows. The N-terminal sequence for the whole protein was
determinedfrom Thr 1toPro 5. Twopredictedstem-and-loopstructuresnearthe 5' endof thetwopromotorsbetween genes 18and 19are
markedbyarrows.Theirstabilizationenergies,calculated bythe method of Ooi and Takanami(25), are -19.4kcal(1cal =4.184J) (near
pl)
and -15.4 kcal(near P2), respectively.(Fig.
3). The 3' endof gene 17 has beensequenced by
D.Description
ofthe DNA sequence.Figure
3shows the DNA Powell,J.Franklin, F.Arisaka,and G.Mosig
(manuscript
in sequencedeterminedasdescribed above. As indicated in thepreparation),
and the 5' end of gene 19 has also beenfigure,
the T4phage
late promoter consensus sequence,determined
(3).
TATAAATA (8), and aputative Shine-Dalgarno
sequence,on November 10, 2019 by guest
http://jvi.asm.org/
TI j.-3
10 20 30 40 50
FIG. 4. Isolation of tryptic peptides ofgpl8by reverse-phase HPLC. A Nucleosil5C18column(0.4 by 25 cm) was used, with elution with
a4 to 36% acetonitrile linear gradient in 0.1% trifluoroacetic acid at 1 ml/min. The lower and upper elution profiles representA210 and fluorescence emission (excitation at 286
nn,
emission at 356 nm) from tryptophan residues, respectively.Thepeaks indicated by arrows were fractionated and rechromatographed under the same conditions except for 10 mM ammonium formic acid, pH 6.7, instead of 0.1% trifluoroacetic acid.TAAaGAGG, are upstream from the initiation codon for gene 18; the capital letters in the latter sequence are com-plementarytothe 3' endof the 16S rRNA. The significance oftwolate promotersand thepredicted two stem-and-loop structures between genes 18 and 19 are addressed in the Discussion.
Analysis of tryptic fragments ofgpl8. Purified gpl8 was heatdenatured, digested with trypsin, and then
subjected
to reverse-phase HPLC (Fig. 4). A number of well-separatedpeaks
werefractionated and further purified by rechromatog-raphyonreverse-phase HPLC. Theiramino acidsequences were determined by a gas phase protein sequencer; the resultsareindicated byarrows inthe sequence(Fig.3).
All of the sequences were located on the DNA sequence.Peptide
T-24is theN-terminal
one;the N-terminal threonine was further confirmed by Edman degradation ofthe wholepolypeptide
chain.Peptide
T-16correspondstothejunction sequence between DNAfragments
655 and 664 (Fig. 2b).Peptide K-15, corresponding
tothejunction
between 662 and 655, was also found in the lysylendopeptidase digest
ofgpl8,
as describedbelow.When a protein sequence is deduced from the DNA sequence, itis essentialtodetermineatleasttheN-terminal and C-terminal sequences of itsgene
product
to obtain the completeprimary
structure oftheprotein
unambiguously.
Although the N-terminalpeptide
was found among thetryptic
fragments
ofgpl8
separated by HPLC(Fig.
4), the C-terminalpeptide
wasnot.Wetherefore triedtoisolatethe C-terminalpeptide
fromthetryptic digest
ofgpl8
by
affinity
chromatography, in which theability of immobilized anhy-drotrypsin to bind peptides specifically with arginine or lysineattheir C terminiwas used (12, 17). We thusidentified T-A2 as the C-terminal peptide (Fig. 2c). The complete primary structure of gpl8, consisting of 658 amino acid residues withamolecular weightof71,160, wasdetermined (Fig. 3). Excellent agreement of the amino acid sequence withtheamino acid composition oftheproteinwas obtained (Table
1).
A predicted secondary structure ofgp18
by the method ofChou and Fasman (7)is shown inFig. 5.Peptide mapping of gpl8 by lysyl endopeptidase. One-dimensional peptide mapping of gpl8 was carried out on reverse-phase HPLC. Theobjective of thepeptidemapping wastwofold. (i)We intendedtoconfirm furthertheprimary structureofgpl8, and (ii) for future study of thehigher-order structure of gpl8, it is desirable to establish a system by which one can readily identify a peptide whose chemical nature has beenaltered by chemical
modification
or amino acid substitution bymoutation.
Lysyl endopeptidase wasused because of its
high
specificity
forlysine
residues and because it produces fewerpeptides than doestrypsin
(ap-proximately 30 instead of more than 50). gpl8 wasS-carboxymethylated
with monoiodoacetic acid under dena-turing andreducing
conditions anddigested by lysyl
endopeptidase asdescribedinMaterials
andMethods.
The resultantdigestwas subjectedtoreverse-phase
HPLC, and
all of the major
peaks
were fractionated. After further purification byrechromatography,
theaminoacid composi-tionof eachpeptide
wasdetermined andtheposition
of theon November 10, 2019 by guest
http://jvi.asm.org/
TABLE 1. Amino acidcompositionof gpl8a
Expectedno.
of residues Amino acid from nucleotide
sequence (amino acid
analysis)
Asx... Thr...
Ser...
Glx... Pro ...
Gly...
Ala...
Cys... Val...
Met ...
Ile...
Leu ...
Tyr...
Phe...
His...
Lys...
Trp...
Arg...
80(83.9)
55(54.0) 47(44.8)
55(59.9) 23 (22.8) 56(57.1)
67(70.8)
5(5.0)
50(39.5) 6(5.8)
50(41.1)
42(43.0) 31(30.7)
24(24.0)
1(1.0)
31(31.0)
7(5.5)
28 (28.6)
a
gpl8
hadatotal of 658residues andamolecularweightof71,160.peptides along the correspondingDNA sequence was iden-tified. The results are shown in Fig. 6 and Table 2. The identifiedpeaks cover88% ofthe totalprimary structure of gpl8 (Fig. 2c).
DISCUSSION
The complete nucleotide sequence of gene 18 and the deduced amino acid sequence, most of which was confirmed by peptide analysis, have been presented. Concerning the nucleotide sequence, a T4 phage late promoter consensus sequence was found 29 bases upstream from the initiation codon. Since Stahlet al. (31)have demonstratedthat genes 17 and 18 have no polarity effect, it is very likely that the consensus sequence is the true late promoter for gene 18. The stop codon for gene 17 was in the late promoter
TABLE 2. Assignmentoflysylendopeptidasefragments
in HPLC'
Peptide Residueno.
K-1... 1-10
K-2... 11-31
K-3... 32-41
K-4... 42-89
K-5... 90-114
K-6... 115-118
K-7... 119-129
K-8... 130-138
K-9... 139-141
K-10... 142-148
K-11... 149-152
K-12... Notrecovered
K-13... 155-185
K-14... 186-214
K-15... 216-232
K-16... 233-240
K-17... 241-245
K-18... 246-26
K-19... 266-297
K-20... Notrecovered
K-21... 302-316
K-22... Notrecovered
K-23... 391-454
K-24... 455-459
K-25... 460-503
K-26... 504-538
K-27... 539-560
K-28... 561-568
K-29... 569-595
K-30... 596-658
a These dataaretaken fromFig.6.
sequence for gene 18 and no terminatorlike structure was foundnearthe 3' end of gene 17.Ontheotherhand,twolate promoter consensus sequences were found between genes 18 and 19. Originally, we expected to find no promoter sequencebetweengenes 18and19because ofthesignificant polarity effect between thesetwogenes(31). AnS1mapping experiment forthat region ofthe DNA was carried out to elucidate the roles of the two late promoter consensus sequences and theterminatorlike structureatthe 3' end of
o.,,,oooo
,2
. D? .b)I-S
-oo:~o000a-> i
'°°°oo °°oooo .
CS 5~~~~~~~~~C
"'-0.
CSCD Ca. cS
3 *2)e
-FIG. 5. PredictedCof gtseCo r u r
Fa~-In.5 rdingomin(bseeondiryscrussin),resopecivly areiecndiatey barrctreows.8wsetiae ytemehdo huan amn()
Sybls ,a eixndctean.,,8set; ,,8tr; rmide.Te oatosoffvecseiersius r lo
seunceV oooost h T-o T-idn oan fsm ucetd-idn rtis(lada)adteE adoh
Fa-IG. . Prdominc(b;seodrDstrctusin,resop18ctively areondiarstuturearrows.iatdb temthdo Coof ndFsan(
-A^'A
on November 10, 2019 by guest
http://jvi.asm.org/
FIG. 6. Peptide mapping ofgpl8by lysyl endopeptidase. Anelutionprofile ofreverse-phaseHPLCasmeasured byabsorptionat210nm
is shown. Conditionswereasdescribedin the legendtoFig. 4,exceptthatthe peptideswereelutedataspeedof 0.5ml/min.All ofthemajor
peakswerefractionated,and theaminoacidcompositionsweredetermined. Theidentified peptides intheprimarystructurearelisted in Table
2.
each late promoter sequence. It revealed that both of the
consensus sequences serve as weakpromoters butmost of the transcripts arise from further upstream of the two
promoters,possibly from the promoterforgene 18 (3).
Whenwestartedthepeptide analysis of gpl8,weassumed
that theN-terminal aminoacid residuewasblocked, because
Brenneretal. (5) couldnotdetecttheN-termitialaminoacid
by the 2,4-dinitrophenol method. Since we identified the
N-terminal peptide with which Edman degradation was
feasible,wereexamined the2,4-dinitrophenol methodonthe
whole protein and found 2,4-dinitrophenol-threonine after hydrolysis (data not shown). We used monomeric gpl8 for N-terminal analysis instead of the contracted sheath which they used. This might explain why they did not detect the N-terminal residue. Contracted sheath is known to be
ex-tremely stable(33). Recently, Ben-Bassat and Bauer (4) have presentedanempirical rule for the removal of the N-terminal
methionine based on the second amino acid residue. The
removal of N-terminal methionine in gpl8agreeswith their rule thatit isremoved when the second residue is threonine. The proposed rule also agrees with the fact that the N-terminal methionine ofgpl9 is retained when the second residue isphenylalanine (3).
Among thethreepeptides notrecovered(Table 3; Fig. 6), peptide 317-390containsmostof thesequencenotconfirmed bypeptide analysis.Thispeptideisconsidered tobe hydro-phobic and appears to aggregate and precipitate during protease digestion. Attempts to recover this peptide with reasonable yield have been unsuccessful. However, it should be noted that where confirmation of the nucleotide
sequence by peptide analysis was not carried out, the
nucleotide sequence wasdeterminedinbothdirections (Fig. 2b).
Determination of the primary structure is the first step toward elucidation ofthe structure-function relationship of the tail sheath protein. Twenty-nine mutant strains of T4
with
distinct phenotypes are present in gene 18. They include ts (growth at 30°C but not at420C;
two strains), cs (growth at 37°C but not at 25°C; three strains), hs (heat labile;fourstrains), CBW (caninfect inthe presenceof high concentration ofpolyethylene glycol in the medium; three strains),and amber mutants(17strains).Identificationof the mutation sites in gene 18 or gpl8 will give us information concerningroles oftheamino acid residues at the mutation sites. Forexample, it is likely that CBW mutants have analtered
aminoacid residue locatedonthe surface of the tail sheath,where it mustinteractwith the tail fibers. Also,the peptidemapproducedby lysylendopeptidase
willhelpus toidentify peptideswhich contain asubstituted amino acid or thosewhichhave achemically modifiedamino acid residue.
TABLE 3. Molecularweightsofsheathand tubeproteins
Molwt(103)of: Phage Sheath Tube
protein protein Reference
T4 71 19 Thisstudy
SPOl 60 12 26
Mu 55 13 10
P2 46 20 19
PS17 36 18 28
on November 10, 2019 by guest
http://jvi.asm.org/
a 1 ) gpl8 Bovine ATPase E.coli ATPase
358 W D F F D R E S V V D V Q L F I A G S 376
248 E Y F R D Q E G - Q D V L L F I D N I 259
229 E K F R D - E G - R D V L L F V D N I 244
a 2) gpl8 524 V T G T G G D G Y V L Y G D K T A T S V P S P D F R I N V R R L 555
EGF-R 693 V L G S G A F G T V Y K G L W I P E G E KVK -I P V A I K E L 723 ADH 197 V FGL G G V G L S V I N G C K A A GAA R I -I G V D I N 228
* * * * * *
b ) gpl8 ' T V K Y V S D D I E T E G - K I T E V D A D G K I K K I N 143
calmodulin 76 N K D T D S E E E I R E A F R V F D K D G D G Y I S A A E 114 troponin C 86 D A K G K S E E E L A E C F R I F D R N A D G Y I D A E E 124 parvalbuuin 79 D A R D L S V K E T K T LN A A G D K D G D G G I G A D E 101
FIG. 7. Sequences in gpl8homologoustonucleotide-bindingsites(al,reference18; a2,reference32)and the EF hand(b,reference16). Abbreviations: EGR-R,epidermalgrowthfactor;ADH, alcoholdehydrogenase.
Some
characteristic
features ofgpl8
drawn from the primary structure may be pointed out. There are five Cys residues, and all of them are in the C-terminal half ofthemolecule
(Fig. 5). Preliminary experiments have indicated that two ofthem form a disulfide bond(data
notshown).
There is a
se4uence
in the N-terminalregion
with five consecutive serine residues(Fig.
3). Thisregion
is consid-eredto assume anundefinedconformation,
possibly
located onthe surfaceofthe
protein. Also,thereisonlyonehistidine residue in the total of 658 residues.Recently, it was reported that gpl8 is a
GTP-binding
protein (27). We therefore searched forhomologytonucle-otide-binding
proteins. Some homologywasfound(Fig.
7al anda2). Further study is necessary to assess the significance of thehomology and the role ofGTP in the sheathprotein.
Thereis alsoaregionhomologous
toCa2'-binding
proteins,
the so-called EFhand(16;Fig.7b).
Thesignificance
of this homology is not clear, but Ca2+ might be important to stabilize the native structure ofgpi8.Ithasbeenreported thatgpl8has 2 to3% hexosamine(5, 34).Amino acidanalysis of gpl8 has shownthat it has some amino sugar whose elution time in the analyzer does not coincide with that of any commonly encountered amino sugarsuchasglucosamine, galactosamine, ormannosamine
(R. Muramatsu,
F.
Arisaka,andS. Ishii, unpublished data). However, we didnot detect this amino sugar in the amino acid analyses of peptides inthe present study, which cov-ered88% ofthe
totalprimarystructure. Inparticular, itwas apparentthatthere are nocovalently attached amino sugars in those peptides whose sequences were determined by Edmandegradationin a protein sequencer. The structure of theamino sugar and its mode of linkageto theprotein remain tobe
elucidated.Anotherapproach to
elucidation
of the structure-function relationship ofthe sheath protein would be to compare the primary structure ofT4with those of other phages. Molec-ular weights of contractiletail sheath proteins from a number ofbacteriophages are listed in Table 3. Although tail tube proteins have relatively constant molecular weights, the molecularweightsof the sheath proteins vary considerably. Wepostulate thatthese sheath proteins share a homologous domainwhich
playsan important role in sheath contraction. Recently, tail sheathand tube genes from Pseudomonas aeruginosaphage
PS17 have been cloned (T. Shinomiya, personalcommunication). Also, the sequence of the genome ofphage 186, closely related to phage P2, will be determined (J. B. Egan,personal communication). We therefore expectthat theprimarystructuresofatleasttwophagesother than T4 will soon be available. The sequence data will be very useful in
studying
the structure-functionrelationship
of the sheath proteinand may provide insightsinto the evolution-aryrelationship amongthese phages.ACKNOWLEDGMENTS
WethankT.Mattsonforprovidingplasmids662,655, and 664, G. Mosig for sending data before publication, T. Senba and
co-workers,NikkakiCo., Tokyo, andN.Ochi andco-workers,Chugai Pharmaceutical Co., Tokyo, for determination of the amino acid
sequencesofpeptides. Thanks arealso due FrederickA. Eiserling and RobertL. Dudafor valuablecomments onthemanuscript.
This work was supported in part by grants-in-aid for scientific research from the Ministry ofEducation, Science,and Culture of
Japan.
LITERATURECITED
1. Amos, L. A., and A. Klug. 1975. Three dimensional image reconstructions of thecontractile tail of theT4bacteriophage.J. Mol. Biol. 99:51-73.
2. Arisaka, F., j. Tschopp, R. van Driel, and J. Engel. 1979. Reassembly of the bacteriophageT4tail from thecore-baseplate
and the monomericsheathprotein P18:acooperative associa-tionprocess.J.Mol.Biol. 132:369-386.
3. Arisaka, F., L. Ishimoto, G. Kassavetis, T. Kumazaki, and S. Ishii. 1988.Nucleotidesequenceof the tail tubestructural gene
ofbacteriophageT4.J. Virol.62:882-886.
4. Ben-Bassat, A., and K. Bauer.1987. Amino-terminalprocessing ofproteins.Nature(London)326:315.
5. Brenner, S., G. Streisinger, R. W. Horne, S. P. Champe, L.
Barnett, S. Benzer, and M. W. Rees. 1959. Structural compo-nentsof bacteriophage. J. Mol. Biol. 1:281-292.
6. Caspar, D. L. D. 1980. Movement and self-control in protein assemblies. Quasi-equivalence revisited. Biophys. J. 32:103-135.
7. Chou, P.Y., and G. D. Fasman. 1978. Empirical predictions of protein conformation. Annu. Rev. Biochem.47:251-276. 8. Christensen, A. C., and E. T. Young. 1982. T4 late transcripts
areinitiatednear aconservedDNAsequence. Nature (London)
299:369-371.
9. Duda, R. L., and F. A. Eiserling. 1982.Evidencefor an internal component of the bacteriophage T4D tail core: a possible
length-determiningtemplate. J. Virol.43:714-720.
10. Giphart-Gassler,M., C. Wiffelman, and J. Reeve. 1981. Struc-turalpolypeptides andproducts oflate genes of bacteriophage Mu.J. Mol. Biol. 145:139-163.
11. Hunkapiller,M. W., E. Lujan, F. Ostrander, and L. E. Hood. 1983. Isolation ofmicrogram quantities ofproteins from
on November 10, 2019 by guest
http://jvi.asm.org/
acrylamide gels for amino acid sequence analysis. Methods Enzymol. 91:227-236.
12. Ishii,S., H. Yokosawa, T. Kumazaki, andI. Nakamura. 1983. Immobilized anhydrotrypsin as a specific affinity adsorbent for trypticpeptides. Methods Enzymol. 91:378-383.
13. Kellenberger, E., and E. Boy de la Tour. 1964. On the fine structure of normal and "polymerized" tail sheath of phage T4. J. Ultrastruct.Res. 11:545-563.
14. Kikuchi, Y., and J. King. 1975. Genetic control of bacteriophage
T4baseplate morphogenesis. III. Formation of the central plug andoverall assembly pathway. J. Mol. Biol. 99:695-716.
15. King, J., and N. Mykolajewycz. 1973. Bacteriophage T4 tail assembly: proteins of sheath, core and baseplate. J. Mol. Biol. 75:339-358.
16. Kretsinger, R. H., and C. E. Nockolds. 1973. Carp muscle calcium-binding protein. II. Structure determination and general description. J. Biol. Chem. 248:3313-3326.
17. Kumazaki, T., T. Nakako, F. Arisaka, and S. Ishii. 1986. A Novel method for selective isolation of C-terminal peptides from tryptic digests ofproteins byimmobilized anhydrotrypsin: application to structural analyses of the tail sheath and tube proteins from bacteriophage T4. Proteins Struct. Funct. Genet. 1:100-107.
18. Leberman, R., and U. Egner. 1984. Homologies in theprimary
structureofGTP-binding proteins: the nucleotide-binding site of EF-Tuandp21. EMBO J. 3:339-341.
19. Lengyel, J. A., R. N. Goldstein, M. Marsh, and R. Calendar. 1974.Structure of bacteriophage P2 tail. Virology 62:161-174. 20. Lepault, J.,and K. Lenard. 1985. Three-dimensional structure
ofunstained, frozen-hydrated extended tails of bacteriophage T4.J. Mol. Biol. 182:431-443.
21. Mattson, T., G. vanHowe,A.BoUle,G.Selzer,and R.Epstein. 1977. Genetic identification of cloned fragments of bacterio-phageT4DNAandcomplementationbysomeclonescontaining earlyT4genes.Mol. Gen. Genet. 154:319-326.
22. Maxam,A. M.,and W. Gilbert. 1980. Sequencing end-labeled
DNAwithbase-specific chemical cleavages. Methods Enzymol.
65:499-560.
23. Messing, J.1983. New M13cloning vectors. Methods Enzymol. 101:20-78.
24. Moody,M. F. 1973. Sheath of bacteriophage T4. III. Contrac-tion mechanism deduced from partially contracted sheaths. J.
Mol. Biol. 60:613-635.
25. Ooi, T., and M. Takanami. 1981. A computer method for construction of secondary structure from polynucleotide
se-quence: possible structure of the bacterial replication origin. Biochim. Biophys.Acta655:221-229.
26. Parker,M. L., and F.A.Eiserling. 1983. Bacteriophage SPOl
structureandmorphogenesis. I. Tailstructureandlength regu-lation. J. Virol. 46:239-249.
27. Serysheva, I. I., A. I. Tourkin, S. Y. Venyaminov, and B. F. Poglazov. 1984. On the presence of guanosine phosphate in the tail of bacteriophage T4. J. Mol. Biol. 179:565-569.
28. Shinomiya, T., andS. Shiga. 1979. Bactericidalactivity of the tail of Pseudomonas aeruginosabacteriophage PS17. J. Virol. 32:958-967.
29. Simpson, R. J., M. R. Neuberger, and T.-Y. Liu.1976.Complete amino acid analysis of proteins from a single hydrolysate. J. Biol. Chem. 251:1936-1940.
30. Smith, P. R.,U.Aebi, R.Josephs, and M. Kessel. 1976. Studies of thestructureof theT4bacteriophage tail sheath.J.Mol. Biol. 106:243-275.
31. Stahl, F. W., J. M.Crasemann, C. Yegian, M. M. Stahl, and A. Nakata. 1970. Thelinkage map ofbacteriophage T4. Genetics 64:157-170.
32. Sternberg, M. J. E., and W. R. Taylor. 1984. Modelling the ATP-binding site of oncogene products, theepidermal growth factor receptor and related proteins. FEBS Lett. 175:387-392.
33. To, C. M.,E.Kellenberger,and A.Eisenstark. 1969. Disassem-bly of T-evenbacteriophage into structural parts and subunits. J. Mol. Biol. 46:493-511.
34. Tschopp, J., F. Arisaka, R. van Driel, and J. Engel. 1979. Purification, characterization and reassembly of the
bacterio-phageT4Dtail sheathprotein P18. J. Mol. Biol. 128:281-286.