FASTA-SWAP or FASTA-PAT Search Results
FASTA-SWAP or FASTA-PAT (FASTA-based Pattern database search tools)
are modified versions of the
FASTA
sequence database search tool.
See the FASTA-SWAP and FASTA-PAT
Help Page for a detailed program description.
Istvan Ladunga, Brent A. Wiese, and Randall F. Smith (1995)
Human Genome Sequencing
Center, Baylor College of Medicine, Houston, TX 77030.
Email
FASTA-SWAP searches a protein pattern database
FASTA-SWAP version 1.0, Dec. 1995,
FASTA version 2.0u August, 1995
Please cite: I. Ladunga, B. Wiese & R.F. Smith (1996), submitted
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
/tmp/fastapat.seq.6927 : 345 aa
>GI||SP|Q|DBP_HUMAND-BINDINGPROTEIN(DBP)(ALBUMINDB
X-BINDINGPRTEINTAXREBMARPVSDRTPAxxxxxxxxxxxxxxxxxxGLRSLLQGTS
KPKEPASCLLKEKERKxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
xxxxxxERTLPFGDVEYVDLDAFxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
xxxxxxxxxxxxxxxxGTASGHRAGLTSRDTPSPVDPDTVEVLMTFEPDPADLALSSIPG
HETFDPRRHRFSEEELKPQPIMKKARKIQVPEEQKDEKYWSRRYKNNEAAKRSRDARRLK
ENQISVRAAFLEKENALLRQEVVAVRQELSHYRAVLSRYQAQHGAL
vs PIMA Database library
searching /local/dot5/sl_home/beauty/seqdb/comb-db/pima 11 library
one = represents 12 library sequences
for inset = represents 1 library sequences
z-opt E()
< 20 222 0 :===================
22 38 0 :====
24 30 0 :===
26 27 0 :===
28 26 2 :*==
30 19 10 :*=
32 29 40 :===*
34 38 109 :==== *
36 93 224 :======== *
38 198 370 :================= *
40 294 516 :========================= *
42 468 630 :======================================= *
44 604 695 :=================================================== *
46 707 708 :==========================================================*
48 687 678 :========================================================*=
50 609 619 :===================================================*
52 623 544 :=============================================*======
54 507 465 :======================================*====
56 435 388 :================================*====
58 361 319 :==========================*====
60 331 258 :=====================*======
62 222 207 :=================*=
64 235 165 :=============*======
66 166 130 :==========*===
68 125 102 :========*==
70 126 80 :======*====
72 88 63 :=====*==
74 72 49 :====*=
76 51 38 :===*=
78 51 30 :==*==
80 35 23 :=*=
82 34 18 :=*=
84 25 14 :=*=
86 25 11 :*==
88 9 8 :*
90 17 6 :*=
92 18 5 :*= :====*=============
94 10 4 :* :===*======
96 12 3 :* :==*=========
98 9 2 :* :=*=======
100 8 2 :* :=*======
102 9 1 :* :*========
104 6 1 :* :*=====
106 3 1 :* :*==
108 4 1 :* :*===
110 3 0 := *===
112 2 0 := *==
114 1 0 := *=
116 2 0 := *==
118 2 0 := *==
>120 11 0 := *===========
2169542 positions in 12664 patterns
statistics extrapolated from 7538 to 7538 patterns
Kolmogorov-Smirnov statistic: 0.0977 (N= 29) at 50
results sorted and z-values calculated from opt score
7727 scores better than 1 saved, ktup: 1, variable pamfact
gap penalties: -100,-10
joining threshold: 83, optimization threshold: 50, width: 32
scan time: 0:05:43
The best scores are: opt z-sc E(7538)
6367 d-beta-hydroxybutyrate precursor bdh dehydrogenas 883 154.1 0.00671
10569 alternatively hypothetical 77.5 spliced kd prote 872 148.4 0.01402
8390 lymphoid-restricted membrane protein 791 136.3 0.06589
7147 antigen c-terminal bbg clone 2.1 1.1 749 132.4 0.11
8199 hypothetical bblf2 protein 745 128.9 0.17
8099 embryonic nuclear protein lin-14 form a b1 736 128.0 0.19
8276 hypothetical 128.6 kd protein zk1098.10 in chromo 764 127.5 0.20
5807.2 300 interspersed kd repeat antigen ag231 707 126.4 0.24
1653 probable repa replication-associated replication 712 125.8 0.25
8516 hypothetical surface-layer 125 80k kd protein pre 727 124.1 0.32
4861 triadin kda junction-specific back sarcoplasmic m 675 122.6 0.38
8476 beta-lactamase hypothetical regulatory protein 2 664 118.8 0.63
8410 no title 660 117.8 0.71
8344 len: 393 cai: mitochondrial 0.17 outer membrane 4 663 117.2 0.76
4598 major minor capsid protein 10a 10b 657 117.0 0.78
6791 dtaf tsm1 ii protein 150 gene product 684 114.9 1.03
11380 hypothetical trwc protein 649 113.4 1.24
6096 crtj regulatory repressor protein 640 112.5 1.39
2681 beta-adaptin adaptin protein clathrin complex bet 635 111.0 1.69
4810 5 iif transcription chain alpha factor tfiif subu 631 109.9 1.96
3332 heat heavy shock 70 chain heat-shock binding prot 620 109.9 1.96
4703 dynactin 117 150 kd dynein-associated isoform pro 645 109.5 2.06
8289 epidermal eps8 growth protein factor receptor kin 640 109.1 2.17
8838 h probable dehydrogenase region ltdh methotrexate 597 108.3 2.39
8492 lmp1 lmp2 gene product 612 107.7 2.58
7391 hypothetical lactococcin a in protein secretion l 601 106.2 3.14
3138 defective fc chorion-1 proteins fc106 fc125 fc177 593 105.8 3.29
8548 yopd protein 584 105.8 3.30
5939 69 autoantigen kd p69 594 104.7 3.79
5176 histone-binding nuclear autoantigenic hgv2 protei 605 104.3 4.01
4573 element insertion is421 hypothetical 47 is186 41 580 104.3 4.01
9186 pes4 pab-like protein 580 103.8 4.24
8288 macrogolgin rat gcp360 657 103.7 4.34
8676 alpha-helical coiled coil protein tlpa 575 103.6 4.37
8248 hypothetical orf3 protein 3 579 103.1 4.64
8306 yd9395.16 cdc1 gene len: 491 cai: 0.13 584 103.0 4.73
5970 cfxy cfxyc protein plasmid 560 103.0 4.74
8078 von vwf pre-pro-polypeptide willebrand -22 factor 579 102.9 4.80
8325 hap4 aa transcriptional 1-554 activator 587 102.8 4.87
8291 hypothetical orf 79.4 ykr090w kd protein in prp16 595 102.6 4.95
10618 d2045.2 596 102.4 5.12
8683 gravity gene cdna clone expressed gsc381 in callu 543 101.5 5.70
8272 regulatory protein rim1 584 101.5 5.70
8680 retrovirus-related gag polyprotein homolog transp 571 101.3 5.86
8621 hypothetical lipopolysaccharide core protein 555 101.2 5.94
8469 citrate beta beta-subunit lyase chain acyl subuni 553 101.1 6.05
4596 dna-directed homolog rna polymerase cds kd 30 e4l 547 100.7 6.31
8338 endoglucanase a endo-1 precursor 4-beta-glucanase 555 100.7 6.37
9529 b2 hypothetical protein 513 100.5 6.52
3694 major paraflagellar rod protein component pfr par 569 99.8 7.17
6367 d-beta-hydroxybutyrate precursor bdh dehydrogenas (334 aa)
initn: opt: 883 z-score: 154.1 E(): 0.0067
Smith-Waterman score: 883; 12.5% identity in 288 aa overlap
21.2% noncontradicting positions, 8.7% class identity
10 20 30 40
GI||SP XBINDINGPRTEINTAXREBMAR-----------PVSDRTPAXXXXXXXXXXXXXXXXXX
P ++ RE aR-----------P RT a
6367 lSrLPGKaLSaCDRENGaRrpLLlgpaSFiPdgRRTYaSaAdaaggKAVLVTGCDS
f q t v t ht fyst s it t q epvss
10 20 30 40 50
50 60 70 80 90 100
GI||SP GLRSLLQGTSKPK---EPASCLLKEKERKXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
G L K--- A CLlKek
6367 GFGFSLAKHLHSKGFLVFAGCLlKdqGdaGVrELDSLnSDRLRTiQLNVcrSEEVEKaVe
m ek hd k k v fn v g
60 70 80 90 100 110
110 120 130 140 150 160
GI||SP XXXXXXXXXXXXXXXXXXXERTLPFGDVEYVDLDAFXXXXXXXXXXXXXXXXXXXXXXXX
FG+VE+ l+ +
6367 dcrfeledPEKGMWGLVNNAGISTFGEVEFTSlETYKqVAEVNLWGTVRmTKSFLPLiRR
tvpsgpkg m e t l
120 130 140 150 160 170
170 180 190 200 210 220
GI||SP XXXXXXXXXXXXXXXXXXXXXXXXXXXXXGTASGHRAGLTSRDTPSPVDPDTVEVLMTFE
A -- L P V VE------
6367 AKGRVVNISSMLGRMANPARSPYCITKFGVEAFSD--CLRYEMhPLGVKVSVVE------
y
180 190 200 210 220
230 240 250 260 270 280
GI||SP PDPAD-LALSSIPGHETFDPRRHRFSEEELKPQPIMKKARKIQVPEE-QKDEKYWSRRYK
--P +-+A +S+ E + eE-- P+ + K K E - K E Y s
6367 --PGNFIAATSLYnPErIQAIAKKMWdE--LPEVVRKDYGKKYFDEKIAKMETYCnSGST
s s e s
230 240 250 260 270 280
290 300 310 320 330 340
GI||SP NNEAAKRSRDARRLKENQISVRAAFLEKENALLRQEVVAVRQELSHYRAVLSRYQAQHGA
+ + + + +
6367 DTSpVInAVTHALTAaTPYTRYHPMDYYWWLRMQiMTHlPGAISDkIYIr
s d t v f m h
290 300 310 320 330 340
10569 alternatively hypothetical 77.5 spliced kd prote (650 aa)
initn: opt: 872 z-score: 148.4 E(): 0.014
Smith-Waterman score: 872; 6.1% identity in 342 aa overlap
20.2% noncontradicting positions, 14.0% class identity
10 20 30 40 50
GI||SP XBINDINGPRTEINTAXREBMARPVSDRTPAXXXXXXXXXXXXXXXXXXGLRSLL
t i a re + r l
10569 WaraDLqnLQRELDAdaieianrqdeSenSRKrLaeqsreFKKnePEdlrnnVakiiKqf
kkf tq tvtvlkdketl lq s itetkk lt eklkq npll sy
10 20 30 40 50 60
60 70 80 90 100 110
GI||SP QG-----TSKPKEPASCLLKEKERKXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Qg-----t + Ke ll er
10569 QrEIDaLsqRSKeaEaallnVYerLidaPDPqPALDLGQqLQlklqrLgdIdTdnqeLrE
g n tk fs kvffd kk sev v l ssvek hk e eskk k
70 80 90 100 110 120
120 130 140 150 160
GI||SP XXXXXXXXXXXXXXXERTLPFGDVEYVDLD-AFXXXXXXXXXXXXXXXXXXXXXXXXXXX
E T+ dl+-+
10569 kieelndelAeyanqEVTIKaLKerirdlEQSsaKnqAeriaLaKeQeinndfaEKeRnl
tlsyyekkf kvkdy t sklley tl tl ktlt e t klqstwe g kw
130 140 150 160 170 180
170 180 190 200 210 220
GI||SP XXXXXXXXXXXXXXXXXXXXXXXXXXGTASGHRAGLTSRDTPSPVDPDTVEVLMTFEPDP
t s t d d +E++mT
10569 qErqadllkqLEEAEhnVQeqnkALEakrseniDiegngnEdgdaeanqiEMimTriara
k temsttsk tk slqt ktitklf lktkyd ettqkndek vs dleey
190 200 210 220 230 240
230 240 250 260 270 280
GI||SP ADLALSSIPGHETFDPRRHRFSEEELKPQPIMKKARKIQVPEEQKDEKYWSRRYKNNE-A
A ET r + l + KA +----EQ E - sr k E-a
10569 NQRAElaqrEaETlrariYqlanrneqLagaiaKApDV----EQAIEV-LsraelEtELa
vvtq l tqeql ssekhsle ssqlq t tessk v h
250 260 270 280 290
290 300 310 320 330 340
GI||SP AKRSRDARRLKENQISVRAAFLEKENALLRQEVVAVRQELSHYRAVLSRYQAQHGAL
AK a enq -- A leke + q+Ls A Y + l
10569 AKEreiaQLeednarL--qASleqeRensahaInqLeqQLnakVAESESYnSeLeqlrrK
lkln vsevql s ytkl kstssq se ke ssv k t ktvee
300 310 320 330 340 350
GI||SP
10569 LnnqaDYneiKeELnaLKkiEFapnEdagdnDaaSEDKNDnplEslLLeaNrkLQaenAa
kgys ekv k si sm gvs gdstq ir ktf vs sk ks stl e
360 370 380 390 400 410
8390 lymphoid-restricted membrane protein (534 aa)
initn: opt: 791 z-score: 136.3 E(): 0.066
Smith-Waterman score: 791; 11.5% identity in 322 aa overlap
18.9% noncontradicting positions, 7.5% class identity
10 20 30 40 50
GI||SP XBINDINGPRTEINTAXREBMARPVSDRTPAXXXXXXXXXXXXXXXXXXGLR-----SLL
++ P sd +p G -----Sll
8390 EpeDGALDVkRqcqCPgPTedpilGqnLldCiRMNdDqSmdENGaerfcpESll
vk t ghk l sgssp te sg t e p te vghvys ps
10 20 30 40 50
60 70 80 90 100 110
GI||SP QGTSKPKEPASCLLKEKERKXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Q P s e
8390 QlReYlsqPlprqTSSsdgTiTSSdpGldILnMAScDLDrnpLCeKEEdaRaASamIEAQ
s g stl sseh tes v es se h g cks k et s pt
60 70 80 90 100 110
120 130 140 150 160 170
GI||SP XXXXXXXXXXERTLPFGDVEYVDLDAFXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
-----fgD vd -A
8390 GTSlAhDNaA-----fqDsTSkdV-AKaalnLEAgEElrTiEnggKehApGdseiSmlPk
p p i yg y vg tisq k pe t ehk gs s etvv pp v
120 130 140 150 160
180 190 200 210 220 230
GI||SP XXXXXXXXXXXXXXXXXXXXGTASGHRAGLTSRDTPSPVDPDTVEVLMTFEPDPADLA--
A R L +----- D T E + E DlA--
8390 asVKlVNfrQSENTSANEKEVEAEFLRLSLGlK-----CDWFTLEKRVKLEERSRDlAEE
tt s vq f w
170 180 190 200 210 220
240 250 260 270 280
GI||SP ------LSSIPGHETFDPRRHRFSEEELKPQPIMKKARKIQVPEEQKDEKYWSRRYKNNE
------ +s+ E+ P---- Ee+ + Q I+KK K v Q + SR
8390 NLKKEITNcLKLLESLTP----LCEdDNQAQEIiKKLEKSIklLSQCaARVASRAEMLGA
s e v vf t
230 240 250 260 270
290 300 310 320 330 340
GI||SP AAKRSRDARRLK------ENQISVRAAFLEKENALLRQEVVAVRQELSHYRAVLSRYQAQ
+ SR +r ------EN + A----KE A L a Q + Q
8390 INQESRVSrAVEVMIQHVENLKRMYA----KEHAELEdLKQaLLQNdRSFNpLeDdDDCQ
k e v e s p e
280 290 300 310 320 330
GI||SP HGAL
8390 IKKRSaSLNSKPSSLRRVTIASLPRNiGNaGlVaGMENNDRFSRRSSSWRILGsKQgEHR
s l v m s t s
340 350 360 370 380 390
7147 antigen c-terminal bbg clone 2.1 1.1 (323 aa)
initn: opt: 749 z-score: 132.4 E(): 0.11
Smith-Waterman score: 749; 26.8% identity in 123 aa overlap
29.3% noncontradicting positions, 2.4% class identity
190 200 210 220 230
GI||SP XXXXXXXXXXXXXXXGTASGHRAGLTSRDTPSPVDPDTVEVLMTFEPDPA---DLALSSI
PS-VD + -E m E +P ---+ S I
7147 fEGEGEQSVIPEAEPsfEGEGEQSVIPEAEPS-VDGEG-EQSmIPEAEPTiEGEGEQSVI
v tv v f
180 190 200 210 220 230
240 250 260 270 280 290
GI||SP PGHETFDPRRHRFSEEELKPQ--PIMKKARKIQVPEEQKDEKYWSRRYKNNEAAKRSRDA
P E---P EE P+--P + ++----PE EK---- K AAK +R +
7147 PEaE---PSVEPAGEEPVIPEAEPSVEPVK----PEVDDIEK----PVKVAKAAKVARSV
v
240 250 260 270 280
300 310 320 330 340
GI||SP RRLKENQISV-RAAFLEKENALLRQEVVAVRQELSHYRAVLSRYQAQHGAL
+ K v-r A KE L +Q+ QE +
7147 KAAKKAAKKlarKARqrKErKLKKQQEEQAQQESAEq
vsk kk k h
290 300 310 320 330
8199 hypothetical bblf2 protein (521 aa)
initn: opt: 745 z-score: 128.9 E(): 0.17
Smith-Waterman score: 745; 18.4% identity in 136 aa overlap
20.6% noncontradicting positions, 2.2% class identity
100 110 120 130 140 150
GI||SP XXXXXXXXXXXXXXXXXXXXXXXXXXXXXERTLPFGDVEYVDLDAFXXXXXXXXXXXXXX
R L VE F
8199 ELRRSGGLIAMLADAAEKDLFDLSFRTRDRRLLSAARVEDEQGLIFQPLFPAQVVCQSCS
100 110 120 130 140 150
160 170 180 190 200 210
GI||SP XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGTASGHRAGLTSRDTPSPVDP
R + T +P DP
8199 GDDGRDQQPPPVDGFGSEMEGEQTCPHAQRHSESPGQLDVYIRTPRGDVFTYSTETPDDP
160 170 180 190 200 210
220 230 240 250 260 270
GI||SP DTV---EVL--MTFEPDPADLALSSIPGHETFDPRRHRFSEEELKPQPIMKKARKIQVPE
V---++L--+T+E---+DL+ S D RRHR S L P
8199 SPVPFRDILRPVTYE---VDLVSSDGATGRGGDARRHRVSLKILEPAGGFESWLVNSWSM
220 230 240 250 260 270
280 290 300 310 320 330
GI||SP EQKDEKYWSRRYKNNEAAKRSRDARRLKENQISVRAAFLEKENALLRQEVVAVRQELSHY
R + A + + +
8199 AGGGLYAFLRSIYASCYANHRGTKPIFYLLDPELCPGGSDFQPYVPGFPFLPIHYVGRAR
280 290 300 310 320 330
8099 embryonic nuclear protein lin-14 form a b1 (475 aa)
initn: opt: 736 z-score: 128.0 E(): 0.19
Smith-Waterman score: 879; 22.2% identity in 221 aa overlap
23.5% noncontradicting positions, 1.4% class identity
110 120 130 140 150
GI||SP XXXXXXXXXXXXXXXXXXXXXXXXERTLPFGDVEYVDL--DAFXXXXXXXXXXXXXXXXX
+ GD + +D --D
8099 QGTDDQTVKWIGPSSVDSNGQKTDSSAASAGDNQNIDVIGDGSESPTSSNHSAQEIALMT
120 130 140 150 160 170
160 170 180 190 200 210
GI||SP XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGTASG-HRAGLTSRDTPSPVDPDT
GT +G- +AG R PV+ D
8099 SQQTFLNALKDSSFLFTNPVPTVETAPPLRVAPPINGTTNGTAKAGGPERKPRKPVNDDI
180 190 200 210 220 230
220 230 240 250 260
GI||SP V----------EVLMTFEPDPADLALSSIPGHETFDPRRHRFSEEELKPQPIM-KKARKI
V----------E + FE -P+ A++S P---TF P---- SE+++ Q I -KK +
8099 VKIVRNQDLSEENISMFEI-PVPKAIASDP---TFRP----VSEQQIIQQIIQGKKYEEM
240 250 260 270 280
270 280 290 300 310 320
GI||SP QVPE--EQKDEKYWSRRYKNNEAAKRSRDARRLKENQISVRAAFLEKEN-ALLRQEVVAV
+V E-- Q K ---------A KR R +--+Q +V--A L N-A L +
8099 EVGECMIQLCKKL---------AEKRVFGPRLM--SQTTV--AGLNHSNYANLPIKGICY
290 300 310 320 330
330 340
GI||SP RQELSHYRAVLSRYQAQHGAL
Q --R VL
8099 IQHVC--RKVLYDKFENEEDFWDKFREAMRKLAARCRRVRHAKKTKHNREEAQAEMLSKR
340 350 360 370 380 390
8276 hypothetical 128.6 kd protein zk1098.10 in chromo (1120 aa)
initn: opt: 764 z-score: 127.5 E(): 0.2
Smith-Waterman score: 902; 16.9% identity in 326 aa overlap
18.7% noncontradicting positions, 1.8% class identity
30 40 50 60 70 80
GI||SP VSDRTPAXXXXXXXXXXXXXXXXXXGLRSLLQGTSKPKEPASCLLKEKERKXXXXXXXXX
V+ LQ-T +E A--L K+ E K
8276 VNVLEALDLAYLERDEQTAELEMLKEDNEQLQ-TQYEREKA--LRKQTEQKYIEIEDTLI
40 50 60 70 80 90
90 100 110 120 130 140
GI||SP XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXERTLPFGDVEYVDLDAFXX
--E L-----E+ L
8276 GQNKELDKKIESLESIMRMLELKAKNATDHASRLEEREV--EQKL-----EFDRLHERYN
100 110 120 130 140
150 160 170 180 190 200
GI||SP XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGTA---SGH
G +---S H
8276 TLLRTHVDHMERTKYLMGSEKFELMQNMPLPNMQLRNKMGMAASVDASSIRGVSDLISAH
150 160 170 180 190 200
210 220 230 240 250
GI||SP RAGLTSRDTPSPVDPDTVEVLMTF----EPDPADLALSSIPGHETFDPRRHRFSEEELKP
T+ D + F----EP P D+ SS ------D + E--P
8276 MTQSTTMDVNLANHITNEDWQDEFSSDIEPSPRDIPQSSA------DALTSPITTKE--P
210 220 230 240 250
260 270 280 290
GI||SP QPIMKKARKIQVPEEQKDEKYWSRRYKNN---------------EAAKRSRDA----RRL
P A Q EE+ DE NN---------------E A D ---- R
8276 TPKREAASPKQSEEEEADETTSVDPKENNDLLGADLTGNLVDPAEFASAVNDTFIGMGRE
260 270 280 290 300 310
300 310 320 330 340
GI||SP KENQISVRAAFLEKENAL-------------LRQEVVAVRQELSHYRAVLSRYQAQHGAL
EN I + L+ NAL-------------L E + R E V + Q Q
8276 VENLIKENSELLDMKNALNIVKNDLINQVDELNSENMILRDENLSRQMVSEKMQEQITKH
320 330 340 350 360 370
GI||SP
8276 EEEIKTLKQKLMEKENEQEEDDVPMAMRKRFTRSEMQRVLMDRNAYKEKLMELEESIKWT
380 390 400 410 420 430
5807.2 300 interspersed kd repeat antigen ag231 (283 aa)
initn: opt: 707 z-score: 126.4 E(): 0.24
Smith-Waterman score: 887; 26.4% identity in 129 aa overlap
26.4% noncontradicting positions, 0.0% class identity
180 190 200 210 220 230
GI||SP XXXXXXXXXXXXXXXXGTASGHRAGLTSRDTPSPVDPDTVEVLMTFEPDPADLALSSIPG
T T PV-- T E T EP ++ + G
5807.2 IEEPVTTQEPVTIEEPVTTQEPVTTQEPVTTQEPV--TTQEPVTTQEPVTVEEHIDEKKG
30 40 50 60 70 80
240 250 260 270 280 290
GI||SP HETFDPRRHRFSEE-ELKPQPIMKKARKIQVPEEQKDEKYWSRRYKNN-EAAKRSRDARR
E + SEE-E K + KK+ ++ K++K K +-E++K + D +
5807.2 SEGDNISLSSLSEETEEKSHTKKKKSSWLKFGRGNKNDKKSKNEKKPSLESVKQNADEQK
90 100 110 120 130 140
300 310 320 330 340
GI||SP LK--ENQISVRAAFLEKENALLRQEVVAVR-----QELSHYRAVLSRYQAQHGAL
+--++QISV A-----+++ QE A -----QEL+ + +
5807.2 EQPTDSQISVNA-----QDSVTIQEPTATQEPPTTQELTATQEPTTTQETVTEQEPTTTQ
150 160 170 180 190
GI||SP
5807.2 ETVTAQEPITTQEPVTAQEPVTTQELIATQEPSTTQEHADEKKASEGDNISLSRLSEETE
200 210 220 230 240 250
1653 probable repa replication-associated replication (357 aa)
initn: opt: 712 z-score: 125.8 E(): 0.25
Smith-Waterman score: 717; 6.8% identity in 280 aa overlap
28.2% noncontradicting positions, 21.4% class identity
30 40 50 60 70 80
GI||SP VSDRTPAXXXXXXXXXXXXXXXXXXGLRSLLQGT-SKPKEPA-SCLLKEKERKXXXXXXX
+ +q T-sk E A-SC+ e k
1653 nnqerncaindiekRK VaEHNalIqSiAKMdKTalqMFELAVSCInTdalPennaifLl
eekkqlqqlqelss v dk s m q psk d enp kdhivy s
kk vvlt s t v e t
p k
x
10 20 30 40 50 60
90 100 110 120 130 140
GI||SP XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX-ERTLPFGDVEYVDLDA
-er lP+ Ve d d
1653 KrdLFaFFdVddadKhrrFKqAianMQeQAfFrIranaarGiemrrIlPiPtVeWadYnD
ee k e ssns tsq e vnl k y n qedqnl fkfen v y y k ns d
k s k s t ek q ksekdk y yks t h
s t f e kvey t
x x x x x
70 80 90 100 110 120
150 160 170 180 190 200
GI||SP FXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGTASGH
G
1653 dVlIrFnraIMPYLInLnanFsqhaiSdiaeLNSKYSiILYrWlaMnYnQfEHYqaK Ga
e k q dqd d kne tkykl elqk l k fs q s y sn n
k m e hee e q k s i g
t sph e m y
x x x
130 140 150 160 170 180
210 220 230 240
GI||SP RAGLTSRDTPSPVDP---------DTVEVLMTF---EPDPADLALSSIPGHETFDPRRHR
R d sP p---------DTv F---e d al I h F ----
1653 RraaQlEaYrnPrIiirdLRdeiTDTindhrrFdrlnrriiKnaidEInanThFnV----
tee v n kd p kmke eil mdeyqq qnfendvl dple tdh s k
kk d s s pvs vfm vks kh ph hw esvk qf
v s t s t p ts ky i v e
x x x x x
190 200 210 220 230 240
250 260 270 280 290 300
GI||SP FSEEELKPQPIMKKARKIQVPEEQKDEKYW--SRRYKNNEAAKRSRDARRLKENQISVRA
f ee k i I kDe Y -- r y eaak dar lk +
1653 elreiqaraainhIqFHIeKKaradDnnYKrnnraaqdaeaanaradarlna qalankf
fydkkkkgrslds v t rnwk es lddqdyiedkerkeqnqndvll esmdspy
s e s gt i v k e kq lg kq sekedq yk v vq
t k v m g vt kt te t kl vt e
x x x x x
250 260 270 280 290 300
310 320 330 340
GI||SP AFLEKENALLRQEVVAVRQELSHYRAVLSRYQAQHGAL
l en Ll a l l + l
1653 TrlLinnmLigandianqaiiaeLarnlYPlYdelkderGenaledHldYiarK
kk lehf lfmlelmdidlllg qesv k hksvell idgvkk ms vrd
m m ss spt mt kktmv k v kfm l t ss
s yyy f p y
x
310 320 330 340 350
8516 hypothetical surface-layer 125 80k kd protein pre (717 aa)
initn: opt: 727 z-score: 124.1 E(): 0.32
Smith-Waterman score: 755; 13.1% identity in 336 aa overlap
21.4% noncontradicting positions, 8.3% class identity
10 20 30
GI||SP XBINDINGPRTEINTAXREBMARPVSDRTPAXXXXXXXX
+ G +t I T--------P s t +
8516 LqEplnriSaTNFTLDGKAYFGNVVMGAGNKsVILT--------PYssSaLSlGDHKLTV
s svenl s t tt t v
10 20 30 40 50
40 50 60 70 80 90
GI||SP XXXXXXXXXXGLRSLLQGTSKPKEPASCLLKEKERKXXXXXXXXXXXXXXXXXXXXXXXX
L S + t A +
8516 SgaKDfAeFVSLNSTHEFkVVEDKEAPTikEATATLETVTLTFSEDiDMDTVKASNVYWK
vv y g t vt v
60 70 80 90 100 110
100 110 120 130 140 150
GI||SP XXXXXXXXXXXXXXXXXXXXXXXXXXERTLPFGDVE-YV-DLDAFXXXXXXXXXXXXXXX
E+TLP G V+-YV-D+ +
8516 SGDSKKEASEFERIADNKYKFVFKGaEKTLPTGKVDVYVEDiKDYSDNKIAKDTKVTVTP
s v
120 130 140 150 160 170
160 170 180 190 200 210
GI||SP XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGTASGHRAGLTSRDTPSPVDPD
T S D + D
8516 EIDQTRPEVRKVTalDEKTIKVTFSKTVDgEsAeKaGNYTikDKDdKVVSVDKVTVDSKD
sv k t i t vt g
180 190 200 210 220 230
220 230 240 250 260 270
GI||SP TVEVLMTFEPDPADLALSSIPGHETFDPRRHRFSEEELKPQPIM-----KKARKIQVPEE
+ V++-------DL G T + +---+ K M-----K R +--E
8516 SKSVII-------DLYSKVSVGENTITIKNVK---DATKLNNTMLDYTGKFTRSDK--EG
240 250 260 270 280
280 290 300 310 320 330
GI||SP QKDEKYWSRRYKNNEAAKRSRDARRLKENQISVRAAFLEKENALLRQEVVAVRQELSHYR
k E------ N AK + +--LK n+ A+ + N L r--+ Q Ls
8516 PdfE------hVINADAKAKKVV--LKFnKKMDAASLADsSNYLVr--IndTLQTLsdDV
ky t d y k dg te
290 300 310 320 330
340
GI||SP AVLSRYQAQHGAL
A LS +
8516 ATLSVSNDATVVTITFAETIKGnDVVFAsGKaISGSGKaNVnELQVlGVKDTSGNVHdKF
d t t v h m k
340 350 360 370 380 390
4861 triadin kda junction-specific back sarcoplasmic m (220 aa)
initn: opt: 675 z-score: 122.6 E(): 0.38
Smith-Waterman score: 675; 10.8% identity in 195 aa overlap
15.4% noncontradicting positions, 4.6% class identity
10 20 30 40 50 60
GI||SP XBINDINGPRTEINTAXREBMARPVSDRTPAXXXXXXXXXXXXXXXXXXGLRSLLQGTSK
+ +A E A+ +R - ++ Q T K
4861 EKHEEPAKSTKKEHAAPSEKQAKAeIERK-EEVSAASTKKAVPAKKEEKTTKTVEQETRK
k
10 20 30 40 50
70 80 90 100 110
GI||SP PK-EPASCLLKEKE-RKXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
K- S+ LK+KE- K
4861 EKPGKISSVLKDKELTKEKEVKVPASLKEKGSETKKDEKTSKPEPQIKKEEKPGKEVKPK
60 70 80 90 100 110
120 130 140 150 160 170
GI||SP XXXXXXXERTLPFGDVEYVDLDAFXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
P D+ + A
4861 PPQPQIKKEEKPEQDIMKPEKTALHGKPEEKVLKQVKAVTTEKHVKPKPAKKAEHQEKEP
120 130 140 150 160 170
180 190 200 210 220 230
GI||SP XXXXXXXXXXXXXXXXXGTASGHRAGLTSRDTPSPVDPDTVEVLMTFEPDPADLALSSIP
T SG + S
4861 PSIKTdKPKSTSKGMPEVTESGKKKIEKSEKEIKVPARRES
e
180 190 200 210 220 230
8476 beta-lactamase hypothetical regulatory protein 2 (312 aa)
initn: opt: 664 z-score: 118.8 E(): 0.63
Smith-Waterman score: 761; 16.0% identity in 318 aa overlap
19.5% noncontradicting positions, 3.5% class identity
10 20 30 40 50
GI||SP XBINDINGPRTEINTAXREBMARPVSDRTPAXXXXXXXXXXXXXXXXXXGLR--SLLQGT
E + R D G+ --+ L +
8476 MLNSESLLRELRDALHEGGLTGSFLVRDLYTGEELGIDPDTELPTA
10 20 30 40
60 70 80 90 100 110
GI||SP SKPKEPASCLLKEKERKXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
S K P + E+ R
8476 SLVKLPLALATLERIRLGEVDGAQQIEVAPGRITTPGPTGLSRFRHPARVAVDDLLYLST
50 60 70 80 90 100
120 130 140 150 160 170
GI||SP XXXXXXX-----ERTLPFGDVEYVDLDAFXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
-----E T P + V F
8476 SVSDGTASDALFEITPPAQVEQMVREWGFRDLTVRHSMRELSETPAERFESADAHLAHAL
110 120 130 140 150 160
180 190 200 210
GI||SP XXXXXXXXXXXXXXXXXXXXXX-GTA---------------SGHRAGLTSRDTPSPVDPD
-GTA---------------+G R G TSR P-P
8476 AISAGTSGRGHRVPQLDVARANTGTARAFVDLLEALWAPVLTGPRPGRTSRALP-PePAA
k
170 180 190 200 210 220
220 230 240 250 260
GI||SP TVEVLMT-------FEPDPADLAL--SSIPGHETFDPRRHRFSEEELKPQPIMKKA--RK
LM+------- PD A A --SS G--T RH E + A--
8476 RLRELMAANLLRHRLAPDFASDAATWSSKTG--TLLNLRHEVGVVEHADGQVFAVAVLTE
230 240 250 260 270 280
270 280 290 300 310 320
GI||SP IQVPEEQK--DEKYWSRRYKNNEAAKRSRDARRLKENQISVRAAFLEKENALLRQEVVAV
QVP + +-- E +------++A+R RD--RL+E
8476 SQVPADSQPGAEALMA------QVARRLRD--RLREW
290 300 310 320 330
330 340
GI||SP RQELSHYRAVLSRYQAQHGAL
8476
340 350
8410 no title (330 aa)
initn: opt: 660 z-score: 117.8 E(): 0.71
Smith-Waterman score: 750; 26.7% identity in 116 aa overlap
26.7% noncontradicting positions, 0.0% class identity
190 200 210 220 230 240
GI||SP XXXXXXXXXGTASGHRAGLTSRDTPSPVDPDTVEVLMTFEPDPADLALSSIPGHET--FD
+ D VE L -----P D S P E --F
8410 LDYADFADDSEEIKDEDVDHQTSDLENNNNDKVEGLA-----PKDQTTSYEPVDEVPEFI
180 190 200 210 220
250 260 270 280 290 300
GI||SP PRRHRFSEEELKPQPIMKKARKIQVPEEQ--KDEKYWSRRYKNNE-AAKRSRDARRLKEN
+EEE---Q + K I E+Q--K E +R +E-AA + +
8410 DDADSVNEEE---QTVDKNEDAITKDEQQVVKKEVDLTRPSAPSEPAAAEHKSYTKDELT
230 240 250 260 270 280
310 320 330 340
GI||SP QISVRAAFLEKENALLRQEVVAVRQELSHYRAVLSRYQAQHGAL
+I RA+ +E+ L + + A E
8410 KIMDRASKIEQIQKLAKYAISALNYEDLPTAKDELTKALDLLNSI
290 300 310 320 330
8344 len: 393 cai: mitochondrial 0.17 outer membrane 4 (393 aa)
initn: opt: 663 z-score: 117.2 E(): 0.76
Smith-Waterman score: 664; 15.1% identity in 345 aa overlap
16.5% noncontradicting positions, 1.4% class identity
10 20 30
GI||SP XBINDINGPRTEINTAXREBMARPVSDRTPAXXXXXXXX
G R E N + S- +Pa
8344 MSSRIIVGSAALAAAITASIMVREQKAKGQRREGNVSAYYNGQEYGS-SAPaQLGKLHNI
p
10 20 30 40 50
40 50 60 70 80 90
GI||SP XXXXXXXXXXGLRSLLQGTSKPKEPASCLLKEKERKXXXXXXXXXXXXXXXXXXXXXXXX
+LL + K +E A -----K K
8344 KQGIKEDALSLKDALLGVSQKAREEAP-----KVTKRVISPEEDAQTRKQLGQKAKDSSS
60 70 80 90 100 110
100 110 120 130 140 150
GI||SP XXXXXXXXXXXXXXXXXXXXXXXXXXERTLPFGDVEYVDLDAFXXXXXXXXXXXXXXXXX
R---F --E VD +
8344 QSIFNWGFSEAERRKAIAIGEFDTAKKR---FE--EAVDRNEKELLSTVMREKKAALDRA
120 130 140 150 160
160 170 180 190 200 210
GI||SP XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGTASGHRAGLTSRDTPSPVDPDTV
T ----------------D +T
8344 SIEYERYGRARDFNELSDKLDQQERNSNPLKRLLKNNTG----------------DANTE
170 180 190 200 210
220 230 240 250 260 270
GI||SP EVLMTFEPDPADLALSSIPGHETFDPRRHRFSEEELKPQPIMKKARKIQVPEEQKDEKYW
E D A --G E + + S E Q + +KI+-------EK W
8344 EAAARSVQGWGDTAQEF--GREELEEAKRNASSEPSEAQKRLDELKKIK-------EKGW
220 230 240 250 260
280 290 300 310 320
GI||SP SRRYK--NNEAAKRSRDARRLKENQISVRAAFLEK----------ENA--LLRQEVVAVR
K-- +E R AR L +--AA L K----------EN+-- L + V
8344 FGYNKGEQSEQQIAERVARGLEGWGET--AAQLSKDEMDDLRWNYENSKKQLDKNVSDAM
270 280 290 300 310 320
330 340
GI||SP QELSHYRAVLSRYQAQHGAL
LS + L Y ++ +
8344 DSLSKAKEDLKQYGSHWWSGWTSKVDNDKQALKDEAQKKYDEALKKYDEAKNKFKEWNDK
330 340 350 360 370 380
4598 major minor capsid protein 10a 10b (344 aa)
initn: opt: 657 z-score: 117.0 E(): 0.78
Smith-Waterman score: 657; 13.8% identity in 305 aa overlap
21.0% noncontradicting positions, 7.2% class identity
30 40 50 60 70 80
GI||SP MARPVSDRTPAXXXXXXXXXXXXXXXXXXGLRSLLQGTSKPKEPASCLLKEKERKXXXXX
+ lRS+ G S------+ k
4598 GDKLALFLKVFGGEVLTAFARTSVTmpRHMlRSIaSGKS------AQFPViGRTqAAYLa
ts v s l k k
30 40 50 60 70
90 100 110 120 130 140
GI||SP XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXERTLPFGDVEYVDLD
E T G--E +
4598 PGENLDDKRKDIKHTEKVIhIDGLLTADVLIYDIEDAMNHYDVRaEYTaQLG--ESLAMA
t s s
80 90 100 110 120 130
150 160 170 180 190 200
GI||SP AFXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGTASG
A T +-
4598 ADGAVLAEiAGLcNledgsNENIEGLGkaTVieltqPnkaaLTDqVaLGKaIIAaLTiA-
l v vpsky tp lttvk ttgs p e e q k
140 150 160 170 180 190
210 220 230 240 250
GI||SP HRAGLTSRDTPSP-----VDPDT-VEVLMTFEPDPADLALSSIPGHETFDPRRHRFSEEE
-RA LT P+ ----- dPD+- +L + P+ A+ a P t --R E
4598 -RAaLTKNYVPAADRtFYcdPDnYSAILAALMPNAANYaALiDPErGsI--RNVMGFEVV
s v tt s q l k t
200 210 220 230 240
260 270 280 290 300 310
GI||SP LKPQPIMKKARKIQ--VPEEQKDEKYWSRRYKNNEAAKRSRDARRLKENQISVRAAFLEK
P+ A -- P QK + n +Ak --++ L +V + L-k
4598 EVPHLTAGGAGdaREdaPadQKHaFPAnkgeGnVKVAlD--NViGLFqHRSAVGTVKL-r
tt gt tg v tsst t k v m k
250 260 270 280 290 300
320 330 340
GI||SP ENALLRQEVVAVRQELSHYRA--VLSRYQAQHGAL
+ AL R----A R---++y A--++++Y HG L
4598 DLALER----ARR---ANfQADQIIAKYAMGHGGLRPEAAGAiVl
y v f
310 320 330 340
6791 dtaf tsm1 ii protein 150 gene product (1039 aa)
initn: opt: 684 z-score: 114.9 E(): 1
Smith-Waterman score: 684; 20.1% identity in 134 aa overlap
25.4% noncontradicting positions, 5.2% class identity
190 200 210 220 230 240
GI||SP XXXXXXXXXXXXXXGTASGHRAGLTSRDTPSPVDPDTVEVLMTFEPDPADLALSSIPGH-
a l s dp T e t + s + G+-
6791 STDdepqlnnshmfnncicnSarlWfPCVDlladkcTWrLEFsVdrnmkaigcgeLiGQN
ektwmwsvytstgeyes ssy v sfdeps e t pklvtnvstsk l
190 200 210 220 230 240
250 260 270 280 290
GI||SP ------ETFDPRRHRFSEEELKPQPIMK---KARKIQVPEEQKDEKYWSRRYKNNEAAKR
------E D H - EEE KP ++K---K ++ EE K+ K--S+ ++N+
6791 GEESEKEKEDTPEHD-EEEEGKPARVIKDEDKDSNLKNDEEGKNSK--SKDAQDNDEEEE
250 260 270 280 290
300 310 320 330 340
GI||SP SRDARRLKENQISVRAAFLEKENALLRQEVV-----AVRQELSHYRAVLSRYQAQHGAL
++ e R E N LR +V-----+ +EL H + +
6791 EGESDEEEeEGEEERRNIEESNNPSLRDVIVCCSEYSNIKELPHPIDLTKKKCIFQIINP
g
300 310 320 330 340 350
GI||SP
6791 VAPHHIGWAIGAFNSWSLPLesimspDardeteedklrenVcannnaladddieididPh
ivpptv lekkvfhysvstp vdpvidtmvgqfgsyvi i
360 370 380 390 400 410
11380 hypothetical trwc protein (508 aa)
initn: opt: 649 z-score: 113.4 E(): 1.2
Smith-Waterman score: 763; 4.2% identity in 262 aa overlap
24.4% noncontradicting positions, 20.2% class identity
10 20 30 40 50
GI||SP XBINDINGPRTEINTAXREBMARPVSDRTPAXXXXXXXXXXXXXXXXXXGLRS-
v d Pa lR -
11380 nRdmqQlqkiaEKAKnarcaliGDkAQllaiEaGrPadIAYqlraAdiaTAhMrEiqRQK
t lfe tlslv gchvvym t tksv d k fe lsqe gmq t s vl
150 160 170 180 190 200
60 70 80 90
GI||SP ----------LLQGTSKPKEPASCLLK------EKERKXXXXXXXXXXXXXXXXXXXXXX
----------L+ T As Lk------E e
11380 nPELKKIaqELMMSTPaaadrAlsqLerngdViEIenhhdRraaISDSVQKiAEaYcALk
d tv esvgk sts kqikw t kssve kgp v h i s
210 220 230 240 250 260
100 110 120 130 140 150
GI||SP XXXXXXXXXXXXXXXXXXXXXXXXXXXXERTLPFGDVEYVDLDAFXXXXXXXXXXXXXXX
e G e vd
11380 ldERdrTlIaaaTNEaRreINqAIRiiRqGlGeaGqGeefdTllrmDsRLTdAErrHSkN
pe tn v vsg n qt e vv e k tl k ifvt tvlv t q lh p
270 280 290 300 310 320
160 170 180 190 200 210
GI||SP XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGTASGHRAGLTSRDTPSPVDPD
g g+r l P ---
11380 YqVGdVirlnrdYakTGLQRGELYRVseTnhdnrlliiedgDgQrKVinldlMPkr---e
t h vqpenq lt vk gpgknttvlgeh k n lqfsp th t
330 340 350 360 370 380
220 230 240 250 260 270
GI||SP TVEVLMTFEPDPADLALSSIPGHETFDPRRHRFSEEELKPQPIMKKARKIQVPEEQKDEK
tvev---f p+ a+la s i D r + e lk + K rki v + +
11380 kiel---fqrEraeiasgDiiriTrnDKerdLaagdrlrVtaVNKadrkiTaldGKrEHL
tvsv yhp tthlqvs tlkw ks hlg vnhesmk vh eehtv vts s
390 400 410 420 430 440
280 290 300 310 320 330
GI||SP YWSRRYKNNEAAKRSRDARRLKENQISVRAAFLEKENALLRQEVVAVRQELSHYRAVLSR
n + + + + + + Ra
11380 nsdLnqdqnqHiDhaYAsTVHglQGaTadriLILLDaHaelrnTrrDllYVAiiRarhqa
sve ptpkpl v yn t ss l sqsv s nssst mk vy vs stfev
450 460 470 480 490 500
6096 crtj regulatory repressor protein (459 aa)
initn: opt: 640 z-score: 112.5 E(): 1.4
Smith-Waterman score: 640; 8.3% identity in 339 aa overlap
23.3% noncontradicting positions, 15.0% class identity
10 20 30 40
GI||SP XBINDINGPRTEINTAXREBMARP---VSDRTPAXXXXXXXXXXXXX
+ rebMa P--- r a
6096 aLqrlaPDLlaDiiasAaDIaLlVSqerVVreVMaNPqhgSaerlaaWqGarLeqllsaE
s psvs vr lvtt c s v pgg es v hfp fgqfse e rp sevftp
10 20 30 40 50 60
50 60 70 80 90
GI||SP XXXXXGLRSLLQGTSKPKEP-----ASCLLKEKERKXXXXXXXXXXXXXXXXXXXXXXXX
lR-L G ----EP-----a L r
6096 SaaKlrnR-LadGl----EPGRGSlalELnHaDaraFelPiRYiihRlgaDrsiLliGRD
vq fel se p vqv t i pds tf v tlt spe gtl ml
70 80 90 100 110
100 110 120 130 140 150
GI||SP XXXXXXXXXXXXXXXXXXXXXXXXXXERTLPFGDVEYVDLDAFXXXXXXXXXXXXXXXXX
E + v v d
6096 lrPiAEVQQQLVaAQLAlERDYEaQREiETRYRVlLdahraPlliVSMSTGRIaDLNlAA
mq l k m t m v evspd mvl v s
120 130 140 150 160 170
160 170 180 190 200 210
GI||SP XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGTASGHRAGLTSRDTPSPVDPDTV
gt s + ltsR + v---tV
6096 aaliGatRadLidAaiaQEldGRRRGEFlEnlaniAasdpaaaVEllaRRSrrrl---lV
glml gv qe lg pvg fe m tmtkl gteslgp vti qkkv t
s
180 190 200 210 220 230
220 230 240 250 260 270
GI||SP EVLMTFEPDPADLALSSIPGHETFDPRRHRFSEEELKPQPIMKKARKIQVPEEQKDEKYW
-v F L L i e+ pr sE--l k i ----D
6096 -tarlFRAAGdRLLLCrideAdArrprgDdlsE--nlaRLfheGiDaiVFl----DADGT
vptv e qlgp e tqtvv etv lse ylk v gm s
240 250 260 270 280
280 290 300 310 320 330
GI||SP SRRYKNNEAAKRSRDARRLKENQISVRAAFLEKENALLRQEVVAVRQ--ELSHYRAVLSR
R -- NeA Da l q A FL + + Lr + Vr -- L hY L+
6096 IRa--ANdAFLnlTDagSaAairGRSiADFLaRGaVDLrVLiDnVrRiGqLRhYaTRLnT
g e ym ss l lvq f s s n l s k t h l v t
290 300 310 320 330 340
340
GI||SP YQAQHGAL
a + a
6096 DFaGQiaaEiSATlldDRarPliaLViRDsnrADamRRPimaggainEgaRNVMqlVGna
s vtv l wfh et tlv v tsl tt vpptmvsd pl em ys
350 360 370 380 390 400
2681 beta-adaptin adaptin protein clathrin complex bet (519 aa)
initn: opt: 635 z-score: 111.0 E(): 1.7
Smith-Waterman score: 682; 6.2% identity in 288 aa overlap
25.3% noncontradicting positions, 19.1% class identity
40 50 60 70 80 90
GI||SP XXXXXXXXXXXGLRSLLQGTSKPKEPASCLLKEKERKXXXXXXXXXXXXXXXXXXXXXXX
++ + l e Rk
2681 ikDaqDpNPLIRalAiRTMgcIRVDKIlEYicePLRrcLhDdnaYVRKTAaiCVAKLhdi
vt ce s cm v sm t let kt k edp vv fql
y
20 30 40 50 60 70
100 110 120 130 140 150
GI||SP XXXXXXXXXXXXXXXXXXXXXXXXXXXERTLPFGDVEYVDLDAFXXXXXXXXXXXXXXXX
rt ++ d da
2681 nadlcedqGflddLrnaiaDSNPlViANatAALiEIan dqdaqnLldlnaqninqlLlA
skqmvvel vveq kdlld m v rv s ne shpnsd semiqshvskf t
t s v s h msgvp vs kpvs
t s s s t
80 90 100 110 120 130
160 170 180 190 200 210
GI||SP XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGTASGHRAGLTSRDTPSPVDPDT
----Ta a -------s V t
2681 LNECTEWariiILdcLanYnaKDdrEAQdIcdRi----TarLaHaN-------aAVVLaa
gqvf gs ge mp el s ie v ph q v p st
t s s s s s
140 150 160 170 180
220 230 240 250 260
GI||SP VEVLMTF------EPD----------PADLALSSIPGHETFDPRRHRFSEEELKPQPIMK
v Vlm f------e d----------pa +L S p + p r+ e P + k
2681 iKVimrllnllqidldfnalilKrLapalVsLlSaePElQYVaLrNIriIlqKrPdiLkq
v lvkfmemppkessscnmlm k sspf t m gp m p k nl ve y el th
ss y yygt t s p k
s t v
190 200 210 220 230 240
270 280 290 300 310
GI||SP KAR----KIQVPEEQKDEK--YWSRRYKNNEAAKRSRDARRLKENQISVRAAFLEKENAL
r----K P K EK-- R a+ LKE + v F k
2681 EiriFfVKfNDPiYVKLEKiDIliRLanqaNiaQ lLaELKEYAmEydpdFVrrAirA
lkv y y l l mv vdps lk v s t veve sk vq
m s t
250 260 270 280 290 300
320 330 340
GI||SP LRQEVVAVRQELSHYRAVLSRYQAQHGAL
l q + v E s r v
2681 igrcaIKy EqfaercldiLLdLiqTrqntikddacisirDilRhcPn Kqecai
lsqlg v psvskvvst e le kvdyvvqecivvlc lf ky g yvsiv
s k v
310 320 330 340 350
4810 5 iif transcription chain alpha factor tfiif subu (563 aa)
initn: opt: 631 z-score: 109.9 E(): 2
Smith-Waterman score: 631; 5.9% identity in 270 aa overlap
22.6% noncontradicting positions, 16.7% class identity
10 20 30 40 50
GI||SP XBINDINGPRTEINTAXREBMARPVSDRTPAXXXXXXXXXXXXXXXXXXGLRSLLQGTS
R Nt r + r ---- a q
4810 SganVqEfkiRVPrNmpKrhniMaF----NAadnVnFaqWrnarlERdnnaKeir qEEd
sqs t yvv k ps kyhl r tlk d st nqvkm elsn kmy m e
tt sv m f
10 20 30 40 50
60 70 80 90 100 110
GI||SP KPKEPASCLLKEKERKXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Pk A + k R
4810 qPefGAGSEfNRdqREEaRRKKfGIiarefrpdaQPWiLrVnGKaGrKfKGireGGVgEN
m ks y kl s y vlkkykved l k g s k y vkk t
t
60 70 80 90 100 110
120 130 140 150 160 170
GI||SP XXXXXXERTLPFGDVEYVDLDAFXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
-p G +E l
4810 aaffiFTqa-aDGAiEAfPlhnWYNFqPiarhrsLsAEEAEqEfeRRnKVlNhFsiMqrr
tsyyv hc p f y vse t lqkykt t e wg k m y tl lqk
t v
120 130 140 150 160 170
180 190 200 210 220 230
GI||SP XXXXXXXXXXXXXXXXGTASGHRAGLTSRDTPSPVDPDTVEVLMTFEPDPADLALSSIPG
g ra r s-------e +t d Dl lsS
4810 RLr DqdqdqDedEaggg EKaa rrKak-------dLrIhDldd DlEdeSdae
k eeeee pe ekli rg kk ks e k t mee s ls tes
vg k g t s m s
180 190 200 210
240 250 260 270 280 290
GI||SP HETFDPRRHRFSEEELKPQ---PIMKKARKIQVPEEQKDEKYWSRRYKNNEAAKRSRDAR
e e Kpq---P+ K arK + DE + E +
4810 dae dqEderrgKaqgKaPLaKGarKKKrKrdsDDEAlEdSDDGDeEGrEmDYMSD
ens ee ggddk pkk g k gd k kgv f e f q v
e g ssip v k
v
220 230 240 250 260 270
300 310 320 330 340
GI||SP RLKENQISVRAAFLEKENALLRQEVVAVRQELSHYRAVLSRYQAQHGAL
q e + + v + +
4810 eSSSddEePEgKakaaqqdedmKGlaEqda SdEEeddEKap eEdadeEee
g eq l s dpepkeekgp vd esd s see ks k eeeg kk
t se p i p v e p k
280 290 300 310 320
3332 heat heavy shock 70 chain heat-shock binding prot (416 aa)
initn: opt: 620 z-score: 109.9 E(): 2
Smith-Waterman score: 620; 5.8% identity in 330 aa overlap
33.3% noncontradicting positions, 27.6% class identity
10 20 30 40 50
GI||SP XBINDINGPRTEINTAXREBMARPVSDR-TPAXXXXXXXXXXXXXXXXXXGLRSLLQ
n eb+a p dR-tPa l q
3332 IGidlGnTsacia inranradiiaNdaGaRaiPaalafs drdrlhGdaAlnqaarnpq
vhf c yssvg vfqdgdvevvp ed n tt sivsyt eeeqyi gq kqsriihve
t t skh q pq d v y v gg evv l ylpls k
ypk k l e t v s
10 20 30 40 50
60 70 80 90 100 110
GI||SP GT---SKPKEPASCLLKEKERKXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
T--- k s ke k
3332 nTiinarriiGRlfaDkcaqkdcangacaVendndrgryqirgkn eeqnealnpddi
s vfdfkdll kpd pqdvskikelkfr iekdgklkveldttg ggeekimsveev
vkv qf ssg ev ymshsppq v gkvpf issy lkt lft
f v lw yl v kvy
x
60 70 80 90 100 110
120 130 140 150 160 170
GI||SP XXXXXXXXXXXXERTLPFGDVEYVDLDAFXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
e l e V f
3332 aarhlnrlKeaAeadiGeaannaViTVPanFndeQrqAlgaaaaaaGlnilriInEPsaa
srlifskm li hdyl hdikdv l fd gek ks tkdsgrii fdvvql h tsr
vsmv t tt sv skvte v ty s s t e gk q f s
yp k v
120 130 140 150 160 170
180 190 200 210 220 230
GI||SP XXXXXXXXXXXXXXXXXXXXXXGTASGHRAGLTSRDTPSPVDPDTVEVLMTFEPDPADLA
G---G ts + vd evl T--------a
3332 aiAhaedr pgegarNiliadcG---GgrhdaaiiairnGifrilaT--------a
ll ygiqq tfgkdk vvvfkl islslsvlevnd myevks n
s lgk sk es f ttf v ms ds v t
t f kv s t
180 190 200 210
240 250 260 270 280 290
GI||SP LSSIPGHETFDPR--RHRFSEEELKPQPIMKKARKIQVPEEQKDEKYWSRRYKNNEAAKR
s G e fd r-- h se + k q k k sr n eaaKr
3332 gndniGgdnldrrLanhlaaeFqrlnnanirqn araiarLraaaeaaKr
hdlhl vedftne ldyfiei kkkhqhdlsgt kkslrk krncsrg h
tstsw gh dt vq vs fki p i mm mnei it k
t t e y k v k vs tss v
220 230 240 250 260
300 310 320 330
GI||SP SRDARRLKENQISV--------------RAAFLEKENALLRQEVVAVRQELSHYRAVLSR
s --l qisv--------------Ra f e n l rq avrq l
3332 qLSn--agqanceiDSLadGqDfdaninRarfEelandlFaqcieairqaiadaeldaad
s s lssvqifv ie i yhclvs mky lvnikv rkflkfvdellrqtgfkkdq
t t st sls f f ses t csp nssssp ekv k k tpl
t v y yt k tt s tt
270 280 290 300 310 320
340
GI||SP YQAQHGAL
a
3332 InallL eGGssriPKirqniq dilnaralnadnnanEaaaiGAAiqAaiim
ddvv t vtft lqklle elfggqnklnsippd lips le rlls
he v vtt k f ppkd k vv w v g v
k y v y s
330 340 350 360 370
4703 dynactin 117 150 kd dynein-associated isoform pro (882 aa)
initn: opt: 645 z-score: 109.5 E(): 2.1
Smith-Waterman score: 673; 9.6% identity in 333 aa overlap
23.4% noncontradicting positions, 13.8% class identity
10 20 30 40 50
GI||SP XBINDINGPRTEINTAXREBMARPVSDRTPAXXXXXXXXXXXXXXXXXXGLRSLLQGT
b + p e +b T l l +
4703 srgaAPalleaqKeeanLraQlaDLeEKLETLrqrRnEDKarLrEldKhKIQlEQlQEfr
tpfv mvpsps tsee qd vr t kik s ek k fe m f v wk
p t g s l y
20 30 40 50 60 70
60 70 80 90 100 110
GI||SP SKPKEPASCLLKEKERKXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
sK e + L ke r
4703 sKiqeaQAdLQrrLlaaeqEaqdaieaKeahaqedarharahrdahldqedareaAdsLQ
t mmgq s ke krgke skeglgg grlhegmgdlsdriemgtsgkgmgegr et
e g g t qymg t nv i k
k
80 90 100 110 120 130
120 130 140 150 160 170
GI||SP XXXXXXXERTLPFGDVEYVDLDAFXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Er------vey D
4703 qEldalKEr------ideLemDLEiLraEiqnK GgDgaa SsYqlKQLEq
l vess k vey tt l kh mee s spg t ef e
v s v
140 150 160 170
180 190 200 210 220
GI||SP XXXXXXXXXXXXXXXXXGTASGH----------RAGLTS----RDTPSPVDPDTVEVLMT
h----------r lts----r+ s- d E +
4703 QNaRLKdaLVRlRDLSahdKqdhqKLqKqlEkKrqEleelrrqrErLq-aeidqaEaiia
i et m sse heiv s em m ns vtsveqtk k s eklkel ktvd
l t v vs s
180 190 200 210 220 230
230 240 250 260 270 280
GI||SP FEPDPADLALSSIPGHETFDPRRHRFSEEELKPQPIMKKARKIQVPEEQKDEKYWSRRYK
+ +D AL + Et r ---e E K + ++ eE de s r
4703 dLqEQVDAALGAEEMVEqLadrnl---nLEdKVreLrEeiadLEAlnEmndqLqEnarEl
e k m tekkm d e kl e tvgq me vhee v snh t
t e
240 250 260 270 280
290 300 310 320 330 340
GI||SP NNEAAKRSRDAR-RLKENQISVRAAFLEKENALLRQEVVAVR-QELSHYRAVLSRYQAQH
e A -r kE q v AA----------qE va r-Q + yR + q q
4703 ELdLREqLDlAaaakrEaqrrrdAA----------qETiaDrdQTIkKfRqLtahLnDqn
e e m ngrvk vekeve i vy yq v y e vqk q vl
g l
290 300 310 320 330
GI||SP GAL
L
4703 rELrnrneanaerqqQdPp EiiDfKqkFAEsKAharAIdmqLRQiElaQANrHmq
t mdqqsssekesl p s tf y im t ytk eve m vq e vs
ts v k s
340 350 360 370 380 390
8289 epidermal eps8 growth protein factor receptor kin (822 aa)
initn: opt: 640 z-score: 109.1 E(): 2.2
Smith-Waterman score: 740; 14.3% identity in 307 aa overlap
17.9% noncontradicting positions, 3.6% class identity
30 40 50 60 70 80
GI||SP TPAXXXXXXXXXXXXXXXXXXGLRSLLQGTSKPKEPA---SCLLKEKERKXXXXXXXXXX
+ S E A---S K+K R
8289 SiLALVCKEPTQnKPDLHLFQCDEVKANLISEDIESAISDSKGGKQKRRPdALRMIanAD
v s e sk
150 160 170 180 190 200
90 100 110 120 130
GI||SP XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXERTLPFGDVEY--------V
E+ + + E --------+
8289 PgIPPPPRAPAPaPPGTVTQVDVRSRVAAWSAWAADQGDFEKPRQYHEQEETPEMMAARI
s v
210 220 230 240 250 260
140 150 160 170 180
GI||SP DLDA-----------FXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
D D+-----------F
8289 DRDVQILNHILDDIEFFITKLQKAAEAFSELSKRKKnKKgKRKGPGEGVLTLRAKPPPPD
s s
270 280 290 300 310 320
190 200 210 220 230
GI||SP XXXXXXXXXGTASGHRAGLTSR-DTPSPVD-------PDTVEVLMTFEPDPADLALSSIP
A L S - PS D-------P + V T P+ A +LS +
8289 EFlDCFQKFKHGFNLLAKLKSHIQNPSAaDLVHFLFTPLNMVVQATGGPELASSVLSPLL
v s
330 340 350 360 370 380
240 250 260 270 280
GI||SP GHETFDPRRHRFSEEELKPQPIM----KKARKIQVPEEQKDEKY-------WSRRYKNNE
+T D + + eE k +---- KaR -+ P EQ Y-------W N
8289 nKDTiDFLNYTanadERqLWMSLGdsWmKaRA-EWPKEQFIPPYVPRFRNGWEPPMLNFM
t v vtge k gt v v
390 400 410 420 430 440
290 300 310 320 330 340
GI||SP AAKRSRDARRLKENQISVRAAFLEKENALLRQEVVAVRQELSHYRAVLSRYQAQHGAL
A D L E+ +V +e L E V S G
8289 GApmEQDlYQLAESVANVAEHQRKQdiKRLSTEHSnVSdYhPADGYAfSSniYhRGpHaD
tt m es s e p y sm t s l
450 460 470 480 490 500
GI||SP
8289 qGEAAmaFKpTpNrqiDRNYdalKTQPKKYAKSKYDFVARNnSELSVlKDDiLEILDDRr
h vp s s hhv epv s m v k
510 520 530 540 550 560
8838 h probable dehydrogenase region ltdh methotrexate (287 aa)
initn: opt: 597 z-score: 108.3 E(): 2.4
Smith-Waterman score: 597; 9.6% identity in 270 aa overlap
20.7% noncontradicting positions, 11.1% class identity
10 20 30 40 50 60
GI||SP XBINDINGPRTEINTAXREBMARPVSDRTPAXXXXXXXXXXXXXXXXXXGLRSLLQGTSK
+ + A r+ A s L T
8838 TaPTaPVALVTGAAKRLGrSIAEaLHAEGYaVCLHYHRSAAdAnaLaATLN
s v s g t e st s
10 20 30 40 50
70 80 90 100 110 120
GI||SP PKEPASCLLKEKERKXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
+ P S + + +
8838 ARRPNSAITVQADLSNVATApfSeaDGSaPVTLFsRCaaLVaACYmHWGRCDVLVNNASS
sv gt v t se d t
60 70 80 90 100 110
130 140 150 160 170
GI||SP XXXXX-------ERTLPFGDVEYVDLDAFXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
-------e GD E + a
8838 FYPTPLLRnDadegepcVGDrEalEtAaADLFGSNAIAPYFLIKAFAqRsadpraaqRGT
k egghgss k sm v t h vrhtsqes
120 130 140 150 160 170
180 190 200 210 220
GI||SP XXXXXXXXXXXXXXXXXXXXXXGTASGHRAGLTSRDT----PSPVDPDTVEVLMTFEPDP
A g GLT +----p + + V ++ pD-
8838 nYSIiNMVDAMTnQPLLGYTiYTMAKeALEGLTRSAALELApLQIRVNGVgPGLSVLpD-
s v s m g s s v
180 190 200 210 220 230
230 240 250 260 270 280
GI||SP ADLALSSIPGHETFDPRRHR-FSEEELKPQPIM---KKARKIQVPEEQKDEKYWSRRYKN
-D+ s gh P +R- S eE I --- KA+ I + D Y R
8838 -DMPfaVqEdhRrKVPLYQRnSSAaEVSDVVIFLCSpKAKYITGTCiKVDGGYSLTRA
ps w gy s d e s v
240 250 260 270 280
290 300 310 320 330 340
GI||SP NEAAKRSRDARRLKENQISVRAAFLEKENALLRQEVVAVRQELSHYRAVLSRYQAQHGAL
8838
290 300 310 320 330 340
8492 lmp1 lmp2 gene product (479 aa)
initn: opt: 612 z-score: 107.7 E(): 2.6
Smith-Waterman score: 879; 12.4% identity in 370 aa overlap
18.6% noncontradicting positions, 6.2% class identity
10 20 30
GI||SP XBINDINGPRTEINTAXREBMARPVSDRTPAXX
N I +TE+ S+ T +
8492 ADNLAKSIKEQLNNSVSNANTLSAKLTDKDNTIQQAKTELEKEVQKAnQAIKSNNTASMQ
d
100 110 120 130 140 150
40 50 60 70 80 90
GI||SP XXXXXXXXXXXXXXXXGLRSLLQGTSKPKEPASCLLKEKERKXXXXXXXXXXXXXXXXXX
-----L+ +K KE -----K E K
8492 SAKSSLDAKVAEITKK-----LETFNKDKEA-----KFNELKQTRNQIQEFINTNKNNPN
160 170 180 190 200
100 110 120 130 140 150
GI||SP XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXE--RTLPFGDVEYVDLDAFXXXXXXXXX
E-- +L + + V D---
8492 YSELISQLTSKRDSKNSVTDSSNKSDIESANTELKQALAKANADKVQAD---NLAKSIKE
210 220 230 240 250 260
160 170 180 190 200 210
GI||SP XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGTASGHRAGLTSRDTP
TAS + A -------
8492 QLNNSVSNANTLSAKLTDKDNTIQQAKTELEKEVQKANQAIKSNNTASMQSAK-------
270 280 290 300 310
220 230 240 250 260
GI||SP SPVDPDTVEV---LMTFEPDPA----DLALSSIPGHETFDPRRHRFSEEELKPQPIMKKA
S D E+---L TF D ----+L + +E + ++ + EL Q K+
8492 SSLDAKVAEITKKLETFNKDKEAKFNELKQTRNQIQEFINTNKNNPNYSELISQLTSKRD
320 330 340 350 360 370
270 280 290 300
GI||SP RKIQVPEEQKDEKYWSRRYKNNE---------AAKRSRD--ARRLKE---NQIS----VR
K V + S--- N E---------A K s D--ar lKe---n is----+R
8492 SKNSVTDSSNKSDIES---ANTELKQALakAnAdKsqaDNearpiKndLnnkienanpiR
nt k k vsi llksl eq qssvsefgtl
380 390 400 410 420 430
310 320 330 340
GI||SP AAFLEKENALLRQEVVAVRQELS-HYRAVLSRYQAQHGAL
a l + l q +El - A+ s a al
8492 nanlsdidnkiqqaKneLaeElqKAnQAIKnNnsaSkQaaKdS
stkftwksstlett tk ek vt d s pts m sl s
440 450 460 470
7391 hypothetical lactococcin a in protein secretion l (458 aa)
initn: opt: 601 z-score: 106.2 E(): 3.1
Smith-Waterman score: 706; 25.4% identity in 130 aa overlap
25.4% noncontradicting positions, 0.0% class identity
230 240 250 260 270
GI||SP TFEPDPADLALSSIPGHETFDPRRHRFSEEELKPQPIMKKARKIQVPEEQK---DEKY--
A ++ EL Q +K+Q Q+---+EK --
7391 IENNLKEGEAVKENSLLLKYNGTPEQTQLSELLTQKKQALDKKVQLDLLQRSLTNEKNEF
70 80 90 100 110 120
280 290 300 310
GI||SP -------WSRRYKNNEAAKRSRDARRLKENQ---------------ISVRAAFL------
------- + + N EA +S +A K NQ---------------I +A L------
7391 PTADSFGYEKSFENYEAQVKSLEATIQKSNQAVEDQNKSTESQKQAIQNQVATLQQAIQN
130 140 150 160 170 180
320 330 340
GI||SP --EKENALLRQEVVAVRQE---LSHYRAVLSRYQAQHGAL
--E ENA V+--Q+---LS+Y +----YQAQ+ L
7391 YSEIENAVSSGGGVS--QDNPYLSQYNS----YQAQQATLEADLKNQKNPDETAKQAAKS
190 200 210 220 230
GI||SP
7391 QEESLKSQFLSGLASSKDSLKSQIQSFNVQESSLTGSNAYDNSQSSQILTLKSQALSASN
240 250 260 270 280 290
3138 defective fc chorion-1 proteins fc106 fc125 fc177 (390 aa)
initn: opt: 593 z-score: 105.8 E(): 3.3
Smith-Waterman score: 593; 14.7% identity in 217 aa overlap
20.7% noncontradicting positions, 6.0% class identity
110 120 130 140 150 160
GI||SP XXXXXXXXXXXXXXXXXERTLPFGDVEYVDLDAFXXXXXXXXXXXXXXXXXXXXXXXXXX
+ + l+
3138 LQPeAAASrVVLVLADDATAKaRVaRQNPPlNPLGQLMNWPALPQDFQLPSMDLGPQVGS
t k t v p
110 120 130 140 150 160
170 180 190 200 210 220
GI||SP XXXXXXXXXXXXXXXXXXXXXXXXXXXGTASGHRAGLTSRDTPSPVDPDTVEVLM-----
+A+ A + D P+ PD+ + +-----
3138 FLAQLPaMPaiPgiLGAAAPVPAPAPAPAAaPPlAPAPAADPPAAPVPDAaQPAILGqAA
p tv sl t p p e
170 180 190 200 210 220
230 240 250 260 270
GI||SP -----TFEPDPADLALSSIPGHE--TFDPRRHRFSEEELKPQ--PIMKKARKIQVPEEQK
-----TF- +Pa+ Ss+ G+ --TF P F- +++ Q--P M A --Q
3138 LQNAFTF-lNPaNFDASgLLGQSaPTFAPPNlDF-VAQMQRQFFPGMTPA --QPAaAGT
f s s v f p
230 240 250 260 270
280 290 300 310 320 330
GI||SP DEKYWSRRYKNNEAAKRSRDARRLKENQISVRAAFLEKENALLRQEVVAVRQELSHYRAV
D----+ +E+ R -+a +E Q+ +++A-LE E
3138 D----AqASDISEVRVRP-EaPYSQEAQMKIKSA-LEMEQERQQ
l d
280 290 300 310
340
GI||SP LSRYQAQHGAL
QAQ
3138 QAQVKDQEQVPLLWFrMPTTQNQDATaEKTLEdLRVEAKLRAFERQVIaELRMLQ
h e h s
320 330 340 350 360
8548 yopd protein (306 aa)
initn: opt: 584 z-score: 105.8 E(): 3.3
Smith-Waterman score: 705; 13.5% identity in 311 aa overlap
17.4% noncontradicting positions, 3.9% class identity
10 20 30 40
GI||SP XBINDINGPRTEIN-TAXREBMARPVSDRTPAXXXXXXXXX
+ g Ei -T A + +
8548 MTINIKTDSPIITTGSQLDAITTETVgQSGEiKKTEDTRHEAQAIKSSEASLSRSQVPEL
k v
10 20 30 40 50 60
50 60 70 80 90 100
GI||SP XXXXXXXXXGLRSLLQGTSKPKEPASCLLKEKERKXXXXXXXXXXXXXXXXXXXXXXXXX
L S QG LL E RK
8548 IKPSQGINVALLSKSQGDLNGTLSILLLLLELARKAREMGLQQRDIENKATIsAQKEQVA
t
70 80 90 100 110 120
110 120 130 140 150 160
GI||SP XXXXXXXXXXXXXXXXXXXXXXXXXERTLPFGDVEYVDLDAFXXXXXXXXXXXXXXXXXX
-------------AF
8548 EMVSGAKLMIAMAVVSGIMAATSTVAS-------------AFSIAKEVKIVKQEQILNSN
130 140 150 160
170 180 190 200 210
GI||SP XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGTASGHRAGLTS---RDTPSPVDPD
A ++ L ---R T S +
8548 IAGRdQLIDTKMQQMgNaGDKAVSREDIGRIWKPEQVADQNKLALLDKEFRMTDSKANAF
e s i
170 180 190 200 210 220
220 230 240 250 260 270
GI||SP TVEVLMTFEPDPADLALSSIPGHETFDPRRHRFSEEELKPQPIMKKARKIQVPEEQKDEK
----+P- +A S+I H-------+ +S+ E K + + ---I E QK E
8548 NAAT----QP-LGQMANSAIQVH-------QGYSQAEVKEKEVNAS---IAANEKQKAEE
230 240 250 260 270
280 290 300 310 320 330
GI||SP YWSRRYKNNEAAKRSRDARRLKENQIS-----VRAAFLEKENALLRQEVVAVRQELSHYR
+--Y +N ----+D+ RL E +S-----++AAF
8548 AMN--YNDNFM----KDVLRLIEQYVSSHTHAMKAAFGVV
280 290 300 310 320
340
GI||SP AVLSRYQAQHGAL
8548
330
5939 69 autoantigen kd p69 (483 aa)
initn: opt: 594 z-score: 104.7 E(): 3.8
Smith-Waterman score: 594; 24.2% identity in 99 aa overlap
33.3% noncontradicting positions, 9.1% class identity
210 220 230 240 250 260
GI||SP TPSPVDPDTVEVLMTFEPDPADLALSSIPGHETFDP-RRHRFSEEELKP-QPIMKKARKI
+ T + + HE+F - + F+ --LK -Q MKK ---
5939 RCNLLSHMLATYQTTLLHFWEKTSHTMAAIHESFKGYQPYEFTT--LKSLQDPMKKL---
230 240 250 260 270
270 280 290 300 310 320
GI||SP QVPEEQKDEKYWSRRYKNNEAAKRSRDARRLKENQISVRAAFLEKENALLRQEVVAVRQE
---- +K+EK s r n eAa-- + r L----IS-----LE EN-- r+E + + E
5939 ----VEKEEKKKinrrEnrdAa--aQEPrQL----IS-----LEEEN--QrKESSscqkE
ssqq ste v v s h tfkt
280 290 300 310 320
330 340
GI||SP LSHYRAVLSRYQAQHGAL
--++vlS
5939 dG--KSilSalDKgSaddACSGPIDELLDmKpEEGACLGPmAGTPEPEgaDKDDLLLLnE
e vp sv s tht v s v sg s
330 340 350 360 370
5176 histone-binding nuclear autoantigenic hgv2 protei (704 aa)
initn: opt: 605 z-score: 104.3 E(): 4
Smith-Waterman score: 681; 5.2% identity in 325 aa overlap
27.7% noncontradicting positions, 22.5% class identity
10 20 30 40
GI||SP XBINDINGPRTEINTAXREBMARPVSDRTPAXXXXXXXXXXXXXXXXX
Eb++r t a
5176 EaaDAEeeKSVSGTDVQEEcrEq gqEKQGEVIVrI EKPkEaSEEQPgtTLeKdnTAVE
vp kg hk k ve s t v vv g qg
140 150 160 170 180 190
50 60 70 80 90 100
GI||SP XGLRSLLQGTSKPKEPASCLLKEKERKXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
sl- T KP + kEk
5176 VEAEpl-DaTaKPVDVGGdEPeEqmaTSeNEaGKAVL qQLVGQdVPPaEESPeVqTEaa
sv p v h k kvv g p e e v m t te
200 210 220 230 240 250
110 120 130 140 150 160
GI||SP XXXXXXXXXXXXXXXXXERTLPFGDVEYVDLDAFXXXXXXXXXXXXXXXXXXXXXXXXXX
e t d e dl a
5176 easadeagdeaSrdpeqdapglgndgasndaeaagdQaeidPqplaErliETKdgdeleE
kvtdvlkissv eksgmeksvkpeppevkglstlve kpse etsi kst eksgsk
v vp t vsk p vv tsvk k v g l
260 270 280 290 300 310
170 180 190 200 210 220
GI||SP XXXXXXXXXXXXXXXXXXXXXXXXXXXGTASGHRAGLTSRDTPSPVDPDTVEVLMTFEPD
t s a ts---p pv tv -t E
5176 KtrAeeaanQ EaKLpidekeaaedgmaeeaaqgaeeek---qadkeneaandd-ddEre
vd kltps t spesspgegeksdsksdekeksks iektvqitnkee kq km
v v t k s vet vs skt t pppv ks v pv tp s
v v s
320 330 340 350 360
230 240 250 260 270 280
GI||SP PADLALSSIPGHETFDPRRHRFSEEELKPQPIMKKARKIQVPEEQKD-EKYWSRRYKNNE
ss g e d seE + K+ pee +-e W
5176 edeeeeedneenedddeenagaeeEnPNDSVLENKSeqEndeddigNlqLAWdMLdLaKi
sqkkvgsestgepeesgtsdkssk k lp eepeevs me e e c t
emp st s ggtk est m s t v
t t
370 380 390 400 410 420
290 300 310 320 330 340
GI||SP AAKRSRDARRLKENQISVRAAFLEKENALLRQEVVAVRQELSHYRAVLSRYQAQHGAL
Kr +---Kenq v a l --l Ev Q + + a Ls- Q qh
5176 ifKrq qs---KeaqqkaAQaHqKLGE--lciEsqNhiQAiedFqaCLn-iQedhLeahD
ly kh et tnklmv c l vgl ve yp vge le s l kql pet
k y sv s s v ey
v
430 440 450 460 470 480
GI||SP
5176 RlLAEThYnLGLAYqfnkrhdnAiaqfqqaidViEaRmamLneqieaaeGnle de
k y q gyesqyee lehysksle l n vdv tkllkenv ekt is
h s s k vs ts g k m ss ssv
v
490 500 510 520 530
4573 element insertion is421 hypothetical 47 is186 41 (354 aa)
initn: opt: 580 z-score: 104.3 E(): 4
Smith-Waterman score: 875; 16.3% identity in 343 aa overlap
18.7% noncontradicting positions, 2.3% class identity
10 20 30 40 50
GI||SP XBINDINGPRTEINTAXREBMA--RPVSDRTPAXXXXXXXXXXXXXXXXXXG
I G E++T+ R A--R R A
4573 MNYSHDNWSAILAHIGKPEELDTSARNAGALTRRREIRDAATLLRLGLAYGPGGM SLRE
10 20 30 40 50
60 70 80 90 100
GI||SP LRSLLQGTSKPKEPASCLLKE--KERKXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
+ Q LLK --
4573 VTAWAQLHDVATLSDVALLKRLRNAADWFGILAAQTLAVRAAVTGCTSGKRLRLVDGTAI
60 70 80 90 100 110
110 120 130 140 150 160
GI||SP XXXXXXXXXXXXXXXXXERTLPFGDVEYVD------LDAFXXXXXXXXXXXXXXXXXXXX
T F D E D------LD F
4573 SaPGGGsAEWRLHMGYDPHTCQFTDFELTDSRDAERLDRFAQTADEIRIADRGFGSRPEC
g t
120 130 140 150 160 170
170 180 190 200 210 220
GI||SP XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGTASGHRAGLTSRDTPSPVDPDTVEVL
G G G T---V
4573 IRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGETT---VMIGNSGNK
180 190 200 210 220 230
230 240 250 260 270
GI||SP MTFEPDPADLALSSIPGHETFDPRRHRFSEEELK---PQPIMKKAR-----KIQVPE---
+ P PA L S+P + + SE K--- Q A ----- PE---
4573 KAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHVLLLTSLPEDEY
240 250 260 270 280 290
280 290 300 310
GI||SP --EQKDEKYWSRRYKNNEAAKRSR-----DARRLKENQIS-------VRAAFLEKENALL
--EQ + Y R- + A KR +-----DA R KE +++------- AAFL +
4573 SAEQVADCYRLR-WQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLLAAFLIDDII
300 310 320 330 340 350
320 330 340
GI||SP RQEVVAVRQELSHYRAVLSRYQAQHGAL
4573
360 370 380
9186 pes4 pab-like protein (381 aa)
initn: opt: 580 z-score: 103.8 E(): 4.2
Smith-Waterman score: 587; 7.4% identity in 285 aa overlap
21.1% noncontradicting positions, 13.7% class identity
10 20 30 40 50 60
GI||SP XBINDINGPRTEINTAXREBMARPVSDRTPAXXXXXXXXXXXXXXXXXXGLRSLLQGTSK
+ E m r + r p+ G + -----k
9186 LFIGnLheTVTEEmLrgIFKrYqSFeSAKVCrDflTKKSLGhGYLNF-----e
d ks t kk k p v i sv y k
10 20 30 40
70 80 90 100 110
GI||SP PKEPASCLLKE--------KERKXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Ke A kE--------kE k
9186 DKndAEkAreElNYTkfnGqEirIMPSlrNTlFRKNiGTNVFFSNLPLnNPqLTTRsFYd
ee s mk f vvf k vk mk t f e l v l
50 60 70 80 90 100
120 130 140 150 160 170
GI||SP XXXXXXXXXXXXXERTLPFGDVEYVDLD--AFXXXXXXXXXXXXXXXXXXXXXXXXXXXX
er G V-Y d d--A
9186 imirYGniLSClLdrRKnIGFV-YFdndisARNVIKkYNNqeFFGnKIiCGiHFDKEVRs
tfse kv k es d edekt m ts k l l t
110 120 130 140 150 160
180 190 200 210 220 230
GI||SP XXXXXXXXXXXXXXXXXXXXXXXXXGTASGHRAGLTSRDTPSPVDPDTVEVLMTFEPDPA
+ G a + + S ---+t+ v---------
9186 rPnFekrKkridadiiIEdEqlannnhSKGnnarSKNIYSSSQ---NsIli---------
v e ttq smlgsetv k lslsekl ddke t fv
170 180 190 200 210
240 250 260 270 280
GI||SP DLALSSIPGHETFDPRRHRFSEEELKPQPIMKKARKIQVPEEQKDEKYWS-RRYKNNEAA
---- ++P T d FSe ----PI----+ i + e qk w+- YKN e +
9186 ----KNLPsdTTrddiLnfFSeiG----PI----KSifiSnaqankphkAFVTYKNeedS
ti qeev dy tv vyl ektkvtylw sse
220 230 240 250 260
290 300 310 320 330 340
GI||SP KRSRDARRLKENQISVRAAFLEKENALLRQEVVAVRQELSHYRAVLSRYQAQHGAL
k++ + k+ q + + + +
9186 eKAqKrlNnfiFrnhkilVgraqDKeeraqFIesnKisklfLeNLSanCNKEFikqLChQ
k i cy kty kgktlw tpgk pvhnk gtq kttvy k fv lsy l
270 280 290 300 310 320
8288 macrogolgin rat gcp360 (3267 aa)
initn: opt: 657 z-score: 103.7 E(): 4.3
Smith-Waterman score: 1078; 13.3% identity in 347 aa overlap
20.5% noncontradicting positions, 7.2% class identity
10 20 30 40
GI||SP XBINDINGPRTEINTAXREBMARPVSDRTPAXXXXXXXXX
Te TA E-------+ t a
8288 dLrrSlnALQEEnQdLSKEIeSlKVSISQLTrQlTALqE-------EGaLalYHAQLrVr
s qm fd k g k f e v h t gv k k
2600 2610 2620 2630 2640
50 60 70 80 90 100
GI||SP XXXXXXXXXGLRSLLQGTSKPKEPASCLLKEKERKXXXXXXXXXXXXXXXXXXXXXXXXX
l S + t E C KE +K
8288 EEEVqrLsAalSSSQKRiadLqEELVCVQKEAaKKVgEIEDKLKrELKHLHHnAGIMRNE
hk t lf tve e s s k d
2650 2660 2670 2680 2690 2700
110 120 130 140 150 160
GI||SP XXXXXXXXXXXXXXXXXXXXXXXXXERTLPFGDVEYVDLDAFXXXXXXXXXXXXXXXXXX
------- E L E dL A
8288 TETAEERVAELARDLVE-------MEQKLLmVTKENKdLTAQIQaFGrSMSSLQnSRDHA
t g s k d
2710 2720 2730 2740 2750
170 180 190 200 210 220
GI||SP XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGTASGHRAGLTSRDTPSPVDPDTVE
a+- Ts ++ S + +
8288 nEELddLKrKYDASLKELAQLKerqdLnRErDAlLSqaA-FplNsTeENilSrLEKLNQQ
t se k gqgl g s v et sm t s ss h
2760 2770 2780 2790 2800 2810
230 240 250 260 270
GI||SP VLMTFEPDPADLALSSIPGHETFDPRRHRFSEEELKPQPI-------MKKARKIQVPEEQ
l ---D L LSS-- E + Fs Q -------+ K RK--- EE
8288 LiSK---DEQLLHLSS--qLEdShNQVQSFsKAMaSLQNERDHLWNELEKFRK---SEEG
l e s y t t
2820 2830 2840 2850 2860
280 290 300 310 320 330
GI||SP KDEKYWSRRYKNNEAAKRSRDARRLKENQISVRAAF--LEKENALLRQEVVAVRQELSHY
K------ R aa ++ LK S --L KE L Q+ + QE++
8288 K------QRSAAqSaasSPAEVQSLKKAMSSLQNDRDRLLKELKNLQQQYLQiNQEITEL
p pst m
2870 2880 2890 2900 2910 2920
340
GI||SP R---AVLSRYQAQHGAL
r---A L yQ q Al
8288 rPLKAQLQEsQDqTKAlQiMqEELRQENLSWQHELdQLRmEKnSWEiHERRMKEQYLMAI
h y k f m k h v s l
2930 2940 2950 2960 2970 2980
8676 alpha-helical coiled coil protein tlpa (345 aa)
initn: opt: 575 z-score: 103.6 E(): 4.4
Smith-Waterman score: 915; 20.2% identity in 218 aa overlap
24.8% noncontradicting positions, 4.6% class identity
110 120 130 140 150 160
GI||SP XXXXXXXXXXXXXXXXXXXXXERTLPFGDVEYVDLDAFXXXXXXXXXXXXXXXXXXXXXX
L EY--- A
8676 LQAEGRNITGFALRNQVGGGNPTRLRQIcDEY---QASQSTVVTEPVAELPVEVAEEVKA
w
20 30 40 50 60 70
170 180 190 200 210 220
GI||SP XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGTASGHRAGLTSR---DTPSPVDPDTVEV
A+G + R---D+ VD-D E
8676 VSAALSERITQLATELNDKAVRAAERRVAEVTRAAGEQTAQAERELADAAQTVD-DLEEK
80 90 100 110 120 130
230 240 250 260 270
GI||SP LMTFEP--DPADLALSSIPGHETFDPRRHRFSEEELKPQPIMKKARKIQVPEEQKDEKYW
L + --D LAL S----E H +LK Q E +++K
8676 LDELQDRYDSLTLALES----ERSLRQQHDVEMAQLKERLAAAEENTRQREERYQEQKTV
140 150 160 170 180
280 290 300 310 320 330
GI||SP SRRYKNNEAA--KRSRDARRLKENQISVRAAFLEKENALLRQEVVAVRQEL-SHYRAVLS
N E A--K +R+ + QIS A E R V L-S+ A S
8676 LQDALNAEQAQHKNTREDLQKRLEQISAEANARTEELKSERDKVNTLLTRLESQENALAS
190 200 210 220 230 240
340
GI||SP RYQAQHGAL
Q-QH A
8676 ERQ-QHLATRETLQQRLEQAIADTQARAGEIALERDRVSSLTARLESQEKASSEQLVRMG
250 260 270 280 290 300
8248 hypothetical orf3 protein 3 (417 aa)
initn: opt: 579 z-score: 103.1 E(): 4.6
Smith-Waterman score: 824; 16.4% identity in 323 aa overlap
22.6% noncontradicting positions, 6.2% class identity
10 20 30 40 50 60
GI||SP XBINDINGPRTEINTAXREBMARPVSDRTPAXXXXXXXXXXXXXXXXXXGLRSLLQGTSK
RP SD T G R G +
8248 mLEAPRPLSDPTCLARRSDVSIVAERRVVAQVGVRPVVAGLAE
v
10 20 30 40
70 80 90 100
GI||SP PKE--------------PASCLLKEKERKXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
E--------------P + ER+
8248 QVEEWEREDDAQGPTDGPEGGVRRWSERQRELGARRAVAVPGQRRVTPSYTAPDRVDLRT
50 60 70 80 90 100
110 120 130 140 150 160
GI||SP XXXXXXXXXXXXXXXXXXXERTLPFGD--VEYVDLDAFXXXXXXXXXXXXXXXXXXXXXX
E ----GD--V YV --A
8248 AEDVPLTPVLAARVAALVEEH----GDHLVRYVA--ARLRDAEWAYWARAEDVAQDVWLD
110 120 130 140 150
170 180 190 200 210 220
GI||SP XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGTASGHRAGLTSRDTPSPVDPDTVEVLMT
T + R R--- P D + EVL -
8248 VARGRVPELLEEVPDLPWPRLAAAAKWSLLDNTRTRRRREWLVR---VPEDRSADEVLE-
160 170 180 190 200 210
230 240 250 260 270 280
GI||SP FEPDPADLALSSIPGHETFDPRRHRFSEEELKP---QPIMKKARKIQVPEEQKDEKYWSR
--------AL+ -PG ++ E E P--- P R P Q+-E R
8248 --------ALAG-PGPDSTVCAVDELMEPEPEPGGWAPACYAERIAALPPRQR-EVLELR
220 230 240 250 260
290 300 310 320 330 340
GI||SP RYKNNEAAKRSRDARRLKENQISVRAAFLEKENALLRQEVVAVRQELSHYRAVLSRYQAQ
--- E A RL + SV A AL V-VRQ L
8248 C---TEGMTTPAIAARLGISRQSVDRALRFAVTALGSPGTV-VRQRSRGAGQPLPAGWER
270 280 290 300 310
GI||SP HGAL
8248 VLDRLPNRTQRDVVRLRAGGASFGELGEQLGLHRGYAHELYTRALRSLREMVQDQRLDPV
320 330 340 350 360 370
8306 yd9395.16 cdc1 gene len: 491 cai: 0.13 (491 aa)
initn: opt: 584 z-score: 103.0 E(): 4.7
Smith-Waterman score: 584; 12.7% identity in 244 aa overlap
17.2% noncontradicting positions, 4.5% class identity
40 50 60 70 80 90
GI||SP XXXXXXXXXXXXXXGLRSLLQGTSKPKEPASCLLKEKERKXXXXXXXXXXXXXXXXXXXX
+ S K +K
8306 KSRSKKRLRIYWRYISIVWILWLGLISYYESVVVKRAMKKCQWSTWEDWPEGAESHRVGL
40 50 60 70 80 90
100 110 120 130 140 150
GI||SP XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXERTLPFGDVEYVDLDAFXXXXXXXXXXXXX
R Y D D+
8306 FADPQIMDEYSYPGRPQIVNYFTRVIVDHYHRRNWKYVQYYLDPDSNFFLGDLFDGGRNW
100 110 120 130 140 150
160 170 180 190 200 210
GI||SP XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGTASGHRAGLTSRDTPSPVD
+S +R +T S D
8306 DDKQWIKEYTRFNQIFPKKPLRRTVMSLPGNHDIGFGDTVVESSLQRFSSYFGETSSSLD
160 170 180 190 200 210
220 230 240 250 260 270
GI||SP P--DTVEVLMTFE-PDPADLALSSIPGH--ETFDPRRHRFSEEELKPQPIMKKARKIQVP
-- T L T - D + S +P +--+ F H L P+ + ------P
8306 AGNHTFVLLDTISLSDKTNPNVSRVPRQFLDNFAMGSHPLPRILLTHVPLWRD------P
220 230 240 250 260
280 290 300 310 320 330
GI||SP EEQKDEKYWSRRYKNNEAAKRSRDARRLKENQISVRAAFLEKENALLRQEVVAVRQELSH
E+Q + --R K ++ EN IS + L + Q
8306 EQQTCGQL--RESKEPFPIQKGHQYQTVIENDISQEILTKIQPEILFSGDDHDHCQISHS
270 280 290 300 310 320
340
GI||SP YRAVLSRYQAQHGAL
Y AQ
8306 YPFQGKTKNAQEITVKSCAMNMGISRPAIQLLSLYNPSDLTMVNAGGEYASKTYQTELCY
330 340 350 360 370 380
5970 cfxy cfxyc protein plasmid (254 aa)
initn: opt: 560 z-score: 103.0 E(): 4.7
Smith-Waterman score: 604; 11.7% identity in 248 aa overlap
16.9% noncontradicting positions, 5.2% class identity
10 20 30 40 50
GI||SP XBINDINGPRTEINTAXREBMARPVSDRTPAXXXXXXXXXXXXXXXXXXGLRSLLQ--
I D++G ++ tA -- A + G L++--
5970 MQALIFDVDGTLADTEsAH--LQAFNAAFAEVGLDWhWDAPLYTRLLKVAGGKERLMHYW
t y
10 20 30 40 50
60 70 80 90 100 110
GI||SP GTSKPKEPASCLLKEKERKXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
P E C KE---------
5970 RMVDPEEARGCKVKE---------TIDAVHAIKTRHYAERVGAGGLPLRPGIARLIaEAG
d
60 70 80 90 100
120 130 140 150 160 170
GI||SP XXXXXXXXXERTLPFGDVEYVDLDAFXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
T P------ +LDA
5970 EAGLPLAIATTTTP------ANLDALLQAhLGADWRrRFAAIcDAGTTAIKKPAPDVYLA
p g g
110 120 130 140 150 160
180 190 200 210 220
GI||SP XXXXXXXXXXXXXXXXXXXGTASGHRAGLTSRDTPSPVDPDTVEVLMTFEP---------
---+a+G RA ++- P+ V P t +FE ---------
5970 VLERLGLEaGDCLAIED---SaNGLRAARAA-GIPTVVTPaaFSAQDSFEGALLVLPHLG
g g tt
170 180 190 200 210
230 240 250 260 270 280
GI||SP DPADLALSSIPGHETFDPRRHRFSEEELKPQPIMKKARKIQVPEEQKDEKYWSRRYKNNE
DPa+ +PG
5970 DPaEPMPQHVPGAAnRWADLAALRAWHHGTLIEAT
g h
220 230 240 250 260 270
8078 von vwf pre-pro-polypeptide willebrand -22 factor (436 aa)
initn: opt: 579 z-score: 102.9 E(): 4.8
Smith-Waterman score: 791; 22.6% identity in 164 aa overlap
28.0% noncontradicting positions, 5.5% class identity
180 190 200 210 220 230
GI||SP XXXXXXXXXXXXXXXXXXXXXXGTASGHRAGLTSRDTPSPVDPDTVEVLMTFEPDPADLA
A gL T Pv p T V t EP D
8078 CqACrEPGgLVVPPTdaPigpTTlYVEDipEPPLHDFh
e q s eg vss s ts y
10 20 30
240 250 260 270
GI||SP LSSIPGHETFDPRRHRFSEEE---LKP--------QPIMKKARKIQVPE---------EQ
S + r SE+E---LK -------- I +K ++ V E---------e
8078 CSRLLDLVFLLDGSSrLSEaEFEVLKaFVVdMMErLrISQKriRVAVVEYHDGSHAYIeL
k d v g h h wv g
40 50 60 70 80 90
280 290 300 310 320
GI||SP KDEKYWS--RR------YKNNEAAKRSRDARRLKENQISVRAAFLEKE-NALLRQEVVAV
KD K S--RR------Y +e+A S-++ + QI + E - ALL----
8078 KDRKRPSELRRIaSQVKYAGSqVASTS-EVLKYTLFQIFgKIDRPEASRIALL----LMA
t e s
100 110 120 130 140 150
330 340
GI||SP RQELSHYRAVLSRYQAQHGAL
QE s l RY
8078 SQEPqRlaRNlVRYVQGLKKKKVIVIPVGIGPHAnLKQIrLIEKQAPENKAFVlSgVDEL
s ms f s h f s
160 170 180 190 200 210
8325 hap4 aa transcriptional 1-554 activator (554 aa)
initn: opt: 587 z-score: 102.8 E(): 4.9
Smith-Waterman score: 587; 13.7% identity in 249 aa overlap
15.3% noncontradicting positions, 1.6% class identity
10 20 30 40 50
GI||SP XBINDINGPRTEINTAXREBMARPVSDRTPAXXXXXXXXXXXXXXXXXXGLRS
+ PV S
8325 MTAKTFLLQASASRPRSNHFKNEHNNIPLAPVPIAPNTNHHNNSSLEFENDGSKKKKKSS
10 20 30 40 50 60
60 70 80 90 100
GI||SP LLQGTSK----PKEPASCLLKEKERKXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
L TSK----P P
8325 LVVRTSKHWVLPPRPRPGRRSSSHNTLPANNTNNILNVGPNSRNSSNNNNNNNIISNRKQ
70 80 90 100 110 120
110 120 130 140 150 160
GI||SP XXXXXXXXXXXXXXXXERTLPFGDVEYVDLDAFXXXXXXXXXXXXXXXXXXXXXXXXXXX
E+ +-- D Y ---AF
8325 ASKEKRKIPRHIQTIDEKLI--NDSNYL---AFLKFDDLENEKFrSSASSISSPSYSSPS
h
130 140 150 160 170
170 180 190 200 210 220
GI||SP XXXXXXXXXXXXXXXXXXXXXXXXXXGTASGHRAGLTSR---DTPSPVD-PDTVEVLMTF
-----T + H + LT ---D+ S V -P T + -
8325 FSSYRNRKKSEFMDDESCTDVE-----TIAAHNSLLTKNHHIDSSSNVHAPPTKKSKLN-
180 190 200 210 220
230 240 250 260 270 280
GI||SP EPDPADLALSSIPGHETFDPRRHRFSEEELKPQPIMKKARKIQVPEEQKDEKYWSRRYKN
--D L+LSS T P + L + I KA P + S
8325 --DFDLLSLSSTSSSATPVPQLTKDLNMNLNFHKIPHKASFPDSPADFSPADSVSLIRNH
230 240 250 260 270 280
290 300 310 320 330 340
GI||SP NEAAKRSRDARRLKENQISVRAAFLEKENALLRQEVVAVRQELSHYRAVLSRYQAQHGAL
+ + N+I F E V + S Q+
8325 SLPTNLQVKDKIEDLNEIKFFNDFEKLEFFNKYAKVNTNNDVNENNDLWNSYLQSMDDTT
290 300 310 320 330 340
8291 hypothetical orf 79.4 ykr090w kd protein in prp16 (706 aa)
initn: opt: 595 z-score: 102.6 E(): 5
Smith-Waterman score: 603; 13.1% identity in 251 aa overlap
16.3% noncontradicting positions, 3.2% class identity
10 20 30 40 50
GI||SP XBINDINGPRTEINTAXREBMARPVSDRTPAXXXXXXXXXXXXXXXXXXGLRSLLQG
+ + R + S +G
8291 TKPRNPFSSQRNASTGSLQASVKSPPITRQRNVSAAPSVPVTMKSAYTASSKSAYSSVKG
30 40 50 60 70 80
60 70 80 90 100 110
GI||SP TSKPKEPASCLLKEKERKXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
S P -- L ER+
8291 ESDIYPPP--VLENSERRSVTPPKNSNFTSSRPSDISRSISRPSERASQEDPFRFERDLD
90 100 110 120 130 140
120 130 140 150 160 170
GI||SP XXXXXXXXERTL--PFGDVEYVDLDAFXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
R -- + E+ D F
8291 RQAEQYAASRHTCKSPANKEFQAADNFPFNFEQEDAGNTEREQDLSPIERSFMMLTQNDT
150 160 170 180 190 200
180 190 200 210 220 230
GI||SP XXXXXXXXXXXXXXXXXXXXGTASGHRAGLTSRDTPS---PVDPDTVEVLMTFEPDPA-D
G -- + +S + S--- D + +E L- FEPDP -
8291 ASVVNSMNQTDNRGVLDQKLGK--EQQKEESSIEYESEGQQEDENDIESL-NFEPDPKLQ
210 220 230 240 250 260
240 250 260 270 280 290
GI||SP LALSSIPGHETFDPRRHRFSEEELKPQPIMKKARKIQVPEEQKDEKYWSRRYKNNEAAKR
+ L + P ++ F ++----EE +P---K I V E +
8291 MNLENEPLQDDFPEAKQ----EEKNTEP---KIPEINVTRESNTPSLTMNALDSKIYPDD
270 280 290 300 310
300 310 320 330 340
GI||SP SRDARRLKENQISVRAAFLEKENALLRQEVVAVRQELSHYRAVLSRYQAQHGAL
+ Q S ++ L + +
8291 NFSGLESSKEQKSPGVSSSSTKVEDLSLDGLNEKRLSITSSENVETPYTATNLQVEQLIA
320 330 340 350 360 370
10618 d2045.2 (759 aa)
initn: opt: 596 z-score: 102.4 E(): 5.1
Smith-Waterman score: 596; 6.8% identity in 307 aa overlap
23.5% noncontradicting positions, 16.6% class identity
30 40 50 60 70 80
GI||SP MARPVSDRTPAXXXXXXXXXXXXXXXXXXGLRSLLQGTSKPKEPASCLLKEKERKXXXXX
L s +Q k ke +s---k e
10618 lRKAeciWLLiyIQsLahLkaksl---nnndILGAVqq
f slv sv y gk pevvs kcse hl
10 20 30
90 100 110 120 130 140
GI||SP XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXERTLPFGDVEYVDLD
E T g +d
10618 aFargLadrDEFiQDsaArGlgiVYeiadgdLKeglVegLlgslaESTAGSakrSaTgId
r mdf ten s vs k msl glggsp ksm ks mktft gst e k s
40 50 60 70 80 90
150 160 170 180 190
GI||SP AFXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX-----
-----
10618 GSVSEdTeLFEkGqLnsTPgdGkisTYqdiLnLASdlndPaLVYKFMqLArhnAlWnSrk
e k p v gt tg slt kel t evgq d s kss t s km
100 110 120 130 140 150
200 210 220 230 240
GI||SP GTASGHRAGLT--SRDTPSPVDPDTVEVLMT------FEPDPADL-ALSSIPGHETFDPR
G A G A l+--s + D t L+ ------f+Pd a -a+ sI g t d-r
10618 GaAhGlGAilSenalEEiLLKDqqtaKqLiPKLfRfRfDPdqaVqraMkdIWniLiad-r
i f f lm kssk l epyf k v y y y fvk sgs ts gt tpe s
160 170 180 190 200 210
250 260 270 280 290
GI||SP RHRFSE------EELKPQPIMKKARK--------IQVPEEQKDEKYWSRRYKNNEAAKRS
se------+EL p k R -------- q q ek+ k eaa R
10618 kntidefaNdIadELLcalanrEwRVREaaclALldLirgqdqeemhekileileaalRt
slvvslyf e lk pgmtdk y ssts sq lqshptvkfskmmpkywtmif v
220 230 240 250 260 270
300 310 320 330 340
GI||SP RDARRLKENQISVRAAFLEKENALLRQEVVAVRQELSHYRAVLSRYQAQHGAL
rD --+Ke---SVR a ++l + v ++ e s + + a
10618 rDD--iKd---SVREaadraadsiaKiiaRSIDlekgtNptKanEiLanaLPalidqggL
m v e vgtkfttvls llv vgssv sv sk f dvi fvwgphi
280 290 300 310 320
GI||SP
10618 nSdaeanrrFaLslliDLtKhaggaiKPfiadLIpdlidafSenEhqViNYLAaraanqn
k tvkevsn c ttvl v sspkql ytpk ylfmtlv si ps l lnsnqyq
330 340 350 360 370 380
8683 gravity gene cdna clone expressed gsc381 in callu (203 aa)
initn: opt: 543 z-score: 101.5 E(): 5.7
Smith-Waterman score: 543; 13.0% identity in 138 aa overlap
15.9% noncontradicting positions, 2.9% class identity
10 20 30 40
GI||SP XBINDINGPRTEINTAXREBMARPVSDRTPAXXXXXXXXXXXXXX
D G ++T r+ R + T +
8683 IRHENGEQRETHLVEEARGDHQGRVRMLSTRArQTTGRNIWLSTTSLWRVSLNSRPSCLY
c
10 20 30 40 50 60
50 60 70 80 90 100
GI||SP XXXXGLRSLLQGTSKPKEPASCLLKEKERKXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
L SL G SK --+SC ----+
8683 QRERHLTSLTPGRSKTT--SSCTYT----RVFIMDNCEELIQSGSALSRALLILKTFLNI
70 80 90 100 110
110 120 130 140 150 160
GI||SP XXXXXXXXXXXXXXXXXXXXERTLPFGDVEYVDLDAFXXXXXXXXXXXXXXXXXXXXXXX
R GD D + F
8683 SREMLSRTRSEGDPQEPCEEVRGALLGDRREQDYNKFYEAFSKNLKLGIHEDSTNRTKIA
120 130 140 150 160 170
170 180 190 200 210 220
GI||SP XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGTASGHRAGLTSRDTPSPVDPDTVEVLMTF
8683 ELLRYHSTKSGGELTSLKDYVTRDEGGPE
180 190 200 210 220 230
8272 regulatory protein rim1 (628 aa)
initn: opt: 584 z-score: 101.5 E(): 5.7
Smith-Waterman score: 693; 14.0% identity in 356 aa overlap
16.0% noncontradicting positions, 2.0% class identity
10 20 30 40 50
GI||SP XBINDINGPRTEINTAXREBMARPVSDRTPAXXXXXXXXXXXXXXXXXXGLRSLL
+N NG--T RE ++ +D + + R +
8272 MVPLEDLLNKENG--TAAPQHSRESIVENGTDVSNVTKKDGLPSPNLSKRSSDCSKRPRI
10 20 30 40 50
60 70 80 90 100 110
GI||SP QGTSKPKEPASCLLKEKERKXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
T+ -----+ Lk +E
8272 RCTTE-----AIGLnGQEDERMSPGSTSSSCLPYHSsSHLNTPPYDLLGASAVSPTTpSS
k t s
60 70 80 90 100 110
120 130 140 150 160 170
GI||SP XXXXXXXXXXERTLPFGDVEYVDLDAFXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
+ P GD----D DA
8272 SDSSSSSPLAQAHNPAGD----DDDADNDGDSEDITLYCKWDNCGMIFNQPELLYNHLCH
120 130 140 150 160
180 190 200 210 220 230
GI||SP XXXXXXXXXXXXXXXXXXXXGTASGHRAGLTSR-DTPSPVDPDTVEVLMTFEPDPADLAL
T + R +TS - P P P DL--
8272 DHVGRKSHKNLQLNCHWGDCTTKTEKRDHITSHLRVHVPLKPFGCSTCSKKFKRPQDL--
170 180 190 200 210 220
240 250 260 270 280 290
GI||SP SSIPGHETFDPRRHRFSEEELKPQPIMKKARKIQVPEEQKDEKYWSRRYKNNEAAKRSRD
------- + H S LK + K K --- +K+ S + A+ S
8272 -------KKHLKIHLESGGILKRKRGPKlGSKRT---SKKNKScASDAVSSCSASVPSaI
w s g
230 240 250 260 270
300 310 320 330 340
GI||SP ARRLKENQISVR---------AAFLEKENALLRQEVVAVRQ----ELSHYRAVLSRYQAQ
A K S ---------+ L + Q ++ Q----ELS+Y+ V---Y Q
8272 AGSFKSHSTSPQILPPLPVGISQHLPSQQQQQQQRAISLNQLCSDELSQYKPV---YSPQ
280 290 300 310 320 330
GI||SP HGAL
A
8272 LSARLQTILPPLYYNNGSTVSQGANSrSMNVYEDGCSNKTIANATQFFTKLSRNMTNNYI
q
340 350 360 370 380 390
8680 retrovirus-related gag polyprotein homolog transp (455 aa)
initn: opt: 571 z-score: 101.3 E(): 5.9
Smith-Waterman score: 757; 9.9% identity in 314 aa overlap
18.5% noncontradicting positions, 8.6% class identity
10 20 30
GI||SP XBINDINGPRTEINTAXREBMARPVSDRTPAXX
N I Gp---------- A VS T-+
8680 AWRQaAIYAYELFKnYngSsAHYQAVaiiRNKIrGa----------AGALLVSHNT-VLN
s p pk t qfl k p
130 140 150 160 170
40 50 60 70 80 90
GI||SP XXXXXXXXXXXXXXXXGLRSLLQGTSKPKEPASCLLK---EKERKXXXXXXXXXXXXXXX
LR L Qg ++ cL++---e E+K
8680 FDAILARLDCTYSDKTSLRLLRQcLElVrQGdlcLMQYYDdVEKKLTLVTNKIVMsHdQE
g m t emp e t e
180 190 200 210 220 230
100 110 120 130 140 150
GI||SP XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXERTLPFGDVEYVDLDAFXXXXXXXX
+--LP + +A
8680 GADLLNaEVRaDALHAFISGLrKaLRAVVFPAQPKD--LPSALALAREAEASIERSMFAN
r d k p
240 250 260 270 280
160 170 180 190 200 210
GI||SP XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGTASGHRAGLTSRDT
g g+----t +Dt
8680 SYAKAlqdRAqiaannKnRaQGKqTRFNKdEQnQdRNPHFtKRqKnGPngQ----anKDn
vee hsgesg s f p e g e v p g qp tp t
290 300 310 320 330 340
220 230 240 250 260 270
GI||SP PSPVDPDTVEVLMTFEPDPADLALSSIPGHETFDPRRHRFSEEELKPQPIMKKARKIQVP
------------t P P + SS----------R R e qp A K
8680 Q------------aqAPQPMEVDSSS----------RFRQrpehyqrqaNESNAfKRrNS
te ktttvsnhp p k
350 360 370 380
280 290 300 310 320 330
GI||SP EEQKDEKYWSRRYKNNEAAKRSRDARRLKENQISVRAAFLEKENALLRQEVVAVRQELSH
e R + A s+da E + le EN + +
8680 SdRSTGqRRQRlNNVVQEApdqKdakeEfEKaAKaAVEEidSENEYAPgDDSiNFLGnaP
e p v sks epvt y t e le s l gt
390 400 410 420 430 440
340
GI||SP YRAVLSRYQAQHGAL
L+
8680 GcRsLNDGWLGE
f t
450
8621 hypothetical lipopolysaccharide core protein (298 aa)
initn: opt: 555 z-score: 101.2 E(): 5.9
Smith-Waterman score: 555; 11.8% identity in 306 aa overlap
15.7% noncontradicting positions, 3.9% class identity
10 20 30 40 50
GI||SP XBINDINGPRTEINTAXREBMARPVSDRTPAXXXXXXXXXXXXXXXXXXGL-RSLLQGTS
+ E T R + + + + +- S+ Q +
8621 MSAIENIVISMENATERRKHITKQFESKNLSFSFFNAYTYQSINQSINQSINQSINQSIN
10 20 30 40 50 60
60 70 80 90 100 110
GI||SP KPKEPASCLLK--EKERKXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
+ ++++L --E R
8621 QSINQSNSILHNIEESRILTKGEKGCLISHFLLWNKCVNENLEYLKIFEDDVILGENAEV
70 80 90 100 110 120
120 130 140 150 160 170
GI||SP XXXXXXXXERTLPFGDVEYVDLDAFXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
F D+ + L+ F
8621 FLNQNEWLKTRFDFNDIFIIRLETFLRPVKLEKQTKIPPFNSRNFDILKSTHWGTAGYII
130 140 150 160 170 180
180 190 200 210 220 230
GI||SP XXXXXXXXXXXXXXXXXXGTASGHRAGLTSRDTPSPVDPDTVEVLMTFEPDPADLALSSI
-----S + VD D V---++ +PA---- I
8621 SQGAAKYVIEYLKNIP-----SDEIVAVDELIFNKLVDVDNYIV---YQLNPA----ICI
190 200 210 220
240 250 260 270 280 290
GI||SP PGHETFDPRRHRFSEEELKPQPIMKKARKIQVPEEQKDEKYWSRRYKNNEAA----KRSR
++ + S E Q----K KI --- +K K R K n ----K+ +
8621 QELQANQSKSVLTSGLEKERQ----KRpKIR---KKKTLKQRLTRIKEnIIRALNRKKWK
s d
230 240 250 260 270 280
300 310 320 330 340
GI||SP DARRLKENQISVRAAFLEKENALLRQEVVAVRQELSHYRAVLSRYQAQHGAL
+ R+KE Q + F+
8621 EQQRIKEMQGKEIVRFM
290 300 310 320 330
8469 citrate beta beta-subunit lyase chain acyl subuni (289 aa)
initn: opt: 553 z-score: 101.1 E(): 6
Smith-Waterman score: 553; 16.9% identity in 136 aa overlap
23.5% noncontradicting positions, 6.6% class identity
110 120 130 140 150 160
GI||SP XXXXXXXXXXXXXXXXXXXERTLPFGDVEYVDLDAFXXXXXXXXXXXXXXXXXXXXXXXX
E + + L AF
8469 GSTKLMAAIESALGVVNAVEIARASPRLAAIALAAFDYVMDMGTSRGDGTELFYARCAVL
120 130 140 150 160 170
170 180 190 200 210 220
GI||SP XXXXXXXXXXXXXXXXXXXXXXXXXXXXXGTASGHRAGLTSRDTPSPVDPDTVEVL-MTF
------A + A d S V+P +E L- +
8469 HAARVAGIAAYDVVWSDINNEEGFL------AEANLAKNLGFnGKSLVNPRQIELLHQVY
d
180 190 200 210 220 230
230 240 250 260 270 280
GI||SP EPDP--ADLALSSIPGHETFDPRRHRFSEEELKPQ--PIMKKARKIQVPEEQKDEKYWSR
P --+D AL I E + R K --PI+ ARK+
8469 APTRKEVDHALEVIAAAEEAETRGLGVVSLNGKMIDGPIIDHARKVVALSASGIRD
240 250 260 270 280 290
290 300 310 320 330 340
GI||SP RYKNNEAAKRSRDARRLKENQISVRAAFLEKENALLRQEVVAVRQELSHYRAVLSRYQAQ
8469
300 310 320 330 340 350
4596 dna-directed homolog rna polymerase cds kd 30 e4l (259 aa)
initn: opt: 547 z-score: 100.7 E(): 6.3
Smith-Waterman score: 547; 11.7% identity in 239 aa overlap
15.9% noncontradicting positions, 4.2% class identity
10 20 30 40 50
GI||SP XBINDINGPRTEINTAXREBMARPVSDRTPAXXXXXXXXXXXXXXXXXXG-LR
++ + T RE ++ V D G- +
4596 MENVYISSYSSNEQTSMAVaATnIRELLSQYVDDANLEDLIEWAMEKSSKYYIKNIGNTK
t d
10 20 30 40 50 60
60 70 80 90 100 110
GI||SP SLLQGTSKPKEPASCLLKEKERKXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
S ++ T + K+ +
4596 SNIEETKFESKNNIGIEYSKDSRNKLSYRNKPfIATNLEYKTLCDMIKGTSGTEKEFLRY
s
70 80 90 100 110 120
120 130 140 150 160
GI||SP XXXXXXXXXXXXXERTLPFGDVEYVD----LDAFXXXXXXXXXXXXXXXXXXXXXXXXXX
DV Y D----Ld
4596 LLFGIKCIKKGVEYNIDKIKDVSYNDYFNVLnEKYNTPCPNCKSRNTTPMMIQTRAADEP
d
130 140 150 160 170 180
170 180 190 200 210 220
GI||SP XXXXXXXXXXXXXXXXXXXXXXXXXXXGTASGHR----AGLTSRDTPSPVDPDTVEVLMT
T S H ---- + + PSP--P++ E ---
4596 PLVRHACRDCKQHFKPPKFRAFRNLNVTTQSIHeNKEITEILPDNNPSP--PESPEP---
k
190 200 210 220 230
230 240 250 260 270 280
GI||SP FEPDPADLALSSIPGHETFDPRRHRFSEEELKPQPIMKKARKIQVPEEQKDEKYWSRRYK
-- P D L +----TFD ++E
4596 --ASPIDDGLIRa----TFDRNDEPPEDDE
v
240 250 260 270 280
8338 endoglucanase a endo-1 precursor 4-beta-glucanase (327 aa)
initn: opt: 555 z-score: 100.7 E(): 6.4
Smith-Waterman score: 555; 44.7% identity in 38 aa overlap
44.7% noncontradicting positions, 0.0% class identity
260 270 280 290 300 310
GI||SP SEEELKPQPIMKKARKIQVPEEQKDEKYWSRRYKNNEAAKRSRDARRLKENQISVRAAFL
S + + R Y NE--KR R +RR E---S R----
8338 SYLLGDNGSKKSYVVGFSKNGANAPSRPHHRGYYANE--KRWRRSRRCSE---SSR----
270 280 290 300 310
320 330 340
GI||SP EKENALLRQEVVAVRQELSHYRAVLSRYQAQHGAL
-KE AL R +
8338 -KEQALGRYDCWRLY
320 330 340
9529 b2 hypothetical protein (106 aa)
initn: opt: 513 z-score: 100.5 E(): 6.5
Smith-Waterman score: 528; 27.5% identity in 91 aa overlap
27.5% noncontradicting positions, 0.0% class identity
230 240 250 260 270
GI||SP EPDPADLALSSIPGHETFDPRRHRFSEEELKPQPIMKKARKIQVPEEQ------KDEKYW
K I IQ E ------+D
9529 MPSKLALIQELPDrIQTAVEAAMGMSYQDAPNN
p
10 20 30
280 290 300 310 320
GI||SP SRRYKNNEAA-------KRSRDARRLKENQISVRAAFLE---KENALLRQEVVAVRQELS
RR +N A------- SR + L E V--A+LE--- E A E + ELS
9529 VRRDLDNLHACLNKAKLTVSRMVTSLLEKPSVV--AYLEGKAPEEAKPTLEERLRKLELS
40 50 60 70 80 90
330 340
GI||SP HYRAVLSRYQAQHGAL
H
9529 HSLPTTGSDPPPAKL
100
3694 major paraflagellar rod protein component pfr par (561 aa)
initn: opt: 569 z-score: 99.8 E(): 7.2
Smith-Waterman score: 767; 11.6% identity in 310 aa overlap
19.0% noncontradicting positions, 7.4% class identity
30 40 50 60 70 80
GI||SP SDRTPAXXXXXXXXXXXXXXXXXXGLRSLLQGTSKPKEPASCLLKEKERKXXXXXXXXXX
+ Q ts K+ L + E
3694 LiaDKFRiIgqcEdENqaFgrIqdVQKqanQEsaaiKDAKRRLKQrCEdDLrniHDAIQK
vq l skt e kp sk he ksf tsqm h t khl
s s
230 240 250 260 270 280
90 100 110 120 130 140
GI||SP XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXERTLPFGDVEYVDLDAFXXX
ER L E d
3694 ADlEDAEAmKRhAanKEKSdrfIrENeDrQdEaWrrIQdLERqLQrLGTERFdEVKRRIE
m t f tq eky q l k e t nk e v k e
h
290 300 310 320 330 340
150 160 170 180 190 200
GI||SP XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGTASGHRAGL
+A R
3694 EnDREEKRrVEYqQFLdVagQHKKLLELsVYNCDLAiRCiGllEEiVaEGCaAiKaRHDK
i k s e cs t l m mm l s s v s
m t v
350 360 370 380 390 400
210 220 230 240
GI||SP TSRDTPSPVDPDTVEVLMTF--------------EPDPADLALSSIPGH-------ETFD
Ts d E L F--------------E ++ + H-------ETFD
3694 TnddLaaLRLdVHqEhLEaFRrLYlTLGqLiYKKEKRlEEIDRNIRTTHIQLEFaiETFD
sqe gd q k y y m k s v m cv
e s
410 420 430 440 450 460
250 260 270 280 290
GI||SP PRRHRFS-------------EEELKPQPIMKKARKIQVPEEQKDEKYWSRRYKNNEAAKR
P s-------------EEEL ----M K + q Ee k s r -
3694 PNAKlHaDaKKdLYrLRaqVEEELa----MLKdKqAqALEeFgesEdalnraghrfqhP-
k s k e k qg e e m k m kpt evsgqcwidvvp
m e
470 480 490 500 510 520
300 310 320 330 340
GI||SP SRDARRLKENQISVRAAFLEKENALLRQEVV---AVRQELSHYRAVLSRYQAQHGAL
---a e + R+ E L +qE V---A R+El
3694 ---adEnndenldRRSKMVEYRaHLaKqEEVKIAAEREEiKRa
ce veegvmt s t e l s
s
530 540 550 560 570
Library scan: 0:05:43 total CPU time: 0:06:55
Kim C. Worley, Human Genome Center, Baylor College of Medicine
kworley@bcm.tmc.edu