Haemophilus influenzae Genome
Sequence Search
>Query : 328 aa
vs library
using protein matrix
520611 residues in 1706 sequences
statistics exclude scores greater than 73
mean initn score: 27.4 (7.00)
mean init1 score: 26.9 (5.85)
1706 scores better than 1 saved, ktup: 2, variable pamfact
joining threshold: 28
The best scores are: initn init1 opt
HI0959 beta-hexosaminidase (exoII) {Vibrio furnissii} 815 330 851
HI0712 hemoglobin-binding protein {Haemophilus ducreyi 61 38 46
HI0182 sugar kinase, putative {Streptomyces coelicolor 60 60 60
HI1643 conserved hypothetical protein {Escherichia col 59 59 66
HI1699 lipopolysaccharide biosynthesis protein, putati 59 47 48
HI0604 adenylate cyclase (cyaA) {Haemophilus influenza 58 39 43
HI1023 transketolase 1 (tktA) {Escherichia coli} 57 42 47
HI0658 ABC transporter, ATP-binding protein {Escherich 56 42 47
HI0110 seryl-tRNA synthetase (serS) {Escherichia coli} 56 56 59
HI0723 TRK system potassium uptake protein (trkH) {Esc 56 42 48
HI0270 conserved hypothetical protein {Escherichia col 54 41 48
HI0444 DNA topoisomerase III (topB) {Escherichia coli} 54 36 52
HI0936 cytochrome C-type biogenesis {Escherichia coli} 53 43 43
HI1001 inner membrane protein, 60 kDa (yidC) {Escheric 53 43 52
HI1377 exodeoxyribonuclease I (sbcB) {Escherichia coli 52 42 42
HI0183 amino acid carrier protein, putative {Bacillus 52 37 39
HI1677 conserved hypothetical protein {Escherichia col 52 43 43
HI0859 ATP-dependent Clp protease, ATPase subunit (clp 52 37 48
HI1104 transporter protein {Acinetobacter calcoaceticu 51 51 64
HI1248 hypothetical protein 51 41 45
HI1620 hypothetical protein 51 51 52
HI0558 glucose-6-phosphate 1-dehydrogenase (zwf) {Haem 50 31 34
HI1534 conserved hypothetical protein {Borrelia burgdo 50 50 67
HI1130 conserved hypothetical protein {Escherichia col 50 50 51
HI1731 conserved hypothetical protein {Escherichia col 50 37 41
HI0053 zinc-type alcohol dehydrogenase {Bacillus subti 50 37 39
HI1692 molybdenum ABC transporter, permease protein (m 50 39 39
HI1655 antigen {Pasteurella haemolytica} 49 35 38
HI1315 hypothetical protein 49 35 49
HI1664 conserved hypothetical protein {Mycoplasma pneu 49 39 49
HI0583 2',3'-cyclic-nucleotide 2'-phosphodiesterase (c 49 38 49
HI0072 conserved hypothetical protein {Escherichia col 48 48 57
HI0676 integrase/recombinase (xerC) {Escherichia coli} 48 48 54
HI1011 conserved hypothetical protein {Escherichia col 48 39 39
HI1311 phenylalanyl-tRNA synthetase, alpha subunit (ph 48 48 48
HI1541 protease IV (sppA) {Escherichia coli} 48 35 35
HI1258 transcription-repair coupling factor (mfd) {Esc 47 34 54
HI1478 transposase (muA) {Bacteriophage mu} 47 37 37
HI0913 ribosomal protein S2 (rpS2) {Escherichia coli} 47 37 39
HI0942 exodeoxyribonuclease V, gamma chain (recC) {Esc 47 33 41
HI1254 conserved hypothetical protein {Escherichia col 47 35 52
HI0977 cell filamentation protein (fic) {Escherichia c 47 47 52
HI1551 biotin synthesis protein BioC, putative {Escher 47 33 36
HI0144 glucose kinase, putative {Escherichia coli} 46 46 53
HI0811 argininosuccinate lyase (argH) {Escherichia col 46 37 37
HI0691 glycerol kinase (glpK) {Escherichia coli} 46 33 40
HI0763 transcriptional regulator (nadR) {Escherichia c 46 33 39
HI1739 transcriptional activator (metR) {Escherichia c 46 33 51
HI1217 transferrin-binding protein, putative {Neisseri 46 35 43
HI0549 dimethyladenosine transferase (ksgA) {Escherich 46 46 47
HI0959 beta-hexosaminidase (exoII) {Vibrio furnissii}
47.9% identity in 338 aa overlap
10 20 30 40 50 60
Query MGPLWLDVEGCELTAEDREILAHPTVGGVILFARNYHDNQQLLALNTAIRQAAKRPILIG
:..: .:..: ::..:. :.:.:: :.:.:::.::.....:. .: ..:: .:.:.::.
HI0959 MSSLLIDLKGKELEQEEVELLSHPLVAGLILFTRNFENREQIQELIRSVRQRVKKPLLIT
10 20 30 40 50 60
70 80 90 100 110
Query VDQEGGRVQLSRR-VQQDPCAQLYARSDNGTQ---LAEDGGWLMAAELIAHDIDLSFAPV
::::::::: : :. : .. . ..:. .:...:: ::::.:: :::::::::
HI0959 VDQEGGRVQRFRDGFTMLPSMQAFQETLSATEQVSFAKEAGWQMAAEMIALDIDLSFAPV
70 80 90 100 110 120
120 130 140 150 160 170
Query LDKGFDCRAIGNRAFGDDVQTVLTYSSAYMRGMKSVGMATTGKHFPGHGAVIADSHLETP
:: : .:::::.:.:..::..... ..:.. ::. .:::.::::::::: :.::::::::
HI0959 LDLGHECRAIGDRSFSSDVKSAVNLATAFIDGMHQAGMASTGKHFPGHGHVLADSHLETP
130 140 150 160 170 180
180 190 200 210 220 230
Query YDER---DSIADDMTIFRAQIEAGILDAMMPAHVIYPHYDAQPASGSPYWLKQVLRQELG
::.: . ...:. :.. :... X::.:::::::...:.:::::: ::::..::..:.
HI0959 YDDRTKEEIFSGDLQPFQQLISQNKLDAIMPAHVIYSQCDSQPASGSKYWLKEILRKKLN
190 200 210 220 230 240
240 250 260 270 280
Query FQGIVFSDDLSMEGAAIMGGPAERAQQSLDAGCDMVLMCNKRESAVAVLDQLP-------
:::..:::::.:.::..::. .::....:.::::..:.::.::....:::.: v
HI0959 FQGTIFSDDLGMKGAGVMGNFVERSKKALNAGCDLLLLCNEREGVIQVLDNLKLTENQPH
250 260 270 280 290 300
290 300 310 320
Query -ISVVPQAQSLLKQQQFTYRELKATERWKQAYQALQRLIDAHS
.^ .. :::.:.. .....: ...::. .:: :. .
HI0959 FMARQARLQSLFKRRVINWNDLISDQRWRLNYQKLADIQSRWLDIQAAKND
310 320 330 340 350
HI0712 hemoglobin-binding protein {Haemophilus ducreyi}
12.8% identity in 47 aa overlap
160 170 180 190 200 210
Query HFPGHGAVIADSHLETPYDERDSIADDMTIFRAQIEAGILDAMMPAHVIYPHYDAQPASG
.. ...... .. .. .. :. . .
HI0712 FKKFGPKDYVYGSKYSKPADYTDCTYNSDCYKKNFKDNLALLLRKTDYKHHSYNLGLNLD
700 710 720 730 740 750
220 230 240 250 260 270
Query SPYWLKQVLRQELGFQGIVFSDDLSMEGAAIMGGPAERAQQSLDAGCDMVLMCNKRESAV
.. X:. :. . :X..
HI0712 PTDWLRVQLKYANGFRAPTSDEIYMTFKHPQFSIQPNTDLKAETSKTKEVAFTFYKNSSY
760 770 780 790 800 810
HI0182 sugar kinase, putative {Streptomyces coelicolor}
35.7% identity in 42 aa overlap
10 20 30 40 50
Query MGPLWLDVEGCELTAEDREILAHPTVGGVILFARNYHDNQQLLALNTAIRQAAKR
X:.: : .... ..: :: ..:: :...
HI0182 ERVPTPKTDYEEWLNTIVDLVNRADEKFGEVGTVGLGVPGFVNQQTGLAEIANIRVADNK
40 50 60 70 80 90
60 70 80 90 100 110
Query PILIGVDQEGGRVQLSRRVQQDPCAQLYARSDNGTQLAEDGGWLMAAELIAHDIDLSFAP
::: ... :X
HI0182 PILCDLSTRLGREVRAENDANCFALSEAWDTENQQYSTVLGLILGTGFGGGFVLNGKVHS
100 110 120 130 140 150
HI1643 conserved hypothetical protein {Escherichia coli}
25.0% identity in 44 aa overlap
170 180 190 200 210 220
Query VIADSHLETPYDERDSIADDMTIFRAQIEAGILDAMMPAHVIYPHYDAQPASGSPYWLKQ
.X: .: ...:::....: ..: .
HI1643 LIFIAIIAVLANYLGSTDFSHHYHISALIIAILLGMAIGNTIYPQFSSQVEKGVLFAKGT
10 20 30 40 50 60
230 240 250 260 270 280
Query VLRQELGFQGIVFSDDLSMEGAAIMGGPAERAQQSLDAGCDMVLMCNKRESAVAVLDQLP
.:X... . :. ..
HI1643 LLRAGIVLYGFRLTFGDIADVGLNAVVTDAIMLISTFFLTALLGIRYLKMDKQLVYLTGA
70 80 90 100 110 120
HI1699 lipopolysaccharide biosynthesis protein, putative {Neisseria meningitidi
s}
27.3% identity in 22 aa overlap
10 20 30
Query MGPLWLDVEGCELTAEDREILAHPTVGGVILF
.X: . .::...: . ....X
HI1699 YSVNKLFKKIKKHYTVYPNYKNIVSNIEPISLWDNQIDCEIDGEVSFFIGQPLLNTKEEN
150 160 170 180 190 200
40 50 60 70 80 90
Query ARNYHDNQQLLALNTAIRQAAKRPILIGVDQEGGRVQLSRRVQQDPCAQLYARSDNGTQL
HI1699 ISLIKKLKDQIPFDYYFPHPAEDYRVDGVNYVESELIFEDYVFKHLSNXKIIIYTFFSSV
210 220 230 240 250 260
HI0604 adenylate cyclase (cyaA) {Haemophilus influenzae}
42.1% identity in 19 aa overlap
270 280 290 300 310 320
Query AGCDMVLMCNKRESAVAVLDQLPISVVPQAQSLLKQQQFTYRELKATERWKQAYQALQRL
X::. .:::X.. .. :
HI0604 INLTTDPTSKVEEVLTGISSRDLFSFGSLEQSLVGSIDFTYRNVWNEIRTLHFEGQNAIL
570 580 590 600 610 620
Query IDAHS
HI0604 LALKVLSNKIYRGVNRPDSIQVYCYSERYRQDLRQLVMGLVNRCVSIQVGDIQQPCQTSR
630 640 650 660 670 680
HI1023 transketolase 1 (tktA) {Escherichia coli}
31.8% identity in 22 aa overlap
90 100 110 120 130 140
Query TQLAEDGGWLMAAELIAHDIDLSFAPVLDKGFDCRAIGNRAFGDDVQTVLTYSSAYMRGM
X:. :.:: ..:X..... ..
HI1023 AAYRESVLPAAVTKRVAIEAGIADFWYKYVGFNGRVIGMNSFGESAPADQLFKLFGFTVE
600 610 620 630 640 650
150 160 170 180 190 200
Query KSVGMATTGKHFPGHGAVIADSHLETPYDERDSIADDMTIFRAQIEAGILDAMMPAHVIY
HI1023 NVVAKAKEIL
660
HI0658 ABC transporter, ATP-binding protein {Escherichia coli}
27.3% identity in 44 aa overlap
260 270 280 290 300 310
Query PAERAQQSLDAGCDMVLMCNKRESAVAVLDQLPISVVPQAQSLLKQQQFTYRELKAT---
.:. ... ... :: ... .. .:.:.
HI0658 AASLLHGLGFSQEETIQPVKAFSGGWRMRLNLAQALLCPSDLLLLDEPTNHLDLDAVIWL
130 140 150 160 170 180
320
Query ERWKQAYQALQRLIDAHS
X:: .::. :X.
HI0658 ERWLVQYQGTLVLISHDRDFLDPIVTKILHIENQKLNEYTGDYSSFEVQRATKLAQQTAM
190 200 210 220 230 240
HI0110 seryl-tRNA synthetase (serS) {Escherichia coli}
34.3% identity in 35 aa overlap
210 220 230 240 250 260
Query IYPHYDAQPASGSPYWLKQVLRQELGFQGIVFSDDLSMEGAAIMGGPAERAQQSLDAGCD
. . : X::. . ..:.::.. : :. .
HI0110 PCFRSEAGSYGRDTRGLIRMHQFDKVEMVQIVDPDKSMEALEELTGHAEKVLQLLNLPYR
270 280 290 300 310 320
270 280 290 300 310 320
Query MVLMCNKRESAVAVLDQLPISVVPQAQSLLKQQQFTYRELKATERWKQAYQALQRLIDAH
.::.X.
HI0110 KVLLCTGDMGFGSCKTYDLEVWVPAQNTYREISSCSNMWDFQARRMQARCKAKGDKKTRL
330 340 350 360 370 380
HI0723 TRK system potassium uptake protein (trkH) {Escherichia coli}
26.1% identity in 23 aa overlap
280 290 300 310 320
Query RESAVAVLDQLPISVVPQAQSLLKQQQFTYRELKATERWKQAYQALQRLIDAHS
..... X:: ..:..:X...
HI0723 DIDNLPPFIGLLLVISAVIGGCGGSTTGGLKAIRTLILWKQIDRELHSLIHPNLVQPIRI
330 340 350 360 370 380
HI0723 GKNRLAPRMIESIWAFFIIFILVYWGCVFAVILCGMNTFDAMGAVFATLTNAGPGLGFIH
390 400 410 420 430 440
HI0270 conserved hypothetical protein {Escherichia coli}
30.8% identity in 26 aa overlap
260 270 280 290 300 310
Query RAQQSLDAGCDMVLMCNKRESAVAVLDQLPISVVPQAQSLLKQQQFTYRELKATERWKQA
.. ...X:::....:X. .: :
HI0270 HPKCLAENAIRAIDLGSHGIDLNCGCPSKTVNGSNGGAALLKQPELIYRATQALRRAVPS
80 90 100 110 120 130
320
Query YQALQRLIDAHS
HI0270 EFPVSVKVRLGWDDISQAFEIADAVEQGGATEITVHGRTKADGYRADRINWKKISEVRER
140 150 160 170 180 190
HI0444 DNA topoisomerase III (topB) {Escherichia coli}
15.1% identity in 86 aa overlap
10 20 30
Query MGPLWLDVEGCELTAEDREILAHPTVGGVIL
..X: ..:X ..: . . .... ..
HI0444 FQPKDFFEVQAWVNPESKEEKTPEKSTALFSALWQPSKACEDYQDDDGRVLSKGLAEKVV
220 230 240 250 260 270
40 50 60 70 80
Query FARNYHDNQQLLALNTAIRQAAKRPILIGVD----QEGGRVQLSRRVQQDPCAQLYARSD
.. .... .. ...:. .. :. ... ... : .: .. :.:..::.
HI0444 --KRITNQPAEVTEYKDVREKETAPLPYSLSALQIDAAKRFGMSAQAVLDTCQRLYETHR
280 290 300 310 320 330
90 100 110 120 130 140
Query NGTQLAEDGGWLMAAELIAHDIDLSFAPVLDKGFDCRAIGNRAFGDDVQTVLTYSSAYMR
HI0444 LITYPRSDCRYLPEEHFAERHNVLNAISTHCEAYQVLPNVILTEQRNRCWNDKKVEAHHA
340 350 360 370 380 390
HI0936 cytochrome C-type biogenesis {Escherichia coli}
55.6% identity in 9 aa overlap
70 80 90 100 110 120
Query RVQLSRRVQQDPCAQLYARSDNGTQLAEDGGWLMAAELIAHDIDLSFAPVLDKGFDCRAI
X::. ..:X
HI0936 FPIITAILILMVIVLSIRKGQFDRTLLIRCGWLLIPSLILAGLMIWQQLRNNSALHFHAF
400 410 420 430 440 450
130 140 150 160 170 180
Query GNRAFGDDVQTVLTYSSAYMRGMKSVGMATTGKHFPGHGAVIADSHLETPYDERDSIADD
HI0936 AFVLLTLAIWLLFVTLWQNWRQIRLSQFGMILAHCGVAIVTIGAVMSGYFGSEIGVRLAP
460 470 480 490 500 510
HI1001 inner membrane protein, 60 kDa (yidC) {Escherichia coli}
22.2% identity in 27 aa overlap
70 80 90 100 110 120
Query VQLSRRVQQDPCAQLYARSDNGTQLAEDGGWLMAAELIAHDIDLSFAPVLDKGFDCRAIG
X:... .. . .: . . .:X...:
HI1001 TDPTQQKVMNFMPLVFMFFFLWFPSGLVLYWLVSNLITIAQQQLIYRGLEKKGLHSRKK
490 500 510 520 530 540
130 140 150 160 170 180
Query NRAFGDDVQTVLTYSSAYMRGMKSVGMATTGKHFPGHGAVIADSHLETPYDERDSIADDM
HI1377 exodeoxyribonuclease I (sbcB) {Escherichia coli}
36.0% identity in 25 aa overlap
280 290 300 310 320
Query KRESAVAVLDQLPISVVPQAQSLLKQQQFTYRELKATERWKQAYQALQRLIDAHS
X:. : ... . ..::::...:X
HI1377 DKRILELLFHYRARHFYKTLTRAEQIKWKKYRQNKLEKSAVEFEASLQRLVEXHSDNSEK
400 410 420 430 440 450
HI1377 LSLLQQVYEYGIKLLG
460 470
HI0183 amino acid carrier protein, putative {Bacillus subtilis}
22.2% identity in 18 aa overlap
60 70 80 90 100 110
Query AKRPILIGVDQEGGRVQLSRRVQQDPCAQLYARSDNGTQLAEDGGWLMAAELIAHDIDLS
X: ....... .. :X.
HI0183 NALQYHIGEFGAHFLAFILLLFAYSSIIGNYAYAESNIRFIKNKPWLVLLFRLMVLFFVY
350 360 370 380 390 400
120 130 140 150 160 170
Query FAPVLDKGFDCRAIGNRAFGDDVQTVLTYSSAYMRGMKSVGMATTGKHFPGHGAVIADSH
HI0183 FGAVRSGNVVWNFADTVMAVMAIINLIAILMLSPIVWKLMKDYQRQLKEGKTPEFKIDEY
410 420 430 440 450 460
HI1677 conserved hypothetical protein {Escherichia coli}
38.5% identity in 13 aa overlap
290 300 310 320
Query DQLPISVVPQAQSLLKQQQFTYRELKATERWKQAYQALQRLIDAHS
X:. : ... .:X.
HI1677 EMEHDEAGQDVEVIKSLTNNCTPPADACFSWKALYSGINEFIDDLMHHIHLENNILFPRV
160 170 180 190 200 210
HI1677 LNEK
220
HI0859 ATP-dependent Clp protease, ATPase subunit (clpB) {Escherichia coli}
16.7% identity in 60 aa overlap
10 20 30
Query MGPLWLDVEGCELTAEDREILAHPTVGGVI
.. . .: :. . . .:......:. ..
HI0859 RAGLSDPNRPIGSFLFLGPTGVGKTELCKTLAKFLFDSEDAMVRIDMSEFMEKHSVSRLV
590 600 610 620 630 640
40 50 60 70 80 90
Query LFARNYHDNQQLLALNTAIRQAAKRPILIGVDQEGGRVQLSRRVQQDPCAQLYARSDNGT
...: . .. :..:.:. . . ::..
HI0859 GAPPGYVGYEEGGYLTEAVRRRPYSVILLDEVEKAHADVFNILLQVLDDGRLTDGQGRTV
650 660 670 680 690 700
HI1104 transporter protein {Acinetobacter calcoaceticus}
20.3% identity in 64 aa overlap
150 160 170 180 190 200
Query YSSAYMRGMKSVGMATTGKHFPGHGAVIADSHLETPYDERDSIADDMTIFRAQIEAGILD
.. .:. ....:..:.. :. :.. .
HI1104 IGWRGMFLVGIFPAFVAWFLRSHLHEPEIFTQKQTALSTQSSFTDKLRSFQLLIKDKATS
170 180 190 200 210 220
210 220 230 240 250 260
Query AMMPAHVIYPHYDAQPASGSPYWLKQVLRQELGFQGIVFSDDLSMEGAAIMGGPAERAQQ
. . :. . .. : X: . :...::X
HI1104 KISLGIVVLTSVQNFGYYGIMIWLPNFLSKQLGFSLTKSGLWTAVTVCGMMAGIWIFGQL
230 240 250 260 270 280
270 280 290 300 310 320
Query SLDAGCDMVLMCNKRESAVAVLDQLPISVVPQAQSLLKQQQFTYRELKATERWKQAYQAL
HI1104 ADRIGRKPSFLLFQLGAVISIVVYSQLTDPDIMLLAGAFLGMFVNGMLGGYGALMAEAYP
290 300 310 320 330 340
HI1248 hypothetical protein
20.0% identity in 75 aa overlap
140 150 160 170 180 190
Query GDDVQTVLTYSSAYMRGMKSVGMATTGKHFPGHGAVIADSHLETPYDERDSIADDMTIFR
:::: : .:.:.: .... . .. .....
HI1248 NLHQIQNNSIKAGTTLIFASFVYGVLHALGPGHGKFIIASYLST-HESQLKQSTILSLLS
50 60 70 80 90 100
200 210 220 230 240 250
Query AQIEAGILDAMMPAHVIYPHYDAQPASGSPYWLKQVLRQELGFQGIVFSDDLSMEGAAIM
. ... . . . :. . ... . :.^::... ^ : : : v v
HI1248 SLMQGIVAITATTLLVVVLNLSSRYFKLSQLWLERTALLLLVFLGCYWIWQGLRAYRKKA
110 120 130 140 150 160
260 270 280 290 300 310
Query GGPAERAQQSLDAGCDMVLMCNKRESAVAVLDQLPISVVPQAQSLLKQQQFTYRELKATE
HI1248 KLAIKSLNPLPLHEKSAVKNNRTFQPNTCSCGHQHLPSPTQTAQATNLKSQFLVILTIGM
170 180 190 200 210 220
 |
| |
|
 |
|
|