Haemophilus influenzae Genome

Sequence Search

 >Query : 328 aa
 vs  library
 using protein matrix

 520611 residues in  1706 sequences
 statistics exclude scores greater than 73
 mean initn score:  27.4 (7.00)
 mean init1 score:  26.9 (5.85)
 1706 scores better than 1 saved, ktup: 2, variable pamfact
 joining threshold: 28
The best scores are:                                  initn init1   opt
HI0959 beta-hexosaminidase (exoII) {Vibrio furnissii}   815   330   851
HI0712 hemoglobin-binding protein {Haemophilus ducreyi   61    38    46
HI0182 sugar kinase, putative {Streptomyces coelicolor   60    60    60
HI1643 conserved hypothetical protein {Escherichia col   59    59    66
HI1699 lipopolysaccharide biosynthesis protein, putati   59    47    48
HI0604 adenylate cyclase (cyaA) {Haemophilus influenza   58    39    43
HI1023 transketolase 1 (tktA) {Escherichia coli}         57    42    47
HI0658 ABC transporter, ATP-binding protein {Escherich   56    42    47
HI0110 seryl-tRNA synthetase (serS) {Escherichia coli}   56    56    59
HI0723 TRK system potassium uptake protein (trkH) {Esc   56    42    48
HI0270 conserved hypothetical protein {Escherichia col   54    41    48
HI0444 DNA topoisomerase III (topB) {Escherichia coli}   54    36    52
HI0936 cytochrome C-type biogenesis {Escherichia coli}   53    43    43
HI1001 inner membrane protein, 60 kDa (yidC) {Escheric   53    43    52
HI1377 exodeoxyribonuclease I (sbcB) {Escherichia coli   52    42    42
HI0183 amino acid carrier protein, putative {Bacillus    52    37    39
HI1677 conserved hypothetical protein {Escherichia col   52    43    43
HI0859 ATP-dependent Clp protease, ATPase subunit (clp   52    37    48
HI1104 transporter protein {Acinetobacter calcoaceticu   51    51    64
HI1248 hypothetical protein                              51    41    45
HI1620 hypothetical protein                              51    51    52
HI0558 glucose-6-phosphate 1-dehydrogenase (zwf) {Haem   50    31    34
HI1534 conserved hypothetical protein {Borrelia burgdo   50    50    67
HI1130 conserved hypothetical protein {Escherichia col   50    50    51
HI1731 conserved hypothetical protein {Escherichia col   50    37    41
HI0053 zinc-type alcohol dehydrogenase {Bacillus subti   50    37    39
HI1692 molybdenum ABC transporter, permease protein (m   50    39    39
HI1655 antigen {Pasteurella haemolytica}                 49    35    38
HI1315 hypothetical protein                              49    35    49
HI1664 conserved hypothetical protein {Mycoplasma pneu   49    39    49
HI0583 2',3'-cyclic-nucleotide 2'-phosphodiesterase (c   49    38    49
HI0072 conserved hypothetical protein {Escherichia col   48    48    57
HI0676 integrase/recombinase (xerC) {Escherichia coli}   48    48    54
HI1011 conserved hypothetical protein {Escherichia col   48    39    39
HI1311 phenylalanyl-tRNA synthetase, alpha subunit (ph   48    48    48
HI1541 protease IV (sppA) {Escherichia coli}             48    35    35
HI1258 transcription-repair coupling factor (mfd) {Esc   47    34    54
HI1478 transposase (muA) {Bacteriophage mu}              47    37    37
HI0913 ribosomal protein S2 (rpS2) {Escherichia coli}    47    37    39
HI0942 exodeoxyribonuclease V, gamma chain (recC) {Esc   47    33    41
HI1254 conserved hypothetical protein {Escherichia col   47    35    52
HI0977 cell filamentation protein (fic) {Escherichia c   47    47    52
HI1551 biotin synthesis protein BioC, putative {Escher   47    33    36
HI0144 glucose kinase, putative {Escherichia coli}       46    46    53
HI0811 argininosuccinate lyase (argH) {Escherichia col   46    37    37
HI0691 glycerol kinase (glpK) {Escherichia coli}         46    33    40
HI0763 transcriptional regulator (nadR) {Escherichia c   46    33    39
HI1739 transcriptional activator (metR) {Escherichia c   46    33    51
HI1217 transferrin-binding protein, putative {Neisseri   46    35    43
HI0549 dimethyladenosine transferase (ksgA) {Escherich   46    46    47


HI0959 beta-hexosaminidase (exoII) {Vibrio furnissii}
  47.9% identity in 338 aa overlap

               10        20        30        40        50        60
Query  MGPLWLDVEGCELTAEDREILAHPTVGGVILFARNYHDNQQLLALNTAIRQAAKRPILIG
       :..: .:..: ::..:. :.:.:: :.:.:::.::.....:. .:  ..:: .:.:.::.
HI0959 MSSLLIDLKGKELEQEEVELLSHPLVAGLILFTRNFENREQIQELIRSVRQRVKKPLLIT
               10        20        30        40        50        60

               70         80        90          100       110      
Query  VDQEGGRVQLSRR-VQQDPCAQLYARSDNGTQ---LAEDGGWLMAAELIAHDIDLSFAPV
       :::::::::  :      :. : .. . ..:.   .:...:: ::::.:: :::::::::
HI0959 VDQEGGRVQRFRDGFTMLPSMQAFQETLSATEQVSFAKEAGWQMAAEMIALDIDLSFAPV
               70        80        90       100       110       120

        120       130       140       150       160       170      
Query  LDKGFDCRAIGNRAFGDDVQTVLTYSSAYMRGMKSVGMATTGKHFPGHGAVIADSHLETP
       :: : .:::::.:.:..::..... ..:.. ::. .:::.::::::::: :.::::::::
HI0959 LDLGHECRAIGDRSFSSDVKSAVNLATAFIDGMHQAGMASTGKHFPGHGHVLADSHLETP
              130       140       150       160       170       180

        180          190       200       210       220       230   
Query  YDER---DSIADDMTIFRAQIEAGILDAMMPAHVIYPHYDAQPASGSPYWLKQVLRQELG
       ::.:   . ...:.  :.. :... X::.:::::::...:.:::::: ::::..::..:.
HI0959 YDDRTKEEIFSGDLQPFQQLISQNKLDAIMPAHVIYSQCDSQPASGSKYWLKEILRKKLN
              190       200       210       220       230       240

           240       250       260       270       280             
Query  FQGIVFSDDLSMEGAAIMGGPAERAQQSLDAGCDMVLMCNKRESAVAVLDQLP-------
       :::..:::::.:.::..::. .::....:.::::..:.::.::....:::.:  v     
HI0959 FQGTIFSDDLGMKGAGVMGNFVERSKKALNAGCDLLLLCNEREGVIQVLDNLKLTENQPH
              250       260       270       280       290       300

         290       300       310       320                
Query  -ISVVPQAQSLLKQQQFTYRELKATERWKQAYQALQRLIDAHS        
        .^  .. :::.:.. .....: ...::. .:: :. .             
HI0959 FMARQARLQSLFKRRVINWNDLISDQRWRLNYQKLADIQSRWLDIQAAKND
              310       320       330       340       350 

HI0712 hemoglobin-binding protein {Haemophilus ducreyi}
  12.8% identity in 47 aa overlap

     160       170       180       190       200       210         
Query  HFPGHGAVIADSHLETPYDERDSIADDMTIFRAQIEAGILDAMMPAHVIYPHYDAQPASG
                                     .. ......   .. ..  .. :.   . .
HI0712 FKKFGPKDYVYGSKYSKPADYTDCTYNSDCYKKNFKDNLALLLRKTDYKHHSYNLGLNLD
       700       710       720       730       740       750       

     220       230       240       250       260       270         
Query  SPYWLKQVLRQELGFQGIVFSDDLSMEGAAIMGGPAERAQQSLDAGCDMVLMCNKRESAV
       .. X:.  :. . :X..                                           
HI0712 PTDWLRVQLKYANGFRAPTSDEIYMTFKHPQFSIQPNTDLKAETSKTKEVAFTFYKNSSY
       760       770       780       790       800       810       

HI0182 sugar kinase, putative {Streptomyces coelicolor}
  35.7% identity in 42 aa overlap

                    10        20        30        40        50     
Query       MGPLWLDVEGCELTAEDREILAHPTVGGVILFARNYHDNQQLLALNTAIRQAAKR
                                     X:.: : .... ..:  ::  ..:: :...
HI0182 ERVPTPKTDYEEWLNTIVDLVNRADEKFGEVGTVGLGVPGFVNQQTGLAEIANIRVADNK
          40        50        60        70        80        90     

          60        70        80        90       100       110     
Query  PILIGVDQEGGRVQLSRRVQQDPCAQLYARSDNGTQLAEDGGWLMAAELIAHDIDLSFAP
       ::: ...   :X                                                
HI0182 PILCDLSTRLGREVRAENDANCFALSEAWDTENQQYSTVLGLILGTGFGGGFVLNGKVHS
         100       110       120       130       140       150     

HI1643 conserved hypothetical protein {Escherichia coli}
  25.0% identity in 44 aa overlap

        170       180       190       200       210       220      
Query  VIADSHLETPYDERDSIADDMTIFRAQIEAGILDAMMPAHVIYPHYDAQPASGSPYWLKQ
                                     .X: .:  ...:::....: ..:  .    
HI1643 LIFIAIIAVLANYLGSTDFSHHYHISALIIAILLGMAIGNTIYPQFSSQVEKGVLFAKGT
      10        20        30        40        50        60         

        230       240       250       260       270       280      
Query  VLRQELGFQGIVFSDDLSMEGAAIMGGPAERAQQSLDAGCDMVLMCNKRESAVAVLDQLP
       .:X... . :. ..                                              
HI1643 LLRAGIVLYGFRLTFGDIADVGLNAVVTDAIMLISTFFLTALLGIRYLKMDKQLVYLTGA
      70        80        90       100       110       120         

HI1699 lipopolysaccharide biosynthesis protein, putative {Neisseria meningitidi
          s}
  27.3% identity in 22 aa overlap

                                           10        20        30  
Query                              MGPLWLDVEGCELTAEDREILAHPTVGGVILF
                                     .X: .  .::...: . ....X        
HI1699 YSVNKLFKKIKKHYTVYPNYKNIVSNIEPISLWDNQIDCEIDGEVSFFIGQPLLNTKEEN
          150       160       170       180       190       200    

             40        50        60        70        80        90  
Query  ARNYHDNQQLLALNTAIRQAAKRPILIGVDQEGGRVQLSRRVQQDPCAQLYARSDNGTQL
                                                                   
HI1699 ISLIKKLKDQIPFDYYFPHPAEDYRVDGVNYVESELIFEDYVFKHLSNXKIIIYTFFSSV
          210       220       230       240       250       260    

HI0604 adenylate cyclase (cyaA) {Haemophilus influenzae}
  42.1% identity in 19 aa overlap

           270       280       290       300       310       320   
Query  AGCDMVLMCNKRESAVAVLDQLPISVVPQAQSLLKQQQFTYRELKATERWKQAYQALQRL
                                     X::.   .:::X.. .. :           
HI0604 INLTTDPTSKVEEVLTGISSRDLFSFGSLEQSLVGSIDFTYRNVWNEIRTLHFEGQNAIL
           570       580       590       600       610       620   

                                                                   
Query  IDAHS                                                       
                                                                   
HI0604 LALKVLSNKIYRGVNRPDSIQVYCYSERYRQDLRQLVMGLVNRCVSIQVGDIQQPCQTSR
           630       640       650       660       670       680   

HI1023 transketolase 1 (tktA) {Escherichia coli}
  31.8% identity in 22 aa overlap

      90       100       110       120       130       140         
Query  TQLAEDGGWLMAAELIAHDIDLSFAPVLDKGFDCRAIGNRAFGDDVQTVLTYSSAYMRGM
                                     X:. :.:: ..:X.....   ..       
HI1023 AAYRESVLPAAVTKRVAIEAGIADFWYKYVGFNGRVIGMNSFGESAPADQLFKLFGFTVE
         600       610       620       630       640       650     

     150       160       170       180       190       200         
Query  KSVGMATTGKHFPGHGAVIADSHLETPYDERDSIADDMTIFRAQIEAGILDAMMPAHVIY
                                                                   
HI1023 NVVAKAKEIL                                                  
         660                                                       

HI0658 ABC transporter, ATP-binding protein {Escherichia coli}
  27.3% identity in 44 aa overlap

           260       270       280       290       300       310   
Query  PAERAQQSLDAGCDMVLMCNKRESAVAVLDQLPISVVPQAQSLLKQQQFTYRELKAT---
                                     .:. ... ... :: ... .. .:.:.   
HI0658 AASLLHGLGFSQEETIQPVKAFSGGWRMRLNLAQALLCPSDLLLLDEPTNHLDLDAVIWL
      130       140       150       160       170       180        

              320                                                  
Query  ERWKQAYQALQRLIDAHS                                          
       X::  .::.   :X.                                             
HI0658 ERWLVQYQGTLVLISHDRDFLDPIVTKILHIENQKLNEYTGDYSSFEVQRATKLAQQTAM
      190       200       210       220       230       240        

HI0110 seryl-tRNA synthetase (serS) {Escherichia coli}
  34.3% identity in 35 aa overlap

       210       220       230       240       250       260       
Query  IYPHYDAQPASGSPYWLKQVLRQELGFQGIVFSDDLSMEGAAIMGGPAERAQQSLDAGCD
                                     . . : X::. . ..:.::.. : :.  . 
HI0110 PCFRSEAGSYGRDTRGLIRMHQFDKVEMVQIVDPDKSMEALEELTGHAEKVLQLLNLPYR
            270       280       290       300       310       320  

       270       280       290       300       310       320       
Query  MVLMCNKRESAVAVLDQLPISVVPQAQSLLKQQQFTYRELKATERWKQAYQALQRLIDAH
       .::.X.                                                      
HI0110 KVLLCTGDMGFGSCKTYDLEVWVPAQNTYREISSCSNMWDFQARRMQARCKAKGDKKTRL
            330       340       350       360       370       380  

HI0723 TRK system potassium uptake protein (trkH) {Escherichia coli}
  26.1% identity in 23 aa overlap

          280       290       300       310       320              
Query  RESAVAVLDQLPISVVPQAQSLLKQQQFTYRELKATERWKQAYQALQRLIDAHS      
                                     .....   X::  ..:..:X...       
HI0723 DIDNLPPFIGLLLVISAVIGGCGGSTTGGLKAIRTLILWKQIDRELHSLIHPNLVQPIRI
      330       340       350       360       370       380        

HI0723 GKNRLAPRMIESIWAFFIIFILVYWGCVFAVILCGMNTFDAMGAVFATLTNAGPGLGFIH
      390       400       410       420       430       440        

HI0270 conserved hypothetical protein {Escherichia coli}
  30.8% identity in 26 aa overlap

        260       270       280       290       300       310      
Query  RAQQSLDAGCDMVLMCNKRESAVAVLDQLPISVVPQAQSLLKQQQFTYRELKATERWKQA
                                     ..    ...X:::....:X. .:  :    
HI0270 HPKCLAENAIRAIDLGSHGIDLNCGCPSKTVNGSNGGAALLKQPELIYRATQALRRAVPS
             80        90       100       110       120       130  

        320                                                        
Query  YQALQRLIDAHS                                                
                                                                   
HI0270 EFPVSVKVRLGWDDISQAFEIADAVEQGGATEITVHGRTKADGYRADRINWKKISEVRER
            140       150       160       170       180       190  

HI0444 DNA topoisomerase III (topB) {Escherichia coli}
  15.1% identity in 86 aa overlap

                                            10        20        30 
Query                               MGPLWLDVEGCELTAEDREILAHPTVGGVIL
                                     ..X:   ..:X  ..: . .   .... ..
HI0444 FQPKDFFEVQAWVNPESKEEKTPEKSTALFSALWQPSKACEDYQDDDGRVLSKGLAEKVV
         220       230       240       250       260       270     

              40        50        60            70        80       
Query  FARNYHDNQQLLALNTAIRQAAKRPILIGVD----QEGGRVQLSRRVQQDPCAQLYARSD
         ..  .... ..  ...:. .. :.  ...    ... :  .: ..  :.:..::.   
HI0444 --KRITNQPAEVTEYKDVREKETAPLPYSLSALQIDAAKRFGMSAQAVLDTCQRLYETHR
           280       290       300       310       320       330   

        90       100       110       120       130       140       
Query  NGTQLAEDGGWLMAAELIAHDIDLSFAPVLDKGFDCRAIGNRAFGDDVQTVLTYSSAYMR
                                                                   
HI0444 LITYPRSDCRYLPEEHFAERHNVLNAISTHCEAYQVLPNVILTEQRNRCWNDKKVEAHHA
           340       350       360       370       380       390   

HI0936 cytochrome C-type biogenesis {Escherichia coli}
  55.6% identity in 9 aa overlap

         70        80        90       100       110       120      
Query  RVQLSRRVQQDPCAQLYARSDNGTQLAEDGGWLMAAELIAHDIDLSFAPVLDKGFDCRAI
                                     X::. ..:X                     
HI0936 FPIITAILILMVIVLSIRKGQFDRTLLIRCGWLLIPSLILAGLMIWQQLRNNSALHFHAF
         400       410       420       430       440       450     

        130       140       150       160       170       180      
Query  GNRAFGDDVQTVLTYSSAYMRGMKSVGMATTGKHFPGHGAVIADSHLETPYDERDSIADD
                                                                   
HI0936 AFVLLTLAIWLLFVTLWQNWRQIRLSQFGMILAHCGVAIVTIGAVMSGYFGSEIGVRLAP
         460       470       480       490       500       510     

HI1001 inner membrane protein, 60 kDa (yidC) {Escherichia coli}
  22.2% identity in 27 aa overlap

        70        80        90       100       110       120       
Query  VQLSRRVQQDPCAQLYARSDNGTQLAEDGGWLMAAELIAHDIDLSFAPVLDKGFDCRAIG
                                     X:... ..  . .: .  . .:X...:   
HI1001 TDPTQQKVMNFMPLVFMFFFLWFPSGLVLYWLVSNLITIAQQQLIYRGLEKKGLHSRKK 
            490       500       510       520       530       540  

       130       140       150       160       170       180       
Query  NRAFGDDVQTVLTYSSAYMRGMKSVGMATTGKHFPGHGAVIADSHLETPYDERDSIADDM

HI1377 exodeoxyribonuclease I (sbcB) {Escherichia coli}
  36.0% identity in 25 aa overlap

           280       290       300       310       320             
Query  KRESAVAVLDQLPISVVPQAQSLLKQQQFTYRELKATERWKQAYQALQRLIDAHS     
                                     X:. : ...  .  ..::::...:X     
HI1377 DKRILELLFHYRARHFYKTLTRAEQIKWKKYRQNKLEKSAVEFEASLQRLVEXHSDNSEK
       400       410       420       430       440       450       

HI1377 LSLLQQVYEYGIKLLG
       460       470   

HI0183 amino acid carrier protein, putative {Bacillus subtilis}
  22.2% identity in 18 aa overlap

             60        70        80        90       100       110  
Query  AKRPILIGVDQEGGRVQLSRRVQQDPCAQLYARSDNGTQLAEDGGWLMAAELIAHDIDLS
                                     X: ....... ..  :X.            
HI0183 NALQYHIGEFGAHFLAFILLLFAYSSIIGNYAYAESNIRFIKNKPWLVLLFRLMVLFFVY
       350       360       370       380       390       400       

            120       130       140       150       160       170  
Query  FAPVLDKGFDCRAIGNRAFGDDVQTVLTYSSAYMRGMKSVGMATTGKHFPGHGAVIADSH
                                                                   
HI0183 FGAVRSGNVVWNFADTVMAVMAIINLIAILMLSPIVWKLMKDYQRQLKEGKTPEFKIDEY
       410       420       430       440       450       460       

HI1677 conserved hypothetical protein {Escherichia coli}
  38.5% identity in 13 aa overlap

            290       300       310       320                      
Query  DQLPISVVPQAQSLLKQQQFTYRELKATERWKQAYQALQRLIDAHS              
                                     X:. : ... .:X.                
HI1677 EMEHDEAGQDVEVIKSLTNNCTPPADACFSWKALYSGINEFIDDLMHHIHLENNILFPRV
     160       170       180       190       200       210         

HI1677 LNEK
     220   

HI0859 ATP-dependent Clp protease, ATPase subunit (clpB) {Escherichia coli}
  16.7% identity in 60 aa overlap

                                             10        20        30
Query                                MGPLWLDVEGCELTAEDREILAHPTVGGVI
                                     .. . .: :.  .  . .:......:. ..
HI0859 RAGLSDPNRPIGSFLFLGPTGVGKTELCKTLAKFLFDSEDAMVRIDMSEFMEKHSVSRLV
       590       600       610       620       630       640       

               40        50        60        70        80        90
Query  LFARNYHDNQQLLALNTAIRQAAKRPILIGVDQEGGRVQLSRRVQQDPCAQLYARSDNGT
         ...: . ..   :..:.:. . . ::..                              
HI0859 GAPPGYVGYEEGGYLTEAVRRRPYSVILLDEVEKAHADVFNILLQVLDDGRLTDGQGRTV
       650       660       670       680       690       700       

HI1104 transporter protein {Acinetobacter calcoaceticus}
  20.3% identity in 64 aa overlap

              150       160       170       180       190       200
Query  YSSAYMRGMKSVGMATTGKHFPGHGAVIADSHLETPYDERDSIADDMTIFRAQIEAGILD
                                     .. .:. ....:..:..  :.  :..   .
HI1104 IGWRGMFLVGIFPAFVAWFLRSHLHEPEIFTQKQTALSTQSSFTDKLRSFQLLIKDKATS
             170       180       190       200       210       220 

              210       220       230       240       250       260
Query  AMMPAHVIYPHYDAQPASGSPYWLKQVLRQELGFQGIVFSDDLSMEGAAIMGGPAERAQQ
        .  . :. .  ..    :   X: . :...::X                          
HI1104 KISLGIVVLTSVQNFGYYGIMIWLPNFLSKQLGFSLTKSGLWTAVTVCGMMAGIWIFGQL
             230       240       250       260       270       280 

              270       280       290       300       310       320
Query  SLDAGCDMVLMCNKRESAVAVLDQLPISVVPQAQSLLKQQQFTYRELKATERWKQAYQAL
                                                                   
HI1104 ADRIGRKPSFLLFQLGAVISIVVYSQLTDPDIMLLAGAFLGMFVNGMLGGYGALMAEAYP
             290       300       310       320       330       340 

HI1248 hypothetical protein
  20.0% identity in 75 aa overlap

             140       150       160       170       180       190 
Query  GDDVQTVLTYSSAYMRGMKSVGMATTGKHFPGHGAVIADSHLETPYDERDSIADDMTIFR
                                     ::::  : .:.:.: .... . .. .....
HI1248 NLHQIQNNSIKAGTTLIFASFVYGVLHALGPGHGKFIIASYLST-HESQLKQSTILSLLS
             50        60        70        80         90       100 

             200       210       220       230       240       250 
Query  AQIEAGILDAMMPAHVIYPHYDAQPASGSPYWLKQVLRQELGFQGIVFSDDLSMEGAAIM
       . ... .  .  .  :.  . ...  . :.^::... ^  : : : v      v      
HI1248 SLMQGIVAITATTLLVVVLNLSSRYFKLSQLWLERTALLLLVFLGCYWIWQGLRAYRKKA
             110       120       130       140       150       160 

             260       270       280       290       300       310 
Query  GGPAERAQQSLDAGCDMVLMCNKRESAVAVLDQLPISVVPQAQSLLKQQQFTYRELKATE
                                                                   
HI1248 KLAIKSLNPLPLHEKSAVKNNRTFQPNTCSCGHQHLPSPTQTAQATNLKSQFLVILTIGM
             170       180       190       200       210       220 



.
BCM HGSC