FASTA-SWAP or FASTA-PAT Search Results

FASTA-SWAP or FASTA-PAT (FASTA-based Pattern database search tools) are modified versions of the FASTA sequence database search tool. See the FASTA-SWAP and FASTA-PAT Help Page for a detailed program description.

Istvan Ladunga, Brent A. Wiese, and Randall F. Smith (1995)
Human Genome Sequencing Center, Baylor College of Medicine, Houston, TX 77030.
Email


 FASTA-SWAP searches a protein pattern database
 FASTA-SWAP version 1.0, Dec. 1995, 
FASTA version 2.0u August, 1995
Please cite: I. Ladunga, B. Wiese & R.F. Smith (1996), submitted
 W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448

 /tmp/fastapat.seq.6927 :  345 aa

 >GI||SP|Q|DBP_HUMAND-BINDINGPROTEIN(DBP)(ALBUMINDB
 X-BINDINGPRTEINTAXREBMARPVSDRTPAxxxxxxxxxxxxxxxxxxGLRSLLQGTS
 KPKEPASCLLKEKERKxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
 xxxxxxERTLPFGDVEYVDLDAFxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
 xxxxxxxxxxxxxxxxGTASGHRAGLTSRDTPSPVDPDTVEVLMTFEPDPADLALSSIPG
 HETFDPRRHRFSEEELKPQPIMKKARKIQVPEEQKDEKYWSRRYKNNEAAKRSRDARRLK
 ENQISVRAAFLEKENALLRQEVVAVRQELSHYRAVLSRYQAQHGAL

 vs PIMA Database library
 searching /local/dot5/sl_home/beauty/seqdb/comb-db/pima 11 library


 one = represents 12 library sequences
 for inset = represents 1 library sequences

   z-opt E()
< 20   222     0 :===================
  22    38     0 :====
  24    30     0 :===
  26    27     0 :===
  28    26     2 :*==
  30    19    10 :*=
  32    29    40 :===*
  34    38   109 :====     *
  36    93   224 :========          *
  38   198   370 :=================             *
  40   294   516 :=========================                 *
  42   468   630 :=======================================             *
  44   604   695 :===================================================      *
  46   707   708 :==========================================================*
  48   687   678 :========================================================*=
  50   609   619 :===================================================*
  52   623   544 :=============================================*======
  54   507   465 :======================================*====
  56   435   388 :================================*====
  58   361   319 :==========================*====
  60   331   258 :=====================*======
  62   222   207 :=================*=
  64   235   165 :=============*======
  66   166   130 :==========*===
  68   125   102 :========*==
  70   126    80 :======*====
  72    88    63 :=====*==
  74    72    49 :====*=
  76    51    38 :===*=
  78    51    30 :==*==
  80    35    23 :=*=
  82    34    18 :=*=
  84    25    14 :=*=
  86    25    11 :*==
  88     9     8 :*
  90    17     6 :*=
  92    18     5 :*=        :====*=============
  94    10     4 :*         :===*======
  96    12     3 :*         :==*=========
  98     9     2 :*         :=*=======
 100     8     2 :*         :=*======
 102     9     1 :*         :*========
 104     6     1 :*         :*=====
 106     3     1 :*         :*==
 108     4     1 :*         :*===
 110     3     0 :=         *===
 112     2     0 :=         *==
 114     1     0 :=         *=
 116     2     0 :=         *==
 118     2     0 :=         *==
>120    11     0 :=         *===========
2169542 positions in 12664 patterns
 statistics extrapolated from 7538 to 7538 patterns
 Kolmogorov-Smirnov statistic: 0.0977 (N= 29) at 50
 results sorted and z-values calculated from opt score
 7727 scores better than 1 saved, ktup: 1, variable pamfact
 gap penalties: -100,-10
 joining threshold: 83, optimization threshold: 50, width: 32
  scan time:  0:05:43
The best scores are:                                     opt  z-sc  E(7538)
6367 d-beta-hydroxybutyrate precursor bdh dehydrogenas   883 154.1  0.00671
10569 alternatively hypothetical 77.5 spliced kd prote   872 148.4  0.01402
8390 lymphoid-restricted membrane protein                791 136.3  0.06589
7147 antigen c-terminal bbg clone 2.1 1.1                749 132.4  0.11
8199 hypothetical bblf2 protein                          745 128.9  0.17
8099 embryonic nuclear protein lin-14 form a b1          736 128.0  0.19
8276 hypothetical 128.6 kd protein zk1098.10 in chromo   764 127.5  0.20
5807.2 300 interspersed kd repeat antigen ag231          707 126.4  0.24
1653 probable repa replication-associated replication    712 125.8  0.25
8516 hypothetical surface-layer 125 80k kd protein pre   727 124.1  0.32
4861 triadin kda junction-specific back sarcoplasmic m   675 122.6  0.38
8476 beta-lactamase hypothetical regulatory protein 2    664 118.8  0.63
8410 no title                                            660 117.8  0.71
8344 len: 393 cai: mitochondrial 0.17 outer membrane 4   663 117.2  0.76
4598 major minor capsid protein 10a 10b                  657 117.0  0.78
6791 dtaf tsm1 ii protein 150 gene product               684 114.9  1.03
11380 hypothetical trwc protein                          649 113.4  1.24
6096 crtj regulatory repressor protein                   640 112.5  1.39
2681 beta-adaptin adaptin protein clathrin complex bet   635 111.0  1.69
4810 5 iif transcription chain alpha factor tfiif subu   631 109.9  1.96
3332 heat heavy shock 70 chain heat-shock binding prot   620 109.9  1.96
4703 dynactin 117 150 kd dynein-associated isoform pro   645 109.5  2.06
8289 epidermal eps8 growth protein factor receptor kin   640 109.1  2.17
8838 h probable dehydrogenase region ltdh methotrexate   597 108.3  2.39
8492 lmp1 lmp2 gene product                              612 107.7  2.58
7391 hypothetical lactococcin a in protein secretion l   601 106.2  3.14
3138 defective fc chorion-1 proteins fc106 fc125 fc177   593 105.8  3.29
8548 yopd protein                                        584 105.8  3.30
5939 69 autoantigen kd p69                               594 104.7  3.79
5176 histone-binding nuclear autoantigenic hgv2 protei   605 104.3  4.01
4573 element insertion is421 hypothetical 47 is186 41    580 104.3  4.01
9186 pes4 pab-like protein                               580 103.8  4.24
8288 macrogolgin rat gcp360                              657 103.7  4.34
8676 alpha-helical coiled coil protein tlpa              575 103.6  4.37
8248 hypothetical orf3 protein 3                         579 103.1  4.64
8306 yd9395.16 cdc1 gene len: 491 cai: 0.13              584 103.0  4.73
5970 cfxy cfxyc protein plasmid                          560 103.0  4.74
8078 von vwf pre-pro-polypeptide willebrand -22 factor   579 102.9  4.80
8325 hap4 aa transcriptional 1-554 activator             587 102.8  4.87
8291 hypothetical orf 79.4 ykr090w kd protein in prp16   595 102.6  4.95
10618 d2045.2                                            596 102.4  5.12
8683 gravity gene cdna clone expressed gsc381 in callu   543 101.5  5.70
8272 regulatory protein rim1                             584 101.5  5.70
8680 retrovirus-related gag polyprotein homolog transp   571 101.3  5.86
8621 hypothetical lipopolysaccharide core protein        555 101.2  5.94
8469 citrate beta beta-subunit lyase chain acyl subuni   553 101.1  6.05
4596 dna-directed homolog rna polymerase cds kd 30 e4l   547 100.7  6.31
8338 endoglucanase a endo-1 precursor 4-beta-glucanase   555 100.7  6.37
9529 b2 hypothetical protein                             513 100.5  6.52
3694 major paraflagellar rod protein component pfr par   569  99.8  7.17
6367 d-beta-hydroxybutyrate precursor bdh dehydrogenas (334 aa)
initn: opt:  883 z-score: 154.1 E(): 0.0067
Smith-Waterman score: 883;    12.5% identity in 288 aa overlap
 21.2% noncontradicting positions,   8.7% class identity

               10        20                   30        40         
GI||SP XBINDINGPRTEINTAXREBMAR-----------PVSDRTPAXXXXXXXXXXXXXXXXXX
               P   ++   RE  aR-----------P   RT a                  
6367       lSrLPGKaLSaCDRENGaRrpLLlgpaSFiPdgRRTYaSaAdaaggKAVLVTGCDS
           f q    t  v      t ht  fyst  s it    t q epvss          
                   10        20        30        40        50      

      50        60           70        80        90       100      
GI||SP GLRSLLQGTSKPK---EPASCLLKEKERKXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
       G    L      K---  A CLlKek                                  
6367   GFGFSLAKHLHSKGFLVFAGCLlKdqGdaGVrELDSLnSDRLRTiQLNVcrSEEVEKaVe
                             m ek hd  k     k      v    fn      v g
         60        70        80        90       100       110      

        110       120       130       140       150       160      
GI||SP XXXXXXXXXXXXXXXXXXXERTLPFGDVEYVDLDAFXXXXXXXXXXXXXXXXXXXXXXXX
                               FG+VE+  l+ +                        
6367   dcrfeledPEKGMWGLVNNAGISTFGEVEFTSlETYKqVAEVNLWGTVRmTKSFLPLiRR
       tvpsgpkg                        m    e           t       l  
        120       130       140       150       160       170      

        170       180       190       200       210       220      
GI||SP XXXXXXXXXXXXXXXXXXXXXXXXXXXXXGTASGHRAGLTSRDTPSPVDPDTVEVLMTFE
                                      A   -- L     P  V    VE------
6367   AKGRVVNISSMLGRMANPARSPYCITKFGVEAFSD--CLRYEMhPLGVKVSVVE------
                                                  y                
        180       190       200       210         220              

        230        240       250       260       270        280    
GI||SP PDPAD-LALSSIPGHETFDPRRHRFSEEELKPQPIMKKARKIQVPEE-QKDEKYWSRRYK
       --P +-+A +S+   E       +  eE-- P+ + K   K    E - K E Y s    
6367   --PGNFIAATSLYnPErIQAIAKKMWdE--LPEVVRKDYGKKYFDEKIAKMETYCnSGST
                    s  s         e                            s    
        230       240       250         260       270       280    

          290       300       310       320       330       340    
GI||SP NNEAAKRSRDARRLKENQISVRAAFLEKENALLRQEVVAVRQELSHYRAVLSRYQAQHGA
       +   +  +           +            +                           
6367   DTSpVInAVTHALTAaTPYTRYHPMDYYWWLRMQiMTHlPGAISDkIYIr          
          s  d        t                  v   f      m   h          
          290       300       310       320       330       340    

10569 alternatively hypothetical 77.5 spliced kd prote (650 aa)
initn: opt:  872 z-score: 148.4 E():  0.014
Smith-Waterman score: 872;     6.1% identity in 342 aa overlap
 20.2% noncontradicting positions,  14.0% class identity

                    10        20        30        40        50     
GI||SP      XBINDINGPRTEINTAXREBMARPVSDRTPAXXXXXXXXXXXXXXXXXXGLRSLL
                      t i  a re  +     r                      l    
10569  WaraDLqnLQRELDAdaieianrqdeSenSRKrLaeqsreFKKnePEdlrnnVakiiKqf
        kkf  tq       tvtvlkdketl lq   s itetkk   lt  eklkq npll sy
               10        20        30        40        50        60

               60        70        80        90       100       110
GI||SP QG-----TSKPKEPASCLLKEKERKXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
       Qg-----t + Ke    ll   er                                    
10569  QrEIDaLsqRSKeaEaallnVYerLidaPDPqPALDLGQqLQlklqrLgdIdTdnqeLrE
        g   n tk   fs kvffd  kk sev   v       l  ssvek hk e eskk k 
               70        80        90       100       110       120

              120       130       140        150       160         
GI||SP XXXXXXXXXXXXXXXERTLPFGDVEYVDLD-AFXXXXXXXXXXXXXXXXXXXXXXXXXXX
                      E T+        dl+-+                            
10569  kieelndelAeyanqEVTIKaLKerirdlEQSsaKnqAeriaLaKeQeinndfaEKeRnl
       tlsyyekkf kvkdy     t  sklley   tl tl ktlt e t klqstwe  g kw
              130       140       150       160       170       180

     170       180       190       200       210       220         
GI||SP XXXXXXXXXXXXXXXXXXXXXXXXXXGTASGHRAGLTSRDTPSPVDPDTVEVLMTFEPDP
                                  t s      t  d       d +E++mT     
10569  qErqadllkqLEEAEhnVQeqnkALEakrseniDiegngnEdgdaeanqiEMimTriara
       k temsttsk     tk  slqt   ktitklf lktkyd ettqkndek  vs dleey
              190       200       210       220       230       240

     230       240       250       260       270       280         
GI||SP ADLALSSIPGHETFDPRRHRFSEEELKPQPIMKKARKIQVPEEQKDEKYWSRRYKNNE-A
          A       ET   r +      l     + KA  +----EQ  E - sr  k  E-a
10569  NQRAElaqrEaETlrariYqlanrneqLagaiaKApDV----EQAIEV-LsraelEtELa
            vvtq l  tqeql ssekhsle ssqlq  t              tessk v  h
              250       260       270           280        290     

      290       300       310       320       330       340        
GI||SP AKRSRDARRLKENQISVRAAFLEKENALLRQEVVAVRQELSHYRAVLSRYQAQHGAL   
       AK    a    enq  -- A leke       +    q+Ls   A    Y +    l   
10569  AKEreiaQLeednarL--qASleqeRensahaInqLeqQLnakVAESESYnSeLeqlrrK
          lkln  vsevql   s  ytkl kstssq se ke  ssv       k t ktvee 
         300       310         320       330       340       350   

                                                                   
GI||SP                                                             
                                                                   
10569  LnnqaDYneiKeELnaLKkiEFapnEdagdnDaaSEDKNDnplEslLLeaNrkLQaenAa
        kgys  ekv k  si  sm  gvs gdstq ir      ktf vs  sk ks  stl e
           360       370       380       390       400       410   

8390 lymphoid-restricted membrane protein          (534 aa)
initn: opt:  791 z-score: 136.3 E():  0.066
Smith-Waterman score: 791;    11.5% identity in 322 aa overlap
 18.9% noncontradicting positions,   7.5% class identity

               10        20        30        40        50          
GI||SP XBINDINGPRTEINTAXREBMARPVSDRTPAXXXXXXXXXXXXXXXXXXGLR-----SLL
                   ++         P sd +p                   G  -----Sll
8390         EpeDGALDVkRqcqCPgPTedpilGqnLldCiRMNdDqSmdENGaerfcpESll
              vk      t ghk  l  sgssp te sg t   e p te   vghvys  ps
                     10        20        30        40        50    

          60        70        80        90       100       110     
GI||SP QGTSKPKEPASCLLKEKERKXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
       Q       P s      e                                          
8390   QlReYlsqPlprqTSSsdgTiTSSdpGldILnMAScDLDrnpLCeKEEdaRaASamIEAQ
        s g stl sseh   tes v   es se  h   g   cks  k   et s  pt    
           60        70        80        90       100       110    

         120       130       140       150       160       170     
GI||SP XXXXXXXXXXERTLPFGDVEYVDLDAFXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
                 -----fgD   vd -A                                  
8390   GTSlAhDNaA-----fqDsTSkdV-AKaalnLEAgEElrTiEnggKehApGdseiSmlPk
          p p  i      yg y  vg    tisq   k  pe t ehk gs s etvv pp v
          120            130        140       150       160        

         180       190       200       210       220       230     
GI||SP XXXXXXXXXXXXXXXXXXXXGTASGHRAGLTSRDTPSPVDPDTVEVLMTFEPDPADLA--
                             A   R  L  +----- D  T E  +  E    DlA--
8390   asVKlVNfrQSENTSANEKEVEAEFLRLSLGlK-----CDWFTLEKRVKLEERSRDlAEE
       tt  s  vq                      f                        w   
      170       180       190       200            210       220   

                 240       250       260       270       280       
GI||SP ------LSSIPGHETFDPRRHRFSEEELKPQPIMKKARKIQVPEEQKDEKYWSRRYKNNE
       ------ +s+   E+  P----  Ee+ + Q I+KK  K  v   Q   +  SR      
8390   NLKKEITNcLKLLESLTP----LCEdDNQAQEIiKKLEKSIklLSQCaARVASRAEMLGA
               s                e       v       vf    t            
           230       240           250       260       270         

       290             300       310       320       330       340 
GI||SP AAKRSRDARRLK------ENQISVRAAFLEKENALLRQEVVAVRQELSHYRAVLSRYQAQ
         + SR +r   ------EN   + A----KE A L     a  Q    +         Q
8390   INQESRVSrAVEVMIQHVENLKRMYA----KEHAELEdLKQaLLQNdRSFNpLeDdDDCQ
               k                            e   v    e    s p e    
     280       290       300           310       320       330     

                                                                   
GI||SP HGAL                                                        
                                                                   
8390   IKKRSaSLNSKPSSLRRVTIASLPRNiGNaGlVaGMENNDRFSRRSSSWRILGsKQgEHR
            s                    l  v m s                   t  s   
         340       350       360       370       380       390     

7147 antigen c-terminal bbg clone 2.1 1.1          (323 aa)
initn: opt:  749 z-score: 132.4 E():   0.11
Smith-Waterman score: 749;    26.8% identity in 123 aa overlap
 29.3% noncontradicting positions,   2.4% class identity

              190       200       210       220       230          
GI||SP XXXXXXXXXXXXXXXGTASGHRAGLTSRDTPSPVDPDTVEVLMTFEPDPA---DLALSSI
                                     PS-VD + -E  m  E +P ---+   S I
7147   fEGEGEQSVIPEAEPsfEGEGEQSVIPEAEPS-VDGEG-EQSmIPEAEPTiEGEGEQSVI
       v              tv                         v       f         
     180       190       200       210         220       230       

       240       250         260       270       280       290     
GI||SP PGHETFDPRRHRFSEEELKPQ--PIMKKARKIQVPEEQKDEKYWSRRYKNNEAAKRSRDA
       P  E---P      EE   P+--P +  ++----PE    EK----  K   AAK +R +
7147   PEaE---PSVEPAGEEPVIPEAEPSVEPVK----PEVDDIEK----PVKVAKAAKVARSV
         v                                                         
       240          250       260           270           280      

         300        310       320       330       340     
GI||SP RRLKENQISV-RAAFLEKENALLRQEVVAVRQELSHYRAVLSRYQAQHGAL
       +  K     v-r A   KE  L +Q+     QE +                
7147   KAAKKAAKKlarKARqrKErKLKKQQEEQAQQESAEq              
                vsk   kk  k                h              
        290       300       310       320       330       

8199 hypothetical bblf2 protein                    (521 aa)
initn: opt:  745 z-score: 128.9 E():   0.17
Smith-Waterman score: 745;    18.4% identity in 136 aa overlap
 20.6% noncontradicting positions,   2.2% class identity

        100       110       120       130       140       150      
GI||SP XXXXXXXXXXXXXXXXXXXXXXXXXXXXXERTLPFGDVEYVDLDAFXXXXXXXXXXXXXX
                                     R L    VE      F              
8199   ELRRSGGLIAMLADAAEKDLFDLSFRTRDRRLLSAARVEDEQGLIFQPLFPAQVVCQSCS
          100       110       120       130       140       150    

        160       170       180       190       200       210      
GI||SP XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGTASGHRAGLTSRDTPSPVDP
                                                    R    +  T +P DP
8199   GDDGRDQQPPPVDGFGSEMEGEQTCPHAQRHSESPGQLDVYIRTPRGDVFTYSTETPDDP
          160       170       180       190       200       210    

           220         230       240       250       260       270 
GI||SP DTV---EVL--MTFEPDPADLALSSIPGHETFDPRRHRFSEEELKPQPIMKKARKIQVPE
         V---++L--+T+E---+DL+ S        D RRHR S   L P              
8199   SPVPFRDILRPVTYE---VDLVSSDGATGRGGDARRHRVSLKILEPAGGFESWLVNSWSM
          220          230       240       250       260       270 

             280       290       300       310       320       330 
GI||SP EQKDEKYWSRRYKNNEAAKRSRDARRLKENQISVRAAFLEKENALLRQEVVAVRQELSHY
                R    +  A                     + +          +       
8199   AGGGLYAFLRSIYASCYANHRGTKPIFYLLDPELCPGGSDFQPYVPGFPFLPIHYVGRAR
             280       290       300       310       320       330 

8099 embryonic nuclear protein lin-14 form a b1    (475 aa)
initn: opt:  736 z-score: 128.0 E():   0.19
Smith-Waterman score: 879;    22.2% identity in 221 aa overlap
 23.5% noncontradicting positions,   1.4% class identity

             110       120       130         140       150         
GI||SP XXXXXXXXXXXXXXXXXXXXXXXXERTLPFGDVEYVDL--DAFXXXXXXXXXXXXXXXXX
                                 +   GD + +D --D                   
8099   QGTDDQTVKWIGPSSVDSNGQKTDSSAASAGDNQNIDVIGDGSESPTSSNHSAQEIALMT
          120       130       140       150       160       170    

     160       170       180       190       200        210        
GI||SP XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGTASG-HRAGLTSRDTPSPVDPDT
                                           GT +G- +AG   R    PV+ D 
8099   SQQTFLNALKDSSFLFTNPVPTVETAPPLRVAPPINGTTNGTAKAGGPERKPRKPVNDDI
          180       190       200       210       220       230    

                220       230       240       250       260        
GI||SP V----------EVLMTFEPDPADLALSSIPGHETFDPRRHRFSEEELKPQPIM-KKARKI
       V----------E +  FE -P+  A++S P---TF P---- SE+++  Q I -KK   +
8099   VKIVRNQDLSEENISMFEI-PVPKAIASDP---TFRP----VSEQQIIQQIIQGKKYEEM
          240       250        260              270       280      

       270         280       290       300       310        320    
GI||SP QVPE--EQKDEKYWSRRYKNNEAAKRSRDARRLKENQISVRAAFLEKEN-ALLRQEVVAV
       +V E-- Q   K ---------A KR    R +--+Q +V--A L   N-A L    +  
8099   EVGECMIQLCKKL---------AEKRVFGPRLM--SQTTV--AGLNHSNYANLPIKGICY
        290                300       310           320       330   

          330       340                                            
GI||SP RQELSHYRAVLSRYQAQHGAL                                       
        Q   --R VL                                                 
8099   IQHVC--RKVLYDKFENEEDFWDKFREAMRKLAARCRRVRHAKKTKHNREEAQAEMLSKR
             340       350       360       370       380       390 

8276 hypothetical 128.6 kd protein zk1098.10 in chromo (1120 aa)
initn: opt:  764 z-score: 127.5 E():    0.2
Smith-Waterman score: 902;    16.9% identity in 326 aa overlap
 18.7% noncontradicting positions,   1.8% class identity

           30        40        50        60        70        80    
GI||SP VSDRTPAXXXXXXXXXXXXXXXXXXGLRSLLQGTSKPKEPASCLLKEKERKXXXXXXXXX
       V+                            LQ-T   +E A--L K+ E K         
8276   VNVLEALDLAYLERDEQTAELEMLKEDNEQLQ-TQYEREKA--LRKQTEQKYIEIEDTLI
          40        50        60         70          80        90  

           90       100       110       120       130       140    
GI||SP XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXERTLPFGDVEYVDLDAFXX
                                              --E  L-----E+  L     
8276   GQNKELDKKIESLESIMRMLELKAKNATDHASRLEEREV--EQKL-----EFDRLHERYN
            100       110       120       130              140     

          150       160       170       180       190          200 
GI||SP XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGTA---SGH
                                                          G +---S H
8276   TLLRTHVDHMERTKYLMGSEKFELMQNMPLPNMQLRNKMGMAASVDASSIRGVSDLISAH
         150       160       170       180       190       200     

             210       220           230       240       250       
GI||SP RAGLTSRDTPSPVDPDTVEVLMTF----EPDPADLALSSIPGHETFDPRRHRFSEEELKP
           T+ D          +    F----EP P D+  SS ------D      +  E--P
8276   MTQSTTMDVNLANHITNEDWQDEFSSDIEPSPRDIPQSSA------DALTSPITTKE--P
         210       220       230       240             250         

       260       270       280                      290            
GI||SP QPIMKKARKIQVPEEQKDEKYWSRRYKNN---------------EAAKRSRDA----RRL
        P    A   Q  EE+ DE        NN---------------E A    D ---- R 
8276   TPKREAASPKQSEEEEADETTSVDPKENNDLLGADLTGNLVDPAEFASAVNDTFIGMGRE
       260       270       280       290       300       310       

      300       310                    320       330       340     
GI||SP KENQISVRAAFLEKENAL-------------LRQEVVAVRQELSHYRAVLSRYQAQHGAL
        EN I   +  L+  NAL-------------L  E +  R E      V  + Q Q    
8276   VENLIKENSELLDMKNALNIVKNDLINQVDELNSENMILRDENLSRQMVSEKMQEQITKH
       320       330       340       350       360       370       

                                                                   
GI||SP                                                             
                                                                   
8276   EEEIKTLKQKLMEKENEQEEDDVPMAMRKRFTRSEMQRVLMDRNAYKEKLMELEESIKWT
       380       390       400       410       420       430       

5807.2 300 interspersed kd repeat antigen ag231    (283 aa)
initn: opt:  707 z-score: 126.4 E():   0.24
Smith-Waterman score: 887;    26.4% identity in 129 aa overlap
 26.4% noncontradicting positions,   0.0% class identity

     180       190       200       210       220       230         
GI||SP XXXXXXXXXXXXXXXXGTASGHRAGLTSRDTPSPVDPDTVEVLMTFEPDPADLALSSIPG
                        T            T  PV-- T E   T EP  ++  +    G
5807.2 IEEPVTTQEPVTIEEPVTTQEPVTTQEPVTTQEPV--TTQEPVTTQEPVTVEEHIDEKKG
         30        40        50        60          70        80    

     240       250        260       270       280        290       
GI||SP HETFDPRRHRFSEE-ELKPQPIMKKARKIQVPEEQKDEKYWSRRYKNN-EAAKRSRDARR
        E  +      SEE-E K +   KK+  ++     K++K      K +-E++K + D  +
5807.2 SEGDNISLSSLSEETEEKSHTKKKKSSWLKFGRGNKNDKKSKNEKKPSLESVKQNADEQK
           90       100       110       120       130       140    

         300       310       320            330       340          
GI||SP LK--ENQISVRAAFLEKENALLRQEVVAVR-----QELSHYRAVLSRYQAQHGAL     
        +--++QISV A-----+++   QE  A  -----QEL+      +  +           
5807.2 EQPTDSQISVNA-----QDSVTIQEPTATQEPPTTQELTATQEPTTTQETVTEQEPTTTQ
          150            160       170       180       190         

                                                                   
GI||SP                                                             
                                                                   
5807.2 ETVTAQEPITTQEPVTAQEPVTTQELIATQEPSTTQEHADEKKASEGDNISLSRLSEETE
     200       210       220       230       240       250         

1653 probable repa replication-associated replication  (357 aa)
initn: opt:  712 z-score: 125.8 E():   0.25
Smith-Waterman score: 717;     6.8% identity in 280 aa overlap
 28.2% noncontradicting positions,  21.4% class identity

           30        40        50         60         70        80  
GI||SP VSDRTPAXXXXXXXXXXXXXXXXXXGLRSLLQGT-SKPKEPA-SCLLKEKERKXXXXXXX
                                   + +q T-sk  E A-SC+  e   k       
1653   nnqerncaindiekRK VaEHNalIqSiAKMdKTalqMFELAVSCInTdalPennaifLl
       eekkqlqqlqelss    v   dk s m   q  psk         d enp kdhivy s
       kk  vvlt              s  t v                     e     t    
       p     k                                                     
             x                                                     
        10        20         30        40        50        60      

             90       100       110       120        130       140 
GI||SP XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX-ERTLPFGDVEYVDLDA
                                                  -er lP+  Ve  d d 
1653   KrdLFaFFdVddadKhrrFKqAianMQeQAfFrIranaarGiemrrIlPiPtVeWadYnD
        ee  k  e ssns tsq  e vnl  k  y n qedqnl fkfen v y y k ns d 
        k   s  k   s   t      ek       q ksekdk y yks         t  h 
        s   t                  f       e   kvey    t               
        x                      x       x   x x                     
         70        80        90       100       110       120      

             150       160       170       180       190       200 
GI||SP FXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGTASGH
                                                                 G 
1653   dVlIrFnraIMPYLInLnanFsqhaiSdiaeLNSKYSiILYrWlaMnYnQfEHYqaK Ga
       e k q dqd      d kne tkykl elqk      l   k fs q s y   sn   n
       k m e hee      e  q          k                s        i   g
         t   sph         e          m                         y    
               x         x                                    x    
        130       140       150       160       170       180      

             210                220          230       240         
GI||SP RAGLTSRDTPSPVDP---------DTVEVLMTF---EPDPADLALSSIPGHETFDPRRHR
       R      d  sP  p---------DTv     F---e d    al  I  h  F  ----
1653   RraaQlEaYrnPrIiirdLRdeiTDTindhrrFdrlnrriiKnaidEInanThFnV----
        tee v n kd p kmke  eil   mdeyqq qnfendvl dple  tdh s k     
         kk   d  s s pvs   vfm   vks kh ph  hw   esvk   qf         
         v    s    t s      t        p  ts  ky   i  v   e          
              x                      x      x    x      x          
         190       200       210       220       230       240     

     250       260       270         280       290       300       
GI||SP FSEEELKPQPIMKKARKIQVPEEQKDEKYW--SRRYKNNEAAKRSRDARRLKENQISVRA
       f ee  k   i      I      kDe Y -- r y   eaak   dar lk   +    
1653   elreiqaraainhIqFHIeKKaradDnnYKrnnraaqdaeaanaradarlna qalankf
       fydkkkkgrslds v   t  rnwk es  lddqdyiedkerkeqnqndvll esmdspy
       s e s   gt i      v   k        e kq  lg kq sekedq yk v vq   
       t k v                 m        g vt  kt te   t kl vt    e   
                             x           x  x  x          x        
             250       260       270       280       290        300

       310       320       330       340                     
GI||SP AFLEKENALLRQEVVAVRQELSHYRAVLSRYQAQHGAL                
         l  en Ll     a    l      l     +   l                
1653   TrlLinnmLigandianqaiiaeLarnlYPlYdelkderGenaledHldYiarK
        kk lehf lfmlelmdidlllg qesv  k hksvell idgvkk ms vrd 
        m  m ss  spt mt kktmv   k    v     kfm l    t     ss 
           s     yyy f  p                                 y  
                        x                                    
              310       320       330       340       350    

8516 hypothetical surface-layer 125 80k kd protein pre (717 aa)
initn: opt:  727 z-score: 124.1 E():   0.32
Smith-Waterman score: 755;    13.1% identity in 336 aa overlap
 21.4% noncontradicting positions,   8.3% class identity

                                    10        20        30         
GI||SP                      XBINDINGPRTEINTAXREBMARPVSDRTPAXXXXXXXX
                              +    G +t I T--------P s  t +        
8516   LqEplnriSaTNFTLDGKAYFGNVVMGAGNKsVILT--------PYssSaLSlGDHKLTV
        s svenl s                     t              tt t  v       
               10        20        30                40        50  

      40        50        60        70        80        90         
GI||SP XXXXXXXXXXGLRSLLQGTSKPKEPASCLLKEKERKXXXXXXXXXXXXXXXXXXXXXXXX
                  L S  + t      A  +                               
8516   SgaKDfAeFVSLNSTHEFkVVEDKEAPTikEATATLETVTLTFSEDiDMDTVKASNVYWK
        vv  y g          t         vt                v             
             60        70        80        90       100       110  

     100       110       120       130         140       150       
GI||SP XXXXXXXXXXXXXXXXXXXXXXXXXXERTLPFGDVE-YV-DLDAFXXXXXXXXXXXXXXX
                                 E+TLP G V+-YV-D+  +               
8516   SGDSKKEASEFERIADNKYKFVFKGaEKTLPTGKVDVYVEDiKDYSDNKIAKDTKVTVTP
                                s               v                  
            120       130       140       150       160       170  

       160       170       180       190       200       210       
GI||SP XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGTASGHRAGLTSRDTPSPVDPD
                                              T         S D  +    D
8516   EIDQTRPEVRKVTalDEKTIKVTFSKTVDgEsAeKaGNYTikDKDdKVVSVDKVTVDSKD
                    sv              k t i t    vt   g              
            180       190       200       210       220       230  

       220       230       240       250       260            270  
GI||SP TVEVLMTFEPDPADLALSSIPGHETFDPRRHRFSEEELKPQPIM-----KKARKIQVPEE
       +  V++-------DL      G  T   +  +---+  K    M-----K  R  +--E 
8516   SKSVII-------DLYSKVSVGENTITIKNVK---DATKLNNTMLDYTGKFTRSDK--EG
                   240       250          260       270         280

            280       290       300       310       320       330  
GI||SP QKDEKYWSRRYKNNEAAKRSRDARRLKENQISVRAAFLEKENALLRQEVVAVRQELSHYR
        k E------   N  AK  + +--LK n+    A+  +  N L r--+    Q Ls   
8516   PdfE------hVINADAKAKKVV--LKFnKKMDAASLADsSNYLVr--IndTLQTLsdDV
        ky       t                 d          y     k   dg     te  
                    290         300       310         320       330

            340                                                    
GI||SP AVLSRYQAQHGAL                                               
       A LS       +                                                
8516   ATLSVSNDATVVTITFAETIKGnDVVFAsGKaISGSGKaNVnELQVlGVKDTSGNVHdKF
                             d     t  t      v  h    m          k  
              340       350       360       370       380       390

4861 triadin kda junction-specific back sarcoplasmic m (220 aa)
initn: opt:  675 z-score: 122.6 E():   0.38
Smith-Waterman score: 675;    10.8% identity in 195 aa overlap
 15.4% noncontradicting positions,   4.6% class identity

               10        20        30        40        50        60
GI||SP XBINDINGPRTEINTAXREBMARPVSDRTPAXXXXXXXXXXXXXXXXXXGLRSLLQGTSK
           +         +A  E  A+   +R -                     ++  Q T K
4861   EKHEEPAKSTKKEHAAPSEKQAKAeIERK-EEVSAASTKKAVPAKKEEKTTKTVEQETRK
                               k                                   
               10        20         30        40        50         

                70         80        90       100       110        
GI||SP PK-EPASCLLKEKE-RKXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
        K-   S+ LK+KE- K                                           
4861   EKPGKISSVLKDKELTKEKEVKVPASLKEKGSETKKDEKTSKPEPQIKKEEKPGKEVKPK
      60        70        80        90       100       110         

      120       130       140       150       160       170        
GI||SP XXXXXXXERTLPFGDVEYVDLDAFXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
                  P  D+   +  A                                     
4861   PPQPQIKKEEKPEQDIMKPEKTALHGKPEEKVLKQVKAVTTEKHVKPKPAKKAEHQEKEP
     120       130       140       150       160       170         

      180       190       200       210       220       230        
GI||SP XXXXXXXXXXXXXXXXXGTASGHRAGLTSRDTPSPVDPDTVEVLMTFEPDPADLALSSIP
                         T SG +    S                               
4861   PSIKTdKPKSTSKGMPEVTESGKKKIEKSEKEIKVPARRES                   
            e                                                      
     180       190       200       210       220       230         

8476 beta-lactamase hypothetical regulatory protein 2  (312 aa)
initn: opt:  664 z-score: 118.8 E():   0.63
Smith-Waterman score: 761;    16.0% identity in 318 aa overlap
 19.5% noncontradicting positions,   3.5% class identity

               10        20        30        40        50          
GI||SP XBINDINGPRTEINTAXREBMARPVSDRTPAXXXXXXXXXXXXXXXXXXGLR--SLLQGT
                         E + R   D                      G+ --+ L  +
8476                 MLNSESLLRELRDALHEGGLTGSFLVRDLYTGEELGIDPDTELPTA
                             10        20        30        40      

       60        70        80        90       100       110        
GI||SP SKPKEPASCLLKEKERKXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
       S  K P +    E+ R                                            
8476   SLVKLPLALATLERIRLGEVDGAQQIEVAPGRITTPGPTGLSRFRHPARVAVDDLLYLST
         50        60        70        80        90       100      

      120            130       140       150       160       170   
GI||SP XXXXXXX-----ERTLPFGDVEYVDLDAFXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
              -----E T P    + V    F                               
8476   SVSDGTASDALFEITPPAQVEQMVREWGFRDLTVRHSMRELSETPAERFESADAHLAHAL
        110       120       130       140       150       160      

           180       190                       200       210       
GI||SP XXXXXXXXXXXXXXXXXXXXXX-GTA---------------SGHRAGLTSRDTPSPVDPD
                             -GTA---------------+G R G TSR  P-P    
8476   AISAGTSGRGHRVPQLDVARANTGTARAFVDLLEALWAPVLTGPRPGRTSRALP-PePAA
                                                               k   
        170       180       190       200       210       220      

       220              230         240       250       260        
GI||SP TVEVLMT-------FEPDPADLAL--SSIPGHETFDPRRHRFSEEELKPQPIMKKA--RK
           LM+-------  PD A  A --SS  G--T    RH     E     +   A--  
8476   RLRELMAANLLRHRLAPDFASDAATWSSKTG--TLLNLRHEVGVVEHADGQVFAVAVLTE
         230       240       250         260       270       280   

        270         280       290       300       310       320    
GI||SP IQVPEEQK--DEKYWSRRYKNNEAAKRSRDARRLKENQISVRAAFLEKENALLRQEVVAV
        QVP + +-- E   +------++A+R RD--RL+E                        
8476   SQVPADSQPGAEALMA------QVARRLRD--RLREW                       
           290             300         310       320       330     

          330       340     
GI||SP RQELSHYRAVLSRYQAQHGAL
                            
8476                        
         340       350      

8410 no title                                      (330 aa)
initn: opt:  660 z-score: 117.8 E():   0.71
Smith-Waterman score: 750;    26.7% identity in 116 aa overlap
 26.7% noncontradicting positions,   0.0% class identity

        190       200       210       220       230       240      
GI||SP XXXXXXXXXGTASGHRAGLTSRDTPSPVDPDTVEVLMTFEPDPADLALSSIPGHET--FD
                                   + D VE L -----P D   S  P  E --F 
8410   LDYADFADDSEEIKDEDVDHQTSDLENNNNDKVEGLA-----PKDQTTSYEPVDEVPEFI
           180       190       200       210            220        

          250       260       270         280        290       300 
GI||SP PRRHRFSEEELKPQPIMKKARKIQVPEEQ--KDEKYWSRRYKNNE-AAKRSRDARRLKEN
             +EEE---Q + K    I   E+Q--K E   +R    +E-AA   +   +    
8410   DDADSVNEEE---QTVDKNEDAITKDEQQVVKKEVDLTRPSAPSEPAAAEHKSYTKDELT
      230          240       250       260       270       280     

             310       320       330       340      
GI||SP QISVRAAFLEKENALLRQEVVAVRQELSHYRAVLSRYQAQHGAL 
       +I  RA+ +E+   L +  + A   E                   
8410   KIMDRASKIEQIQKLAKYAISALNYEDLPTAKDELTKALDLLNSI
         290       300       310       320       330

8344 len: 393 cai: mitochondrial 0.17 outer membrane 4 (393 aa)
initn: opt:  663 z-score: 117.2 E():   0.76
Smith-Waterman score: 664;    15.1% identity in 345 aa overlap
 16.5% noncontradicting positions,   1.4% class identity

                                    10        20        30         
GI||SP                      XBINDINGPRTEINTAXREBMARPVSDRTPAXXXXXXXX
                                   G R E N +         S- +Pa        
8344   MSSRIIVGSAALAAAITASIMVREQKAKGQRREGNVSAYYNGQEYGS-SAPaQLGKLHNI
                                                          p        
               10        20        30        40         50         

      40        50        60        70        80        90         
GI||SP XXXXXXXXXXGLRSLLQGTSKPKEPASCLLKEKERKXXXXXXXXXXXXXXXXXXXXXXXX
                    +LL  + K +E A -----K  K                        
8344   KQGIKEDALSLKDALLGVSQKAREEAP-----KVTKRVISPEEDAQTRKQLGQKAKDSSS
      60        70        80             90       100       110    

     100       110       120       130       140       150         
GI||SP XXXXXXXXXXXXXXXXXXXXXXXXXXERTLPFGDVEYVDLDAFXXXXXXXXXXXXXXXXX
                                  R---F --E VD +                   
8344   QSIFNWGFSEAERRKAIAIGEFDTAKKR---FE--EAVDRNEKELLSTVMREKKAALDRA
          120       130       140            150       160         

     160       170       180       190       200       210         
GI||SP XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGTASGHRAGLTSRDTPSPVDPDTV
                                            T ----------------D +T 
8344   SIEYERYGRARDFNELSDKLDQQERNSNPLKRLLKNNTG----------------DANTE
     170       180       190       200                       210   

     220       230       240       250       260       270         
GI||SP EVLMTFEPDPADLALSSIPGHETFDPRRHRFSEEELKPQPIMKKARKIQVPEEQKDEKYW
       E          D A   --G E  +  +   S E    Q  +   +KI+-------EK W
8344   EAAARSVQGWGDTAQEF--GREELEEAKRNASSEPSEAQKRLDELKKIK-------EKGW
           220       230         240       250       260           

     280         290       300       310                   320     
GI||SP SRRYK--NNEAAKRSRDARRLKENQISVRAAFLEK----------ENA--LLRQEVVAVR
           K-- +E     R AR L     +--AA L K----------EN+-- L + V    
8344   FGYNKGEQSEQQIAERVARGLEGWGET--AAQLSKDEMDDLRWNYENSKKQLDKNVSDAM
          270       280       290         300       310       320  

         330       340                                             
GI||SP QELSHYRAVLSRYQAQHGAL                                        
         LS  +  L  Y ++  +                                         
8344   DSLSKAKEDLKQYGSHWWSGWTSKVDNDKQALKDEAQKKYDEALKKYDEAKNKFKEWNDK
            330       340       350       360       370       380  

4598 major minor capsid protein 10a 10b            (344 aa)
initn: opt:  657 z-score: 117.0 E():   0.78
Smith-Waterman score: 657;    13.8% identity in 305 aa overlap
 21.0% noncontradicting positions,   7.2% class identity

               30        40        50        60        70        80
GI||SP MARPVSDRTPAXXXXXXXXXXXXXXXXXXGLRSLLQGTSKPKEPASCLLKEKERKXXXXX
         +                           lRS+  G S------+        k     
4598   GDKLALFLKVFGGEVLTAFARTSVTmpRHMlRSIaSGKS------AQFPViGRTqAAYLa
                                ts   v   s               l   k    k
              30        40        50        60              70     

               90       100       110       120       130       140
GI||SP XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXERTLPFGDVEYVDLD
                                                    E T   G--E   + 
4598   PGENLDDKRKDIKHTEKVIhIDGLLTADVLIYDIEDAMNHYDVRaEYTaQLG--ESLAMA
                          t                        s   s           
          80        90       100       110       120         130   

              150       160       170       180       190       200
GI||SP AFXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGTASG
       A                                                       T +-
4598   ADGAVLAEiAGLcNledgsNENIEGLGkaTVieltqPnkaaLTDqVaLGKaIIAaLTiA-
               l   v vpsky        tp  lttvk ttgs   p e   e   q  k  
           140       150       160       170       180       190   

              210             220       230       240       250    
GI||SP HRAGLTSRDTPSP-----VDPDT-VEVLMTFEPDPADLALSSIPGHETFDPRRHRFSEEE
       -RA LT    P+ ----- dPD+-  +L +  P+ A+ a    P   t --R     E  
4598   -RAaLTKNYVPAADRtFYcdPDnYSAILAALMPNAANYaALiDPErGsI--RNVMGFEVV
          s           v  tt  s               q  l   k t            
             200       210       220       230       240           

          260         270       280       290       300       310  
GI||SP LKPQPIMKKARKIQ--VPEEQKDEKYWSRRYKNNEAAKRSRDARRLKENQISVRAAFLEK
         P+     A    -- P  QK     +    n  +Ak --++  L     +V  + L-k
4598   EVPHLTAGGAGdaREdaPadQKHaFPAnkgeGnVKVAlD--NViGLFqHRSAVGTVKL-r
                  tt  gt tg   v   tsst t    k     v   m           k
     250       260       270       280         290       300       

            320       330         340               
GI||SP ENALLRQEVVAVRQELSHYRA--VLSRYQAQHGAL          
       + AL R----A R---++y A--++++Y   HG L          
4598   DLALER----ARR---ANfQADQIIAKYAMGHGGLRPEAAGAiVl
                         y                       v f
        310              320       330       340    

6791 dtaf tsm1 ii protein 150 gene product         (1039 aa)
initn: opt:  684 z-score: 114.9 E():      1
Smith-Waterman score: 684;    20.1% identity in 134 aa overlap
 25.4% noncontradicting positions,   5.2% class identity

             190       200       210       220       230       240 
GI||SP XXXXXXXXXXXXXXGTASGHRAGLTSRDTPSPVDPDTVEVLMTFEPDPADLALSSIPGH-
                            a l      s  dp T e   t       +  s + G+-
6791   STDdepqlnnshmfnncicnSarlWfPCVDlladkcTWrLEFsVdrnmkaigcgeLiGQN
          ektwmwsvytstgeyes ssy v    sfdeps  e   t pklvtnvstsk l   
              190       200       210       220       230       240

                    250       260          270       280       290 
GI||SP ------ETFDPRRHRFSEEELKPQPIMK---KARKIQVPEEQKDEKYWSRRYKNNEAAKR
       ------E  D   H - EEE KP  ++K---K   ++  EE K+ K--S+  ++N+    
6791   GEESEKEKEDTPEHD-EEEEGKPARVIKDEDKDSNLKNDEEGKNSK--SKDAQDNDEEEE
              250        260       270       280         290       

             300       310       320            330       340      
GI||SP SRDARRLKENQISVRAAFLEKENALLRQEVV-----AVRQELSHYRAVLSRYQAQHGAL 
         ++    e     R    E  N  LR  +V-----+  +EL H      +    +    
6791   EGESDEEEeEGEEERRNIEESNNPSLRDVIVCCSEYSNIKELPHPIDLTKKKCIFQIINP
               g                                                   
       300       310       320       330       340       350       

                                                                   
GI||SP                                                             
                                                                   
6791   VAPHHIGWAIGAFNSWSLPLesimspDardeteedklrenVcannnaladddieididPh
                           ivpptv lekkvfhysvstp vdpvidtmvgqfgsyvi i
       360       370       380       390       400       410       

11380 hypothetical trwc protein                    (508 aa)
initn: opt:  649 z-score: 113.4 E():    1.2
Smith-Waterman score: 763;     4.2% identity in 262 aa overlap
 24.4% noncontradicting positions,  20.2% class identity

                     10        20        30        40        50    
GI||SP       XBINDINGPRTEINTAXREBMARPVSDRTPAXXXXXXXXXXXXXXXXXXGLRS-
                                     v d  Pa                   lR -
11380  nRdmqQlqkiaEKAKnarcaliGDkAQllaiEaGrPadIAYqlraAdiaTAhMrEiqRQK
       t lfe tlslv    gchvvym  t  tksv d k fe   lsqe gmq  t s vl   
      150       160       170       180       190       200        

                      60        70              80        90       
GI||SP ----------LLQGTSKPKEPASCLLK------EKERKXXXXXXXXXXXXXXXXXXXXXX
       ----------L+  T      As  Lk------E e                        
11380  nPELKKIaqELMMSTPaaadrAlsqLerngdViEIenhhdRraaISDSVQKiAEaYcALk
       d      tv       esvgk sts kqikw t  kssve kgp       v  h i  s
      210       220       230       240       250       260        

       100       110       120       130       140       150       
GI||SP XXXXXXXXXXXXXXXXXXXXXXXXXXXXERTLPFGDVEYVDLDAFXXXXXXXXXXXXXXX
                                   e     G  e vd                   
11380  ldERdrTlIaaaTNEaRreINqAIRiiRqGlGeaGqGeefdTllrmDsRLTdAErrHSkN
       pe  tn v vsg   n qt  e   vv e k tl k ifvt tvlv t   q  lh  p 
      270       280       290       300       310       320        

       160       170       180       190       200       210       
GI||SP XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGTASGHRAGLTSRDTPSPVDPD
                                             g   g+r  l     P  --- 
11380  YqVGdVirlnrdYakTGLQRGELYRVseTnhdnrlliiedgDgQrKVinldlMPkr---e
        t  h vqpenq lt           vk gpgknttvlgeh k n  lqfsp  th   t
      330       340       350       360       370       380        

       220       230       240       250       260       270       
GI||SP TVEVLMTFEPDPADLALSSIPGHETFDPRRHRFSEEELKPQPIMKKARKIQVPEEQKDEK
       tvev---f p+ a+la s i      D  r   + e lk   + K  rki v   + +  
11380  kiel---fqrEraeiasgDiiriTrnDKerdLaagdrlrVtaVNKadrkiTaldGKrEHL
       tvsv   yhp tthlqvs tlkw ks  hlg vnhesmk vh   eehtv vts  s   
            390       400       410       420       430       440  

       280       290       300       310       320       330       
GI||SP YWSRRYKNNEAAKRSRDARRLKENQISVRAAFLEKENALLRQEVVAVRQELSHYRAVLSR
               n     +  +        +     +  +               +  Ra    
11380  nsdLnqdqnqHiDhaYAsTVHglQGaTadriLILLDaHaelrnTrrDllYVAiiRarhqa
       sve ptpkpl v yn  t   ss  l sqsv     s nssst mk vy   vs stfev
            450       460       470       480       490       500  

6096 crtj regulatory repressor protein             (459 aa)
initn: opt:  640 z-score: 112.5 E():    1.4
Smith-Waterman score: 640;     8.3% identity in 339 aa overlap
 23.3% noncontradicting positions,  15.0% class identity

                            10        20           30        40    
GI||SP              XBINDINGPRTEINTAXREBMARP---VSDRTPAXXXXXXXXXXXXX
                                   + rebMa P---   r  a             
6096   aLqrlaPDLlaDiiasAaDIaLlVSqerVVreVMaNPqhgSaerlaaWqGarLeqllsaE
       s psvs   vr lvtt c  s v  pgg  es  v  hfp fgqfse e rp sevftp 
               10        20        30        40        50        60

           50        60             70        80        90         
GI||SP XXXXXGLRSLLQGTSKPKEP-----ASCLLKEKERKXXXXXXXXXXXXXXXXXXXXXXXX
             lR-L  G ----EP-----a  L     r                         
6096   SaaKlrnR-LadGl----EPGRGSlalELnHaDaraFelPiRYiihRlgaDrsiLliGRD
        vq fel   se p          vqv  t i pds tf v  tlt spe gtl ml   
                70            80        90       100       110     

     100       110       120       130       140       150         
GI||SP XXXXXXXXXXXXXXXXXXXXXXXXXXERTLPFGDVEYVDLDAFXXXXXXXXXXXXXXXXX
                                 E    +  v  v  d                   
6096   lrPiAEVQQQLVaAQLAlERDYEaQREiETRYRVlLdahraPlliVSMSTGRIaDLNlAA
       mq l        k    m     t   m      v evspd mvl        v   s  
         120       130       140       150       160       170     

     160       170       180       190       200       210         
GI||SP XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGTASGHRAGLTSRDTPSPVDPDTV
                                           gt s   + ltsR +   v---tV
6096   aaliGatRadLidAaiaQEldGRRRGEFlEnlaniAasdpaaaVEllaRRSrrrl---lV
       glml gv qe lg pvg  fe       m tmtkl gteslgp  vti   qkkv   t 
                                                      s            
         180       190       200       210       220       230     

     220       230       240       250       260       270         
GI||SP EVLMTFEPDPADLALSSIPGHETFDPRRHRFSEEELKPQPIMKKARKIQVPEEQKDEKYW
       -v   F      L L  i   e+  pr    sE--l      k    i   ----D    
6096   -tarlFRAAGdRLLLCrideAdArrprgDdlsE--nlaRLfheGiDaiVFl----DADGT
        vptv     e     qlgp e tqtvv etv   lse  ylk v gm  s         
             240       250       260         270       280         

     280       290       300       310       320         330       
GI||SP SRRYKNNEAAKRSRDARRLKENQISVRAAFLEKENALLRQEVVAVRQ--ELSHYRAVLSR
        R -- NeA     Da  l   q    A FL +  + Lr  +  Vr -- L hY   L+ 
6096   IRa--ANdAFLnlTDagSaAairGRSiADFLaRGaVDLrVLiDnVrRiGqLRhYaTRLnT
         g    e   ym  ss l lvq   f    s  s   n  l s k t h  l v   t 
           290       300       310       320       330       340   

       340                                                         
GI||SP YQAQHGAL                                                    
         a + a                                                     
6096   DFaGQiaaEiSATlldDRarPliaLViRDsnrADamRRPimaggainEgaRNVMqlVGna
         s  vtv l   wfh  et tlv  v  tsl  tt   vpptmvsd pl    em  ys
           350       360       370       380       390       400   

2681 beta-adaptin adaptin protein clathrin complex bet (519 aa)
initn: opt:  635 z-score: 111.0 E():    1.7
Smith-Waterman score: 682;     6.2% identity in 288 aa overlap
 25.3% noncontradicting positions,  19.1% class identity

       40        50        60        70        80        90        
GI||SP XXXXXXXXXXXGLRSLLQGTSKPKEPASCLLKEKERKXXXXXXXXXXXXXXXXXXXXXXX
                     ++        +     l e  Rk                       
2681   ikDaqDpNPLIRalAiRTMgcIRVDKIlEYicePLRrcLhDdnaYVRKTAaiCVAKLhdi
       vt ce s     cm v   sm      t  let   kt k edp      vv     fql
                                                                y  
      20        30        40        50        60        70         

      100       110       120       130       140       150        
GI||SP XXXXXXXXXXXXXXXXXXXXXXXXXXXERTLPFGDVEYVDLDAFXXXXXXXXXXXXXXXX
                                   rt    ++   d da                 
2681   nadlcedqGflddLrnaiaDSNPlViANatAALiEIan dqdaqnLldlnaqninqlLlA
       skqmvvel vveq kdlld    m v  rv   s  ne shpnsd semiqshvskf t 
         t         s v   s                 h   msgvp vs kpvs       
                   t                       s     s       s t       
      80        90       100       110        120       130        

      160       170       180       190       200       210        
GI||SP XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGTASGHRAGLTSRDTPSPVDPDT
                                         ----Ta    a -------s V   t
2681   LNECTEWariiILdcLanYnaKDdrEAQdIcdRi----TarLaHaN-------aAVVLaa
              gqvf  gs ge mp  el   s ie v     ph q v        p    st
                     t s  s   s                  s          s      
      140       150       160       170           180              

      220                       230       240       250       260  
GI||SP VEVLMTF------EPD----------PADLALSSIPGHETFDPRRHRFSEEELKPQPIMK
       v Vlm f------e d----------pa  +L S p    + p r+     e  P  + k
2681   iKVimrllnllqidldfnalilKrLapalVsLlSaePElQYVaLrNIriIlqKrPdiLkq
       v  lvkfmemppkessscnmlm k sspf t m gp  m   p k  nl ve y el th
                  ss y yygt t            s   p                    k
                         s               t   v                     
       190       200       210       220       230       240       

                270         280       290       300       310      
GI||SP KAR----KIQVPEEQKDEK--YWSRRYKNNEAAKRSRDARRLKENQISVRAAFLEKENAL
         r----K   P   K EK--   R       a+       LKE  + v   F  k    
2681   EiriFfVKfNDPiYVKLEKiDIliRLanqaNiaQ   lLaELKEYAmEydpdFVrrAirA
        lkv y  y   l      l  mv  vdps lk    v s      t veve  sk vq 
        m                         s           t                    
       250       260       270       280          290       300    

        320       330       340                                    
GI||SP LRQEVVAVRQELSHYRAVLSRYQAQHGAL                               
       l q  + v  E s  r v                                          
2681   igrcaIKy  EqfaercldiLLdLiqTrqntikddacisirDilRhcPn     Kqecai
       lsqlg  v   psvskvvst  e le kvdyvvqecivvlc lf ky g      yvsiv
                  s                            k                 v 
          310         320       330       340       350            

4810 5 iif transcription chain alpha factor tfiif subu (563 aa)
initn: opt:  631 z-score: 109.9 E():      2
Smith-Waterman score: 631;     5.9% identity in 270 aa overlap
 22.6% noncontradicting positions,  16.7% class identity

                10        20        30        40        50         
GI||SP  XBINDINGPRTEINTAXREBMARPVSDRTPAXXXXXXXXXXXXXXXXXXGLRSLLQGTS
                 R   Nt  r  + r ----  a                        q   
4810   SganVqEfkiRVPrNmpKrhniMaF----NAadnVnFaqWrnarlERdnnaKeir qEEd
        sqs t yvv   k ps kyhl r       tlk d st nqvkm  elsn kmy m  e
                      tt   sv                            m  f      
               10        20            30        40        50      

      60        70        80        90       100       110         
GI||SP KPKEPASCLLKEKERKXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
        Pk  A +    k R                                             
4810   qPefGAGSEfNRdqREEaRRKKfGIiarefrpdaQPWiLrVnGKaGrKfKGireGGVgEN
       m ks     y  kl   s    y  vlkkykved   l k g  s k y  vkk   t  
                                                   t               
          60        70        80        90       100       110     

     120       130       140       150       160       170         
GI||SP XXXXXXERTLPFGDVEYVDLDAFXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
                -p G +E   l                                        
4810   aaffiFTqa-aDGAiEAfPlhnWYNFqPiarhrsLsAEEAEqEfeRRnKVlNhFsiMqrr
       tsyyv  hc p   f  y vse    t lqkykt t     e wg  k  m y tl lqk
                           t       v                               
         120        130       140       150       160       170    

     180       190       200       210       220       230         
GI||SP XXXXXXXXXXXXXXXXGTASGHRAGLTSRDTPSPVDPDTVEVLMTFEPDPADLALSSIPG
                       g     ra    r   s-------e  +t   d  Dl lsS   
4810   RLr DqdqdqDedEaggg  EKaa    rrKak-------dLrIhDldd  DlEdeSdae
         k  eeeee pe ekli    rg    kk ks       e k t mee   s ls tes
             vg      k       g     t  s                      m   s 
           180       190             200                210        

     240       250          260       270       280       290      
GI||SP HETFDPRRHRFSEEELKPQ---PIMKKARKIQVPEEQKDEKYWSRRYKNNEAAKRSRDAR
        e          e   Kpq---P+ K arK +      DE        + E       + 
4810   dae     dqEderrgKaqgKaPLaKGarKKKrKrdsDDEAlEdSDDGDeEGrEmDYMSD
       ens     ee ggddk pkk g  k  gd   k kgv    f e     f  q v     
        e      g   ssip     v      k                               
                     v                                             
      220            230       240       250       260       270   

        300       310       320       330       340                
GI||SP RLKENQISVRAAFLEKENALLRQEVVAVRQELSHYRAVLSRYQAQHGAL           
            q        e +     + v           +       +               
4810   eSSSddEePEgKakaaqqdedmKGlaEqda      SdEEeddEKap    eEdadeEee
       g   eq l  s dpepkeekgp  vd esd       s  see  ks    k eeeg kk
       t   se p    i p  v           e               p        k     
           280       290       300             310           320   

3332 heat heavy shock 70 chain heat-shock binding prot (416 aa)
initn: opt:  620 z-score: 109.9 E():      2
Smith-Waterman score: 620;     5.8% identity in 330 aa overlap
 33.3% noncontradicting positions,  27.6% class identity

                  10        20         30        40        50      
GI||SP    XBINDINGPRTEINTAXREBMARPVSDR-TPAXXXXXXXXXXXXXXXXXXGLRSLLQ
             n              eb+a p  dR-tPa                   l    q
3332   IGidlGnTsacia inranradiiaNdaGaRaiPaalafs drdrlhGdaAlnqaarnpq
         vhf c yssvg vfqdgdvevvp ed n tt sivsyt eeeqyi gq kqsriihve
          t  t        skh q      pq d v   y   v gg evv l    ylpls k
                      ypk k       l e            t           v    s
               10         20        30         40        50        

            60        70        80        90       100       110   
GI||SP GT---SKPKEPASCLLKEKERKXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
        T--- k     s   ke   k                                      
3332   nTiinarriiGRlfaDkcaqkdcangacaVendndrgryqirgkn   eeqnealnpddi
       s vfdfkdll  kpd pqdvskikelkfr iekdgklkveldttg   ggeekimsveev
          vkv qf   ssg  ev  ymshsppq  v gkvpf issy      lkt lft    
                     f  v      lw yl           v            kvy    
                                   x                               
       60        70        80        90       100          110     

           120       130       140       150       160       170   
GI||SP XXXXXXXXXXXXERTLPFGDVEYVDLDAFXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
                   e  l     e V    f                               
3332   aarhlnrlKeaAeadiGeaannaViTVPanFndeQrqAlgaaaaaaGlnilriInEPsaa
       srlifskm li hdyl hdikdv l   fd gek ks tkdsgrii fdvvql h  tsr
       vsmv t   tt  sv  skvte  v   ty s s  t   e  gk   q   f      s
                        yp  k                     v                
         120       130       140       150       160       170     

           180       190       200       210       220       230   
GI||SP XXXXXXXXXXXXXXXXXXXXXXGTASGHRAGLTSRDTPSPVDPDTVEVLMTFEPDPADLA
                                 G---G ts  +   vd    evl T--------a
3332   aiAhaedr     pgegarNiliadcG---GgrhdaaiiairnGifrilaT--------a
       ll ygiqq     tfgkdk vvvfkl     islslsvlevnd myevks         n
       s    lgk      sk es      f     ttf v  ms ds v t             
       t      f         kv              s     t                    
         180            190          200       210                 

           240         250       260       270       280       290 
GI||SP LSSIPGHETFDPR--RHRFSEEELKPQPIMKKARKIQVPEEQKDEKYWSRRYKNNEAAKR
        s   G e fd r-- h  se + k q    k             k  sr   n eaaKr
3332   gndniGgdnldrrLanhlaaeFqrlnnanirqn           araiarLraaaeaaKr
       hdlhl vedftne ldyfiei kkkhqhdlsgt           kkslrk krncsrg h
       tstsw  gh  dt vq  vs     fki p i               mm  mnei it k
          t       t   e         y k v k               vs   tss v   
     220       230       240       250                  260        

             300                     310       320       330       
GI||SP SRDARRLKENQISV--------------RAAFLEKENALLRQEVVAVRQELSHYRAVLSR
       s   --l   qisv--------------Ra f e  n l rq   avrq l         
3332   qLSn--agqanceiDSLadGqDfdaninRarfEelandlFaqcieairqaiadaeldaad
       s  s  lssvqifv   ie i yhclvs mky lvnikv rkflkfvdellrqtgfkkdq
       t  t  st  sls    f  f  ses t       csp  nssssp ekv  k k tpl 
                 t v    y     yt               k tt    s        tt 
      270         280       290       300       310       320      

       340                                                         
GI||SP YQAQHGAL                                                    
         a                                                         
3332   InallL  eGGssriPKirqniq       dilnaralnadnnanEaaaiGAAiqAaiim
        ddvv   t  vtft  lqklle       elfggqnklnsippd lips   le rlls
        he     v        vtt  k        f ppkd  k      vv w   v  g  v
         k                            y   v             y      s   
        330         340              350       360       370       

4703 dynactin 117 150 kd dynein-associated isoform pro (882 aa)
initn: opt:  645 z-score: 109.5 E():    2.1
Smith-Waterman score: 673;     9.6% identity in 333 aa overlap
 23.4% noncontradicting positions,  13.8% class identity

                 10        20        30        40        50        
GI||SP   XBINDINGPRTEINTAXREBMARPVSDRTPAXXXXXXXXXXXXXXXXXXGLRSLLQGT
          b   +  p  e      +b        T                     l  l +  
4703   srgaAPalleaqKeeanLraQlaDLeEKLETLrqrRnEDKarLrEldKhKIQlEQlQEfr
       tpfv  mvpsps tsee qd vr  t      kik s   ek k fe m   f  v  wk
             p    t    g  s             l              y           
               20        30        40        50        60        70

       60        70        80        90       100       110        
GI||SP SKPKEPASCLLKEKERKXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
       sK  e  + L ke  r                                            
4703   sKiqeaQAdLQrrLlaaeqEaqdaieaKeahaqedarharahrdahldqedareaAdsLQ
       t mmgq  s  ke krgke skeglgg grlhegmgdlsdriemgtsgkgmgegr et  
                      e  g   g t    qymg    t  nv  i         k     
                         k                                         
               80        90       100       110       120       130

      120       130       140       150       160       170        
GI||SP XXXXXXXERTLPFGDVEYVDLDAFXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
              Er------vey   D                                      
4703   qEldalKEr------ideLemDLEiLraEiqnK          GgDgaa SsYqlKQLEq
       l vess  k      vey tt   l kh mee            s spg  t ef    e
                           v      s                          v     
                    140       150                 160        170   

      180       190       200                     210       220    
GI||SP XXXXXXXXXXXXXXXXXGTASGH----------RAGLTS----RDTPSPVDPDTVEVLMT
                             h----------r  lts----r+  s-   d  E  + 
4703   QNaRLKdaLVRlRDLSahdKqdhqKLqKqlEkKrqEleelrrqrErLq-aeidqaEaiia
         i   et   m    sse heiv  s em m ns vtsveqtk k s eklkel ktvd
                                   l     t   v            vs   s   
           180       190       200       210       220        230  

          230       240       250       260       270       280    
GI||SP FEPDPADLALSSIPGHETFDPRRHRFSEEELKPQPIMKKARKIQVPEEQKDEKYWSRRYK
          + +D AL +    Et   r  ---e E K   +      ++  eE  de   s r  
4703   dLqEQVDAALGAEEMVEqLadrnl---nLEdKVreLrEeiadLEAlnEmndqLqEnarEl
       e k              m tekkm   d  e  kl e tvgq   me vhee v snh t
                        t         e                                
            240       250          260       270       280         

          290        300       310       320        330       340  
GI||SP NNEAAKRSRDAR-RLKENQISVRAAFLEKENALLRQEVVAVR-QELSHYRAVLSRYQAQH
         e       A -r kE q  v AA----------qE va r-Q +  yR    + q q 
4703   ELdLREqLDlAaaakrEaqrrrdAA----------qETiaDrdQTIkKfRqLtahLnDqn
         e   e  m ngrvk vekeve            i  vy yq   v y e vqk q vl
                  g      l                                         
     290       300       310                 320       330         

                                                                   
GI||SP GAL                                                         
         L                                                         
4703   rELrnrneanaerqqQdPp     EiiDfKqkFAEsKAharAIdmqLRQiElaQANrHmq
       t  mdqqsssekesl p s      tf y im   t  ytk  eve   m vq   e vs
          ts     v k                                       s       
     340       350            360       370       380       390    

8289 epidermal eps8 growth protein factor receptor kin (822 aa)
initn: opt:  640 z-score: 109.1 E():    2.2
Smith-Waterman score: 740;    14.3% identity in 307 aa overlap
 17.9% noncontradicting positions,   3.6% class identity

       30        40        50        60           70        80     
GI||SP TPAXXXXXXXXXXXXXXXXXXGLRSLLQGTSKPKEPA---SCLLKEKERKXXXXXXXXXX
       +                             S   E A---S   K+K R           
8289   SiLALVCKEPTQnKPDLHLFQCDEVKANLISEDIESAISDSKGGKQKRRPdALRMIanAD
        v          s                                     e     sk  
        150       160       170       180       190       200      

          90       100       110       120       130               
GI||SP XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXERTLPFGDVEY--------V
                                               E+   + + E --------+
8289   PgIPPPPRAPAPaPPGTVTQVDVRSRVAAWSAWAADQGDFEKPRQYHEQEETPEMMAARI
        s          v                                               
        210       220       230       240       250       260      

       140                  150       160       170       180      
GI||SP DLDA-----------FXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
       D D+-----------F                                            
8289   DRDVQILNHILDDIEFFITKLQKAAEAFSELSKRKKnKKgKRKGPGEGVLTLRAKPPPPD
                                           s  s                    
        270       280       290       300       310       320      

        190       200        210              220       230        
GI||SP XXXXXXXXXGTASGHRAGLTSR-DTPSPVD-------PDTVEVLMTFEPDPADLALSSIP
                       A L S -  PS  D-------P  + V  T  P+ A  +LS + 
8289   EFlDCFQKFKHGFNLLAKLKSHIQNPSAaDLVHFLFTPLNMVVQATGGPELASSVLSPLL
         v                         s                               
        330       340       350       360       370       380      

      240       250       260           270              280       
GI||SP GHETFDPRRHRFSEEELKPQPIM----KKARKIQVPEEQKDEKY-------WSRRYKNNE
         +T D   +  + eE k    +---- KaR -+ P EQ    Y-------W     N  
8289   nKDTiDFLNYTanadERqLWMSLGdsWmKaRA-EWPKEQFIPPYVPRFRNGWEPPMLNFM
       t   v      vtge  k      gt v v                              
        390       400       410        420       430       440     

       290       300       310       320       330       340       
GI||SP AAKRSRDARRLKENQISVRAAFLEKENALLRQEVVAVRQELSHYRAVLSRYQAQHGAL  
        A    D   L E+  +V      +e   L  E   V           S      G    
8289   GApmEQDlYQLAESVANVAEHQRKQdiKRLSTEHSnVSdYhPADGYAfSSniYhRGpHaD
         tt   m                 es        s  e p      y  sm t  s l 
         450       460       470       480       490       500     

                                                                   
GI||SP                                                             
                                                                   
8289   qGEAAmaFKpTpNrqiDRNYdalKTQPKKYAKSKYDFVARNnSELSVlKDDiLEILDDRr
       h    vp  s s hhv    epv                  s     m   v       k
         510       520       530       540       550       560     

8838 h probable dehydrogenase region ltdh methotrexate (287 aa)
initn: opt:  597 z-score: 108.3 E():    2.4
Smith-Waterman score: 597;     9.6% identity in 270 aa overlap
 20.7% noncontradicting positions,  11.1% class identity

               10        20        30        40        50        60
GI||SP XBINDINGPRTEINTAXREBMARPVSDRTPAXXXXXXXXXXXXXXXXXXGLRSLLQGTSK
                 +    +     A     r+ A                     s L  T  
8838            TaPTaPVALVTGAAKRLGrSIAEaLHAEGYaVCLHYHRSAAdAnaLaATLN
                 s  v             s    g      t          e st s    
                        10        20        30        40        50 

               70        80        90       100       110       120
GI||SP PKEPASCLLKEKERKXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
        + P S +  + +                                               
8838   ARRPNSAITVQADLSNVATApfSeaDGSaPVTLFsRCaaLVaACYmHWGRCDVLVNNASS
                           sv gt   v     t  se  d   t              
              60        70        80        90       100       110 

                     130       140       150       160       170   
GI||SP XXXXX-------ERTLPFGDVEYVDLDAFXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
            -------e     GD E  +  a                                
8838   FYPTPLLRnDadegepcVGDrEalEtAaADLFGSNAIAPYFLIKAFAqRsadpraaqRGT
               k egghgss   k sm v t                   h vrhtsqes   
             120       130       140       150       160       170 

           180       190       200       210           220         
GI||SP XXXXXXXXXXXXXXXXXXXXXXGTASGHRAGLTSRDT----PSPVDPDTVEVLMTFEPDP
                               A g   GLT   +----p  +  + V   ++  pD-
8838   nYSIiNMVDAMTnQPLLGYTiYTMAKeALEGLTRSAALELApLQIRVNGVgPGLSVLpD-
       s   v       s       m     g              s        s      v  
             180       190       200       210       220       230 

     230       240        250       260          270       280     
GI||SP ADLALSSIPGHETFDPRRHR-FSEEELKPQPIM---KKARKIQVPEEQKDEKYWSRRYKN
       -D+  s   gh    P  +R- S eE     I --- KA+ I     + D  Y   R   
8838   -DMPfaVqEdhRrKVPLYQRnSSAaEVSDVVIFLCSpKAKYITGTCiKVDGGYSLTRA  
           ps w gy s       d   e           s         v             
               240       250       260       270       280         

         290       300       310       320       330       340     
GI||SP NEAAKRSRDARRLKENQISVRAAFLEKENALLRQEVVAVRQELSHYRAVLSRYQAQHGAL
                                                                   
8838                                                               
     290       300       310       320       330       340         

8492 lmp1 lmp2 gene product                        (479 aa)
initn: opt:  612 z-score: 107.7 E():    2.6
Smith-Waterman score: 879;    12.4% identity in 370 aa overlap
 18.6% noncontradicting positions,   6.2% class identity

                                          10        20        30   
GI||SP                            XBINDINGPRTEINTAXREBMARPVSDRTPAXX
                                     N I   +TE+            S+ T +  
8492   ADNLAKSIKEQLNNSVSNANTLSAKLTDKDNTIQQAKTELEKEVQKAnQAIKSNNTASMQ
                                                      d            
     100       110       120       130       140       150         

            40        50        60        70        80        90   
GI||SP XXXXXXXXXXXXXXXXGLRSLLQGTSKPKEPASCLLKEKERKXXXXXXXXXXXXXXXXXX
                       -----L+  +K KE -----K  E K                  
8492   SAKSSLDAKVAEITKK-----LETFNKDKEA-----KFNELKQTRNQIQEFINTNKNNPN
     160       170            180            190       200         

           100       110       120         130       140       150 
GI||SP XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXE--RTLPFGDVEYVDLDAFXXXXXXXXX
                                       E-- +L   + + V  D---        
8492   YSELISQLTSKRDSKNSVTDSSNKSDIESANTELKQALAKANADKVQAD---NLAKSIKE
     210       220       230       240       250          260      

             160       170       180       190       200       210 
GI||SP XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGTASGHRAGLTSRDTP
                                                    TAS + A -------
8492   QLNNSVSNANTLSAKLTDKDNTIQQAKTELEKEVQKANQAIKSNNTASMQSAK-------
        270       280       290       300       310                

             220          230           240       250       260    
GI||SP SPVDPDTVEV---LMTFEPDPA----DLALSSIPGHETFDPRRHRFSEEELKPQPIMKKA
       S  D    E+---L TF  D  ----+L  +    +E  +  ++  +  EL  Q   K+ 
8492   SSLDAKVAEITKKLETFNKDKEAKFNELKQTRNQIQEFINTNKNNPNYSELISQLTSKRD
     320       330       340       350       360       370         

          270       280                290         300             
GI||SP RKIQVPEEQKDEKYWSRRYKNNE---------AAKRSRD--ARRLKE---NQIS----VR
        K  V +        S--- N E---------A K s D--ar lKe---n is----+R
8492   SKNSVTDSSNKSDIES---ANTELKQALakAnAdKsqaDNearpiKndLnnkienanpiR
                                   nt k k vsi  llksl eq qssvsefgtl 
     380       390          400       410       420       430      

        310       320        330       340        
GI||SP AAFLEKENALLRQEVVAVRQELS-HYRAVLSRYQAQHGAL   
        a l   +  l q      +El -   A+ s   a   al   
8492   nanlsdidnkiqqaKneLaeElqKAnQAIKnNnsaSkQaaKdS
       stkftwksstlett tk ek vt  d    s pts m sl s 
        440       450       460       470         

7391 hypothetical lactococcin a in protein secretion l (458 aa)
initn: opt:  601 z-score: 106.2 E():    3.1
Smith-Waterman score: 706;    25.4% identity in 130 aa overlap
 25.4% noncontradicting positions,   0.0% class identity

           230       240       250       260       270             
GI||SP TFEPDPADLALSSIPGHETFDPRRHRFSEEELKPQPIMKKARKIQVPEEQK---DEKY--
                A         ++         EL  Q      +K+Q    Q+---+EK --
7391   IENNLKEGEAVKENSLLLKYNGTPEQTQLSELLTQKKQALDKKVQLDLLQRSLTNEKNEF
              70        80        90       100       110       120 

             280       290       300                      310      
GI||SP -------WSRRYKNNEAAKRSRDARRLKENQ---------------ISVRAAFL------
       -------  + + N EA  +S +A   K NQ---------------I   +A L------
7391   PTADSFGYEKSFENYEAQVKSLEATIQKSNQAVEDQNKSTESQKQAIQNQVATLQQAIQN
             130       140       150       160       170       180 

                320          330       340                         
GI||SP --EKENALLRQEVVAVRQE---LSHYRAVLSRYQAQHGAL                    
       --E ENA      V+--Q+---LS+Y +----YQAQ+  L                    
7391   YSEIENAVSSGGGVS--QDNPYLSQYNS----YQAQQATLEADLKNQKNPDETAKQAAKS
             190         200           210       220       230     

                                                                   
GI||SP                                                             
                                                                   
7391   QEESLKSQFLSGLASSKDSLKSQIQSFNVQESSLTGSNAYDNSQSSQILTLKSQALSASN
         240       250       260       270       280       290     

3138 defective fc chorion-1 proteins fc106 fc125 fc177 (390 aa)
initn: opt:  593 z-score: 105.8 E():    3.3
Smith-Waterman score: 593;    14.7% identity in 217 aa overlap
 20.7% noncontradicting positions,   6.0% class identity

      110       120       130       140       150       160        
GI||SP XXXXXXXXXXXXXXXXXERTLPFGDVEYVDLDAFXXXXXXXXXXXXXXXXXXXXXXXXXX
                          +      +   l+                            
3138   LQPeAAASrVVLVLADDATAKaRVaRQNPPlNPLGQLMNWPALPQDFQLPSMDLGPQVGS
          t    k            t  v     p                             
             110       120       130       140       150       160 

      170       180       190       200       210       220        
GI||SP XXXXXXXXXXXXXXXXXXXXXXXXXXXGTASGHRAGLTSRDTPSPVDPDTVEVLM-----
                                   +A+   A   + D P+   PD+ +  +-----
3138   FLAQLPaMPaiPgiLGAAAPVPAPAPAPAAaPPlAPAPAADPPAAPVPDAaQPAILGqAA
             p  tv sl                t  p                p      e  
             170       180       190       200       210       220 

                230       240         250         260       270    
GI||SP -----TFEPDPADLALSSIPGHE--TFDPRRHRFSEEELKPQ--PIMKKARKIQVPEEQK
       -----TF- +Pa+   Ss+ G+ --TF P    F-  +++ Q--P M  A --Q      
3138   LQNAFTF-lNPaNFDASgLLGQSaPTFAPPNlDF-VAQMQRQFFPGMTPA --QPAaAGT
               f  s     s     v       f                        p   
              230       240       250        260          270      

          280       290       300       310       320       330    
GI||SP DEKYWSRRYKNNEAAKRSRDARRLKENQISVRAAFLEKENALLRQEVVAVRQELSHYRAV
       D----+     +E+  R -+a   +E Q+ +++A-LE E                     
3138   D----AqASDISEVRVRP-EaPYSQEAQMKIKSA-LEMEQERQQ                
             l             d                                       
            280       290        300        310                    

          340                                                     
GI||SP LSRYQAQHGAL                                                
           QAQ                                                    
3138       QAQVKDQEQVPLLWFrMPTTQNQDATaEKTLEdLRVEAKLRAFERQVIaELRMLQ
                          h          e     h               s      
              320       330       340       350       360         

8548 yopd protein                                  (306 aa)
initn: opt:  584 z-score: 105.8 E():    3.3
Smith-Waterman score: 705;    13.5% identity in 311 aa overlap
 17.4% noncontradicting positions,   3.9% class identity

                                  10         20        30        40
GI||SP                    XBINDINGPRTEIN-TAXREBMARPVSDRTPAXXXXXXXXX
                              +  g   Ei -T      A  +     +         
8548   MTINIKTDSPIITTGSQLDAITTETVgQSGEiKKTEDTRHEAQAIKSSEASLSRSQVPEL
                                 k    v                            
               10        20        30        40        50        60

               50        60        70        80        90       100
GI||SP XXXXXXXXXGLRSLLQGTSKPKEPASCLLKEKERKXXXXXXXXXXXXXXXXXXXXXXXXX
                 L S  QG          LL E  RK                         
8548   IKPSQGINVALLSKSQGDLNGTLSILLLLLELARKAREMGLQQRDIENKATIsAQKEQVA
                                                           t       
               70        80        90       100       110       120

              110       120       130       140       150       160
GI||SP XXXXXXXXXXXXXXXXXXXXXXXXXERTLPFGDVEYVDLDAFXXXXXXXXXXXXXXXXXX
                                  -------------AF                  
8548   EMVSGAKLMIAMAVVSGIMAATSTVAS-------------AFSIAKEVKIVKQEQILNSN
              130       140                    150       160       

              170       180       190       200          210       
GI||SP XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGTASGHRAGLTS---RDTPSPVDPD
                                            A  ++  L  ---R T S  +  
8548   IAGRdQLIDTKMQQMgNaGDKAVSREDIGRIWKPEQVADQNKLALLDKEFRMTDSKANAF
           e          s i                                          
       170       180       190       200       210       220       

       220       230       240       250       260       270       
GI||SP TVEVLMTFEPDPADLALSSIPGHETFDPRRHRFSEEELKPQPIMKKARKIQVPEEQKDEK
           ----+P-   +A S+I  H-------+ +S+ E K + +   ---I   E QK E 
8548   NAAT----QP-LGQMANSAIQVH-------QGYSQAEVKEKEVNAS---IAANEKQKAEE
       230            240              250       260          270  

       280       290       300            310       320       330  
GI||SP YWSRRYKNNEAAKRSRDARRLKENQIS-----VRAAFLEKENALLRQEVVAVRQELSHYR
         +--Y +N  ----+D+ RL E  +S-----++AAF                       
8548   AMN--YNDNFM----KDVLRLIEQYVSSHTHAMKAAFGVV                    
              280           290       300       310       320      

            340     
GI||SP AVLSRYQAQHGAL
                    
8548                
        330         

5939 69 autoantigen kd p69                         (483 aa)
initn: opt:  594 z-score: 104.7 E():    3.8
Smith-Waterman score: 594;    24.2% identity in 99 aa overlap
 33.3% noncontradicting positions,   9.1% class identity

     210       220       230       240        250        260       
GI||SP TPSPVDPDTVEVLMTFEPDPADLALSSIPGHETFDP-RRHRFSEEELKP-QPIMKKARKI
         +           T        + +    HE+F  -  + F+ --LK -Q  MKK ---
5939   RCNLLSHMLATYQTTLLHFWEKTSHTMAAIHESFKGYQPYEFTT--LKSLQDPMKKL---
            230       240       250       260         270          

       270       280       290       300       310       320       
GI||SP QVPEEQKDEKYWSRRYKNNEAAKRSRDARRLKENQISVRAAFLEKENALLRQEVVAVRQE
       ---- +K+EK  s r  n eAa--  + r L----IS-----LE EN-- r+E  + + E
5939   ----VEKEEKKKinrrEnrdAa--aQEPrQL----IS-----LEEEN--QrKESSscqkE
                   ssqq ste v  v   s                     h    tfkt 
           280       290         300                  310       320

       330       340                                               
GI||SP LSHYRAVLSRYQAQHGAL                                          
         --++vlS                                                   
5939   dG--KSilSalDKgSaddACSGPIDELLDmKpEEGACLGPmAGTPEPEgaDKDDLLLLnE
       e     vp sv  s tht           v s        v       sg        s 
                330       340       350       360       370        

5176 histone-binding nuclear autoantigenic hgv2 protei (704 aa)
initn: opt:  605 z-score: 104.3 E():      4
Smith-Waterman score: 681;     5.2% identity in 325 aa overlap
 27.7% noncontradicting positions,  22.5% class identity

                           10        20        30        40        
GI||SP             XBINDINGPRTEINTAXREBMARPVSDRTPAXXXXXXXXXXXXXXXXX
                                     Eb++r     t a                 
5176   EaaDAEeeKSVSGTDVQEEcrEq gqEKQGEVIVrI EKPkEaSEEQPgtTLeKdnTAVE
        vp   kg           hk k ve        s     t v     vv  g qg    
      140       150       160        170        180       190      

       50        60        70        80        90       100        
GI||SP XGLRSLLQGTSKPKEPASCLLKEKERKXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
           sl-  T KP +      kEk                                    
5176   VEAEpl-DaTaKPVDVGGdEPeEqmaTSeNEaGKAVL qQLVGQdVPPaEESPeVqTEaa
           sv  p v       h  k kvv  g  p      e     e   v    m t  te
        200        210       220       230        240       250    

      110       120       130       140       150       160        
GI||SP XXXXXXXXXXXXXXXXXERTLPFGDVEYVDLDAFXXXXXXXXXXXXXXXXXXXXXXXXXX
                        e t    d e  dl a                           
5176   easadeagdeaSrdpeqdapglgndgasndaeaagdQaeidPqplaErliETKdgdeleE
       kvtdvlkissv eksgmeksvkpeppevkglstlve kpse etsi kst   eksgsk 
             v       vp   t vsk      p vv   tsvk    k  v    g  l   
          260       270       280       290       300       310    

      170       180       190       200       210       220        
GI||SP XXXXXXXXXXXXXXXXXXXXXXXXXXXGTASGHRAGLTSRDTPSPVDPDTVEVLMTFEPD
                                   t s   a  ts---p pv   tv   -t E  
5176   KtrAeeaanQ EaKLpidekeaaedgmaeeaaqgaeeek---qadkeneaandd-ddEre
        vd kltps   t  spesspgegeksdsksdekeksks   iektvqitnkee kq km
             v         v  t k s  vet vs skt t    pppv ks v pv tp  s
                            v            v            s            
          320        330       340       350          360          

      230       240       250       260       270        280       
GI||SP PADLALSSIPGHETFDPRRHRFSEEELKPQPIMKKARKIQVPEEQKD-EKYWSRRYKNNE
             ss  g e  d      seE      +  K+     pee  +-e  W        
5176   edeeeeedneenedddeenagaeeEnPNDSVLENKSeqEndeddigNlqLAWdMLdLaKi
       sqkkvgsestgepeesgtsdkssk k          lp eepeevs me   e  e c t
        emp   st  s ggtk  est   m           s t                   v
                     t          t                                  
     370       380       390       400       410       420         

       290       300       310       320       330       340       
GI||SP AAKRSRDARRLKENQISVRAAFLEKENALLRQEVVAVRQELSHYRAVLSRYQAQHGAL  
         Kr   +---Kenq  v  a l    --l  Ev    Q +  + a Ls- Q qh     
5176   ifKrq qs---KeaqqkaAQaHqKLGE--lciEsqNhiQAiedFqaCLn-iQedhLeahD
       ly kh et    tnklmv  c l      vgl ve yp  vge le  s l kql pet 
             k         y             sv     s       s  v    ey     
                                            v                      
     430           440       450         460       470        480  

                                                                   
GI||SP                                                             
                                                                   
5176   RlLAEThYnLGLAYqfnkrhdnAiaqfqqaidViEaRmamLneqieaaeGnle     de
        k    y q     gyesqyee lehysksle l n vdv tkllkenv ekt     is
               h     s s k    vs  ts  g   k        m ss  ssv       
                                                   v               
            490       500       510       520       530            

4573 element insertion is421 hypothetical 47 is186 41  (354 aa)
initn: opt:  580 z-score: 104.3 E():      4
Smith-Waterman score: 875;    16.3% identity in 343 aa overlap
 18.7% noncontradicting positions,   2.3% class identity

                       10        20          30        40        50
GI||SP         XBINDINGPRTEINTAXREBMA--RPVSDRTPAXXXXXXXXXXXXXXXXXXG
                 I    G   E++T+ R   A--R    R  A                   
4573   MNYSHDNWSAILAHIGKPEELDTSARNAGALTRRREIRDAATLLRLGLAYGPGGM SLRE
               10        20        30        40        50          

               60        70          80        90       100        
GI||SP LRSLLQGTSKPKEPASCLLKE--KERKXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
         +  Q           LLK --                                     
4573   VTAWAQLHDVATLSDVALLKRLRNAADWFGILAAQTLAVRAAVTGCTSGKRLRLVDGTAI
      60        70        80        90       100       110         

      110       120       130             140       150       160  
GI||SP XXXXXXXXXXXXXXXXXERTLPFGDVEYVD------LDAFXXXXXXXXXXXXXXXXXXXX
                          T  F D E  D------LD F                    
4573   SaPGGGsAEWRLHMGYDPHTCQFTDFELTDSRDAERLDRFAQTADEIRIADRGFGSRPEC
        g    t                                                     
     120       130       140       150       160       170         

            170       180       190       200       210       220  
GI||SP XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGTASGHRAGLTSRDTPSPVDPDTVEVL
                                        G   G   G     T---V        
4573   IRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGETT---VMIGNSGNK
     180       190       200       210       220          230      

            230       240       250          260            270    
GI||SP MTFEPDPADLALSSIPGHETFDPRRHRFSEEELK---PQPIMKKAR-----KIQVPE---
        +  P PA L   S+P   +   +    SE   K--- Q     A -----    PE---
4573   KAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHVLLLTSLPEDEY
        240       250       260       270       280       290      

               280       290            300              310       
GI||SP --EQKDEKYWSRRYKNNEAAKRSR-----DARRLKENQIS-------VRAAFLEKENALL
       --EQ  + Y  R- +   A KR +-----DA R KE +++-------  AAFL  +    
4573   SAEQVADCYRLR-WQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLLAAFLIDDII  
        300        310       320       330       340       350     

       320       330       340     
GI||SP RQEVVAVRQELSHYRAVLSRYQAQHGAL
                                   
4573                               
         360       370       380   

9186 pes4 pab-like protein                         (381 aa)
initn: opt:  580 z-score: 103.8 E():    4.2
Smith-Waterman score: 587;     7.4% identity in 285 aa overlap
 21.1% noncontradicting positions,  13.7% class identity

               10        20        30        40        50        60
GI||SP XBINDINGPRTEINTAXREBMARPVSDRTPAXXXXXXXXXXXXXXXXXXGLRSLLQGTSK
                   +     E m r +  r p+                  G  + -----k
9186          LFIGnLheTVTEEmLrgIFKrYqSFeSAKVCrDflTKKSLGhGYLNF-----e
                  d ks     t kk   k p  v     i sv      y          k
                      10        20        30        40             

               70                80        90       100       110  
GI||SP PKEPASCLLKE--------KERKXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
        Ke A    kE--------kE k                                     
9186   DKndAEkAreElNYTkfnGqEirIMPSlrNTlFRKNiGTNVFFSNLPLnNPqLTTRsFYd
         ee  s mk f   vvf k vk    mk  t    f           e  l    v  l
       50        60        70        80        90       100        

            120       130       140         150       160       170
GI||SP XXXXXXXXXXXXXERTLPFGDVEYVDLD--AFXXXXXXXXXXXXXXXXXXXXXXXXXXXX
                    er    G V-Y d d--A                             
9186   imirYGniLSClLdrRKnIGFV-YFdndisARNVIKkYNNqeFFGnKIiCGiHFDKEVRs
       tfse  kv   k es  d       edekt      m   ts   k  l  l       t
      110       120       130        140       150       160       

              180       190       200       210       220       230
GI||SP XXXXXXXXXXXXXXXXXXXXXXXXXGTASGHRAGLTSRDTPSPVDPDTVEVLMTFEPDPA
                                  + G  a   +  + S ---+t+ v---------
9186   rPnFekrKkridadiiIEdEqlannnhSKGnnarSKNIYSSSQ---NsIli---------
       v e ttq smlgsetv  k lslsekl   ddke             t fv         
       170       180       190       200       210                 

              240       250       260       270       280          
GI||SP DLALSSIPGHETFDPRRHRFSEEELKPQPIMKKARKIQVPEEQKDEKYWS-RRYKNNEAA
       ---- ++P   T d     FSe  ----PI----+ i + e qk    w+-  YKN e +
9186   ----KNLPsdTTrddiLnfFSeiG----PI----KSifiSnaqankphkAFVTYKNeedS
               ti  qeev dy  tv             vyl ektkvtylw       sse 
             220       230               240       250       260   

     290       300       310       320       330       340         
GI||SP KRSRDARRLKENQISVRAAFLEKENALLRQEVVAVRQELSHYRAVLSRYQAQHGAL    
       k++               +   k+     q + + +               +        
9186   eKAqKrlNnfiFrnhkilVgraqDKeeraqFIesnKisklfLeNLSanCNKEFikqLChQ
       k  i cy kty kgktlw tpgk  pvhnk  gtq kttvy k   fv     lsy  l 
           270       280       290       300       310       320   

8288 macrogolgin rat gcp360                        (3267 aa)
initn: opt:  657 z-score: 103.7 E():    4.3
Smith-Waterman score: 1078;    13.3% identity in 347 aa overlap
 20.5% noncontradicting positions,   7.2% class identity

                                   10        20        30        40
GI||SP                     XBINDINGPRTEINTAXREBMARPVSDRTPAXXXXXXXXX
                                     Te  TA  E-------+ t a         
8288   dLrrSlnALQEEnQdLSKEIeSlKVSISQLTrQlTALqE-------EGaLalYHAQLrVr
       s qm fd     k g     k f        e v   h          t gv     k k
             2600      2610      2620             2630      2640   

               50        60        70        80        90       100
GI||SP XXXXXXXXXGLRSLLQGTSKPKEPASCLLKEKERKXXXXXXXXXXXXXXXXXXXXXXXXX
                 l S  + t    E   C  KE  +K                         
8288   EEEVqrLsAalSSSQKRiadLqEELVCVQKEAaKKVgEIEDKLKrELKHLHHnAGIMRNE
           hk t lf      tve e          s   s       k       d       
          2650      2660      2670      2680      2690      2700   

              110       120       130       140       150       160
GI||SP XXXXXXXXXXXXXXXXXXXXXXXXXERTLPFGDVEYVDLDAFXXXXXXXXXXXXXXXXXX
                        ------- E  L     E  dL A                   
8288   TETAEERVAELARDLVE-------MEQKLLmVTKENKdLTAQIQaFGrSMSSLQnSRDHA
                                     t      g      s  k      d     
          2710      2720             2730      2740      2750      

              170       180       190       200       210       220
GI||SP XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGTASGHRAGLTSRDTPSPVDPDTVE
                                            a+-     Ts ++ S  +    +
8288   nEELddLKrKYDASLKELAQLKerqdLnRErDAlLSqaA-FplNsTeENilSrLEKLNQQ
       t   se  k             gqgl g  s  v  et   sm t s  ss h       
       2760      2770      2780      2790       2800      2810     

              230       240       250       260              270   
GI||SP VLMTFEPDPADLALSSIPGHETFDPRRHRFSEEELKPQPI-------MKKARKIQVPEEQ
        l  ---D   L LSS--  E      + Fs      Q  -------+ K RK--- EE 
8288   LiSK---DEQLLHLSS--qLEdShNQVQSFsKAMaSLQNERDHLWNELEKFRK---SEEG
        l                e  s y      t   t                         
           2820        2830      2840      2850      2860          

           280       290       300         310       320       330 
GI||SP KDEKYWSRRYKNNEAAKRSRDARRLKENQISVRAAF--LEKENALLRQEVVAVRQELSHY
       K------ R     aa    ++  LK    S     --L KE   L Q+   + QE++  
8288   K------QRSAAqSaasSPAEVQSLKKAMSSLQNDRDRLLKELKNLQQQYLQiNQEITEL
                   p pst                                   m       
            2870      2880      2890      2900      2910      2920 

                340                                                
GI||SP R---AVLSRYQAQHGAL                                           
       r---A L  yQ q  Al                                           
8288   rPLKAQLQEsQDqTKAlQiMqEELRQENLSWQHELdQLRmEKnSWEiHERRMKEQYLMAI
       h        y  k   f m k              h   v  s   l             
            2930      2940      2950      2960      2970      2980 

8676 alpha-helical coiled coil protein tlpa        (345 aa)
initn: opt:  575 z-score: 103.6 E():    4.4
Smith-Waterman score: 915;    20.2% identity in 218 aa overlap
 24.8% noncontradicting positions,   4.6% class identity

          110       120       130       140       150       160    
GI||SP XXXXXXXXXXXXXXXXXXXXXERTLPFGDVEYVDLDAFXXXXXXXXXXXXXXXXXXXXXX
                               L     EY--- A                       
8676   LQAEGRNITGFALRNQVGGGNPTRLRQIcDEY---QASQSTVVTEPVAELPVEVAEEVKA
                                   w                               
        20        30        40           50        60        70    

          170       180       190       200          210       220 
GI||SP XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGTASGHRAGLTSR---DTPSPVDPDTVEV
                                        A+G     + R---D+   VD-D  E 
8676   VSAALSERITQLATELNDKAVRAAERRVAEVTRAAGEQTAQAERELADAAQTVD-DLEEK
           80        90       100       110       120        130   

               230       240       250       260       270         
GI||SP LMTFEP--DPADLALSSIPGHETFDPRRHRFSEEELKPQPIMKKARKIQVPEEQKDEKYW
       L   + --D   LAL S----E      H     +LK           Q  E  +++K  
8676   LDELQDRYDSLTLALES----ERSLRQQHDVEMAQLKERLAAAEENTRQREERYQEQKTV
           140       150           160       170       180         

     280         290       300       310       320        330      
GI||SP SRRYKNNEAA--KRSRDARRLKENQISVRAAFLEKENALLRQEVVAVRQEL-SHYRAVLS
            N E A--K +R+    +  QIS  A     E    R  V      L-S+  A  S
8676   LQDALNAEQAQHKNTREDLQKRLEQISAEANARTEELKSERDKVNTLLTRLESQENALAS
     190       200       210       220       230       240         

        340                                                        
GI||SP RYQAQHGAL                                                   
         Q-QH A                                                    
8676   ERQ-QHLATRETLQQRLEQAIADTQARAGEIALERDRVSSLTARLESQEKASSEQLVRMG
     250        260       270       280       290       300        

8248 hypothetical orf3 protein 3                   (417 aa)
initn: opt:  579 z-score: 103.1 E():    4.6
Smith-Waterman score: 824;    16.4% identity in 323 aa overlap
 22.6% noncontradicting positions,   6.2% class identity

               10        20        30        40        50        60
GI||SP XBINDINGPRTEINTAXREBMARPVSDRTPAXXXXXXXXXXXXXXXXXXGLRSLLQGTSK
                             RP SD T                    G R    G + 
8248                    mLEAPRPLSDPTCLARRSDVSIVAERRVVAQVGVRPVVAGLAE
                        v                                          
                                10        20        30        40   

                             70        80        90       100      
GI||SP PKE--------------PASCLLKEKERKXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
         E--------------P     +  ER+                               
8248   QVEEWEREDDAQGPTDGPEGGVRRWSERQRELGARRAVAVPGQRRVTPSYTAPDRVDLRT
            50        60        70        80        90       100   

        110       120       130         140       150       160    
GI||SP XXXXXXXXXXXXXXXXXXXERTLPFGD--VEYVDLDAFXXXXXXXXXXXXXXXXXXXXXX
                          E ----GD--V YV --A