Blast performed on February-4-2012
BLASTP 2.2.24+


Reference:
Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database
search programs", Nucleic Acids Res. 25:3389-3402.



Reference for composition-based statistics:
Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei
Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and
Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST
protein database searches with composition-based statistics and
other refinements", Nucleic Acids Res. 29:2994-3005.



Database: nr_env
           20,512,688 sequences; 6,167,035,527 total letters



Query=  EG13730 gfcC 
Length=248
                                                                      Score     E
Sequences producing significant alignments:                          (Bits)  Value

ref|NP_415505.1|  conserved protein [Escherichia coli str. K-12 s...   503    1e-140
ref|YP_309961.1|  hypothetical protein SSON_0992 [Shigella sonnei...   501    3e-140
ref|ZP_03070105.1|  group 4 capsule (G4C) polysaccharide, YmcB [E...   501    4e-140
gb|EFZ60522.1|  hypothetical protein ECLT68_0462 [Escherichia col...   501    5e-140
ref|NP_706908.1|  hypothetical protein SF0987 [Shigella flexneri ...   501    5e-140
ref|YP_002402185.1|  hypothetical protein EC55989_1093 [Escherich...   500    6e-140
ref|YP_408638.1|  hypothetical protein SBO_2245 [Shigella boydii ...   500    8e-140
ref|YP_001880817.1|  group 4 capsule (G4C) polysaccharide, YmcB [...   500    8e-140
gb|EFZ70668.1|  hypothetical protein ECOK1357_1256 [Escherichia c...   500    9e-140
ref|ZP_03064769.1|  group 4 capsule (G4C) polysaccharide, YmcB [S...   500    1e-139
gb|EGB58446.1|  SLBB-domain-containing protein [Escherichia coli ...   499    1e-139
ref|ZP_06656911.1|  YmcB [Escherichia coli B185] >gb|EFF07293.1| ...   499    1e-139
ref|ZP_08353029.1|  conserved hypothetical protein [Escherichia c...   499    1e-139
ref|ZP_02999848.1|  group 4 capsule (G4C) polysaccharide, YmcB [E...   499    2e-139
ref|ZP_06652940.1|  YmcB protein [Escherichia coli B354] >gb|EFF1...   499    2e-139
ref|ZP_03047374.1|  group 4 capsule (G4C) polysaccharide, YmcB [E...   499    2e-139
ref|YP_003036816.1|  hypothetical protein ECBD_2609 [Escherichia ...   498    3e-139
ref|ZP_08373309.1|  conserved hypothetical protein [Escherichia c...   498    5e-139
ref|YP_402624.1|  hypothetical protein SDY_0961 [Shigella dysente...   497    6e-139
ref|ZP_08383110.1|  conserved hypothetical protein [Escherichia c...   497    7e-139
ref|NP_286922.1|  hypothetical protein Z1402 [Escherichia coli O1...   496    1e-138
ref|YP_002328533.1|  hypothetical protein E2348C_0970 [Escherichi...   495    3e-138
gb|AEE55762.1|  conserved hypothetical protein [Escherichia coli ...   495    3e-138
ref|YP_003498802.1|  Group 4 capsule (G4C) polysaccharide, YmcB [...   494    3e-138
gb|EGB62657.1|  SLBB-domain-containing protein [Escherichia coli ...   494    6e-138
gb|EGB71723.1|  SLBB-domain-containing protein [Escherichia coli ...   493    9e-138
ref|ZP_07137361.1|  conserved hypothetical protein [Escherichia c...   493    1e-137
gb|EFZ75912.1|  hypothetical protein ECRN5871_1089 [Escherichia c...   491    5e-137
gb|EGJ00865.1|  hypothetical protein SD15574_1331 [Shigella dysen...   469    2e-130
ref|ZP_07783163.1|  uncharacterized protein gfcC [Escherichia col...   466    2e-129
ref|ZP_07681343.1|  conserved hypothetical protein [Shigella dyse...   464    7e-129
ref|ZP_02904453.1|  group 4 capsule (G4C) polysaccharide, YmcB [E...   449    1e-124
gb|EGI97539.1|  hypothetical protein SB521682_1222 [Shigella boyd...   448    4e-124
gb|EFW50291.1|  YjbG polysaccharide synthesis-related protein [Sh...   447    5e-124
gb|EFW53537.1|  YjbG polysaccharide synthesis-related protein [Sh...   447    5e-124
pdb|3P42|A  Chain A, Structure Of Gfcc (Ymcb), Protein Encoded By...   444    5e-123
gb|EFW71149.1|  YjbG polysaccharide synthesis-related protein [Es...   440    8e-122
gb|EGK28119.1|  hypothetical protein SFK272_1492 [Shigella flexne...   319    2e-85 
emb|CBX82236.1|  Uncharacterized protein gfcC Group 4 capsule pro...   219    3e-55 
ref|YP_001909022.1|  Conserved hypothetical protein YmcB [Erwinia...   218    7e-55 
ref|YP_002650310.1|  capsular polysaccharide protein [Erwinia pyr...   216    4e-54 
ref|YP_003537358.1|  exported protein [Erwinia amylovora ATCC 499...   215    4e-54 
gb|ADP10499.1|  Putative capsular polysaccharide protein [Erwinia...   211    8e-53 
ref|YP_003532687.1|  hypothetical protein EAMY_3334 [Erwinia amyl...   209    4e-52 
ref|YP_003739692.1|  conserved uncharacterized protein YmcB [Erwi...   205    6e-51 
ref|YP_001174974.1|  hypothetical protein Ent638_0233 [Enterobact...   203    2e-50 
ref|YP_003518538.1|  YmcB [Pantoea ananatis LMG 20103] >gb|ADD754...   200    1e-49 
dbj|BAK13483.1|  hypothetical protein YmcB precursor YmcB [Pantoe...   200    2e-49 
ref|YP_001436216.1|  hypothetical protein ESA_00075 [Cronobacter ...   200    2e-49 
gb|EGL72010.1|  hypothetical protein CSE899_14477 [Cronobacter sa...   197    1e-48 
ref|YP_003610796.1|  hypothetical protein ECL_00280 [Enterobacter...   197    1e-48 
ref|NP_709891.1|  hypothetical protein SF4177 [Shigella flexneri ...   196    3e-48 
ref|YP_312940.1|  hypothetical protein SSON_4206 [Shigella sonnei...   196    4e-48 
ref|YP_003232032.1|  hypothetical protein ECO26_5143 [Escherichia...   195    4e-48 
ref|YP_003212148.1|  hypothetical protein CTU_37850 [Cronobacter ...   195    6e-48 
gb|EGC97468.1|  hypothetical protein ECD227_3706 [Escherichia fer...   195    6e-48 
ref|ZP_05969412.1|  conserved hypothetical protein [Enterobacter ...   192    3e-47 
ref|YP_001455401.1|  hypothetical protein CKO_03889 [Citrobacter ...   192    3e-47 
ref|YP_003367162.1|  hypothetical protein ROD_37221 [Citrobacter ...   192    3e-47 
gb|EGB70445.1|  SLBB-domain-containing protein [Escherichia coli ...   192    6e-47 
ref|ZP_08500193.1|  group 4 capsule (G4C) polysaccharide, YmcB [E...   191    1e-46 
ref|YP_001465525.1|  hypothetical protein EcE24377A_4576 [Escheri...   190    1e-46 
ref|ZP_07140265.1|  hypothetical protein HMPREF9548_02441 [Escher...   190    2e-46 
emb|CBK84486.1|  SLBB-domain like (DUF1017) [Enterobacter cloacae...   189    3e-46 
ref|ZP_08366591.1|  conserved hypothetical protein [Escherichia c...   188    6e-46 
ref|ZP_08497921.1|  hypothetical protein HMPREF9086_2183 [Enterob...   188    8e-46 
ref|ZP_03048870.1|  conserved hypothetical protein [Escherichia c...   187    1e-45 
ref|YP_001726928.1|  hypothetical protein EcolC_4002 [Escherichia...   187    1e-45 
ref|NP_418452.1|  conserved protein [Escherichia coli str. K-12 s...   187    1e-45 
ref|YP_001882714.1|  hypothetical protein SbBS512_E4533 [Shigella...   187    2e-45 
ref|YP_002295592.1|  hypothetical protein ECSE_4317 [Escherichia ...   187    2e-45 
gb|EFW53980.1|  YjbG polysaccharide synthesis-related protein [Sh...   186    2e-45 
ref|YP_003933062.1|  hypothetical protein Pvag_3494 [Pantoea vaga...   186    2e-45 
emb|CBG37221.1|  conserved hypothetical protein [Escherichia coli...   186    3e-45 
emb|CAP78488.1|  Uncharacterized protein yjbG [Escherichia coli L...   186    3e-45 
ref|ZP_06939706.1|  hypothetical protein EcolOP_27029 [Escherichi...   186    3e-45 
ref|ZP_07152932.1|  conserved hypothetical protein [Escherichia c...   186    3e-45 
ref|YP_543535.1|  hypothetical protein UTI89_C4596 [Escherichia c...   186    4e-45 
ref|ZP_06660155.1|  YjbG polysaccharide synthesis protein [Escher...   186    4e-45 
ref|ZP_08350965.1|  conserved hypothetical protein [Escherichia c...   186    4e-45 
ref|YP_001746417.1|  hypothetical protein EcSMS35_4489 [Escherich...   185    4e-45 
ref|ZP_08356834.1|  conserved hypothetical protein [Escherichia c...   185    6e-45 
ref|YP_002415168.1|  hypothetical protein ECUMN_4561 [Escherichia...   185    6e-45 
ref|YP_004214318.1|  hypothetical protein Rahaq_3601 [Rahnella sp...   184    8e-45 
gb|EFX17993.1|  hypothetical protein ECO2687_20949 [Escherichia c...   184    9e-45 
gb|EGC12490.1|  SLBB-domain-containing protein [Escherichia coli ...   184    9e-45 
ref|ZP_08386356.1|  conserved hypothetical protein [Escherichia c...   184    9e-45 
ref|YP_002389495.1|  hypothetical protein ECIAI1_4253 [Escherichi...   184    1e-44 
ref|ZP_03029269.1|  conserved hypothetical protein [Escherichia c...   184    1e-44 
gb|EFZ75386.1|  hypothetical protein ECRN5871_1899 [Escherichia c...   184    1e-44 
gb|EGC06312.1|  SLBB-domain-containing protein [Escherichia fergu...   184    1e-44 
ref|ZP_02805425.1|  conserved hypothetical protein [Escherichia c...   184    1e-44 
ref|NP_756848.1|  hypothetical protein c4996 [Escherichia coli CF...   184    1e-44 
ref|ZP_05433479.1|  hypothetical protein ShiD9_11915 [Shigella sp...   184    1e-44 
ref|NP_290662.1|  hypothetical protein Z5626 [Escherichia coli O1...   184    1e-44 
ref|YP_405613.1|  hypothetical protein SDY_4220 [Shigella dysente...   183    2e-44 
ref|YP_002385133.1|  hypothetical protein EFER_4120 [Escherichia ...   183    2e-44 
ref|YP_219090.1|  hypothetical protein SC4103 [Salmonella enteric...   183    2e-44 
ref|YP_003943655.1|  hypothetical protein Entcl_4138 [Enterobacte...   182    3e-44 
gb|EFZ53016.1|  hypothetical protein SS53G_2453 [Shigella sonnei ...   182    4e-44 
gb|EFS13220.1|  uncharacterized protein gfcC [Shigella flexneri 2...   182    4e-44 
ref|ZP_02685432.1|  conserved hypothetical protein [Salmonella en...   182    4e-44 
ref|ZP_03059326.1|  conserved hypothetical protein [Escherichia c...   182    5e-44 
ref|ZP_02809628.1|  conserved hypothetical protein [Escherichia c...   182    6e-44 
ref|YP_002410323.1|  hypothetical protein ECIAI39_4450 [Escherich...   181    1e-43 
gb|EGB61141.1|  SLBB-domain-containing protein [Escherichia coli ...   181    1e-43 
ref|ZP_07185547.1|  conserved hypothetical protein [Escherichia c...   181    1e-43 
ref|ZP_03217856.1|  conserved hypothetical protein [Salmonella en...   180    2e-43 
ref|ZP_03044141.1|  conserved hypothetical protein [Escherichia c...   180    2e-43 
ref|ZP_06651602.1|  predicted protein [Escherichia coli B354] >gb...   180    2e-43 
ref|ZP_06356425.1|  conserved hypothetical protein [Citrobacter y...   180    2e-43 
ref|YP_001572429.1|  hypothetical protein SARI_03458 [Salmonella ...   180    2e-43 
ref|NP_458522.1|  hypothetical protein STY4420 [Salmonella enteri...   179    4e-43 
ref|ZP_07380293.1|  protein of unknown function DUF1017 [Pantoea ...   179    5e-43 
ref|ZP_03357288.1|  hypothetical protein SentesTyphi_01726 [Salmo...   177    1e-42 
ref|YP_004114116.1|  hypothetical protein Pat9b_0234 [Pantoea sp....   177    1e-42 
ref|ZP_08376264.1|  conserved hypothetical protein [Escherichia c...   177    2e-42 
ref|ZP_04558593.1|  conserved hypothetical protein [Citrobacter s...   176    4e-42 
ref|ZP_02346309.1|  conserved hypothetical protein [Salmonella en...   175    5e-42 
ref|ZP_07183334.1|  conserved hypothetical protein [Escherichia c...   173    2e-41 
ref|YP_002218119.1|  hypothetical protein SeD_A4618 [Salmonella e...   172    5e-41 
ref|ZP_03213760.1|  conserved hypothetical protein [Salmonella en...   172    6e-41 
ref|YP_001591314.1|  hypothetical protein SPAB_05204 [Salmonella ...   172    6e-41 
ref|ZP_02664389.1|  conserved hypothetical protein [Salmonella en...   171    1e-40 
gb|EFY12299.1|  hypothetical protein SEEM315_09434 [Salmonella en...   171    1e-40 
ref|ZP_03069392.1|  conserved hypothetical protein [Escherichia c...   171    1e-40 
ref|YP_153099.1|  hypothetical protein SPA4041 [Salmonella enteri...   170    1e-40 
ref|ZP_02799370.2|  conserved hypothetical protein [Escherichia c...   170    2e-40 
gb|EFW51577.1|  YjbG polysaccharide synthesis-related protein [Sh...   170    2e-40 
ref|YP_002043474.1|  hypothetical protein SNSL254_A4566 [Salmonel...   170    2e-40 
ref|ZP_03064544.1|  conserved hypothetical protein [Shigella dyse...   170    2e-40 
ref|YP_002149138.1|  hypothetical protein SeAg_B4481 [Salmonella ...   170    2e-40 
ref|ZP_02785878.2|  conserved hypothetical protein [Escherichia c...   170    2e-40 
ref|ZP_02833337.1|  conserved hypothetical protein [Salmonella en...   169    3e-40 
ref|NP_463089.1|  periplasmic protein [Salmonella enterica subsp....   169    4e-40 
ref|ZP_06537804.1|  hypothetical protein Salmonellaentericaenteri...   168    6e-40 
ref|ZP_03373074.1|  hypothetical protein SentesTyp_23395 [Salmone...   168    6e-40 
ref|YP_002228797.1|  hypothetical protein SG4066 [Salmonella ente...   168    8e-40 
ref|ZP_04656864.1|  hypothetical protein SentesTe_18130 [Salmonel...   168    8e-40 
gb|EGI88734.1|  hypothetical protein SB521682_4838 [Shigella boyd...   168    9e-40 
ref|ZP_03364691.1|  hypothetical protein SentesTyph_17318 [Salmon...   149    3e-34 
gb|EFW61835.1|  YjbG polysaccharide synthesis-related protein [Sh...   149    5e-34 
ref|YP_410318.1|  hypothetical protein SBO_4056 [Shigella boydii ...   141    9e-32 
ref|ZP_03338398.1|  hypothetical protein Salmonelentericaenterica...   125    5e-27 
gb|EDA50530.1|  hypothetical protein GOS_1989170 [marine metagenome]   125    8e-27 
gb|EBB46452.1|  hypothetical protein GOS_229183 [marine metagenome]    124    2e-26 
ref|ZP_03830735.1|  hypothetical protein PcarcW_05039 [Pectobacte...   121    1e-25 
ref|YP_003260364.1|  hypothetical protein Pecwa_3011 [Pectobacter...   119    3e-25 
ref|YP_003016906.1|  hypothetical protein PC1_1323 [Pectobacteriu...   119    5e-25 
gb|EFU99994.1|  conserved hypothetical protein [Escherichia coli ...   118    1e-24 
ref|ZP_03826818.1|  hypothetical protein PcarbP_09375 [Pectobacte...   117    1e-24 
ref|YP_049553.1|  hypothetical protein ECA1447 [Pectobacterium at...   117    1e-24 
ref|YP_003005145.1|  hypothetical protein Dd1591_2844 [Dickeya ze...   117    2e-24 
ref|YP_003882187.1|  hypothetical protein Dda3937_03274 [Dickeya ...   115    6e-24 
ref|YP_691475.1|  hypothetical protein SFV_4186 [Shigella flexner...   114    2e-23 
ref|ZP_06715180.1|  conserved hypothetical protein [Edwardsiella ...   114    2e-23 
ref|YP_962878.1|  hypothetical protein Sputw3181_1486 [Shewanella...   112    5e-23 
gb|ADV54996.1|  protein of unknown function DUF1017 [Shewanella p...   111    1e-22 
ref|YP_001184042.1|  hypothetical protein Sputcn32_2522 [Shewanel...   110    3e-22 
ref|YP_737462.1|  hypothetical protein Shewmr7_1406 [Shewanella s...   108    6e-22 
ref|YP_733476.1|  hypothetical protein Shewmr4_1341 [Shewanella s...   106    3e-21 
ref|YP_001051215.1|  hypothetical protein Sbal_2862 [Shewanella b...   105    9e-21 
ref|YP_003332847.1|  hypothetical protein Dd586_1258 [Dickeya dad...   105    9e-21 
gb|EDA46671.1|  hypothetical protein GOS_1996150 [marine metagenome]   104    1e-20 
ref|YP_869034.1|  hypothetical protein Shewana3_1394 [Shewanella ...   104    1e-20 
gb|EDA79313.1|  hypothetical protein GOS_1936285 [marine metagenome]   103    2e-20 
ref|YP_001555433.1|  hypothetical protein Sbal195_3008 [Shewanell...   103    3e-20 
ref|ZP_07391524.1|  protein of unknown function DUF1017 [Shewanel...   103    3e-20 
ref|ZP_08567183.1|  YjbG polysaccharide synthesis protein [Shewan...   103    3e-20 
gb|EDA62109.1|  hypothetical protein GOS_1968443 [marine metagenome]   102    5e-20 
ref|YP_857379.1|  putative periplasmic protein [Aeromonas hydroph...   101    1e-19 
ref|ZP_08570204.1|  SLBB-domain like (DUF1017) [Rheinheimera sp. ...  98.2    1e-18 
ref|NP_718714.1|  polysaccharide synthesis-related protein [Shewa...  97.8    1e-18 
ref|YP_001367076.1|  hypothetical protein Shew185_2879 [Shewanell...  94.0    2e-17 
gb|EGK28123.1|  hypothetical protein SFK272_1496 [Shigella flexne...  93.6    3e-17 
ref|YP_002357425.1|  hypothetical protein Sbal223_1497 [Shewanell...  93.6    3e-17 
ref|YP_455825.1|  hypothetical protein SG2145 [Sodalis glossinidi...  91.7    1e-16 
ref|YP_004433358.1|  hypothetical protein Glaag_1129 [Glaciecola ...  90.5    2e-16 
ref|YP_203541.1|  hypothetical protein VF_0158 [Vibrio fischeri E...  88.6    7e-16 
ref|ZP_01218708.1|  hypothetical polysaccharide synthesis-related...  88.6    8e-16 
ref|YP_130871.1|  polysaccharide synthesis-like protein [Photobac...  87.0    2e-15 
ref|YP_002154921.1|  hypothetical protein VFMJ11_0150 [Vibrio fis...  86.7    3e-15 
ref|YP_662763.1|  hypothetical protein Patl_3203 [Pseudoalteromon...  81.3    1e-13 
ref|ZP_06054113.1|  hypothetical polysaccharide synthesis-related...  80.9    2e-13 
gb|EBT31304.1|  hypothetical protein GOS_7294385 [marine metagenome]  80.1    3e-13 
ref|YP_002261809.1|  exported protein [Aliivibrio salmonicida LFI...  80.1    3e-13 
gb|ECC92702.1|  hypothetical protein GOS_5462754 [marine metagenome]  76.3    5e-12 
gb|EBT37533.1|  hypothetical protein GOS_7284195 [marine metagenome]  73.9    2e-11 
ref|YP_004565064.1|  hypothetical protein VAA_02509 [Vibrio angui...  73.9    2e-11 
ref|ZP_05879679.1|  hypothetical protein VFA_003816 [Vibrio furni...  72.4    6e-11 
ref|YP_154958.1|  hypothetical protein IL0568 [Idiomarina loihien...  72.4    6e-11 
gb|ADT85366.1|  hypothetical periplasmic protein [Vibrio furnissi...  72.4    6e-11 
ref|ZP_05883360.1|  polysaccharide synthesis-related protein [Vib...  70.1    3e-10 
ref|YP_002415894.1|  hypothetical protein VS_0210 [Vibrio splendi...  68.9    8e-10 
ref|ZP_01982747.1|  conserved hypothetical protein [Vibrio choler...  68.6    1e-09 
ref|ZP_06053682.1|  hypothetical polysaccharide synthesis-related...  68.6    1e-09 
ref|ZP_01950873.1|  WbfC protein [Vibrio cholerae 1587] >gb|EAY32...  68.2    1e-09 
ref|ZP_03348517.1|  hypothetical protein Salmoneentericaenterica_...  67.0    2e-09 
ref|ZP_06154782.1|  protein of unknown function DUF1017 [Photobac...  67.0    3e-09 
ref|ZP_05118573.1|  conserved hypothetical protein [Vibrio paraha...  66.6    4e-09 
ref|ZP_05888474.1|  hypothetical protein VIC_004993 [Vibrio coral...  66.6    4e-09 
ref|ZP_00989941.1|  hypothetical protein V12B01_23145 [Vibrio spl...  65.5    8e-09 
ref|ZP_04961240.1|  Periplasmic protein involved in polysaccharid...  65.1    1e-08 
ref|ZP_04919294.1|  WbfC protein [Vibrio cholerae V51] >gb|EAZ501...  64.7    1e-08 
dbj|BAA33618.1|  unknown [Vibrio cholerae]                            64.7    1e-08 
ref|NP_933122.1|  hypothetical protein VV0329 [Vibrio vulnificus ...  64.3    2e-08 
gb|ABI85351.1|  hypothetical protein [Vibrio cholerae]                64.3    2e-08 
ref|YP_004190015.1|  YjbG polysaccharide synthesis-related protei...  64.3    2e-08 
ref|ZP_06943634.1|  periplasmic protein [Vibrio cholerae RC385] >...  64.3    2e-08 
ref|ZP_01978991.1|  conserved hypothetical protein [Vibrio choler...  63.5    3e-08 
ref|ZP_05715654.1|  hypothetical protein VMD_07000 [Vibrio mimicu...  63.5    3e-08 
ref|NP_759771.1|  hypothetical protein VV1_0794 [Vibrio vulnificu...  63.2    3e-08 
ref|ZP_08103789.1|  putative periplasmic protein [Vibrio sinaloen...  63.2    4e-08 
ref|ZP_05240723.1|  conserved hypothetical protein [Vibrio choler...  63.2    4e-08 
ref|ZP_04402220.1|  polysaccharide synthesis-related protein [Vib...  62.8    4e-08 
ref|ZP_04717494.1|  hypothetical protein AmacA2_21203 [Alteromona...  62.4    6e-08 
gb|ECV65415.1|  hypothetical protein GOS_2858926 [marine metagenome]  62.0    7e-08 
ref|YP_002873221.1|  hypothetical protein PFLU3660 [Pseudomonas f...  60.8    2e-07 
ref|ZP_06040321.1|  polysaccharide synthesis-related protein [Vib...  60.5    2e-07 
ref|ZP_07774980.1|  hypothetical protein PFWH6_2379 [Pseudomonas ...  58.5    8e-07 
ref|YP_004469043.1|  hypothetical protein ambt_18740 [Alteromonas...  58.2    1e-06 
ref|YP_001339659.1|  hypothetical protein Mmwyl1_0790 [Marinomona...  57.0    3e-06 
gb|EDJ38382.1|  hypothetical protein GOS_1705098 [marine metagenome]  56.2    4e-06 
ref|YP_002890759.1|  hypothetical protein Tmz1t_3793 [Thauera sp....  55.5    9e-06 
gb|EBF39314.1|  hypothetical protein GOS_9605035 [marine metagenome]  55.1    9e-06 
gb|ECO71570.1|  hypothetical protein GOS_4328030 [marine metagenome]  54.7    1e-05 
ref|ZP_06173722.1|  conserved hypothetical protein [Vibrio harvey...  52.8    6e-05 
gb|ECG95961.1|  hypothetical protein GOS_3683526 [marine metagenome]  52.0    1e-04 
ref|ZP_07742696.1|  putative periplasmic protein [Vibrio caribben...  51.2    2e-04 
ref|YP_349560.1|  hypothetical protein Pfl01_3831 [Pseudomonas fl...  51.2    2e-04 
ref|ZP_01074260.1|  hypothetical protein MED121_15079 [Marinomona...  50.8    2e-04 
gb|EBP46974.1|  hypothetical protein GOS_7906243 [marine metagenome]  47.0    0.003 
ref|ZP_06180024.1|  hypothetical protein VMC_14540 [Vibrio algino...  46.6    0.003 
gb|EBH71459.1|  hypothetical protein GOS_9215222 [marine metagenome]  46.6    0.003 
gb|EBW35987.1|  hypothetical protein GOS_6758981 [marine metagenome]  46.2    0.005 
ref|ZP_01261311.1|  hypothetical protein V12G01_11523 [Vibrio alg...  45.1    0.009 
gb|ECD07193.1|  hypothetical protein GOS_4892991 [marine metagenome]  45.1    0.009 
gb|EBK16152.1|  hypothetical protein GOS_8778301 [marine metagenome]  45.1    0.009 
gb|EBF16285.1|  hypothetical protein GOS_9642686 [marine metagenome]  44.7    0.012 
ref|YP_004627399.1|  polysaccharide export protein [Thermodesulfo...  44.7    0.013 
gb|ECT46535.1|  hypothetical protein GOS_5865945 [marine metagenome]  44.3    0.017 
ref|YP_002943855.1|  hypothetical protein Vapar_1942 [Variovorax ...  44.3    0.019 
ref|YP_003448508.1|  polysaccharide export outer membrane protein...  44.3    0.020 
ref|YP_002296906.1|  polysaccharide biosynthesis [Rhodospirillum ...  43.5    0.030 
ref|ZP_01987515.1|  conserved hypothetical protein [Vibrio harvey...  43.1    0.035 


>ref|NP_415505.1| conserved protein [Escherichia coli str. K-12 substr. MG1655]
 ref|AP_001614.1| hypothetical protein [Escherichia coli str. K-12 substr. W3110]
 ref|YP_001457822.1| group 4 capsule (G4C) polysaccharide, YmcB [Escherichia coli 
HS]
 53 more sequence titles
 Length=248

 Score =  503 bits (1294),  Expect = 1e-140, Method: Compositional matrix adjust.
 Identities = 248/248 (100%), Positives = 248/248 (100%), Gaps = 0/248 (0%)

Query  1    MNKLQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWW  60
            MNKLQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWW
Sbjct  1    MNKLQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWW  60

Query  61   PGALLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKL  120
            PGALLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKL
Sbjct  61   PGALLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKL  120

Query  121  DPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRL  180
            DPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRL
Sbjct  121  DPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRL  180

Query  181  AGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVS  240
            AGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVS
Sbjct  181  AGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVS  240

Query  241  VLTQRVPD  248
            VLTQRVPD
Sbjct  241  VLTQRVPD  248


>ref|YP_309961.1| hypothetical protein SSON_0992 [Shigella sonnei Ss046]
 ref|YP_001462217.1| hypothetical protein EcE24377A_1101 [Escherichia coli E24377A]
 ref|YP_001725569.1| hypothetical protein EcolC_2611 [Escherichia coli ATCC 8739]
 26 more sequence titles
 Length=248

 Score =  501 bits (1291),  Expect = 3e-140, Method: Compositional matrix adjust.
 Identities = 247/248 (99%), Positives = 248/248 (100%), Gaps = 0/248 (0%)

Query  1    MNKLQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWW  60
            MNKLQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGP+ENVVQLVTQPQLRDRLWW
Sbjct  1    MNKLQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPMENVVQLVTQPQLRDRLWW  60

Query  61   PGALLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKL  120
            PGALLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKL
Sbjct  61   PGALLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKL  120

Query  121  DPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRL  180
            DPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRL
Sbjct  121  DPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRL  180

Query  181  AGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVS  240
            AGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVS
Sbjct  181  AGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVS  240

Query  241  VLTQRVPD  248
            VLTQRVPD
Sbjct  241  VLTQRVPD  248


>ref|ZP_03070105.1| group 4 capsule (G4C) polysaccharide, YmcB [Escherichia coli 
101-1]
 ref|ZP_07145145.1| conserved hypothetical protein [Escherichia coli MS 187-1]
 gb|EDX38942.1| group 4 capsule (G4C) polysaccharide, YmcB [Escherichia coli 
101-1]
 gb|EFK25881.1| conserved hypothetical protein [Escherichia coli MS 187-1]
 gb|EGB68331.1| SLBB-domain-containing protein [Escherichia coli TA007]
Length=248

 Score =  501 bits (1291),  Expect = 4e-140, Method: Compositional matrix adjust.
 Identities = 247/248 (99%), Positives = 248/248 (100%), Gaps = 0/248 (0%)

Query  1    MNKLQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWW  60
            MNKLQSYFIASVLYVMTPHAFAQGTVTIYLPGE+QTLSVGPVENVVQLVTQPQLRDRLWW
Sbjct  1    MNKLQSYFIASVLYVMTPHAFAQGTVTIYLPGEEQTLSVGPVENVVQLVTQPQLRDRLWW  60

Query  61   PGALLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKL  120
            PGALLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKL
Sbjct  61   PGALLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKL  120

Query  121  DPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRL  180
            DPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRL
Sbjct  121  DPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRL  180

Query  181  AGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVS  240
            AGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVS
Sbjct  181  AGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVS  240

Query  241  VLTQRVPD  248
            VLTQRVPD
Sbjct  241  VLTQRVPD  248


>gb|EFZ60522.1| hypothetical protein ECLT68_0462 [Escherichia coli LT-68]
Length=248

 Score =  501 bits (1289),  Expect = 5e-140, Method: Compositional matrix adjust.
 Identities = 247/248 (99%), Positives = 247/248 (99%), Gaps = 0/248 (0%)

Query  1    MNKLQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWW  60
            MNKLQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWW
Sbjct  1    MNKLQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWW  60

Query  61   PGALLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKL  120
            P ALLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKL
Sbjct  61   PSALLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKL  120

Query  121  DPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRL  180
            DPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRL
Sbjct  121  DPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRL  180

Query  181  AGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVS  240
            AGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVS
Sbjct  181  AGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVS  240

Query  241  VLTQRVPD  248
            VLTQRVPD
Sbjct  241  VLTQRVPD  248


>ref|NP_706908.1| hypothetical protein SF0987 [Shigella flexneri 2a str. 301]
 ref|NP_836693.1| hypothetical protein S1054 [Shigella flexneri 2a str. 2457T]
 ref|YP_688519.1| hypothetical protein SFV_0994 [Shigella flexneri 5 str. 8401]
 10 more sequence titles
 Length=248

 Score =  501 bits (1289),  Expect = 5e-140, Method: Compositional matrix adjust.
 Identities = 247/248 (99%), Positives = 247/248 (99%), Gaps = 0/248 (0%)

Query  1    MNKLQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWW  60
            MNKLQSYFIASVLYVMTPHAFAQGTVTIYLP EQQTLSVGPVENVVQLVTQPQLRDRLWW
Sbjct  1    MNKLQSYFIASVLYVMTPHAFAQGTVTIYLPSEQQTLSVGPVENVVQLVTQPQLRDRLWW  60

Query  61   PGALLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKL  120
            PGALLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKL
Sbjct  61   PGALLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKL  120

Query  121  DPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRL  180
            DPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRL
Sbjct  121  DPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRL  180

Query  181  AGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVS  240
            AGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVS
Sbjct  181  AGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVS  240

Query  241  VLTQRVPD  248
            VLTQRVPD
Sbjct  241  VLTQRVPD  248


>ref|YP_002402185.1| hypothetical protein EC55989_1093 [Escherichia coli 55989]
 emb|CAU96954.1| conserved hypothetical protein [Escherichia coli 55989]
Length=248

 Score =  500 bits (1288),  Expect = 6e-140, Method: Compositional matrix adjust.
 Identities = 246/248 (99%), Positives = 248/248 (100%), Gaps = 0/248 (0%)

Query  1    MNKLQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWW  60
            MNKLQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGP+ENVVQLVTQPQLRDRLWW
Sbjct  1    MNKLQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPMENVVQLVTQPQLRDRLWW  60

Query  61   PGALLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKL  120
            PGALLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKL
Sbjct  61   PGALLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKL  120

Query  121  DPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRL  180
            DPDFVR+DENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRL
Sbjct  121  DPDFVRLDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRL  180

Query  181  AGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVS  240
            AGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVS
Sbjct  181  AGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVS  240

Query  241  VLTQRVPD  248
            VLTQRVPD
Sbjct  241  VLTQRVPD  248


>ref|YP_408638.1| hypothetical protein SBO_2245 [Shigella boydii Sb227]
 gb|ABB66810.1| conserved hypothetical protein [Shigella boydii Sb227]
 gb|EGI98404.1| hypothetical protein SB359474_2603 [Shigella boydii 3594-74]
Length=248

 Score =  500 bits (1287),  Expect = 8e-140, Method: Compositional matrix adjust.
 Identities = 246/248 (99%), Positives = 248/248 (100%), Gaps = 0/248 (0%)

Query  1    MNKLQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWW  60
            MNKLQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGP+ENVVQLVTQPQLRDRLWW
Sbjct  1    MNKLQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPMENVVQLVTQPQLRDRLWW  60

Query  61   PGALLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKL  120
            PGALLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKL
Sbjct  61   PGALLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKL  120

Query  121  DPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRL  180
            DPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRL
Sbjct  121  DPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRL  180

Query  181  AGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVS  240
            AGADKNNVMVITPEGETVVAPVALWNKRHV+PPPGSQLWLGFSAHVLPEKYADLNDQIVS
Sbjct  181  AGADKNNVMVITPEGETVVAPVALWNKRHVDPPPGSQLWLGFSAHVLPEKYADLNDQIVS  240

Query  241  VLTQRVPD  248
            VLTQRVPD
Sbjct  241  VLTQRVPD  248


>ref|YP_001880817.1| group 4 capsule (G4C) polysaccharide, YmcB [Shigella boydii CDC 
3083-94]
 gb|ACD07514.1| group 4 capsule (G4C) polysaccharide, YmcB [Shigella boydii CDC 
3083-94]
Length=248

 Score =  500 bits (1287),  Expect = 8e-140, Method: Compositional matrix adjust.
 Identities = 246/248 (99%), Positives = 248/248 (100%), Gaps = 0/248 (0%)

Query  1    MNKLQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWW  60
            MNKLQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGP+ENVVQLVTQPQLRDRLWW
Sbjct  1    MNKLQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPMENVVQLVTQPQLRDRLWW  60

Query  61   PGALLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKL  120
            PGALLTDSAAKAKALKDYQHVMAQLASWEAEA+DDVAATIKSVRQQLLNLNITGRLPVKL
Sbjct  61   PGALLTDSAAKAKALKDYQHVMAQLASWEAEANDDVAATIKSVRQQLLNLNITGRLPVKL  120

Query  121  DPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRL  180
            DPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRL
Sbjct  121  DPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRL  180

Query  181  AGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVS  240
            AGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVS
Sbjct  181  AGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVS  240

Query  241  VLTQRVPD  248
            VLTQRVPD
Sbjct  241  VLTQRVPD  248


>gb|EFZ70668.1| hypothetical protein ECOK1357_1256 [Escherichia coli 1357]
Length=248

 Score =  500 bits (1287),  Expect = 9e-140, Method: Compositional matrix adjust.
 Identities = 246/248 (99%), Positives = 247/248 (99%), Gaps = 0/248 (0%)

Query  1    MNKLQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWW  60
            MNKLQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGP+ENVVQLVTQPQLRDRLWW
Sbjct  1    MNKLQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPMENVVQLVTQPQLRDRLWW  60

Query  61   PGALLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKL  120
            P ALLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKL
Sbjct  61   PSALLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKL  120

Query  121  DPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRL  180
            DPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRL
Sbjct  121  DPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRL  180

Query  181  AGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVS  240
            AGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVS
Sbjct  181  AGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVS  240

Query  241  VLTQRVPD  248
            VLTQRVPD
Sbjct  241  VLTQRVPD  248


>ref|ZP_03064769.1| group 4 capsule (G4C) polysaccharide, YmcB [Shigella dysenteriae 
1012]
 gb|EDX35451.1| group 4 capsule (G4C) polysaccharide, YmcB [Shigella dysenteriae 
1012]
Length=248

 Score =  500 bits (1287),  Expect = 1e-139, Method: Compositional matrix adjust.
 Identities = 247/248 (99%), Positives = 247/248 (99%), Gaps = 0/248 (0%)

Query  1    MNKLQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWW  60
            MNKLQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWW
Sbjct  1    MNKLQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWW  60

Query  61   PGALLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKL  120
            PGALLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKL
Sbjct  61   PGALLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKL  120

Query  121  DPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRL  180
             PDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRL
Sbjct  121  APDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRL  180

Query  181  AGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVS  240
            AGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVS
Sbjct  181  AGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVS  240

Query  241  VLTQRVPD  248
            VLTQRVPD
Sbjct  241  VLTQRVPD  248


>gb|EGB58446.1| SLBB-domain-containing protein [Escherichia coli H489]
Length=248

 Score =  499 bits (1286),  Expect = 1e-139, Method: Compositional matrix adjust.
 Identities = 246/248 (99%), Positives = 247/248 (99%), Gaps = 0/248 (0%)

Query  1    MNKLQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWW  60
            MNKLQSYFIASVLYVMTPHAFAQGTVTIYLPGE+QTLSVGPVENVVQLVTQPQLRDRLWW
Sbjct  1    MNKLQSYFIASVLYVMTPHAFAQGTVTIYLPGEEQTLSVGPVENVVQLVTQPQLRDRLWW  60

Query  61   PGALLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKL  120
            P ALLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKL
Sbjct  61   PSALLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKL  120

Query  121  DPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRL  180
            DPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRL
Sbjct  121  DPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRL  180

Query  181  AGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVS  240
            AGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVS
Sbjct  181  AGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVS  240

Query  241  VLTQRVPD  248
            VLTQRVPD
Sbjct  241  VLTQRVPD  248


>ref|ZP_06656911.1| YmcB [Escherichia coli B185]
 gb|EFF07293.1| YmcB [Escherichia coli B185]
Length=248

 Score =  499 bits (1286),  Expect = 1e-139, Method: Compositional matrix adjust.
 Identities = 246/248 (99%), Positives = 247/248 (99%), Gaps = 0/248 (0%)

Query  1    MNKLQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWW  60
            MNKLQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENV QLVTQPQLRDRLWW
Sbjct  1    MNKLQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVAQLVTQPQLRDRLWW  60

Query  61   PGALLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKL  120
            PGALLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKL
Sbjct  61   PGALLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKL  120

Query  121  DPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRL  180
            DPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRL
Sbjct  121  DPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRL  180

Query  181  AGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVS  240
            AGADKNNV+VITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVS
Sbjct  181  AGADKNNVIVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVS  240

Query  241  VLTQRVPD  248
            VLTQRVPD
Sbjct  241  VLTQRVPD  248


>ref|ZP_08353029.1| conserved hypothetical protein [Escherichia coli M718]
 gb|EGI22346.1| conserved hypothetical protein [Escherichia coli M718]
Length=248

 Score =  499 bits (1286),  Expect = 1e-139, Method: Compositional matrix adjust.
 Identities = 246/248 (99%), Positives = 246/248 (99%), Gaps = 0/248 (0%)

Query  1    MNKLQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWW  60
            MNKLQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENV QLVTQPQLRDRLWW
Sbjct  1    MNKLQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVAQLVTQPQLRDRLWW  60

Query  61   PGALLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKL  120
            PGALLTDSA KAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKL
Sbjct  61   PGALLTDSAVKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKL  120

Query  121  DPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRL  180
            DPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRL
Sbjct  121  DPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRL  180

Query  181  AGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVS  240
            AGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVS
Sbjct  181  AGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVS  240

Query  241  VLTQRVPD  248
            VLTQRVPD
Sbjct  241  VLTQRVPD  248


>ref|ZP_02999848.1| group 4 capsule (G4C) polysaccharide, YmcB [Escherichia coli 
53638]
 gb|EDU62880.1| group 4 capsule (G4C) polysaccharide, YmcB [Escherichia coli 
53638]
Length=248

 Score =  499 bits (1285),  Expect = 2e-139, Method: Compositional matrix adjust.
 Identities = 247/248 (99%), Positives = 247/248 (99%), Gaps = 0/248 (0%)

Query  1    MNKLQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWW  60
            MNKLQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQ QLRDRLWW
Sbjct  1    MNKLQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQLQLRDRLWW  60

Query  61   PGALLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKL  120
            PGALLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKL
Sbjct  61   PGALLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKL  120

Query  121  DPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRL  180
            DPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRL
Sbjct  121  DPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRL  180

Query  181  AGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVS  240
            AGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVS
Sbjct  181  AGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVS  240

Query  241  VLTQRVPD  248
            VLTQRVPD
Sbjct  241  VLTQRVPD  248


>ref|ZP_06652940.1| YmcB protein [Escherichia coli B354]
 gb|EFF12316.1| YmcB protein [Escherichia coli B354]
Length=248

 Score =  499 bits (1285),  Expect = 2e-139, Method: Compositional matrix adjust.
 Identities = 246/248 (99%), Positives = 246/248 (99%), Gaps = 0/248 (0%)

Query  1    MNKLQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWW  60
            MNKLQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWW
Sbjct  1    MNKLQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWW  60

Query  61   PGALLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKL  120
            PGALLTDSAAKAKALKDYQHVMAQLASWEAEADDDV ATIKSVRQQLLNLNITGRLPVKL
Sbjct  61   PGALLTDSAAKAKALKDYQHVMAQLASWEAEADDDVVATIKSVRQQLLNLNITGRLPVKL  120

Query  121  DPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRL  180
            DPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGR VTDYLQDHPRL
Sbjct  121  DPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRRVTDYLQDHPRL  180

Query  181  AGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVS  240
            AGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVS
Sbjct  181  AGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVS  240

Query  241  VLTQRVPD  248
            VLTQRVPD
Sbjct  241  VLTQRVPD  248


>ref|ZP_03047374.1| group 4 capsule (G4C) polysaccharide, YmcB [Escherichia coli 
E22]
 ref|ZP_03062339.1| group 4 capsule (G4C) polysaccharide, YmcB [Escherichia coli 
B171]
 ref|YP_003221011.1| hypothetical protein ECO103_1030 [Escherichia coli O103:H2 str. 
12009]
 gb|EDV80675.1| group 4 capsule (G4C) polysaccharide, YmcB [Escherichia coli 
E22]
 gb|EDX28417.1| group 4 capsule (G4C) polysaccharide, YmcB [Escherichia coli 
B171]
 dbj|BAI29877.1| conserved predicted protein [Escherichia coli O103:H2 str. 12009]
 gb|EFZ44274.1| hypothetical protein ECE128010_5482 [Escherichia coli E128010]
Length=248

 Score =  499 bits (1284),  Expect = 2e-139, Method: Compositional matrix adjust.
 Identities = 246/248 (99%), Positives = 247/248 (99%), Gaps = 0/248 (0%)

Query  1    MNKLQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWW  60
            MNKLQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWW
Sbjct  1    MNKLQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWW  60

Query  61   PGALLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKL  120
            PGALLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNL+ITGRLPVKL
Sbjct  61   PGALLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLHITGRLPVKL  120

Query  121  DPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRL  180
            DPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSG GQLPWQAGRSVTDYLQDHPRL
Sbjct  121  DPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGTGQLPWQAGRSVTDYLQDHPRL  180

Query  181  AGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVS  240
            AGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVS
Sbjct  181  AGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVS  240

Query  241  VLTQRVPD  248
            VLTQRVPD
Sbjct  241  VLTQRVPD  248


>ref|YP_003036816.1| hypothetical protein ECBD_2609 [Escherichia coli 'BL21-Gold(DE3)pLysS 
AG']
 ref|YP_003044206.1| hypothetical protein ECB_00988 [Escherichia coli B str. REL606]
 ref|ZP_06939304.1| hypothetical protein EcolOP_25017 [Escherichia coli OP50]
 emb|CAQ31512.1| conserved protein [Escherichia coli BL21(DE3)]
 gb|ACT29631.1| protein of unknown function DUF1017 [Escherichia coli 'BL21-Gold(DE3)pLysS 
AG']
 gb|ACT38670.1| hypothetical protein ECB_00988 [Escherichia coli B str. REL606]
 gb|ACT42883.1| hypothetical protein ECD_00988 [Escherichia coli BL21(DE3)]
Length=248

 Score =  498 bits (1283),  Expect = 3e-139, Method: Compositional matrix adjust.
 Identities = 246/248 (99%), Positives = 247/248 (99%), Gaps = 0/248 (0%)

Query  1    MNKLQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWW  60
            MNKLQSYFIASVLYVMTPHAFAQGTVTIYLPGE+QTLSVGPVENVVQLVTQPQLRDRLWW
Sbjct  1    MNKLQSYFIASVLYVMTPHAFAQGTVTIYLPGEEQTLSVGPVENVVQLVTQPQLRDRLWW  60

Query  61   PGALLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKL  120
            P ALLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKL
Sbjct  61   PVALLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKL  120

Query  121  DPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRL  180
            DPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRL
Sbjct  121  DPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRL  180

Query  181  AGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVS  240
            AGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVS
Sbjct  181  AGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVS  240

Query  241  VLTQRVPD  248
            VLTQRVPD
Sbjct  241  VLTQRVPD  248


>ref|ZP_08373309.1| conserved hypothetical protein [Escherichia coli TA280]
 gb|EGI41529.1| conserved hypothetical protein [Escherichia coli TA280]
Length=248

 Score =  498 bits (1281),  Expect = 5e-139, Method: Compositional matrix adjust.
 Identities = 244/248 (98%), Positives = 246/248 (99%), Gaps = 0/248 (0%)

Query  1    MNKLQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWW  60
            MNKLQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWW
Sbjct  1    MNKLQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWW  60

Query  61   PGALLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKL  120
            PGALLTDSAAKAKALKDYQHVMAQLASWEAEADDDV ATIKSVRQQLLNLNITGRLPVKL
Sbjct  61   PGALLTDSAAKAKALKDYQHVMAQLASWEAEADDDVVATIKSVRQQLLNLNITGRLPVKL  120

Query  121  DPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRL  180
            DPDFVRVDENSNPPLVG+YTLYTVQRPVTITLLGAVSG GQLPWQAGRSVTDYLQDHPRL
Sbjct  121  DPDFVRVDENSNPPLVGNYTLYTVQRPVTITLLGAVSGTGQLPWQAGRSVTDYLQDHPRL  180

Query  181  AGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVS  240
            AGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVS
Sbjct  181  AGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVS  240

Query  241  VLTQRVPD  248
            +LTQRVPD
Sbjct  241  ILTQRVPD  248


>ref|YP_402624.1| hypothetical protein SDY_0961 [Shigella dysenteriae Sd197]
 gb|ABB61133.1| conserved hypothetical protein [Shigella dysenteriae Sd197]
Length=248

 Score =  497 bits (1280),  Expect = 6e-139, Method: Compositional matrix adjust.
 Identities = 244/248 (98%), Positives = 246/248 (99%), Gaps = 0/248 (0%)

Query  1    MNKLQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWW  60
            MNKLQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENV QLVTQPQLRDRLWW
Sbjct  1    MNKLQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVAQLVTQPQLRDRLWW  60

Query  61   PGALLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKL  120
            PGALLTDSA KAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKL
Sbjct  61   PGALLTDSAVKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKL  120

Query  121  DPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRL  180
            DPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRL
Sbjct  121  DPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRL  180

Query  181  AGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVS  240
            AGADKNNV+VITPEGETV+APVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVS
Sbjct  181  AGADKNNVIVITPEGETVIAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVS  240

Query  241  VLTQRVPD  248
            VLTQRVPD
Sbjct  241  VLTQRVPD  248


>ref|ZP_08383110.1| conserved hypothetical protein [Escherichia coli H299]
 gb|EGI51301.1| conserved hypothetical protein [Escherichia coli H299]
Length=248

 Score =  497 bits (1279),  Expect = 7e-139, Method: Compositional matrix adjust.
 Identities = 245/248 (98%), Positives = 245/248 (98%), Gaps = 0/248 (0%)

Query  1    MNKLQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWW  60
            MNKLQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWW
Sbjct  1    MNKLQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWW  60

Query  61   PGALLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKL  120
            PGALLTDSAAKAKALKDYQHVMAQLASWEAEADDDV ATIKSVRQQLLNL ITGRLPVKL
Sbjct  61   PGALLTDSAAKAKALKDYQHVMAQLASWEAEADDDVVATIKSVRQQLLNLKITGRLPVKL  120

Query  121  DPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRL  180
            DPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSG GQLPWQAGRSVTDYLQDHPRL
Sbjct  121  DPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGTGQLPWQAGRSVTDYLQDHPRL  180

Query  181  AGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVS  240
            AGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVS
Sbjct  181  AGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVS  240

Query  241  VLTQRVPD  248
            VLTQRVPD
Sbjct  241  VLTQRVPD  248


>ref|NP_286922.1| hypothetical protein Z1402 [Escherichia coli O157:H7 EDL933]
 ref|NP_309168.1| hypothetical protein ECs1141 [Escherichia coli O157:H7 str. Sakai]
 ref|ZP_02772422.1| group 4 capsule (G4C) polysaccharide, YmcB [Escherichia coli 
O157:H7 str. EC4113]
 44 more sequence titles
 Length=248

 Score =  496 bits (1277),  Expect = 1e-138, Method: Compositional matrix adjust.
 Identities = 244/248 (98%), Positives = 246/248 (99%), Gaps = 0/248 (0%)

Query  1    MNKLQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWW  60
            MNKLQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENV QLVTQPQLRDRLWW
Sbjct  1    MNKLQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVAQLVTQPQLRDRLWW  60

Query  61   PGALLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKL  120
            PGALLTDSA KAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKL
Sbjct  61   PGALLTDSAVKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKL  120

Query  121  DPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRL  180
            DPDFVRVDENSNPPLVG+YTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRL
Sbjct  121  DPDFVRVDENSNPPLVGNYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRL  180

Query  181  AGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVS  240
            AGADKNNV+VITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVS
Sbjct  181  AGADKNNVIVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVS  240

Query  241  VLTQRVPD  248
            VLTQRVPD
Sbjct  241  VLTQRVPD  248


>ref|YP_002328533.1| hypothetical protein E2348C_0970 [Escherichia coli O127:H6 str. 
E2348/69]
 emb|CAS08518.1| predicted protein [Escherichia coli O127:H6 str. E2348/69]
Length=248

 Score =  495 bits (1274),  Expect = 3e-138, Method: Compositional matrix adjust.
 Identities = 244/248 (98%), Positives = 245/248 (98%), Gaps = 0/248 (0%)

Query  1    MNKLQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWW  60
            MNKLQSYFIASVLYVMTPHAFAQG VTIYLPGEQQTLSVGPVENV QLVTQPQLRDRLWW
Sbjct  1    MNKLQSYFIASVLYVMTPHAFAQGMVTIYLPGEQQTLSVGPVENVAQLVTQPQLRDRLWW  60

Query  61   PGALLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKL  120
            PGALLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKL
Sbjct  61   PGALLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKL  120

Query  121  DPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRL  180
            DPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPW AGRSVTDYLQDHPRL
Sbjct  121  DPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWLAGRSVTDYLQDHPRL  180

Query  181  AGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVS  240
            AGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVS
Sbjct  181  AGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVS  240

Query  241  VLTQRVPD  248
            VLTQRVP+
Sbjct  241  VLTQRVPE  248


>gb|AEE55762.1| conserved hypothetical protein [Escherichia coli UMNK88]
Length=248

 Score =  495 bits (1274),  Expect = 3e-138, Method: Compositional matrix adjust.
 Identities = 245/248 (98%), Positives = 245/248 (98%), Gaps = 0/248 (0%)

Query  1    MNKLQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWW  60
            MNKLQSYFIASVLYVMT HAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWW
Sbjct  1    MNKLQSYFIASVLYVMTHHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWW  60

Query  61   PGALLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKL  120
            PGALLTDSAAKAKALKDYQHVMAQLASWEAEADDDV ATIKSVRQQLLNLNITGRLPVKL
Sbjct  61   PGALLTDSAAKAKALKDYQHVMAQLASWEAEADDDVVATIKSVRQQLLNLNITGRLPVKL  120

Query  121  DPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRL  180
            DPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQD PRL
Sbjct  121  DPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDRPRL  180

Query  181  AGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVS  240
            AGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVS
Sbjct  181  AGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVS  240

Query  241  VLTQRVPD  248
            VLTQRVPD
Sbjct  241  VLTQRVPD  248


>ref|YP_003498802.1| Group 4 capsule (G4C) polysaccharide, YmcB [Escherichia coli 
O55:H7 str. CB9615]
 gb|ACI85399.1| hypothetical protein ECs1141 [Escherichia coli]
 gb|ADD55818.1| Group 4 capsule (G4C) polysaccharide, YmcB [Escherichia coli 
O55:H7 str. CB9615]
 gb|EFX27157.1| Group 4 capsule (G4C) polysaccharide, YmcB [Escherichia coli 
O55:H7 str. USDA 5905]
Length=248

 Score =  494 bits (1273),  Expect = 3e-138, Method: Compositional matrix adjust.
 Identities = 243/248 (97%), Positives = 245/248 (98%), Gaps = 0/248 (0%)

Query  1    MNKLQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWW  60
            MNKLQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENV QLVTQPQLRDRLWW
Sbjct  1    MNKLQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVAQLVTQPQLRDRLWW  60

Query  61   PGALLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKL  120
            PG LLTDSA KAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKL
Sbjct  61   PGTLLTDSAVKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKL  120

Query  121  DPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRL  180
            DPDFVRVDENSNPPLVG+YTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRL
Sbjct  121  DPDFVRVDENSNPPLVGNYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRL  180

Query  181  AGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVS  240
            AGADKNNV+VITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVS
Sbjct  181  AGADKNNVIVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVS  240

Query  241  VLTQRVPD  248
            VLTQRVPD
Sbjct  241  VLTQRVPD  248


>gb|EGB62657.1| SLBB-domain-containing protein [Escherichia coli M863]
 gb|EGE65021.1| hypothetical protein ECSTEC7V_1672 [Escherichia coli STEC_7v]
Length=248

 Score =  494 bits (1271),  Expect = 6e-138, Method: Compositional matrix adjust.
 Identities = 244/248 (98%), Positives = 245/248 (98%), Gaps = 0/248 (0%)

Query  1    MNKLQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWW  60
            MNKLQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQ LSVGPVENVVQLVTQPQLRDRLWW
Sbjct  1    MNKLQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQPLSVGPVENVVQLVTQPQLRDRLWW  60

Query  61   PGALLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKL  120
            PG LLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKL
Sbjct  61   PGVLLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKL  120

Query  121  DPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRL  180
            DPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAV+GAGQLPWQAGRSVTDYLQD PRL
Sbjct  121  DPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVTGAGQLPWQAGRSVTDYLQDTPRL  180

Query  181  AGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVS  240
            AGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVS
Sbjct  181  AGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVS  240

Query  241  VLTQRVPD  248
            VLTQRVPD
Sbjct  241  VLTQRVPD  248


>gb|EGB71723.1| SLBB-domain-containing protein [Escherichia coli TW10509]
Length=248

 Score =  493 bits (1270),  Expect = 9e-138, Method: Compositional matrix adjust.
 Identities = 244/248 (98%), Positives = 245/248 (98%), Gaps = 0/248 (0%)

Query  1    MNKLQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWW  60
            MNKLQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQ LSVGPVENVVQLVTQPQLRDRLWW
Sbjct  1    MNKLQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQPLSVGPVENVVQLVTQPQLRDRLWW  60

Query  61   PGALLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKL  120
            PG LLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKL
Sbjct  61   PGVLLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKL  120

Query  121  DPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRL  180
            DPDFVR+DENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQD PRL
Sbjct  121  DPDFVRMDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDTPRL  180

Query  181  AGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVS  240
            AGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVS
Sbjct  181  AGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVS  240

Query  241  VLTQRVPD  248
            VLTQRVPD
Sbjct  241  VLTQRVPD  248


>ref|ZP_07137361.1| conserved hypothetical protein [Escherichia coli MS 115-1]
 gb|EFJ95361.1| conserved hypothetical protein [Escherichia coli MS 115-1]
Length=248

 Score =  493 bits (1269),  Expect = 1e-137, Method: Compositional matrix adjust.
 Identities = 244/248 (98%), Positives = 246/248 (99%), Gaps = 0/248 (0%)

Query  1    MNKLQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWW  60
            MNKLQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWW
Sbjct  1    MNKLQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWW  60

Query  61   PGALLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKL  120
            PGALLTDSAAKAKALKDYQHV+AQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKL
Sbjct  61   PGALLTDSAAKAKALKDYQHVIAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKL  120

Query  121  DPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRL  180
            DPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGA QLPWQAGRSVTDYLQDHPRL
Sbjct  121  DPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAEQLPWQAGRSVTDYLQDHPRL  180

Query  181  AGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVS  240
            AGADKNNV+VITPEGETVVAPVALWNKRHVEPP GSQLWLGFSAHVLPEKYADLNDQIVS
Sbjct  181  AGADKNNVIVITPEGETVVAPVALWNKRHVEPPLGSQLWLGFSAHVLPEKYADLNDQIVS  240

Query  241  VLTQRVPD  248
            VLTQRVPD
Sbjct  241  VLTQRVPD  248


>gb|EFZ75912.1| hypothetical protein ECRN5871_1089 [Escherichia coli RN587/1]
Length=248

 Score =  491 bits (1263),  Expect = 5e-137, Method: Compositional matrix adjust.
 Identities = 243/248 (97%), Positives = 243/248 (97%), Gaps = 0/248 (0%)

Query  1    MNKLQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWW  60
            MNKLQSYFIASVLYVMTPHAFAQG VTIYLPGEQQTLSVGPVENV QLVTQPQLRDRLWW
Sbjct  1    MNKLQSYFIASVLYVMTPHAFAQGMVTIYLPGEQQTLSVGPVENVAQLVTQPQLRDRLWW  60

Query  61   PGALLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKL  120
            PGALLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKL
Sbjct  61   PGALLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKL  120

Query  121  DPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRL  180
             PDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPW AGRSVTDYLQDH RL
Sbjct  121  APDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWLAGRSVTDYLQDHTRL  180

Query  181  AGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVS  240
            AGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVS
Sbjct  181  AGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVS  240

Query  241  VLTQRVPD  248
            VLTQRVPD
Sbjct  241  VLTQRVPD  248


>gb|EGJ00865.1| hypothetical protein SD15574_1331 [Shigella dysenteriae 155-74]
Length=233

 Score =  469 bits (1207),  Expect = 2e-130, Method: Compositional matrix adjust.
 Identities = 232/233 (99%), Positives = 232/233 (99%), Gaps = 0/233 (0%)

Query  16   MTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGALLTDSAAKAKAL  75
            MTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGALLTDSAAKAKAL
Sbjct  1    MTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGALLTDSAAKAKAL  60

Query  76   KDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPDFVRVDENSNPPL  135
            KDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKL PDFVRVDENSNPPL
Sbjct  61   KDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLAPDFVRVDENSNPPL  120

Query  136  VGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGADKNNVMVITPEG  195
            VGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGADKNNVMVITPEG
Sbjct  121  VGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGADKNNVMVITPEG  180

Query  196  ETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLTQRVPD  248
            ETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLTQRVPD
Sbjct  181  ETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLTQRVPD  233


>ref|ZP_07783163.1| uncharacterized protein gfcC [Escherichia coli 2362-75]
 gb|EFR14271.1| uncharacterized protein gfcC [Escherichia coli 2362-75]
Length=233

 Score =  466 bits (1198),  Expect = 2e-129, Method: Compositional matrix adjust.
 Identities = 230/233 (98%), Positives = 230/233 (98%), Gaps = 0/233 (0%)

Query  16   MTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGALLTDSAAKAKAL  75
            MTPHAFAQG VTIYLPGEQQTLSVGPVENV QLVTQPQLRDRLWWPGALLTDSAAKAKAL
Sbjct  1    MTPHAFAQGMVTIYLPGEQQTLSVGPVENVAQLVTQPQLRDRLWWPGALLTDSAAKAKAL  60

Query  76   KDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPDFVRVDENSNPPL  135
            KDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPDFVRVDENSNPPL
Sbjct  61   KDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPDFVRVDENSNPPL  120

Query  136  VGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGADKNNVMVITPEG  195
            VGDYTLYTVQRPVTITLLGAVSGAGQLPW AGRSVTDYLQDHPRLAGADKNNVMVITPEG
Sbjct  121  VGDYTLYTVQRPVTITLLGAVSGAGQLPWLAGRSVTDYLQDHPRLAGADKNNVMVITPEG  180

Query  196  ETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLTQRVPD  248
            ETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLTQRVPD
Sbjct  181  ETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLTQRVPD  233


>ref|ZP_07681343.1| conserved hypothetical protein [Shigella dysenteriae 1617]
 gb|EFP70903.1| conserved hypothetical protein [Shigella dysenteriae 1617]
Length=233

 Score =  464 bits (1193),  Expect = 7e-129, Method: Compositional matrix adjust.
 Identities = 228/233 (97%), Positives = 230/233 (98%), Gaps = 0/233 (0%)

Query  16   MTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGALLTDSAAKAKAL  75
            MTPHAFAQGTVTIYLPGEQQTLSVGPVENV QLVTQPQLRDRLWWPGALLTDSA KAKAL
Sbjct  1    MTPHAFAQGTVTIYLPGEQQTLSVGPVENVAQLVTQPQLRDRLWWPGALLTDSAVKAKAL  60

Query  76   KDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPDFVRVDENSNPPL  135
            KDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKL PDFVRVDENSNPPL
Sbjct  61   KDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLAPDFVRVDENSNPPL  120

Query  136  VGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGADKNNVMVITPEG  195
            VGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGADKNNV+VITPEG
Sbjct  121  VGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGADKNNVIVITPEG  180

Query  196  ETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLTQRVPD  248
            ETV+APVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLTQRVPD
Sbjct  181  ETVIAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLTQRVPD  233


>ref|ZP_02904453.1| group 4 capsule (G4C) polysaccharide, YmcB [Escherichia albertii 
TW07627]
 gb|EDS90115.1| group 4 capsule (G4C) polysaccharide, YmcB [Escherichia albertii 
TW07627]
Length=248

 Score =  449 bits (1156),  Expect = 1e-124, Method: Compositional matrix adjust.
 Identities = 221/248 (89%), Positives = 230/248 (92%), Gaps = 0/248 (0%)

Query  1    MNKLQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWW  60
            MNKL SYFIASVLYV+TPHAFAQG+VT+YLPGE+Q LSV  VENV QLVTQPQLRDRLWW
Sbjct  1    MNKLPSYFIASVLYVITPHAFAQGSVTVYLPGEKQALSVESVENVAQLVTQPQLRDRLWW  60

Query  61   PGALLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKL  120
            PGALLTDSAAKAKA KDYQHVMAQLASWEAEADDDVAATIK VRQQL NLNITGRL V+L
Sbjct  61   PGALLTDSAAKAKADKDYQHVMAQLASWEAEADDDVAATIKFVRQQLTNLNITGRLSVEL  120

Query  121  DPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRL  180
            DPDFVRVDE+SN PLVGDY LY VQRP TITLLGAVSGAGQLPW+AGRSV DYLQ HPRL
Sbjct  121  DPDFVRVDEDSNRPLVGDYALYAVQRPSTITLLGAVSGAGQLPWRAGRSVADYLQHHPRL  180

Query  181  AGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVS  240
            AGAD NNV VITPEG+TVVAPVALWNKRHVEPPPGSQLWLGFS HVLPEKYADLN+QIVS
Sbjct  181  AGADSNNVFVITPEGKTVVAPVALWNKRHVEPPPGSQLWLGFSTHVLPEKYADLNNQIVS  240

Query  241  VLTQRVPD  248
            VLTQRVPD
Sbjct  241  VLTQRVPD  248


>gb|EGI97539.1| hypothetical protein SB521682_1222 [Shigella boydii 5216-82]
Length=223

 Score =  448 bits (1152),  Expect = 4e-124, Method: Compositional matrix adjust.
 Identities = 221/223 (99%), Positives = 223/223 (100%), Gaps = 0/223 (0%)

Query  26   VTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGALLTDSAAKAKALKDYQHVMAQL  85
            +TIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGALLTDSAAKAKALKDYQHVMAQL
Sbjct  1    MTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGALLTDSAAKAKALKDYQHVMAQL  60

Query  86   ASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPDFVRVDENSNPPLVGDYTLYTVQ  145
            ASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPDFVRVDENSNPPLVGDYTLYTVQ
Sbjct  61   ASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPDFVRVDENSNPPLVGDYTLYTVQ  120

Query  146  RPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGADKNNVMVITPEGETVVAPVALW  205
            RPVTITLLG+VSGAGQLPWQAGRSVTDYLQDHPRLAGADKNNVMVITPEGETVVAPVALW
Sbjct  121  RPVTITLLGSVSGAGQLPWQAGRSVTDYLQDHPRLAGADKNNVMVITPEGETVVAPVALW  180

Query  206  NKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLTQRVPD  248
            NKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLTQRVPD
Sbjct  181  NKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLTQRVPD  223


>gb|EFW50291.1| YjbG polysaccharide synthesis-related protein [Shigella dysenteriae 
CDC 74-1112]
Length=223

 Score =  447 bits (1151),  Expect = 5e-124, Method: Compositional matrix adjust.
 Identities = 221/223 (99%), Positives = 223/223 (100%), Gaps = 0/223 (0%)

Query  26   VTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGALLTDSAAKAKALKDYQHVMAQL  85
            +TIYLPGEQQTLSVGP+ENVVQLVTQPQLRDRLWWPGALLTDSAAKAKALKDYQHVMAQL
Sbjct  1    MTIYLPGEQQTLSVGPMENVVQLVTQPQLRDRLWWPGALLTDSAAKAKALKDYQHVMAQL  60

Query  86   ASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPDFVRVDENSNPPLVGDYTLYTVQ  145
            ASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPDFVRVDENSNPPLVGDYTLYTVQ
Sbjct  61   ASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPDFVRVDENSNPPLVGDYTLYTVQ  120

Query  146  RPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGADKNNVMVITPEGETVVAPVALW  205
            RPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGADKNNVMVITPEGETVVAPVALW
Sbjct  121  RPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGADKNNVMVITPEGETVVAPVALW  180

Query  206  NKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLTQRVPD  248
            NKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLTQRVPD
Sbjct  181  NKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLTQRVPD  223


>gb|EFW53537.1| YjbG polysaccharide synthesis-related protein [Shigella boydii 
ATCC 9905]
Length=223

 Score =  447 bits (1151),  Expect = 5e-124, Method: Compositional matrix adjust.
 Identities = 221/223 (99%), Positives = 223/223 (100%), Gaps = 0/223 (0%)

Query  26   VTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGALLTDSAAKAKALKDYQHVMAQL  85
            +TIYLPGEQQTLSVGPVENVV+LVTQPQLRDRLWWPGALLTDSAAKAKALKDYQHVMAQL
Sbjct  1    MTIYLPGEQQTLSVGPVENVVKLVTQPQLRDRLWWPGALLTDSAAKAKALKDYQHVMAQL  60

Query  86   ASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPDFVRVDENSNPPLVGDYTLYTVQ  145
            ASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPDFVRVDENSNPPLVGDYTLYTVQ
Sbjct  61   ASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPDFVRVDENSNPPLVGDYTLYTVQ  120

Query  146  RPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGADKNNVMVITPEGETVVAPVALW  205
            RPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGADKNNVMVITPEGETVVAPVALW
Sbjct  121  RPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGADKNNVMVITPEGETVVAPVALW  180

Query  206  NKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLTQRVPD  248
            NKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLTQRVPD
Sbjct  181  NKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLTQRVPD  223


>pdb|3P42|A Chain A, Structure Of Gfcc (Ymcb), Protein Encoded By The E. 
Coli Group 4 Capsule Operon
 pdb|3P42|B Chain B, Structure Of Gfcc (Ymcb), Protein Encoded By The E. 
Coli Group 4 Capsule Operon
 pdb|3P42|C Chain C, Structure Of Gfcc (Ymcb), Protein Encoded By The E. 
Coli Group 4 Capsule Operon
 pdb|3P42|D Chain D, Structure Of Gfcc (Ymcb), Protein Encoded By The E. 
Coli Group 4 Capsule Operon
Length=236

 Score =  444 bits (1142),  Expect = 5e-123, Method: Compositional matrix adjust.
 Identities = 221/227 (97%), Positives = 222/227 (97%), Gaps = 0/227 (0%)

Query  22   AQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGALLTDSAAKAKALKDYQHV  81
            AQG VTIYLPGEQQTLSVGPVENV QLVTQPQLRDRLWWPGALLTDSAAKAKALKDYQHV
Sbjct  2    AQGXVTIYLPGEQQTLSVGPVENVAQLVTQPQLRDRLWWPGALLTDSAAKAKALKDYQHV  61

Query  82   MAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPDFVRVDENSNPPLVGDYTL  141
             AQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPDFVRVDENSNPPLVGDYTL
Sbjct  62   XAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPDFVRVDENSNPPLVGDYTL  121

Query  142  YTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGADKNNVMVITPEGETVVAP  201
            YTVQRPVTITLLGAVSGAGQLPW AGRSVTDYLQDHPRLAGADKNNV VITPEGETVVAP
Sbjct  122  YTVQRPVTITLLGAVSGAGQLPWLAGRSVTDYLQDHPRLAGADKNNVXVITPEGETVVAP  181

Query  202  VALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLTQRVPD  248
            VALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLTQRVP+
Sbjct  182  VALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLTQRVPE  228


>gb|EFW71149.1| YjbG polysaccharide synthesis-related protein [Escherichia coli 
WV_060327]
Length=223

 Score =  440 bits (1132),  Expect = 8e-122, Method: Compositional matrix adjust.
 Identities = 218/223 (97%), Positives = 219/223 (98%), Gaps = 0/223 (0%)

Query  26   VTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGALLTDSAAKAKALKDYQHVMAQL  85
            +TIYLPGEQQTLSVGPVENV QLVTQPQLRDRLWWPGALLTDSAAKAKALKDYQHVMAQL
Sbjct  1    MTIYLPGEQQTLSVGPVENVAQLVTQPQLRDRLWWPGALLTDSAAKAKALKDYQHVMAQL  60

Query  86   ASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPDFVRVDENSNPPLVGDYTLYTVQ  145
            ASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPDFVRVDENSNPPLVGDYTLYTVQ
Sbjct  61   ASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPDFVRVDENSNPPLVGDYTLYTVQ  120

Query  146  RPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGADKNNVMVITPEGETVVAPVALW  205
            RPVTITLLGAVSGAGQLPW AGRSVTDYLQDH RLAGADKNNVMVITPEGE VVAPVALW
Sbjct  121  RPVTITLLGAVSGAGQLPWLAGRSVTDYLQDHTRLAGADKNNVMVITPEGEAVVAPVALW  180

Query  206  NKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLTQRVPD  248
            NKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLTQRVPD
Sbjct  181  NKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLTQRVPD  223


>gb|EGK28119.1| hypothetical protein SFK272_1492 [Shigella flexneri K-272]
 gb|EGK39944.1| hypothetical protein SFK227_0667 [Shigella flexneri K-227]
Length=161

 Score =  319 bits (818),  Expect = 2e-85, Method: Compositional matrix adjust.
 Identities = 157/158 (99%), Positives = 158/158 (100%), Gaps = 0/158 (0%)

Query  91   EADDDVAATIKSVRQQLLNLNITGRLPVKLDPDFVRVDENSNPPLVGDYTLYTVQRPVTI  150
            +ADDDVAATIKSVRQQLLNLNITGRLPVKLDPDFVRVDENSNPPLVGDYTLYTVQRPVTI
Sbjct  4    QADDDVAATIKSVRQQLLNLNITGRLPVKLDPDFVRVDENSNPPLVGDYTLYTVQRPVTI  63

Query  151  TLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGADKNNVMVITPEGETVVAPVALWNKRHV  210
            TLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGADKNNVMVITPEGETVVAPVALWNKRHV
Sbjct  64   TLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGADKNNVMVITPEGETVVAPVALWNKRHV  123

Query  211  EPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLTQRVPD  248
            EPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLTQRVPD
Sbjct  124  EPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLTQRVPD  161


>emb|CBX82236.1| Uncharacterized protein gfcC Group 4 capsule protein C homolog; 
Flags: Precursor [Erwinia amylovora ATCC BAA-2158]
Length=251

 Score =  219 bits (558),  Expect = 3e-55, Method: Compositional matrix adjust.
 Identities = 114/252 (45%), Positives = 166/252 (65%), Gaps = 5/252 (1%)

Query  1    MNKLQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWW  60
            M K+ +  +A +L V++  A A G V I+ PG+ Q L V  + ++ QLVT P L  + WW
Sbjct  1    MKKI-TILLAGILAVLSLQARADGKVNIFYPGQNQPLVVNHMADLEQLVTNPALAQKTWW  59

Query  61   PGALLTDSAAKAKALKDYQHVMAQLASW--EAEADDD--VAATIKSVRQQLLNLNITGRL  116
            PG  + +  A A  ++  Q ++A+L +W  +   DDD  +AA +++VRQQ+  L +TGR 
Sbjct  60   PGTAIGEKQATAGVIQQQQQLLARLQTWRDQLRNDDDGALAAAVENVRQQIAALKVTGRQ  119

Query  117  PVKLDPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQD  176
             V LDPD+VR+   +N  L G+Y++YT+++P +ITL G +  +G+ PW AGRS + YL +
Sbjct  120  FVNLDPDWVRLRPGANRRLEGEYSVYTLKKPTSITLAGVIENSGRTPWVAGRSASGYLSE  179

Query  177  HPRLAGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLND  236
            HPR++GA++N  ++I+P GE    PVA WN RH EP  GS L++GFSA  LP  YADLN 
Sbjct  180  HPRMSGAERNIALLISPGGEVSEVPVAYWNHRHTEPQAGSTLFVGFSAWTLPRAYADLNI  239

Query  237  QIVSVLTQRVPD  248
            QIVSVLT R+PD
Sbjct  240  QIVSVLTHRIPD  251


>ref|YP_001909022.1| Conserved hypothetical protein YmcB [Erwinia tasmaniensis Et1/99]
 emb|CAO98154.1| Conserved hypothetical protein YmcB [Erwinia tasmaniensis Et1/99]
Length=251

 Score =  218 bits (555),  Expect = 7e-55, Method: Compositional matrix adjust.
 Identities = 113/245 (46%), Positives = 158/245 (64%), Gaps = 4/245 (1%)

Query  8    FIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGALLTD  67
            F+  +   ++  A A G V I+ PG+ Q L V PV ++ QLVT P L  + WWPG  + +
Sbjct  7    FLTVIAVALSQLALADGRVNIFYPGQSQPLVVNPVADLEQLVTDPALAQKTWWPGTAIGE  66

Query  68   SAAKAKALKDYQHVMAQLASWE----AEADDDVAATIKSVRQQLLNLNITGRLPVKLDPD  123
              A   AL+  Q ++A+L +W      E DD  AAT+ +VR+Q+  L +TGR  V LDPD
Sbjct  67   KLATVGALQQQQQLLARLQAWRDRLHNEGDDSQAATVDNVRRQIAVLKVTGRQFVNLDPD  126

Query  124  FVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGA  183
            +VR+   +N  L G+Y++YT+  P +ITL GA+   G++PW AGRS  +YL  HPR++GA
Sbjct  127  WVRLRPQANRRLQGEYSVYTLNEPTSITLAGAIESTGKVPWAAGRSAVEYLAAHPRMSGA  186

Query  184  DKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLT  243
            +++  ++I+P GE    PVA WN+RHVEP  GS L++GFS   LP  YADLN QIVSVLT
Sbjct  187  ERSTALLISPGGEVTEIPVAYWNRRHVEPQAGSTLFIGFSTWTLPRAYADLNLQIVSVLT  246

Query  244  QRVPD  248
             R+PD
Sbjct  247  HRIPD  251


>ref|YP_002650310.1| capsular polysaccharide protein [Erwinia pyrifoliae Ep1/96]
 emb|CAX57108.1| Putative capsular polysaccharide protein [Erwinia pyrifoliae 
Ep1/96]
 emb|CAY75967.1| Uncharacterized protein ymcB precursor [Erwinia pyrifoliae DSM 
12163]
Length=251

 Score =  216 bits (549),  Expect = 4e-54, Method: Compositional matrix adjust.
 Identities = 110/245 (44%), Positives = 155/245 (63%), Gaps = 4/245 (1%)

Query  8    FIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGALLTD  67
             +A ++  +T  A A   V I+ PG+ Q L V  + ++ QLVT P L ++ WWPG  + +
Sbjct  7    LLAGIVAALTLQARADSQVNIFYPGQNQPLVVNHMADLQQLVTNPALAEKTWWPGTTIAE  66

Query  68   SAAKAKALKDYQHVMAQLASWE----AEADDDVAATIKSVRQQLLNLNITGRLPVKLDPD  123
              A A A++  Q ++A+L +W      E DD  A  + +VRQQ+  L +TGR  V LDPD
Sbjct  67   KRATAVAIQQQQQLLARLQTWRDRLRNEGDDTQAVAVDNVRQQIAVLKVTGRQIVNLDPD  126

Query  124  FVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGA  183
            +VR+    N  L G+Y++YT+  P +ITL G +   G+ PW AGRS  +YL  HPR++GA
Sbjct  127  WVRLRPQDNRWLQGEYSVYTLNEPTSITLAGVIEKTGKTPWAAGRSAVEYLDAHPRMSGA  186

Query  184  DKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLT  243
            +++  ++I+P GE    PVA WN+RHVEP  GS L++GFSA  LP  YADLN QIVSVLT
Sbjct  187  ERSTALLISPGGEVTEIPVAYWNRRHVEPQAGSTLFIGFSAWTLPRAYADLNSQIVSVLT  246

Query  244  QRVPD  248
             R+PD
Sbjct  247  HRIPD  251


>ref|YP_003537358.1| exported protein [Erwinia amylovora ATCC 49946]
 emb|CBJ44932.1| putative exported protein [Erwinia amylovora ATCC 49946]
Length=251

 Score =  215 bits (548),  Expect = 4e-54, Method: Compositional matrix adjust.
 Identities = 113/252 (44%), Positives = 165/252 (65%), Gaps = 5/252 (1%)

Query  1    MNKLQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWW  60
            M K+ +  +A +L V++  A A G V I+ PG+ Q L V  + ++ QLVT P L  + WW
Sbjct  1    MKKI-TILLAGILAVLSLQARADGKVNIFYPGQNQPLVVNHMADLEQLVTNPALAQKTWW  59

Query  61   PGALLTDSAAKAKALKDYQHVMAQLASW--EAEADDD--VAATIKSVRQQLLNLNITGRL  116
            PG  + +  A A  ++  Q ++A+L +W  +   DDD  +AA +++VRQQ+  L +TGR 
Sbjct  60   PGTAIGEKQATAGVIQQQQQLLARLQTWRDQLRNDDDGALAAAVENVRQQIAALKVTGRQ  119

Query  117  PVKLDPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQD  176
             V LDPD+VR+   +N  L G+Y++YT+++P +ITL G +  +G+ PW AGRS + YL +
Sbjct  120  FVNLDPDWVRLRPGANRRLEGEYSVYTLKKPTSITLAGVIENSGRTPWVAGRSASGYLSE  179

Query  177  HPRLAGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLND  236
            HPR++GA++N  ++I+P GE    PVA WN RH EP  GS L++GFSA  LP  YADLN 
Sbjct  180  HPRMSGAERNIALLISPGGEVSEVPVAYWNHRHTEPQAGSTLFVGFSAWTLPRAYADLNI  239

Query  237  QIVSVLTQRVPD  248
            QIVSVLT  +PD
Sbjct  240  QIVSVLTHWIPD  251


>gb|ADP10499.1| Putative capsular polysaccharide protein [Erwinia sp. Ejp617]
Length=252

 Score =  211 bits (537),  Expect = 8e-53, Method: Compositional matrix adjust.
 Identities = 109/245 (44%), Positives = 152/245 (62%), Gaps = 4/245 (1%)

Query  8    FIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGALLTD  67
             +A ++      A A   V I+ PG+ Q L V  + ++ QLVT P L  + WWPG  + +
Sbjct  8    LLAGIVAAFALQARADSQVNIFYPGQNQPLVVNHMADLQQLVTNPALAQKTWWPGTTIGE  67

Query  68   SAAKAKALKDYQHVMAQLASWE----AEADDDVAATIKSVRQQLLNLNITGRLPVKLDPD  123
              A A A++  Q ++A+L +W      E DD  AA + +VRQQ+  L +TGR  V LDPD
Sbjct  68   KRATAVAIQQQQQLLARLQTWRDRLRNEGDDTQAAAVDNVRQQIAVLKVTGRQIVNLDPD  127

Query  124  FVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGA  183
            +VR+    N  L G+Y++YT+  P +ITL G +   G+ PW AGRS  +YL  HPR++GA
Sbjct  128  WVRLRPQDNRRLQGEYSVYTLNEPTSITLAGVIEKTGKTPWAAGRSAVEYLDAHPRMSGA  187

Query  184  DKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLT  243
            +++  ++I+P GE    PVA WN+R VEP  GS L++GFSA  LP  YADLN QIVSVLT
Sbjct  188  ERSTALLISPGGEVTEIPVAYWNRRQVEPQAGSTLFIGFSAWTLPRAYADLNSQIVSVLT  247

Query  244  QRVPD  248
             R+PD
Sbjct  248  HRIPD  252


>ref|YP_003532687.1| hypothetical protein EAMY_3334 [Erwinia amylovora CFBP1430]
 emb|CBA23497.1| Uncharacterized protein ymcB precursor [Erwinia amylovora CFBP1430]
Length=238

 Score =  209 bits (532),  Expect = 4e-52, Method: Compositional matrix adjust.
 Identities = 108/238 (45%), Positives = 157/238 (65%), Gaps = 4/238 (1%)

Query  15   VMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGALLTDSAAKAKA  74
            +++  A A G V I+ PG+ Q L V  + ++ QLVT P L  + WWPG  + +  A A  
Sbjct  1    MLSLQARADGKVNIFYPGQNQPLVVNHMADLEQLVTNPALAQKTWWPGTAIGEKQATAGV  60

Query  75   LKDYQHVMAQLASW--EAEADDD--VAATIKSVRQQLLNLNITGRLPVKLDPDFVRVDEN  130
            ++  Q ++A+L +W  +   DDD  +AA +++VRQQ+  L +TGR  V LDPD+VR+   
Sbjct  61   IQQQQQLLARLQTWRDQLRNDDDGALAAAVENVRQQIAALKVTGRQFVNLDPDWVRLRPG  120

Query  131  SNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGADKNNVMV  190
            +N  L G+Y++YT+++P +ITL G +  +G+ PW AGRS + YL +HPR++GA++N  ++
Sbjct  121  ANRRLEGEYSVYTLKKPTSITLAGVIENSGRTPWVAGRSASGYLSEHPRMSGAERNIALL  180

Query  191  ITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLTQRVPD  248
            I+P GE    PVA WN RH EP  GS L++GFSA  LP  YADLN QIVSVLT  +PD
Sbjct  181  ISPGGEVSEVPVAYWNHRHTEPQAGSTLFVGFSAWTLPRAYADLNIQIVSVLTHWIPD  238


>ref|YP_003739692.1| conserved uncharacterized protein YmcB [Erwinia billingiae Eb661]
 emb|CAX57832.1| conserved uncharacterized protein YmcB [Erwinia billingiae Eb661]
Length=251

 Score =  205 bits (521),  Expect = 6e-51, Method: Compositional matrix adjust.
 Identities = 111/252 (44%), Positives = 161/252 (63%), Gaps = 5/252 (1%)

Query  1    MNKLQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWW  60
            M K+ +  +A +   ++ +  A+  VT+Y PG+ QT  V   +N+ QLV+ P L D+ WW
Sbjct  1    MKKI-TLLLAGISACVSLNVSAESQVTVYSPGQTQTSIVSHAQNLAQLVSSPALMDKTWW  59

Query  61   PGALLTDSAAKAKALKDYQHVMAQLASW----EAEADDDVAATIKSVRQQLLNLNITGRL  116
            PG ++ +  A A A++  Q V+A+L +W     A+ D + AA + +V QQ+  + +TGR 
Sbjct  60   PGTVIAEKLATAAAIQQQQQVLARLKAWSNQLHADGDSEQAAVVDNVWQQVSAVKVTGRQ  119

Query  117  PVKLDPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQD  176
               LDPD+VR+    N  L GDY++YT+ +P +ITL G ++ +G+ PW  GRSV DYLQD
Sbjct  120  LANLDPDWVRMRPAQNRRLEGDYSVYTLLKPTSITLAGVLANSGKTPWAPGRSVVDYLQD  179

Query  177  HPRLAGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLND  236
            H RL+G ++N V++I P GE    PVA WN+RHVEP  GS ++ GFS+  LP    DLN 
Sbjct  180  HDRLSGGERNFVVLIAPNGEVSDVPVAYWNRRHVEPEVGSIVYRGFSSWTLPGDDEDLNQ  239

Query  237  QIVSVLTQRVPD  248
            QIVSVLT R+PD
Sbjct  240  QIVSVLTHRIPD  251


>ref|YP_001174974.1| hypothetical protein Ent638_0233 [Enterobacter sp. 638]
 gb|ABP58923.1| protein of unknown function DUF1017 [Enterobacter sp. 638]
Length=245

 Score =  203 bits (517),  Expect = 2e-50, Method: Compositional matrix adjust.
 Identities = 104/240 (43%), Positives = 151/240 (62%), Gaps = 1/240 (0%)

Query  9    IASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGALLTDS  68
            +A ++ +  P A++ GTV +Y P  ++  ++   E+++ LV QP+L +  WWPGA++++ 
Sbjct  7    VALIVTLAAPLAWSAGTVKVYTPDNKEPKTLSNAEHLIDLVGQPRLANS-WWPGAIISER  65

Query  69   AAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPDFVRVD  128
             A A A + +Q ++A+L     + D D AA I +VRQQL  + +TGR  V LDPD VRV 
Sbjct  66   QATAIAEQKHQALLARLTGLAEQEDGDTAAAINAVRQQLQAITVTGRQRVNLDPDEVRVT  125

Query  129  ENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGADKNNV  188
            EN NP L GDYTL+ V +P T+T+ G VS  GQ P+  GR V  YL +   L+GA+ +  
Sbjct  126  ENGNPTLEGDYTLWIVAKPSTVTVAGLVSSPGQKPFTPGRDVASYLDEQHLLSGAENSYA  185

Query  189  MVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLTQRVPD  248
             VI P+G     PVA WNKRH+EP PGS +++GF+ H   + Y  LN  I+  LTQR+PD
Sbjct  186  WVIYPDGRRQNVPVAYWNKRHIEPMPGSVIFVGFADHFWTKAYDGLNTDILRSLTQRIPD  245


>ref|YP_003518538.1| YmcB [Pantoea ananatis LMG 20103]
 gb|ADD75410.1| YmcB [Pantoea ananatis LMG 20103]
Length=247

 Score =  200 bits (509),  Expect = 1e-49, Method: Compositional matrix adjust.
 Identities = 110/243 (45%), Positives = 153/243 (62%), Gaps = 0/243 (0%)

Query  6    SYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGALL  65
            +  +A +  + T  A A G VT++ P + Q++ V  VEN+ QLVTQP L  +  W  A++
Sbjct  5    TRLLAGMSLLTTLAAQAAGQVTVHAPHDTQSVQVNQVENLAQLVTQPALMTKTDWRRAVI  64

Query  66   TDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPDFV  125
             +  A A A + YQ  +AQL +W A++  + AA I +V  QL  + +TGR    LDPD++
Sbjct  65   AERGATAVAQQQYQQTLAQLRAWRADSSGEQAAAIDAVIHQLSGIRVTGRQFTSLDPDWI  124

Query  126  RVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGADK  185
            R+    N  L G Y LY  Q   ++ L G ++GAG + WQ G+SV DYL +HPRLAGA++
Sbjct  125  RLHTMDNRRLEGSYDLYLTQPSTSVLLFGPIAGAGAVNWQPGKSVRDYLSEHPRLAGAER  184

Query  186  NNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLTQR  245
            N  +VI P+G T  APVA WN RHVEP PGS +  GFS+  LP  + DLND++VSVLT R
Sbjct  185  NIAIVIAPDGTTREAPVAYWNHRHVEPEPGSIIMTGFSSWSLPGAFKDLNDRLVSVLTHR  244

Query  246  VPD  248
            +PD
Sbjct  245  IPD  247


>dbj|BAK13483.1| hypothetical protein YmcB precursor YmcB [Pantoea ananatis AJ13355]
Length=247

 Score =  200 bits (508),  Expect = 2e-49, Method: Compositional matrix adjust.
 Identities = 110/243 (45%), Positives = 154/243 (63%), Gaps = 0/243 (0%)

Query  6    SYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGALL  65
            +  +A +  + T  A A G VT++ P + Q++ V  VEN+ QLVTQP L  +  W  A++
Sbjct  5    TRLLAGMSLLTTLAAQAAGQVTVHAPHDTQSVQVNQVENLAQLVTQPALMTQTDWRRAVI  64

Query  66   TDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPDFV  125
             +  A A A + YQ  +AQL +W A++  + AA I +V +QL  + +TGR    LDPD++
Sbjct  65   AERGATAVAQQQYQQTLAQLRAWRADSSGEQAAAIDAVIRQLSGIRVTGRQFTSLDPDWI  124

Query  126  RVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGADK  185
            R+    N  L G Y LY  Q   ++ L G ++GAG + WQ G+SV DYL +HPRLAGA++
Sbjct  125  RLHTMDNRRLEGSYDLYLTQPSTSVLLFGPIAGAGAVNWQPGKSVRDYLSEHPRLAGAER  184

Query  186  NNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLTQR  245
            N  +VI P+G T  APVA WN RHVEP PGS +  GFS+  LP  + DLND++VSVLT R
Sbjct  185  NIAIVIAPDGTTREAPVAYWNHRHVEPEPGSIIMTGFSSWSLPGAFKDLNDRLVSVLTHR  244

Query  246  VPD  248
            +PD
Sbjct  245  IPD  247


>ref|YP_001436216.1| hypothetical protein ESA_00075 [Cronobacter sakazakii ATCC BAA-894]
 gb|ABU75380.1| hypothetical protein ESA_00075 [Cronobacter sakazakii ATCC BAA-894]
Length=244

 Score =  200 bits (508),  Expect = 2e-49, Method: Compositional matrix adjust.
 Identities = 110/246 (44%), Positives = 153/246 (62%), Gaps = 3/246 (1%)

Query  4    LQSYFIASVLYVMTPH-AFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPG  62
            +++  IAS+++ +    AFA GT+T+Y     Q L V   +++  LV+QPQL    WW G
Sbjct  1    MKTTLIASLIFSLGSFCAFADGTITVYR-DHAQPLKVSGAKHLADLVSQPQLAGS-WWLG  58

Query  63   ALLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDP  122
            A++++  A  +A   +Q ++ +LAS  AE   D  A I  VRQQL  + +TGR  V LDP
Sbjct  59   AVISERQASVEAQAQHQVLLNRLASLAAEEGGDDGAAINRVRQQLQAIKVTGRQRVILDP  118

Query  123  DFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAG  182
            D VRV  ++NPPL GDY L+   +P TITL+G VS  G+  +  G+ VTDYL D  RL+G
Sbjct  119  DRVRVRPHNNPPLEGDYELWVGPQPSTITLVGLVSAPGKKTFTPGKPVTDYLDDVSRLSG  178

Query  183  ADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVL  242
            A+++   +I P G     PVA WNKRHVEP PGS L++GF+ H     Y  LN+QI+S L
Sbjct  179  AERSYAWLIQPTGRVEKVPVAYWNKRHVEPMPGSILYVGFADHTFTSAYDGLNEQIISSL  238

Query  243  TQRVPD  248
            T R+PD
Sbjct  239  THRIPD  244


>gb|EGL72010.1| hypothetical protein CSE899_14477 [Cronobacter sakazakii E899]
Length=244

 Score =  197 bits (502),  Expect = 1e-48, Method: Compositional matrix adjust.
 Identities = 109/246 (44%), Positives = 152/246 (61%), Gaps = 3/246 (1%)

Query  4    LQSYFIASVLYVMTPH-AFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPG  62
            +++  IAS+++ +    AFA GT+T+Y     Q L V   +++  LV+QPQL    WW G
Sbjct  1    MKTTLIASLIFSLGSFCAFADGTITVYR-DHAQPLKVSGAKHLADLVSQPQLAGS-WWLG  58

Query  63   ALLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDP  122
            A++++  A  +A   +Q ++ +LAS  AE   D  A I  V QQL  + +TGR  V LDP
Sbjct  59   AVISERQASVEAQAQHQVLLNRLASLAAEEGGDDGAAINRVHQQLQAIKVTGRQRVILDP  118

Query  123  DFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAG  182
            D VRV  ++NPPL GDY L+   +P TITL+G VS  G+  +  G+ VTDYL D  RL+G
Sbjct  119  DRVRVRPHNNPPLEGDYELWVGPQPSTITLVGLVSAPGKKTFTPGKPVTDYLDDVSRLSG  178

Query  183  ADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVL  242
            A+++   +I P G     PVA WNKRHVEP PGS L++GF+ H     Y  LN+QI+S L
Sbjct  179  AERSYAWLIQPTGRVEKVPVAYWNKRHVEPMPGSILYVGFADHTFTSAYDGLNEQIISSL  238

Query  243  TQRVPD  248
            T R+PD
Sbjct  239  THRIPD  244


>ref|YP_003610796.1| hypothetical protein ECL_00280 [Enterobacter cloacae subsp. cloacae 
ATCC 13047]
 gb|ADF59847.1| hypothetical protein ECL_00280 [Enterobacter cloacae subsp. cloacae 
ATCC 13047]
Length=245

 Score =  197 bits (502),  Expect = 1e-48, Method: Compositional matrix adjust.
 Identities = 104/240 (43%), Positives = 147/240 (61%), Gaps = 1/240 (0%)

Query  9    IASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGALLTDS  68
            +A +    TP A++ GTV +Y P   Q  ++    N++ LV QP+L +  WW GA++ + 
Sbjct  7    VALIASFATPLAWSAGTVKVYTPDSTQPKTLTNAGNLIDLVGQPRLANS-WWTGAVIAER  65

Query  69   AAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPDFVRVD  128
             A   A + ++ ++A+L     + D D AA I S+RQQLL L +TGR  + LDPD VRV 
Sbjct  66   QATVAAEQKHKALLARLTGLAEQEDGDDAAAINSLRQQLLALKVTGRQNINLDPDEVRVT  125

Query  129  ENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGADKNNV  188
            E  NP L GDYTL+   +P TIT++G +S  G+ P+  GR V  YL +   L+GAD +  
Sbjct  126  EKGNPALEGDYTLWLPAQPSTITVMGLISSPGKKPFTPGRDVASYLDEQSLLSGADNSYA  185

Query  189  MVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLTQRVPD  248
             VI P+G T  APVA WNKRHVEP PGS +++GF+ H   + Y  LN  I+  LTQR+P+
Sbjct  186  WVIYPDGNTQKAPVAYWNKRHVEPMPGSIIFVGFADHFWTKAYDGLNADILHSLTQRIPE  245


>ref|NP_709891.1| hypothetical protein SF4177 [Shigella flexneri 2a str. 301]
 ref|NP_838790.1| hypothetical protein S3554 [Shigella flexneri 2a str. 2457T]
 gb|AAN45598.1| orf, conserved hypothetical protein [Shigella flexneri 2a str. 
301]
 10 more sequence titles
 Length=245

 Score =  196 bits (497),  Expect = 3e-48, Method: Compositional matrix adjust.
 Identities = 103/244 (42%), Positives = 148/244 (60%), Gaps = 1/244 (0%)

Query  4    LQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGA  63
            ++   +A +L V     FA GTV ++  G  +  ++   E+++ LV QP+L +  WWPGA
Sbjct  2    IKQTIVALLLSVGASSVFAAGTVKVFSNGSSEAKTLTGAEHLIDLVGQPRLANS-WWPGA  60

Query  64   LLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPD  123
            ++++  A A AL+  Q ++ +LA   A++  D AA I ++RQQ+  L +TGR  + LDPD
Sbjct  61   VISEELATAAALRQQQALLTRLAEQGADSSTDDAAAINALRQQIQALKVTGRQKINLDPD  120

Query  124  FVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGA  183
             VRV E  NPPL G+YTL+    P T+TL G +S  G  P+  GR V  YL D   L+GA
Sbjct  121  IVRVAERGNPPLQGNYTLWVGPPPSTVTLFGLISHPGNQPFTPGRDVASYLSDQSLLSGA  180

Query  184  DKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLT  243
            D++   V+ P+G T  APVA WNKRHVEP PGS +++G +  V  E    LN  I+  LT
Sbjct  181  DRSYAWVVYPDGRTQKAPVAYWNKRHVEPMPGSIIYVGLADSVWSETPDALNADILQTLT  240

Query  244  QRVP  247
            QR+P
Sbjct  241  QRIP  244


>ref|YP_312940.1| hypothetical protein SSON_4206 [Shigella sonnei Ss046]
 gb|AAZ90705.1| conserved hypothetical protein [Shigella sonnei Ss046]
Length=245

 Score =  196 bits (497),  Expect = 4e-48, Method: Compositional matrix adjust.
 Identities = 103/244 (42%), Positives = 148/244 (60%), Gaps = 1/244 (0%)

Query  4    LQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGA  63
            ++   +A +L V     FA GTV ++  G  +  ++   E+++ LV QP+L +  WWPGA
Sbjct  2    IKQTIVALLLSVGASSVFAAGTVKVFSNGSSEAKTLTGAEHLIDLVGQPRLANS-WWPGA  60

Query  64   LLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPD  123
            ++++  A A AL+  Q ++ +LA   A++  D AA I ++RQQ+  L +TGR  + LDPD
Sbjct  61   VISEELATAAALRQQQALLTRLAEQGADSSTDDAAAINALRQQIQALEVTGRQKINLDPD  120

Query  124  FVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGA  183
             VRV E  NPPL G+YTL+    P T+TL G +S  G  P+  GR V  YL D   L+GA
Sbjct  121  IVRVAERGNPPLQGNYTLWVGPPPSTVTLFGLISHPGNQPFTPGRDVASYLSDQSLLSGA  180

Query  184  DKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLT  243
            D++   V+ P+G T  APVA WNKRHVEP PGS +++G +  V  E    LN  I+  LT
Sbjct  181  DRSYAWVVYPDGRTQKAPVAYWNKRHVEPMPGSIIYVGLADSVWSETPDALNADILQTLT  240

Query  244  QRVP  247
            QR+P
Sbjct  241  QRIP  244


>ref|YP_003232032.1| hypothetical protein ECO26_5143 [Escherichia coli O26:H11 str. 
11368]
 ref|YP_003237146.1| hypothetical protein ECO111_4850 [Escherichia coli O111:H- str. 
11128]
 dbj|BAI28292.1| conserved predicted protein [Escherichia coli O26:H11 str. 11368]
 dbj|BAI38595.1| conserved predicted protein [Escherichia coli O111:H- str. 11128]
 gb|EFZ41755.1| hypothetical protein ECEPECA14_2520 [Escherichia coli EPECa14]
 gb|EFZ63178.1| hypothetical protein ECOK1180_3669 [Escherichia coli 1180]
Length=245

 Score =  195 bits (496),  Expect = 4e-48, Method: Compositional matrix adjust.
 Identities = 103/244 (42%), Positives = 148/244 (60%), Gaps = 1/244 (0%)

Query  4    LQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGA  63
            ++  F+A +L V     FA GTV ++  G  +  ++   E+++ LV QP+L +  WWPGA
Sbjct  2    IKQTFVALLLSVGASSVFAAGTVKVFSNGSSEAKTLTGAEHLIDLVGQPRLANS-WWPGA  60

Query  64   LLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPD  123
            ++++  A A AL+  Q ++ +LA   A++  D AA I ++RQQ+  L +TGR  + LDPD
Sbjct  61   VISEELATAAALRQQQALLTRLAEQGADSSTDDAAAINALRQQIQALKVTGRQKINLDPD  120

Query  124  FVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGA  183
             VRV E  NPPL G+YTL+    P T+TL G +S  G  P+  GR V  YL     L+GA
Sbjct  121  IVRVAERGNPPLQGNYTLWVGPPPSTVTLFGLISHPGNQPFTPGRDVASYLSGQNLLSGA  180

Query  184  DKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLT  243
            D++   V+ P+G T  APVA WNKRHVEP PGS +++G +  V  E    LN  I+  LT
Sbjct  181  DRSYAWVVYPDGRTQKAPVAYWNKRHVEPMPGSIIYVGLADSVWSETPDALNADILQTLT  240

Query  244  QRVP  247
            QR+P
Sbjct  241  QRIP  244


>ref|YP_003212148.1| hypothetical protein CTU_37850 [Cronobacter turicensis z3032]
 emb|CBA34102.1| Uncharacterized protein yjbG [Cronobacter turicensis z3032]
Length=235

 Score =  195 bits (495),  Expect = 6e-48, Method: Compositional matrix adjust.
 Identities = 107/229 (46%), Positives = 143/229 (62%), Gaps = 2/229 (0%)

Query  20   AFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGALLTDSAAKAKALKDYQ  79
            AFA GT+T+Y     Q L V   +++  LV+QPQL    WW GA++++  A  +A   +Q
Sbjct  9    AFADGTITVYR-DHAQPLKVSGAKHLADLVSQPQLTGS-WWLGAVISERQASVEAQAQHQ  66

Query  80   HVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPDFVRVDENSNPPLVGDY  139
             ++ +LAS  AE   D  A I  VRQQL  + +TGR  V LDPD VRV  ++NPPL GDY
Sbjct  67   VLLNRLASLAAEEGGDDGAAINRVRQQLQAIKVTGRQRVILDPDRVRVRPHNNPPLEGDY  126

Query  140  TLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGADKNNVMVITPEGETVV  199
             L+   +P TITL+G VS  G+  +  G+ VTDYL +  RL+GA+++   +I P G    
Sbjct  127  ELWVGPQPSTITLVGLVSAPGKKTFTPGKPVTDYLDEVSRLSGAERSYAWLIQPTGRVEK  186

Query  200  APVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLTQRVPD  248
             PVA WNKRHVEP PGS L++GF+ H     Y  LN+QI+S LT RVPD
Sbjct  187  VPVAYWNKRHVEPMPGSILYVGFADHTFTSAYDGLNEQIISSLTHRVPD  235


>gb|EGC97468.1| hypothetical protein ECD227_3706 [Escherichia fergusonii ECD227]
Length=245

 Score =  195 bits (495),  Expect = 6e-48, Method: Compositional matrix adjust.
 Identities = 102/244 (41%), Positives = 147/244 (60%), Gaps = 1/244 (0%)

Query  4    LQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGA  63
            ++    A ++ V     FA GTV ++  G  +  ++   E+++ LV QP+L +  WWPGA
Sbjct  2    IKQTIAALIMSVGASSVFAAGTVKVFSNGSNEAKTLTGAEHLIDLVGQPRLANS-WWPGA  60

Query  64   LLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPD  123
            ++++  A A AL+  Q ++ +LA   +E+  D AA I ++RQQ+  L +TGR  + LDPD
Sbjct  61   VISEELATAAALRQQQALLTRLAELASESSTDDAAAINALRQQIQALKVTGRQKINLDPD  120

Query  124  FVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGA  183
             VRV E SNPPL G+YTL+    P T+TL G +S  G  P+  GR V  YL     L+GA
Sbjct  121  IVRVAERSNPPLQGNYTLWVGPPPSTVTLFGLISRPGNQPFTPGRDVASYLSGQSLLSGA  180

Query  184  DKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLT  243
            D++   V+ P+G T  APVA WNKRHVEP PGS +++G +  V  E    LN  I+  LT
Sbjct  181  DRSYAWVVYPDGRTQKAPVAYWNKRHVEPMPGSIIYVGLADSVWSETPDALNADILQTLT  240

Query  244  QRVP  247
            QR+P
Sbjct  241  QRIP  244


>ref|ZP_05969412.1| conserved hypothetical protein [Enterobacter cancerogenus ATCC 
35316]
 gb|EFC55292.1| conserved hypothetical protein [Enterobacter cancerogenus ATCC 
35316]
Length=246

 Score =  192 bits (489),  Expect = 3e-47, Method: Compositional matrix adjust.
 Identities = 102/247 (41%), Positives = 149/247 (60%), Gaps = 3/247 (1%)

Query  4    LQSYFIASVLYV--MTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWP  61
            ++   IA VL      P A++ GTV +Y P   Q  ++    +++ LV QP+L +  WWP
Sbjct  1    MKKTVIAIVLLAGFAAPLAWSAGTVKVYTPENAQPKTLTNAGHLLDLVGQPRLANS-WWP  59

Query  62   GALLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLD  121
            GA++ +  A  +A + +  ++A+L     + D D AA I SVRQQL  L +TGR  + LD
Sbjct  60   GAVIGERQASVEAEQKHNALLARLTGLAGQEDGDDAAAINSVRQQLQALKVTGRQTINLD  119

Query  122  PDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLA  181
            PD VRV E  NP L G+YTL+   +P T+T++G +S  G+ P+  GR V  YL +   L+
Sbjct  120  PDVVRVAEKGNPALEGEYTLWLPTQPSTVTVMGLISSPGKKPFTPGRDVASYLDEQSLLS  179

Query  182  GADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSV  241
            GAD +   ++ P+G T  APVA WNKRH+EP PGS +++GF+ H   + Y  LN  I+  
Sbjct  180  GADNSYAWIVYPDGHTQKAPVAYWNKRHIEPMPGSVIFVGFADHFWTKAYDGLNADILHS  239

Query  242  LTQRVPD  248
            LTQR+PD
Sbjct  240  LTQRIPD  246


>ref|YP_001455401.1| hypothetical protein CKO_03889 [Citrobacter koseri ATCC BAA-895]
 gb|ABV14965.1| hypothetical protein CKO_03889 [Citrobacter koseri ATCC BAA-895]
Length=245

 Score =  192 bits (489),  Expect = 3e-47, Method: Compositional matrix adjust.
 Identities = 101/245 (41%), Positives = 147/245 (60%), Gaps = 1/245 (0%)

Query  4    LQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGA  63
            ++    A  L +    +FA GTV ++  G  +  ++   E+++ LV QP+L +  WWPGA
Sbjct  2    MKQTIAALALSLCASSSFAAGTVKVFAAGSTEPKTLTGAEHLIDLVGQPKLANS-WWPGA  60

Query  64   LLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPD  123
            ++++  A A A +  Q ++ +LA++ AE   D AA I ++RQQ+  L +TGR  V LDPD
Sbjct  61   VISEERATATAQRQQQELLGRLAAFGAEKSGDDAAAINTLRQQVQTLKVTGRQLVNLDPD  120

Query  124  FVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGA  183
             VRV E  NPPL G YTL+   +P  ITL G VS  G+ P+  GR V  YL +   L+GA
Sbjct  121  TVRVSERGNPPLQGHYTLWVGGQPTDITLFGLVSQPGKRPFSPGRDVASYLDEQSLLSGA  180

Query  184  DKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLT  243
            D++   V+ P+G T  APVA WNKRHVEP PGS +++G +  +       LN  I+  LT
Sbjct  181  DRSYAWVVYPDGRTQKAPVAYWNKRHVEPMPGSIIFVGLTDSLWSNTPDALNADILRTLT  240

Query  244  QRVPD  248
            QR+P+
Sbjct  241  QRIPE  245


>ref|YP_003367162.1| hypothetical protein ROD_37221 [Citrobacter rodentium ICC168]
 emb|CBG90427.1| putative exported protein [Citrobacter rodentium ICC168]
Length=245

 Score =  192 bits (489),  Expect = 3e-47, Method: Compositional matrix adjust.
 Identities = 101/245 (41%), Positives = 149/245 (60%), Gaps = 1/245 (0%)

Query  4    LQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGA  63
            ++    A ++ +     FA GTV +++ G  Q  ++   E ++ LV QP+L +  WWPGA
Sbjct  2    MKRTLFALLISLNAASVFAAGTVNVFIAGTPQAKTLTGAERLIDLVGQPRLANS-WWPGA  60

Query  64   LLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPD  123
            ++++  A A AL+  Q ++A+LA+  A    D AA I ++RQQ+  L ITGR  V LDPD
Sbjct  61   VISEEQATAAALRQQQELVARLAALSAGESGDDAAAINALRQQVQALRITGRQRVNLDPD  120

Query  124  FVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGA  183
             VRV E +NPPL G+YTL+   +P  +TL G +S  G+LP+  GR V  YL+    L+GA
Sbjct  121  VVRVSERANPPLQGNYTLWVGPQPAEVTLFGLMSRPGKLPFMPGRDVVSYLEGQSLLSGA  180

Query  184  DKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLT  243
            D++   VI P+G +   PVA WN+RHVEP PGS +++G    V   +   LN  I+  LT
Sbjct  181  DRSYAWVIYPDGRSQKVPVAYWNRRHVEPMPGSIIFVGLDDAVWSSEPDALNADILHTLT  240

Query  244  QRVPD  248
            QR+P+
Sbjct  241  QRIPE  245


>gb|EGB70445.1| SLBB-domain-containing protein [Escherichia coli TW10509]
Length=245

 Score =  192 bits (487),  Expect = 6e-47, Method: Compositional matrix adjust.
 Identities = 101/244 (41%), Positives = 146/244 (59%), Gaps = 1/244 (0%)

Query  4    LQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGA  63
            ++    A ++ V     FA GTV ++  G  +  ++   E+++ LV QP+L +  WWPGA
Sbjct  2    IKQTIAALIMSVGASSVFAAGTVKVFSNGSNEAKTLTGAEHLIDLVGQPRLANS-WWPGA  60

Query  64   LLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPD  123
            ++++  A A AL+  Q ++ +LA   +E+  D AA I ++RQQ+  L +TGR  + LDPD
Sbjct  61   VISEELATAAALRQQQALLTRLAELASESSTDDAAAINALRQQIQALKVTGRQKINLDPD  120

Query  124  FVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGA  183
             VRV E  NPPL G+YTL+    P T+TL G +S  G  P+  GR V  YL D   L+GA
Sbjct  121  IVRVAERGNPPLQGNYTLWVGPPPSTVTLFGLISRPGNQPFTPGRDVASYLSDQSLLSGA  180

Query  184  DKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLT  243
            D++   V+ P+G T  APVA WNKRHVE  PGS +++G +  V  E    LN  I+  LT
Sbjct  181  DRSYAWVVYPDGRTQKAPVAYWNKRHVELMPGSIIYVGLADSVWSETPDALNADILQTLT  240

Query  244  QRVP  247
            QR+P
Sbjct  241  QRIP  244


>ref|ZP_08500193.1| group 4 capsule (G4C) polysaccharide, YmcB [Enterobacter hormaechei 
ATCC 49162]
 gb|EGK57100.1| group 4 capsule (G4C) polysaccharide, YmcB [Enterobacter hormaechei 
ATCC 49162]
Length=245

 Score =  191 bits (485),  Expect = 1e-46, Method: Compositional matrix adjust.
 Identities = 107/248 (43%), Positives = 155/248 (62%), Gaps = 3/248 (1%)

Query  1    MNKLQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWW  60
            MN  + +     L + +  A     VT++ PG  +T S    + + +LVTQPQ  + +WW
Sbjct  1    MNGHKKWLPGVGLSLFSLSALGASVVTVHQPG--KTWSAEQADTLSRLVTQPQFNN-VWW  57

Query  61   PGALLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKL  120
             GA +   +A  +A +  Q V+A L +W+  A+D+ AAT+++V  Q+ +L I GR  V L
Sbjct  58   QGAAIATPSATRRAQQTQQQVLALLTAWQNRANDERAATVRAVAAQIRSLRIVGRQFVNL  117

Query  121  DPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRL  180
            DPD VR D + + PL G Y L+    P T+TL+GAV+  G+  W+ G S+ DYLQ  PRL
Sbjct  118  DPDAVRTDAHGDRPLEGRYDLWLSPAPRTVTLMGAVATPGKRAWRPGASIRDYLQGQPRL  177

Query  181  AGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVS  240
            AGAD+NNV VI P+G TVVA V  WN RH+E  PG+ LW+GF    +P+ +  LN+QIV+
Sbjct  178  AGADRNNVTVIDPDGSTVVAQVGYWNARHIEAEPGAVLWVGFDPRAVPDDFTGLNEQIVA  237

Query  241  VLTQRVPD  248
            +LT+R+PD
Sbjct  238  LLTRRIPD  245


>ref|YP_001465525.1| hypothetical protein EcE24377A_4576 [Escherichia coli E24377A]
 ref|YP_001460814.1| hypothetical protein EcHS_A4267 [Escherichia coli HS]
 ref|ZP_07591769.1| protein of unknown function DUF1017 [Escherichia coli W]
 gb|ABV08431.1| conserved hypothetical protein [Escherichia coli HS]
 gb|ABV16853.1| conserved hypothetical protein [Escherichia coli E24377A]
 gb|EFN38440.1| protein of unknown function DUF1017 [Escherichia coli W]
 gb|ADT77679.1| conserved protein [Escherichia coli W]
 gb|ADX52853.1| protein of unknown function DUF1017 [Escherichia coli KO11]
Length=245

 Score =  190 bits (483),  Expect = 1e-46, Method: Compositional matrix adjust.
 Identities = 104/244 (42%), Positives = 150/244 (61%), Gaps = 1/244 (0%)

Query  4    LQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGA  63
            ++  F+A +L V     FA GTV ++  G  +  ++   E+++ LV QP+L +  WWPGA
Sbjct  2    IKQTFVALLLSVGASSVFAAGTVKVFSNGSSEAKTLTGAEHLIDLVGQPRLANS-WWPGA  60

Query  64   LLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPD  123
            ++++  A A AL+  Q ++ +LA   A++  D AA I ++RQQ+  L +TGR  + LDPD
Sbjct  61   VISEELATAAALRQQQALLTRLAELAADSSADDAAAINALRQQIQALKVTGRQKINLDPD  120

Query  124  FVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGA  183
             VRV E  NPPL G+YTL+    P T+TL G +S  G+ P+  GR V  YL D   L+GA
Sbjct  121  IVRVAERGNPPLQGNYTLWVGPPPSTVTLFGLISRPGKQPFTPGRDVASYLSDQSLLSGA  180

Query  184  DKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLT  243
            D++   V+ P+G T  APVA WNKRHVEP PGS +++G +  V  E    LN  I+  LT
Sbjct  181  DRSYAWVVYPDGRTQKAPVAYWNKRHVEPMPGSIIYVGLADSVWSETPDALNADILQTLT  240

Query  244  QRVP  247
            QR+P
Sbjct  241  QRIP  244


>ref|ZP_07140265.1| hypothetical protein HMPREF9548_02441 [Escherichia coli MS 182-1]
 ref|ZP_07223211.1| conserved hypothetical protein [Escherichia coli MS 78-1]
 gb|EFK02824.1| hypothetical protein HMPREF9548_02441 [Escherichia coli MS 182-1]
 gb|EFK71207.1| conserved hypothetical protein [Escherichia coli MS 78-1]
 gb|EFW75431.1| YjbG polysaccharide synthesis-related protein [Escherichia coli 
EC4100B]
Length=245

 Score =  190 bits (482),  Expect = 2e-46, Method: Compositional matrix adjust.
 Identities = 104/244 (42%), Positives = 150/244 (61%), Gaps = 1/244 (0%)

Query  4    LQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGA  63
            ++  F+A +L V     FA GTV ++  G  +  ++   E+++ LV QP+L +  WWPGA
Sbjct  2    IKQTFVALLLSVGASSVFAAGTVKVFSNGSSEAKTLTGAEHLIDLVGQPRLANS-WWPGA  60

Query  64   LLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPD  123
            ++++  A A AL+  Q ++ +LA   A++  D AA I ++RQQ+  L +TGR  + LDPD
Sbjct  61   VISEELATAAALRQQQALLTRLAEQGADSSADDAAAINALRQQIQALKVTGRQKINLDPD  120

Query  124  FVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGA  183
             VRV E  NPPL G+YTL+    P T+TL G +S  G+ P+  GR V  YL D   L+GA
Sbjct  121  IVRVAERGNPPLQGNYTLWVGPPPSTVTLFGLISRPGKQPFTPGRDVASYLSDQSLLSGA  180

Query  184  DKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLT  243
            D++   V+ P+G T  APVA WNKRHVEP PGS +++G +  V  E    LN  I+  LT
Sbjct  181  DRSYAWVVYPDGRTQKAPVAYWNKRHVEPMPGSIIYVGLADSVWSETPDALNADILQTLT  240

Query  244  QRVP  247
            QR+P
Sbjct  241  QRIP  244


>emb|CBK84486.1| SLBB-domain like (DUF1017) [Enterobacter cloacae subsp. cloacae 
NCTC 9394]
Length=245

 Score =  189 bits (480),  Expect = 3e-46, Method: Compositional matrix adjust.
 Identities = 99/240 (41%), Positives = 150/240 (62%), Gaps = 1/240 (0%)

Query  9    IASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGALLTDS  68
            IA +  + +P A++ GTV +Y P  ++  ++    +++ LV QP+L  + WW GA++++ 
Sbjct  7    IALLASLTSPLAWSAGTVQVYTPDSEKPKTLTNAGHLLDLVGQPRLA-KSWWTGAVISER  65

Query  69   AAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPDFVRVD  128
             A   A + +Q ++A+L+    + D D AA I S+RQQL  + +TGR  V LDPD VRV 
Sbjct  66   QATIVAEQKHQALLARLSGLAQQEDTDDAAAITSLRQQLQAVKVTGRQKVNLDPDEVRVA  125

Query  129  ENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGADKNNV  188
            EN NP L GDYTL+   +P T+T++G +S  G+ P+  GR V  YL +   L+GAD +  
Sbjct  126  ENGNPSLEGDYTLWLPAQPSTVTVMGLLSSPGKKPFTPGRDVASYLDEQSLLSGADNSYA  185

Query  189  MVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLTQRVPD  248
             V+ P+G T  APVA WNKRH+EP PGS +++GF+ H   + Y  LN  I+  L QR+P+
Sbjct  186  WVVYPDGHTQKAPVAYWNKRHIEPMPGSIIFVGFADHFWTKAYDGLNADILRSLIQRIPE  245


>ref|ZP_08366591.1| conserved hypothetical protein [Escherichia coli TA143]
 gb|EGI28742.1| conserved hypothetical protein [Escherichia coli TA143]
Length=245

 Score =  188 bits (478),  Expect = 6e-46, Method: Compositional matrix adjust.
 Identities = 103/244 (42%), Positives = 149/244 (61%), Gaps = 1/244 (0%)

Query  4    LQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGA  63
            ++   +A +L V     FA GTV ++  G  +  ++   E+++ LV QP+L +  WWPGA
Sbjct  2    IKQTIVALILSVGASSVFAAGTVKVFSNGSSEAKTLTGAEHLIDLVGQPRLANS-WWPGA  60

Query  64   LLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPD  123
            ++++  A A AL+  Q ++ +LA   A++  D AA I ++RQQ+  L +TGR  + LDPD
Sbjct  61   VISEELATAAALRQQQALLTRLAELAADSSADDAAAINALRQQIQALKVTGRQKINLDPD  120

Query  124  FVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGA  183
             VRV E  NPPL G+YTL+    P T+TL G +S  G+ P+  GR V  YL D   L+GA
Sbjct  121  IVRVAERGNPPLQGNYTLWVGPPPSTVTLFGLISRPGKQPFTPGRDVASYLSDQSLLSGA  180

Query  184  DKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLT  243
            D++   V+ P+G T  APVA WNKRHVEP PGS +++G +  V  E    LN  I+  LT
Sbjct  181  DRSYAWVVYPDGRTQKAPVAYWNKRHVEPMPGSIIYVGLADSVWSETPDALNADILQTLT  240

Query  244  QRVP  247
            QR+P
Sbjct  241  QRIP  244


>ref|ZP_08497921.1| hypothetical protein HMPREF9086_2183 [Enterobacter hormaechei 
ATCC 49162]
 gb|EGK61041.1| hypothetical protein HMPREF9086_2183 [Enterobacter hormaechei 
ATCC 49162]
Length=245

 Score =  188 bits (477),  Expect = 8e-46, Method: Compositional matrix adjust.
 Identities = 98/240 (40%), Positives = 150/240 (62%), Gaps = 1/240 (0%)

Query  9    IASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGALLTDS  68
            +A +  + +P A++ GTV +Y P  ++  ++    +++ LV QP+L  + WW GA++++ 
Sbjct  7    VALLASLASPLAWSAGTVQVYTPDSEKPKTLTNAGHLLDLVGQPRLA-KSWWTGAVISER  65

Query  69   AAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPDFVRVD  128
             A   A + +Q ++A+L+    + D D AA I S+RQQL  + +TGR  V LDPD VRV 
Sbjct  66   QATVVAEQKHQALLARLSGLAQQEDADDAAGINSLRQQLQAVKVTGRQKVNLDPDEVRVA  125

Query  129  ENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGADKNNV  188
            EN NP L GDYTL+   +P T+T++G +S  G+ P+  GR V  YL +   L+GAD +  
Sbjct  126  ENGNPSLEGDYTLWLPAQPSTVTVMGLLSSPGKKPFTPGRDVASYLDEQSLLSGADNSYA  185

Query  189  MVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLTQRVPD  248
             V+ P+G T  APVA WNKRH+EP PGS +++GF+ H   + Y  LN  I+  L QR+P+
Sbjct  186  WVVYPDGHTQKAPVAYWNKRHIEPMPGSIIFVGFADHFWTKAYDGLNADILRSLIQRIPE  245


>ref|ZP_03048870.1| conserved hypothetical protein [Escherichia coli E110019]
 ref|ZP_07124032.1| hypothetical protein HMPREF9536_04298 [Escherichia coli MS 84-1]
 ref|ZP_07208830.1| hypothetical protein HMPREF9347_01280 [Escherichia coli MS 124-1]
 gb|EDV89316.1| conserved hypothetical protein [Escherichia coli E110019]
 gb|EFJ85444.1| hypothetical protein HMPREF9536_04298 [Escherichia coli MS 84-1]
 gb|EFK69681.1| hypothetical protein HMPREF9347_01280 [Escherichia coli MS 124-1]
 gb|EFU34662.1| conserved hypothetical protein [Escherichia coli MS 85-1]
Length=245

 Score =  187 bits (475),  Expect = 1e-45, Method: Compositional matrix adjust.
 Identities = 103/244 (42%), Positives = 148/244 (60%), Gaps = 1/244 (0%)

Query  4    LQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGA  63
            ++   +A +L V     FA GTV ++  G  +  ++   E ++ LV QP+L +  WWPGA
Sbjct  2    IKQTIVALILSVGASSVFAAGTVKVFSNGSSEAKTLTGAEYLIDLVGQPRLANS-WWPGA  60

Query  64   LLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPD  123
            ++++  A A AL+  Q ++ +LA   A++  D AA I ++RQQ+  L +TGR  + LDPD
Sbjct  61   VISEELATAAALRQQQALLTRLAELAADSSADDAAAINALRQQIQALKVTGRQKINLDPD  120

Query  124  FVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGA  183
             VRV E  NPPL G+YTL+    P T+TL G +S  G+ P+  GR V  YL D   L+GA
Sbjct  121  IVRVAERGNPPLQGNYTLWVGPPPSTVTLFGLISRPGKQPFTPGRDVASYLSDQSLLSGA  180

Query  184  DKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLT  243
            D++   V+ P+G T  APVA WNKRHVEP PGS +++G +  V  E    LN  I+  LT
Sbjct  181  DRSYAWVVYPDGRTQKAPVAYWNKRHVEPMPGSIIYVGLADSVWSETPDALNADILQTLT  240

Query  244  QRVP  247
            QR+P
Sbjct  241  QRIP  244


>ref|YP_001726928.1| hypothetical protein EcolC_4002 [Escherichia coli ATCC 8739]
 ref|YP_003038179.1| hypothetical protein ECBD_4009 [Escherichia coli 'BL21-Gold(DE3)pLysS 
AG']
 ref|YP_003047072.1| hypothetical protein ECB_03900 [Escherichia coli B str. REL606]
 9 more sequence titles
 Length=245

 Score =  187 bits (475),  Expect = 1e-45, Method: Compositional matrix adjust.
 Identities = 103/244 (42%), Positives = 149/244 (61%), Gaps = 1/244 (0%)

Query  4    LQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGA  63
            ++  F+A +L V     FA GTV ++  G  +  ++   E+++ LV QP+L +  WWPGA
Sbjct  2    IKQTFVALLLSVGASSVFAAGTVKVFSNGSSEAKTLTGAEHLIDLVGQPRLANS-WWPGA  60

Query  64   LLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPD  123
            ++++  A A AL+  Q ++ +LA   A++  D AA I ++RQQ+  L +TGR  + LDPD
Sbjct  61   VISEELATAAALRQQQALLTRLAELAADSSADDAAAINALRQQIQALKVTGRQKINLDPD  120

Query  124  FVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGA  183
             VRV E  NPPL G+YTL+    P T+TL G +S  G+ P+  GR V  YL     L+GA
Sbjct  121  IVRVAERGNPPLQGNYTLWVGPPPSTVTLFGLISRPGKQPFTPGRDVASYLSGQNLLSGA  180

Query  184  DKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLT  243
            D++   V+ P+G T  APVA WNKRHVEP PGS +++G +  V  E    LN  I+  LT
Sbjct  181  DRSYAWVVYPDGRTQKAPVAYWNKRHVEPMPGSIIYVGLADSVWSETPDALNADILQTLT  240

Query  244  QRVP  247
            QR+P
Sbjct  241  QRIP  244


>ref|NP_418452.1| conserved protein [Escherichia coli str. K-12 substr. MG1655]
 ref|AP_004529.1| hypothetical protein [Escherichia coli str. K-12 substr. W3110]
 ref|YP_001732805.1| hypothetical protein ECDH10B_4217 [Escherichia coli str. K-12 
substr. DH10B]
 29 more sequence titles
 Length=245

 Score =  187 bits (475),  Expect = 1e-45, Method: Compositional matrix adjust.
 Identities = 103/244 (42%), Positives = 149/244 (61%), Gaps = 1/244 (0%)

Query  4    LQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGA  63
            ++   +A +L V     FA GTV ++  G  +  ++   E+++ LV QP+L +  WWPGA
Sbjct  2    IKQTIVALLLSVGASSVFAAGTVKVFSNGSSEAKTLTGAEHLIDLVGQPRLANS-WWPGA  60

Query  64   LLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPD  123
            ++++  A A AL+  Q ++ +LA   A++  D AA I ++RQQ+  L +TGR  + LDPD
Sbjct  61   VISEELATAAALRQQQALLTRLAEQGADSSADDAAAINALRQQIQALKVTGRQKINLDPD  120

Query  124  FVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGA  183
             VRV E  NPPL G+YTL+    P T+TL G +S  G+ P+  GR V  YL D   L+GA
Sbjct  121  IVRVAERGNPPLQGNYTLWVGPPPSTVTLFGLISRPGKQPFTPGRDVASYLSDQSLLSGA  180

Query  184  DKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLT  243
            D++   V+ P+G T  APVA WNKRHVEP PGS +++G +  V  E    LN  I+  LT
Sbjct  181  DRSYAWVVYPDGRTQKAPVAYWNKRHVEPMPGSIIYVGLADSVWSETPDALNADILQTLT  240

Query  244  QRVP  247
            QR+P
Sbjct  241  QRIP  244


>ref|YP_001882714.1| hypothetical protein SbBS512_E4533 [Shigella boydii CDC 3083-94]
 gb|ACD09338.1| conserved hypothetical protein [Shigella boydii CDC 3083-94]
Length=245

 Score =  187 bits (474),  Expect = 2e-45, Method: Compositional matrix adjust.
 Identities = 103/244 (42%), Positives = 149/244 (61%), Gaps = 1/244 (0%)

Query  4    LQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGA  63
            ++  F+A +L V     FA GTV ++  G  +  ++   E+++ LV QP+L +  WWPGA
Sbjct  2    IKQTFVALLLSVGASSVFAAGTVKVFSNGSSEAKTLTGAEHLIDLVGQPRLANS-WWPGA  60

Query  64   LLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPD  123
            ++++  A A AL+  Q ++ +LA   A++  D AA I ++RQQ+  L +TGR  + LDPD
Sbjct  61   VISEELATAAALRQQQALLTRLAEQAADSSADDAAAINALRQQIQALKVTGRQKINLDPD  120

Query  124  FVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGA  183
             VRV E  NPPL G+YTL+    P T+TL G +S  G+ P+  GR V  YL     L+GA
Sbjct  121  IVRVAERGNPPLQGNYTLWVGPPPSTVTLFGLISRPGKQPFTPGRDVASYLSGQNLLSGA  180

Query  184  DKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLT  243
            D++   V+ P+G T  APVA WNKRHVEP PGS +++G +  V  E    LN  I+  LT
Sbjct  181  DRSYAWVVYPDGRTQKAPVAYWNKRHVEPMPGSIIYVGLADSVWSETPDALNADILQTLT  240

Query  244  QRVP  247
            QR+P
Sbjct  241  QRIP  244


>ref|YP_002295592.1| hypothetical protein ECSE_4317 [Escherichia coli SE11]
 dbj|BAG79841.1| conserved hypothetical protein [Escherichia coli SE11]
 gb|EGB86303.1| hypothetical protein HMPREF9542_04277 [Escherichia coli MS 117-3]
Length=245

 Score =  187 bits (474),  Expect = 2e-45, Method: Compositional matrix adjust.
 Identities = 103/244 (42%), Positives = 148/244 (60%), Gaps = 1/244 (0%)

Query  4    LQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGA  63
            ++   +A +L V     FA GTV ++  G  +  ++   E ++ LV QP+L +  WWPGA
Sbjct  2    IKQTIVALILSVGASSVFAAGTVKVFSNGSSEAKTLTGAEYLIDLVGQPRLANS-WWPGA  60

Query  64   LLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPD  123
            ++++  A A AL+  Q ++ +LA   A++  D AA I ++RQQ+  L +TGR  + LDPD
Sbjct  61   VISEELATAAALRQQQALLTRLAEQGADSSADDAAAINALRQQIQALKVTGRQKINLDPD  120

Query  124  FVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGA  183
             VRV E  NPPL G+YTL+    P T+TL G +S  G+ P+  GR V  YL D   L+GA
Sbjct  121  IVRVAERGNPPLQGNYTLWVGPPPSTVTLFGLISRPGKQPFTPGRDVASYLSDQSLLSGA  180

Query  184  DKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLT  243
            D++   V+ P+G T  APVA WNKRHVEP PGS +++G +  V  E    LN  I+  LT
Sbjct  181  DRSYAWVVYPDGRTQKAPVAYWNKRHVEPMPGSIIYVGLADSVWSETPDALNADILQTLT  240

Query  244  QRVP  247
            QR+P
Sbjct  241  QRIP  244


>gb|EFW53980.1| YjbG polysaccharide synthesis-related protein [Shigella boydii 
ATCC 9905]
 gb|EGI89476.1| hypothetical protein SD15574_5023 [Shigella dysenteriae 155-74]
Length=245

 Score =  186 bits (473),  Expect = 2e-45, Method: Compositional matrix adjust.
 Identities = 103/244 (42%), Positives = 149/244 (61%), Gaps = 1/244 (0%)

Query  4    LQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGA  63
            ++  F+A +L V     FA GTV ++  G  +  ++   E+++ LV QP+L +  WWPGA
Sbjct  2    IKQTFVALLLSVGASSVFAAGTVKVFSNGSSEAKTLTGAEHLIDLVGQPRLANS-WWPGA  60

Query  64   LLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPD  123
            ++++  A A AL+  Q ++ +LA   A++  D AA I ++RQQ+  L +TGR  + LDPD
Sbjct  61   VISEELATAAALRQQQALLTRLAEQGADSSADDAAAINALRQQIQALKVTGRQKINLDPD  120

Query  124  FVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGA  183
             VRV E  NPPL G+YTL+    P T+TL G +S  G+ P+  GR V  YL     L+GA
Sbjct  121  IVRVAERGNPPLQGNYTLWVGPPPSTVTLFGLISRPGKQPFTPGRDVASYLSGQNLLSGA  180

Query  184  DKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLT  243
            D++   V+ P+G T  APVA WNKRHVEP PGS +++G +  V  E    LN  I+  LT
Sbjct  181  DRSYAWVVYPDGRTQKAPVAYWNKRHVEPMPGSIIYVGLADSVWSETPDALNADILQTLT  240

Query  244  QRVP  247
            QR+P
Sbjct  241  QRIP  244


>ref|YP_003933062.1| hypothetical protein Pvag_3494 [Pantoea vagans C9-1]
 gb|ADO11613.1| Uncharacterized protein ymcB precursor [Pantoea vagans C9-1]
Length=247

 Score =  186 bits (473),  Expect = 2e-45, Method: Compositional matrix adjust.
 Identities = 98/229 (42%), Positives = 136/229 (59%), Gaps = 0/229 (0%)

Query  20   AFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGALLTDSAAKAKALKDYQ  79
            A A   V ++ P       +  + ++ QL T P L+    W   ++ +  A A A + YQ
Sbjct  19   AQATAQVIVHAPHNGGQAELSQIADLSQLATLPPLQANTDWRRTVIAERGASAVAQQQYQ  78

Query  80   HVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPDFVRVDENSNPPLVGDY  139
              +  L +W A++  D AA I  V +QL  +N+TGR    LDPD++R+    N  L G Y
Sbjct  79   QTLGALRAWRADSSGDRAAAIDEVIRQLSAINVTGRQFTPLDPDWIRLHPADNRRLEGSY  138

Query  140  TLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGADKNNVMVITPEGETVV  199
             L+      ++ LLGA+SGAG++ WQ G+SV DYL+ HP L+GA++N V VI+P G T  
Sbjct  139  DLWLQTPSDSVLLLGALSGAGKVSWQPGKSVRDYLEGHPSLSGAERNFVTVISPSGATQQ  198

Query  200  APVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLTQRVPD  248
             PVA WN RH E  PGS +WLGFS+  LP  Y DLND+I+SVLT R+PD
Sbjct  199  VPVAYWNHRHAEVEPGSVIWLGFSSWSLPGSYEDLNDRILSVLTHRIPD  247


>emb|CBG37221.1| conserved hypothetical protein [Escherichia coli 042]
Length=244

 Score =  186 bits (472),  Expect = 3e-45, Method: Compositional matrix adjust.
 Identities = 102/244 (41%), Positives = 149/244 (61%), Gaps = 1/244 (0%)

Query  4    LQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGA  63
            ++   +A +L V     FA GTV ++  G  +  ++   E+++ LV QP+L +  WWPGA
Sbjct  2    IKQTIVALILSVGASSVFAAGTVKVFSNGSSEAKTLTGAEHLIDLVGQPRLANS-WWPGA  60

Query  64   LLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPD  123
            ++++  A A AL+  Q ++ +LA   A++  D AA I ++RQQ+  L +TGR  + LDPD
Sbjct  61   VISEELATAAALRQQQALLTRLAEQGADSSADDAAAINALRQQIQALKVTGRQKINLDPD  120

Query  124  FVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGA  183
             VRV E  NPPL G+YTL+    P T+TL G +S  G+ P+  GR V  YL D   L+GA
Sbjct  121  IVRVAERGNPPLQGNYTLWVGPPPSTVTLFGLISRPGKQPFTPGRDVASYLSDQSLLSGA  180

Query  184  DKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLT  243
            D++   V+ P+G T  APVA WNKRHVEP PGS +++G +  V  +    LN  I+  LT
Sbjct  181  DRSYAWVVYPDGRTQKAPVAYWNKRHVEPMPGSIIYVGLADSVWSKTPDALNADILQTLT  240

Query  244  QRVP  247
            QR+P
Sbjct  241  QRIP  244


>emb|CAP78488.1| Uncharacterized protein yjbG [Escherichia coli LF82]
 gb|ADR29433.1| hypothetical protein NRG857_20125 [Escherichia coli O83:H1 str. 
NRG 857C]
 gb|EFU56817.1| conserved hypothetical protein [Escherichia coli MS 16-3]
 gb|EFW68025.1| YjbG polysaccharide synthesis-related protein [Escherichia coli 
WV_060327]
Length=245

 Score =  186 bits (472),  Expect = 3e-45, Method: Compositional matrix adjust.
 Identities = 102/244 (41%), Positives = 148/244 (60%), Gaps = 1/244 (0%)

Query  4    LQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGA  63
            ++   +A +L V     FA GTV ++  G  +  ++   E+++ LV QP+L +  WWPGA
Sbjct  2    IKQTIVALILSVGASSVFAAGTVKVFSNGSSEAKTLTGAEHLIDLVGQPRLANS-WWPGA  60

Query  64   LLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPD  123
            ++++  A A AL+  Q ++ +LA   A++  D AA I ++RQQ+  L +TGR  + LDPD
Sbjct  61   VISEELATAAALRQQQALLTRLAELAADSSADDAAAINALRQQIQALKVTGRQKINLDPD  120

Query  124  FVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGA  183
             VRV E  NPPL G+YTL+    P T+TL G +S  G+  +  GR V  YL D   L+GA
Sbjct  121  IVRVAERGNPPLQGNYTLWVGTPPSTVTLFGLISRPGKQSFTPGRDVASYLSDQSLLSGA  180

Query  184  DKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLT  243
            D++   V+ P+G T  APVA WNKRHVEP PGS +++G +  V  E    LN  I+  LT
Sbjct  181  DRSYAWVVYPDGRTQKAPVAYWNKRHVEPMPGSIIYVGLADSVWSETPDALNADILQTLT  240

Query  244  QRVP  247
            QR+P
Sbjct  241  QRIP  244


>ref|ZP_06939706.1| hypothetical protein EcolOP_27029 [Escherichia coli OP50]
Length=245

 Score =  186 bits (472),  Expect = 3e-45, Method: Compositional matrix adjust.
 Identities = 103/244 (42%), Positives = 148/244 (60%), Gaps = 1/244 (0%)

Query  4    LQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGA  63
            ++  F+A +L V     FA GTV ++  G  +  ++   E ++ LV QP+L +  WWPGA
Sbjct  2    IKQTFVALLLSVGASSVFAAGTVKVFSNGSSEAKTLTGAEYLIDLVGQPRLANS-WWPGA  60

Query  64   LLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPD  123
            ++++  A A AL+  Q ++ +LA   A++  D AA I ++RQQ+  L +TGR  + LDPD
Sbjct  61   VISEELATAAALRQQQALLTRLAELAADSSADDAAAINALRQQIQALKVTGRQKINLDPD  120

Query  124  FVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGA  183
             VRV E  NPPL G+YTL+    P T+TL G +S  G+ P+  GR V  YL     L+GA
Sbjct  121  IVRVAERGNPPLQGNYTLWVGPPPSTVTLFGLISRPGKQPFTPGRDVASYLSGQNLLSGA  180

Query  184  DKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLT  243
            D++   V+ P+G T  APVA WNKRHVEP PGS +++G +  V  E    LN  I+  LT
Sbjct  181  DRSYAWVVYPDGRTQKAPVAYWNKRHVEPMPGSIIYVGLADSVWSETPDALNADILQTLT  240

Query  244  QRVP  247
            QR+P
Sbjct  241  QRIP  244


>ref|ZP_07152932.1| conserved hypothetical protein [Escherichia coli MS 21-1]
 gb|EFK20313.1| conserved hypothetical protein [Escherichia coli MS 21-1]
Length=245

 Score =  186 bits (471),  Expect = 3e-45, Method: Compositional matrix adjust.
 Identities = 102/244 (41%), Positives = 147/244 (60%), Gaps = 1/244 (0%)

Query  4    LQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGA  63
            ++   +A +L       FA GTV ++  G  +  ++   E+++ LV QPQL +  WWPGA
Sbjct  2    IKQTIVALLLSAGASSVFAAGTVKVFSNGSSEAKTLTGAEHLIDLVGQPQLANS-WWPGA  60

Query  64   LLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPD  123
            ++++  A A AL+  Q ++ +LA   A++  D AA I ++RQQ+  L +TGR  + LDPD
Sbjct  61   VISEELATAAALRQQQALLTRLAELAADSSADDAAAINALRQQIQALKVTGRQKINLDPD  120

Query  124  FVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGA  183
             VRV E  NPPL G+YTL+    P T+TL G +S  G+  +  GR V  YL D   L+GA
Sbjct  121  IVRVAERGNPPLQGNYTLWVGPPPSTVTLFGLISRPGKQSFTPGRDVASYLSDQSLLSGA  180

Query  184  DKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLT  243
            D++   V+ P+G T  APVA WNKRHVEP PGS +++G +  V  E    LN  I+  LT
Sbjct  181  DRSYAWVVYPDGRTQKAPVAYWNKRHVEPMPGSIIYVGLADSVWSETSDALNADILQTLT  240

Query  244  QRVP  247
            QR+P
Sbjct  241  QRIP  244


>ref|YP_543535.1| hypothetical protein UTI89_C4596 [Escherichia coli UTI89]
 ref|YP_859620.1| hypothetical protein APECO1_2440 [Escherichia coli APEC O1]
 ref|YP_002394011.1| hypothetical protein ECS88_4501 [Escherichia coli S88]
 10 more sequence titles
 Length=245

 Score =  186 bits (471),  Expect = 4e-45, Method: Compositional matrix adjust.
 Identities = 102/244 (41%), Positives = 147/244 (60%), Gaps = 1/244 (0%)

Query  4    LQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGA  63
            ++   +A +L V     FA GTV ++  G  +  ++   E+++ LV QP+L +  WWPGA
Sbjct  2    IKQTIVALILSVGASSVFAAGTVKVFSNGSSEAKTLTGAEHLIDLVGQPRLANS-WWPGA  60

Query  64   LLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPD  123
            ++++  A A AL+  Q ++ +LA   A++  D AA I ++RQQ+  L +TGR  + LDPD
Sbjct  61   VISEELATAAALRQQQALLTRLAELAADSSADDAAAINALRQQIQALKVTGRQKINLDPD  120

Query  124  FVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGA  183
             VRV E  NPPL G+YTL+    P T+TL G +S  G+  +  GR V  YL D   L+GA
Sbjct  121  IVRVAERGNPPLQGNYTLWVGTPPSTVTLFGLISRPGKQSFTPGRDVASYLSDQSLLSGA  180

Query  184  DKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLT  243
            D++   V+ P G T  APVA WNKRHVEP PGS +++G +  V  E    LN  I+  LT
Sbjct  181  DRSYAWVVYPSGRTQKAPVAYWNKRHVEPMPGSIIYVGLADSVWSETPDALNADILQTLT  240

Query  244  QRVP  247
            QR+P
Sbjct  241  QRIP  244


>ref|ZP_06660155.1| YjbG polysaccharide synthesis protein [Escherichia coli B185]
 gb|EFF03249.1| YjbG polysaccharide synthesis protein [Escherichia coli B185]
Length=245

 Score =  186 bits (471),  Expect = 4e-45, Method: Compositional matrix adjust.
 Identities = 102/244 (41%), Positives = 148/244 (60%), Gaps = 1/244 (0%)

Query  4    LQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGA  63
            ++  F+A +L V     FA GTV ++  G  +  ++   E+++ LV QP+L +  WWPGA
Sbjct  2    IKQTFVALLLSVGASSVFAAGTVKVFSNGSSEAKTLTGAEHLIDLVGQPRLANS-WWPGA  60

Query  64   LLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPD  123
            ++++  A A AL+  Q ++ +LA   A++  D AA I ++RQQ+  L +TGR  + LDPD
Sbjct  61   VISEELATAAALRQQQALLTRLAELAADSSADDAAAINALRQQIQALKVTGRQKINLDPD  120

Query  124  FVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGA  183
             VRV E +NPPL G+YTL+    P T+TL G +S  G  P+  GR V  YL     L+GA
Sbjct  121  IVRVAERANPPLQGNYTLWVGPPPSTVTLFGLISRPGNQPFTPGRDVASYLSGQSLLSGA  180

Query  184  DKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLT  243
            D++   V+ P+G T  APVA WNKRHVEP PGS +++G +  V  E    LN  I+  L 
Sbjct  181  DRSYAWVVYPDGRTQKAPVAYWNKRHVEPMPGSIIYVGLADSVWSETPDALNADILQTLA  240

Query  244  QRVP  247
            QR+P
Sbjct  241  QRIP  244


>ref|ZP_08350965.1| conserved hypothetical protein [Escherichia coli M605]
 dbj|BAI57425.1| conserved hypothetical protein [Escherichia coli SE15]
 gb|EGH36867.1| YjbG polysaccharide synthesis-related protein [Escherichia coli 
AA86]
 gb|EGI13421.1| conserved hypothetical protein [Escherichia coli M605]
 gb|AEG39028.1| Hypothetical protein ECNA114_4184 [Escherichia coli NA114]
Length=245

 Score =  186 bits (471),  Expect = 4e-45, Method: Compositional matrix adjust.
 Identities = 102/244 (41%), Positives = 147/244 (60%), Gaps = 1/244 (0%)

Query  4    LQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGA  63
            ++   +A +L V     FA GTV ++  G     ++   E+++ LV QP+L +  WWPGA
Sbjct  2    IKQTIVALILSVGASSVFAAGTVKVFSNGSSDAKTLTGAEHLIDLVGQPRLANS-WWPGA  60

Query  64   LLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPD  123
            ++++  A A AL+  Q +M ++A   A++  D AA I ++RQQ+  L +TGR  + LDPD
Sbjct  61   VISEELATAAALRQQQALMTRMAELAADSSADDAAAINALRQQIQALKVTGRQKINLDPD  120

Query  124  FVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGA  183
             VRV E  NPPL G+YTL+    P T+TL G +S  G+  +  GR V  YL D   L+GA
Sbjct  121  IVRVAERGNPPLQGNYTLWVGPPPSTVTLFGLISRPGKQSFTPGRDVASYLSDQSLLSGA  180

Query  184  DKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLT  243
            D++   V+ P+G T  APVA WNKRHVEP PGS +++G +  V  E    LN  I+  LT
Sbjct  181  DRSYAWVVYPDGRTQKAPVAYWNKRHVEPMPGSIIYVGLADSVWSETPDALNADILQTLT  240

Query  244  QRVP  247
            QR+P
Sbjct  241  QRIP  244


>ref|YP_001746417.1| hypothetical protein EcSMS35_4489 [Escherichia coli SMS-3-5]
 gb|ACB18208.1| conserved hypothetical protein [Escherichia coli SMS-3-5]
Length=245

 Score =  185 bits (470),  Expect = 4e-45, Method: Compositional matrix adjust.
 Identities = 102/244 (41%), Positives = 147/244 (60%), Gaps = 1/244 (0%)

Query  4    LQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGA  63
            ++   +A +L       FA GTV ++  G  +  ++   E+++ LV QPQL +  WWPGA
Sbjct  2    IKQTIVALLLSAGASSVFAAGTVKVFSNGSSEAKTLTGAEHLIDLVGQPQLANS-WWPGA  60

Query  64   LLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPD  123
            ++++  A A AL+  Q ++ +LA   A++  D AA I ++RQQ+  L +TGR  + LDPD
Sbjct  61   VISEELATAAALRQQQALLTRLAELAADSSADDAAAINALRQQIQALKVTGRQKINLDPD  120

Query  124  FVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGA  183
             VRV E  NPPL G+YTL+    P T+TL G +S  G+ P+  GR V  YL     L+GA
Sbjct  121  IVRVAERGNPPLQGNYTLWVGPPPSTVTLFGLISRPGKQPFTPGRDVASYLSGQNLLSGA  180

Query  184  DKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLT  243
            D++   V+ P+G T  APVA WNKRHVEP PGS +++G +  V  E    LN  I+  LT
Sbjct  181  DRSYAWVVYPDGRTQKAPVAYWNKRHVEPMPGSIIYVGLADSVWSETSDALNADILQTLT  240

Query  244  QRVP  247
            QR+P
Sbjct  241  QRIP  244


>ref|ZP_08356834.1| conserved hypothetical protein [Escherichia coli M718]
 gb|EGI18290.1| conserved hypothetical protein [Escherichia coli M718]
Length=245

 Score =  185 bits (469),  Expect = 6e-45, Method: Compositional matrix adjust.
 Identities = 103/244 (42%), Positives = 148/244 (60%), Gaps = 1/244 (0%)

Query  4    LQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGA  63
            ++   +A +L V T   FA GTV ++  G  +  ++   E+++ LV QP+L +  WWPGA
Sbjct  2    IKQTIVALLLSVGTSSVFAAGTVKVFSNGSGEAKTLTGAEHLIDLVGQPRLANS-WWPGA  60

Query  64   LLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPD  123
            ++++  A A AL+  Q ++ +LA   A++  D AA I ++RQQ+  L +TGR  + LDPD
Sbjct  61   VISEELATAAALRQQQALLTRLAELAADSSADDAAAINALRQQIQALKVTGRQKINLDPD  120

Query  124  FVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGA  183
             VRV E  NPPL G+YTL+    P T+TL G +S  G+ P+  GR V  YL     L+GA
Sbjct  121  IVRVAERGNPPLQGNYTLWVGPPPSTVTLFGLISRPGKQPFTPGRDVASYLSGQNLLSGA  180

Query  184  DKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLT  243
            D++   V+ P G T  APVA WNKRHVEP PGS +++G +  V  E    LN  I+  LT
Sbjct  181  DRSYAWVVYPHGRTQKAPVAYWNKRHVEPMPGSIIYVGLADSVWSEMPDALNADILQTLT  240

Query  244  QRVP  247
            QR+P
Sbjct  241  QRIP  244


>ref|YP_002415168.1| hypothetical protein ECUMN_4561 [Escherichia coli UMN026]
 ref|ZP_06646764.1| YjbG polysaccharide synthesis protein [Escherichia coli FVEC1412]
 ref|ZP_06988080.1| YjbG polysaccharide synthesis protein [Escherichia coli FVEC1302]
 ref|ZP_07118071.1| conserved hypothetical protein [Escherichia coli MS 198-1]
 emb|CAR15678.1| conserved hypothetical protein [Escherichia coli UMN026]
 gb|EFF02596.1| YjbG polysaccharide synthesis protein [Escherichia coli FVEC1412]
 gb|EFI22031.1| YjbG polysaccharide synthesis protein [Escherichia coli FVEC1302]
 gb|EFJ72462.1| conserved hypothetical protein [Escherichia coli MS 198-1]
Length=245

 Score =  185 bits (469),  Expect = 6e-45, Method: Compositional matrix adjust.
 Identities = 102/244 (41%), Positives = 148/244 (60%), Gaps = 1/244 (0%)

Query  4    LQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGA  63
            ++   +A +L V     FA GTV ++  G  +  ++   E+++ LV QP+L +  WWPGA
Sbjct  2    IKQTIVALILSVGASSVFAAGTVKVFSNGSSEAKTLTGAEHLIDLVGQPRLANS-WWPGA  60

Query  64   LLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPD  123
            ++++  A A AL+  Q ++ +LA   A++  D AA I ++RQQ+  L +TGR  + LDPD
Sbjct  61   VISEELATAAALRQQQALLTRLAEQGADSSADDAAAINALRQQIQALKVTGRQKINLDPD  120

Query  124  FVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGA  183
             VRV E  NPPL G+YTL+    P T+TL G +S  G+ P+  GR V  YL     L+GA
Sbjct  121  IVRVAERGNPPLQGNYTLWVGPPPSTVTLFGLISRPGKQPFTPGRDVASYLSGQNLLSGA  180

Query  184  DKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLT  243
            D++   V+ P+G T  APVA WNKRHVEP PGS +++G +  V  E    LN  I+  LT
Sbjct  181  DRSYAWVVYPDGRTQKAPVAYWNKRHVEPMPGSIIYVGLADSVWSETPDALNADILQTLT  240

Query  244  QRVP  247
            QR+P
Sbjct  241  QRIP  244


>ref|YP_004214318.1| hypothetical protein Rahaq_3601 [Rahnella sp. Y9602]
 gb|ADW75191.1| protein of unknown function DUF1017 [Rahnella sp. Y9602]
Length=247

 Score =  184 bits (468),  Expect = 8e-45, Method: Compositional matrix adjust.
 Identities = 102/245 (41%), Positives = 141/245 (57%), Gaps = 9/245 (3%)

Query  8    FIASVLYVMTPHAFAQGTVTIYLPGEQ----QTLSVGPVENVVQLVTQPQLRDRLWWPGA  63
            FI S L  M   AF++G V +Y    Q    Q LS   ++++ Q++ +  L  + W PG 
Sbjct  8    FIFSALPGM---AFSEGNVAVYTSASQGQPAQVLS--HIKDMRQMMAESDLIRQSWSPGT  62

Query  64   LLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPD  123
            ++   AA  +A + Y  +  QL +W A    D   TI  V QQL  L +TGR    +D D
Sbjct  63   VIAVPAATPEAQQQYLSMQNQLKAWRATESGDTQQTINRVIQQLQGLQVTGRQFTPMDAD  122

Query  124  FVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGA  183
             +  +  +N  L GDY +YT  RP ++ LLGAVSGAG+ PW AGR++ +YL DH  L+GA
Sbjct  123  LILNNNAANRQLQGDYRVYTATRPNSVLLLGAVSGAGKQPWVAGRTIREYLADHQFLSGA  182

Query  184  DKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLT  243
            + N  +VI P G   V PVA WN RH E  PGS +W+GFS   LP ++ +LN  IVSVL 
Sbjct  183  NLNEAVVIDPNGTIRVVPVAYWNYRHAEAQPGSIIWVGFSDWTLPRQFKNLNQHIVSVLA  242

Query  244  QRVPD  248
             R+P+
Sbjct  243  HRIPE  247


>gb|EFX17993.1| hypothetical protein ECO2687_20949 [Escherichia coli O157:H- 
str. H 2687]
Length=245

 Score =  184 bits (468),  Expect = 9e-45, Method: Compositional matrix adjust.
 Identities = 102/244 (41%), Positives = 148/244 (60%), Gaps = 1/244 (0%)

Query  4    LQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGA  63
            ++   +A +L V     FA GTV ++  G  +  ++   E+++ LV QP+L +  WWPGA
Sbjct  2    IKQTIVALLLSVRASSVFAAGTVKVFSNGSSEAKTLTGAEHLIDLVGQPRLANS-WWPGA  60

Query  64   LLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPD  123
            ++++  A A AL+  Q ++ +LA   A++  D AA I ++RQQ+  L +TGR  + LDPD
Sbjct  61   VISEELATAAALRQQQALLTRLAEQGADSSADDAAAINALRQQIQALKVTGRQKINLDPD  120

Query  124  FVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGA  183
             VRV E +NPPL G+YTL+    P T+TL G +S  G  P+  GR V  YL     L+GA
Sbjct  121  IVRVAERANPPLQGNYTLWVGPPPSTVTLFGLISRPGNQPFTPGRDVASYLSGQSLLSGA  180

Query  184  DKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLT  243
            D++   V+ P+G T  APVA WNKRHVEP PGS +++G +  V  E    LN  I+  LT
Sbjct  181  DRSYAWVVYPDGRTQKAPVAYWNKRHVEPMPGSIIYVGLADSVWSEMPDALNADILQTLT  240

Query  244  QRVP  247
            QR+P
Sbjct  241  QRIP  244


>gb|EGC12490.1| SLBB-domain-containing protein [Escherichia coli E1167]
Length=245

 Score =  184 bits (468),  Expect = 9e-45, Method: Compositional matrix adjust.
 Identities = 102/244 (41%), Positives = 147/244 (60%), Gaps = 1/244 (0%)

Query  4    LQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGA  63
            ++   +A +L V     FA GTV ++  G  +  ++   E ++ LV QP+L +  WWPGA
Sbjct  2    IKQTIVALILSVGASSVFAAGTVKVFSNGSSEAKTLTGAEYLIDLVGQPRLANS-WWPGA  60

Query  64   LLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPD  123
            ++++  A A AL+  Q ++ +LA   A++  D AA I ++RQQ+  L +TGR  + LDPD
Sbjct  61   VISEELATAAALRQQQALLTRLAELAADSSADDAAAINALRQQIQALKVTGRQKINLDPD  120

Query  124  FVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGA  183
             VRV E  NPPL G+YTL+    P T+TL G +S  G+ P+  GR V  YL     L+GA
Sbjct  121  IVRVAERGNPPLQGNYTLWVGPPPSTVTLFGLISRLGKQPFTPGRDVASYLSGQNLLSGA  180

Query  184  DKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLT  243
            D++   V+ P+G T  APVA WNKRHVEP PGS +++G +  V  E    LN  I+  LT
Sbjct  181  DRSYAWVVYPDGRTQKAPVAYWNKRHVEPMPGSIIYVGLADSVWSETPDALNADILQTLT  240

Query  244  QRVP  247
            QR+P
Sbjct  241  QRIP  244


>ref|ZP_08386356.1| conserved hypothetical protein [Escherichia coli H299]
 gb|EGI48189.1| conserved hypothetical protein [Escherichia coli H299]
Length=245

 Score =  184 bits (468),  Expect = 9e-45, Method: Compositional matrix adjust.
 Identities = 102/244 (41%), Positives = 148/244 (60%), Gaps = 1/244 (0%)

Query  4    LQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGA  63
            ++   +A +L V     FA GTV ++  G  +  ++   E+++ LV QP+L +  WWPGA
Sbjct  2    IKQTIVALILSVGASSVFAAGTVKVFSNGSSEAKTLTGAEHLIDLVGQPRLANS-WWPGA  60

Query  64   LLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPD  123
            ++++  A A AL+  Q ++ +LA   A++  D AA I ++RQQ+  L +TGR  + LDPD
Sbjct  61   VISEELATAAALRQQQALLTRLAELAADSSADDAAAINALRQQIQALKVTGRQKINLDPD  120

Query  124  FVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGA  183
             VRV E  NPPL G+YTL+    P T+TL G +S  G+ P+  GR V  YL     L+GA
Sbjct  121  IVRVAERGNPPLQGNYTLWVGPPPSTVTLFGLISRPGKQPFIPGRDVASYLSGQNLLSGA  180

Query  184  DKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLT  243
            D++   V+ P+G T  APVA WNKRHVEP PGS +++G +  V  E    LN  I+  LT
Sbjct  181  DRSYAWVVYPDGRTQKAPVAYWNKRHVEPMPGSIIYVGLADSVWSETPDALNADILQTLT  240

Query  244  QRVP  247
            QR+P
Sbjct  241  QRIP  244


>ref|YP_002389495.1| hypothetical protein ECIAI1_4253 [Escherichia coli IAI1]
 emb|CAR01003.1| conserved hypothetical protein [Escherichia coli IAI1]
Length=245

 Score =  184 bits (467),  Expect = 1e-44, Method: Compositional matrix adjust.
 Identities = 102/244 (41%), Positives = 148/244 (60%), Gaps = 1/244 (0%)

Query  4    LQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGA  63
            ++   +A +L V     FA GTV ++  G  +  ++   E+++ LV QP+L +  WWPGA
Sbjct  2    IKQTIVALLLSVGASSVFAAGTVKVFSNGSSEAKTLTGAEHLIDLVGQPRLANS-WWPGA  60

Query  64   LLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPD  123
            ++++  A A AL+  Q ++ +LA   A++  D AA I ++RQQ+  L +TGR  + LDPD
Sbjct  61   VISEELATAAALRQQQALLTRLAEQAADSSADDAAAINALRQQIQALKVTGRQKINLDPD  120

Query  124  FVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGA  183
             VRV E  NPPL G+YTL+    P T+TL G +S  G+ P+  GR V  YL     L+GA
Sbjct  121  IVRVAERGNPPLQGNYTLWVGPPPSTVTLFGLISRPGKQPFTPGRDVASYLSGQNLLSGA  180

Query  184  DKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLT  243
            D++   V+ P+G T  APVA WNKRHVEP PGS +++G +  V  E    LN  I+  LT
Sbjct  181  DRSYAWVVYPDGRTQKAPVAYWNKRHVEPMPGSIIYVGLADSVWSETPDALNADILQTLT  240

Query  244  QRVP  247
            QR+P
Sbjct  241  QRIP  244


>ref|ZP_03029269.1| conserved hypothetical protein [Escherichia coli B7A]
 ref|YP_002405400.1| hypothetical protein EC55989_4516 [Escherichia coli 55989]
 ref|ZP_06664741.1| hypothetical protein ECCG_02649 [Escherichia coli B088]
 16 more sequence titles
 Length=245

 Score =  184 bits (467),  Expect = 1e-44, Method: Compositional matrix adjust.
 Identities = 102/244 (41%), Positives = 147/244 (60%), Gaps = 1/244 (0%)

Query  4    LQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGA  63
            ++   +A +L V     FA GTV ++  G  +  ++   E ++ LV QP+L +  WWPGA
Sbjct  2    IKQTIVALILSVGASSVFAAGTVKVFSNGSSEAKTLTGAEYLIDLVGQPRLANS-WWPGA  60

Query  64   LLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPD  123
            ++++  A A AL+  Q ++ +LA   A++  D AA I ++RQQ+  L +TGR  + LDPD
Sbjct  61   VISEELATAAALRQQQALLTRLAELAADSSADDAAAINALRQQIQALKVTGRQKINLDPD  120

Query  124  FVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGA  183
             VRV E  NPPL G+YTL+    P T+TL G +S  G+ P+  GR V  YL     L+GA
Sbjct  121  IVRVAERGNPPLQGNYTLWVGPPPSTVTLFGLISRPGKQPFTPGRDVASYLSGQNLLSGA  180

Query  184  DKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLT  243
            D++   V+ P+G T  APVA WNKRHVEP PGS +++G +  V  E    LN  I+  LT
Sbjct  181  DRSYAWVVYPDGRTQKAPVAYWNKRHVEPMPGSIIYVGLADSVWSETPDALNADILQTLT  240

Query  244  QRVP  247
            QR+P
Sbjct  241  QRIP  244


>gb|EFZ75386.1| hypothetical protein ECRN5871_1899 [Escherichia coli RN587/1]
Length=245

 Score =  184 bits (467),  Expect = 1e-44, Method: Compositional matrix adjust.
 Identities = 102/244 (41%), Positives = 146/244 (59%), Gaps = 1/244 (0%)

Query  4    LQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGA  63
            ++   +A +L       FA GTV ++  G  +  ++   E+++ LV QPQL +  WWPGA
Sbjct  2    IKQTIVALLLSAGASSVFAAGTVKVFSNGSSEAKTLTGAEHLIDLVGQPQLANS-WWPGA  60

Query  64   LLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPD  123
            ++++  A A AL   Q ++ +LA   A++  D AA I ++RQQ+  L +TGR  + LDPD
Sbjct  61   VISEELATAAALHQQQALLTRLAELAADSSADDAAAINALRQQIQALKVTGRQKINLDPD  120

Query  124  FVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGA  183
             VRV E  NPPL G+YTL+    P T+TL G +S  G+  +  GR V  YL D   L+GA
Sbjct  121  IVRVAERGNPPLQGNYTLWVGTPPSTVTLFGLISRPGKQSFTPGRDVASYLSDQSLLSGA  180

Query  184  DKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLT  243
            D++   V+ P+G T  APVA WNKRHVEP PGS +++G +  V  E    LN  I+  LT
Sbjct  181  DRSYAWVVYPDGRTQKAPVAYWNKRHVEPMPGSIIYVGLADSVWSETPDALNADILQTLT  240

Query  244  QRVP  247
            QR+P
Sbjct  241  QRIP  244


>gb|EGC06312.1| SLBB-domain-containing protein [Escherichia fergusonii B253]
Length=245

 Score =  184 bits (467),  Expect = 1e-44, Method: Compositional matrix adjust.
 Identities = 101/244 (41%), Positives = 147/244 (60%), Gaps = 1/244 (0%)

Query  4    LQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGA  63
            ++    A ++ V     FA GTV ++  G  +  ++   E+++ LV QP+L +  WWPGA
Sbjct  2    IKQTIAAMIMSVGASSVFAAGTVKVFSNGSNEAKTLTGAEHLIDLVGQPRLANS-WWPGA  60

Query  64   LLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPD  123
            ++++  A A AL+  Q ++ +LA   A++  D AA I ++RQQ+  L +TGR  + LDPD
Sbjct  61   VISEELATAAALRQQQALLTRLAELAADSSADDAAAINALRQQIQALKVTGRQKINLDPD  120

Query  124  FVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGA  183
             VRV E  NPPL G+YTL+    P T+TL G +S  G+ P+  GR V  YL     L+GA
Sbjct  121  IVRVAERGNPPLQGNYTLWVGPPPSTVTLFGLISRPGKQPFTPGRDVASYLSGQSLLSGA  180

Query  184  DKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLT  243
            D++   V+ P+G T  APVA WNKRHVEP PGS +++G +  V  E    LN  I+  LT
Sbjct  181  DRSYAWVVYPDGRTQKAPVAYWNKRHVEPMPGSIIYVGLADSVWSETPDALNADILQTLT  240

Query  244  QRVP  247
            QR+P
Sbjct  241  QRIP  244


>ref|ZP_02805425.1| conserved hypothetical protein [Escherichia coli O157:H7 str. 
EC4076]
 ref|ZP_03081176.1| hypothetical protein EscherichcoliO157_04915 [Escherichia coli 
O157:H7 str. EC4024]
 ref|ZP_03248902.1| conserved hypothetical protein [Escherichia coli O157:H7 str. 
EC4206]
 17 more sequence titles
 Length=245

 Score =  184 bits (467),  Expect = 1e-44, Method: Compositional matrix adjust.
 Identities = 102/244 (41%), Positives = 148/244 (60%), Gaps = 1/244 (0%)

Query  4    LQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGA  63
            ++   +A +L V     FA GTV ++  G  +  ++   E+++ LV QP+L +  WWPGA
Sbjct  2    IKQTIVALLLSVGASSVFAAGTVKVFSNGSSEAKTLTGAEHLIDLVGQPRLANS-WWPGA  60

Query  64   LLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPD  123
            ++++  A A AL+  Q ++ +LA   A++  D AA I ++RQQ+  L +TGR  + LDPD
Sbjct  61   VISEELATAAALRQQQALLTRLAEQGADSSADDAAAINALRQQIQALKVTGRQKINLDPD  120

Query  124  FVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGA  183
             VRV E +NPPL G+YTL+    P T+TL G +S  G  P+  GR V  YL     L+GA
Sbjct  121  IVRVAERANPPLQGNYTLWVGPPPSTVTLFGLISRPGNQPFTPGRDVASYLSGQSLLSGA  180

Query  184  DKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLT  243
            D++   V+ P+G T  APVA WNKRHVEP PGS +++G +  V  E    LN  I+  LT
Sbjct  181  DRSYAWVVYPDGRTQKAPVAYWNKRHVEPMPGSIIYVGLADSVWSEMPDALNADILQTLT  240

Query  244  QRVP  247
            QR+P
Sbjct  241  QRIP  244


>ref|NP_756848.1| hypothetical protein c4996 [Escherichia coli CFT073]
 ref|YP_672097.1| hypothetical protein ECP_4246 [Escherichia coli 536]
 ref|ZP_03033573.1| conserved hypothetical protein [Escherichia coli F11]
 25 more sequence titles
 Length=245

 Score =  184 bits (467),  Expect = 1e-44, Method: Compositional matrix adjust.
 Identities = 101/244 (41%), Positives = 147/244 (60%), Gaps = 1/244 (0%)

Query  4    LQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGA  63
            ++   +A +L       FA GTV ++  G  +  ++   E+++ LV QP+L +  WWPGA
Sbjct  2    IKQTIVALLLSAGASSVFAAGTVKVFSNGSSEAKTLTGAEHLIDLVGQPRLANS-WWPGA  60

Query  64   LLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPD  123
            ++++  A A AL+  Q ++ +LA   A++  D AA I ++RQQ+  L +TGR  + LDPD
Sbjct  61   VISEELATAAALRQQQALLTRLAELAADSSADDAAAINALRQQIQALKVTGRQKINLDPD  120

Query  124  FVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGA  183
             VRV E  NPPL G+YTL+    P T+TL G +S  G+  +  GR V  YL D   L+GA
Sbjct  121  IVRVAERGNPPLQGNYTLWVGTPPSTVTLFGLISRPGKQSFTPGRDVASYLSDQSLLSGA  180

Query  184  DKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLT  243
            D++   V+ P+G T  APVA WNKRHVEP PGS +++G +  V  E    LN  I+  LT
Sbjct  181  DRSYAWVVYPDGRTQKAPVAYWNKRHVEPMPGSIIYVGLADSVWSETPDALNADILQTLT  240

Query  244  QRVP  247
            QR+P
Sbjct  241  QRIP  244


>ref|ZP_05433479.1| hypothetical protein ShiD9_11915 [Shigella sp. D9]
 ref|ZP_08393156.1| conserved hypothetical protein [Shigella sp. D9]
 gb|EGJ06441.1| conserved hypothetical protein [Shigella sp. D9]
Length=245

 Score =  184 bits (466),  Expect = 1e-44, Method: Compositional matrix adjust.
 Identities = 102/244 (41%), Positives = 147/244 (60%), Gaps = 1/244 (0%)

Query  4    LQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGA  63
            ++   +A +L V     FA GTV ++  G  +  ++   E ++ LV QP+L +  WWPGA
Sbjct  2    IKQTIVALILSVGASSVFAAGTVKVFSNGSSEAKTLTGAEYLIDLVGQPRLANS-WWPGA  60

Query  64   LLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPD  123
            ++++  A A AL+  Q ++ +LA   A++  D AA I ++RQQ+  L +TGR  + LDPD
Sbjct  61   VISEELATAAALRQQQALLTRLAEQGADSSADDAAAINALRQQIQALKVTGRQKINLDPD  120

Query  124  FVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGA  183
             VRV E  NPPL G+YTL+    P T+TL G +S  G+ P+  GR V  YL     L+GA
Sbjct  121  IVRVAERGNPPLQGNYTLWVGPPPSTVTLFGLISRPGKQPFTPGRDVASYLSGQNLLSGA  180

Query  184  DKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLT  243
            D++   V+ P+G T  APVA WNKRHVEP PGS +++G +  V  E    LN  I+  LT
Sbjct  181  DRSYAWVVYPDGRTQKAPVAYWNKRHVEPMPGSIIYVGLADSVWSETPDALNADILQTLT  240

Query  244  QRVP  247
            QR+P
Sbjct  241  QRIP  244


>ref|NP_290662.1| hypothetical protein Z5626 [Escherichia coli O157:H7 EDL933]
 ref|NP_313038.1| hypothetical protein ECs5011 [Escherichia coli O157:H7 str. Sakai]
 ref|ZP_03440396.1| conserved hypothetical protein [Escherichia coli O157:H7 str. 
TW14588]
 gb|AAG59227.1|AE005635_7 orf, hypothetical protein [Escherichia coli O157:H7 str. EDL933]
 dbj|BAB38434.1| hypothetical protein [Escherichia coli O157:H7 str. Sakai]
 gb|ACI74042.1| hypothetical protein ECs5011 [Escherichia coli]
 gb|EEC28957.1| conserved hypothetical protein [Escherichia coli O157:H7 str. 
TW14588]
 gb|EGD70232.1| YjbG polysaccharide synthesis-related protein [Escherichia coli 
O157:H7 str. 1044]
Length=245

 Score =  184 bits (466),  Expect = 1e-44, Method: Compositional matrix adjust.
 Identities = 102/244 (41%), Positives = 147/244 (60%), Gaps = 1/244 (0%)

Query  4    LQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGA  63
            ++   +A +L V     FA GTV ++  G  +  ++   E+++ LV QP+L +  WWPGA
Sbjct  2    IKQTIVALLLSVGASSVFAAGTVKVFSNGSSEAKTLTGAEHLIDLVGQPRLANS-WWPGA  60

Query  64   LLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPD  123
            ++++  A A AL+  Q ++ +LA   A++  D AA I ++RQQ+  L +TGR  + LDPD
Sbjct  61   VISEELATAAALRQQQALLTRLAEQGADSSADDAAAINALRQQIQALKVTGRQKINLDPD  120

Query  124  FVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGA  183
             VRV E +NPPL G+YTL+    P T+TL G +S  G  P+  GR V  YL     L GA
Sbjct  121  IVRVAERANPPLQGNYTLWVGPPPSTVTLFGLISRPGNQPFTPGRDVASYLSGQSLLGGA  180

Query  184  DKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLT  243
            D++   V+ P+G T  APVA WNKRHVEP PGS +++G +  V  E    LN  I+  LT
Sbjct  181  DRSYAWVVYPDGRTQKAPVAYWNKRHVEPMPGSIIYVGLADSVWSEMPDALNADILQTLT  240

Query  244  QRVP  247
            QR+P
Sbjct  241  QRIP  244


>ref|YP_405613.1| hypothetical protein SDY_4220 [Shigella dysenteriae Sd197]
 ref|ZP_07678832.1| conserved hypothetical protein [Shigella dysenteriae 1617]
 gb|ABB64122.1| conserved hypothetical protein [Shigella dysenteriae Sd197]
 gb|EFP73168.1| conserved hypothetical protein [Shigella dysenteriae 1617]
Length=245

 Score =  183 bits (465),  Expect = 2e-44, Method: Compositional matrix adjust.
 Identities = 102/244 (41%), Positives = 148/244 (60%), Gaps = 1/244 (0%)

Query  4    LQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGA  63
            ++   +A +L V     FA GTV ++  G  +  ++   E+++ LV QP+L +  WWPGA
Sbjct  2    IKQTIVALLLSVGASSVFAAGTVKVFSNGSSEAKTLTGAEHLIDLVGQPRLANS-WWPGA  60

Query  64   LLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPD  123
            ++++  A A AL+  Q ++ +LA   A++  D AA I ++RQQ+  L +TGR  + LDPD
Sbjct  61   VISEELATAAALRQQQALLTRLAEQGADSSADDAAAINALRQQIQVLKVTGRQKINLDPD  120

Query  124  FVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGA  183
             VRV E +NPPL G+YTL+    P T+TL G +S  G  P+  GR V  YL     L+GA
Sbjct  121  IVRVAERANPPLQGNYTLWVGPPPSTVTLFGLISRPGNQPFTPGRDVASYLSGQSLLSGA  180

Query  184  DKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLT  243
            D++   V+ P+G T  APVA WNKRHVEP PGS +++G +  V  E    LN  I+  LT
Sbjct  181  DRSYAWVVYPDGRTQKAPVAYWNKRHVEPMPGSIIYVGLADSVWSETPDALNADILQTLT  240

Query  244  QRVP  247
            QR+P
Sbjct  241  QRIP  244


>ref|YP_002385133.1| hypothetical protein EFER_4120 [Escherichia fergusonii ATCC 35469]
 emb|CAQ91542.1| conserved hypothetical protein [Escherichia fergusonii ATCC 35469]
Length=245

 Score =  183 bits (465),  Expect = 2e-44, Method: Compositional matrix adjust.
 Identities = 101/244 (41%), Positives = 147/244 (60%), Gaps = 1/244 (0%)

Query  4    LQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGA  63
            ++    A ++ V     FA GTV ++  G  +  ++   E+++ LV QP+L +  WWPGA
Sbjct  2    IKQTIAALIMSVGASSVFAAGTVKVFSNGSNEAKTLTGAEHLIDLVGQPRLANS-WWPGA  60

Query  64   LLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPD  123
            ++++  A A AL+  Q ++ +LA   A++  D AA I ++RQQ+  L +TGR  + LDPD
Sbjct  61   VISEELATAAALRQQQALLTRLAELAADSSADDAAAINALRQQIQALKVTGRQKINLDPD  120

Query  124  FVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGA  183
             VRV E  NPPL G+YTL+    P T+TL G +S  G+ P+  GR V  YL     L+GA
Sbjct  121  IVRVAERGNPPLQGNYTLWVGPPPSTVTLFGLISRPGKQPFTPGRDVASYLSGQSLLSGA  180

Query  184  DKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLT  243
            D++   V+ P+G T  APVA WNKRHVEP PGS +++G +  V  E    LN  I+  LT
Sbjct  181  DRSYAWVVYPDGRTQKAPVAYWNKRHVEPMPGSIIYVGLADSVWSETPDALNADILQTLT  240

Query  244  QRVP  247
            QR+P
Sbjct  241  QRIP  244


>ref|YP_219090.1| hypothetical protein SC4103 [Salmonella enterica subsp. enterica 
serovar Choleraesuis str. SC-B67]
 ref|YP_002639789.1| hypothetical protein SPC_4285 [Salmonella enterica subsp. enterica 
serovar Paratyphi C strain RKS4594]
 gb|AAX68009.1| putative periplasmic protein [Salmonella enterica subsp. enterica 
serovar Choleraesuis str. SC-B67]
 gb|ACN48348.1| hypothetical protein SPC_4285 [Salmonella enterica subsp. enterica 
serovar Paratyphi C strain RKS4594]
 gb|EFZ08743.1| Uncharacterized protein yjbG [Salmonella enterica subsp. enterica 
serovar Choleraesuis str. SCSA50]
Length=245

 Score =  183 bits (465),  Expect = 2e-44, Method: Compositional matrix adjust.
 Identities = 95/239 (39%), Positives = 140/239 (58%), Gaps = 1/239 (0%)

Query  10   ASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGALLTDSA  69
            A  L  +   AFA GTVT++  G  +  ++   E ++ LV QP+L    WWP A++ +  
Sbjct  8    ALALAFIASSAFASGTVTVFTQGNSEPKTLTDAERLLDLVGQPRLATS-WWPAAVIGEEQ  66

Query  70   AKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPDFVRVDE  129
                A +  Q ++ +LA+  AE D D A  I ++R+Q+  + +TGR  V LDPD VRV E
Sbjct  67   TTVTARQQQQELLGRLAALSAEEDGDAAGAINTLRRQIQAVKVTGRQFVNLDPDVVRVSE  126

Query  130  NSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGADKNNVM  189
              NPPL G YTL+   +P  +TL G +S  G  P+  GR V  YL+D   L+GAD++   
Sbjct  127  RGNPPLQGHYTLWVGPQPTQVTLFGLISQPGSQPFVPGRDVASYLEDQRLLSGADRSYAW  186

Query  190  VITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLTQRVPD  248
            V+ P+G +  APVA WNKRH+EP PGS +++GF+  +       +N  I+  LTQR+P+
Sbjct  187  VVYPDGRSQKAPVAYWNKRHIEPMPGSIIFVGFADSLWRGTPEAMNADILHTLTQRIPE  245


>ref|YP_003943655.1| hypothetical protein Entcl_4138 [Enterobacter cloacae SCF1]
 gb|ADO50371.1| protein of unknown function DUF1017 [Enterobacter cloacae SCF1]
Length=246

 Score =  182 bits (463),  Expect = 3e-44, Method: Compositional matrix adjust.
 Identities = 97/248 (39%), Positives = 151/248 (60%), Gaps = 3/248 (1%)

Query  1    MNKLQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWW  60
            MNKL + F+   L +    ++A GTV +Y+ G  Q  ++     ++ LV QP+L    WW
Sbjct  1    MNKLPALFL--TLGMAAAPSWASGTVDVYMNGATQPKTLADAARLIDLVEQPRLAGS-WW  57

Query  61   PGALLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKL  120
            PGA++      A AL+  Q ++++LA+  A ++ D AA I ++RQQL  + + GR  + L
Sbjct  58   PGAVIAAQPQTAVALQQKQALLSRLATLAARSNGDDAAAINALRQQLQAVRVVGRQFISL  117

Query  121  DPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRL  180
            DPD VR  + +NPPL G Y+L+   +P T+TL G +S  G++ +  GR +  YL D   L
Sbjct  118  DPDQVRAGQLNNPPLEGKYSLWVGPQPGTVTLFGLISRPGKVAFTPGRDIASYLDDVSLL  177

Query  181  AGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVS  240
            +GA +++  VI P+G T  APVA WNKRHVEP PGS +++GF+  +   +Y +LN  ++ 
Sbjct  178  SGAGRSDAWVIYPDGRTEKAPVAYWNKRHVEPMPGSTIFVGFADALWTTQYDELNADVLR  237

Query  241  VLTQRVPD  248
             L QR+P+
Sbjct  238  ALAQRIPE  245


>gb|EFZ53016.1| hypothetical protein SS53G_2453 [Shigella sonnei 53G]
Length=220

 Score =  182 bits (462),  Expect = 4e-44, Method: Compositional matrix adjust.
 Identities = 95/220 (43%), Positives = 136/220 (61%), Gaps = 1/220 (0%)

Query  28   IYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGALLTDSAAKAKALKDYQHVMAQLAS  87
            ++  G  +  ++   E+++ LV QP+L +  WWPGA++++  A A AL+  Q ++ +LA 
Sbjct  1    MFSNGSSEAKTLTGAEHLIDLVGQPRLANS-WWPGAVISEELATAAALRQQQALLTRLAE  59

Query  88   WEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPDFVRVDENSNPPLVGDYTLYTVQRP  147
              A++  D AA I ++RQQ+  L +TGR  + LDPD VRV E  NPPL G+YTL+    P
Sbjct  60   QGADSSTDDAAAINALRQQIQALEVTGRQKINLDPDIVRVAERGNPPLQGNYTLWVGPPP  119

Query  148  VTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGADKNNVMVITPEGETVVAPVALWNK  207
             T+TL G +S  G  P+  GR V  YL D   L+GAD++   V+ P+G T  APVA WNK
Sbjct  120  STVTLFGLISHPGNQPFTPGRDVASYLSDQSLLSGADRSYAWVVYPDGRTQKAPVAYWNK  179

Query  208  RHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLTQRVP  247
            RHVEP PGS +++G +  V  E    LN  I+  LTQR+P
Sbjct  180  RHVEPMPGSIIYVGLADSVWSETPDALNADILQTLTQRIP  219


>gb|EFS13220.1| uncharacterized protein gfcC [Shigella flexneri 2a str. 2457T]
 gb|EGK33796.1| hypothetical protein SFK227_4042 [Shigella flexneri K-227]
Length=220

 Score =  182 bits (462),  Expect = 4e-44, Method: Compositional matrix adjust.
 Identities = 95/220 (43%), Positives = 136/220 (61%), Gaps = 1/220 (0%)

Query  28   IYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGALLTDSAAKAKALKDYQHVMAQLAS  87
            ++  G  +  ++   E+++ LV QP+L +  WWPGA++++  A A AL+  Q ++ +LA 
Sbjct  1    MFSNGSSEAKTLTGAEHLIDLVGQPRLANS-WWPGAVISEELATAAALRQQQALLTRLAE  59

Query  88   WEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPDFVRVDENSNPPLVGDYTLYTVQRP  147
              A++  D AA I ++RQQ+  L +TGR  + LDPD VRV E  NPPL G+YTL+    P
Sbjct  60   QGADSSTDDAAAINALRQQIQALKVTGRQKINLDPDIVRVAERGNPPLQGNYTLWVGPPP  119

Query  148  VTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGADKNNVMVITPEGETVVAPVALWNK  207
             T+TL G +S  G  P+  GR V  YL D   L+GAD++   V+ P+G T  APVA WNK
Sbjct  120  STVTLFGLISHPGNQPFTPGRDVASYLSDQSLLSGADRSYAWVVYPDGRTQKAPVAYWNK  179

Query  208  RHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLTQRVP  247
            RHVEP PGS +++G +  V  E    LN  I+  LTQR+P
Sbjct  180  RHVEPMPGSIIYVGLADSVWSETPDALNADILQTLTQRIP  219


>ref|ZP_02685432.1| conserved hypothetical protein [Salmonella enterica subsp. enterica 
serovar Hadar str. RI_05P066]
 gb|EDZ34654.1| conserved hypothetical protein [Salmonella enterica subsp. enterica 
serovar Hadar str. RI_05P066]
Length=245

 Score =  182 bits (462),  Expect = 4e-44, Method: Compositional matrix adjust.
 Identities = 95/239 (39%), Positives = 141/239 (58%), Gaps = 1/239 (0%)

Query  10   ASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGALLTDSA  69
            A  L  +   AFA GTVT++  G  +  ++   E ++ LV QP+L    WWP A++ +  
Sbjct  8    ALALAFIASSAFASGTVTVFTQGNSEPKTLTDAERLLDLVGQPRLATS-WWPAAVIGEEQ  66

Query  70   AKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPDFVRVDE  129
            A   A +  Q ++ +LA+  AE + D AA I ++R+Q+  + +TGR  V LDPD VRV E
Sbjct  67   ATVTARQQQQELLGRLAALSAEENGDAAAAINTLRRQIQAVKVTGRQFVNLDPDVVRVSE  126

Query  130  NSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGADKNNVM  189
              NPPL G YTL+   +P  +TL G +S  G  P+  GR V  YL+    L+GAD++   
Sbjct  127  RGNPPLQGHYTLWVGPQPTQVTLFGLISQPGSQPFVPGRDVASYLEGQRLLSGADRSYAW  186

Query  190  VITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLTQRVPD  248
            V+ P+G +  APVA WNKRH+EP PGS +++GF+  +       +N  I+  LTQR+P+
Sbjct  187  VVYPDGRSQKAPVAYWNKRHIEPMPGSIIFVGFADSLWRGTPEAMNADILHTLTQRIPE  245


>ref|ZP_03059326.1| conserved hypothetical protein [Escherichia coli B171]
 ref|YP_003224603.1| hypothetical protein ECO103_4776 [Escherichia coli O103:H2 str. 
12009]
 gb|EDX31378.1| conserved hypothetical protein [Escherichia coli B171]
 dbj|BAI33469.1| conserved predicted protein [Escherichia coli O103:H2 str. 12009]
 gb|EFZ47328.1| hypothetical protein ECE128010_2426 [Escherichia coli E128010]
Length=245

 Score =  182 bits (461),  Expect = 5e-44, Method: Compositional matrix adjust.
 Identities = 101/244 (41%), Positives = 146/244 (59%), Gaps = 1/244 (0%)

Query  4    LQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGA  63
            ++   +A +L V     FA GTV ++  G  +  ++   E ++ LV QP+L +  WWPGA
Sbjct  2    IKQTIVALILSVGASSVFAAGTVKVFSNGSSEAKTLTGAEYLIDLVGQPRLANS-WWPGA  60

Query  64   LLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPD  123
            ++++  A A A +  Q ++ +LA   A++  D AA I ++RQQ+  L +TGR  + LDPD
Sbjct  61   VISEELATAAASRQQQALLTRLAELAADSSADDAAAINALRQQIQALKVTGRQKINLDPD  120

Query  124  FVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGA  183
             VRV E  NPPL G+YTL+    P T+TL G +S  G+ P+  GR V  YL     L+GA
Sbjct  121  IVRVAERGNPPLQGNYTLWVGPPPSTVTLFGLISRPGKQPFTPGRDVASYLSGQNLLSGA  180

Query  184  DKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLT  243
            D++   V+ P+G T  APVA WNKRHVEP PGS +++G +  V  E    LN  I+  LT
Sbjct  181  DRSYAWVVYPDGRTQKAPVAYWNKRHVEPMPGSIIYVGLADSVWSETPDALNADILQTLT  240

Query  244  QRVP  247
            QR+P
Sbjct  241  QRIP  244


>ref|ZP_02809628.1| conserved hypothetical protein [Escherichia coli O157:H7 str. 
EC869]
 ref|ZP_05937737.1| hypothetical protein EscherichiacoliO157_02451 [Escherichia coli 
O157:H7 str. FRIK2000]
 ref|ZP_05949524.1| hypothetical protein EscherichiacoliO157EcO_14370 [Escherichia 
coli O157:H7 str. FRIK966]
 gb|EDU93445.1| conserved hypothetical protein [Escherichia coli O157:H7 str. 
EC869]
Length=245

 Score =  182 bits (461),  Expect = 6e-44, Method: Compositional matrix adjust.
 Identities = 101/244 (41%), Positives = 147/244 (60%), Gaps = 1/244 (0%)

Query  4    LQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGA  63
            ++   +A +L V     FA GTV ++  G  +  ++   E+++ LV QP+L +  WWPGA
Sbjct  2    IKQTIVALLLSVGASSVFAAGTVKVFSNGSSEAKTLTGAEHLIDLVGQPRLANS-WWPGA  60

Query  64   LLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPD  123
            ++++  A A AL+  Q ++ +LA   A++  D AA I ++ QQ+  L +TGR  + LDPD
Sbjct  61   VISEELATAAALRQQQALLTRLAEQGADSSADDAAAINALHQQIQALKVTGRQKINLDPD  120

Query  124  FVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGA  183
             VRV E +NPPL G+YTL+    P T+TL G +S  G  P+  GR V  YL     L+GA
Sbjct  121  IVRVAERANPPLQGNYTLWVGPPPSTVTLFGLISRPGNQPFTPGRDVASYLSGQSLLSGA  180

Query  184  DKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLT  243
            D++   V+ P+G T  APVA WNKRHVEP PGS +++G +  V  E    LN  I+  LT
Sbjct  181  DRSYAWVVYPDGRTQKAPVAYWNKRHVEPMPGSIIYVGLADSVWSEMPDALNADILQTLT  240

Query  244  QRVP  247
            QR+P
Sbjct  241  QRIP  244


>ref|YP_002410323.1| hypothetical protein ECIAI39_4450 [Escherichia coli IAI39]
 emb|CAR20556.1| conserved hypothetical protein [Escherichia coli IAI39]
Length=245

 Score =  181 bits (459),  Expect = 1e-43, Method: Compositional matrix adjust.
 Identities = 99/244 (40%), Positives = 145/244 (59%), Gaps = 1/244 (0%)

Query  4    LQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGA  63
            ++   +A +L       FA GTV ++  G  +  ++   E+++ LV QP+L +  WWPGA
Sbjct  2    IKQTIVALLLSAGVSSVFAAGTVKVFSNGSSEAKTLTGAEHLIDLVGQPRLANS-WWPGA  60

Query  64   LLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPD  123
            ++++  A A AL+  Q ++ +LA   A++  D A  I ++RQQ+  L +TGR  + LDPD
Sbjct  61   VISEELATAAALRQQQALLTRLAELTADSSADDADAINALRQQIQALKVTGRQKINLDPD  120

Query  124  FVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGA  183
             VRV E  NPPL G+YTL+    P T+ L G +S  G+ P+  GR V  YL     L+GA
Sbjct  121  IVRVAERGNPPLQGNYTLWVGPPPSTVMLFGLISRPGKQPFTPGRDVASYLSGQNLLSGA  180

Query  184  DKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLT  243
            D++   V+ P+G T  APVA WNKRHVEP PGS +++G +  V  E    LN  I+  LT
Sbjct  181  DRSYAWVVYPDGRTQKAPVAYWNKRHVEPMPGSIIYVGLADSVWSETPDALNADILQTLT  240

Query  244  QRVP  247
            QR+P
Sbjct  241  QRIP  244


>gb|EGB61141.1| SLBB-domain-containing protein [Escherichia coli M863]
 gb|EGE62083.1| hypothetical protein ECSTEC7V_4766 [Escherichia coli STEC_7v]
Length=245

 Score =  181 bits (458),  Expect = 1e-43, Method: Compositional matrix adjust.
 Identities = 100/244 (40%), Positives = 146/244 (59%), Gaps = 1/244 (0%)

Query  4    LQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGA  63
            ++    A ++ V     FA GTV ++  G  +  ++   E+++ LV QP+L +  WWPGA
Sbjct  2    IKQTIAALIMSVGASSVFAAGTVKVFSNGSNEAKTLTGAEHLIDLVGQPRLANS-WWPGA  60

Query  64   LLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPD  123
            ++++  A A AL+  Q ++ +LA   A++  D AA I ++RQQ+  L +TGR  + LD D
Sbjct  61   VISEELATAAALRQQQALLTRLAELAADSSADDAAAINALRQQIQALKVTGRQKINLDSD  120

Query  124  FVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGA  183
             VRV E  NPPL G+YTL+    P T+TL G +S  G+  +  GR V  YL D   L+GA
Sbjct  121  IVRVAERGNPPLQGNYTLWVGPPPSTVTLFGLISRPGKQSFTPGRDVASYLSDQSLLSGA  180

Query  184  DKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLT  243
            D++   V+ P+G T  APVA WNKRHVEP PGS +++G +  V  E    LN  I+  LT
Sbjct  181  DRSYAWVVYPDGRTQKAPVAYWNKRHVEPMPGSIIYVGLADSVWSETPDALNADILQTLT  240

Query  244  QRVP  247
            QR+P
Sbjct  241  QRIP  244


>ref|ZP_07185547.1| conserved hypothetical protein [Escherichia coli MS 69-1]
 gb|EFJ81563.1| conserved hypothetical protein [Escherichia coli MS 69-1]
Length=245

 Score =  181 bits (458),  Expect = 1e-43, Method: Compositional matrix adjust.
 Identities = 100/240 (41%), Positives = 144/240 (60%), Gaps = 1/240 (0%)

Query  8    FIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGALLTD  67
             +A +L V     FA GTV ++  G  +  ++   E+++ LV QP+L +  WWPGA++++
Sbjct  6    IVALILSVGASSVFAAGTVKVFSNGSSEAKTLTGAEHLIDLVGQPRLANS-WWPGAVISE  64

Query  68   SAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPDFVRV  127
              A A AL+  Q ++ +LA   A++  D AA I ++RQQ+  L +TGR  + LDPD VRV
Sbjct  65   ELATAAALRQQQALLTRLAELAADSSADDAAAINALRQQIQALKVTGRQKINLDPDIVRV  124

Query  128  DENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGADKNN  187
             E  NPPL G+YTL+    P T+ L G +S  G+ P+   R V  YL     L+GAD++ 
Sbjct  125  AERGNPPLQGNYTLWVGPPPSTVMLFGLISRPGKQPFTPSRDVASYLSGQNLLSGADRSY  184

Query  188  VMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLTQRVP  247
              V+ P+G T  APVA WNKRHVEP PGS +++G +  V  E    LN  I+  LTQR+P
Sbjct  185  AWVVYPDGRTQKAPVAYWNKRHVEPMPGSIIYVGLADSVWSETPDALNADILQTLTQRIP  244


>ref|ZP_03217856.1| conserved hypothetical protein [Salmonella enterica subsp. enterica 
serovar Javiana str. GA_MM04042433]
 gb|EDZ08514.1| conserved hypothetical protein [Salmonella enterica subsp. enterica 
serovar Javiana str. GA_MM04042433]
Length=244

 Score =  180 bits (457),  Expect = 2e-43, Method: Compositional matrix adjust.
 Identities = 97/239 (40%), Positives = 141/239 (58%), Gaps = 2/239 (0%)

Query  10   ASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGALLTDSA  69
            A  L  +   AFA GTVT++  G  +  ++   E ++ LV QP+L    WWP A++ +  
Sbjct  8    ALALAFVASSAFASGTVTVFTQGNSEPKTLTDAERLLDLVGQPRLATS-WWPAAVIGEEQ  66

Query  70   AKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPDFVRVDE  129
            A   A +  Q ++ +LA+  AE D D AA I ++R+Q+  L +TGR  V LDPD VRV E
Sbjct  67   ATVTARQQQQELLGRLAALSAEEDGDAAA-INTLRRQIQALKVTGRQFVNLDPDVVRVSE  125

Query  130  NSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGADKNNVM  189
              NPPL G YTL+   +P  +TL G +S  G  P+  GR V  YL+    L+GAD++   
Sbjct  126  RGNPPLQGHYTLWVGPQPTQVTLFGLISQPGSQPFVPGRDVASYLEGQRLLSGADRSYAW  185

Query  190  VITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLTQRVPD  248
            V+ P+G +  APVA WNKRH+EP PGS +++GF+  +       +N  I+  LTQR+P+
Sbjct  186  VVYPDGRSQKAPVAYWNKRHIEPMPGSIIFVGFADSLWRGTPEAINADILHTLTQRIPE  244


>ref|ZP_03044141.1| conserved hypothetical protein [Escherichia coli E22]
 gb|EDV83910.1| conserved hypothetical protein [Escherichia coli E22]
Length=245

 Score =  180 bits (457),  Expect = 2e-43, Method: Compositional matrix adjust.
 Identities = 101/244 (41%), Positives = 145/244 (59%), Gaps = 1/244 (0%)

Query  4    LQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGA  63
            ++   +A +L V     FA GTV ++  G  +  ++   E ++ LV QP L +  WWPGA
Sbjct  2    IKQTIVALILSVGASSVFAAGTVKVFSNGSSEAKTLTGAEYLIDLVGQPWLANS-WWPGA  60

Query  64   LLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPD  123
            ++++  A A A +  Q ++ +LA   A++  D AA I ++RQQ+  L +TGR  + LDPD
Sbjct  61   VISEELATAAASRQQQALLTRLAELAADSSADDAAAINALRQQIQALKVTGRQKINLDPD  120

Query  124  FVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGA  183
             VRV E  NPPL G+YTL+    P T+TL G +S  G+ P+  GR V  YL     L+GA
Sbjct  121  IVRVAERGNPPLQGNYTLWVGPPPSTVTLFGLISRPGKQPFTPGRDVASYLSGQNLLSGA  180

Query  184  DKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLT  243
            D++   V+ P+G T  APVA WNKRHVEP PGS +++G +  V  E    LN  I+  LT
Sbjct  181  DRSYAWVVYPDGRTQKAPVAYWNKRHVEPMPGSIIYVGLADSVWSETPDALNADILQTLT  240

Query  244  QRVP  247
            QR+P
Sbjct  241  QRIP  244


>ref|ZP_06651602.1| predicted protein [Escherichia coli B354]
 gb|EFF14495.1| predicted protein [Escherichia coli B354]
Length=245

 Score =  180 bits (456),  Expect = 2e-43, Method: Compositional matrix adjust.
 Identities = 100/244 (40%), Positives = 146/244 (59%), Gaps = 1/244 (0%)

Query  4    LQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGA  63
            ++   +A +L V     FA GTV ++  G  +  ++   E+++ LV QP+L +  WWPGA
Sbjct  2    IKQTIVALILSVGASSVFAAGTVKVFSNGSSEAKTLTGAEHLIDLVGQPRLANS-WWPGA  60

Query  64   LLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPD  123
            ++++  A A AL+  Q ++ +LA   A++  D AA I ++RQQ+  L +TGR  + LD D
Sbjct  61   VISEELATAAALRQQQALLTRLAELTADSSADDAAAINALRQQIQALKVTGRQKINLDSD  120

Query  124  FVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGA  183
             VRV E  NPPL G+YTL+    P T+TL G +S  G+ P+  GR V  YL     L+ A
Sbjct  121  IVRVAERGNPPLQGNYTLWVGPPPSTVTLFGLISRPGKQPFTPGRDVASYLSGQNLLSSA  180

Query  184  DKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLT  243
            D++   V+ P+G T  APVA WNKRHVEP PGS +++G +  V  E    LN  I+  LT
Sbjct  181  DRSYAWVVYPDGRTQKAPVAYWNKRHVEPMPGSIIYVGLADSVWSETPDALNADILQTLT  240

Query  244  QRVP  247
            QR+P
Sbjct  241  QRIP  244


>ref|ZP_06356425.1| conserved hypothetical protein [Citrobacter youngae ATCC 29220]
 gb|EFE05535.1| conserved hypothetical protein [Citrobacter youngae ATCC 29220]
Length=245

 Score =  180 bits (456),  Expect = 2e-43, Method: Compositional matrix adjust.
 Identities = 100/245 (40%), Positives = 147/245 (60%), Gaps = 1/245 (0%)

Query  4    LQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGA  63
            ++   +A V  +  P  FA G V + + G  +   +   E+++ LV QP+L +  WWPGA
Sbjct  2    IKRAVMALVFSLSVPSVFAAGDVKVMIAGSAEPKILTGAEHLIDLVGQPRLSNS-WWPGA  60

Query  64   LLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPD  123
            ++++  A  +A +  Q ++A+LA+  AE   D AA I ++RQQ+  L +TGR  + LDPD
Sbjct  61   VISEERATVEAERQQQALLARLAALSAEESGDDAAAINALRQQIQALKVTGRQKINLDPD  120

Query  124  FVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGA  183
             VRV E  NPPL G+YTL+   +P  ITL G +S  G+ P+  GR V  YL    RL+GA
Sbjct  121  VVRVSERGNPPLQGNYTLWVGPQPTDITLFGLLSRPGKQPFMPGRDVASYLDGQSRLSGA  180

Query  184  DKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLT  243
            D++   VI P+G T   P+A WNKRHVEP PGS +++G S  V      ++N  I+  LT
Sbjct  181  DRSYAWVIYPDGRTQKVPIAYWNKRHVEPMPGSIIFVGLSDAVWSSTPDEINADILRTLT  240

Query  244  QRVPD  248
            QR+P+
Sbjct  241  QRIPE  245


>ref|YP_001572429.1| hypothetical protein SARI_03458 [Salmonella enterica subsp. arizonae 
serovar 62:z4,z23:-- str. RSK2980]
 gb|ABX23287.1| hypothetical protein SARI_03458 [Salmonella enterica subsp. arizonae 
serovar 62:z4,z23:--]
Length=245

 Score =  180 bits (456),  Expect = 2e-43, Method: Compositional matrix adjust.
 Identities = 94/239 (39%), Positives = 140/239 (58%), Gaps = 1/239 (0%)

Query  10   ASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGALLTDSA  69
            A  L  +   AFA GTVT+++ G  +  ++     ++ LV QP+L    WWP A++ +  
Sbjct  8    ALALAFVASSAFASGTVTVFIKGHNEPKTLTDAGRLLDLVGQPRLATS-WWPAAVIGEEK  66

Query  70   AKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPDFVRVDE  129
            A A A +  Q ++ +LA+   E + D AA I ++R+Q+  + +TGR  V LDPD VRV E
Sbjct  67   ATATARQQQQELLGRLAALSTEENGDAAAAINALRRQIQAVKVTGRQFVNLDPDVVRVSE  126

Query  130  NSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGADKNNVM  189
              NPPL G+YTL+   +P  ITL G +S  G  P+  GR V  YL     L+GAD++   
Sbjct  127  RGNPPLQGNYTLWVGPQPTQITLFGLISRPGSQPFIPGRDVASYLDGQRLLSGADRSYAW  186

Query  190  VITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLTQRVPD  248
            V+ P+G +  APVA WNKRH+EP PGS +++GF++         +N   +  LTQR+P+
Sbjct  187  VVYPDGRSQKAPVAYWNKRHIEPMPGSIIFVGFASSRWRGTPEAINADTLHTLTQRIPE  245


>ref|NP_458522.1| hypothetical protein STY4420 [Salmonella enterica subsp. enterica 
serovar Typhi str. CT18]
 ref|NP_807734.1| hypothetical protein t4130 [Salmonella enterica subsp. enterica 
serovar Typhi str. Ty2]
 ref|ZP_02656274.1| conserved hypothetical protein [Salmonella enterica subsp. enterica 
serovar Kentucky str. CDC 191]
 11 more sequence titles
 Length=244

 Score =  179 bits (453),  Expect = 4e-43, Method: Compositional matrix adjust.
 Identities = 96/239 (40%), Positives = 141/239 (58%), Gaps = 2/239 (0%)

Query  10   ASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGALLTDSA  69
            A  L  +   AFA GTVT++  G  +  ++   E ++ LV QP+L    WWP A++ +  
Sbjct  8    ALALAFVASSAFASGTVTVFTQGNSEPKTLTDAERLLDLVGQPRLATS-WWPAAVIGEEQ  66

Query  70   AKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPDFVRVDE  129
            A   A +  Q ++ +LA+  AE D D AA I ++R+Q+  + +TGR  V LDPD VRV E
Sbjct  67   ATVTARQQQQELLGRLAALSAEEDGDAAA-INTLRRQIQAVKVTGRQFVNLDPDVVRVSE  125

Query  130  NSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGADKNNVM  189
              NPPL G YTL+   +P  +TL G +S  G  P+  GR V  YL+    L+GAD++   
Sbjct  126  RGNPPLQGHYTLWVGPQPTQVTLFGLISQPGSQPFVPGRDVASYLEGQRLLSGADRSYAW  185

Query  190  VITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLTQRVPD  248
            V+ P+G +  APVA WNKRH+EP PGS +++GF+  +       +N  I+  LTQR+P+
Sbjct  186  VVYPDGRSQKAPVAYWNKRHIEPMPGSIIFVGFADSLWRGTPEAINADILHTLTQRIPE  244


>ref|ZP_07380293.1| protein of unknown function DUF1017 [Pantoea sp. aB]
 gb|EFM18299.1| protein of unknown function DUF1017 [Pantoea sp. aB]
Length=247

 Score =  179 bits (453),  Expect = 5e-43, Method: Compositional matrix adjust.
 Identities = 94/222 (42%), Positives = 129/222 (58%), Gaps = 0/222 (0%)

Query  27   TIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGALLTDSAAKAKALKDYQHVMAQLA  86
            T++ P       +  + ++ QLVT P L+    W    + +  A A A + YQ  +  L 
Sbjct  26   TVHTPHNGGQAELSQITDLSQLVTLPPLQVNTDWRSTFIAERGATAVARQQYQQTLGALR  85

Query  87   SWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPDFVRVDENSNPPLVGDYTLYTVQR  146
            +W A++  D AA I  V +QL  + + GR    LDPD++R+    N  L G Y LY    
Sbjct  86   AWRADSSGDRAAAIDEVIRQLSAIKVAGRQFTSLDPDWIRLHPADNRRLEGSYDLYLQAP  145

Query  147  PVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGADKNNVMVITPEGETVVAPVALWN  206
              ++ LLGA+SGAG++ WQ G+SV DYL  H  L+GA++N V VI P G T   PVA WN
Sbjct  146  TDSVLLLGALSGAGKVSWQPGKSVRDYLDGHDALSGAERNFVTVIAPSGATQQVPVAYWN  205

Query  207  KRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLTQRVPD  248
            +RH E  PGS +WLGFS+  LP    DLND+I+SVLT R+PD
Sbjct  206  RRHAEVEPGSVIWLGFSSWSLPGSDEDLNDRILSVLTHRIPD  247


>ref|ZP_03357288.1| hypothetical protein SentesTyphi_01726 [Salmonella enterica subsp. 
enterica serovar Typhi str. E02-1180]
Length=244

 Score =  177 bits (450),  Expect = 1e-42, Method: Compositional matrix adjust.
 Identities = 96/239 (40%), Positives = 140/239 (58%), Gaps = 2/239 (0%)

Query  10   ASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGALLTDSA  69
            A  L  +   AFA GTVT++  G  +  ++   E ++ LV QP+L    WWP A++ +  
Sbjct  8    ALALAFVASSAFASGTVTVFTQGNSEPKTLTDAERLLDLVGQPRLATS-WWPAAVIGEEQ  66

Query  70   AKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPDFVRVDE  129
            A   A +  Q ++ +LA+  AE D D AA I ++R+Q+  + +TGR  V LDPD VRV E
Sbjct  67   ATVTARQQQQELLGRLAALSAEEDGDAAA-INTLRRQIQAVKVTGRQFVNLDPDVVRVSE  125

Query  130  NSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGADKNNVM  189
              NPPL G YTL+   +P  +TL G +S  G  P+  GR V  YL+    L+GAD +   
Sbjct  126  RGNPPLQGHYTLWVGPQPTQVTLFGLISQPGSQPFVPGRDVASYLEGQRLLSGADCSYAW  185

Query  190  VITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLTQRVPD  248
            V+ P+G +  APVA WNKRH+EP PGS +++GF+  +       +N  I+  LTQR+P+
Sbjct  186  VVYPDGRSQKAPVAYWNKRHIEPMPGSIIFVGFADSLWRGTPEAINADILHTLTQRIPE  244


>ref|YP_004114116.1| hypothetical protein Pat9b_0234 [Pantoea sp. At-9b]
 gb|ADU67560.1| protein of unknown function DUF1017 [Pantoea sp. At-9b]
Length=245

 Score =  177 bits (449),  Expect = 1e-42, Method: Compositional matrix adjust.
 Identities = 102/248 (41%), Positives = 147/248 (59%), Gaps = 3/248 (1%)

Query  1    MNKLQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWW  60
            M K+ S      L+    +A A   V ++    + TL V   +N+ QL++ P +    WW
Sbjct  1    MKKITSLLAGLALFTAG-NALADSQVIVHDGPHRATLQVDHAQNLSQLLSNPAIHT--WW  57

Query  61   PGALLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKL  120
            PG ++ + AA A A +  Q ++A L +W+A+   + AA I +V QQL    +TGR    L
Sbjct  58   PGTVIAEHAATAVAKQQQQQLLADLRAWQADNSGERAAAIGAVIQQLAATPVTGRQFTSL  117

Query  121  DPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRL  180
            DPD+VR+   +N  L G Y LYT+  P  + +LGA+   G++ WQ GR+V  YL DH RL
Sbjct  118  DPDWVRLRPEANRILQGSYDLYTLAAPTQVLVLGALEHPGKVSWQPGRTVRSYLADHDRL  177

Query  181  AGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVS  240
            +G +++   VI P GET   P+A WN RHVE  PGS +WLGFS+  LP   +DLND+++S
Sbjct  178  SGGERSFATVIAPSGETQQVPIAYWNHRHVEVEPGSIIWLGFSSWSLPWGQSDLNDRMIS  237

Query  241  VLTQRVPD  248
            VLT R+PD
Sbjct  238  VLTHRIPD  245


>ref|ZP_08376264.1| conserved hypothetical protein [Escherichia coli TA280]
 gb|EGI38697.1| conserved hypothetical protein [Escherichia coli TA280]
Length=245

 Score =  177 bits (448),  Expect = 2e-42, Method: Compositional matrix adjust.
 Identities = 99/244 (40%), Positives = 145/244 (59%), Gaps = 1/244 (0%)

Query  4    LQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGA  63
            ++   +A +L V     FA GTV ++  G  +  ++   E+++ LV QP+L +  WW GA
Sbjct  2    IKQTIVALLLSVGASSVFAAGTVKVFSNGSSEAKTLTGAEHLIDLVGQPRLANS-WWSGA  60

Query  64   LLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPD  123
            ++++  A A AL+  Q ++ +LA   A++  D AA I ++RQQ+  L +TGR  + LDPD
Sbjct  61   VISEELATAAALRQQQALLTRLAELTADSSADDAAAINALRQQIQALKVTGRQKINLDPD  120

Query  124  FVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGA  183
             VRV E  NPPL G+YTL+    P T+ L G +S  G+  +  GR V  YL     L+GA
Sbjct  121  IVRVAERGNPPLQGNYTLWVGPPPSTVMLFGLISRPGKQSFTPGRDVASYLSGQNLLSGA  180

Query  184  DKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLT  243
            D++   V+ P+G T  APVA WNKRHVEP PGS +++G +  V  E    LN  I+  LT
Sbjct  181  DRSYAWVVYPDGRTQKAPVAYWNKRHVEPMPGSIIYVGLADSVWSETPDALNADILQTLT  240

Query  244  QRVP  247
            QR+P
Sbjct  241  QRIP  244


>ref|ZP_04558593.1| conserved hypothetical protein [Citrobacter sp. 30_2]
 gb|EEH96041.1| conserved hypothetical protein [Citrobacter sp. 30_2]
Length=245

 Score =  176 bits (445),  Expect = 4e-42, Method: Compositional matrix adjust.
 Identities = 96/228 (42%), Positives = 141/228 (61%), Gaps = 1/228 (0%)

Query  21   FAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGALLTDSAAKAKALKDYQH  80
            FA G+V ++    ++  ++   E+++ LV QP+L +  WWPGA++++  A  +A +  Q 
Sbjct  19   FAAGSVKVFTSASEEPKTLTGAEHLLDLVGQPRLSNS-WWPGAVISEERATMEAGRQQQA  77

Query  81   VMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPDFVRVDENSNPPLVGDYT  140
            ++A+LA+  AE   D AA I ++RQQ+  L +TGR  + LDPD VRV E  NPPL G+YT
Sbjct  78   LLARLAALSAEESGDDAAAINTLRQQIQALKVTGRQKINLDPDVVRVSERGNPPLQGNYT  137

Query  141  LYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGADKNNVMVITPEGETVVA  200
            L+   +P  ITL G +S  G+ P+  GR V  YL    RL+GAD++   VI P+G T   
Sbjct  138  LWVGAQPTHITLFGLLSHPGKQPFMPGRDVASYLDGQSRLSGADRSFAWVIYPDGRTQKV  197

Query  201  PVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLTQRVPD  248
             VA WNKRHVEP PGS +++G S  V      ++N  I+  LTQR+P+
Sbjct  198  SVAYWNKRHVEPMPGSIIYVGLSDAVWSSTSDEINADILRTLTQRIPE  245


>ref|ZP_02346309.1| conserved hypothetical protein [Salmonella enterica subsp. enterica 
serovar Saintpaul str. SARA29]
 gb|EDZ10637.1| conserved hypothetical protein [Salmonella enterica subsp. enterica 
serovar Saintpaul str. SARA29]
Length=244

 Score =  175 bits (444),  Expect = 5e-42, Method: Compositional matrix adjust.
 Identities = 94/239 (39%), Positives = 139/239 (58%), Gaps = 2/239 (0%)

Query  10   ASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGALLTDSA  69
            A  L  +   AFA GTVT++  G  +  ++   E ++ LV QP+L    WWP A++ +  
Sbjct  8    ALALAFIASSAFASGTVTVFTQGNSEPKTLTDAERLLDLVGQPRLATS-WWPAAVIGEEQ  66

Query  70   AKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPDFVRVDE  129
            A   A +  Q ++ +LA+  AE + D AA I ++R+Q+  + +TGR  V LDPD VRV E
Sbjct  67   ATVTARQQQQELLGRLAALSAEEEGDAAA-INTLRRQIQAVKVTGRQFVNLDPDVVRVSE  125

Query  130  NSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGADKNNVM  189
              NPPL G YTL+    P  +TL G +S  G  P+  GR V  YL+    L+GAD++   
Sbjct  126  RGNPPLQGHYTLWVGPEPTQVTLFGLISQPGSQPFVPGRDVASYLEGQRLLSGADRSYAW  185

Query  190  VITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLTQRVPD  248
            V+ P+G +  APVA WNKRH+EP PGS +++GF+  +       +N  I+  L QR+P+
Sbjct  186  VVYPDGRSQKAPVAYWNKRHIEPMPGSIIFVGFADSLWRGTPEAINADILHTLKQRIPE  244


>ref|ZP_07183334.1| conserved hypothetical protein [Escherichia coli MS 196-1]
 gb|EFI90597.1| conserved hypothetical protein [Escherichia coli MS 196-1]
 gb|AEE59357.1| conserved hypothetical protein [Escherichia coli UMNK88]
Length=220

 Score =  173 bits (438),  Expect = 2e-41, Method: Compositional matrix adjust.
 Identities = 95/220 (43%), Positives = 137/220 (62%), Gaps = 1/220 (0%)

Query  28   IYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGALLTDSAAKAKALKDYQHVMAQLAS  87
            ++  G  +  ++   E+++ LV QP+L +  WWPGA++++  A A AL+  Q ++ +LA 
Sbjct  1    MFSNGSSEAKTLTGAEHLIDLVGQPRLANS-WWPGAVISEELATAAALRQQQALLTRLAE  59

Query  88   WEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPDFVRVDENSNPPLVGDYTLYTVQRP  147
              A++  D AA I ++RQQ+  L +TGR  + LDPD VRV E  NPPL G+YTL+    P
Sbjct  60   QGADSSADDAAAINALRQQIQALKVTGRQKINLDPDIVRVAERGNPPLQGNYTLWVGPPP  119

Query  148  VTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGADKNNVMVITPEGETVVAPVALWNK  207
             T+TL G +S  G+ P+  GR V  YL D   L+GAD++   V+ P+G T  APVA WNK
Sbjct  120  STVTLFGLISRPGKQPFTPGRDVASYLSDQSLLSGADRSYAWVVYPDGRTQKAPVAYWNK  179

Query  208  RHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLTQRVP  247
            RHVEP PGS +++G +  V  E    LN  I+  LTQR+P
Sbjct  180  RHVEPMPGSIIYVGLADSVWSETPDALNADILQTLTQRIP  219


>ref|YP_002218119.1| hypothetical protein SeD_A4618 [Salmonella enterica subsp. enterica 
serovar Dublin str. CT_02021853]
 gb|ACH75379.1| conserved hypothetical protein [Salmonella enterica subsp. enterica 
serovar Dublin str. CT_02021853]
 gb|EGE32261.1| Putative periplasmic protein [Salmonella enterica subsp. enterica 
serovar Dublin str. SD3246]
Length=245

 Score =  172 bits (435),  Expect = 5e-41, Method: Compositional matrix adjust.
 Identities = 97/239 (40%), Positives = 142/239 (59%), Gaps = 1/239 (0%)

Query  10   ASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGALLTDSA  69
            A  L  +   AFA GTVT++  G  +  ++   E ++ LV QP+L    WWP A++ +  
Sbjct  8    ALALAFVASSAFASGTVTVFTQGNSEPKTLTDAERLLDLVGQPRLATS-WWPAAVIGEEQ  66

Query  70   AKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPDFVRVDE  129
            A   A +  Q ++ +LA+  AE D D AA I ++R+Q+  + +TGR  V LDPD VRV E
Sbjct  67   ATVTARQQQQELLGRLAALGAEEDGDAAAAINTLRRQIQAVKVTGRQFVNLDPDVVRVSE  126

Query  130  NSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGADKNNVM  189
              NPPL G YTL+   +P  +TL G +S  G  P+  GR V  YL+D   L+GAD++   
Sbjct  127  RGNPPLQGHYTLWVGPQPTQVTLFGLISQPGSQPFVPGRDVASYLEDQRLLSGADRSYAW  186

Query  190  VITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLTQRVPD  248
            V+ P+G +  APVA WNKRH+EP PGS +++GF+  +       +N  I+  LTQR+P+
Sbjct  187  VVYPDGRSQKAPVAYWNKRHIEPMPGSIIFVGFADSLWRGTPEAMNADILHTLTQRIPE  245


>ref|ZP_03213760.1| conserved hypothetical protein [Salmonella enterica subsp. enterica 
serovar Virchow str. SL491]
 gb|EDZ02791.1| conserved hypothetical protein [Salmonella enterica subsp. enterica 
serovar Virchow str. SL491]
Length=245

 Score =  172 bits (435),  Expect = 6e-41, Method: Compositional matrix adjust.
 Identities = 97/239 (40%), Positives = 142/239 (59%), Gaps = 1/239 (0%)

Query  10   ASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGALLTDSA  69
            A  L  +   AFA GTVT++  G  +  ++   E ++ LV QP+L    WWP A++ +  
Sbjct  8    ALALAFVASSAFASGTVTVFTQGNSEPKTLTDAERLLDLVGQPRLATS-WWPAAVIGEEQ  66

Query  70   AKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPDFVRVDE  129
            A   A +  Q ++ +LA+  AE D D AA I ++R+Q+  + +TGR  V LDPD VRV E
Sbjct  67   ATVTARQQQQELLGRLAALGAEEDGDAAAAINTLRRQIQAVKVTGRQFVNLDPDVVRVSE  126

Query  130  NSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGADKNNVM  189
              NPPL G YTL+   +P  +TL G +S  G  P+  GR V  YL+D   L+GAD++   
Sbjct  127  RGNPPLQGHYTLWVGPQPTQVTLFGLISQPGSQPFVPGRDVASYLEDQRLLSGADRSYAW  186

Query  190  VITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLTQRVPD  248
            V+ P+G +  APVA WNKRH+EP PGS +++GF+  +       +N  I+  LTQR+P+
Sbjct  187  VVYPDGRSQKAPVAYWNKRHIEPMPGSIIFVGFADSLWRGTPKAINADILHTLTQRIPE  245


>ref|YP_001591314.1| hypothetical protein SPAB_05204 [Salmonella enterica subsp. enterica 
serovar Paratyphi B str. SPB7]
 ref|ZP_02700750.1| conserved hypothetical protein [Salmonella enterica subsp. enterica 
serovar Newport str. SL317]
 gb|ABX70481.1| hypothetical protein SPAB_05204 [Salmonella enterica subsp. enterica 
serovar Paratyphi B str. SPB7]
 gb|EDX49107.1| conserved hypothetical protein [Salmonella enterica subsp. enterica 
serovar Newport str. SL317]
Length=245

 Score =  172 bits (435),  Expect = 6e-41, Method: Compositional matrix adjust.
 Identities = 97/239 (40%), Positives = 142/239 (59%), Gaps = 1/239 (0%)

Query  10   ASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGALLTDSA  69
            A  L  +   AFA GTVT++  G  +  ++   E ++ LV QP+L    WWP A++ +  
Sbjct  8    ALALAFVASSAFASGTVTVFTQGNSEPKTLTDAERLLDLVGQPRLATS-WWPAAVIGEEQ  66

Query  70   AKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPDFVRVDE  129
            A   A +  Q ++ +LA+  AE D D AA I ++R+Q+  + +TGR  V LDPD VRV E
Sbjct  67   ATVTARQQQQELLGRLAALGAEEDGDAAAAINTLRRQIQAVKVTGRQFVNLDPDVVRVSE  126

Query  130  NSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGADKNNVM  189
              NPPL G YTL+   +P  +TL G +S  G  P+  GR V  YL+D   L+GAD++   
Sbjct  127  RGNPPLQGHYTLWVGPQPTQVTLFGLISQPGSQPFVPGRDVASYLEDQRLLSGADRSYAW  186

Query  190  VITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLTQRVPD  248
            V+ P+G +  APVA WNKRH+EP PGS +++GF+  +       +N  I+  LTQR+P+
Sbjct  187  VVYPDGRSQKAPVAYWNKRHIEPMPGSIIFVGFADSLWRGTPEAINADILHTLTQRIPE  245


>ref|ZP_02664389.1| conserved hypothetical protein [Salmonella enterica subsp. enterica 
serovar Schwarzengrund str. SL480]
 ref|YP_002117102.1| hypothetical protein SeSA_A4415 [Salmonella enterica subsp. enterica 
serovar Schwarzengrund str. CVM19633]
 gb|ACF92324.1| conserved hypothetical protein [Salmonella enterica subsp. enterica 
serovar Schwarzengrund str. CVM19633]
 gb|EDY27341.1| conserved hypothetical protein [Salmonella enterica subsp. enterica 
serovar Schwarzengrund str. SL480]
Length=245

 Score =  171 bits (433),  Expect = 1e-40, Method: Compositional matrix adjust.
 Identities = 97/239 (40%), Positives = 141/239 (58%), Gaps = 1/239 (0%)

Query  10   ASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGALLTDSA  69
            A  L  +   AFA GTVT++  G  +  ++   E ++ LV QP+L    WWP A++ +  
Sbjct  8    ALALAFVASSAFASGTVTVFTQGNSEPKTLTDAERLLDLVGQPRLATS-WWPAAVIGEEQ  66

Query  70   AKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPDFVRVDE  129
            A   A +  Q ++ +LA+  AE D D AA I ++R+Q+  L +TGR  V LDPD VRV E
Sbjct  67   ATVTARQQQQELLGRLAALSAEEDGDAAAAINTLRRQIQALKVTGRQFVNLDPDVVRVSE  126

Query  130  NSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGADKNNVM  189
              NPPL G YTL+   +P  +TL G +S  G  P+  GR V  YL+    L+GAD++   
Sbjct  127  RGNPPLQGHYTLWVGPQPTQVTLFGLISQPGSQPFVPGRDVASYLEGQRLLSGADRSYAW  186

Query  190  VITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLTQRVPD  248
            V+ P+G +  APVA WNKRH+EP PGS +++GF+  +       +N  I+  LTQR+P+
Sbjct  187  VVYPDGRSQKAPVAYWNKRHIEPMPGSIIFVGFADSLWRGTPEAMNADILHTLTQRIPE  245


>gb|EFY12299.1| hypothetical protein SEEM315_09434 [Salmonella enterica subsp. 
enterica serovar Montevideo str. 315996572]
 gb|EFY15329.1| hypothetical protein SEEM971_09698 [Salmonella enterica subsp. 
enterica serovar Montevideo str. 495297-1]
 gb|EFY18999.1| hypothetical protein SEEM973_18592 [Salmonella enterica subsp. 
enterica serovar Montevideo str. 495297-3]
 32 more sequence titles
 Length=245

 Score =  171 bits (432),  Expect = 1e-40, Method: Compositional matrix adjust.
 Identities = 97/239 (40%), Positives = 141/239 (58%), Gaps = 1/239 (0%)

Query  10   ASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGALLTDSA  69
            A  L  +   AFA GTVT++  G  +  ++   E ++ LV QP+L    WWP A++ +  
Sbjct  8    ALALAFVASSAFASGTVTVFTQGNSEPKTLTDAERLLDLVGQPRLATS-WWPAAVIGEEQ  66

Query  70   AKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPDFVRVDE  129
            A   A +  Q ++ +LA+  AE D D AA I ++R+Q+  L +TGR  V LDPD VRV E
Sbjct  67   ATVTARQQQQELLGRLAALSAEEDGDAAAAINTLRRQIQALKVTGRQFVNLDPDVVRVSE  126

Query  130  NSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGADKNNVM  189
              NPPL G YTL+   +P  +TL G +S  G  P+  GR V  YL+    L+GAD++   
Sbjct  127  RGNPPLQGHYTLWVGPQPTQVTLFGLISQPGSQPFVPGRDVASYLEGQRLLSGADRSYAW  186

Query  190  VITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLTQRVPD  248
            V+ P+G +  APVA WNKRH+EP PGS +++GF+  +       +N  I+  LTQR+P+
Sbjct  187  VVYPDGRSQKAPVAYWNKRHIEPMPGSIIFVGFADSLWRGTPEAINADILHTLTQRIPE  245


>ref|ZP_03069392.1| conserved hypothetical protein [Escherichia coli 101-1]
 ref|ZP_07788357.1| uncharacterized protein gfcC [Escherichia coli 1827-70]
 gb|EDX39839.1| conserved hypothetical protein [Escherichia coli 101-1]
 gb|EFP98976.1| uncharacterized protein gfcC [Escherichia coli 1827-70]
Length=220

 Score =  171 bits (432),  Expect = 1e-40, Method: Compositional matrix adjust.
 Identities = 94/220 (42%), Positives = 136/220 (61%), Gaps = 1/220 (0%)

Query  28   IYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGALLTDSAAKAKALKDYQHVMAQLAS  87
            ++  G  +  ++   E+++ LV QP+L +  WWPGA++++  A A AL+  Q ++ +LA 
Sbjct  1    MFSNGSSEAKTLTGAEHLIDLVGQPRLANS-WWPGAVISEELATAAALRQQQALLTRLAE  59

Query  88   WEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPDFVRVDENSNPPLVGDYTLYTVQRP  147
              A++  D AA I ++RQQ+  L +TGR  + LDPD VRV E  NPPL G+YTL+    P
Sbjct  60   LAADSSADDAAAINALRQQIQALKVTGRQKINLDPDIVRVAERGNPPLQGNYTLWVGPPP  119

Query  148  VTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGADKNNVMVITPEGETVVAPVALWNK  207
             T+TL G +S  G+ P+  GR V  YL     L+GAD++   V+ P+G T  APVA WNK
Sbjct  120  STVTLFGLISRPGKQPFTPGRDVASYLSGQNLLSGADRSYAWVVYPDGRTQKAPVAYWNK  179

Query  208  RHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLTQRVP  247
            RHVEP PGS +++G +  V  E    LN  I+  LTQR+P
Sbjct  180  RHVEPMPGSIIYVGLADSVWSETPDALNADILQTLTQRIP  219


>ref|YP_153099.1| hypothetical protein SPA4041 [Salmonella enterica subsp. enterica 
serovar Paratyphi A str. ATCC 9150]
 ref|YP_002144588.1| hypothetical protein SSPA3750 [Salmonella enterica subsp. enterica 
serovar Paratyphi A str. AKU_12601]
 gb|AAV79787.1| putative exported protein [Salmonella enterica subsp. enterica 
serovar Paratyphi A str. ATCC 9150]
 emb|CAR62034.1| putative exported protein [Salmonella enterica subsp. enterica 
serovar Paratyphi A str. AKU_12601]
Length=245

 Score =  170 bits (431),  Expect = 1e-40, Method: Compositional matrix adjust.
 Identities = 96/239 (40%), Positives = 141/239 (58%), Gaps = 1/239 (0%)

Query  10   ASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGALLTDSA  69
            A  L  +   AFA GTVT++  G  +  ++   E ++ LV QP+L    WWP A++ +  
Sbjct  8    ALALAFIASSAFASGTVTVFTQGNSEPKTLTDAERLLDLVGQPRLATS-WWPAAVIGEEQ  66

Query  70   AKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPDFVRVDE  129
            A   A +  Q ++ +LA+  AE D D AA I ++R+Q+  + +TGR  V LDPD VRV E
Sbjct  67   ATVTARQQQQELLGRLAALSAEEDGDAAAAINTLRRQIQAVKVTGRQFVNLDPDVVRVSE  126

Query  130  NSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGADKNNVM  189
              NPPL G Y L+   +P  +TL G +S  G  P+  GR V  YL+D   L+GAD++   
Sbjct  127  RGNPPLQGHYMLWVGPQPTQVTLFGLISQPGSQPFVPGRDVASYLEDQRLLSGADRSYAW  186

Query  190  VITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLTQRVPD  248
            V+ P+G +  APVA WNKRH+EP PGS +++GF+  +       +N  I+  LTQR+P+
Sbjct  187  VVYPDGRSQKAPVAYWNKRHIEPMPGSIIFVGFADSLWRGTPEAINADILHTLTQRIPE  245


>ref|ZP_02799370.2| conserved hypothetical protein [Escherichia coli O157:H7 str. 
EC4196]
 ref|ZP_02775866.2| conserved hypothetical protein [Escherichia coli O157:H7 str. 
EC4113]
 ref|ZP_02780183.2| conserved hypothetical protein [Escherichia coli O157:H7 str. 
EC4401]
 12 more sequence titles
 Length=220

 Score =  170 bits (431),  Expect = 2e-40, Method: Compositional matrix adjust.
 Identities = 94/220 (42%), Positives = 136/220 (61%), Gaps = 1/220 (0%)

Query  28   IYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGALLTDSAAKAKALKDYQHVMAQLAS  87
            ++  G  +  ++   E+++ LV QP+L +  WWPGA++++  A A AL+  Q ++ +LA 
Sbjct  1    MFSNGSSEAKTLTGAEHLIDLVGQPRLANS-WWPGAVISEELATAAALRQQQALLTRLAE  59

Query  88   WEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPDFVRVDENSNPPLVGDYTLYTVQRP  147
              A++  D AA I ++RQQ+  L +TGR  + LDPD VRV E +NPPL G+YTL+    P
Sbjct  60   QGADSSADDAAAINALRQQIQALKVTGRQKINLDPDIVRVAERANPPLQGNYTLWVGPPP  119

Query  148  VTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGADKNNVMVITPEGETVVAPVALWNK  207
             T+TL G +S  G  P+  GR V  YL     L+GAD++   V+ P+G T  APVA WNK
Sbjct  120  STVTLFGLISRPGNQPFTPGRDVASYLSGQSLLSGADRSYAWVVYPDGRTQKAPVAYWNK  179

Query  208  RHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLTQRVP  247
            RHVEP PGS +++G +  V  E    LN  I+  LTQR+P
Sbjct  180  RHVEPMPGSIIYVGLADSVWSEMPDALNADILQTLTQRIP  219


>gb|EFW51577.1| YjbG polysaccharide synthesis-related protein [Shigella dysenteriae 
CDC 74-1112]
Length=220

 Score =  170 bits (431),  Expect = 2e-40, Method: Compositional matrix adjust.
 Identities = 94/220 (42%), Positives = 136/220 (61%), Gaps = 1/220 (0%)

Query  28   IYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGALLTDSAAKAKALKDYQHVMAQLAS  87
            ++  G  +  ++   E+++ LV QP+L +  WWPGA++++  A A AL+  Q ++ +LA 
Sbjct  1    MFSNGSSEAKTLTGAEHLIDLVGQPRLANS-WWPGAVISEELATAAALRQQQALLTRLAE  59

Query  88   WEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPDFVRVDENSNPPLVGDYTLYTVQRP  147
              A++  D AA I ++RQQ+  L +TGR  + LDPD VRV E  NPPL G+YTL+    P
Sbjct  60   QAADSSADDAAAINALRQQIQALKVTGRQKINLDPDIVRVAERGNPPLQGNYTLWVGPPP  119

Query  148  VTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGADKNNVMVITPEGETVVAPVALWNK  207
             T+TL G +S  G+ P+  GR V  YL     L+GAD++   V+ P+G T  APVA WNK
Sbjct  120  STVTLFGLISRPGKQPFTPGRDVASYLSGQNLLSGADRSYAWVVYPDGRTQKAPVAYWNK  179

Query  208  RHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLTQRVP  247
            RHVEP PGS +++G +  V  E    LN  I+  LTQR+P
Sbjct  180  RHVEPMPGSIIYVGLADSVWSETPDALNADILQTLTQRIP  219


>ref|YP_002043474.1| hypothetical protein SNSL254_A4566 [Salmonella enterica subsp. 
enterica serovar Newport str. SL254]
 gb|ACF62021.1| conserved hypothetical protein [Salmonella enterica subsp. enterica 
serovar Newport str. SL254]
Length=245

 Score =  170 bits (431),  Expect = 2e-40, Method: Compositional matrix adjust.
 Identities = 96/239 (40%), Positives = 141/239 (58%), Gaps = 1/239 (0%)

Query  10   ASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGALLTDSA  69
            A  L  +   AFA GTVT++  G  +  ++   E ++ LV QP+L    WWP A++ +  
Sbjct  8    ALALAFIASSAFASGTVTVFTQGNSEPKTLTDAERLLDLVGQPRLATS-WWPAAVIGEEQ  66

Query  70   AKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPDFVRVDE  129
            A   A +  Q ++ +LA+  AE D D AA I ++R+Q+  + +TGR  V LDPD VRV E
Sbjct  67   ATVTARQQQQELLGRLAALSAEEDGDAAAAINTLRRQIQAVKVTGRQLVNLDPDVVRVSE  126

Query  130  NSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGADKNNVM  189
              NPPL G YTL+   +P  +TL G +S  G  P+  GR V  YL+    L+GAD++   
Sbjct  127  RGNPPLQGHYTLWVGPQPTQVTLFGLISQPGSQPFVPGRDVASYLEGQRLLSGADRSYAW  186

Query  190  VITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLTQRVPD  248
            V+ P+G +  APVA WNKRH+EP PGS +++GF+  +       +N  I+  LTQR+P+
Sbjct  187  VVYPDGRSQKAPVAYWNKRHIEPMPGSIIFVGFADSLWRGTPEAMNADILHTLTQRIPE  245


>ref|ZP_03064544.1| conserved hypothetical protein [Shigella dysenteriae 1012]
 gb|EDX35540.1| conserved hypothetical protein [Shigella dysenteriae 1012]
Length=220

 Score =  170 bits (431),  Expect = 2e-40, Method: Compositional matrix adjust.
 Identities = 94/220 (42%), Positives = 136/220 (61%), Gaps = 1/220 (0%)

Query  28   IYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGALLTDSAAKAKALKDYQHVMAQLAS  87
            ++  G  +  ++   E+++ LV QP+L +  WWPGA++++  A A AL+  Q ++ +LA 
Sbjct  1    MFSNGSSEAKTLTGAEHLIDLVGQPRLANS-WWPGAVISEELATAAALRQQQALLTRLAE  59

Query  88   WEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPDFVRVDENSNPPLVGDYTLYTVQRP  147
              A++  D AA I ++RQQ+  L +TGR  + LDPD VRV E  NPPL G+YTL+    P
Sbjct  60   QGADSSADDAAAINALRQQIQALKVTGRQKINLDPDIVRVAERGNPPLQGNYTLWVGPPP  119

Query  148  VTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGADKNNVMVITPEGETVVAPVALWNK  207
             T+TL G +S  G+ P+  GR V  YL     L+GAD++   V+ P+G T  APVA WNK
Sbjct  120  STVTLFGLISRPGKQPFTPGRDVASYLSGQNLLSGADRSYAWVVYPDGRTQKAPVAYWNK  179

Query  208  RHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLTQRVP  247
            RHVEP PGS +++G +  V  E    LN  I+  LTQR+P
Sbjct  180  RHVEPMPGSIIYVGLADSVWSETPDALNADILQTLTQRIP  219


>ref|YP_002149138.1| hypothetical protein SeAg_B4481 [Salmonella enterica subsp. enterica 
serovar Agona str. SL483]
 gb|ACH50713.1| conserved hypothetical protein [Salmonella enterica subsp. enterica 
serovar Agona str. SL483]
Length=245

 Score =  170 bits (431),  Expect = 2e-40, Method: Compositional matrix adjust.
 Identities = 96/245 (39%), Positives = 143/245 (58%), Gaps = 1/245 (0%)

Query  4    LQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGA  63
            ++    A  L  +   AFA GTVT++  G  +  ++   E ++ LV QP+L    WWP A
Sbjct  2    MKRMISALALAFIASSAFASGTVTVFTQGNSEPKTLTDAERLLDLVGQPRLATS-WWPAA  60

Query  64   LLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPD  123
            ++ +  A   A +  Q ++ +LA+  AE D D AA I ++R+Q+  + +TGR  V LDPD
Sbjct  61   VIGEEQATVTARQQQQELLGRLAALSAEEDGDAAAAINTLRRQIQAVKVTGRQFVNLDPD  120

Query  124  FVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGA  183
             VRV E  NPPL G YTL+   +P  +TL G +S  G  P+  GR V  YL+    L+GA
Sbjct  121  VVRVSERGNPPLQGHYTLWVGPQPTQVTLFGLISQPGSQPFVPGRDVASYLEGQRLLSGA  180

Query  184  DKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLT  243
            D++   V+ P+G +  APVA WNKRH+EP PGS +++GF+  +       +N  I+  LT
Sbjct  181  DRSYAWVVYPDGRSQKAPVAYWNKRHIEPMPGSIIFVGFADSLWRGTPEAINADILHTLT  240

Query  244  QRVPD  248
            QR+P+
Sbjct  241  QRIPE  245


>ref|ZP_02785878.2| conserved hypothetical protein [Escherichia coli O157:H7 str. 
EC4501]
 gb|EDU86949.1| conserved hypothetical protein [Escherichia coli O157:H7 str. 
EC4501]
 gb|EFW65521.1| YjbG polysaccharide synthesis-related protein [Escherichia coli 
O157:H7 str. EC1212]
Length=220

 Score =  170 bits (430),  Expect = 2e-40, Method: Compositional matrix adjust.
 Identities = 94/220 (42%), Positives = 135/220 (61%), Gaps = 1/220 (0%)

Query  28   IYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGALLTDSAAKAKALKDYQHVMAQLAS  87
            ++  G  +  ++   E+++ LV QP+L +  WWPGA++++  A A AL+  Q ++ +LA 
Sbjct  1    MFSNGSSEAKTLTGAEHLIDLVGQPRLANS-WWPGAVISEELATAAALRQQQALLTRLAE  59

Query  88   WEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPDFVRVDENSNPPLVGDYTLYTVQRP  147
              A++  D AA I ++RQQ+  L +TGR  + LDPD VRV E +NPPL G+YTL+    P
Sbjct  60   QGADSSADDAAAINALRQQIQALKVTGRQKINLDPDIVRVAERANPPLQGNYTLWVGPPP  119

Query  148  VTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGADKNNVMVITPEGETVVAPVALWNK  207
             T+TL G +S  G  P+  GR V  YL     L GAD++   V+ P+G T  APVA WNK
Sbjct  120  STVTLFGLISRPGNQPFTPGRDVASYLSGQSLLGGADRSYAWVVYPDGRTQKAPVAYWNK  179

Query  208  RHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLTQRVP  247
            RHVEP PGS +++G +  V  E    LN  I+  LTQR+P
Sbjct  180  RHVEPMPGSIIYVGLADSVWSEMPDALNADILQTLTQRIP  219


>ref|ZP_02833337.1| conserved hypothetical protein [Salmonella enterica subsp. enterica 
serovar Weltevreden str. HI_N05-537]
 gb|EDZ28933.1| conserved hypothetical protein [Salmonella enterica subsp. enterica 
serovar Weltevreden str. HI_N05-537]
 emb|CBY98390.1| Uncharacterized protein yjbG Flags: Precursor [Salmonella enterica 
subsp. enterica serovar Weltevreden str. 2007-60-3289-1]
Length=245

 Score =  169 bits (429),  Expect = 3e-40, Method: Compositional matrix adjust.
 Identities = 96/239 (40%), Positives = 141/239 (58%), Gaps = 1/239 (0%)

Query  10   ASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGALLTDSA  69
            A  L  +   AFA GTVT++  G  +  ++   E ++ LV QP+L    WWP A++ +  
Sbjct  8    ALALAFVASSAFASGTVTVFTQGNSEPKTLTDAERLLDLVGQPRLATS-WWPAAVIGEEQ  66

Query  70   AKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPDFVRVDE  129
            A   A +  Q ++ +LA+  AE D D AA I ++R+Q+  + +TGR  V LDPD VRV E
Sbjct  67   ATVTARQQQQELLGRLAALGAEEDGDAAAAINTLRRQIQAVKVTGRQFVNLDPDVVRVSE  126

Query  130  NSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGADKNNVM  189
              NPPL G YTL+   +P  +TL G +S  G  P+  GR V  YL+    L+GAD++   
Sbjct  127  RGNPPLQGHYTLWVGPQPTQVTLFGLISQLGSQPFVPGRDVASYLEGQRLLSGADRSYAW  186

Query  190  VITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLTQRVPD  248
            V+ P+G +  APVA WNKRH+EP PGS +++GF+  +       +N  I+  LTQR+P+
Sbjct  187  VVYPDGRSQKAPVAYWNKRHIEPMPGSIIFVGFADSLWRGTPEAINADILHTLTQRIPE  245


>ref|NP_463089.1| periplasmic protein [Salmonella enterica subsp. enterica serovar 
Typhimurium str. LT2]
 ref|ZP_02572509.1| conserved hypothetical protein [Salmonella enterica subsp. enterica 
serovar 4,[5],12:i:- str. CVM23701]
 ref|ZP_02668549.1| conserved hypothetical protein [Salmonella enterica subsp. enterica 
serovar Heidelberg str. SL486]
 14 more sequence titles
 Length=245

 Score =  169 bits (427),  Expect = 4e-40, Method: Compositional matrix adjust.
 Identities = 96/239 (40%), Positives = 140/239 (58%), Gaps = 1/239 (0%)

Query  10   ASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGALLTDSA  69
            A  L  +   AFA GTVT++  G  +  ++   E ++ LV QP+L    WWP A++ +  
Sbjct  8    ALALAFVASSAFASGTVTVFTQGNSEPKTLTDAERLLDLVGQPRLATS-WWPAAVIGEEQ  66

Query  70   AKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPDFVRVDE  129
            A   A +  Q ++ +LA   AE D D AA I ++R+Q+  + +TGR  V LDPD VRV E
Sbjct  67   ATVTARQQQQELLGRLAVLSAEEDGDAAAAINTLRRQIQAVKVTGRQFVNLDPDVVRVSE  126

Query  130  NSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGADKNNVM  189
              NPPL G Y L+   +P  +TL G +S  G  P+  GR V  YL+D   L+GAD++   
Sbjct  127  RGNPPLQGHYMLWVGPQPTQVTLFGLISQPGSQPFVPGRDVASYLEDQRLLSGADRSYAW  186

Query  190  VITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLTQRVPD  248
            V+ P+G +  APVA WNKRH+EP PGS +++GF+  +       +N  I+  LTQR+P+
Sbjct  187  VVYPDGRSQKAPVAYWNKRHIEPMPGSIIFVGFADSLWRGTPEAINADILHTLTQRIPE  245


>ref|ZP_06537804.1| hypothetical protein Salmonellaentericaenterica_23277 [Salmonella 
enterica subsp. enterica serovar Typhi str. AG3]
Length=229

 Score =  168 bits (426),  Expect = 6e-40, Method: Compositional matrix adjust.
 Identities = 89/214 (41%), Positives = 129/214 (60%), Gaps = 2/214 (0%)

Query  10   ASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGALLTDSA  69
            A  L  +   AFA GTVT++  G  +  ++   E ++ LV QP+L    WWP A++ +  
Sbjct  8    ALALAFVASSAFASGTVTVFTQGNSEPKTLTDAERLLDLVGQPRLATS-WWPAAVIGEEQ  66

Query  70   AKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPDFVRVDE  129
            A   A +  Q ++ +LA+  AE D D AA I ++R+Q+  + +TGR  V LDPD VRV E
Sbjct  67   ATVTARQQQQELLGRLAALSAEEDGDAAA-INTLRRQIQAVKVTGRQFVNLDPDVVRVSE  125

Query  130  NSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGADKNNVM  189
              NPPL G YTL+   +P  +TL G +S  G  P+  GR V  YL+    L+GAD++   
Sbjct  126  RGNPPLQGHYTLWVGPQPTQVTLFGLISQPGSQPFVPGRDVASYLEGQRLLSGADRSYAW  185

Query  190  VITPEGETVVAPVALWNKRHVEPPPGSQLWLGFS  223
            V+ P+G +  APVA WNKRH+EP PGS +++GF+
Sbjct  186  VVYPDGRSQKAPVAYWNKRHIEPMPGSIIFVGFA  219


>ref|ZP_03373074.1| hypothetical protein SentesTyp_23395 [Salmonella enterica subsp. 
enterica serovar Typhi str. E98-2068]
Length=230

 Score =  168 bits (426),  Expect = 6e-40, Method: Compositional matrix adjust.
 Identities = 89/214 (41%), Positives = 129/214 (60%), Gaps = 2/214 (0%)

Query  10   ASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGALLTDSA  69
            A  L  +   AFA GTVT++  G  +  ++   E ++ LV QP+L    WWP A++ +  
Sbjct  8    ALALAFVASSAFASGTVTVFTQGNSEPKTLTDAERLLDLVGQPRLATS-WWPAAVIGEEQ  66

Query  70   AKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPDFVRVDE  129
            A   A +  Q ++ +LA+  AE D D AA I ++R+Q+  + +TGR  V LDPD VRV E
Sbjct  67   ATVTARQQQQELLGRLAALSAEEDGDAAA-INTLRRQIQAVKVTGRQFVNLDPDVVRVSE  125

Query  130  NSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGADKNNVM  189
              NPPL G YTL+   +P  +TL G +S  G  P+  GR V  YL+    L+GAD++   
Sbjct  126  RGNPPLQGHYTLWVGPQPTQVTLFGLISQPGSQPFVPGRDVASYLEGQRLLSGADRSYAW  185

Query  190  VITPEGETVVAPVALWNKRHVEPPPGSQLWLGFS  223
            V+ P+G +  APVA WNKRH+EP PGS +++GF+
Sbjct  186  VVYPDGRSQKAPVAYWNKRHIEPMPGSIIFVGFA  219


>ref|YP_002228797.1| hypothetical protein SG4066 [Salmonella enterica subsp. enterica 
serovar Gallinarum str. 287/91]
 ref|YP_002246029.1| hypothetical protein SEN3991 [Salmonella enterica subsp. enterica 
serovar Enteritidis str. P125109]
 emb|CAR39836.1| putative exported protein [Salmonella enterica subsp. enterica 
serovar Gallinarum str. 287/91]
 emb|CAR35557.1| putative exported protein [Salmonella enterica subsp. enterica 
serovar Enteritidis str. P125109]
 gb|EGE36491.1| Putative exported protein [Salmonella enterica subsp. enterica 
serovar Gallinarum str. SG9]
Length=245

 Score =  168 bits (425),  Expect = 8e-40, Method: Compositional matrix adjust.
 Identities = 95/245 (38%), Positives = 142/245 (57%), Gaps = 1/245 (0%)

Query  4    LQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGA  63
            ++    A  L  +    FA GTVT++  G  +  ++   E ++ LV QP+L    WWP A
Sbjct  2    MKRMISALALAFIASSVFASGTVTVFTQGNSEPKTLTDAERLLDLVGQPRLATS-WWPAA  60

Query  64   LLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPD  123
            ++ +  A   A +  Q ++ +LA+  AE D D AA I ++R+Q+  + +TGR  V LDPD
Sbjct  61   VIGEEQATVTARQQQQELLGRLAALGAEEDGDAAAAINTLRRQIQAVKVTGRQLVNLDPD  120

Query  124  FVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGA  183
             VRV E  NPPL G YTL+   +P  +TL G +S  G  P+  GR V  YL+    L+GA
Sbjct  121  VVRVSERGNPPLQGHYTLWVGPQPTQVTLFGLISQPGSQPFVPGRDVASYLEGQRLLSGA  180

Query  184  DKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLT  243
            D++   V+ P+G +  APVA WNKRH+EP PGS +++GF+  +       +N  I+  LT
Sbjct  181  DRSYAWVVYPDGRSQKAPVAYWNKRHIEPMPGSIIFVGFADSLWRGTPKAINADILHTLT  240

Query  244  QRVPD  248
            QR+P+
Sbjct  241  QRIPE  245


>ref|ZP_04656864.1| hypothetical protein SentesTe_18130 [Salmonella enterica subsp. 
enterica serovar Tennessee str. CDC07-0191]
Length=245

 Score =  168 bits (425),  Expect = 8e-40, Method: Compositional matrix adjust.
 Identities = 95/239 (39%), Positives = 140/239 (58%), Gaps = 1/239 (0%)

Query  10   ASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGALLTDSA  69
            A  L  +    FA GTVT++  G  +  ++   E ++ LV QP+L    WWP A++ +  
Sbjct  8    ALALAFIASSVFASGTVTVFTQGNSEPKTLTDAERLLDLVGQPRLATS-WWPAAVIGEEQ  66

Query  70   AKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPDFVRVDE  129
            A   A +  Q ++ +LA+  AE D D AA I ++R+Q+  + +TGR  V LDPD VRV E
Sbjct  67   ATVTARQQQQELLGRLAALSAEEDGDAAAAINTLRRQIQAVKVTGRQFVNLDPDVVRVSE  126

Query  130  NSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGADKNNVM  189
              NPPL G YTL+   +P  +TL G +S  G  P+  GR V  YL+    L+GAD++   
Sbjct  127  RGNPPLQGHYTLWVGPQPTQVTLFGLISQPGSQPFVPGRDVASYLEGQRLLSGADRSYAW  186

Query  190  VITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLTQRVPD  248
            V+ P+G +  APVA WNKRH+EP PGS +++GF+  +       +N  I+  LTQR+P+
Sbjct  187  VVYPDGRSQKAPVAYWNKRHIEPMPGSIIFVGFADSLWRGTPEAINADILHTLTQRIPE  245


>gb|EGI88734.1| hypothetical protein SB521682_4838 [Shigella boydii 5216-82]
Length=220

 Score =  168 bits (425),  Expect = 9e-40, Method: Compositional matrix adjust.
 Identities = 93/220 (42%), Positives = 135/220 (61%), Gaps = 1/220 (0%)

Query  28   IYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGALLTDSAAKAKALKDYQHVMAQLAS  87
            ++  G  +  ++   E+++ LV QP+L +  WWPGA++++  A A AL+  Q ++ +LA 
Sbjct  1    MFSNGSSEAKTLTGAEHLIDLVGQPRLANS-WWPGAVISEELATAAALRQQQALLTRLAE  59

Query  88   WEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPDFVRVDENSNPPLVGDYTLYTVQRP  147
              A++  D AA I ++RQQ+  L +T R  + LDPD VRV E  NPPL G+YTL+    P
Sbjct  60   QGADSSADDAAAINALRQQIQALKVTSRQKINLDPDIVRVAERGNPPLQGNYTLWVGPPP  119

Query  148  VTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGADKNNVMVITPEGETVVAPVALWNK  207
             T+TL G +S  G+ P+  GR V  YL     L+GAD++   V+ P+G T  APVA WNK
Sbjct  120  STVTLFGLISRPGKQPFTPGRDVASYLSGQNLLSGADRSYAWVVYPDGRTQKAPVAYWNK  179

Query  208  RHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLTQRVP  247
            RHVEP PGS +++G +  V  E    LN  I+  LTQR+P
Sbjct  180  RHVEPMPGSIIYVGLADSVWSETPDALNADILQTLTQRIP  219


>ref|ZP_03364691.1| hypothetical protein SentesTyph_17318 [Salmonella enterica subsp. 
enterica serovar Typhi str. E98-0664]
Length=187

 Score =  149 bits (377),  Expect = 3e-34, Method: Compositional matrix adjust.
 Identities = 78/188 (41%), Positives = 114/188 (60%), Gaps = 1/188 (0%)

Query  61   PGALLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKL  120
            P A++ +  A   A +  Q ++ +LA+  AE D D AA I ++R+Q+  + +TGR  V L
Sbjct  1    PAAVIGEEQATVTARQQQQELLGRLAALSAEEDGDAAA-INTLRRQIQAVKVTGRQFVNL  59

Query  121  DPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRL  180
            DPD VRV E  NPPL G YTL+   +P  +TL G +S  G  P+  GR V  YL+    L
Sbjct  60   DPDVVRVSERGNPPLQGHYTLWVGPQPTQVTLFGLISQPGSQPFVPGRDVASYLEGQRLL  119

Query  181  AGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVS  240
            +GAD++   V+ P+G +  APVA WNKRH+EP PGS +++GF+  +       +N  I+ 
Sbjct  120  SGADRSYAWVVYPDGRSQKAPVAYWNKRHIEPMPGSIIFVGFADSLWRGTPEAINADILH  179

Query  241  VLTQRVPD  248
             LTQR+P+
Sbjct  180  TLTQRIPE  187


>gb|EFW61835.1| YjbG polysaccharide synthesis-related protein [Shigella flexneri 
CDC 796-83]
 gb|EGI92902.1| hypothetical protein SB359474_4677 [Shigella boydii 3594-74]
Length=185

 Score =  149 bits (375),  Expect = 5e-34, Method: Compositional matrix adjust.
 Identities = 82/184 (44%), Positives = 114/184 (61%), Gaps = 0/184 (0%)

Query  64   LLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPD  123
            ++++  A A AL+  Q ++ +LA   A++  D AA I ++RQQ+  L +TGR  + LDPD
Sbjct  1    MISEELATAAALRQQQALLTRLAEQAADSSADDAAAINALRQQIQALKVTGRQKINLDPD  60

Query  124  FVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGA  183
             VRV E  NPPL G+YTL+    P T+TL G +S  G+ P+  GR V  YL     L+GA
Sbjct  61   IVRVAERGNPPLQGNYTLWVGPPPSTVTLFGLISRPGKQPFTPGRDVASYLSGQNLLSGA  120

Query  184  DKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLT  243
            D++   V+ P+G T  APVA WNKRHVEP PGS +++G +  V  E    LN  I+  LT
Sbjct  121  DRSYAWVVYPDGRTQKAPVAYWNKRHVEPMPGSIIYVGLADSVWSETPDALNADILQTLT  180

Query  244  QRVP  247
            QR+P
Sbjct  181  QRIP  184


>ref|YP_410318.1| hypothetical protein SBO_4056 [Shigella boydii Sb227]
 gb|ABB68490.1| conserved hypothetical protein [Shigella boydii Sb227]
Length=174

 Score =  141 bits (356),  Expect = 9e-32, Method: Compositional matrix adjust.
 Identities = 78/173 (45%), Positives = 107/173 (61%), Gaps = 0/173 (0%)

Query  75   LKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPDFVRVDENSNPP  134
            ++  Q ++ +LA   A++  D AA I ++RQQ+  L +TGR  + LDPD VRV E  NPP
Sbjct  1    MRQQQALLTRLAEQAADSSADDAAAINALRQQIQALKVTGRQKINLDPDIVRVAERGNPP  60

Query  135  LVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGADKNNVMVITPE  194
            L G+YTL+    P T+TL G +S  G+ P+  GR V  YL     L+GAD++   V+ P+
Sbjct  61   LQGNYTLWVGPPPSTVTLFGLISRPGKQPFTPGRDVASYLSGQNLLSGADRSYAWVVYPD  120

Query  195  GETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLTQRVP  247
            G T  APVA WNKRHVEP PGS +++G +  V  E    LN  I+  LTQR+P
Sbjct  121  GRTQKAPVAYWNKRHVEPMPGSIIYVGLADSVWSETPDALNADILQTLTQRIP  173


>ref|ZP_03338398.1| hypothetical protein Salmonelentericaenterica_16032 [Salmonella 
enterica subsp. enterica serovar Typhi str. 404ty]
Length=144

 Score =  125 bits (314),  Expect = 5e-27, Method: Compositional matrix adjust.
 Identities = 63/144 (43%), Positives = 89/144 (61%), Gaps = 0/144 (0%)

Query  105  QQLLNLNITGRLPVKLDPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPW  164
            +Q+  + +TGR  V LDPD VRV E  NPPL G YTL+   +P  +TL G +S  G  P+
Sbjct  1    RQIQAVKVTGRQFVNLDPDVVRVSERGNPPLQGHYTLWVGPQPTQVTLFGLISQPGSQPF  60

Query  165  QAGRSVTDYLQDHPRLAGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSA  224
              GR V  YL+    L+GAD++   V+ P+G +  APVA WNKRH+EP PGS +++GF+ 
Sbjct  61   VPGRDVASYLEGQRLLSGADRSYAWVVYPDGRSQKAPVAYWNKRHIEPMPGSIIFVGFAD  120

Query  225  HVLPEKYADLNDQIVSVLTQRVPD  248
             +       +N  I+  LTQR+P+
Sbjct  121  SLWRGTPEAINADILHTLTQRIPE  144


>gb|EDA50530.1| hypothetical protein GOS_1989170 [marine metagenome]
Length=179

 Score =  125 bits (313),  Expect = 8e-27, Method: Compositional matrix adjust.
 Identities = 66/179 (36%), Positives = 105/179 (58%), Gaps = 5/179 (2%)

Query  75   LKDYQHVMAQLASWEAE--ADDDVAATIKSVRQQLLN---LNITGRLPVKLDPDFVRVDE  129
            + D  ++++ L + + +   D ++ A ++S +Q L+    LN+TGRLP+ +DP   R  E
Sbjct  1    IADKMNLLSDLKALQVQWMRDGNMGAWVQSSQQLLIEIDRLNVTGRLPIAIDPVINRAHE  60

Query  130  NSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGADKNNVM  189
            + NP L GDYTL+   R   I   G ++GA +   + G  + DY Q +  LAGAD     
Sbjct  61   DKNPLLSGDYTLFISPRSQFIYFTGLINGASRQLLREGAGLADYWQAYSLLAGADLAQAY  120

Query  190  VITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLTQRVPD  248
            +I P GE  + PVA+WN+ H EP  G+ L++GF   +LP KY +LN +I ++L  R+P+
Sbjct  121  LIQPTGEVSLVPVAVWNQLHREPMAGATLFVGFDTDLLPAKYKNLNLRIANLLANRIPE  179


>gb|EBB46452.1| hypothetical protein GOS_229183 [marine metagenome]
Length=180

 Score =  124 bits (310),  Expect = 2e-26, Method: Compositional matrix adjust.
 Identities = 66/180 (36%), Positives = 102/180 (56%), Gaps = 1/180 (0%)

Query  4    LQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGA  63
            ++    A    +     FA G+V +   G  +  ++   E+++ LV QP+L +  WWPGA
Sbjct  2    MKQMITALTFSLCAASVFAAGSVKVITTGSTEAKTLTGAEHLLDLVGQPRLSNS-WWPGA  60

Query  64   LLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPD  123
            ++++  A  +A +  Q ++A+LA+  AEA  + A  I ++RQQ+  L + GR  + LDPD
Sbjct  61   VISEERATTEAQRQQQALLARLATLSAEASGEDAGAINALRQQIQALKVAGRQTINLDPD  120

Query  124  FVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGA  183
             VR  E  NPPL G+YTL+   +P  ITLLG +S  G+  +  GR V  YL    RL+GA
Sbjct  121  VVRTSEPGNPPLQGNYTLWVGPQPTDITLLGLLSHTGKQLFIPGRDVASYLDGQHRLSGA  180


>ref|ZP_03830735.1| hypothetical protein PcarcW_05039 [Pectobacterium carotovorum 
subsp. carotovorum WPP14]
Length=260

 Score =  121 bits (303),  Expect = 1e-25, Method: Compositional matrix adjust.
 Identities = 84/258 (32%), Positives = 135/258 (52%), Gaps = 25/258 (9%)

Query  9    IASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQL------VTQPQLRDRLWWPG  62
            IA++L +++  A     +T+  P  Q+T++V  +++  +L      V+ PQ    + W  
Sbjct  7    IATLLLLVSGVA-TSAQLTVKSP--QETIAVVKLDDGTRLEKFYEQVSWPQ---NINWQT  60

Query  63   ALLTDSAAKAKALKDYQHVMAQLAS----WEAEADDDVAATIKSVRQQLLNLNITGRLPV  118
            A ++D A   K       ++ +LA     W    D D+A +   +R+ +  +N+ GR+  
Sbjct  61   AFISDFATTQKVRAQGDVLLQKLAELETRWRNSGDGDLAISAWLLRKTINPINVAGRIRT  120

Query  119  KLDPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGA---------GQLPWQAGRS  169
             LDPD VRV   +N PLVG+Y LY       ++L+G V+ A         G++  +AG S
Sbjct  121  DLDPDRVRVYIENNRPLVGEYALYVAPHDDKLSLIGLVNTAADVGELETSGKVALRAGWS  180

Query  170  VTDYLQDHPRLAGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPE  229
            V +YL     LAGAD +   +I   G+    P+ALWN++HVEP  G  L++GF+  VLP+
Sbjct  181  VDNYLSGRRLLAGADNSYGYLIAGNGKWRKVPLALWNRQHVEPAAGETLFIGFNPSVLPQ  240

Query  230  KYADLNDQIVSVLTQRVP  247
              + LNDQ+   L  R P
Sbjct  241  DMSSLNDQLADYLANRTP  258


>ref|YP_003260364.1| hypothetical protein Pecwa_3011 [Pectobacterium wasabiae WPP163]
 gb|ACX88757.1| protein of unknown function DUF1017 [Pectobacterium wasabiae 
WPP163]
Length=260

 Score =  119 bits (299),  Expect = 3e-25, Method: Compositional matrix adjust.
 Identities = 68/201 (33%), Positives = 105/201 (52%), Gaps = 13/201 (6%)

Query  60   WPGALLTDSAAKAKALKDYQHVMAQLAS----WEAEADDDVAATIKSVRQQLLNLNITGR  115
            W  A ++D A           ++ +LA     W    D D+A +   +R+ +  +N+ GR
Sbjct  58   WQTAFISDFATTQNVRAQGDTLLQKLAELETRWRNSGDGDLAISAWLLRKAISPINVAGR  117

Query  116  LPVKLDPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGA---------GQLPWQA  166
            +   LDPD VRV   +N PLVG+Y LY       ++L+G V+ +         G++  +A
Sbjct  118  IHTNLDPDRVRVYIENNRPLVGEYALYVAPHDDKLSLIGLVNTSADVGELETSGKVALRA  177

Query  167  GRSVTDYLQDHPRLAGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHV  226
            G SV DYL     LAGAD +   +I   G+    P+ALWN++H EP  G  +++GF+  V
Sbjct  178  GSSVEDYLAGRRLLAGADNSYGYLIGGNGQWRKVPLALWNRQHTEPAAGETIFIGFNPSV  237

Query  227  LPEKYADLNDQIVSVLTQRVP  247
            LP+  + LNDQ+   L  R+P
Sbjct  238  LPQDMSSLNDQLADYLANRIP  258


>ref|YP_003016906.1| hypothetical protein PC1_1323 [Pectobacterium carotovorum subsp. 
carotovorum PC1]
 gb|ACT12370.1| protein of unknown function DUF1017 [Pectobacterium carotovorum 
subsp. carotovorum PC1]
Length=260

 Score =  119 bits (298),  Expect = 5e-25, Method: Compositional matrix adjust.
 Identities = 82/258 (31%), Positives = 134/258 (51%), Gaps = 25/258 (9%)

Query  9    IASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQL------VTQPQLRDRLWWPG  62
            IA++L +++  A     +T+  P  QQT++V  +++  +L      V  PQ    + W  
Sbjct  7    IATLLLLVSGVA-TSAQLTVKSP--QQTIAVVKLDDGTRLEKFYEQVPWPQ---NINWQT  60

Query  63   ALLTDSAAKAKALKDYQHVMAQLAS----WEAEADDDVAATIKSVRQQLLNLNITGRLPV  118
            A ++D A   K       ++ +LA     W    D D+A +   +R+ +  +N+ GR+  
Sbjct  61   AFISDFATTQKVRAQGDVLLQKLAELETRWRNSGDGDLAISAWLLRKTINPINVAGRIST  120

Query  119  KLDPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGA---------GQLPWQAGRS  169
             LDPD VRV   +N PLVG+Y LY       ++L+G V+ +         G++  +AG S
Sbjct  121  DLDPDRVRVYAENNRPLVGEYALYVAPHDDKLSLIGLVNTSADVGELETSGKVALRAGWS  180

Query  170  VTDYLQDHPRLAGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPE  229
            V +YL     +AGAD +   +I   G+    P+ALWN++H+EP  G  L++GF+  VLP+
Sbjct  181  VENYLSGRRLIAGADNSYGYLIGGNGQWRKVPLALWNRQHIEPAAGETLFIGFNPAVLPQ  240

Query  230  KYADLNDQIVSVLTQRVP  247
              + LNDQ+   L  R P
Sbjct  241  DMSSLNDQLADYLANRTP  258


>gb|EFU99994.1| conserved hypothetical protein [Escherichia coli 3431]
Length=248

 Score =  118 bits (295),  Expect = 1e-24, Method: Compositional matrix adjust.
 Identities = 70/241 (29%), Positives = 114/241 (47%), Gaps = 5/241 (2%)

Query  13   LYVMTPHAFAQGTVTIYLPG-EQQTLSVGPVENVVQLVTQPQLRDRLWWPGALLTDSAAK  71
            L + +    A   V++Y  G +QQ L +     +  L+    + + ++W  A +      
Sbjct  8    LLLFSSVCMADANVSLYFNGNQQQNLILSSDARLDTLLQSSHIPENVYWRSAQIATPEQH  67

Query  72   AKALKDYQHVMAQLAS----WEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPDFVRV  127
               ++    ++++L +    W  E     A +   + QQ+  L+++GRLP+ +DP  V  
Sbjct  68   KVIMQRQSALLSELQTIETLWRNEGKQKNADSTAKLYQQIAKLHLSGRLPITIDPYQVLR  127

Query  128  DENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGADKNN  187
             +  NP L G Y LY   R   I L G +S    LP + G  V  Y Q      GAD  +
Sbjct  128  SKADNPRLDGQYQLYLASRASKIALFGLISALPNLPLEPGFGVDQYWQRSALQPGADTAH  187

Query  188  VMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLTQRVP  247
            V +I P G     PVA+WNK H EP PG+ L +GF    LP ++  +N +I  ++  +VP
Sbjct  188  VWLIQPTGNIEKVPVAVWNKLHREPMPGATLLVGFDESQLPTRFEGINRRIAEIIANKVP  247

Query  248  D  248
            +
Sbjct  248  E  248


>ref|ZP_03826818.1| hypothetical protein PcarbP_09375 [Pectobacterium carotovorum 
subsp. brasiliensis PBR1692]
Length=260

 Score =  117 bits (294),  Expect = 1e-24, Method: Compositional matrix adjust.
 Identities = 81/258 (31%), Positives = 134/258 (51%), Gaps = 25/258 (9%)

Query  9    IASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQL------VTQPQLRDRLWWPG  62
            IA++L +++  A     +T+  P  QQT++V  +++  +L      V  PQ    + W  
Sbjct  7    IATLLLLVSGVA-TSAQLTVKSP--QQTIAVVKLDDGTRLEKFYEQVPWPQ---NINWQT  60

Query  63   ALLTDSAAKAKALKDYQHVMAQLAS----WEAEADDDVAATIKSVRQQLLNLNITGRLPV  118
            A ++D A   K       ++ +LA     W    D D+A +   +R+ +  +N+ GR+  
Sbjct  61   AFISDFATTQKVRAQGDVLLQKLAELETRWRNSGDGDLAISAWLLRKTINPINVAGRIRT  120

Query  119  KLDPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGA---------GQLPWQAGRS  169
             LDPD VRV   +N PLVG+Y LY       ++L+G V+ +         G++  +AG S
Sbjct  121  DLDPDRVRVYAENNRPLVGEYALYVAPHDDKLSLIGLVNTSADVGELETSGKVALRAGWS  180

Query  170  VTDYLQDHPRLAGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPE  229
              +YL     LAGAD +   +I   G+    P+ALWN++H+EP  G  L++GF+  VLP+
Sbjct  181  AENYLSGRRLLAGADNSYGYLIAGNGQWRKVPLALWNRQHIEPAAGETLFIGFNPAVLPQ  240

Query  230  KYADLNDQIVSVLTQRVP  247
            + + LN+Q+   L  R P
Sbjct  241  EMSSLNEQLADYLANRTP  258


>ref|YP_049553.1| hypothetical protein ECA1447 [Pectobacterium atrosepticum SCRI1043]
 emb|CAG74357.1| putative exported protein [Pectobacterium atrosepticum SCRI1043]
Length=260

 Score =  117 bits (293),  Expect = 1e-24, Method: Compositional matrix adjust.
 Identities = 80/258 (31%), Positives = 135/258 (52%), Gaps = 25/258 (9%)

Query  9    IASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQL------VTQPQLRDRLWWPG  62
            IA++L +++  A +   +T+  P  Q+T++V  +++  +L      V+ PQ    + W  
Sbjct  7    IATLLLLISGVAMS-AQLTVKSP--QETIAVVKLDDGTRLEKFYEQVSWPQ---NINWQT  60

Query  63   ALLTDSAAKAKALKDYQHVMAQLAS----WEAEADDDVAATIKSVRQQLLNLNITGRLPV  118
            A ++D A   K       ++ +LA     W    D D+A +   +R+ +  +N+ GR+  
Sbjct  61   AFISDFATTQKVRVQGDVLLQKLAELETRWRNSGDGDLAISAWLLRKAINPINVAGRIRT  120

Query  119  KLDPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGA---------GQLPWQAGRS  169
             LDPD VRV   +N PLVG+Y LY       + L+G V+ +         G +  +AG S
Sbjct  121  NLDPDRVRVYSENNRPLVGEYALYVAPHDDKLALIGLVNTSADVGELETSGNVVLRAGWS  180

Query  170  VTDYLQDHPRLAGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPE  229
            V +YL     LAGAD +   +I   G+    P+ALWN++H+EP  G  +++GF+  VLP+
Sbjct  181  VENYLVGRRLLAGADNSYGYLIGGNGQWRKVPLALWNRQHIEPAAGETIFIGFNPSVLPQ  240

Query  230  KYADLNDQIVSVLTQRVP  247
              + LN+Q+   L  R+P
Sbjct  241  DMSSLNEQLADYLANRIP  258


>ref|YP_003005145.1| hypothetical protein Dd1591_2844 [Dickeya zeae Ech1591]
 gb|ACT07666.1| protein of unknown function DUF1017 [Dickeya zeae Ech1591]
Length=259

 Score =  117 bits (292),  Expect = 2e-24, Method: Compositional matrix adjust.
 Identities = 70/214 (32%), Positives = 100/214 (46%), Gaps = 13/214 (6%)

Query  47   QLVTQPQLRDRLWWPGALLTDSAAKAKALKDYQHVMAQLAS----WEAEADDDVAATIKS  102
            Q   Q    D++ W  ALLT+     K  +  + + A+L      W+ E D D A     
Sbjct  45   QFYGQMAFPDKVNWQTALLTNERVTEKVREKGEKLQARLYQLQLVWQVEGDGDWAIAAWY  104

Query  103  VRQQLLNLNITGRLPVKLDPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGA---  159
            + Q L  +N  GR+   +DP+ VR+ +  N PLVG+YTLY         L G +S     
Sbjct  105  MAQALKQVNTVGRIRASIDPEIVRLSQRDNRPLVGNYTLYLSPYRQQFFLFGLISTGIDI  164

Query  160  ------GQLPWQAGRSVTDYLQDHPRLAGADKNNVMVITPEGETVVAPVALWNKRHVEPP  213
                    +  Q G SV  Y+     L G D  +  +I+ EG     P+A+WN RH EP 
Sbjct  165  GTPHVFKDIDLQPGWSVEQYIGRRRFLPGGDSRDGYLISGEGHWRKVPLAVWNSRHHEPA  224

Query  214  PGSQLWLGFSAHVLPEKYADLNDQIVSVLTQRVP  247
             G  L++GF   +LPE +  LN+QI   L  R+P
Sbjct  225  AGETLFIGFDPSILPEGFTSLNEQIADYLANRIP  258


>ref|YP_003882187.1| hypothetical protein Dda3937_03274 [Dickeya dadantii 3937]
 gb|ADM97630.1| hypothetical protein Dda3937_03274 [Dickeya dadantii 3937]
Length=259

 Score =  115 bits (288),  Expect = 6e-24, Method: Compositional matrix adjust.
 Identities = 70/214 (32%), Positives = 99/214 (46%), Gaps = 13/214 (6%)

Query  47   QLVTQPQLRDRLWWPGALLTDSAAKAKALKDYQHVMAQLAS----WEAEADDDVAATIKS  102
            Q   Q    D+  W  ALLT+     K  +  + + A+L      W+AE D D A     
Sbjct  45   QFYGQTAFPDKANWQTALLTNERVTEKVREKGEKLQARLYQLQLVWQAEGDGDWAIAAWY  104

Query  103  VRQQLLNLNITGRLPVKLDPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGA---  159
            + Q L  +N  GR+   +DPD VR+    N PLVG+YTLY         L G +S     
Sbjct  105  MAQALKQVNTVGRIRASIDPDIVRLSPRDNRPLVGNYTLYLSPYRDQFFLFGLISTGVDI  164

Query  160  ------GQLPWQAGRSVTDYLQDHPRLAGADKNNVMVITPEGETVVAPVALWNKRHVEPP  213
                    +  Q+G SV  Y+     L G D  +  +I   G     P+A+WN++H EP 
Sbjct  165  GTPNVFKDIDLQSGWSVEQYIGRRRFLPGGDNRDGYLIAGNGHWRKVPLAVWNRQHNEPA  224

Query  214  PGSQLWLGFSAHVLPEKYADLNDQIVSVLTQRVP  247
             G  L++GF   +LPE +  LN+QI   L  R+P
Sbjct  225  AGETLFIGFDPSILPEGFTSLNEQIADYLANRIP  258


>ref|YP_691475.1| hypothetical protein SFV_4186 [Shigella flexneri 5 str. 8401]
 gb|ABF06170.1| conserved hypothetical protein [Shigella flexneri 5 str. 8401]
Length=213

 Score =  114 bits (284),  Expect = 2e-23, Method: Compositional matrix adjust.
 Identities = 65/164 (39%), Positives = 98/164 (59%), Gaps = 6/164 (3%)

Query  4    LQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGA  63
            ++   +A +L V     FA GTV ++  G  +  ++   E+++ LV QP+L +  WWPGA
Sbjct  2    IKQTIVALLLSVGASSVFAAGTVKVFSNGSSEAKTLTGAEHLIDLVGQPRLANS-WWPGA  60

Query  64   LLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPD  123
            ++++  A A AL+  Q ++ +LA   A++  D AA I ++RQQ+  L +TGR  + LDPD
Sbjct  61   VISEELATAAALRQQQALLTRLAEQGADSSTDDAAAINALRQQIQALKVTGRQKINLDPD  120

Query  124  FVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLP-WQA  166
             VRV E  NPPL G+YTL+ V  P   TL G +S  G  P WQ+
Sbjct  121  IVRVAERGNPPLQGNYTLW-VGPP---TLFGLISHPGNQPSWQS  160


>ref|ZP_06715180.1| conserved hypothetical protein [Edwardsiella tarda ATCC 23685]
 gb|EFE22503.1| conserved hypothetical protein [Edwardsiella tarda ATCC 23685]
Length=248

 Score =  114 bits (284),  Expect = 2e-23, Method: Compositional matrix adjust.
 Identities = 61/205 (29%), Positives = 109/205 (53%), Gaps = 4/205 (1%)

Query  48   LVTQPQLRDRLWWPGALLTDSAAKAKALKDYQHVMAQLAS----WEAEADDDVAATIKSV  103
            L+   +L   ++W  A +T  A +    +  + ++++L +    W  + ++  A + + +
Sbjct  44   LLNDSRLPSDIYWRSAQITTPAHQVAIKQQRRALLSELGALETLWRQQGEEAWAESTRHL  103

Query  104  RQQLLNLNITGRLPVKLDPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLP  163
             +QL  L +TGRLP+ LDP   +  +  NP LVGDY L+   R   + +LG +     LP
Sbjct  104  IRQLSALRLTGRLPIVLDPRQAQRSQADNPRLVGDYQLFIAPRRAQVMMLGLIHALPTLP  163

Query  164  WQAGRSVTDYLQDHPRLAGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFS  223
                + V++Y +    L  AD  +V +I P G+    PVA+WNKR  EP  G+ + +GF 
Sbjct  164  LIPAQGVSEYWRRDALLPAADSAHVWLIQPTGDISQVPVAVWNKRLREPMAGASILIGFD  223

Query  224  AHVLPEKYADLNDQIVSVLTQRVPD  248
             + LP ++  +N +I  +++ RVP+
Sbjct  224  PNTLPSRFQGINQRIAEIISNRVPE  248


>ref|YP_962878.1| hypothetical protein Sputw3181_1486 [Shewanella sp. W3-18-1]
 gb|ABM24324.1| protein of unknown function DUF1017 [Shewanella sp. W3-18-1]
Length=261

 Score =  112 bits (280),  Expect = 5e-23, Method: Compositional matrix adjust.
 Identities = 68/189 (35%), Positives = 101/189 (53%), Gaps = 3/189 (1%)

Query  58   LWWPGALLTDSAAKAKALKDYQHVMAQLASW-EAEADDDVAATIKSVRQQLLNLNITGRL  116
            ++W GA L D    A      Q V++QLA   EA  D      +  + Q L  + +  R+
Sbjct  73   IYWLGAALVDLENTAVLETKRQQVLSQLAQMGEATNDSRYITKLAQLAQFLRGIKLGQRV  132

Query  117  PVKLDPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQD  176
               LD D +R+ E+ N  + G Y L    RP TITL+GAVS  G +PWQ+  S  DYLQ 
Sbjct  133  MQPLDIDAIRITESYNAIIEGKYQLVLPPRPSTITLVGAVSQTGNMPWQSQASSKDYLQQ  192

Query  177  HPRLAGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLND  236
               L  A+ + V +I P+G  +  P+A WN +  +  PG+ L++ FS+  L + YA+LN+
Sbjct  193  AGLLENAETSFVWIIQPDGNAIRQPIAYWNYQAQDIAPGATLFVEFSS--LFDGYANLNE  250

Query  237  QIVSVLTQR  245
             I+ +L  R
Sbjct  251  NIIELLKNR  259


>gb|ADV54996.1| protein of unknown function DUF1017 [Shewanella putrefaciens 
200]
Length=261

 Score =  111 bits (278),  Expect = 1e-22, Method: Compositional matrix adjust.
 Identities = 67/189 (35%), Positives = 101/189 (53%), Gaps = 3/189 (1%)

Query  58   LWWPGALLTDSAAKAKALKDYQHVMAQLASW-EAEADDDVAATIKSVRQQLLNLNITGRL  116
            ++W GA L D    A      Q V++QLA   EA  D      +  + Q L  + +  R+
Sbjct  73   IYWLGAALVDLENTAVLETKRQQVLSQLAQMGEATNDSRYITKLAQLAQFLRGIKLGQRV  132

Query  117  PVKLDPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQD  176
               LD D +R+ E+ N  + G Y L    RP TITL+GAVS  G +PWQ+  S  DYL+ 
Sbjct  133  MQPLDIDAIRITESYNAIIEGKYQLVLPPRPSTITLVGAVSQTGNMPWQSQASSKDYLKQ  192

Query  177  HPRLAGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLND  236
               L  A+ + V +I P+G  +  P+A WN +  +  PG+ L++ FS+  L + YA+LN+
Sbjct  193  AGLLENAETSFVWIIQPDGNAIRQPIAYWNHQAQDIAPGATLFVEFSS--LFDGYANLNE  250

Query  237  QIVSVLTQR  245
             I+ +L  R
Sbjct  251  NIIELLKNR  259


>ref|YP_001184042.1| hypothetical protein Sputcn32_2522 [Shewanella putrefaciens CN-32]
 gb|ABP76243.1| protein of unknown function DUF1017 [Shewanella putrefaciens 
CN-32]
Length=261

 Score =  110 bits (274),  Expect = 3e-22, Method: Compositional matrix adjust.
 Identities = 66/189 (34%), Positives = 101/189 (53%), Gaps = 3/189 (1%)

Query  58   LWWPGALLTDSAAKAKALKDYQHVMAQLASW-EAEADDDVAATIKSVRQQLLNLNITGRL  116
            ++W GA L D    A      Q V++QLA   EA  D      +  + Q L  + +  R+
Sbjct  73   IYWLGAALVDLENTAVLETKRQQVLSQLAQMGEATNDSRYITKLAQLAQFLRGIKLGQRV  132

Query  117  PVKLDPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQD  176
               LD D +R+ E+ N  + G Y L    RP TITL+GAV+  G +PWQ+  S  DYL+ 
Sbjct  133  MQPLDIDAIRITESYNAIIEGKYQLVLPPRPSTITLVGAVTQTGNMPWQSQTSSKDYLKQ  192

Query  177  HPRLAGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLND  236
               L  A+ + V +I P+G  +  P+A WN +  +  PG+ L++ FS+  L + YA+LN+
Sbjct  193  AGLLENAETSFVWIIQPDGNAIRQPIAYWNYQAQDIAPGATLFVEFSS--LFDGYANLNE  250

Query  237  QIVSVLTQR  245
             I+ +L  R
Sbjct  251  NIIELLKNR  259


>ref|YP_737462.1| hypothetical protein Shewmr7_1406 [Shewanella sp. MR-7]
 gb|ABI42405.1| protein of unknown function DUF1017 [Shewanella sp. MR-7]
Length=261

 Score =  108 bits (271),  Expect = 6e-22, Method: Compositional matrix adjust.
 Identities = 64/189 (33%), Positives = 103/189 (54%), Gaps = 3/189 (1%)

Query  58   LWWPGALLTDSAAKAKALKDYQHVMAQLASWEAEADDD-VAATIKSVRQQLLNLNITGRL  116
            ++W GA L D    AK     QHV+ QLA+   +AD+    A +    Q L N+ +  R+
Sbjct  73   IYWLGATLLDLQNTAKLEATRQHVLQQLANMGQQADNSQYIAKLSKFAQFLRNIKLGQRV  132

Query  117  PVKLDPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQD  176
               LD D +R+ +  NP + G + L    RP T+T++GAV+  G+  WQ+  S  +YL+ 
Sbjct  133  NQPLDLDLIRITDAYNPIIDGQFLLVLPPRPTTVTVVGAVAQTGEQEWQSRASSKNYLKQ  192

Query  177  HPRLAGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLND  236
               L  A+ + V +I P+G  +  P+A WN +  +  PG+ L++ FSA  L + Y+ LN+
Sbjct  193  AGLLDNAENSFVWIIQPDGNAIRQPIAYWNHQAQDIAPGAVLFVEFSA--LFDGYSTLNN  250

Query  237  QIVSVLTQR  245
             I+ +L  R
Sbjct  251  NIIELLKNR  259


>ref|YP_733476.1| hypothetical protein Shewmr4_1341 [Shewanella sp. MR-4]
 gb|ABI38419.1| protein of unknown function DUF1017 [Shewanella sp. MR-4]
Length=261

 Score =  106 bits (264),  Expect = 3e-21, Method: Compositional matrix adjust.
 Identities = 63/189 (33%), Positives = 102/189 (53%), Gaps = 3/189 (1%)

Query  58   LWWPGALLTDSAAKAKALKDYQHVMAQLASWEAEADDD-VAATIKSVRQQLLNLNITGRL  116
            ++W GA L D    AK     QHV+ QLA+   +AD+    A +    Q L N+ +  R+
Sbjct  73   IYWLGATLLDLQNTAKLEATRQHVLQQLANMGQQADNSQYIAKLSKFAQFLRNIKLGQRV  132

Query  117  PVKLDPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQD  176
               LD D +R+ +  NP + G + L    RP T+T++GAV+   +  WQ+  S  +YL+ 
Sbjct  133  NQPLDLDLIRITDAYNPIIDGQFLLVLPPRPTTVTVVGAVAQTSEQEWQSRASSKNYLKQ  192

Query  177  HPRLAGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLND  236
               L  A+ + V +I P+G  +  P+A WN +  +  PG+ L++ FSA  L + Y+ LN+
Sbjct  193  AGLLDNAENSFVWIIQPDGNAIRQPIAYWNHQAQDIAPGAVLFVEFSA--LFDGYSTLNN  250

Query  237  QIVSVLTQR  245
             I+ +L  R
Sbjct  251  NIIELLKNR  259


>ref|YP_001051215.1| hypothetical protein Sbal_2862 [Shewanella baltica OS155]
 gb|ABN62346.1| protein of unknown function DUF1017 [Shewanella baltica OS155]
 gb|AEH14691.1| protein of unknown function DUF1017 [Shewanella baltica OS117]
Length=261

 Score =  105 bits (261),  Expect = 9e-21, Method: Compositional matrix adjust.
 Identities = 62/189 (32%), Positives = 101/189 (53%), Gaps = 3/189 (1%)

Query  58   LWWPGALLTDSAAKAKALKDYQHVMAQLASWEAEADDDV-AATIKSVRQQLLNLNITGRL  116
            ++W GA L D    A      Q V++QLA      DD    A +  + Q + +L +  R+
Sbjct  73   IYWLGAALLDIQNTAALETKRQQVLSQLAKMGQVKDDSAYIAKLAKLAQLIRSLQLGQRV  132

Query  117  PVKLDPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQD  176
               LD D +R++++ N  + G + L    RP T+T+LGAV+  G L WQ  ++  DYL+ 
Sbjct  133  MQPLDIDLIRINDSYNSLIDGRFLLVLPPRPSTVTVLGAVAQTGDLAWQGQKTSKDYLKQ  192

Query  177  HPRLAGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLND  236
               L  A+ + V +I P+G+ +  P+A WN +  +  PG+ L++ FS+  L + Y  LN+
Sbjct  193  AGLLDNAETSFVWIIQPDGKAIKQPIAYWNHQEQDIAPGASLYVEFSS--LFDDYTQLNE  250

Query  237  QIVSVLTQR  245
             IV +L  R
Sbjct  251  NIVELLRNR  259


>ref|YP_003332847.1| hypothetical protein Dd586_1258 [Dickeya dadantii Ech586]
 gb|ACZ76142.1| protein of unknown function DUF1017 [Dickeya dadantii Ech586]
Length=259

 Score =  105 bits (261),  Expect = 9e-21, Method: Compositional matrix adjust.
 Identities = 63/214 (29%), Positives = 101/214 (47%), Gaps = 13/214 (6%)

Query  47   QLVTQPQLRDRLWWPGALLTDSAAKAKALKDYQHVMAQLA----SWEAEADDDVAATIKS  102
            Q  ++    D + W  ALLT+ +   +A +  + + A+L     +W+ + D D A     
Sbjct  45   QFYSRITFADNVNWQTALLTNESVTQQAKEAGEKLQARLYQLQLAWQIDGDGDWAIAAWY  104

Query  103  VRQQLLNLNITGRLPVKLDPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGA---  159
            + Q L  +N  GR+   + PD +R+    N PLVG+YTLY         L G +S     
Sbjct  105  MAQALKQVNAVGRIRASIAPDIIRLSPRKNRPLVGNYTLYLSPYRQQFFLFGLISTGIDI  164

Query  160  ------GQLPWQAGRSVTDYLQDHPRLAGADKNNVMVITPEGETVVAPVALWNKRHVEPP  213
                    +  + G SV  Y+     L G D     +I+ +G     P+A+WN++H EP 
Sbjct  165  GTPNVFKDIDLKPGWSVEQYIGRRRFLPGGDNREGYLISGDGHWRKVPLAIWNRQHHEPA  224

Query  214  PGSQLWLGFSAHVLPEKYADLNDQIVSVLTQRVP  247
             G  L++GF   +LP+ ++ LN QI   L  R+P
Sbjct  225  AGETLFIGFDPSILPDGFSSLNAQIADYLANRIP  258


>gb|EDA46671.1| hypothetical protein GOS_1996150 [marine metagenome]
Length=292

 Score =  104 bits (260),  Expect = 1e-20, Method: Compositional matrix adjust.
 Identities = 61/189 (32%), Positives = 102/189 (53%), Gaps = 3/189 (1%)

Query  58   LWWPGALLTDSAAKAKALKDYQHVMAQLASWEAEADDD-VAATIKSVRQQLLNLNITGRL  116
            ++W GA L D+   A      Q ++ QLA+   + D+    A +    Q L N+ +  R+
Sbjct  104  VYWLGAALLDTKNTATLELIRQQILQQLANMGQQTDNSQYIAKLSKFAQFLRNIKLGQRV  163

Query  117  PVKLDPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQD  176
               LD D +R+ +  NP + G + L    RP TIT++GAV+  G+  W +  S  DYL+ 
Sbjct  164  NQPLDLDLIRITDAYNPIMDGQFLLVLPPRPTTITVVGAVAQTGEQKWVSRTSSKDYLKQ  223

Query  177  HPRLAGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLND  236
               L  A+ + V +I P+G+T+  P+A WN + ++  PG+ L++ FS   L + Y+ LN+
Sbjct  224  AGLLENAENSFVWIIQPDGKTIRQPIAYWNHQSMDIAPGAILFVEFSG--LFDDYSTLNN  281

Query  237  QIVSVLTQR  245
             I+ +L  R
Sbjct  282  NIIELLKNR  290


>ref|YP_869034.1| hypothetical protein Shewana3_1394 [Shewanella sp. ANA-3]
 gb|ABK47628.1| protein of unknown function DUF1017 [Shewanella sp. ANA-3]
Length=262

 Score =  104 bits (259),  Expect = 1e-20, Method: Compositional matrix adjust.
 Identities = 62/190 (32%), Positives = 101/190 (53%), Gaps = 4/190 (2%)

Query  58   LWWPGALLTDSAAKAKALKDYQHVMAQLASWEAEADDD--VAATIKSVRQQLLNLNITGR  115
            ++W GA L D    A      Q V+ +LA+   +AD++    A +    Q L N+ +  R
Sbjct  73   IYWLGAALLDIHNTAALETTRQQVLQKLANMGQQADNNSQYIAKLSKFAQFLRNIKLGQR  132

Query  116  LPVKLDPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQ  175
            +   LD D +R+ +  NP + G + L    RP T+T++GAVS  G+  WQ+  S  +YLQ
Sbjct  133  VNQPLDLDLIRITDAYNPIIDGQFLLVLPPRPSTVTVVGAVSQTGEQAWQSQTSSREYLQ  192

Query  176  DHPRLAGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLN  235
                L  A+ + V +I P+G  +  P+A WN +  +  PG+ L++ FS   L + Y+ LN
Sbjct  193  QAGLLENAENSFVWIIQPDGNAIRQPIAYWNHQAQDIAPGATLFVEFSG--LFDDYSTLN  250

Query  236  DQIVSVLTQR  245
            + I+ +L  R
Sbjct  251  NNIIELLKNR  260


>gb|EDA79313.1| hypothetical protein GOS_1936285 [marine metagenome]
Length=293

 Score =  103 bits (257),  Expect = 2e-20, Method: Compositional matrix adjust.
 Identities = 61/189 (32%), Positives = 102/189 (53%), Gaps = 3/189 (1%)

Query  58   LWWPGALLTDSAAKAKALKDYQHVMAQLASWEAEADDD-VAATIKSVRQQLLNLNITGRL  116
            ++W GA L D+   A      Q ++ +LAS   +AD+    A +    Q L N+ +  R+
Sbjct  105  VYWLGAALLDTKNTATLELIRQQILQRLASLGQQADNSQYIAKLSKFAQFLRNIKLGQRV  164

Query  117  PVKLDPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQD  176
               LD D +R+ +  NP + G + L    RP TIT++GAV+  G+  W +  S  DYL+ 
Sbjct  165  NQPLDLDLIRITDAYNPIIDGQFLLVLPPRPTTITVVGAVAQTGEQKWVSRTSSKDYLKQ  224

Query  177  HPRLAGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLND  236
               L  A+ + V +I P+G+ +  P+A WN + ++  PG+ L++ FS   L + Y+ LN+
Sbjct  225  AGLLENAENSFVWIIQPDGKAIRQPIAYWNHQSMDIAPGAILFVEFSG--LFDDYSTLNN  282

Query  237  QIVSVLTQR  245
             I+ +L  R
Sbjct  283  NIIELLKNR  291


>ref|YP_001555433.1| hypothetical protein Sbal195_3008 [Shewanella baltica OS195]
 gb|ABX50173.1| protein of unknown function DUF1017 [Shewanella baltica OS195]
 gb|ADT95167.1| protein of unknown function DUF1017 [Shewanella baltica OS678]
Length=261

 Score =  103 bits (257),  Expect = 3e-20, Method: Compositional matrix adjust.
 Identities = 62/189 (32%), Positives = 101/189 (53%), Gaps = 3/189 (1%)

Query  58   LWWPGALLTDSAAKAKALKDYQHVMAQLASWEAEADDDV-AATIKSVRQQLLNLNITGRL  116
            ++W GA L D   KA      Q V++QLA      DD    A +  + Q + +L +  R+
Sbjct  73   IYWLGAALLDIQNKAALETKRQQVLSQLAKMGQVKDDSAYIAKLAKLAQLIRSLQLGQRV  132

Query  117  PVKLDPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQD  176
               LD D +R++++ N  + G + L    RP T+T+LGAV  +  L WQ  ++  DYL+ 
Sbjct  133  MQPLDIDLIRINDSYNSLIDGRFLLVLPPRPSTVTVLGAVEQSRDLAWQGQKTSKDYLKQ  192

Query  177  HPRLAGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLND  236
               L  A+ + V +I P+G+ +  P+A WN +  +  PG+ L++ FS+  L + Y  LN+
Sbjct  193  AGLLDNAETSFVWIIQPDGKAIKQPIAYWNHQTKDIAPGATLYVEFSS--LFDDYTKLNE  250

Query  237  QIVSVLTQR  245
             IV +L  R
Sbjct  251  NIVELLRNR  259


>ref|ZP_07391524.1| protein of unknown function DUF1017 [Shewanella baltica OS183]
 gb|EFM15713.1| protein of unknown function DUF1017 [Shewanella baltica OS183]
 gb|AEG10767.1| protein of unknown function DUF1017 [Shewanella baltica BA175]
Length=261

 Score =  103 bits (256),  Expect = 3e-20, Method: Compositional matrix adjust.
 Identities = 61/189 (32%), Positives = 99/189 (52%), Gaps = 3/189 (1%)

Query  58   LWWPGALLTDSAAKAKALKDYQHVMAQLASWEAEADDDV-AATIKSVRQQLLNLNITGRL  116
            ++W GA L D    A      + +++QLA     ADD +  A +  + Q L  + +  R+
Sbjct  73   IYWLGAALLDIQNTAALEAKRKEILSQLAQMGQAADDSIYTAKLAKLAQFLRQIKLGQRV  132

Query  117  PVKLDPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQD  176
               L  D +R++++ NP + G + L   QRP T+ ++GAV+ AG   W+   S  DYL  
Sbjct  133  MQPLHLDLIRINDSYNPLVDGRFELILPQRPTTVLVMGAVAKAGSFEWKVNASSKDYLAK  192

Query  177  HPRLAGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLND  236
               L  AD + V +I P+G+ +  P+A WN +  +  PG+ L++ FS   L E Y+ LN 
Sbjct  193  AMPLENADNSFVWIIQPDGKALKQPIAYWNAQVQDIAPGAVLYVEFSD--LVEDYSTLNA  250

Query  237  QIVSVLTQR  245
             I+ +L  R
Sbjct  251  NIIELLRNR  259


>ref|ZP_08567183.1| YjbG polysaccharide synthesis protein [Shewanella sp. HN-41]
 gb|EGM69329.1| YjbG polysaccharide synthesis protein [Shewanella sp. HN-41]
Length=261

 Score =  103 bits (256),  Expect = 3e-20, Method: Compositional matrix adjust.
 Identities = 61/189 (32%), Positives = 96/189 (50%), Gaps = 3/189 (1%)

Query  58   LWWPGALLTDSAAKAKALKDYQHVMAQLASW-EAEADDDVAATIKSVRQQLLNLNITGRL  116
            ++W GA L D    A      Q V++QL    EA  D    A +  + Q +  + +  R+
Sbjct  73   IYWLGAALLDIKNTASLETKRQQVLSQLIQMGEATDDSHYIAQLAKLAQFIRKIKLGQRV  132

Query  117  PVKLDPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQD  176
               LD D +R++++ N  L G + L    RP T+TLLGAV   G   WQA     +YL+ 
Sbjct  133  IQPLDIDLIRINQSFNAVLDGRFLLVLPPRPTTVTLLGAVEQMGSYEWQANIDSKEYLKQ  192

Query  177  HPRLAGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLND  236
               L  A+ + + +I P+G+ +  P+A WN +  +  PG+ L++ FS+  L + Y  LN 
Sbjct  193  AGLLGNAETSTIWIIQPDGKAIKQPIAYWNHQEQDIAPGASLYVEFSS--LFDDYTQLNK  250

Query  237  QIVSVLTQR  245
             IV +L  R
Sbjct  251  NIVELLRNR  259


>gb|EDA62109.1| hypothetical protein GOS_1968443 [marine metagenome]
Length=268

 Score =  102 bits (254),  Expect = 5e-20, Method: Compositional matrix adjust.
 Identities = 61/189 (32%), Positives = 100/189 (52%), Gaps = 3/189 (1%)

Query  58   LWWPGALLTDSAAKAKALKDYQHVMAQLASWEAEADD-DVAATIKSVRQQLLNLNITGRL  116
            ++W GA L D    A      Q V+ QLA+   + D+ +  A +  + Q L  + +  R+
Sbjct  80   IYWLGASLLDIKNTAALEATRQQVLQQLATMGEQIDNGNYIAKLSKLAQFLRTIKLGQRI  139

Query  117  PVKLDPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQD  176
               LD D +R+ +  NP + GD+ L    RP T+T++GAV+  G+ PWQ+  S  DY+  
Sbjct  140  NQPLDLDLIRITDAYNPVIDGDFLLVLPPRPTTVTVVGAVAQTGEQPWQSRASSKDYINQ  199

Query  177  HPRLAGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLND  236
               L  A+ + V +I P+G  +  P+A WN +  +  PG+ L++ FS   L   +A LN+
Sbjct  200  AGLLDNAENSFVWIIQPDGNAIKQPIAYWNYQAQDIAPGAILFVEFSE--LFADHAKLNN  257

Query  237  QIVSVLTQR  245
             I+ +L  R
Sbjct  258  NIIELLKNR  266


>ref|YP_857379.1| putative periplasmic protein [Aeromonas hydrophila subsp. hydrophila 
ATCC 7966]
 gb|ABK37777.1| putative periplasmic protein [Aeromonas hydrophila subsp. hydrophila 
ATCC 7966]
Length=252

 Score =  101 bits (251),  Expect = 1e-19, Method: Compositional matrix adjust.
 Identities = 76/259 (29%), Positives = 130/259 (50%), Gaps = 23/259 (8%)

Query  3    KLQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVV---QLVTQPQLRD---  56
            K+ + F+ S+L  +T  A+A  T++I+  G+       PV+++    Q+     L D   
Sbjct  2    KIINIFLISIL--VTNPAWADATISIWWKGK-------PVKDLYYAKQITLSSALSDPAI  52

Query  57   ---RLWWPGALLTDSAAKAKALKDYQHVMAQLA----SWEAEADDDVAATIKSVRQQLLN  109
                 +WP   ++  A + +     Q ++A L      W    +  +A+    + QQL  
Sbjct  53   LSYDSYWPVGQISTPARQQELEHQRQVLLADLTVLSKMWSDMGEPSLASATLQLLQQLQQ  112

Query  110  LNITGRLPVKLDPDFVRVDENSNPPLV-GDYTLYTVQRPVTITLLGAVSGAGQLPWQAGR  168
            L +TGR  V +DPD  +     + P++ G Y LY   R   + + G V   G  P   G 
Sbjct  113  LELTGRFDVSVDPDVNQARAGVDAPILKGHYQLYLAPRHPEVQIAGLVKQIGGAPLLPGA  172

Query  169  SVTDYLQDHPRLAGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLP  228
             + +Y +    L G D   V +++P G+    PVA+WN+RHVE  PG+ L++GFS  VLP
Sbjct  173  GLREYWKRKDILEGGDPAGVYLVSPSGKYDWFPVAIWNERHVEAMPGATLFVGFSPDVLP  232

Query  229  EKYADLNDQIVSVLTQRVP  247
            ++Y +LN++I+++   R+P
Sbjct  233  KQYQNLNERILTLFANRMP  251


>ref|ZP_08570204.1| SLBB-domain like (DUF1017) [Rheinheimera sp. A13L]
 gb|EGM78158.1| SLBB-domain like (DUF1017) [Rheinheimera sp. A13L]
Length=253

 Score = 98.2 bits (243),  Expect = 1e-18, Method: Compositional matrix adjust.
 Identities = 74/255 (29%), Positives = 119/255 (46%), Gaps = 14/255 (5%)

Query  1    MNKLQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVV--QLVTQPQLRDRL  58
            MN+L  +  A  +      + A   V +   G  Q     P   +V  QL T P L    
Sbjct  1    MNRLSCFLFALAIVFANTVSSATPVVLVEHQGNYQGFFDRPRLGLVVSQLNTSPSL----  56

Query  59   WWPGALL--TDSAAKAKALKDYQHVMAQLA----SWEAEADDDVAATIKSVRQQLLNLNI  112
            +WP A L   D   K K  +  + ++ QLA     ++ ++D  +AA+++ + + + +  +
Sbjct  57   YWPAAKLFKVDVETKLKLEQQRKELLNQLALLKHEFQQDSDSGMAASVEKLEKDISSWEL  116

Query  113  TGRLPVKLDPDFVRVDENSNPPL-VGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVT  171
             G + + LDPD VR  ++ NP L  G Y L    RP  + + G V      P +   SV 
Sbjct  117  AGNMNLALDPDRVRAKKSLNPLLSAGQYKLVVGARPTELQIEGLVD-EQMTPLRNAVSVD  175

Query  172  DYLQDHPRLAGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKY  231
             YL     L G   + V +I   G+  +A   LWNK H E  PG+ L++ F    LP+ +
Sbjct  176  SYLDTISILDGGSSSFVYIIPASGKISIAKTGLWNKHHQEVLPGTVLFIPFEQRHLPDVF  235

Query  232  ADLNDQIVSVLTQRV  246
            + +N+QIV +L  +V
Sbjct  236  SHINEQIVELLLHKV  250


>ref|NP_718714.1| polysaccharide synthesis-related protein [Shewanella oneidensis 
MR-1]
 gb|AAN56158.1|AE015753_4 polysaccharide synthesis-related protein [Shewanella oneidensis 
MR-1]
Length=261

 Score = 97.8 bits (242),  Expect = 1e-18, Method: Compositional matrix adjust.
 Identities = 58/189 (30%), Positives = 99/189 (52%), Gaps = 3/189 (1%)

Query  58   LWWPGALLTDSAAKAKALKDYQHVMAQLASWEAEADDD-VAATIKSVRQQLLNLNITGRL  116
            ++W GA L D+   A      Q V+ QLA    + D+    A +    Q L N+ +  R+
Sbjct  73   IYWLGAALLDTKNTAVLEVTRQQVLQQLADMGEQVDNSQYIAKLAKFAQFLRNIKLGQRV  132

Query  117  PVKLDPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQD  176
               LD D +R+ +  NP + GD+ L    RP T++++GAV+  G+  W +  S  DY+  
Sbjct  133  NQPLDLDLIRITDAYNPVIDGDFLLVLPPRPTTVSVVGAVAQTGEQTWLSQASSKDYINQ  192

Query  177  HPRLAGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLND  236
               L  A+ + V +I P+G  +  P+A WN +  +  PG+ L++ F+A  L + + +LN+
Sbjct  193  AGLLDNAENSFVWIIQPDGNAIKQPIAYWNHQAQDIAPGAILFVEFTA--LFDDHTELNN  250

Query  237  QIVSVLTQR  245
             I+ +L  R
Sbjct  251  NIIELLKNR  259


>ref|YP_001367076.1| hypothetical protein Shew185_2879 [Shewanella baltica OS185]
 gb|ABS09013.1| protein of unknown function DUF1017 [Shewanella baltica OS185]
Length=261

 Score = 94.0 bits (232),  Expect = 2e-17, Method: Compositional matrix adjust.
 Identities = 60/189 (31%), Positives = 93/189 (49%), Gaps = 3/189 (1%)

Query  58   LWWPGALLTDSAAKAKALKDYQHVMAQLAS-WEAEADDDVAATIKSVRQQLLNLNITGRL  116
            ++W GA L D    A      +  +++LA   EA  D    A +  + Q L  + +  R+
Sbjct  73   IYWLGAALLDIQNTAVLENKRKEALSELAKVGEASNDSIYIAKLAKLAQFLRQIKLGKRV  132

Query  117  PVKLDPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQD  176
               LD D +R+ ++ NP L G + L    RP TI + GAV+  G   W+   S  DYL  
Sbjct  133  MQPLDLDLIRITDSYNPLLDGRFELVLPPRPTTIVVAGAVARTGSYEWKLNTSSKDYLAK  192

Query  177  HPRLAGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLND  236
               L  +D + V +I P+G  +  P+A WN +  +  PG+ ++L FS   L E Y+ LN 
Sbjct  193  AQPLENSDGDFVWIIQPDGNAIKQPIAYWNAQAQDIAPGAVIYLEFSD--LLEDYSTLNA  250

Query  237  QIVSVLTQR  245
             I+ +L  R
Sbjct  251  NIIELLRNR  259


>gb|EGK28123.1| hypothetical protein SFK272_1496 [Shigella flexneri K-272]
 gb|EGK39941.1| hypothetical protein SFK227_0664 [Shigella flexneri K-227]
Length=55

 Score = 93.6 bits (231),  Expect = 3e-17, Method: Compositional matrix adjust.
 Identities = 44/46 (95%), Positives = 46/46 (100%), Gaps = 0/46 (0%)

Query  48  LVTQPQLRDRLWWPGALLTDSAAKAKALKDYQHVMAQLASWEAEAD  93
           +VTQPQLRDRLWWPGALLTDSAAKAKALKDYQHVMAQLASWEAEA+
Sbjct  1   MVTQPQLRDRLWWPGALLTDSAAKAKALKDYQHVMAQLASWEAEAE  46


>ref|YP_002357425.1| hypothetical protein Sbal223_1497 [Shewanella baltica OS223]
 gb|ACK46002.1| protein of unknown function DUF1017 [Shewanella baltica OS223]
Length=261

 Score = 93.6 bits (231),  Expect = 3e-17, Method: Compositional matrix adjust.
 Identities = 60/189 (31%), Positives = 93/189 (49%), Gaps = 3/189 (1%)

Query  58   LWWPGALLTDSAAKAKALKDYQHVMAQLAS-WEAEADDDVAATIKSVRQQLLNLNITGRL  116
            ++W GA L D    A      +  +++LA   EA  D    A +  + Q L  + +  R+
Sbjct  73   IYWLGAALLDIQNTAVLENKRKEALSELAKVGEASNDSIYIAKLAKLVQFLRQIKLGKRV  132

Query  117  PVKLDPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQD  176
               LD D +R+ ++ NP L G + L    RP TI + GAV+  G   W+   S  DYL  
Sbjct  133  MQPLDLDLIRITDSYNPLLDGRFELVLPPRPTTIVVAGAVARTGSYEWKLNTSSKDYLAK  192

Query  177  HPRLAGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLND  236
               L  +D + V +I P+G  +  P+A WN +  +  PG+ ++L FS   L E Y+ LN 
Sbjct  193  AQPLENSDGDFVWIIQPDGNAIKQPIAYWNAQAQDIAPGAVIYLEFSD--LLEDYSTLNA  250

Query  237  QIVSVLTQR  245
             I+ +L  R
Sbjct  251  NIIELLRNR  259


>ref|YP_455825.1| hypothetical protein SG2145 [Sodalis glossinidius str. 'morsitans']
 dbj|BAE75420.1| hypothetical protein [Sodalis glossinidius str. 'morsitans']
Length=164

 Score = 91.7 bits (226),  Expect = 1e-16, Method: Compositional matrix adjust.
 Identities = 44/103 (42%), Positives = 65/103 (63%), Gaps = 0/103 (0%)

Query  86   ASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPDFVRVDENSNPPLVGDYTLYTVQ  145
            A    + D ++A  +  VR QL  + ITGR  V LDPD++R+   +N  L G+Y++YT+ 
Sbjct  62   AELRGDNDGELADLVDRVRAQLAAMRITGRQFVPLDPDWIRLRSEANRRLSGEYSVYTLS  121

Query  146  RPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGADKNNV  188
            RP +I ++G  + AG  P+Q GR V +YLQ H R +GA+KN V
Sbjct  122  RPTSIQVVGVTTPAGPQPYQPGRDVAEYLQTHQRFSGAEKNVV  164


>ref|YP_004433358.1| hypothetical protein Glaag_1129 [Glaciecola agarilytica 4H-3-7+YE-5]
 gb|AEE22090.1| protein of unknown function DUF1017 [Glaciecola sp. 4H-3-7+YE-5]
Length=265

 Score = 90.5 bits (223),  Expect = 2e-16, Method: Compositional matrix adjust.
 Identities = 61/195 (31%), Positives = 101/195 (51%), Gaps = 6/195 (3%)

Query  58   LWWPGALL--TDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQ--QLLNL-NI  112
            ++WP A L  T  +  A      Q ++ +L +      +D  A + ++ Q  Q++N   +
Sbjct  70   IYWPAAALYETQESKVAPLHAQRQRLIEKLTTLHQRFANDDRALLSAIDQLTQVVNSWQL  129

Query  113  TGRLPVKLDPDFVRVDENSNPPLV-GDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVT  171
              R P+K+D D  R+    NP L  GDYTL    R   I + GAV+    +  QA + V+
Sbjct  130  GKRSPIKIDLDLARIQPPKNPLLTEGDYTLSAKPRSNKIFITGAVNQTQVVAHQAYQDVS  189

Query  172  DYLQDHPRLAGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKY  231
             Y+    R+  A+++ V VI  +G  + AP A WNK+H E  PGS L++ F+  +   + 
Sbjct  190  HYVPASARIDKANQDYVYVIQADGRVIFAPTAYWNKQHQEVMPGSLLFVPFNTSLFHPEL  249

Query  232  ADLNDQIVSVLTQRV  246
            A++ND +VS+   R+
Sbjct  250  AEVNDLVVSLAKNRL  264


>ref|YP_203541.1| hypothetical protein VF_0158 [Vibrio fischeri ES114]
 gb|AAW84653.1| hypothetical protein VF_0158 [Vibrio fischeri ES114]
Length=253

 Score = 88.6 bits (218),  Expect = 7e-16, Method: Compositional matrix adjust.
 Identities = 61/249 (24%), Positives = 113/249 (45%), Gaps = 11/249 (4%)

Query  6    SYFIASVLYVMTPHAFAQGT--VTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGA  63
            S F+ S+L   +   +A  T  V++ LP +   L+      + Q++     +   +  GA
Sbjct  5    SSFVFSLLLSASTVTYASSTQAVSVTLPNQNLVLNYSQPVRLEQVILDANAQVNFYSLGA  64

Query  64   LLTDSAAKAKALKDYQHVMAQLASWEAE-----ADDDVAATIKSVRQQLLNLNITGRLPV  118
             L+D+  +        + + QL+S   E     A+++   +   +  QL +    GR+  
Sbjct  65   ALSDNQLQKDIDNLRNNSIEQLSSLSRETSLFSANNEFKRSATQIISQLEHHTFVGRIFS  124

Query  119  KLDPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQL--PWQAGRSVTDYLQD  176
             LD D +R++E  NP L  DY L+   RP +++  GA+     +  P+    S+ DYL  
Sbjct  125  SLDLDLIRINEKLNPILNADYQLFVHPRPTSVSFFGAIDSESSISVPFIEHASIDDYLDS  184

Query  177  HPRLAGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLND  236
             P  + AD + + VI P+G       ++W  +     PG+ +++      LP  ++ LN 
Sbjct  185  LPLSSTADTSTIYVIQPDGVVQTTEFSVWQNKPAYLAPGASVYIPLGG--LPSDFSSLNT  242

Query  237  QIVSVLTQR  245
             IV +L  +
Sbjct  243  SIVQLLRNK  251


>ref|ZP_01218708.1| hypothetical polysaccharide synthesis-related protein [Photobacterium 
profundum 3TCK]
 gb|EAS44622.1| hypothetical polysaccharide synthesis-related protein [Photobacterium 
profundum 3TCK]
Length=289

 Score = 88.6 bits (218),  Expect = 8e-16, Method: Compositional matrix adjust.
 Identities = 63/194 (32%), Positives = 99/194 (51%), Gaps = 16/194 (8%)

Query  57   RLWWPGALLTDSAAKAKALKDYQHV----MAQLAS-WEAEADDDVAATIKSVRQQLLNLN  111
            R++W GA L  +    +     QHV    + QLA+ W++E    V     ++ QQL  L 
Sbjct  105  RIYWTGAALFHAFPHPQ-----QHVVVAQLNQLATHWQSEQQQAVL----NLSQQLAQLM  155

Query  112  ITGRLPVKLDPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVT  171
               R+   LD D VR+++ SN  +  + TL    RP  I +LGA+       WQ      
Sbjct  156  TGERIFTSLDYDNVRLNKQSNTLITQNLTLILPPRPERILVLGALEKPVWTKWQTRLDAE  215

Query  172  DYLQDHPRLAGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKY  231
             YL+    L+ A+K++  VI P+G     P A WN+ H +  PG+ ++LGFS+  LP+ +
Sbjct  216  AYLKQSKPLSNANKSDAWVIQPDGTVEQHPTAYWNRDHHDIAPGAIVYLGFSS--LPDGF  273

Query  232  ADLNDQIVSVLTQR  245
              LN+ I+++L  R
Sbjct  274  ETLNEDIINLLRNR  287


>ref|YP_130871.1| polysaccharide synthesis-like protein [Photobacterium profundum 
SS9]
 emb|CAG21069.1| hypothetical polysaccharide synthesis-related protein [Photobacterium 
profundum SS9]
Length=293

 Score = 87.0 bits (214),  Expect = 2e-15, Method: Compositional matrix adjust.
 Identities = 54/189 (28%), Positives = 89/189 (47%), Gaps = 2/189 (1%)

Query  57   RLWWPGALLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRL  116
            R++W GA L  +    +       +      W  E          ++ QQ+  L    R+
Sbjct  105  RIYWTGAALFQAFPHPQQQTVVDQINQLATYWHNEQQQPQQQAALNLSQQIEQLTTGERI  164

Query  117  PVKLDPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQD  176
               LD D +R+++ +N  +  D TL    RP  I +LGA+       WQ       YL+ 
Sbjct  165  FTSLDYDDIRLNKQANTLITDDLTLILPPRPERILVLGALDKPIWAEWQTRLDAEAYLKQ  224

Query  177  HPRLAGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLND  236
               L+ A+ +N  VI P+G     P+A WN+ H +  PG+ ++LGFS+  LP+ +  LN+
Sbjct  225  AKSLSNANNSNAWVIQPDGTVEQHPIAYWNRDHHDIAPGAIVYLGFSS--LPKGFETLNE  282

Query  237  QIVSVLTQR  245
             I+++L  R
Sbjct  283  DIINLLRNR  291


>ref|YP_002154921.1| hypothetical protein VFMJ11_0150 [Vibrio fischeri MJ11]
 gb|ACH64877.1| conserved hypothetical protein [Vibrio fischeri MJ11]
Length=253

 Score = 86.7 bits (213),  Expect = 3e-15, Method: Compositional matrix adjust.
 Identities = 60/249 (24%), Positives = 113/249 (45%), Gaps = 11/249 (4%)

Query  6    SYFIASVLYVMTPHAFAQGT--VTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGA  63
            S F+ S+L   +   +A  T  V++ LP +   L+      + Q++     +   +  GA
Sbjct  5    SSFVFSLLLSASTVTYASSTQAVSVTLPNQNLVLNYSQPVRLEQVILDANAQMNFYSLGA  64

Query  64   LLTDSAAKAKALKDYQHVMAQLASWEAE-----ADDDVAATIKSVRQQLLNLNITGRLPV  118
            +L+D+  +        + + QL+S   E     A+++   +   +  QL +    GR+  
Sbjct  65   VLSDNQLQKDIDNLRNNSIEQLSSLSRETSLFSANNEFKRSATQIISQLEHHTFVGRIFS  124

Query  119  KLDPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQL--PWQAGRSVTDYLQD  176
             LD D +R++E  NP L  DY L+   RP +++  GA+     +  P+     + DYL  
Sbjct  125  PLDLDLIRINEKLNPILNADYQLFVHPRPTSVSFFGAIDSESSISVPFIEHAGIDDYLGS  184

Query  177  HPRLAGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLND  236
             P  + AD + + VI P+G       ++W  +     PG+ +++      LP  ++ LN 
Sbjct  185  LPLSSTADTSTIYVIQPDGVVQTTEFSVWQNKPAYLAPGASVYIPLGG--LPSDFSSLNT  242

Query  237  QIVSVLTQR  245
             IV +L  +
Sbjct  243  SIVQLLRNK  251


>ref|YP_662763.1| hypothetical protein Patl_3203 [Pseudoalteromonas atlantica T6c]
 gb|ABG41709.1| protein of unknown function DUF1017 [Pseudoalteromonas atlantica 
T6c]
Length=247

 Score = 81.3 bits (199),  Expect = 1e-13, Method: Compositional matrix adjust.
 Identities = 55/210 (26%), Positives = 102/210 (48%), Gaps = 19/210 (9%)

Query  48   LVTQPQLRDRLWWPGALL------TDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIK  101
            ++++  L   ++WP A L      T   A+  +L +   + + L S   E         K
Sbjct  44   VLSKLDLNTNIYWPSAALFIPNDVTLERARRSSLSNLNILASHLPSDTHEQ--------K  95

Query  102  SVRQQLLN----LNITGRLPVKLDPDFVRVDENSNPPLVGDYTLYTV-QRPVTITLLGAV  156
            SV   L+N     ++  RL VK+D D  R+ E  NP     Y + ++ +R  ++ ++GAV
Sbjct  96   SVFTNLINELEHWHLANRLSVKIDYDLARISEAHNPQFDNGYYVVSLHERQESVEIIGAV  155

Query  157  SGAGQLPWQAGRSVTDYLQDHPRLAGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGS  216
            +   +        V++YL      + A+++ V++I  +G  + AP A WN++H E  PGS
Sbjct  156  TNTVETKHLPHTDVSEYLNSANIASFANRDTVIIIQADGRIIEAPTAYWNRQHQEVMPGS  215

Query  217  QLWLGFSAHVLPEKYADLNDQIVSVLTQRV  246
             +++ F   +   +Y +LN  IV++   R+
Sbjct  216  IIYVPFKESLFTPQYKELNQLIVTLAKNRL  245


>ref|ZP_06054113.1| hypothetical polysaccharide synthesis-related protein [Grimontia 
hollisae CIP 101886]
 gb|EEY71428.1| hypothetical polysaccharide synthesis-related protein [Grimontia 
hollisae CIP 101886]
Length=271

 Score = 80.9 bits (198),  Expect = 2e-13, Method: Compositional matrix adjust.
 Identities = 57/189 (30%), Positives = 87/189 (46%), Gaps = 4/189 (2%)

Query  60   WPGALLTDSAAKAKALKDYQHVMAQLASWEAEADDD--VAATIKSVRQQLLNLNITGRLP  117
            W GA L +     +  +    V   L +  AE  DD   A  + ++   + N     RLP
Sbjct  84   WQGAALFNHVLSPETAQLLDSVKQDLKTLRAEWADDPTYANAVDALIDYVENATFRERLP  143

Query  118  VKLDPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDH  177
            + LD D      ++NP + G  TL    R   ++++GAV+    LP+ A  S  +YL   
Sbjct  144  LPLDEDHYLAGSHTNPLITGKVTLILPSRAKQVSVIGAVTQPHTLPFTALTSAREYLTQS  203

Query  178  PRLAGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQ  237
            P L     ++V VI+P GE  +   A WN +H    PG+ +++ F    LP   A LND 
Sbjct  204  PTLNRFGISDVAVISPNGELAIHHTAYWNAQHQNVAPGAVIFVPFQR--LPFGLASLNDT  261

Query  238  IVSVLTQRV  246
            +  +L  RV
Sbjct  262  LPRLLQHRV  270


>gb|EBT31304.1| hypothetical protein GOS_7294385 [marine metagenome]
Length=115

 Score = 80.1 bits (196),  Expect = 3e-13, Method: Compositional matrix adjust.
 Identities = 42/114 (36%), Positives = 67/114 (58%), Gaps = 2/114 (1%)

Query  132  NPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGADKNNVMVI  191
            NP + G + L    RP TIT++GAV+  G+  W +G S  DYLQ    L  A+ + V +I
Sbjct  2    NPIINGQFLLVLPPRPTTITVVGAVAQTGEQSWVSGVSSKDYLQQAGLLENAENSFVWII  61

Query  192  TPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLTQR  245
             P+G+ +  P+A WN + ++  PG+ L++ FS   L + Y+ LN+ I+ +L  R
Sbjct  62   QPDGKAIRQPIAYWNHQSMDIAPGAILFVEFSG--LFDDYSTLNNNIIELLKNR  113


>ref|YP_002261809.1| exported protein [Aliivibrio salmonicida LFI1238]
 emb|CAQ77958.1| exported protein [Aliivibrio salmonicida LFI1238]
Length=254

 Score = 80.1 bits (196),  Expect = 3e-13, Method: Compositional matrix adjust.
 Identities = 57/196 (29%), Positives = 101/196 (51%), Gaps = 16/196 (8%)

Query  62   GALLTDSAAKAKALKDYQHVMAQLA---------SWEAEADDDVAATIKSVRQQLLNLNI  112
            G +L+D   K   ++  + +  QL+         SW  E+   + +++  +  QL  L+ 
Sbjct  63   GLILSDDEKKNTVIQQQEKLSQQLSKLGEYSPFLSW-TESTYQLNSSL--LINQLAQLSF  119

Query  113  TGRLPVKLDPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLG-AVSGAGQ-LPWQAGRSV  170
              RL + LD D +R+ + +NP L G ++L+  +RP TIT+LG  +S   Q L +    SV
Sbjct  120  VSRLFIPLDIDEIRIKKENNPLLSGQFSLFVPERPTTITVLGLTLSPKPQTLSYIENGSV  179

Query  171  TDYLQDHPRLAGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEK  230
             DYL +    + A+ + V VI P+G   +A    W K  V   PG+ +++GF+   LP+ 
Sbjct  180  KDYLHNVDVSSQANTSQVYVIQPDGVVQIASNNQWQKNTVSIAPGATIFIGFNE--LPDS  237

Query  231  YADLNDQIVSVLTQRV  246
             + ++  I+ +L  +V
Sbjct  238  LSSIHQDIIQLLRNKV  253


>gb|ECC92702.1| hypothetical protein GOS_5462754 [marine metagenome]
Length=116

 Score = 76.3 bits (186),  Expect = 5e-12, Method: Compositional matrix adjust.
 Identities = 40/114 (35%), Positives = 66/114 (57%), Gaps = 2/114 (1%)

Query  132  NPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGADKNNVMVI  191
            NP + G + L    RP TIT++GAV+  G+  W +  S  DYL+    L  A+ + V +I
Sbjct  3    NPIIDGQFLLVLPPRPTTITVVGAVAQTGEQKWVSRTSSKDYLKQAGLLENAENSFVWII  62

Query  192  TPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLTQR  245
             P+G+ +  P+A WN + ++  PG+ L++ FS   L + Y+ LN+ I+ +L  R
Sbjct  63   QPDGKAIRQPIAYWNHQSMDIAPGAILFVEFSG--LFDDYSTLNNNIIELLKNR  114


>gb|EBT37533.1| hypothetical protein GOS_7284195 [marine metagenome]
Length=127

 Score = 73.9 bits (180),  Expect = 2e-11, Method: Compositional matrix adjust.
 Identities = 43/125 (34%), Positives = 63/125 (50%), Gaps = 2/125 (1%)

Query  123  DFVRVDENSNPPL-VGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLA  181
            D VR  ++ NP L  G Y L    RP  + L G V+    +      SV  YL     L 
Sbjct  1    DIVRAKKSLNPLLSAGQYKLLVNVRPTVVQLEGLVA-EKNIALADAVSVVHYLDSVSVLD  59

Query  182  GADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSV  241
            G   + + +IT  G+ ++A   LWN  + E  PGS L++ F    LPE ++D+N+QIV +
Sbjct  60   GGSSSYLYIITAAGKVLIAKTGLWNNTYQEVSPGSLLFVPFEQRFLPEAFSDINEQIVEL  119

Query  242  LTQRV  246
            L  +V
Sbjct  120  LLHKV  124


>ref|YP_004565064.1| hypothetical protein VAA_02509 [Vibrio anguillarum 775]
 gb|ABI93956.1| WbfC [Listonella anguillarum]
 gb|AEH32022.1| Hypothetical exported protein [Vibrio anguillarum 775]
Length=289

 Score = 73.9 bits (180),  Expect = 2e-11, Method: Compositional matrix adjust.
 Identities = 64/256 (25%), Positives = 114/256 (44%), Gaps = 16/256 (6%)

Query  4    LQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQ---TLSVGPVENVVQLV-----TQPQLR  55
            L  +F+  VL  ++P A++     + +        +LS   V  V +LV        QL 
Sbjct  38   LMKHFLLGVLCALSPAAYSAPASVVEVRSADTPALSLSFASVPRVDELVINALNASAQLP  97

Query  56   DRLWWPGALLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGR  115
              + W  + L D ++     +  Q V+  +   E+ A        + +++QL       R
Sbjct  98   ADIDWLSSALFDVSSP---YQKKQQVLLAITHQESLAASAHKKRWRQLKEQLRARTFAQR  154

Query  116  LPVKLDPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQ  175
            +   LDPD  R+  + NP L G + L     P T+++LG V  AG + W+       Y++
Sbjct  155  IFTPLDPDITRITASQNPKLQGQWLLSLNALPTTVSVLGNVKQAGDMAWKPRTDAGHYVR  214

Query  176  DHPRLAGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGF---SAHVLPE-KY  231
                LA  D + V VI P+G   +  +A WN    +  PG+ L++ F   +  + P+   
Sbjct  215  S-AGLAETDISQVWVIQPDGHASLHDIAYWNHDFQDIAPGATLYVPFPIETTSLYPQYSL  273

Query  232  ADLNDQIVSVLTQRVP  247
             ++ND +V +L  ++P
Sbjct  274  HNVNDIVVELLRNQLP  289


>ref|ZP_05879679.1| hypothetical protein VFA_003816 [Vibrio furnissii CIP 102972]
 gb|EEX39519.1| hypothetical protein VFA_003816 [Vibrio furnissii CIP 102972]
Length=220

 Score = 72.4 bits (176),  Expect = 6e-11, Method: Compositional matrix adjust.
 Identities = 55/166 (33%), Positives = 75/166 (45%), Gaps = 3/166 (1%)

Query  59   WWPGALLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPV  118
            +  G  L   A   KA + Y  V A+L++        VAA  K +  QL       R  +
Sbjct  34   YHEGYRLFSGAKADKAQELYAGVKARLSALLNNDTYRVAA--KQLLTQLSGYQYGYREKL  91

Query  119  KLDPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHP  178
             LD D VR+    NP L G + L   QRP +I L G +S   Q+ + A  +V DY+    
Sbjct  92   NLDVDAVRLKPEFNPLLPGQFQLELAQRPNSIALFG-LSEQQQMTFNANFTVADYIARSR  150

Query  179  RLAGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSA  224
                   ++  VI+PEGE      A WN  H    PGS L +GF+A
Sbjct  151  SPHKKHHSDAWVISPEGEITHVGYAYWNNAHTHLKPGSALLVGFNA  196


>ref|YP_154958.1| hypothetical protein IL0568 [Idiomarina loihiensis L2TR]
 gb|AAV81409.1| Fusion of WbfC- and WbfB-like uncharacterized domains involved 
in polysaccharide synthesis [Idiomarina loihiensis L2TR]
Length=955

 Score = 72.4 bits (176),  Expect = 6e-11, Method: Compositional matrix adjust.
 Identities = 53/198 (26%), Positives = 93/198 (46%), Gaps = 8/198 (4%)

Query  59   WWPGALLTDSAAKAKALKDYQHVMAQLAS----WEAEADDDVAATIKSVRQQLLNLNITG  114
            +WP   L  +  KA+  +    ++AQL      W++  + + A +   ++ Q+ +  +  
Sbjct  51   YWPLVRLVKTDDKAEIEQQRNQILAQLTELEQYWQSRRETEKAQSAALLKSQVKSWQLGK  110

Query  115  RLPVKLDPDFVRVDENSNPPL-VGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDY  173
            +L  ++  +  R +  SNP L  G+Y LY  +RP ++ + G VS  G   + +G++V D+
Sbjct  111  QLWGQISIENARTELASNPLLPAGEYKLYVPERPDSVHVYGVVSTPGDYRYASGKTVADW  170

Query  174  LQ---DHPRLAGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEK  230
            L    D   L  A   +  V   + +   A  A + +   E  PG  LWLGF  + LP K
Sbjct  171  LNSIDDSEGLGAAYNKSQAVRIRQNQQQTADWAYYKQSDAELLPGDILWLGFEPNQLPPK  230

Query  231  YADLNDQIVSVLTQRVPD  248
            +  LN  I  +L   V +
Sbjct  231  FESLNADIRDLLMHFVAN  248


>gb|ADT85366.1| hypothetical periplasmic protein [Vibrio furnissii NCTC 11218]
Length=252

 Score = 72.4 bits (176),  Expect = 6e-11, Method: Compositional matrix adjust.
 Identities = 67/225 (29%), Positives = 98/225 (43%), Gaps = 9/225 (4%)

Query  6    SYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVG-PVE-NVVQLVTQPQLRDR----LW  59
            ++ +  +  V +  A+A    T+ LP +Q  L    PV  + V L  Q Q   +     +
Sbjct  7    TWLLLCLTSVSSTSAWANTPTTVTLPMQQVILQYNQPVRLDRVLLDAQQQANQKNHLNTY  66

Query  60   WPGALLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVK  119
              G  L   A   KA + Y  V  +L++        +AA  K +  QL       R  + 
Sbjct  67   HEGYRLFSGAKADKAQELYASVKERLSTLLNNETYRIAA--KQLLTQLSGYQYGYREKLN  124

Query  120  LDPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPR  179
            LD D VR+    NP L G + L   QRP +I L G +S   Q+ + A  +V DY+     
Sbjct  125  LDVDAVRLKPEFNPLLPGQFQLELAQRPNSIALFG-LSEQQQMTFNANFTVADYIARSRS  183

Query  180  LAGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSA  224
                  ++  VI+PEGE      A WN  H    PGS L +GF+A
Sbjct  184  PHKKHHSDAWVISPEGEITHVGYAYWNNAHTHLKPGSALLVGFNA  228


>ref|ZP_05883360.1| polysaccharide synthesis-related protein [Vibrio metschnikovii 
CIP 69.14]
 gb|EEX35778.1| polysaccharide synthesis-related protein [Vibrio metschnikovii 
CIP 69.14]
Length=274

 Score = 70.1 bits (170),  Expect = 3e-10, Method: Compositional matrix adjust.
 Identities = 52/178 (29%), Positives = 82/178 (46%), Gaps = 15/178 (8%)

Query  80   HVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPDFVRVDENSNPPLVGDY  139
             V+ QLA  +     D A   + +R QL       R  + LDPD +RV +  NP L G +
Sbjct  100  QVLKQLALQQDNISADQARAWEQLRNQLRRSEFAQREFIPLDPDVIRVVDRHNPLLKGHF  159

Query  140  TLY--TVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQD----HPRLAGADKNNVMVITP  193
             L+   +     +++ GAV+   +  WQA  S  DY Q     +PRL+      VMVI P
Sbjct  160  ALHLPLLSDQTNVSVWGAVNEPTRFEWQANFSAKDYAQQAQWINPRLSA-----VMVIQP  214

Query  194  EGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLP----EKYADLNDQIVSVLTQRVP  247
             G+    PV  W  + +   PG+ +++ F+  +L      +    N Q+V +L   +P
Sbjct  215  NGDVQSHPVGYWQSQPLPVQPGAIIYVPFTRSLLASLSRSELDQTNQQVVELLRHLLP  272


>ref|YP_002415894.1| hypothetical protein VS_0210 [Vibrio splendidus LGP32]
 emb|CAV17242.1| Conserved hypothetical protein [Vibrio splendidus LGP32]
Length=313

 Score = 68.9 bits (167),  Expect = 8e-10, Method: Compositional matrix adjust.
 Identities = 71/277 (25%), Positives = 121/277 (43%), Gaps = 45/277 (16%)

Query  8    FIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRL-------WW  60
            ++++   V+TP   +    TI LP +  TL       ++Q++        +       ++
Sbjct  42   YVSAASNVVTPDGSSLMKTTIELPLQGVTLEYKANVRLLQVLDDANASSNVNDNSSIGYF  101

Query  61   P-GALLTD--SAAKAKALKDYQ----HVMAQLASWEAEADDDVAATIKSVRQQLLNLNIT  113
            P  A L D  +A   KA KD +    +V  QL ++  E  +      K V+QQL +    
Sbjct  102  PLSAQLFDKTNAESNKANKDIEAKKRNVFNQLDAFSVEEPE-----AKLVKQQLASFQYL  156

Query  114  GRLPVKLDPDFVRVDENSNPPLVGD---------------------YTLYTVQRPVTITL  152
             R+ ++LD + V    + NP LV                       ++LY  QRP +I L
Sbjct  157  NRVFIELDRNAVISQSDKNPLLVSSSHTNKPSSAAIKRASTSQTQAFSLYLPQRPTSIQL  216

Query  153  LGAVSGAGQLPWQAGRSVTDYLQDHPR-LAG--ADKNNVMVITPEGETVVAPVALWNKRH  209
            +GA+  +  +      ++ DYL   P    G  ADK+   V+ P+G       A WN++ 
Sbjct  217  MGAMKASVTMNLIEHGTLNDYLDALPNGFIGESADKSVAYVVQPDGVVRTIQYAYWNEQS  276

Query  210  VEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLTQRV  246
                PG+ +++ F  + LP +Y+ LN  IV +L  +V
Sbjct  277  AYFAPGAIVFMAF--YSLPSEYSTLNQDIVDLLRHKV  311


>ref|ZP_01982747.1| conserved hypothetical protein [Vibrio cholerae 623-39]
 gb|EDL72570.1| conserved hypothetical protein [Vibrio cholerae 623-39]
Length=256

 Score = 68.6 bits (166),  Expect = 1e-09, Method: Compositional matrix adjust.
 Identities = 50/173 (28%), Positives = 80/173 (46%), Gaps = 5/173 (2%)

Query  79   QHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPDFVRVDENSNPPLVGD  138
            QHV+  LA  ++ A     A  +++R QL       R+   +DPD+ R+    NP L G 
Sbjct  84   QHVLLVLAHQQSAAPMAQKALWETLRDQLRLSAFAKRIFTPVDPDWTRLAAQDNPRLNGQ  143

Query  139  YTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGADKNNVMVITPEGETV  198
            + L    R   +++ GAV   G + W+  +S  DY Q    L     + ++VI P+GE  
Sbjct  144  WLLTLNPRVAEVSVYGAVKKPGDVTWRGRQSAKDYAQA-AGLVDEKISEIVVIQPDGEVQ  202

Query  199  VAPVALWNKRHVEPPPGSQLWLGFSAH---VLPEK-YADLNDQIVSVLTQRVP  247
               VA WN    E  PG+ +++    H     P+    D N  ++ +L  R+P
Sbjct  203  KHSVAYWNMTFAEVAPGAIVYVPMPLHDSSFYPDTPSTDANQLVIELLRNRLP  255


>ref|ZP_06053682.1| hypothetical polysaccharide synthesis-related protein [Grimontia 
hollisae CIP 101886]
 gb|EEY70997.1| hypothetical polysaccharide synthesis-related protein [Grimontia 
hollisae CIP 101886]
Length=277

 Score = 68.6 bits (166),  Expect = 1e-09, Method: Compositional matrix adjust.
 Identities = 70/263 (26%), Positives = 111/263 (42%), Gaps = 31/263 (11%)

Query  9    IASVLYVMTPHAFAQG-----TVTIYLPGEQQ-TLSVGPVENVVQLV-------------  49
            I    +V+TPHA         TVT  +  ++  +LS      + QLV             
Sbjct  20   IVLFCFVLTPHAAIATEETAVTVTSSVDADKHFSLSFNNAPRISQLVSEGASVIRAHISE  79

Query  50   TQPQLRDRLWWPGA-LLTDSA-----AKAKALKDYQHVMAQLASWEAEADDDVAATIKSV  103
            T+    D ++W GA L +D       A + ++ +  + +A L  W +++    A  + S+
Sbjct  80   TKAHSTDSIYWTGAGLFSDDEDTSLNASSSSVINKLNKLADL--WYSDSKKRDA--VLSL  135

Query  104  RQQLLNLNITGRLPVKLDPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLP  163
            R  +       RL V LD D        NP + G +      RP  I + GAV+   ++P
Sbjct  136  RDFIAASTFKPRLTVTLDEDVYLAGTQQNPLMKGRFYFQLPSRPTHIWMTGAVAKTQKIP  195

Query  164  WQAGRSVTDYLQDHPRLAGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFS  223
            + A    +DYL +   L     +NV VI P GE     V+ WN +     PG+ +++ F 
Sbjct  196  FSAPFFASDYLNNTTTLNTFGISNVSVIQPNGELETHHVSYWNHQPAGLAPGAIIFVPFQ  255

Query  224  AHVLPEKYADLNDQIVSVLTQRV  246
               LP     LN +I  +L  RV
Sbjct  256  H--LPTDLDVLNQEIPRLLQHRV  276


>ref|ZP_01950873.1| WbfC protein [Vibrio cholerae 1587]
 gb|EAY32675.1| WbfC protein [Vibrio cholerae 1587]
Length=256

 Score = 68.2 bits (165),  Expect = 1e-09, Method: Compositional matrix adjust.
 Identities = 50/173 (28%), Positives = 80/173 (46%), Gaps = 5/173 (2%)

Query  79   QHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPDFVRVDENSNPPLVGD  138
            QHV+  LA  ++ A     A  +++R QL       R+   +DPD+ R+    NP L G 
Sbjct  84   QHVLLVLAHQQSAAPMAQKALWETLRDQLRLSAFAKRIFTPVDPDWTRLAAQDNPRLNGQ  143

Query  139  YTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGADKNNVMVITPEGETV  198
            + L    R   +++ GAV   G + W+  +S  DY Q    L     + ++VI P+GE  
Sbjct  144  WLLTLNPRVAEVSVYGAVKKPGDVTWRGRQSAKDYAQA-AGLVDEQISEIVVIQPDGEVQ  202

Query  199  VAPVALWNKRHVEPPPGSQLWLGFSAH---VLPEK-YADLNDQIVSVLTQRVP  247
               VA WN    E  PG+ +++    H     P+    D N  ++ +L  R+P
Sbjct  203  KHSVAYWNMTFAEVAPGAIVYVPMPLHDSSFYPDTPSTDANQLVIELLRNRLP  255


>ref|ZP_03348517.1| hypothetical protein Salmoneentericaenterica_23372 [Salmonella 
enterica subsp. enterica serovar Typhi str. E00-7866]
Length=33

 Score = 67.0 bits (162),  Expect = 2e-09, Method: Compositional matrix adjust.
 Identities = 33/33 (100%), Positives = 33/33 (100%), Gaps = 0/33 (0%)

Query  70   AKAKALKDYQHVMAQLASWEAEADDDVAATIKS  102
            AKAKALKDYQHVMAQLASWEAEADDDVAATIKS
Sbjct  1    AKAKALKDYQHVMAQLASWEAEADDDVAATIKS  33


>ref|ZP_06154782.1| protein of unknown function DUF1017 [Photobacterium damselae 
subsp. damselae CIP 102761]
 gb|EEZ40479.1| protein of unknown function DUF1017 [Photobacterium damselae 
subsp. damselae CIP 102761]
Length=272

 Score = 67.0 bits (162),  Expect = 3e-09, Method: Compositional matrix adjust.
 Identities = 42/139 (30%), Positives = 71/139 (51%), Gaps = 3/139 (2%)

Query  108  LNLNITGR-LPVKLDPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQA  166
            LN N+  R L   LD DFVR++  +NP L G Y L     P  + +LGAV+   ++ WQ 
Sbjct  134  LNQNVVHRRLWQNLDYDFVRLNIANNPQLQGSYQLVLPTTPNKVLVLGAVATPTEVMWQP  193

Query  167  GRSVTDYLQDHPRLAGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHV  226
              S  + L     +   +++ + VI P+G+     +A WN+ H +  PG+ L++ +    
Sbjct  194  RISAAELLTQVNVIDEHNRSQISVIQPDGQIETHSIAYWNQNHKDIAPGATLYVHYEQAF  253

Query  227  LPEKYADLNDQIVSVLTQR  245
              + + DLN  ++ +L  R
Sbjct  254  --DLHRDLNQFVIQLLQNR  270


>ref|ZP_05118573.1| conserved hypothetical protein [Vibrio parahaemolyticus 16]
 gb|EED27588.1| conserved hypothetical protein [Vibrio parahaemolyticus 16]
Length=242

 Score = 66.6 bits (161),  Expect = 4e-09, Method: Compositional matrix adjust.
 Identities = 45/152 (29%), Positives = 74/152 (48%), Gaps = 5/152 (3%)

Query  94   DDVAATIKSVRQQLLNLNITGRLPVKLDPDFVRVDENSNPPLVGDYTLYTVQRPVTITLL  153
            ++VA   K   QQ+ +  +  R  + LD D +R D   NP L G+Y L T +R  T++  
Sbjct  92   EEVAGVFK---QQIQSWTVAYREKIDLDFDQIRTDAADNPMLQGNYELITPKRTRTLSFE  148

Query  154  GAVSGAGQLPWQAGRSVTDYLQDHPRLAGADKNNVMVITPEGETVVAPVALWNKRHVEPP  213
            GA+     + +     ++ YL     L  A  +   VI P+G  V    A WN ++    
Sbjct  149  GALYTPQDVEFDESFPLSGYLSRLNLLKSAHPSYAWVIYPDGNVVRRGYAYWNSQNTSLT  208

Query  214  PGSQLWLGFSAHVLPEKYADLNDQIVSVLTQR  245
            PGS +++GF++    ++   L  QIV ++T R
Sbjct  209  PGSVIFIGFNSS--NKEVQQLEQQIVQLITMR  238


>ref|ZP_05888474.1| hypothetical protein VIC_004993 [Vibrio coralliilyticus ATCC 
BAA-450]
 gb|EEX30697.1| hypothetical protein VIC_004993 [Vibrio coralliilyticus ATCC 
BAA-450]
Length=246

 Score = 66.6 bits (161),  Expect = 4e-09, Method: Compositional matrix adjust.
 Identities = 47/144 (32%), Positives = 67/144 (46%), Gaps = 4/144 (2%)

Query  103  VRQQLLNLNITGRLPVKLDPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQL  162
            +R+Q+   ++  R  V LD DFVR+  ++NP L G Y      RP  I L G        
Sbjct  103  LREQVKTWSVGYRENVSLDLDFVRLTPSANPMLSGHYQFEYPDRPTNIHLEGLFFSTTMP  162

Query  163  PWQAGRSVTDYLQDHPRLAGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGF  222
              +   +V DYL     L+ A  +   VI P+G       A WN  +   PPGS ++LGF
Sbjct  163  EARPDWTVKDYLSTSRVLSSASNSYAWVIYPDGNYKQVGFAYWNDENTPLPPGSSIFLGF  222

Query  223  SAHVLPEK-YADLNDQIVSVLTQR  245
            +    P K  + L   IVS++  R
Sbjct  223  NN---PSKELSQLEQDIVSLIAWR  243


>ref|ZP_00989941.1| hypothetical protein V12B01_23145 [Vibrio splendidus 12B01]
 gb|EAP95066.1| hypothetical protein V12B01_23145 [Vibrio splendidus 12B01]
Length=326

 Score = 65.5 bits (158),  Expect = 8e-09, Method: Compositional matrix adjust.
 Identities = 49/168 (29%), Positives = 78/168 (46%), Gaps = 26/168 (15%)

Query  103  VRQQLLNLNITGRLPVKLDPDFVRVDENSNPPLVGD---------------------YTL  141
            VRQQL +     R+ ++LD + V    + NP LV                       ++L
Sbjct  159  VRQQLASFQYLNRVFIELDRNAVISQSDKNPLLVSSSHTNKPSSTAIKRASTSQTQAFSL  218

Query  142  YTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPR-LAG--ADKNNVMVITPEGETV  198
            Y  QRP +I L+GA+  +  +      ++ DYL   P    G  ADK+   V+ P+G   
Sbjct  219  YLPQRPTSIQLMGAMKESVTMNLIEHGTLNDYLDALPNGFIGESADKSVAYVVQPDGLVQ  278

Query  199  VAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLTQRV  246
                A WN++ V   PG+ +++ F  + LP +Y+ LN  IV +L  +V
Sbjct  279  TIQYAYWNEQPVYLAPGAIVFMAF--YSLPSEYSTLNQDIVDLLRHKV  324


>ref|ZP_04961240.1| Periplasmic protein involved in polysaccharide export [Vibrio 
cholerae AM-19226]
 gb|EDN15638.1| Periplasmic protein involved in polysaccharide export [Vibrio 
cholerae AM-19226]
Length=256

 Score = 65.1 bits (157),  Expect = 1e-08, Method: Compositional matrix adjust.
 Identities = 48/173 (27%), Positives = 78/173 (45%), Gaps = 5/173 (2%)

Query  79   QHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPDFVRVDENSNPPLVGD  138
            QHV+  LA  ++ A     A  +++R QL       R+   +DPD+ R+    NP L G 
Sbjct  84   QHVLLVLAHQQSAAPMAQKAIWETLRDQLRLSAFAKRIFTPIDPDWTRLAAQDNPRLNGQ  143

Query  139  YTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGADKNNVMVITPEGETV  198
            + L    R   +++ GAV   G + W+  +S  DY Q    L     + ++VI P+G   
Sbjct  144  WLLTLNPRVAEVSVYGAVKKPGDVTWRGRQSAKDYAQA-AGLVDEQISEIVVIQPDGVVQ  202

Query  199  VAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKY----ADLNDQIVSVLTQRVP  247
               VA WN    E  PG+ +++    H     +     D N  ++ +L  R+P
Sbjct  203  KHSVAYWNMTFAEVAPGAIVYVPMPLHDSSFYHDTPSTDANQLVIELLRNRLP  255


>ref|ZP_04919294.1| WbfC protein [Vibrio cholerae V51]
 gb|EAZ50134.1| WbfC protein [Vibrio cholerae V51]
Length=288

 Score = 64.7 bits (156),  Expect = 1e-08, Method: Compositional matrix adjust.
 Identities = 56/197 (28%), Positives = 87/197 (44%), Gaps = 18/197 (9%)

Query  60   WPGALLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVK  119
            WP A L D    + A    + V+ +LA  ++ A     A   S+  QL       RL + 
Sbjct  100  WPSAGLFD---LSHAFLFKRDVLLKLADQQSSAPPTQQALWASLIAQLRQAEFAKRLFIS  156

Query  120  LDPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPR  179
            +DPD+ R+    NP L G + L    +   +++ GAV+  G + W    S  DY Q    
Sbjct  157  VDPDWTRIAPQHNPRLNGSWLLTLNSKSTQVSVYGAVNQPGDVIWHNRLSAKDYAQ-AAG  215

Query  180  LAGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKY--------  231
            L     + ++VI P+G      VA WN+   E  PG+ +++      LP K         
Sbjct  216  LIDEQISEIVVIQPDGIAQKHAVAYWNQDFNEVAPGAIVYVP-----LPLKRAFFDPTVT  270

Query  232  -ADLNDQIVSVLTQRVP  247
             ADLN  ++ +L  R+P
Sbjct  271  DADLNQLVIELLRNRLP  287


>dbj|BAA33618.1| unknown [Vibrio cholerae]
Length=288

 Score = 64.7 bits (156),  Expect = 1e-08, Method: Compositional matrix adjust.
 Identities = 56/197 (28%), Positives = 87/197 (44%), Gaps = 18/197 (9%)

Query  60   WPGALLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVK  119
            WP A L D    + A    + V+ +LA  ++ A     A   S+  QL       RL + 
Sbjct  100  WPSAGLFD---LSHAFLFKRDVLLKLADQQSSAPPTQQALWASLIAQLRQAEFAKRLFIS  156

Query  120  LDPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPR  179
            +DPD+ R+    NP L G + L    +   +++ GAV+  G + W    S  DY Q    
Sbjct  157  VDPDWTRIAPQHNPRLNGSWLLTLNSKSTQVSVYGAVNQPGDVIWHNRLSAKDYAQ-AAG  215

Query  180  LAGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKY--------  231
            L     + ++VI P+G      VA WN+   E  PG+ +++      LP K         
Sbjct  216  LIDEQISEIVVIQPDGIAQKHAVAYWNQDFNEVAPGAIVYVP-----LPLKRAFFDPTVT  270

Query  232  -ADLNDQIVSVLTQRVP  247
             ADLN  ++ +L  R+P
Sbjct  271  DADLNQLVIELLRNRLP  287


>ref|NP_933122.1| hypothetical protein VV0329 [Vibrio vulnificus YJ016]
 dbj|BAC93093.1| putative exported protein [Vibrio vulnificus YJ016]
Length=265

 Score = 64.3 bits (155),  Expect = 2e-08, Method: Compositional matrix adjust.
 Identities = 47/166 (28%), Positives = 73/166 (43%), Gaps = 3/166 (1%)

Query  59   WWPGALLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPV  118
            +  G  L +   +A+A    QHV  QL   E   D+D     + +   L       R  +
Sbjct  82   FHEGFQLFNLDKQAEADAQLQHVRQQLI--ELAKDEDYRQASQLLLTLLEKHQYGYRENI  139

Query  119  KLDPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHP  178
             LD D VR+  + NP L G Y L    R   + LLG +     +P+ A   V DY+    
Sbjct  140  NLDIDAVRLKADLNPALPGHYALKQASRENKVLLLGLID-QKTVPFSADLDVADYIATST  198

Query  179  RLAGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSA  224
                 +K+   VI P G +     + WN +H+   PGS +++GF++
Sbjct  199  LNNNGNKSEAWVIAPNGNSSKVGYSYWNNQHMSVLPGSTIFIGFNS  244


>gb|ABI85351.1| hypothetical protein [Vibrio cholerae]
Length=256

 Score = 64.3 bits (155),  Expect = 2e-08, Method: Compositional matrix adjust.
 Identities = 48/173 (27%), Positives = 78/173 (45%), Gaps = 5/173 (2%)

Query  79   QHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPDFVRVDENSNPPLVGD  138
            QHV+  LA  ++ A     A  +++R QL       R+   +DPD+ R+    NP L G 
Sbjct  84   QHVLLVLAHQQSAAPMAQKALWETLRDQLRLSAFAKRIFTPVDPDWTRLAAQDNPRLNGQ  143

Query  139  YTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGADKNNVMVITPEGETV  198
            + L    R   +++ GAV   G + W+  +S  DY Q    L     + ++VI P+G   
Sbjct  144  WLLTLNPRVAEVSVYGAVKKPGDVTWRGRQSAKDYAQA-AGLVDEQISEIVVIQPDGVVQ  202

Query  199  VAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKY----ADLNDQIVSVLTQRVP  247
               VA WN    E  PG+ +++    H     +     D N  ++ +L  R+P
Sbjct  203  KHSVAYWNMTFAEVAPGAIVYVPMPLHDSSFYHDTPSTDANQLVIELLRNRLP  255


>ref|YP_004190015.1| YjbG polysaccharide synthesis-related protein [Vibrio vulnificus 
MO6-24/O]
 gb|ADV87812.1| YjbG polysaccharide synthesis-related protein [Vibrio vulnificus 
MO6-24/O]
Length=265

 Score = 64.3 bits (155),  Expect = 2e-08, Method: Compositional matrix adjust.
 Identities = 47/166 (28%), Positives = 73/166 (43%), Gaps = 3/166 (1%)

Query  59   WWPGALLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPV  118
            +  G  L +   +A+A    QHV  QL   E   D+D     + +   L       R  +
Sbjct  82   FHEGFQLFNLDKQAEADAQLQHVRQQLI--ELAKDEDYRQASQLLLTLLEKHQYGYRENI  139

Query  119  KLDPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHP  178
             LD D VR+  + NP L G Y L    R   + LLG +     +P+ A   V DY+    
Sbjct  140  NLDIDAVRLKADLNPALPGHYALKQASRENKVLLLGLID-QKTVPFSADLDVADYIATST  198

Query  179  RLAGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSA  224
                 +K+   VI P G +     + WN +H+   PGS +++GF++
Sbjct  199  LNNNGNKSEAWVIAPNGNSSKVGYSYWNNQHMSVLPGSTIFIGFNS  244


>ref|ZP_06943634.1| periplasmic protein [Vibrio cholerae RC385]
 gb|EFH72958.1| periplasmic protein [Vibrio cholerae RC385]
Length=256

 Score = 64.3 bits (155),  Expect = 2e-08, Method: Compositional matrix adjust.
 Identities = 48/173 (27%), Positives = 78/173 (45%), Gaps = 5/173 (2%)

Query  79   QHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPDFVRVDENSNPPLVGD  138
            QHV+  LA  ++ A        +++R QL       R+   +DPD+ R+    NP L G 
Sbjct  84   QHVLLVLAHQQSAAPMTQKVLWETLRDQLRLSAFAKRIFTPIDPDWTRLAAQDNPRLNGQ  143

Query  139  YTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGADKNNVMVITPEGETV  198
            + L    R   +++ GAV   G + W+  +S  DY Q    L     + ++VI P+G   
Sbjct  144  WLLTLNPRVAEVSVYGAVKKPGDVTWRGRQSAKDYAQA-AGLVDEQISEIVVIQPDGVVQ  202

Query  199  VAPVALWNKRHVEPPPGSQLWLGFSAH---VLPEK-YADLNDQIVSVLTQRVP  247
               VA WN    E  PG+ +++    H     P+    D N  ++ +L  R+P
Sbjct  203  KHSVAYWNMTFAEVAPGAIVYVPMPLHDSSFYPDTPSTDANQLVIELLRNRLP  255


>ref|ZP_01978991.1| conserved hypothetical protein [Vibrio cholerae MZO-2]
 gb|EDM54080.1| conserved hypothetical protein [Vibrio cholerae MZO-2]
Length=256

 Score = 63.5 bits (153),  Expect = 3e-08, Method: Compositional matrix adjust.
 Identities = 48/173 (27%), Positives = 78/173 (45%), Gaps = 5/173 (2%)

Query  79   QHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPDFVRVDENSNPPLVGD  138
            QHV+  LA  ++ A        +++R QL       R+   +DPD+ R+    NP L G 
Sbjct  84   QHVLLVLAHQQSAAPMAQKVLWETLRDQLRLSAFAKRIFTPVDPDWTRLAAQDNPRLNGQ  143

Query  139  YTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGADKNNVMVITPEGETV  198
            + L    R   +++ GAV   G + W+  +S  DY Q    L     + ++VI P+G   
Sbjct  144  WLLTLNPRVAEVSVYGAVKKPGDVTWRGRQSAKDYAQA-AGLVDEQISEIVVIQPDGVVQ  202

Query  199  VAPVALWNKRHVEPPPGSQLWLGFSAH---VLPEK-YADLNDQIVSVLTQRVP  247
               VA WN    E  PG+ +++    H     P+    D N  ++ +L  R+P
Sbjct  203  KHSVAYWNMTFAEVAPGAIVYVPMPLHDSSFYPDTPSTDANQLVIELLRNRLP  255


>ref|ZP_05715654.1| hypothetical protein VMD_07000 [Vibrio mimicus VM573]
 gb|EEW11844.1| hypothetical protein VMD_07000 [Vibrio mimicus VM573]
Length=256

 Score = 63.5 bits (153),  Expect = 3e-08, Method: Compositional matrix adjust.
 Identities = 48/173 (27%), Positives = 78/173 (45%), Gaps = 5/173 (2%)

Query  79   QHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPDFVRVDENSNPPLVGD  138
            QHV+  +A  ++ A     A  + +R QL       R+   +DPD+ R+    NP L G 
Sbjct  84   QHVLLVMAHQQSSAPIQQKALWEKMRSQLRLSAFAKRVFTPIDPDWTRIAPPDNPRLNGQ  143

Query  139  YTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGADKNNVMVITPEGETV  198
            + L    R   +++ GAV   G + W++ ++  DY Q    L       ++VI P+G   
Sbjct  144  WLLTLNPRVGEVSVYGAVHKPGDVTWRSRQTAKDYAQA-AGLIDEKIAEIVVIQPDGVVQ  202

Query  199  VAPVALWNKRHVEPPPGSQLWLGFSAH---VLPEK-YADLNDQIVSVLTQRVP  247
            V  VA WN    E  PG+ +++    H     P     D N  ++ +L  R+P
Sbjct  203  VHSVAYWNMHFAEVAPGAIVYVPLPLHDSSFYPNTPNTDANQLVIELLRNRLP  255


>ref|NP_759771.1| hypothetical protein VV1_0794 [Vibrio vulnificus CMCP6]
 gb|AAO09298.1| hypothetical protein VV1_0794 [Vibrio vulnificus CMCP6]
Length=265

 Score = 63.2 bits (152),  Expect = 3e-08, Method: Compositional matrix adjust.
 Identities = 49/178 (27%), Positives = 78/178 (43%), Gaps = 25/178 (14%)

Query  67   DSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNL----------------  110
            + ++ A+AL    H   QL +   +A+ D  A ++ VRQQL+ L                
Sbjct  72   NGSSNAQALSF--HEGFQLFNLNKQAETD--ALLQHVRQQLIELAKDEDYRQASQLLLTL  127

Query  111  ----NITGRLPVKLDPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQA  166
                    R  + LD D VR+  + NP L G Y L    R   + LLG +     +P+ A
Sbjct  128  LEKHQYGYRENINLDIDAVRLKADLNPALPGHYALKQASRENKVLLLGLID-QKTVPFSA  186

Query  167  GRSVTDYLQDHPRLAGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSA  224
               V DY+         +K+   VI P G +     + WN +H+   PGS +++GF++
Sbjct  187  DLDVADYIATSTLNNNGNKSEAWVIAPNGNSSKVGYSYWNNQHMSVLPGSTIFIGFNS  244


>ref|ZP_08103789.1| putative periplasmic protein [Vibrio sinaloensis DSM 21326]
 gb|EGA69190.1| putative periplasmic protein [Vibrio sinaloensis DSM 21326]
Length=257

 Score = 63.2 bits (152),  Expect = 4e-08, Method: Compositional matrix adjust.
 Identities = 36/143 (25%), Positives = 77/143 (53%), Gaps = 2/143 (1%)

Query  103  VRQQLLNLNITGRLPVKLDPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQL  162
            V +Q+ N ++  R  ++LD D +R   ++NP L G+    + +R   ++  G +    ++
Sbjct  113  VIEQVENWDVVYRELIELDFDTIRTQPSANPMLQGNLEFISPKRSQELSFEGLLFPPQKV  172

Query  163  PWQAGRSVTDYLQDHPRLAGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGF  222
            P+ A + +++Y +    L+ A  +   +I P G  V A  A WN++  +  PGS +++GF
Sbjct  173  PFDASQPLSEYFRKLNLLSNAHPSYAWIIYPNGHFVRAGYAYWNEQKTQLTPGSAVFIGF  232

Query  223  SAHVLPEKYADLNDQIVSVLTQR  245
            ++  L  +   + ++IV +++ R
Sbjct  233  NSEDL--EIQKIEERIVQLISMR  253


>ref|ZP_05240723.1| conserved hypothetical protein [Vibrio cholerae MO10]
 emb|CAA62137.1| WbfC protein [Vibrio cholerae]
 dbj|BAA33588.1| unknown [Vibrio cholerae]
 gb|EET25492.1| conserved hypothetical protein [Vibrio cholerae MO10]
Length=288

 Score = 63.2 bits (152),  Expect = 4e-08, Method: Compositional matrix adjust.
 Identities = 55/197 (27%), Positives = 86/197 (43%), Gaps = 18/197 (9%)

Query  60   WPGALLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVK  119
            WP A L D    + A    + V+ +LA  ++ A     A   S+  QL       RL + 
Sbjct  100  WPSAGLFD---LSHAFLFKRDVLLKLADQQSSAPPTQQALWASLIAQLRQAEFAKRLFIS  156

Query  120  LDPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPR  179
            +DPD+ R+    NP L G + L    +   +++ GAV+  G + W    S  DY      
Sbjct  157  VDPDWTRIAPQHNPRLNGSWLLTLNSKSTQVSVYGAVNQPGDVIWHNRLSAKDYAHA-AG  215

Query  180  LAGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKY--------  231
            L     + ++VI P+G      VA WN+   E  PG+ +++      LP K         
Sbjct  216  LIDEQISEIVVIQPDGIAQKHAVAYWNQDFNEVAPGAIVYVP-----LPLKRAFFDPTVT  270

Query  232  -ADLNDQIVSVLTQRVP  247
             ADLN  ++ +L  R+P
Sbjct  271  DADLNQLVIELLRNRLP  287


>ref|ZP_04402220.1| polysaccharide synthesis-related protein [Vibrio cholerae TMA 
21]
 gb|EEO15214.1| polysaccharide synthesis-related protein [Vibrio cholerae TMA 
21]
Length=256

 Score = 62.8 bits (151),  Expect = 4e-08, Method: Compositional matrix adjust.
 Identities = 48/173 (27%), Positives = 78/173 (45%), Gaps = 5/173 (2%)

Query  79   QHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPDFVRVDENSNPPLVGD  138
            QHV+  LA  ++ A        +++R QL       R+   +DPD+ R+    NP L G 
Sbjct  84   QHVLLVLAHQQSAAPMVQKVLWETLRDQLRLSAFAKRIFTPVDPDWTRLAAQDNPRLNGQ  143

Query  139  YTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGADKNNVMVITPEGETV  198
            + L    R   +++ GAV   G + W+  +S  DY Q    L     + ++VI P+G   
Sbjct  144  WLLTLNPRVAEVSVYGAVKKPGDVTWRGRQSAKDYAQ-AAGLVDEQISEIVVIQPDGVVQ  202

Query  199  VAPVALWNKRHVEPPPGSQLWLGFSAH---VLPEK-YADLNDQIVSVLTQRVP  247
               VA WN    E  PG+ +++    H     P+    D N  ++ +L  R+P
Sbjct  203  KHSVAYWNMTFAEVAPGAIVYVPMPLHDSSFYPDTPSTDANQLVIELLRNRLP  255


>ref|ZP_04717494.1| hypothetical protein AmacA2_21203 [Alteromonas macleodii ATCC 
27126]
Length=249

 Score = 62.4 bits (150),  Expect = 6e-08, Method: Compositional matrix adjust.
 Identities = 46/191 (24%), Positives = 90/191 (47%), Gaps = 4/191 (2%)

Query  59   WWPGALLTDSAAKAKALKDYQHVMAQLASW--EAEADDDVAATIKSVRQQLLNLNITGRL  116
            +WP A   D  A  KA ++ +  ++Q++    + +AD +    ++++  Q+ +  ++ R+
Sbjct  57   YWPSASAFD-LANPKAEEEKEIALSQISGLLNQFDADSETHKALQNLYDQVSSWTVSTRI  115

Query  117  PVKLDPDFVRVDENSNPPLV-GDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQ  175
             + +  +  R+    NP    G Y +    RP  +   GAV   G    Q+  SV   + 
Sbjct  116  DMPISYNRARLFFEDNPMFQPGKYWIRLNGRPDVVHFSGAVVKPGAYKHQSDTSVYTAVH  175

Query  176  DHPRLAGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLN  235
               +   AD+++V VI P G      +A WN    +  PGSQ+++  S+ +   K   LN
Sbjct  176  TVKKAVDADRSHVYVIDPMGNIEEKGIAYWNLDFGQLMPGSQVYVPISSELFSNKLKQLN  235

Query  236  DQIVSVLTQRV  246
            +++ ++   RV
Sbjct  236  ERVAALAVHRV  246


>gb|ECV65415.1| hypothetical protein GOS_2858926 [marine metagenome]
Length=237

 Score = 62.0 bits (149),  Expect = 7e-08, Method: Compositional matrix adjust.
 Identities = 46/191 (24%), Positives = 90/191 (47%), Gaps = 4/191 (2%)

Query  59   WWPGALLTDSAAKAKALKDYQHVMAQLASW--EAEADDDVAATIKSVRQQLLNLNITGRL  116
            +WP A   D  A  KA ++ +  ++Q++    + +AD +    ++++  Q+ +  ++ R+
Sbjct  45   YWPSASAFD-LANPKAEEEKKIALSQISGLLNQFDADSETHKALQNLYDQVSSWTVSTRI  103

Query  117  PVKLDPDFVRVDENSNPPLV-GDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQ  175
             + +  +  R+    NP    G Y +    RP  +   GAV   G    Q+  SV   + 
Sbjct  104  DMPISYNRARLFFEDNPMFQPGKYWIRLNGRPDVVHFSGAVVKPGAYKHQSDTSVYTAVH  163

Query  176  DHPRLAGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLN  235
               +   AD+++V VI P G      +A WN    +  PGSQ+++  S+ +   K   LN
Sbjct  164  TVKKAVDADRSHVYVIDPMGNIEEKGIAYWNLDFGQLMPGSQVYVPISSELFSNKLKQLN  223

Query  236  DQIVSVLTQRV  246
            +++ ++   RV
Sbjct  224  ERVAALAVHRV  234


>ref|YP_002873221.1| hypothetical protein PFLU3660 [Pseudomonas fluorescens SBW25]
 emb|CAY49907.1| putative membrane protein [Pseudomonas fluorescens SBW25]
Length=256

 Score = 60.8 bits (146),  Expect = 2e-07, Method: Compositional matrix adjust.
 Identities = 55/176 (31%), Positives = 77/176 (43%), Gaps = 8/176 (4%)

Query  71   KAKALKDYQ--HVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPDFVRVD  128
            KA  L D Q  H  A +A  E     D A     + QQ+  L +TGR    LDP  V V 
Sbjct  77   KAGVLFDLQTLHQAALVAGRE-----DRARAAAQLYQQVQALPVTGRQAAVLDPVAVEVG  131

Query  129  ENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGADKNNV  188
               N P+     L    RP ++ +LGAV+ A  L +   +  + YL   P    AD + +
Sbjct  132  FAPNLPVSSGDRLVYPPRPDSVRVLGAVARACTLAFAPLQQGSAYLDACPASKAADTDYL  191

Query  189  MVITPEGETVVAPVALWNKRHVEPP-PGSQLWLGFSAHVLPEKYADLNDQIVSVLT  243
             +I P+G      +A WN+    PP PGS L +   +  L     +LN Q+   L 
Sbjct  192  WLIQPDGHVTRLGIAPWNREEGAPPAPGSTLLVPIRSDDLDPPTPELNQQLAEFLA  247


>ref|ZP_06040321.1| polysaccharide synthesis-related protein [Vibrio mimicus MB-451]
 gb|EEY39705.1| polysaccharide synthesis-related protein [Vibrio mimicus MB-451]
Length=256

 Score = 60.5 bits (145),  Expect = 2e-07, Method: Compositional matrix adjust.
 Identities = 47/173 (27%), Positives = 77/173 (44%), Gaps = 5/173 (2%)

Query  79   QHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPDFVRVDENSNPPLVGD  138
            QHV+  +A  ++ A     A  + +R QL       R+   +DPD+ R+    N  L G 
Sbjct  84   QHVLLVMAHQQSSAPIQQKALWEKMRSQLRLSAFAKRVFTPIDPDWTRIAPQDNHRLNGQ  143

Query  139  YTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGADKNNVMVITPEGETV  198
            + L    R   +++ GAV   G + W++ ++  DY Q    L       ++VI P+G   
Sbjct  144  WLLTLNPRVGEVSVYGAVHKPGDVTWRSRQTAKDYAQA-AGLIDEKIAEIVVIQPDGVVQ  202

Query  199  VAPVALWNKRHVEPPPGSQLWLGFSAH---VLPEK-YADLNDQIVSVLTQRVP  247
            V  VA WN    E  PG+ +++    H     P     D N  ++ +L  R+P
Sbjct  203  VHSVAYWNMNFAEVAPGAIVYVPIPLHDSSFYPNTPNTDANQLVIELLRNRLP  255


>ref|ZP_07774980.1| hypothetical protein PFWH6_2379 [Pseudomonas fluorescens WH6]
 gb|EFQ63928.1| hypothetical protein PFWH6_2379 [Pseudomonas fluorescens WH6]
Length=255

 Score = 58.5 bits (140),  Expect = 8e-07, Method: Compositional matrix adjust.
 Identities = 48/155 (30%), Positives = 72/155 (46%), Gaps = 3/155 (1%)

Query  91   EADDDVAATIKSVRQQLLNLNITGRLPVKLDPDFVRVDENSNPPL-VGDYTLYTVQRPVT  149
            E  D  AA    + +Q+  L +TGR    LDP  + V    + PL  GD  +Y   RP T
Sbjct  93   EGRDTRAALSARLYKQVERLPVTGRQVAVLDPIALEVGFALDSPLDEGDRLIYPA-RPST  151

Query  150  ITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGADKNNVMVITPEGETVVAPVALWNKRH  209
            + + GAV    Q+P+ A +    Y +    L+ A  + V +I P+G      VA WN   
Sbjct  152  VEVWGAVEQTCQVPYAAAQEAWVYARRCAILSDAQSDYVWLIQPDGHVRRLGVAPWNHEE  211

Query  210  -VEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLT  243
             V P PGS++ +   +  L     +LN+Q+   L 
Sbjct  212  GVMPAPGSRILVPIRSDDLQSPTPELNEQLAEFLA  246


>ref|YP_004469043.1| hypothetical protein ambt_18740 [Alteromonas sp. SN2]
 gb|AEF05241.1| hypothetical protein ambt_18740 [Alteromonas sp. SN2]
Length=247

 Score = 58.2 bits (139),  Expect = 1e-06, Method: Compositional matrix adjust.
 Identities = 44/190 (23%), Positives = 85/190 (44%), Gaps = 2/190 (1%)

Query  59   WWPGALLTD-SAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLP  117
            +WP + + + + A A+  KD      ++   +  A+ D    ++++ QQ+ +  ++ R+ 
Sbjct  55   YWPTSGIYNLNDAYAEREKDAVLSEIRMVMKDYNANSDTYRALENLYQQVSSWTVSTRVI  114

Query  118  VKLDPDFVRVDENSNPPLV-GDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQD  176
              +  +  R+    NP    G Y L    RP  +   G V   G        S+    + 
Sbjct  115  TPISYNRARLIAEENPMFQPGRYLLRISPRPSVVHFSGLVIKPGAYRHGNDLSIFSTAKS  174

Query  177  HPRLAGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLND  236
              + + ADK++V VITP GE     +A WN  + +  PGSQ+++  +  +       LN 
Sbjct  175  VTKASDADKSHVFVITPMGEIEKRGIAYWNIDYSQLMPGSQIYVPITGQIFSSTLDALNT  234

Query  237  QIVSVLTQRV  246
            +I ++   R+
Sbjct  235  RIANLAVHRI  244


>ref|YP_001339659.1| hypothetical protein Mmwyl1_0790 [Marinomonas sp. MWYL1]
 gb|ABR69724.1| conserved hypothetical protein [Marinomonas sp. MWYL1]
Length=263

 Score = 57.0 bits (136),  Expect = 3e-06, Method: Compositional matrix adjust.
 Identities = 56/225 (24%), Positives = 96/225 (42%), Gaps = 12/225 (5%)

Query  21   FAQGTVTIYLPGEQQTLSVGPVENVVQLVTQP--QLRDRLWWPGALLTDSAAKAKALKDY  78
             +Q  + +Y P +  TLS      + Q++T    QL  + +  G  L D + + +  +  
Sbjct  42   LSQEQIALYYPQQPVTLSYPQEVRLSQVLTDAYAQLNYQPYSLGTALIDLSKQQQIDEKK  101

Query  79   QHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPDFVRVDENSNPPLVGD  138
              ++ QL      A + +A  + S       L+   R  ++ DP  VRV+   +P L G 
Sbjct  102  HAILKQLQDINTPASNYIAKKLNS-------LDFVYRERIETDPSKVRVNPKYDPMLKGV  154

Query  139  YTLYTVQRPVTITLLGAVS-GAGQLPWQAGRSVTDYLQDHPRLAGADKNNVMVITPEGET  197
            Y L+  +RP  I L+ A       L   A  ++ DYL +         ++  +I    + 
Sbjct  155  YQLFLPKRPQHIYLINADDHNYLTLKLTANSNLKDYLAEQFEANRYTYDSAWIIQANQDV  214

Query  198  VVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVL  242
              A    W  +     PG+ +++G +   LPEKY DLN  I  +L
Sbjct  215  YRATDIQWKGKLYFLSPGAIVFIGLTD--LPEKYRDLNADIAHLL  257


>gb|EDJ38382.1| hypothetical protein GOS_1705098 [marine metagenome]
Length=838

 Score = 56.2 bits (134),  Expect = 4e-06, Method: Compositional matrix adjust.
 Identities = 42/173 (24%), Positives = 83/173 (47%), Gaps = 8/173 (4%)

Query  54   LRDRLWWPGALLTDSAAKAK-------ALKDYQHVMAQLASWEAEADDDVAATIKSVRQQ  106
            L ++ +  GA+ T ++ + +       A ++    MA++ S E + D ++    + +  +
Sbjct  627  LTEQAFADGAIFTRASERRQEANRNRQAAREVDQAMARVLSREDDVDSNLVVMSERLANE  686

Query  107  LLNLNITGRLPVKLDPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQA  166
            L      GR+ V+ DP  +    + +  L G   +Y  ++ +T+ + G V     L +++
Sbjct  687  LREAETLGRITVEADPQILSQRPDLDVLLQGGDHIYYPEQTLTVRVSGEVQSPSALMFES  746

Query  167  GRSVTDYLQDHPRL-AGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQL  218
            G++ +DYL+D     A ADK+   V+ P+G      V+ WN   V   PGS +
Sbjct  747  GKTASDYLRDAGGFTALADKSRSFVVHPDGSARPLRVSSWNYDPVTILPGSTI  799


>ref|YP_002890759.1| hypothetical protein Tmz1t_3793 [Thauera sp. MZ1T]
 gb|ACR02382.1| protein of unknown function DUF940 membrane lipoprotein putative 
[Thauera sp. MZ1T]
Length=1261

 Score = 55.5 bits (132),  Expect = 9e-06, Method: Compositional matrix adjust.
 Identities = 50/183 (27%), Positives = 87/183 (47%), Gaps = 10/183 (5%)

Query  68   SAAKAKA-LKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKL-DPDFV  125
            SA +A+A LK  Q ++AQL  W A        T + +   L +L +TGR+P+ + D  ++
Sbjct  87   SARQAQAELK--QALLAQL--WSARDLQADETTRQRLADWLRSLPVTGRVPLAIVDARWL  142

Query  126  RVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGADK  185
            + + + +P L   + L   QRP T+T+L        +  + G     YLQ     A ++ 
Sbjct  143  QANPDQDPILAPGHALVLPQRPGTVTVLADDGRPCAVVHRPGTQARGYLQACIGGAASEA  202

Query  186  NNVMVITPEGETVVAPVALWNKR-HVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLTQ  244
            +   ++ P+G      +A WN     EP PG+ +W    +   PE   D +D++ + L  
Sbjct  203  DMAWIVQPDGRVQRFGIATWNAEPQSEPAPGAWIWAPRRSAAWPE---DFSDRLATFLAT  259

Query  245  RVP  247
            + P
Sbjct  260  QSP  262


>gb|EBF39314.1| hypothetical protein GOS_9605035 [marine metagenome]
Length=354

 Score = 55.1 bits (131),  Expect = 9e-06, Method: Compositional matrix adjust.
 Identities = 38/147 (25%), Positives = 72/147 (48%), Gaps = 1/147 (0%)

Query  73   KALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPDFVRVDENSN  132
            +A ++    MA++ S E + D ++    + +  +L      GR+ V+ DP  +    + +
Sbjct  169  QAAREVDQAMARVLSREDDVDSNLVVMSERLANELREAETLGRITVEADPQILSQRPDLD  228

Query  133  PPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRL-AGADKNNVMVI  191
              L G   +Y  ++ +T+ + G V     L +++G++ +DYL+D     A ADK+   V+
Sbjct  229  VLLQGGDHIYYPEQTLTVRVSGEVQSPSALMFESGKTASDYLRDAGGFTALADKSRSFVV  288

Query  192  TPEGETVVAPVALWNKRHVEPPPGSQL  218
             P+G      V+ WN   V   PGS +
Sbjct  289  HPDGSARPLRVSSWNYDPVTILPGSTI  315


>gb|ECO71570.1| hypothetical protein GOS_4328030 [marine metagenome]
Length=158

 Score = 54.7 bits (130),  Expect = 1e-05, Method: Compositional matrix adjust.
 Identities = 35/155 (22%), Positives = 73/155 (47%), Gaps = 1/155 (0%)

Query  93   DDDVAATIKSVRQQLLNLNITGRLPVKLDPDFVRVDENSNPPLV-GDYTLYTVQRPVTIT  151
            D +    + ++ QQ+ +  ++ R+ + +  +  R+    NP    G Y +    RP  + 
Sbjct  1    DSETHKALDNLFQQVASWTVSTRINMPISYNRARLLIEENPMFQPGKYWIRLNGRPNVVH  60

Query  152  LLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGADKNNVMVITPEGETVVAPVALWNKRHVE  211
              GA+   G     +  S+   +    + + AD+++V +I+P G+     +A WN    +
Sbjct  61   FSGAILKPGAYQHLSDTSIYTAVNTVKKASDADRSHVYLISPTGQVEEKGIAYWNLDFSQ  120

Query  212  PPPGSQLWLGFSAHVLPEKYADLNDQIVSVLTQRV  246
              PGSQ+++  S  +   K   LN+++V++   RV
Sbjct  121  IMPGSQVYIPISNELFSNKLKRLNERVVALAVHRV  155


>ref|ZP_06173722.1| conserved hypothetical protein [Vibrio harveyi 1DA3]
 gb|EEZ89852.1| conserved hypothetical protein [Vibrio harveyi 1DA3]
Length=243

 Score = 52.8 bits (125),  Expect = 6e-05, Method: Compositional matrix adjust.
 Identities = 33/141 (23%), Positives = 64/141 (45%), Gaps = 4/141 (2%)

Query  86   ASWEAEADDDVAATIKSVRQQLLNL----NITGRLPVKLDPDFVRVDENSNPPLVGDYTL  141
            A    E   D+    K  + +L N       + R+   +D D VR+++ +NP L G Y L
Sbjct  80   ARLSKERVLDLMKEYKLNKTELFNFIQRSGTSKRVISNIDLDTVRLNKKNNPLLKGKYIL  139

Query  142  YTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGADKNNVMVITPEGETVVAP  201
               +R + + +LG       +P +   S++  L D   L    ++  ++I P+G+   + 
Sbjct  140  SVGEREIPLLVLGNTPKVATIPTEENMSLSRLLNDETSLFKNLEHAPVLIYPDGKLTQSH  199

Query  202  VALWNKRHVEPPPGSQLWLGF  222
            +  W  +    PPG+ +++ F
Sbjct  200  IGNWKTKSYSLPPGTIIYIPF  220


>gb|ECG95961.1| hypothetical protein GOS_3683526 [marine metagenome]
Length=231

 Score = 52.0 bits (123),  Expect = 1e-04, Method: Compositional matrix adjust.
 Identities = 42/165 (25%), Positives = 82/165 (49%), Gaps = 9/165 (5%)

Query  64   LLTDSAAKAKALKDYQHVMAQLASWEAEADD----DVAATIKSVRQQLLNLNITGRLPVK  119
            L T+   +  A + Y+  +  L++   +A      + AA+ + + ++L N  ++GR+  +
Sbjct  32   LNTEKINRMAADELYKSFLDNLSNLSGKATSGSALEGAASTRLIMEELKNSPVSGRVSAE  91

Query  120  LDPDFVRVDENSNPPLVGDYTLYTVQRPVT-ITLLGAVSGAGQLPWQAGRSVTDYLQDHP  178
             D + +  D  S   ++ D    T+   V  + + G VS  G + ++ G+ V+ Y++   
Sbjct  92   FDINVLEEDP-SKDVVLQDGDKITIPEFVNQVYIFGEVSSEGTVRFEKGQPVSFYIEKKG  150

Query  179  RLAG-ADKNNVMVITPEGET--VVAPVALWNKRHVEPPPGSQLWL  220
              +G AD+ NV V+ P GET  V   V +   R +E  PGS +++
Sbjct  151  GFSGFADERNVFVLNPNGETFKVSKNVFMRQGRDIEIYPGSVIFV  195


>ref|ZP_07742696.1| putative periplasmic protein [Vibrio caribbenthicus ATCC BAA-2122]
 gb|EFP97003.1| putative periplasmic protein [Vibrio caribbenthicus ATCC BAA-2122]
Length=243

 Score = 51.2 bits (121),  Expect = 2e-04, Method: Compositional matrix adjust.
 Identities = 43/181 (23%), Positives = 76/181 (41%), Gaps = 6/181 (3%)

Query  65   LTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPDF  124
            L D+  + +A      V+A+L   E   D  V   I  ++Q     N   R+   LD D 
Sbjct  66   LFDNQKQKEAQMLQNSVLARLQQIETTTDYPVEQLIADIKQ----WNTGYRIKTSLDYDA  121

Query  125  VRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGAD  184
            +R++   +P L G +     QR   + L+G ++   Q+      S+   ++D   L  AD
Sbjct  122  IRINPELDPLLSGHFEFTFPQRDHKVELIGLITQPTQVSITDYSSIAALMRDIVPLPHAD  181

Query  185  KNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLTQ  244
             + V V+ P G       A WN         S +++GF +    ++   L   I+ +++ 
Sbjct  182  PSFVWVVHPNGYAERVGYAYWNNAATRLTSNSTIYVGFDSD--SDQLTSLEKDIIKLISM  239

Query  245  R  245
            R
Sbjct  240  R  240


>ref|YP_349560.1| hypothetical protein Pfl01_3831 [Pseudomonas fluorescens Pf0-1]
 gb|ABA75569.1| putative membrane protein [Pseudomonas fluorescens Pf0-1]
Length=243

 Score = 51.2 bits (121),  Expect = 2e-04, Method: Compositional matrix adjust.
 Identities = 45/150 (30%), Positives = 74/150 (49%), Gaps = 4/150 (2%)

Query  97   AATIKSVRQQLLNLNITGRLPVKLDPDFVRVDENSNPPL-VGDYTLYTVQRPVTITLLGA  155
            AA  + + +Q+  + +TGR    LDP  V V    N  L  GD  +Y  +R   + +LGA
Sbjct  87   AALAQRLAEQVRQMPVTGRQIADLDPVAVEVGFARNIRLDDGDQLIYP-KRVDEVEVLGA  145

Query  156  VSGAGQLPWQAGRSVTDYLQDHPRL-AGADKNNVMVITPEGETVVAPVALWNKRHVEPP-  213
            V+   +LP+Q  +   +YLQ    L A AD + + +I P GE+    +A WN+   + P 
Sbjct  146  VAEPCRLPYQPLQEAREYLQGCTLLEADADADYLWLIQPNGESRRVGIAHWNRESGQMPV  205

Query  214  PGSQLWLGFSAHVLPEKYADLNDQIVSVLT  243
             GS++ +      L     +LN Q+  ++ 
Sbjct  206  AGSKILVPVKNDDLDPPVPELNQQLAELIA  235


>ref|ZP_01074260.1| hypothetical protein MED121_15079 [Marinomonas sp. MED121]
 gb|EAQ67261.1| hypothetical protein MED121_15079 [Marinomonas sp. MED121]
Length=256

 Score = 50.8 bits (120),  Expect = 2e-04, Method: Compositional matrix adjust.
 Identities = 32/137 (23%), Positives = 66/137 (48%), Gaps = 6/137 (4%)

Query  94   DDVAATIKSVRQQLLNLNITGRLPVKLDPDFVRVDENSNPPLVGDYTLYTVQRPVTITLL  153
            +D   T K++   L   N   R  + LD + +++D+NSNP + G Y L   + P  I ++
Sbjct  102  NDKVETKKNLLSILEKHNFKKREEITLDLEKIQLDKNSNPIIQGKYELMLPRTPNYIIVI  161

Query  154  GAVSGAG--QLPWQAGRSVTDYL---QDHPRLAGADKN-NVMVITPEGETVVAPVALWNK  207
               +     +LP +      DY+   +D   +    K+  + ++  + ET++  V+ WN 
Sbjct  162  DPSNSKDLIKLPLKYNYDFEDYINFYKDGFNIKNKIKSEKITIVQADKETIIPKVSYWND  221

Query  208  RHVEPPPGSQLWLGFSA  224
            ++    PG+ +++G  +
Sbjct  222  KNYYLSPGAFIYIGIES  238


>gb|EBP46974.1| hypothetical protein GOS_7906243 [marine metagenome]
Length=231

 Score = 47.0 bits (110),  Expect = 0.003, Method: Compositional matrix adjust.
 Identities = 31/106 (29%), Positives = 52/106 (49%), Gaps = 2/106 (1%)

Query  114  GRLPVKLDPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDY  173
            GR  +  D   ++ D   N PL+G   LY  +RP +I ++G V  A  L +    ++ DY
Sbjct  90   GRQTISADILTLKTDPYKNIPLMGGDELYIPKRPNSINVVGEVLNATTLNFHPDYALEDY  149

Query  174  LQDHPRLAG-ADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQL  218
            ++    L   AD+NN+ ++ P+G        L+ K+     PGS +
Sbjct  150  IEMSGGLTNYADQNNIYIVKPDGSAYTHKKTLF-KKDRNLLPGSMI  194


>ref|ZP_06180024.1| hypothetical protein VMC_14540 [Vibrio alginolyticus 40B]
 gb|EEZ83725.1|