Blast performed on February-4-2012
BLASTP 2.2.24+
Reference:
Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database
search programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei
Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and
Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST
protein database searches with composition-based statistics and
other refinements", Nucleic Acids Res. 29:2994-3005.
Database: nr_env
20,512,688 sequences; 6,167,035,527 total letters
Query= EG13730 gfcC
Length=248
Score E
Sequences producing significant alignments: (Bits) Value
ref|NP_415505.1| conserved protein [Escherichia coli str. K-12 s... 503 1e-140
ref|YP_309961.1| hypothetical protein SSON_0992 [Shigella sonnei... 501 3e-140
ref|ZP_03070105.1| group 4 capsule (G4C) polysaccharide, YmcB [E... 501 4e-140
gb|EFZ60522.1| hypothetical protein ECLT68_0462 [Escherichia col... 501 5e-140
ref|NP_706908.1| hypothetical protein SF0987 [Shigella flexneri ... 501 5e-140
ref|YP_002402185.1| hypothetical protein EC55989_1093 [Escherich... 500 6e-140
ref|YP_408638.1| hypothetical protein SBO_2245 [Shigella boydii ... 500 8e-140
ref|YP_001880817.1| group 4 capsule (G4C) polysaccharide, YmcB [... 500 8e-140
gb|EFZ70668.1| hypothetical protein ECOK1357_1256 [Escherichia c... 500 9e-140
ref|ZP_03064769.1| group 4 capsule (G4C) polysaccharide, YmcB [S... 500 1e-139
gb|EGB58446.1| SLBB-domain-containing protein [Escherichia coli ... 499 1e-139
ref|ZP_06656911.1| YmcB [Escherichia coli B185] >gb|EFF07293.1| ... 499 1e-139
ref|ZP_08353029.1| conserved hypothetical protein [Escherichia c... 499 1e-139
ref|ZP_02999848.1| group 4 capsule (G4C) polysaccharide, YmcB [E... 499 2e-139
ref|ZP_06652940.1| YmcB protein [Escherichia coli B354] >gb|EFF1... 499 2e-139
ref|ZP_03047374.1| group 4 capsule (G4C) polysaccharide, YmcB [E... 499 2e-139
ref|YP_003036816.1| hypothetical protein ECBD_2609 [Escherichia ... 498 3e-139
ref|ZP_08373309.1| conserved hypothetical protein [Escherichia c... 498 5e-139
ref|YP_402624.1| hypothetical protein SDY_0961 [Shigella dysente... 497 6e-139
ref|ZP_08383110.1| conserved hypothetical protein [Escherichia c... 497 7e-139
ref|NP_286922.1| hypothetical protein Z1402 [Escherichia coli O1... 496 1e-138
ref|YP_002328533.1| hypothetical protein E2348C_0970 [Escherichi... 495 3e-138
gb|AEE55762.1| conserved hypothetical protein [Escherichia coli ... 495 3e-138
ref|YP_003498802.1| Group 4 capsule (G4C) polysaccharide, YmcB [... 494 3e-138
gb|EGB62657.1| SLBB-domain-containing protein [Escherichia coli ... 494 6e-138
gb|EGB71723.1| SLBB-domain-containing protein [Escherichia coli ... 493 9e-138
ref|ZP_07137361.1| conserved hypothetical protein [Escherichia c... 493 1e-137
gb|EFZ75912.1| hypothetical protein ECRN5871_1089 [Escherichia c... 491 5e-137
gb|EGJ00865.1| hypothetical protein SD15574_1331 [Shigella dysen... 469 2e-130
ref|ZP_07783163.1| uncharacterized protein gfcC [Escherichia col... 466 2e-129
ref|ZP_07681343.1| conserved hypothetical protein [Shigella dyse... 464 7e-129
ref|ZP_02904453.1| group 4 capsule (G4C) polysaccharide, YmcB [E... 449 1e-124
gb|EGI97539.1| hypothetical protein SB521682_1222 [Shigella boyd... 448 4e-124
gb|EFW50291.1| YjbG polysaccharide synthesis-related protein [Sh... 447 5e-124
gb|EFW53537.1| YjbG polysaccharide synthesis-related protein [Sh... 447 5e-124
pdb|3P42|A Chain A, Structure Of Gfcc (Ymcb), Protein Encoded By... 444 5e-123
gb|EFW71149.1| YjbG polysaccharide synthesis-related protein [Es... 440 8e-122
gb|EGK28119.1| hypothetical protein SFK272_1492 [Shigella flexne... 319 2e-85
emb|CBX82236.1| Uncharacterized protein gfcC Group 4 capsule pro... 219 3e-55
ref|YP_001909022.1| Conserved hypothetical protein YmcB [Erwinia... 218 7e-55
ref|YP_002650310.1| capsular polysaccharide protein [Erwinia pyr... 216 4e-54
ref|YP_003537358.1| exported protein [Erwinia amylovora ATCC 499... 215 4e-54
gb|ADP10499.1| Putative capsular polysaccharide protein [Erwinia... 211 8e-53
ref|YP_003532687.1| hypothetical protein EAMY_3334 [Erwinia amyl... 209 4e-52
ref|YP_003739692.1| conserved uncharacterized protein YmcB [Erwi... 205 6e-51
ref|YP_001174974.1| hypothetical protein Ent638_0233 [Enterobact... 203 2e-50
ref|YP_003518538.1| YmcB [Pantoea ananatis LMG 20103] >gb|ADD754... 200 1e-49
dbj|BAK13483.1| hypothetical protein YmcB precursor YmcB [Pantoe... 200 2e-49
ref|YP_001436216.1| hypothetical protein ESA_00075 [Cronobacter ... 200 2e-49
gb|EGL72010.1| hypothetical protein CSE899_14477 [Cronobacter sa... 197 1e-48
ref|YP_003610796.1| hypothetical protein ECL_00280 [Enterobacter... 197 1e-48
ref|NP_709891.1| hypothetical protein SF4177 [Shigella flexneri ... 196 3e-48
ref|YP_312940.1| hypothetical protein SSON_4206 [Shigella sonnei... 196 4e-48
ref|YP_003232032.1| hypothetical protein ECO26_5143 [Escherichia... 195 4e-48
ref|YP_003212148.1| hypothetical protein CTU_37850 [Cronobacter ... 195 6e-48
gb|EGC97468.1| hypothetical protein ECD227_3706 [Escherichia fer... 195 6e-48
ref|ZP_05969412.1| conserved hypothetical protein [Enterobacter ... 192 3e-47
ref|YP_001455401.1| hypothetical protein CKO_03889 [Citrobacter ... 192 3e-47
ref|YP_003367162.1| hypothetical protein ROD_37221 [Citrobacter ... 192 3e-47
gb|EGB70445.1| SLBB-domain-containing protein [Escherichia coli ... 192 6e-47
ref|ZP_08500193.1| group 4 capsule (G4C) polysaccharide, YmcB [E... 191 1e-46
ref|YP_001465525.1| hypothetical protein EcE24377A_4576 [Escheri... 190 1e-46
ref|ZP_07140265.1| hypothetical protein HMPREF9548_02441 [Escher... 190 2e-46
emb|CBK84486.1| SLBB-domain like (DUF1017) [Enterobacter cloacae... 189 3e-46
ref|ZP_08366591.1| conserved hypothetical protein [Escherichia c... 188 6e-46
ref|ZP_08497921.1| hypothetical protein HMPREF9086_2183 [Enterob... 188 8e-46
ref|ZP_03048870.1| conserved hypothetical protein [Escherichia c... 187 1e-45
ref|YP_001726928.1| hypothetical protein EcolC_4002 [Escherichia... 187 1e-45
ref|NP_418452.1| conserved protein [Escherichia coli str. K-12 s... 187 1e-45
ref|YP_001882714.1| hypothetical protein SbBS512_E4533 [Shigella... 187 2e-45
ref|YP_002295592.1| hypothetical protein ECSE_4317 [Escherichia ... 187 2e-45
gb|EFW53980.1| YjbG polysaccharide synthesis-related protein [Sh... 186 2e-45
ref|YP_003933062.1| hypothetical protein Pvag_3494 [Pantoea vaga... 186 2e-45
emb|CBG37221.1| conserved hypothetical protein [Escherichia coli... 186 3e-45
emb|CAP78488.1| Uncharacterized protein yjbG [Escherichia coli L... 186 3e-45
ref|ZP_06939706.1| hypothetical protein EcolOP_27029 [Escherichi... 186 3e-45
ref|ZP_07152932.1| conserved hypothetical protein [Escherichia c... 186 3e-45
ref|YP_543535.1| hypothetical protein UTI89_C4596 [Escherichia c... 186 4e-45
ref|ZP_06660155.1| YjbG polysaccharide synthesis protein [Escher... 186 4e-45
ref|ZP_08350965.1| conserved hypothetical protein [Escherichia c... 186 4e-45
ref|YP_001746417.1| hypothetical protein EcSMS35_4489 [Escherich... 185 4e-45
ref|ZP_08356834.1| conserved hypothetical protein [Escherichia c... 185 6e-45
ref|YP_002415168.1| hypothetical protein ECUMN_4561 [Escherichia... 185 6e-45
ref|YP_004214318.1| hypothetical protein Rahaq_3601 [Rahnella sp... 184 8e-45
gb|EFX17993.1| hypothetical protein ECO2687_20949 [Escherichia c... 184 9e-45
gb|EGC12490.1| SLBB-domain-containing protein [Escherichia coli ... 184 9e-45
ref|ZP_08386356.1| conserved hypothetical protein [Escherichia c... 184 9e-45
ref|YP_002389495.1| hypothetical protein ECIAI1_4253 [Escherichi... 184 1e-44
ref|ZP_03029269.1| conserved hypothetical protein [Escherichia c... 184 1e-44
gb|EFZ75386.1| hypothetical protein ECRN5871_1899 [Escherichia c... 184 1e-44
gb|EGC06312.1| SLBB-domain-containing protein [Escherichia fergu... 184 1e-44
ref|ZP_02805425.1| conserved hypothetical protein [Escherichia c... 184 1e-44
ref|NP_756848.1| hypothetical protein c4996 [Escherichia coli CF... 184 1e-44
ref|ZP_05433479.1| hypothetical protein ShiD9_11915 [Shigella sp... 184 1e-44
ref|NP_290662.1| hypothetical protein Z5626 [Escherichia coli O1... 184 1e-44
ref|YP_405613.1| hypothetical protein SDY_4220 [Shigella dysente... 183 2e-44
ref|YP_002385133.1| hypothetical protein EFER_4120 [Escherichia ... 183 2e-44
ref|YP_219090.1| hypothetical protein SC4103 [Salmonella enteric... 183 2e-44
ref|YP_003943655.1| hypothetical protein Entcl_4138 [Enterobacte... 182 3e-44
gb|EFZ53016.1| hypothetical protein SS53G_2453 [Shigella sonnei ... 182 4e-44
gb|EFS13220.1| uncharacterized protein gfcC [Shigella flexneri 2... 182 4e-44
ref|ZP_02685432.1| conserved hypothetical protein [Salmonella en... 182 4e-44
ref|ZP_03059326.1| conserved hypothetical protein [Escherichia c... 182 5e-44
ref|ZP_02809628.1| conserved hypothetical protein [Escherichia c... 182 6e-44
ref|YP_002410323.1| hypothetical protein ECIAI39_4450 [Escherich... 181 1e-43
gb|EGB61141.1| SLBB-domain-containing protein [Escherichia coli ... 181 1e-43
ref|ZP_07185547.1| conserved hypothetical protein [Escherichia c... 181 1e-43
ref|ZP_03217856.1| conserved hypothetical protein [Salmonella en... 180 2e-43
ref|ZP_03044141.1| conserved hypothetical protein [Escherichia c... 180 2e-43
ref|ZP_06651602.1| predicted protein [Escherichia coli B354] >gb... 180 2e-43
ref|ZP_06356425.1| conserved hypothetical protein [Citrobacter y... 180 2e-43
ref|YP_001572429.1| hypothetical protein SARI_03458 [Salmonella ... 180 2e-43
ref|NP_458522.1| hypothetical protein STY4420 [Salmonella enteri... 179 4e-43
ref|ZP_07380293.1| protein of unknown function DUF1017 [Pantoea ... 179 5e-43
ref|ZP_03357288.1| hypothetical protein SentesTyphi_01726 [Salmo... 177 1e-42
ref|YP_004114116.1| hypothetical protein Pat9b_0234 [Pantoea sp.... 177 1e-42
ref|ZP_08376264.1| conserved hypothetical protein [Escherichia c... 177 2e-42
ref|ZP_04558593.1| conserved hypothetical protein [Citrobacter s... 176 4e-42
ref|ZP_02346309.1| conserved hypothetical protein [Salmonella en... 175 5e-42
ref|ZP_07183334.1| conserved hypothetical protein [Escherichia c... 173 2e-41
ref|YP_002218119.1| hypothetical protein SeD_A4618 [Salmonella e... 172 5e-41
ref|ZP_03213760.1| conserved hypothetical protein [Salmonella en... 172 6e-41
ref|YP_001591314.1| hypothetical protein SPAB_05204 [Salmonella ... 172 6e-41
ref|ZP_02664389.1| conserved hypothetical protein [Salmonella en... 171 1e-40
gb|EFY12299.1| hypothetical protein SEEM315_09434 [Salmonella en... 171 1e-40
ref|ZP_03069392.1| conserved hypothetical protein [Escherichia c... 171 1e-40
ref|YP_153099.1| hypothetical protein SPA4041 [Salmonella enteri... 170 1e-40
ref|ZP_02799370.2| conserved hypothetical protein [Escherichia c... 170 2e-40
gb|EFW51577.1| YjbG polysaccharide synthesis-related protein [Sh... 170 2e-40
ref|YP_002043474.1| hypothetical protein SNSL254_A4566 [Salmonel... 170 2e-40
ref|ZP_03064544.1| conserved hypothetical protein [Shigella dyse... 170 2e-40
ref|YP_002149138.1| hypothetical protein SeAg_B4481 [Salmonella ... 170 2e-40
ref|ZP_02785878.2| conserved hypothetical protein [Escherichia c... 170 2e-40
ref|ZP_02833337.1| conserved hypothetical protein [Salmonella en... 169 3e-40
ref|NP_463089.1| periplasmic protein [Salmonella enterica subsp.... 169 4e-40
ref|ZP_06537804.1| hypothetical protein Salmonellaentericaenteri... 168 6e-40
ref|ZP_03373074.1| hypothetical protein SentesTyp_23395 [Salmone... 168 6e-40
ref|YP_002228797.1| hypothetical protein SG4066 [Salmonella ente... 168 8e-40
ref|ZP_04656864.1| hypothetical protein SentesTe_18130 [Salmonel... 168 8e-40
gb|EGI88734.1| hypothetical protein SB521682_4838 [Shigella boyd... 168 9e-40
ref|ZP_03364691.1| hypothetical protein SentesTyph_17318 [Salmon... 149 3e-34
gb|EFW61835.1| YjbG polysaccharide synthesis-related protein [Sh... 149 5e-34
ref|YP_410318.1| hypothetical protein SBO_4056 [Shigella boydii ... 141 9e-32
ref|ZP_03338398.1| hypothetical protein Salmonelentericaenterica... 125 5e-27
gb|EDA50530.1| hypothetical protein GOS_1989170 [marine metagenome] 125 8e-27
gb|EBB46452.1| hypothetical protein GOS_229183 [marine metagenome] 124 2e-26
ref|ZP_03830735.1| hypothetical protein PcarcW_05039 [Pectobacte... 121 1e-25
ref|YP_003260364.1| hypothetical protein Pecwa_3011 [Pectobacter... 119 3e-25
ref|YP_003016906.1| hypothetical protein PC1_1323 [Pectobacteriu... 119 5e-25
gb|EFU99994.1| conserved hypothetical protein [Escherichia coli ... 118 1e-24
ref|ZP_03826818.1| hypothetical protein PcarbP_09375 [Pectobacte... 117 1e-24
ref|YP_049553.1| hypothetical protein ECA1447 [Pectobacterium at... 117 1e-24
ref|YP_003005145.1| hypothetical protein Dd1591_2844 [Dickeya ze... 117 2e-24
ref|YP_003882187.1| hypothetical protein Dda3937_03274 [Dickeya ... 115 6e-24
ref|YP_691475.1| hypothetical protein SFV_4186 [Shigella flexner... 114 2e-23
ref|ZP_06715180.1| conserved hypothetical protein [Edwardsiella ... 114 2e-23
ref|YP_962878.1| hypothetical protein Sputw3181_1486 [Shewanella... 112 5e-23
gb|ADV54996.1| protein of unknown function DUF1017 [Shewanella p... 111 1e-22
ref|YP_001184042.1| hypothetical protein Sputcn32_2522 [Shewanel... 110 3e-22
ref|YP_737462.1| hypothetical protein Shewmr7_1406 [Shewanella s... 108 6e-22
ref|YP_733476.1| hypothetical protein Shewmr4_1341 [Shewanella s... 106 3e-21
ref|YP_001051215.1| hypothetical protein Sbal_2862 [Shewanella b... 105 9e-21
ref|YP_003332847.1| hypothetical protein Dd586_1258 [Dickeya dad... 105 9e-21
gb|EDA46671.1| hypothetical protein GOS_1996150 [marine metagenome] 104 1e-20
ref|YP_869034.1| hypothetical protein Shewana3_1394 [Shewanella ... 104 1e-20
gb|EDA79313.1| hypothetical protein GOS_1936285 [marine metagenome] 103 2e-20
ref|YP_001555433.1| hypothetical protein Sbal195_3008 [Shewanell... 103 3e-20
ref|ZP_07391524.1| protein of unknown function DUF1017 [Shewanel... 103 3e-20
ref|ZP_08567183.1| YjbG polysaccharide synthesis protein [Shewan... 103 3e-20
gb|EDA62109.1| hypothetical protein GOS_1968443 [marine metagenome] 102 5e-20
ref|YP_857379.1| putative periplasmic protein [Aeromonas hydroph... 101 1e-19
ref|ZP_08570204.1| SLBB-domain like (DUF1017) [Rheinheimera sp. ... 98.2 1e-18
ref|NP_718714.1| polysaccharide synthesis-related protein [Shewa... 97.8 1e-18
ref|YP_001367076.1| hypothetical protein Shew185_2879 [Shewanell... 94.0 2e-17
gb|EGK28123.1| hypothetical protein SFK272_1496 [Shigella flexne... 93.6 3e-17
ref|YP_002357425.1| hypothetical protein Sbal223_1497 [Shewanell... 93.6 3e-17
ref|YP_455825.1| hypothetical protein SG2145 [Sodalis glossinidi... 91.7 1e-16
ref|YP_004433358.1| hypothetical protein Glaag_1129 [Glaciecola ... 90.5 2e-16
ref|YP_203541.1| hypothetical protein VF_0158 [Vibrio fischeri E... 88.6 7e-16
ref|ZP_01218708.1| hypothetical polysaccharide synthesis-related... 88.6 8e-16
ref|YP_130871.1| polysaccharide synthesis-like protein [Photobac... 87.0 2e-15
ref|YP_002154921.1| hypothetical protein VFMJ11_0150 [Vibrio fis... 86.7 3e-15
ref|YP_662763.1| hypothetical protein Patl_3203 [Pseudoalteromon... 81.3 1e-13
ref|ZP_06054113.1| hypothetical polysaccharide synthesis-related... 80.9 2e-13
gb|EBT31304.1| hypothetical protein GOS_7294385 [marine metagenome] 80.1 3e-13
ref|YP_002261809.1| exported protein [Aliivibrio salmonicida LFI... 80.1 3e-13
gb|ECC92702.1| hypothetical protein GOS_5462754 [marine metagenome] 76.3 5e-12
gb|EBT37533.1| hypothetical protein GOS_7284195 [marine metagenome] 73.9 2e-11
ref|YP_004565064.1| hypothetical protein VAA_02509 [Vibrio angui... 73.9 2e-11
ref|ZP_05879679.1| hypothetical protein VFA_003816 [Vibrio furni... 72.4 6e-11
ref|YP_154958.1| hypothetical protein IL0568 [Idiomarina loihien... 72.4 6e-11
gb|ADT85366.1| hypothetical periplasmic protein [Vibrio furnissi... 72.4 6e-11
ref|ZP_05883360.1| polysaccharide synthesis-related protein [Vib... 70.1 3e-10
ref|YP_002415894.1| hypothetical protein VS_0210 [Vibrio splendi... 68.9 8e-10
ref|ZP_01982747.1| conserved hypothetical protein [Vibrio choler... 68.6 1e-09
ref|ZP_06053682.1| hypothetical polysaccharide synthesis-related... 68.6 1e-09
ref|ZP_01950873.1| WbfC protein [Vibrio cholerae 1587] >gb|EAY32... 68.2 1e-09
ref|ZP_03348517.1| hypothetical protein Salmoneentericaenterica_... 67.0 2e-09
ref|ZP_06154782.1| protein of unknown function DUF1017 [Photobac... 67.0 3e-09
ref|ZP_05118573.1| conserved hypothetical protein [Vibrio paraha... 66.6 4e-09
ref|ZP_05888474.1| hypothetical protein VIC_004993 [Vibrio coral... 66.6 4e-09
ref|ZP_00989941.1| hypothetical protein V12B01_23145 [Vibrio spl... 65.5 8e-09
ref|ZP_04961240.1| Periplasmic protein involved in polysaccharid... 65.1 1e-08
ref|ZP_04919294.1| WbfC protein [Vibrio cholerae V51] >gb|EAZ501... 64.7 1e-08
dbj|BAA33618.1| unknown [Vibrio cholerae] 64.7 1e-08
ref|NP_933122.1| hypothetical protein VV0329 [Vibrio vulnificus ... 64.3 2e-08
gb|ABI85351.1| hypothetical protein [Vibrio cholerae] 64.3 2e-08
ref|YP_004190015.1| YjbG polysaccharide synthesis-related protei... 64.3 2e-08
ref|ZP_06943634.1| periplasmic protein [Vibrio cholerae RC385] >... 64.3 2e-08
ref|ZP_01978991.1| conserved hypothetical protein [Vibrio choler... 63.5 3e-08
ref|ZP_05715654.1| hypothetical protein VMD_07000 [Vibrio mimicu... 63.5 3e-08
ref|NP_759771.1| hypothetical protein VV1_0794 [Vibrio vulnificu... 63.2 3e-08
ref|ZP_08103789.1| putative periplasmic protein [Vibrio sinaloen... 63.2 4e-08
ref|ZP_05240723.1| conserved hypothetical protein [Vibrio choler... 63.2 4e-08
ref|ZP_04402220.1| polysaccharide synthesis-related protein [Vib... 62.8 4e-08
ref|ZP_04717494.1| hypothetical protein AmacA2_21203 [Alteromona... 62.4 6e-08
gb|ECV65415.1| hypothetical protein GOS_2858926 [marine metagenome] 62.0 7e-08
ref|YP_002873221.1| hypothetical protein PFLU3660 [Pseudomonas f... 60.8 2e-07
ref|ZP_06040321.1| polysaccharide synthesis-related protein [Vib... 60.5 2e-07
ref|ZP_07774980.1| hypothetical protein PFWH6_2379 [Pseudomonas ... 58.5 8e-07
ref|YP_004469043.1| hypothetical protein ambt_18740 [Alteromonas... 58.2 1e-06
ref|YP_001339659.1| hypothetical protein Mmwyl1_0790 [Marinomona... 57.0 3e-06
gb|EDJ38382.1| hypothetical protein GOS_1705098 [marine metagenome] 56.2 4e-06
ref|YP_002890759.1| hypothetical protein Tmz1t_3793 [Thauera sp.... 55.5 9e-06
gb|EBF39314.1| hypothetical protein GOS_9605035 [marine metagenome] 55.1 9e-06
gb|ECO71570.1| hypothetical protein GOS_4328030 [marine metagenome] 54.7 1e-05
ref|ZP_06173722.1| conserved hypothetical protein [Vibrio harvey... 52.8 6e-05
gb|ECG95961.1| hypothetical protein GOS_3683526 [marine metagenome] 52.0 1e-04
ref|ZP_07742696.1| putative periplasmic protein [Vibrio caribben... 51.2 2e-04
ref|YP_349560.1| hypothetical protein Pfl01_3831 [Pseudomonas fl... 51.2 2e-04
ref|ZP_01074260.1| hypothetical protein MED121_15079 [Marinomona... 50.8 2e-04
gb|EBP46974.1| hypothetical protein GOS_7906243 [marine metagenome] 47.0 0.003
ref|ZP_06180024.1| hypothetical protein VMC_14540 [Vibrio algino... 46.6 0.003
gb|EBH71459.1| hypothetical protein GOS_9215222 [marine metagenome] 46.6 0.003
gb|EBW35987.1| hypothetical protein GOS_6758981 [marine metagenome] 46.2 0.005
ref|ZP_01261311.1| hypothetical protein V12G01_11523 [Vibrio alg... 45.1 0.009
gb|ECD07193.1| hypothetical protein GOS_4892991 [marine metagenome] 45.1 0.009
gb|EBK16152.1| hypothetical protein GOS_8778301 [marine metagenome] 45.1 0.009
gb|EBF16285.1| hypothetical protein GOS_9642686 [marine metagenome] 44.7 0.012
ref|YP_004627399.1| polysaccharide export protein [Thermodesulfo... 44.7 0.013
gb|ECT46535.1| hypothetical protein GOS_5865945 [marine metagenome] 44.3 0.017
ref|YP_002943855.1| hypothetical protein Vapar_1942 [Variovorax ... 44.3 0.019
ref|YP_003448508.1| polysaccharide export outer membrane protein... 44.3 0.020
ref|YP_002296906.1| polysaccharide biosynthesis [Rhodospirillum ... 43.5 0.030
ref|ZP_01987515.1| conserved hypothetical protein [Vibrio harvey... 43.1 0.035
>ref|NP_415505.1| conserved protein [Escherichia coli str. K-12 substr. MG1655]
ref|AP_001614.1| hypothetical protein [Escherichia coli str. K-12 substr. W3110]
ref|YP_001457822.1| group 4 capsule (G4C) polysaccharide, YmcB [Escherichia coli
HS]
53 more sequence titles
Length=248
Score = 503 bits (1294), Expect = 1e-140, Method: Compositional matrix adjust.
Identities = 248/248 (100%), Positives = 248/248 (100%), Gaps = 0/248 (0%)
Query 1 MNKLQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWW 60
MNKLQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWW
Sbjct 1 MNKLQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWW 60
Query 61 PGALLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKL 120
PGALLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKL
Sbjct 61 PGALLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKL 120
Query 121 DPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRL 180
DPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRL
Sbjct 121 DPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRL 180
Query 181 AGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVS 240
AGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVS
Sbjct 181 AGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVS 240
Query 241 VLTQRVPD 248
VLTQRVPD
Sbjct 241 VLTQRVPD 248
>ref|YP_309961.1| hypothetical protein SSON_0992 [Shigella sonnei Ss046]
ref|YP_001462217.1| hypothetical protein EcE24377A_1101 [Escherichia coli E24377A]
ref|YP_001725569.1| hypothetical protein EcolC_2611 [Escherichia coli ATCC 8739]
26 more sequence titles
ref|ZP_03028398.1| group 4 capsule (G4C) polysaccharide, YmcB [Escherichia coli
B7A]
ref|ZP_03051741.1| group 4 capsule (G4C) polysaccharide, YmcB [Escherichia coli
E110019]
ref|YP_002292322.1| hypothetical protein ECSE_1047 [Escherichia coli SE11]
ref|YP_003228580.1| hypothetical protein ECO26_1540 [Escherichia coli O26:H11 str.
11368]
ref|YP_003233596.1| hypothetical protein ECO111_1096 [Escherichia coli O111:H- str.
11128]
ref|ZP_07591268.1| protein of unknown function DUF1017 [Escherichia coli W]
ref|ZP_07689445.1| conserved hypothetical protein [Escherichia coli MS 145-7]
ref|ZP_07785295.1| uncharacterized protein gfcC [Escherichia coli 1827-70]
gb|AAZ87726.1| conserved hypothetical protein [Shigella sonnei Ss046]
gb|ABV19908.1| conserved hypothetical protein [Escherichia coli E24377A]
gb|ACA78242.1| protein of unknown function DUF1017 [Escherichia coli ATCC 8739]
gb|EDV63094.1| group 4 capsule (G4C) polysaccharide, YmcB [Escherichia coli
B7A]
gb|EDV86387.1| group 4 capsule (G4C) polysaccharide, YmcB [Escherichia coli
E110019]
dbj|BAG76571.1| conserved hypothetical protein [Escherichia coli SE11]
dbj|BAI24840.1| conserved predicted protein [Escherichia coli O26:H11 str. 11368]
dbj|BAI35045.1| conserved predicted protein [Escherichia coli O111:H- str. 11128]
gb|EFN38682.1| protein of unknown function DUF1017 [Escherichia coli W]
gb|EFO58591.1| conserved hypothetical protein [Escherichia coli MS 145-7]
gb|EFQ02044.1| uncharacterized protein gfcC [Escherichia coli 1827-70]
gb|ADT74597.1| conserved hypothetical protein [Escherichia coli W]
gb|EFW61897.1| hypothetical protein SGF_00626 [Shigella flexneri CDC 796-83]
gb|EFZ43595.1| hypothetical protein ECEPECA14_0667 [Escherichia coli EPECa14]
gb|EFZ51259.1| hypothetical protein SS53G_4157 [Shigella sonnei 53G]
gb|EFZ61475.1| hypothetical protein ECOK1180_5623 [Escherichia coli 1180]
gb|ADX51438.1| protein of unknown function DUF1017 [Escherichia coli KO11]
gb|EGB88343.1| hypothetical protein HMPREF9542_02196 [Escherichia coli MS 117-3]
Length=248
Score = 501 bits (1291), Expect = 3e-140, Method: Compositional matrix adjust.
Identities = 247/248 (99%), Positives = 248/248 (100%), Gaps = 0/248 (0%)
Query 1 MNKLQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWW 60
MNKLQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGP+ENVVQLVTQPQLRDRLWW
Sbjct 1 MNKLQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPMENVVQLVTQPQLRDRLWW 60
Query 61 PGALLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKL 120
PGALLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKL
Sbjct 61 PGALLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKL 120
Query 121 DPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRL 180
DPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRL
Sbjct 121 DPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRL 180
Query 181 AGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVS 240
AGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVS
Sbjct 181 AGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVS 240
Query 241 VLTQRVPD 248
VLTQRVPD
Sbjct 241 VLTQRVPD 248
>ref|ZP_03070105.1| group 4 capsule (G4C) polysaccharide, YmcB [Escherichia coli
101-1]
ref|ZP_07145145.1| conserved hypothetical protein [Escherichia coli MS 187-1]
gb|EDX38942.1| group 4 capsule (G4C) polysaccharide, YmcB [Escherichia coli
101-1]
gb|EFK25881.1| conserved hypothetical protein [Escherichia coli MS 187-1]
gb|EGB68331.1| SLBB-domain-containing protein [Escherichia coli TA007]
Length=248
Score = 501 bits (1291), Expect = 4e-140, Method: Compositional matrix adjust.
Identities = 247/248 (99%), Positives = 248/248 (100%), Gaps = 0/248 (0%)
Query 1 MNKLQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWW 60
MNKLQSYFIASVLYVMTPHAFAQGTVTIYLPGE+QTLSVGPVENVVQLVTQPQLRDRLWW
Sbjct 1 MNKLQSYFIASVLYVMTPHAFAQGTVTIYLPGEEQTLSVGPVENVVQLVTQPQLRDRLWW 60
Query 61 PGALLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKL 120
PGALLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKL
Sbjct 61 PGALLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKL 120
Query 121 DPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRL 180
DPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRL
Sbjct 121 DPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRL 180
Query 181 AGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVS 240
AGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVS
Sbjct 181 AGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVS 240
Query 241 VLTQRVPD 248
VLTQRVPD
Sbjct 241 VLTQRVPD 248
>gb|EFZ60522.1| hypothetical protein ECLT68_0462 [Escherichia coli LT-68]
Length=248
Score = 501 bits (1289), Expect = 5e-140, Method: Compositional matrix adjust.
Identities = 247/248 (99%), Positives = 247/248 (99%), Gaps = 0/248 (0%)
Query 1 MNKLQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWW 60
MNKLQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWW
Sbjct 1 MNKLQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWW 60
Query 61 PGALLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKL 120
P ALLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKL
Sbjct 61 PSALLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKL 120
Query 121 DPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRL 180
DPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRL
Sbjct 121 DPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRL 180
Query 181 AGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVS 240
AGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVS
Sbjct 181 AGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVS 240
Query 241 VLTQRVPD 248
VLTQRVPD
Sbjct 241 VLTQRVPD 248
>ref|NP_706908.1| hypothetical protein SF0987 [Shigella flexneri 2a str. 301]
ref|NP_836693.1| hypothetical protein S1054 [Shigella flexneri 2a str. 2457T]
ref|YP_688519.1| hypothetical protein SFV_0994 [Shigella flexneri 5 str. 8401]
10 more sequence titles
gb|AAN42615.1| orf, conserved hypothetical protein [Shigella flexneri 2a str.
301]
gb|AAP16499.1| hypothetical protein S1054 [Shigella flexneri 2a str. 2457T]
gb|ABF03214.1| conserved hypothetical protein [Shigella flexneri 5 str. 8401]
gb|ADA73333.1| hypothetical protein SFxv_1069 [Shigella flexneri 2002017]
gb|EFS15164.1| uncharacterized protein gfcC [Shigella flexneri 2a str. 2457T]
gb|EGJ90241.1| hypothetical protein SF274771_1276 [Shigella flexneri 2747-71]
gb|EGJ93034.1| hypothetical protein SFK671_1167 [Shigella flexneri K-671]
gb|EGJ97971.1| conserved protein [Shigella flexneri 2930-71]
gb|EGK39344.1| hypothetical protein SFK304_1330 [Shigella flexneri K-304]
gb|EGM62783.1| conserved protein [Shigella flexneri J1713]
Length=248
Score = 501 bits (1289), Expect = 5e-140, Method: Compositional matrix adjust.
Identities = 247/248 (99%), Positives = 247/248 (99%), Gaps = 0/248 (0%)
Query 1 MNKLQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWW 60
MNKLQSYFIASVLYVMTPHAFAQGTVTIYLP EQQTLSVGPVENVVQLVTQPQLRDRLWW
Sbjct 1 MNKLQSYFIASVLYVMTPHAFAQGTVTIYLPSEQQTLSVGPVENVVQLVTQPQLRDRLWW 60
Query 61 PGALLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKL 120
PGALLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKL
Sbjct 61 PGALLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKL 120
Query 121 DPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRL 180
DPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRL
Sbjct 121 DPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRL 180
Query 181 AGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVS 240
AGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVS
Sbjct 181 AGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVS 240
Query 241 VLTQRVPD 248
VLTQRVPD
Sbjct 241 VLTQRVPD 248
>ref|YP_002402185.1| hypothetical protein EC55989_1093 [Escherichia coli 55989]
emb|CAU96954.1| conserved hypothetical protein [Escherichia coli 55989]
Length=248
Score = 500 bits (1288), Expect = 6e-140, Method: Compositional matrix adjust.
Identities = 246/248 (99%), Positives = 248/248 (100%), Gaps = 0/248 (0%)
Query 1 MNKLQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWW 60
MNKLQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGP+ENVVQLVTQPQLRDRLWW
Sbjct 1 MNKLQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPMENVVQLVTQPQLRDRLWW 60
Query 61 PGALLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKL 120
PGALLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKL
Sbjct 61 PGALLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKL 120
Query 121 DPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRL 180
DPDFVR+DENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRL
Sbjct 121 DPDFVRLDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRL 180
Query 181 AGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVS 240
AGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVS
Sbjct 181 AGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVS 240
Query 241 VLTQRVPD 248
VLTQRVPD
Sbjct 241 VLTQRVPD 248
>ref|YP_408638.1| hypothetical protein SBO_2245 [Shigella boydii Sb227]
gb|ABB66810.1| conserved hypothetical protein [Shigella boydii Sb227]
gb|EGI98404.1| hypothetical protein SB359474_2603 [Shigella boydii 3594-74]
Length=248
Score = 500 bits (1287), Expect = 8e-140, Method: Compositional matrix adjust.
Identities = 246/248 (99%), Positives = 248/248 (100%), Gaps = 0/248 (0%)
Query 1 MNKLQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWW 60
MNKLQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGP+ENVVQLVTQPQLRDRLWW
Sbjct 1 MNKLQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPMENVVQLVTQPQLRDRLWW 60
Query 61 PGALLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKL 120
PGALLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKL
Sbjct 61 PGALLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKL 120
Query 121 DPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRL 180
DPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRL
Sbjct 121 DPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRL 180
Query 181 AGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVS 240
AGADKNNVMVITPEGETVVAPVALWNKRHV+PPPGSQLWLGFSAHVLPEKYADLNDQIVS
Sbjct 181 AGADKNNVMVITPEGETVVAPVALWNKRHVDPPPGSQLWLGFSAHVLPEKYADLNDQIVS 240
Query 241 VLTQRVPD 248
VLTQRVPD
Sbjct 241 VLTQRVPD 248
>ref|YP_001880817.1| group 4 capsule (G4C) polysaccharide, YmcB [Shigella boydii CDC
3083-94]
gb|ACD07514.1| group 4 capsule (G4C) polysaccharide, YmcB [Shigella boydii CDC
3083-94]
Length=248
Score = 500 bits (1287), Expect = 8e-140, Method: Compositional matrix adjust.
Identities = 246/248 (99%), Positives = 248/248 (100%), Gaps = 0/248 (0%)
Query 1 MNKLQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWW 60
MNKLQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGP+ENVVQLVTQPQLRDRLWW
Sbjct 1 MNKLQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPMENVVQLVTQPQLRDRLWW 60
Query 61 PGALLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKL 120
PGALLTDSAAKAKALKDYQHVMAQLASWEAEA+DDVAATIKSVRQQLLNLNITGRLPVKL
Sbjct 61 PGALLTDSAAKAKALKDYQHVMAQLASWEAEANDDVAATIKSVRQQLLNLNITGRLPVKL 120
Query 121 DPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRL 180
DPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRL
Sbjct 121 DPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRL 180
Query 181 AGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVS 240
AGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVS
Sbjct 181 AGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVS 240
Query 241 VLTQRVPD 248
VLTQRVPD
Sbjct 241 VLTQRVPD 248
>gb|EFZ70668.1| hypothetical protein ECOK1357_1256 [Escherichia coli 1357]
Length=248
Score = 500 bits (1287), Expect = 9e-140, Method: Compositional matrix adjust.
Identities = 246/248 (99%), Positives = 247/248 (99%), Gaps = 0/248 (0%)
Query 1 MNKLQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWW 60
MNKLQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGP+ENVVQLVTQPQLRDRLWW
Sbjct 1 MNKLQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPMENVVQLVTQPQLRDRLWW 60
Query 61 PGALLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKL 120
P ALLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKL
Sbjct 61 PSALLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKL 120
Query 121 DPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRL 180
DPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRL
Sbjct 121 DPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRL 180
Query 181 AGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVS 240
AGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVS
Sbjct 181 AGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVS 240
Query 241 VLTQRVPD 248
VLTQRVPD
Sbjct 241 VLTQRVPD 248
>ref|ZP_03064769.1| group 4 capsule (G4C) polysaccharide, YmcB [Shigella dysenteriae
1012]
gb|EDX35451.1| group 4 capsule (G4C) polysaccharide, YmcB [Shigella dysenteriae
1012]
Length=248
Score = 500 bits (1287), Expect = 1e-139, Method: Compositional matrix adjust.
Identities = 247/248 (99%), Positives = 247/248 (99%), Gaps = 0/248 (0%)
Query 1 MNKLQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWW 60
MNKLQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWW
Sbjct 1 MNKLQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWW 60
Query 61 PGALLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKL 120
PGALLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKL
Sbjct 61 PGALLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKL 120
Query 121 DPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRL 180
PDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRL
Sbjct 121 APDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRL 180
Query 181 AGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVS 240
AGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVS
Sbjct 181 AGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVS 240
Query 241 VLTQRVPD 248
VLTQRVPD
Sbjct 241 VLTQRVPD 248
>gb|EGB58446.1| SLBB-domain-containing protein [Escherichia coli H489]
Length=248
Score = 499 bits (1286), Expect = 1e-139, Method: Compositional matrix adjust.
Identities = 246/248 (99%), Positives = 247/248 (99%), Gaps = 0/248 (0%)
Query 1 MNKLQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWW 60
MNKLQSYFIASVLYVMTPHAFAQGTVTIYLPGE+QTLSVGPVENVVQLVTQPQLRDRLWW
Sbjct 1 MNKLQSYFIASVLYVMTPHAFAQGTVTIYLPGEEQTLSVGPVENVVQLVTQPQLRDRLWW 60
Query 61 PGALLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKL 120
P ALLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKL
Sbjct 61 PSALLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKL 120
Query 121 DPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRL 180
DPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRL
Sbjct 121 DPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRL 180
Query 181 AGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVS 240
AGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVS
Sbjct 181 AGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVS 240
Query 241 VLTQRVPD 248
VLTQRVPD
Sbjct 241 VLTQRVPD 248
>ref|ZP_06656911.1| YmcB [Escherichia coli B185]
gb|EFF07293.1| YmcB [Escherichia coli B185]
Length=248
Score = 499 bits (1286), Expect = 1e-139, Method: Compositional matrix adjust.
Identities = 246/248 (99%), Positives = 247/248 (99%), Gaps = 0/248 (0%)
Query 1 MNKLQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWW 60
MNKLQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENV QLVTQPQLRDRLWW
Sbjct 1 MNKLQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVAQLVTQPQLRDRLWW 60
Query 61 PGALLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKL 120
PGALLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKL
Sbjct 61 PGALLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKL 120
Query 121 DPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRL 180
DPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRL
Sbjct 121 DPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRL 180
Query 181 AGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVS 240
AGADKNNV+VITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVS
Sbjct 181 AGADKNNVIVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVS 240
Query 241 VLTQRVPD 248
VLTQRVPD
Sbjct 241 VLTQRVPD 248
>ref|ZP_08353029.1| conserved hypothetical protein [Escherichia coli M718]
gb|EGI22346.1| conserved hypothetical protein [Escherichia coli M718]
Length=248
Score = 499 bits (1286), Expect = 1e-139, Method: Compositional matrix adjust.
Identities = 246/248 (99%), Positives = 246/248 (99%), Gaps = 0/248 (0%)
Query 1 MNKLQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWW 60
MNKLQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENV QLVTQPQLRDRLWW
Sbjct 1 MNKLQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVAQLVTQPQLRDRLWW 60
Query 61 PGALLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKL 120
PGALLTDSA KAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKL
Sbjct 61 PGALLTDSAVKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKL 120
Query 121 DPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRL 180
DPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRL
Sbjct 121 DPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRL 180
Query 181 AGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVS 240
AGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVS
Sbjct 181 AGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVS 240
Query 241 VLTQRVPD 248
VLTQRVPD
Sbjct 241 VLTQRVPD 248
>ref|ZP_02999848.1| group 4 capsule (G4C) polysaccharide, YmcB [Escherichia coli
53638]
gb|EDU62880.1| group 4 capsule (G4C) polysaccharide, YmcB [Escherichia coli
53638]
Length=248
Score = 499 bits (1285), Expect = 2e-139, Method: Compositional matrix adjust.
Identities = 247/248 (99%), Positives = 247/248 (99%), Gaps = 0/248 (0%)
Query 1 MNKLQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWW 60
MNKLQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQ QLRDRLWW
Sbjct 1 MNKLQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQLQLRDRLWW 60
Query 61 PGALLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKL 120
PGALLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKL
Sbjct 61 PGALLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKL 120
Query 121 DPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRL 180
DPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRL
Sbjct 121 DPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRL 180
Query 181 AGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVS 240
AGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVS
Sbjct 181 AGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVS 240
Query 241 VLTQRVPD 248
VLTQRVPD
Sbjct 241 VLTQRVPD 248
>ref|ZP_06652940.1| YmcB protein [Escherichia coli B354]
gb|EFF12316.1| YmcB protein [Escherichia coli B354]
Length=248
Score = 499 bits (1285), Expect = 2e-139, Method: Compositional matrix adjust.
Identities = 246/248 (99%), Positives = 246/248 (99%), Gaps = 0/248 (0%)
Query 1 MNKLQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWW 60
MNKLQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWW
Sbjct 1 MNKLQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWW 60
Query 61 PGALLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKL 120
PGALLTDSAAKAKALKDYQHVMAQLASWEAEADDDV ATIKSVRQQLLNLNITGRLPVKL
Sbjct 61 PGALLTDSAAKAKALKDYQHVMAQLASWEAEADDDVVATIKSVRQQLLNLNITGRLPVKL 120
Query 121 DPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRL 180
DPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGR VTDYLQDHPRL
Sbjct 121 DPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRRVTDYLQDHPRL 180
Query 181 AGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVS 240
AGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVS
Sbjct 181 AGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVS 240
Query 241 VLTQRVPD 248
VLTQRVPD
Sbjct 241 VLTQRVPD 248
>ref|ZP_03047374.1| group 4 capsule (G4C) polysaccharide, YmcB [Escherichia coli
E22]
ref|ZP_03062339.1| group 4 capsule (G4C) polysaccharide, YmcB [Escherichia coli
B171]
ref|YP_003221011.1| hypothetical protein ECO103_1030 [Escherichia coli O103:H2 str.
12009]
gb|EDV80675.1| group 4 capsule (G4C) polysaccharide, YmcB [Escherichia coli
E22]
gb|EDX28417.1| group 4 capsule (G4C) polysaccharide, YmcB [Escherichia coli
B171]
dbj|BAI29877.1| conserved predicted protein [Escherichia coli O103:H2 str. 12009]
gb|EFZ44274.1| hypothetical protein ECE128010_5482 [Escherichia coli E128010]
Length=248
Score = 499 bits (1284), Expect = 2e-139, Method: Compositional matrix adjust.
Identities = 246/248 (99%), Positives = 247/248 (99%), Gaps = 0/248 (0%)
Query 1 MNKLQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWW 60
MNKLQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWW
Sbjct 1 MNKLQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWW 60
Query 61 PGALLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKL 120
PGALLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNL+ITGRLPVKL
Sbjct 61 PGALLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLHITGRLPVKL 120
Query 121 DPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRL 180
DPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSG GQLPWQAGRSVTDYLQDHPRL
Sbjct 121 DPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGTGQLPWQAGRSVTDYLQDHPRL 180
Query 181 AGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVS 240
AGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVS
Sbjct 181 AGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVS 240
Query 241 VLTQRVPD 248
VLTQRVPD
Sbjct 241 VLTQRVPD 248
>ref|YP_003036816.1| hypothetical protein ECBD_2609 [Escherichia coli 'BL21-Gold(DE3)pLysS
AG']
ref|YP_003044206.1| hypothetical protein ECB_00988 [Escherichia coli B str. REL606]
ref|ZP_06939304.1| hypothetical protein EcolOP_25017 [Escherichia coli OP50]
emb|CAQ31512.1| conserved protein [Escherichia coli BL21(DE3)]
gb|ACT29631.1| protein of unknown function DUF1017 [Escherichia coli 'BL21-Gold(DE3)pLysS
AG']
gb|ACT38670.1| hypothetical protein ECB_00988 [Escherichia coli B str. REL606]
gb|ACT42883.1| hypothetical protein ECD_00988 [Escherichia coli BL21(DE3)]
Length=248
Score = 498 bits (1283), Expect = 3e-139, Method: Compositional matrix adjust.
Identities = 246/248 (99%), Positives = 247/248 (99%), Gaps = 0/248 (0%)
Query 1 MNKLQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWW 60
MNKLQSYFIASVLYVMTPHAFAQGTVTIYLPGE+QTLSVGPVENVVQLVTQPQLRDRLWW
Sbjct 1 MNKLQSYFIASVLYVMTPHAFAQGTVTIYLPGEEQTLSVGPVENVVQLVTQPQLRDRLWW 60
Query 61 PGALLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKL 120
P ALLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKL
Sbjct 61 PVALLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKL 120
Query 121 DPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRL 180
DPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRL
Sbjct 121 DPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRL 180
Query 181 AGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVS 240
AGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVS
Sbjct 181 AGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVS 240
Query 241 VLTQRVPD 248
VLTQRVPD
Sbjct 241 VLTQRVPD 248
>ref|ZP_08373309.1| conserved hypothetical protein [Escherichia coli TA280]
gb|EGI41529.1| conserved hypothetical protein [Escherichia coli TA280]
Length=248
Score = 498 bits (1281), Expect = 5e-139, Method: Compositional matrix adjust.
Identities = 244/248 (98%), Positives = 246/248 (99%), Gaps = 0/248 (0%)
Query 1 MNKLQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWW 60
MNKLQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWW
Sbjct 1 MNKLQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWW 60
Query 61 PGALLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKL 120
PGALLTDSAAKAKALKDYQHVMAQLASWEAEADDDV ATIKSVRQQLLNLNITGRLPVKL
Sbjct 61 PGALLTDSAAKAKALKDYQHVMAQLASWEAEADDDVVATIKSVRQQLLNLNITGRLPVKL 120
Query 121 DPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRL 180
DPDFVRVDENSNPPLVG+YTLYTVQRPVTITLLGAVSG GQLPWQAGRSVTDYLQDHPRL
Sbjct 121 DPDFVRVDENSNPPLVGNYTLYTVQRPVTITLLGAVSGTGQLPWQAGRSVTDYLQDHPRL 180
Query 181 AGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVS 240
AGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVS
Sbjct 181 AGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVS 240
Query 241 VLTQRVPD 248
+LTQRVPD
Sbjct 241 ILTQRVPD 248
>ref|YP_402624.1| hypothetical protein SDY_0961 [Shigella dysenteriae Sd197]
gb|ABB61133.1| conserved hypothetical protein [Shigella dysenteriae Sd197]
Length=248
Score = 497 bits (1280), Expect = 6e-139, Method: Compositional matrix adjust.
Identities = 244/248 (98%), Positives = 246/248 (99%), Gaps = 0/248 (0%)
Query 1 MNKLQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWW 60
MNKLQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENV QLVTQPQLRDRLWW
Sbjct 1 MNKLQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVAQLVTQPQLRDRLWW 60
Query 61 PGALLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKL 120
PGALLTDSA KAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKL
Sbjct 61 PGALLTDSAVKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKL 120
Query 121 DPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRL 180
DPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRL
Sbjct 121 DPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRL 180
Query 181 AGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVS 240
AGADKNNV+VITPEGETV+APVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVS
Sbjct 181 AGADKNNVIVITPEGETVIAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVS 240
Query 241 VLTQRVPD 248
VLTQRVPD
Sbjct 241 VLTQRVPD 248
>ref|ZP_08383110.1| conserved hypothetical protein [Escherichia coli H299]
gb|EGI51301.1| conserved hypothetical protein [Escherichia coli H299]
Length=248
Score = 497 bits (1279), Expect = 7e-139, Method: Compositional matrix adjust.
Identities = 245/248 (98%), Positives = 245/248 (98%), Gaps = 0/248 (0%)
Query 1 MNKLQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWW 60
MNKLQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWW
Sbjct 1 MNKLQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWW 60
Query 61 PGALLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKL 120
PGALLTDSAAKAKALKDYQHVMAQLASWEAEADDDV ATIKSVRQQLLNL ITGRLPVKL
Sbjct 61 PGALLTDSAAKAKALKDYQHVMAQLASWEAEADDDVVATIKSVRQQLLNLKITGRLPVKL 120
Query 121 DPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRL 180
DPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSG GQLPWQAGRSVTDYLQDHPRL
Sbjct 121 DPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGTGQLPWQAGRSVTDYLQDHPRL 180
Query 181 AGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVS 240
AGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVS
Sbjct 181 AGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVS 240
Query 241 VLTQRVPD 248
VLTQRVPD
Sbjct 241 VLTQRVPD 248
>ref|NP_286922.1| hypothetical protein Z1402 [Escherichia coli O157:H7 EDL933]
ref|NP_309168.1| hypothetical protein ECs1141 [Escherichia coli O157:H7 str. Sakai]
ref|ZP_02772422.1| group 4 capsule (G4C) polysaccharide, YmcB [Escherichia coli
O157:H7 str. EC4113]
44 more sequence titles
ref|ZP_02779313.1| group 4 capsule (G4C) polysaccharide, YmcB [Escherichia coli
O157:H7 str. EC4401]
ref|ZP_02788229.1| group 4 capsule (G4C) polysaccharide, YmcB [Escherichia coli
O157:H7 str. EC4501]
ref|ZP_02792490.1| group 4 capsule (G4C) polysaccharide, YmcB [Escherichia coli
O157:H7 str. EC4486]
ref|ZP_02802092.1| group 4 capsule (G4C) polysaccharide, YmcB [Escherichia coli
O157:H7 str. EC4196]
ref|ZP_02806009.1| group 4 capsule (G4C) polysaccharide, YmcB [Escherichia coli
O157:H7 str. EC4076]
ref|ZP_02812498.1| group 4 capsule (G4C) polysaccharide, YmcB [Escherichia coli
O157:H7 str. EC869]
ref|ZP_02825622.1| group 4 capsule (G4C) polysaccharide, YmcB [Escherichia coli
O157:H7 str. EC508]
ref|ZP_03080625.1| hypothetical protein EscherichcoliO157_02102 [Escherichia coli
O157:H7 str. EC4024]
ref|ZP_03252007.1| group 4 capsule (G4C) polysaccharide, YmcB [Escherichia coli
O157:H7 str. EC4206]
ref|ZP_03256547.1| group 4 capsule (G4C) polysaccharide, YmcB [Escherichia coli
O157:H7 str. EC4045]
ref|ZP_03262548.1| group 4 capsule (G4C) polysaccharide, YmcB [Escherichia coli
O157:H7 str. EC4042]
ref|YP_002269711.1| group 4 capsule (G4C) polysaccharide, YmcB [Escherichia coli
O157:H7 str. EC4115]
ref|ZP_03441081.1| group 4 capsule (G4C) polysaccharide, YmcB [Escherichia coli
O157:H7 str. TW14588]
ref|YP_003077079.1| hypothetical protein ECSP_1154 [Escherichia coli O157:H7 str.
TW14359]
ref|ZP_05941444.1| hypothetical protein EscherichiacoliO157_21586 [Escherichia coli
O157:H7 str. FRIK2000]
ref|ZP_05948256.1| hypothetical protein EscherichiacoliO157EcO_07798 [Escherichia
coli O157:H7 str. FRIK966]
gb|AAG55533.1|AE005292_6 orf, hypothetical protein [Escherichia coli O157:H7 str. EDL933]
dbj|BAB34564.1| hypothetical protein [Escherichia coli O157:H7 str. Sakai]
gb|EDU31456.1| group 4 capsule (G4C) polysaccharide, YmcB [Escherichia coli
O157:H7 str. EC4196]
gb|EDU56334.1| group 4 capsule (G4C) polysaccharide, YmcB [Escherichia coli
O157:H7 str. EC4113]
gb|EDU70182.1| group 4 capsule (G4C) polysaccharide, YmcB [Escherichia coli
O157:H7 str. EC4076]
gb|EDU76602.1| group 4 capsule (G4C) polysaccharide, YmcB [Escherichia coli
O157:H7 str. EC4401]
gb|EDU81708.1| group 4 capsule (G4C) polysaccharide, YmcB [Escherichia coli
O157:H7 str. EC4486]
gb|EDU85013.1| group 4 capsule (G4C) polysaccharide, YmcB [Escherichia coli
O157:H7 str. EC4501]
gb|EDU91158.1| group 4 capsule (G4C) polysaccharide, YmcB [Escherichia coli
O157:H7 str. EC869]
gb|EDU95538.1| group 4 capsule (G4C) polysaccharide, YmcB [Escherichia coli
O157:H7 str. EC508]
gb|EDZ79072.1| group 4 capsule (G4C) polysaccharide, YmcB [Escherichia coli
O157:H7 str. EC4206]
gb|EDZ80704.1| group 4 capsule (G4C) polysaccharide, YmcB [Escherichia coli
O157:H7 str. EC4045]
gb|EDZ85397.1| group 4 capsule (G4C) polysaccharide, YmcB [Escherichia coli
O157:H7 str. EC4042]
gb|ACI35308.1| group 4 capsule (G4C) polysaccharide, YmcB [Escherichia coli
O157:H7 str. EC4115]
gb|ACI85396.1| hypothetical protein ECs1141 [Escherichia coli]
gb|ACI85397.1| hypothetical protein ECs1141 [Escherichia coli]
gb|ACI85398.1| hypothetical protein ECs1141 [Escherichia coli]
gb|ACI85400.1| hypothetical protein ECs1141 [Escherichia coli]
gb|EEC29642.1| group 4 capsule (G4C) polysaccharide, YmcB [Escherichia coli
O157:H7 str. TW14588]
gb|ACT71003.1| conserved protein [Escherichia coli O157:H7 str. TW14359]
gb|EFW67159.1| hypothetical protein ECoD_00774 [Escherichia coli O157:H7 str.
EC1212]
gb|EFX07645.1| hypothetical protein ECO5101_23660 [Escherichia coli O157:H7
str. G5101]
gb|EFX12179.1| hypothetical protein ECO9389_03411 [Escherichia coli O157:H-
str. 493-89]
gb|EFX17090.1| hypothetical protein ECO2687_19521 [Escherichia coli O157:H-
str. H 2687]
gb|EFX21826.1| hypothetical protein ECO7815_15853 [Escherichia coli O55:H7 str.
3256-97 TW 07815]
gb|EFX31516.1| hypothetical protein ECOSU61_02013 [Escherichia coli O157:H7
str. LSU-61]
gb|EGD62399.1| hypothetical protein ECF_05075 [Escherichia coli O157:H7 str.
1125]
gb|EGD71503.1| hypothetical protein ECoA_00302 [Escherichia coli O157:H7 str.
1044]
Length=248
Score = 496 bits (1277), Expect = 1e-138, Method: Compositional matrix adjust.
Identities = 244/248 (98%), Positives = 246/248 (99%), Gaps = 0/248 (0%)
Query 1 MNKLQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWW 60
MNKLQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENV QLVTQPQLRDRLWW
Sbjct 1 MNKLQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVAQLVTQPQLRDRLWW 60
Query 61 PGALLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKL 120
PGALLTDSA KAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKL
Sbjct 61 PGALLTDSAVKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKL 120
Query 121 DPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRL 180
DPDFVRVDENSNPPLVG+YTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRL
Sbjct 121 DPDFVRVDENSNPPLVGNYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRL 180
Query 181 AGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVS 240
AGADKNNV+VITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVS
Sbjct 181 AGADKNNVIVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVS 240
Query 241 VLTQRVPD 248
VLTQRVPD
Sbjct 241 VLTQRVPD 248
>ref|YP_002328533.1| hypothetical protein E2348C_0970 [Escherichia coli O127:H6 str.
E2348/69]
emb|CAS08518.1| predicted protein [Escherichia coli O127:H6 str. E2348/69]
Length=248
Score = 495 bits (1274), Expect = 3e-138, Method: Compositional matrix adjust.
Identities = 244/248 (98%), Positives = 245/248 (98%), Gaps = 0/248 (0%)
Query 1 MNKLQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWW 60
MNKLQSYFIASVLYVMTPHAFAQG VTIYLPGEQQTLSVGPVENV QLVTQPQLRDRLWW
Sbjct 1 MNKLQSYFIASVLYVMTPHAFAQGMVTIYLPGEQQTLSVGPVENVAQLVTQPQLRDRLWW 60
Query 61 PGALLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKL 120
PGALLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKL
Sbjct 61 PGALLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKL 120
Query 121 DPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRL 180
DPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPW AGRSVTDYLQDHPRL
Sbjct 121 DPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWLAGRSVTDYLQDHPRL 180
Query 181 AGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVS 240
AGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVS
Sbjct 181 AGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVS 240
Query 241 VLTQRVPD 248
VLTQRVP+
Sbjct 241 VLTQRVPE 248
>gb|AEE55762.1| conserved hypothetical protein [Escherichia coli UMNK88]
Length=248
Score = 495 bits (1274), Expect = 3e-138, Method: Compositional matrix adjust.
Identities = 245/248 (98%), Positives = 245/248 (98%), Gaps = 0/248 (0%)
Query 1 MNKLQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWW 60
MNKLQSYFIASVLYVMT HAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWW
Sbjct 1 MNKLQSYFIASVLYVMTHHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWW 60
Query 61 PGALLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKL 120
PGALLTDSAAKAKALKDYQHVMAQLASWEAEADDDV ATIKSVRQQLLNLNITGRLPVKL
Sbjct 61 PGALLTDSAAKAKALKDYQHVMAQLASWEAEADDDVVATIKSVRQQLLNLNITGRLPVKL 120
Query 121 DPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRL 180
DPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQD PRL
Sbjct 121 DPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDRPRL 180
Query 181 AGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVS 240
AGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVS
Sbjct 181 AGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVS 240
Query 241 VLTQRVPD 248
VLTQRVPD
Sbjct 241 VLTQRVPD 248
>ref|YP_003498802.1| Group 4 capsule (G4C) polysaccharide, YmcB [Escherichia coli
O55:H7 str. CB9615]
gb|ACI85399.1| hypothetical protein ECs1141 [Escherichia coli]
gb|ADD55818.1| Group 4 capsule (G4C) polysaccharide, YmcB [Escherichia coli
O55:H7 str. CB9615]
gb|EFX27157.1| Group 4 capsule (G4C) polysaccharide, YmcB [Escherichia coli
O55:H7 str. USDA 5905]
Length=248
Score = 494 bits (1273), Expect = 3e-138, Method: Compositional matrix adjust.
Identities = 243/248 (97%), Positives = 245/248 (98%), Gaps = 0/248 (0%)
Query 1 MNKLQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWW 60
MNKLQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENV QLVTQPQLRDRLWW
Sbjct 1 MNKLQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVAQLVTQPQLRDRLWW 60
Query 61 PGALLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKL 120
PG LLTDSA KAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKL
Sbjct 61 PGTLLTDSAVKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKL 120
Query 121 DPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRL 180
DPDFVRVDENSNPPLVG+YTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRL
Sbjct 121 DPDFVRVDENSNPPLVGNYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRL 180
Query 181 AGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVS 240
AGADKNNV+VITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVS
Sbjct 181 AGADKNNVIVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVS 240
Query 241 VLTQRVPD 248
VLTQRVPD
Sbjct 241 VLTQRVPD 248
>gb|EGB62657.1| SLBB-domain-containing protein [Escherichia coli M863]
gb|EGE65021.1| hypothetical protein ECSTEC7V_1672 [Escherichia coli STEC_7v]
Length=248
Score = 494 bits (1271), Expect = 6e-138, Method: Compositional matrix adjust.
Identities = 244/248 (98%), Positives = 245/248 (98%), Gaps = 0/248 (0%)
Query 1 MNKLQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWW 60
MNKLQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQ LSVGPVENVVQLVTQPQLRDRLWW
Sbjct 1 MNKLQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQPLSVGPVENVVQLVTQPQLRDRLWW 60
Query 61 PGALLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKL 120
PG LLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKL
Sbjct 61 PGVLLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKL 120
Query 121 DPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRL 180
DPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAV+GAGQLPWQAGRSVTDYLQD PRL
Sbjct 121 DPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVTGAGQLPWQAGRSVTDYLQDTPRL 180
Query 181 AGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVS 240
AGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVS
Sbjct 181 AGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVS 240
Query 241 VLTQRVPD 248
VLTQRVPD
Sbjct 241 VLTQRVPD 248
>gb|EGB71723.1| SLBB-domain-containing protein [Escherichia coli TW10509]
Length=248
Score = 493 bits (1270), Expect = 9e-138, Method: Compositional matrix adjust.
Identities = 244/248 (98%), Positives = 245/248 (98%), Gaps = 0/248 (0%)
Query 1 MNKLQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWW 60
MNKLQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQ LSVGPVENVVQLVTQPQLRDRLWW
Sbjct 1 MNKLQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQPLSVGPVENVVQLVTQPQLRDRLWW 60
Query 61 PGALLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKL 120
PG LLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKL
Sbjct 61 PGVLLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKL 120
Query 121 DPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRL 180
DPDFVR+DENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQD PRL
Sbjct 121 DPDFVRMDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDTPRL 180
Query 181 AGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVS 240
AGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVS
Sbjct 181 AGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVS 240
Query 241 VLTQRVPD 248
VLTQRVPD
Sbjct 241 VLTQRVPD 248
>ref|ZP_07137361.1| conserved hypothetical protein [Escherichia coli MS 115-1]
gb|EFJ95361.1| conserved hypothetical protein [Escherichia coli MS 115-1]
Length=248
Score = 493 bits (1269), Expect = 1e-137, Method: Compositional matrix adjust.
Identities = 244/248 (98%), Positives = 246/248 (99%), Gaps = 0/248 (0%)
Query 1 MNKLQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWW 60
MNKLQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWW
Sbjct 1 MNKLQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWW 60
Query 61 PGALLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKL 120
PGALLTDSAAKAKALKDYQHV+AQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKL
Sbjct 61 PGALLTDSAAKAKALKDYQHVIAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKL 120
Query 121 DPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRL 180
DPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGA QLPWQAGRSVTDYLQDHPRL
Sbjct 121 DPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAEQLPWQAGRSVTDYLQDHPRL 180
Query 181 AGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVS 240
AGADKNNV+VITPEGETVVAPVALWNKRHVEPP GSQLWLGFSAHVLPEKYADLNDQIVS
Sbjct 181 AGADKNNVIVITPEGETVVAPVALWNKRHVEPPLGSQLWLGFSAHVLPEKYADLNDQIVS 240
Query 241 VLTQRVPD 248
VLTQRVPD
Sbjct 241 VLTQRVPD 248
>gb|EFZ75912.1| hypothetical protein ECRN5871_1089 [Escherichia coli RN587/1]
Length=248
Score = 491 bits (1263), Expect = 5e-137, Method: Compositional matrix adjust.
Identities = 243/248 (97%), Positives = 243/248 (97%), Gaps = 0/248 (0%)
Query 1 MNKLQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWW 60
MNKLQSYFIASVLYVMTPHAFAQG VTIYLPGEQQTLSVGPVENV QLVTQPQLRDRLWW
Sbjct 1 MNKLQSYFIASVLYVMTPHAFAQGMVTIYLPGEQQTLSVGPVENVAQLVTQPQLRDRLWW 60
Query 61 PGALLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKL 120
PGALLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKL
Sbjct 61 PGALLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKL 120
Query 121 DPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRL 180
PDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPW AGRSVTDYLQDH RL
Sbjct 121 APDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWLAGRSVTDYLQDHTRL 180
Query 181 AGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVS 240
AGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVS
Sbjct 181 AGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVS 240
Query 241 VLTQRVPD 248
VLTQRVPD
Sbjct 241 VLTQRVPD 248
>gb|EGJ00865.1| hypothetical protein SD15574_1331 [Shigella dysenteriae 155-74]
Length=233
Score = 469 bits (1207), Expect = 2e-130, Method: Compositional matrix adjust.
Identities = 232/233 (99%), Positives = 232/233 (99%), Gaps = 0/233 (0%)
Query 16 MTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGALLTDSAAKAKAL 75
MTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGALLTDSAAKAKAL
Sbjct 1 MTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGALLTDSAAKAKAL 60
Query 76 KDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPDFVRVDENSNPPL 135
KDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKL PDFVRVDENSNPPL
Sbjct 61 KDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLAPDFVRVDENSNPPL 120
Query 136 VGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGADKNNVMVITPEG 195
VGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGADKNNVMVITPEG
Sbjct 121 VGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGADKNNVMVITPEG 180
Query 196 ETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLTQRVPD 248
ETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLTQRVPD
Sbjct 181 ETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLTQRVPD 233
>ref|ZP_07783163.1| uncharacterized protein gfcC [Escherichia coli 2362-75]
gb|EFR14271.1| uncharacterized protein gfcC [Escherichia coli 2362-75]
Length=233
Score = 466 bits (1198), Expect = 2e-129, Method: Compositional matrix adjust.
Identities = 230/233 (98%), Positives = 230/233 (98%), Gaps = 0/233 (0%)
Query 16 MTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGALLTDSAAKAKAL 75
MTPHAFAQG VTIYLPGEQQTLSVGPVENV QLVTQPQLRDRLWWPGALLTDSAAKAKAL
Sbjct 1 MTPHAFAQGMVTIYLPGEQQTLSVGPVENVAQLVTQPQLRDRLWWPGALLTDSAAKAKAL 60
Query 76 KDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPDFVRVDENSNPPL 135
KDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPDFVRVDENSNPPL
Sbjct 61 KDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPDFVRVDENSNPPL 120
Query 136 VGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGADKNNVMVITPEG 195
VGDYTLYTVQRPVTITLLGAVSGAGQLPW AGRSVTDYLQDHPRLAGADKNNVMVITPEG
Sbjct 121 VGDYTLYTVQRPVTITLLGAVSGAGQLPWLAGRSVTDYLQDHPRLAGADKNNVMVITPEG 180
Query 196 ETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLTQRVPD 248
ETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLTQRVPD
Sbjct 181 ETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLTQRVPD 233
>ref|ZP_07681343.1| conserved hypothetical protein [Shigella dysenteriae 1617]
gb|EFP70903.1| conserved hypothetical protein [Shigella dysenteriae 1617]
Length=233
Score = 464 bits (1193), Expect = 7e-129, Method: Compositional matrix adjust.
Identities = 228/233 (97%), Positives = 230/233 (98%), Gaps = 0/233 (0%)
Query 16 MTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGALLTDSAAKAKAL 75
MTPHAFAQGTVTIYLPGEQQTLSVGPVENV QLVTQPQLRDRLWWPGALLTDSA KAKAL
Sbjct 1 MTPHAFAQGTVTIYLPGEQQTLSVGPVENVAQLVTQPQLRDRLWWPGALLTDSAVKAKAL 60
Query 76 KDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPDFVRVDENSNPPL 135
KDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKL PDFVRVDENSNPPL
Sbjct 61 KDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLAPDFVRVDENSNPPL 120
Query 136 VGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGADKNNVMVITPEG 195
VGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGADKNNV+VITPEG
Sbjct 121 VGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGADKNNVIVITPEG 180
Query 196 ETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLTQRVPD 248
ETV+APVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLTQRVPD
Sbjct 181 ETVIAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLTQRVPD 233
>ref|ZP_02904453.1| group 4 capsule (G4C) polysaccharide, YmcB [Escherichia albertii
TW07627]
gb|EDS90115.1| group 4 capsule (G4C) polysaccharide, YmcB [Escherichia albertii
TW07627]
Length=248
Score = 449 bits (1156), Expect = 1e-124, Method: Compositional matrix adjust.
Identities = 221/248 (89%), Positives = 230/248 (92%), Gaps = 0/248 (0%)
Query 1 MNKLQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWW 60
MNKL SYFIASVLYV+TPHAFAQG+VT+YLPGE+Q LSV VENV QLVTQPQLRDRLWW
Sbjct 1 MNKLPSYFIASVLYVITPHAFAQGSVTVYLPGEKQALSVESVENVAQLVTQPQLRDRLWW 60
Query 61 PGALLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKL 120
PGALLTDSAAKAKA KDYQHVMAQLASWEAEADDDVAATIK VRQQL NLNITGRL V+L
Sbjct 61 PGALLTDSAAKAKADKDYQHVMAQLASWEAEADDDVAATIKFVRQQLTNLNITGRLSVEL 120
Query 121 DPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRL 180
DPDFVRVDE+SN PLVGDY LY VQRP TITLLGAVSGAGQLPW+AGRSV DYLQ HPRL
Sbjct 121 DPDFVRVDEDSNRPLVGDYALYAVQRPSTITLLGAVSGAGQLPWRAGRSVADYLQHHPRL 180
Query 181 AGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVS 240
AGAD NNV VITPEG+TVVAPVALWNKRHVEPPPGSQLWLGFS HVLPEKYADLN+QIVS
Sbjct 181 AGADSNNVFVITPEGKTVVAPVALWNKRHVEPPPGSQLWLGFSTHVLPEKYADLNNQIVS 240
Query 241 VLTQRVPD 248
VLTQRVPD
Sbjct 241 VLTQRVPD 248
>gb|EGI97539.1| hypothetical protein SB521682_1222 [Shigella boydii 5216-82]
Length=223
Score = 448 bits (1152), Expect = 4e-124, Method: Compositional matrix adjust.
Identities = 221/223 (99%), Positives = 223/223 (100%), Gaps = 0/223 (0%)
Query 26 VTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGALLTDSAAKAKALKDYQHVMAQL 85
+TIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGALLTDSAAKAKALKDYQHVMAQL
Sbjct 1 MTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGALLTDSAAKAKALKDYQHVMAQL 60
Query 86 ASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPDFVRVDENSNPPLVGDYTLYTVQ 145
ASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPDFVRVDENSNPPLVGDYTLYTVQ
Sbjct 61 ASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPDFVRVDENSNPPLVGDYTLYTVQ 120
Query 146 RPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGADKNNVMVITPEGETVVAPVALW 205
RPVTITLLG+VSGAGQLPWQAGRSVTDYLQDHPRLAGADKNNVMVITPEGETVVAPVALW
Sbjct 121 RPVTITLLGSVSGAGQLPWQAGRSVTDYLQDHPRLAGADKNNVMVITPEGETVVAPVALW 180
Query 206 NKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLTQRVPD 248
NKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLTQRVPD
Sbjct 181 NKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLTQRVPD 223
>gb|EFW50291.1| YjbG polysaccharide synthesis-related protein [Shigella dysenteriae
CDC 74-1112]
Length=223
Score = 447 bits (1151), Expect = 5e-124, Method: Compositional matrix adjust.
Identities = 221/223 (99%), Positives = 223/223 (100%), Gaps = 0/223 (0%)
Query 26 VTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGALLTDSAAKAKALKDYQHVMAQL 85
+TIYLPGEQQTLSVGP+ENVVQLVTQPQLRDRLWWPGALLTDSAAKAKALKDYQHVMAQL
Sbjct 1 MTIYLPGEQQTLSVGPMENVVQLVTQPQLRDRLWWPGALLTDSAAKAKALKDYQHVMAQL 60
Query 86 ASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPDFVRVDENSNPPLVGDYTLYTVQ 145
ASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPDFVRVDENSNPPLVGDYTLYTVQ
Sbjct 61 ASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPDFVRVDENSNPPLVGDYTLYTVQ 120
Query 146 RPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGADKNNVMVITPEGETVVAPVALW 205
RPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGADKNNVMVITPEGETVVAPVALW
Sbjct 121 RPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGADKNNVMVITPEGETVVAPVALW 180
Query 206 NKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLTQRVPD 248
NKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLTQRVPD
Sbjct 181 NKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLTQRVPD 223
>gb|EFW53537.1| YjbG polysaccharide synthesis-related protein [Shigella boydii
ATCC 9905]
Length=223
Score = 447 bits (1151), Expect = 5e-124, Method: Compositional matrix adjust.
Identities = 221/223 (99%), Positives = 223/223 (100%), Gaps = 0/223 (0%)
Query 26 VTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGALLTDSAAKAKALKDYQHVMAQL 85
+TIYLPGEQQTLSVGPVENVV+LVTQPQLRDRLWWPGALLTDSAAKAKALKDYQHVMAQL
Sbjct 1 MTIYLPGEQQTLSVGPVENVVKLVTQPQLRDRLWWPGALLTDSAAKAKALKDYQHVMAQL 60
Query 86 ASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPDFVRVDENSNPPLVGDYTLYTVQ 145
ASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPDFVRVDENSNPPLVGDYTLYTVQ
Sbjct 61 ASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPDFVRVDENSNPPLVGDYTLYTVQ 120
Query 146 RPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGADKNNVMVITPEGETVVAPVALW 205
RPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGADKNNVMVITPEGETVVAPVALW
Sbjct 121 RPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGADKNNVMVITPEGETVVAPVALW 180
Query 206 NKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLTQRVPD 248
NKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLTQRVPD
Sbjct 181 NKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLTQRVPD 223
>pdb|3P42|A Chain A, Structure Of Gfcc (Ymcb), Protein Encoded By The E.
Coli Group 4 Capsule Operon
pdb|3P42|B Chain B, Structure Of Gfcc (Ymcb), Protein Encoded By The E.
Coli Group 4 Capsule Operon
pdb|3P42|C Chain C, Structure Of Gfcc (Ymcb), Protein Encoded By The E.
Coli Group 4 Capsule Operon
pdb|3P42|D Chain D, Structure Of Gfcc (Ymcb), Protein Encoded By The E.
Coli Group 4 Capsule Operon
Length=236
Score = 444 bits (1142), Expect = 5e-123, Method: Compositional matrix adjust.
Identities = 221/227 (97%), Positives = 222/227 (97%), Gaps = 0/227 (0%)
Query 22 AQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGALLTDSAAKAKALKDYQHV 81
AQG VTIYLPGEQQTLSVGPVENV QLVTQPQLRDRLWWPGALLTDSAAKAKALKDYQHV
Sbjct 2 AQGXVTIYLPGEQQTLSVGPVENVAQLVTQPQLRDRLWWPGALLTDSAAKAKALKDYQHV 61
Query 82 MAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPDFVRVDENSNPPLVGDYTL 141
AQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPDFVRVDENSNPPLVGDYTL
Sbjct 62 XAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPDFVRVDENSNPPLVGDYTL 121
Query 142 YTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGADKNNVMVITPEGETVVAP 201
YTVQRPVTITLLGAVSGAGQLPW AGRSVTDYLQDHPRLAGADKNNV VITPEGETVVAP
Sbjct 122 YTVQRPVTITLLGAVSGAGQLPWLAGRSVTDYLQDHPRLAGADKNNVXVITPEGETVVAP 181
Query 202 VALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLTQRVPD 248
VALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLTQRVP+
Sbjct 182 VALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLTQRVPE 228
>gb|EFW71149.1| YjbG polysaccharide synthesis-related protein [Escherichia coli
WV_060327]
Length=223
Score = 440 bits (1132), Expect = 8e-122, Method: Compositional matrix adjust.
Identities = 218/223 (97%), Positives = 219/223 (98%), Gaps = 0/223 (0%)
Query 26 VTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGALLTDSAAKAKALKDYQHVMAQL 85
+TIYLPGEQQTLSVGPVENV QLVTQPQLRDRLWWPGALLTDSAAKAKALKDYQHVMAQL
Sbjct 1 MTIYLPGEQQTLSVGPVENVAQLVTQPQLRDRLWWPGALLTDSAAKAKALKDYQHVMAQL 60
Query 86 ASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPDFVRVDENSNPPLVGDYTLYTVQ 145
ASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPDFVRVDENSNPPLVGDYTLYTVQ
Sbjct 61 ASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPDFVRVDENSNPPLVGDYTLYTVQ 120
Query 146 RPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGADKNNVMVITPEGETVVAPVALW 205
RPVTITLLGAVSGAGQLPW AGRSVTDYLQDH RLAGADKNNVMVITPEGE VVAPVALW
Sbjct 121 RPVTITLLGAVSGAGQLPWLAGRSVTDYLQDHTRLAGADKNNVMVITPEGEAVVAPVALW 180
Query 206 NKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLTQRVPD 248
NKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLTQRVPD
Sbjct 181 NKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLTQRVPD 223
>gb|EGK28119.1| hypothetical protein SFK272_1492 [Shigella flexneri K-272]
gb|EGK39944.1| hypothetical protein SFK227_0667 [Shigella flexneri K-227]
Length=161
Score = 319 bits (818), Expect = 2e-85, Method: Compositional matrix adjust.
Identities = 157/158 (99%), Positives = 158/158 (100%), Gaps = 0/158 (0%)
Query 91 EADDDVAATIKSVRQQLLNLNITGRLPVKLDPDFVRVDENSNPPLVGDYTLYTVQRPVTI 150
+ADDDVAATIKSVRQQLLNLNITGRLPVKLDPDFVRVDENSNPPLVGDYTLYTVQRPVTI
Sbjct 4 QADDDVAATIKSVRQQLLNLNITGRLPVKLDPDFVRVDENSNPPLVGDYTLYTVQRPVTI 63
Query 151 TLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGADKNNVMVITPEGETVVAPVALWNKRHV 210
TLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGADKNNVMVITPEGETVVAPVALWNKRHV
Sbjct 64 TLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGADKNNVMVITPEGETVVAPVALWNKRHV 123
Query 211 EPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLTQRVPD 248
EPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLTQRVPD
Sbjct 124 EPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLTQRVPD 161
>emb|CBX82236.1| Uncharacterized protein gfcC Group 4 capsule protein C homolog;
Flags: Precursor [Erwinia amylovora ATCC BAA-2158]
Length=251
Score = 219 bits (558), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 114/252 (45%), Positives = 166/252 (65%), Gaps = 5/252 (1%)
Query 1 MNKLQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWW 60
M K+ + +A +L V++ A A G V I+ PG+ Q L V + ++ QLVT P L + WW
Sbjct 1 MKKI-TILLAGILAVLSLQARADGKVNIFYPGQNQPLVVNHMADLEQLVTNPALAQKTWW 59
Query 61 PGALLTDSAAKAKALKDYQHVMAQLASW--EAEADDD--VAATIKSVRQQLLNLNITGRL 116
PG + + A A ++ Q ++A+L +W + DDD +AA +++VRQQ+ L +TGR
Sbjct 60 PGTAIGEKQATAGVIQQQQQLLARLQTWRDQLRNDDDGALAAAVENVRQQIAALKVTGRQ 119
Query 117 PVKLDPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQD 176
V LDPD+VR+ +N L G+Y++YT+++P +ITL G + +G+ PW AGRS + YL +
Sbjct 120 FVNLDPDWVRLRPGANRRLEGEYSVYTLKKPTSITLAGVIENSGRTPWVAGRSASGYLSE 179
Query 177 HPRLAGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLND 236
HPR++GA++N ++I+P GE PVA WN RH EP GS L++GFSA LP YADLN
Sbjct 180 HPRMSGAERNIALLISPGGEVSEVPVAYWNHRHTEPQAGSTLFVGFSAWTLPRAYADLNI 239
Query 237 QIVSVLTQRVPD 248
QIVSVLT R+PD
Sbjct 240 QIVSVLTHRIPD 251
>ref|YP_001909022.1| Conserved hypothetical protein YmcB [Erwinia tasmaniensis Et1/99]
emb|CAO98154.1| Conserved hypothetical protein YmcB [Erwinia tasmaniensis Et1/99]
Length=251
Score = 218 bits (555), Expect = 7e-55, Method: Compositional matrix adjust.
Identities = 113/245 (46%), Positives = 158/245 (64%), Gaps = 4/245 (1%)
Query 8 FIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGALLTD 67
F+ + ++ A A G V I+ PG+ Q L V PV ++ QLVT P L + WWPG + +
Sbjct 7 FLTVIAVALSQLALADGRVNIFYPGQSQPLVVNPVADLEQLVTDPALAQKTWWPGTAIGE 66
Query 68 SAAKAKALKDYQHVMAQLASWE----AEADDDVAATIKSVRQQLLNLNITGRLPVKLDPD 123
A AL+ Q ++A+L +W E DD AAT+ +VR+Q+ L +TGR V LDPD
Sbjct 67 KLATVGALQQQQQLLARLQAWRDRLHNEGDDSQAATVDNVRRQIAVLKVTGRQFVNLDPD 126
Query 124 FVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGA 183
+VR+ +N L G+Y++YT+ P +ITL GA+ G++PW AGRS +YL HPR++GA
Sbjct 127 WVRLRPQANRRLQGEYSVYTLNEPTSITLAGAIESTGKVPWAAGRSAVEYLAAHPRMSGA 186
Query 184 DKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLT 243
+++ ++I+P GE PVA WN+RHVEP GS L++GFS LP YADLN QIVSVLT
Sbjct 187 ERSTALLISPGGEVTEIPVAYWNRRHVEPQAGSTLFIGFSTWTLPRAYADLNLQIVSVLT 246
Query 244 QRVPD 248
R+PD
Sbjct 247 HRIPD 251
>ref|YP_002650310.1| capsular polysaccharide protein [Erwinia pyrifoliae Ep1/96]
emb|CAX57108.1| Putative capsular polysaccharide protein [Erwinia pyrifoliae
Ep1/96]
emb|CAY75967.1| Uncharacterized protein ymcB precursor [Erwinia pyrifoliae DSM
12163]
Length=251
Score = 216 bits (549), Expect = 4e-54, Method: Compositional matrix adjust.
Identities = 110/245 (44%), Positives = 155/245 (63%), Gaps = 4/245 (1%)
Query 8 FIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGALLTD 67
+A ++ +T A A V I+ PG+ Q L V + ++ QLVT P L ++ WWPG + +
Sbjct 7 LLAGIVAALTLQARADSQVNIFYPGQNQPLVVNHMADLQQLVTNPALAEKTWWPGTTIAE 66
Query 68 SAAKAKALKDYQHVMAQLASWE----AEADDDVAATIKSVRQQLLNLNITGRLPVKLDPD 123
A A A++ Q ++A+L +W E DD A + +VRQQ+ L +TGR V LDPD
Sbjct 67 KRATAVAIQQQQQLLARLQTWRDRLRNEGDDTQAVAVDNVRQQIAVLKVTGRQIVNLDPD 126
Query 124 FVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGA 183
+VR+ N L G+Y++YT+ P +ITL G + G+ PW AGRS +YL HPR++GA
Sbjct 127 WVRLRPQDNRWLQGEYSVYTLNEPTSITLAGVIEKTGKTPWAAGRSAVEYLDAHPRMSGA 186
Query 184 DKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLT 243
+++ ++I+P GE PVA WN+RHVEP GS L++GFSA LP YADLN QIVSVLT
Sbjct 187 ERSTALLISPGGEVTEIPVAYWNRRHVEPQAGSTLFIGFSAWTLPRAYADLNSQIVSVLT 246
Query 244 QRVPD 248
R+PD
Sbjct 247 HRIPD 251
>ref|YP_003537358.1| exported protein [Erwinia amylovora ATCC 49946]
emb|CBJ44932.1| putative exported protein [Erwinia amylovora ATCC 49946]
Length=251
Score = 215 bits (548), Expect = 4e-54, Method: Compositional matrix adjust.
Identities = 113/252 (44%), Positives = 165/252 (65%), Gaps = 5/252 (1%)
Query 1 MNKLQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWW 60
M K+ + +A +L V++ A A G V I+ PG+ Q L V + ++ QLVT P L + WW
Sbjct 1 MKKI-TILLAGILAVLSLQARADGKVNIFYPGQNQPLVVNHMADLEQLVTNPALAQKTWW 59
Query 61 PGALLTDSAAKAKALKDYQHVMAQLASW--EAEADDD--VAATIKSVRQQLLNLNITGRL 116
PG + + A A ++ Q ++A+L +W + DDD +AA +++VRQQ+ L +TGR
Sbjct 60 PGTAIGEKQATAGVIQQQQQLLARLQTWRDQLRNDDDGALAAAVENVRQQIAALKVTGRQ 119
Query 117 PVKLDPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQD 176
V LDPD+VR+ +N L G+Y++YT+++P +ITL G + +G+ PW AGRS + YL +
Sbjct 120 FVNLDPDWVRLRPGANRRLEGEYSVYTLKKPTSITLAGVIENSGRTPWVAGRSASGYLSE 179
Query 177 HPRLAGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLND 236
HPR++GA++N ++I+P GE PVA WN RH EP GS L++GFSA LP YADLN
Sbjct 180 HPRMSGAERNIALLISPGGEVSEVPVAYWNHRHTEPQAGSTLFVGFSAWTLPRAYADLNI 239
Query 237 QIVSVLTQRVPD 248
QIVSVLT +PD
Sbjct 240 QIVSVLTHWIPD 251
>gb|ADP10499.1| Putative capsular polysaccharide protein [Erwinia sp. Ejp617]
Length=252
Score = 211 bits (537), Expect = 8e-53, Method: Compositional matrix adjust.
Identities = 109/245 (44%), Positives = 152/245 (62%), Gaps = 4/245 (1%)
Query 8 FIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGALLTD 67
+A ++ A A V I+ PG+ Q L V + ++ QLVT P L + WWPG + +
Sbjct 8 LLAGIVAAFALQARADSQVNIFYPGQNQPLVVNHMADLQQLVTNPALAQKTWWPGTTIGE 67
Query 68 SAAKAKALKDYQHVMAQLASWE----AEADDDVAATIKSVRQQLLNLNITGRLPVKLDPD 123
A A A++ Q ++A+L +W E DD AA + +VRQQ+ L +TGR V LDPD
Sbjct 68 KRATAVAIQQQQQLLARLQTWRDRLRNEGDDTQAAAVDNVRQQIAVLKVTGRQIVNLDPD 127
Query 124 FVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGA 183
+VR+ N L G+Y++YT+ P +ITL G + G+ PW AGRS +YL HPR++GA
Sbjct 128 WVRLRPQDNRRLQGEYSVYTLNEPTSITLAGVIEKTGKTPWAAGRSAVEYLDAHPRMSGA 187
Query 184 DKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLT 243
+++ ++I+P GE PVA WN+R VEP GS L++GFSA LP YADLN QIVSVLT
Sbjct 188 ERSTALLISPGGEVTEIPVAYWNRRQVEPQAGSTLFIGFSAWTLPRAYADLNSQIVSVLT 247
Query 244 QRVPD 248
R+PD
Sbjct 248 HRIPD 252
>ref|YP_003532687.1| hypothetical protein EAMY_3334 [Erwinia amylovora CFBP1430]
emb|CBA23497.1| Uncharacterized protein ymcB precursor [Erwinia amylovora CFBP1430]
Length=238
Score = 209 bits (532), Expect = 4e-52, Method: Compositional matrix adjust.
Identities = 108/238 (45%), Positives = 157/238 (65%), Gaps = 4/238 (1%)
Query 15 VMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGALLTDSAAKAKA 74
+++ A A G V I+ PG+ Q L V + ++ QLVT P L + WWPG + + A A
Sbjct 1 MLSLQARADGKVNIFYPGQNQPLVVNHMADLEQLVTNPALAQKTWWPGTAIGEKQATAGV 60
Query 75 LKDYQHVMAQLASW--EAEADDD--VAATIKSVRQQLLNLNITGRLPVKLDPDFVRVDEN 130
++ Q ++A+L +W + DDD +AA +++VRQQ+ L +TGR V LDPD+VR+
Sbjct 61 IQQQQQLLARLQTWRDQLRNDDDGALAAAVENVRQQIAALKVTGRQFVNLDPDWVRLRPG 120
Query 131 SNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGADKNNVMV 190
+N L G+Y++YT+++P +ITL G + +G+ PW AGRS + YL +HPR++GA++N ++
Sbjct 121 ANRRLEGEYSVYTLKKPTSITLAGVIENSGRTPWVAGRSASGYLSEHPRMSGAERNIALL 180
Query 191 ITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLTQRVPD 248
I+P GE PVA WN RH EP GS L++GFSA LP YADLN QIVSVLT +PD
Sbjct 181 ISPGGEVSEVPVAYWNHRHTEPQAGSTLFVGFSAWTLPRAYADLNIQIVSVLTHWIPD 238
>ref|YP_003739692.1| conserved uncharacterized protein YmcB [Erwinia billingiae Eb661]
emb|CAX57832.1| conserved uncharacterized protein YmcB [Erwinia billingiae Eb661]
Length=251
Score = 205 bits (521), Expect = 6e-51, Method: Compositional matrix adjust.
Identities = 111/252 (44%), Positives = 161/252 (63%), Gaps = 5/252 (1%)
Query 1 MNKLQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWW 60
M K+ + +A + ++ + A+ VT+Y PG+ QT V +N+ QLV+ P L D+ WW
Sbjct 1 MKKI-TLLLAGISACVSLNVSAESQVTVYSPGQTQTSIVSHAQNLAQLVSSPALMDKTWW 59
Query 61 PGALLTDSAAKAKALKDYQHVMAQLASW----EAEADDDVAATIKSVRQQLLNLNITGRL 116
PG ++ + A A A++ Q V+A+L +W A+ D + AA + +V QQ+ + +TGR
Sbjct 60 PGTVIAEKLATAAAIQQQQQVLARLKAWSNQLHADGDSEQAAVVDNVWQQVSAVKVTGRQ 119
Query 117 PVKLDPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQD 176
LDPD+VR+ N L GDY++YT+ +P +ITL G ++ +G+ PW GRSV DYLQD
Sbjct 120 LANLDPDWVRMRPAQNRRLEGDYSVYTLLKPTSITLAGVLANSGKTPWAPGRSVVDYLQD 179
Query 177 HPRLAGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLND 236
H RL+G ++N V++I P GE PVA WN+RHVEP GS ++ GFS+ LP DLN
Sbjct 180 HDRLSGGERNFVVLIAPNGEVSDVPVAYWNRRHVEPEVGSIVYRGFSSWTLPGDDEDLNQ 239
Query 237 QIVSVLTQRVPD 248
QIVSVLT R+PD
Sbjct 240 QIVSVLTHRIPD 251
>ref|YP_001174974.1| hypothetical protein Ent638_0233 [Enterobacter sp. 638]
gb|ABP58923.1| protein of unknown function DUF1017 [Enterobacter sp. 638]
Length=245
Score = 203 bits (517), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 104/240 (43%), Positives = 151/240 (62%), Gaps = 1/240 (0%)
Query 9 IASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGALLTDS 68
+A ++ + P A++ GTV +Y P ++ ++ E+++ LV QP+L + WWPGA++++
Sbjct 7 VALIVTLAAPLAWSAGTVKVYTPDNKEPKTLSNAEHLIDLVGQPRLANS-WWPGAIISER 65
Query 69 AAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPDFVRVD 128
A A A + +Q ++A+L + D D AA I +VRQQL + +TGR V LDPD VRV
Sbjct 66 QATAIAEQKHQALLARLTGLAEQEDGDTAAAINAVRQQLQAITVTGRQRVNLDPDEVRVT 125
Query 129 ENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGADKNNV 188
EN NP L GDYTL+ V +P T+T+ G VS GQ P+ GR V YL + L+GA+ +
Sbjct 126 ENGNPTLEGDYTLWIVAKPSTVTVAGLVSSPGQKPFTPGRDVASYLDEQHLLSGAENSYA 185
Query 189 MVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLTQRVPD 248
VI P+G PVA WNKRH+EP PGS +++GF+ H + Y LN I+ LTQR+PD
Sbjct 186 WVIYPDGRRQNVPVAYWNKRHIEPMPGSVIFVGFADHFWTKAYDGLNTDILRSLTQRIPD 245
>ref|YP_003518538.1| YmcB [Pantoea ananatis LMG 20103]
gb|ADD75410.1| YmcB [Pantoea ananatis LMG 20103]
Length=247
Score = 200 bits (509), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 110/243 (45%), Positives = 153/243 (62%), Gaps = 0/243 (0%)
Query 6 SYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGALL 65
+ +A + + T A A G VT++ P + Q++ V VEN+ QLVTQP L + W A++
Sbjct 5 TRLLAGMSLLTTLAAQAAGQVTVHAPHDTQSVQVNQVENLAQLVTQPALMTKTDWRRAVI 64
Query 66 TDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPDFV 125
+ A A A + YQ +AQL +W A++ + AA I +V QL + +TGR LDPD++
Sbjct 65 AERGATAVAQQQYQQTLAQLRAWRADSSGEQAAAIDAVIHQLSGIRVTGRQFTSLDPDWI 124
Query 126 RVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGADK 185
R+ N L G Y LY Q ++ L G ++GAG + WQ G+SV DYL +HPRLAGA++
Sbjct 125 RLHTMDNRRLEGSYDLYLTQPSTSVLLFGPIAGAGAVNWQPGKSVRDYLSEHPRLAGAER 184
Query 186 NNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLTQR 245
N +VI P+G T APVA WN RHVEP PGS + GFS+ LP + DLND++VSVLT R
Sbjct 185 NIAIVIAPDGTTREAPVAYWNHRHVEPEPGSIIMTGFSSWSLPGAFKDLNDRLVSVLTHR 244
Query 246 VPD 248
+PD
Sbjct 245 IPD 247
>dbj|BAK13483.1| hypothetical protein YmcB precursor YmcB [Pantoea ananatis AJ13355]
Length=247
Score = 200 bits (508), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 110/243 (45%), Positives = 154/243 (63%), Gaps = 0/243 (0%)
Query 6 SYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGALL 65
+ +A + + T A A G VT++ P + Q++ V VEN+ QLVTQP L + W A++
Sbjct 5 TRLLAGMSLLTTLAAQAAGQVTVHAPHDTQSVQVNQVENLAQLVTQPALMTQTDWRRAVI 64
Query 66 TDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPDFV 125
+ A A A + YQ +AQL +W A++ + AA I +V +QL + +TGR LDPD++
Sbjct 65 AERGATAVAQQQYQQTLAQLRAWRADSSGEQAAAIDAVIRQLSGIRVTGRQFTSLDPDWI 124
Query 126 RVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGADK 185
R+ N L G Y LY Q ++ L G ++GAG + WQ G+SV DYL +HPRLAGA++
Sbjct 125 RLHTMDNRRLEGSYDLYLTQPSTSVLLFGPIAGAGAVNWQPGKSVRDYLSEHPRLAGAER 184
Query 186 NNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLTQR 245
N +VI P+G T APVA WN RHVEP PGS + GFS+ LP + DLND++VSVLT R
Sbjct 185 NIAIVIAPDGTTREAPVAYWNHRHVEPEPGSIIMTGFSSWSLPGAFKDLNDRLVSVLTHR 244
Query 246 VPD 248
+PD
Sbjct 245 IPD 247
>ref|YP_001436216.1| hypothetical protein ESA_00075 [Cronobacter sakazakii ATCC BAA-894]
gb|ABU75380.1| hypothetical protein ESA_00075 [Cronobacter sakazakii ATCC BAA-894]
Length=244
Score = 200 bits (508), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 110/246 (44%), Positives = 153/246 (62%), Gaps = 3/246 (1%)
Query 4 LQSYFIASVLYVMTPH-AFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPG 62
+++ IAS+++ + AFA GT+T+Y Q L V +++ LV+QPQL WW G
Sbjct 1 MKTTLIASLIFSLGSFCAFADGTITVYR-DHAQPLKVSGAKHLADLVSQPQLAGS-WWLG 58
Query 63 ALLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDP 122
A++++ A +A +Q ++ +LAS AE D A I VRQQL + +TGR V LDP
Sbjct 59 AVISERQASVEAQAQHQVLLNRLASLAAEEGGDDGAAINRVRQQLQAIKVTGRQRVILDP 118
Query 123 DFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAG 182
D VRV ++NPPL GDY L+ +P TITL+G VS G+ + G+ VTDYL D RL+G
Sbjct 119 DRVRVRPHNNPPLEGDYELWVGPQPSTITLVGLVSAPGKKTFTPGKPVTDYLDDVSRLSG 178
Query 183 ADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVL 242
A+++ +I P G PVA WNKRHVEP PGS L++GF+ H Y LN+QI+S L
Sbjct 179 AERSYAWLIQPTGRVEKVPVAYWNKRHVEPMPGSILYVGFADHTFTSAYDGLNEQIISSL 238
Query 243 TQRVPD 248
T R+PD
Sbjct 239 THRIPD 244
>gb|EGL72010.1| hypothetical protein CSE899_14477 [Cronobacter sakazakii E899]
Length=244
Score = 197 bits (502), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 109/246 (44%), Positives = 152/246 (61%), Gaps = 3/246 (1%)
Query 4 LQSYFIASVLYVMTPH-AFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPG 62
+++ IAS+++ + AFA GT+T+Y Q L V +++ LV+QPQL WW G
Sbjct 1 MKTTLIASLIFSLGSFCAFADGTITVYR-DHAQPLKVSGAKHLADLVSQPQLAGS-WWLG 58
Query 63 ALLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDP 122
A++++ A +A +Q ++ +LAS AE D A I V QQL + +TGR V LDP
Sbjct 59 AVISERQASVEAQAQHQVLLNRLASLAAEEGGDDGAAINRVHQQLQAIKVTGRQRVILDP 118
Query 123 DFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAG 182
D VRV ++NPPL GDY L+ +P TITL+G VS G+ + G+ VTDYL D RL+G
Sbjct 119 DRVRVRPHNNPPLEGDYELWVGPQPSTITLVGLVSAPGKKTFTPGKPVTDYLDDVSRLSG 178
Query 183 ADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVL 242
A+++ +I P G PVA WNKRHVEP PGS L++GF+ H Y LN+QI+S L
Sbjct 179 AERSYAWLIQPTGRVEKVPVAYWNKRHVEPMPGSILYVGFADHTFTSAYDGLNEQIISSL 238
Query 243 TQRVPD 248
T R+PD
Sbjct 239 THRIPD 244
>ref|YP_003610796.1| hypothetical protein ECL_00280 [Enterobacter cloacae subsp. cloacae
ATCC 13047]
gb|ADF59847.1| hypothetical protein ECL_00280 [Enterobacter cloacae subsp. cloacae
ATCC 13047]
Length=245
Score = 197 bits (502), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 104/240 (43%), Positives = 147/240 (61%), Gaps = 1/240 (0%)
Query 9 IASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGALLTDS 68
+A + TP A++ GTV +Y P Q ++ N++ LV QP+L + WW GA++ +
Sbjct 7 VALIASFATPLAWSAGTVKVYTPDSTQPKTLTNAGNLIDLVGQPRLANS-WWTGAVIAER 65
Query 69 AAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPDFVRVD 128
A A + ++ ++A+L + D D AA I S+RQQLL L +TGR + LDPD VRV
Sbjct 66 QATVAAEQKHKALLARLTGLAEQEDGDDAAAINSLRQQLLALKVTGRQNINLDPDEVRVT 125
Query 129 ENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGADKNNV 188
E NP L GDYTL+ +P TIT++G +S G+ P+ GR V YL + L+GAD +
Sbjct 126 EKGNPALEGDYTLWLPAQPSTITVMGLISSPGKKPFTPGRDVASYLDEQSLLSGADNSYA 185
Query 189 MVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLTQRVPD 248
VI P+G T APVA WNKRHVEP PGS +++GF+ H + Y LN I+ LTQR+P+
Sbjct 186 WVIYPDGNTQKAPVAYWNKRHVEPMPGSIIFVGFADHFWTKAYDGLNADILHSLTQRIPE 245
>ref|NP_709891.1| hypothetical protein SF4177 [Shigella flexneri 2a str. 301]
ref|NP_838790.1| hypothetical protein S3554 [Shigella flexneri 2a str. 2457T]
gb|AAN45598.1| orf, conserved hypothetical protein [Shigella flexneri 2a str.
301]
10 more sequence titles
gb|AAP18601.1| hypothetical protein S3554 [Shigella flexneri 2a str. 2457T]
gb|ADA76472.1| hypothetical protein SFxv_4555 [Shigella flexneri 2002017]
gb|EGJ82360.1| hypothetical protein SF434370_3508 [Shigella flexneri 4343-70]
gb|EGJ82530.1| hypothetical protein SFK671_3975 [Shigella flexneri K-671]
gb|EGJ84232.1| hypothetical protein SF274771_3981 [Shigella flexneri 2747-71]
gb|EGJ94960.1| conserved protein [Shigella flexneri 2930-71]
gb|EGK18610.1| hypothetical protein SFK272_4141 [Shigella flexneri K-272]
gb|EGK18740.1| hypothetical protein SFK218_4325 [Shigella flexneri K-218]
gb|EGK33567.1| hypothetical protein SFK304_4211 [Shigella flexneri K-304]
gb|EGM58948.1| conserved protein [Shigella flexneri J1713]
Length=245
Score = 196 bits (497), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 103/244 (42%), Positives = 148/244 (60%), Gaps = 1/244 (0%)
Query 4 LQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGA 63
++ +A +L V FA GTV ++ G + ++ E+++ LV QP+L + WWPGA
Sbjct 2 IKQTIVALLLSVGASSVFAAGTVKVFSNGSSEAKTLTGAEHLIDLVGQPRLANS-WWPGA 60
Query 64 LLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPD 123
++++ A A AL+ Q ++ +LA A++ D AA I ++RQQ+ L +TGR + LDPD
Sbjct 61 VISEELATAAALRQQQALLTRLAEQGADSSTDDAAAINALRQQIQALKVTGRQKINLDPD 120
Query 124 FVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGA 183
VRV E NPPL G+YTL+ P T+TL G +S G P+ GR V YL D L+GA
Sbjct 121 IVRVAERGNPPLQGNYTLWVGPPPSTVTLFGLISHPGNQPFTPGRDVASYLSDQSLLSGA 180
Query 184 DKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLT 243
D++ V+ P+G T APVA WNKRHVEP PGS +++G + V E LN I+ LT
Sbjct 181 DRSYAWVVYPDGRTQKAPVAYWNKRHVEPMPGSIIYVGLADSVWSETPDALNADILQTLT 240
Query 244 QRVP 247
QR+P
Sbjct 241 QRIP 244
>ref|YP_312940.1| hypothetical protein SSON_4206 [Shigella sonnei Ss046]
gb|AAZ90705.1| conserved hypothetical protein [Shigella sonnei Ss046]
Length=245
Score = 196 bits (497), Expect = 4e-48, Method: Compositional matrix adjust.
Identities = 103/244 (42%), Positives = 148/244 (60%), Gaps = 1/244 (0%)
Query 4 LQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGA 63
++ +A +L V FA GTV ++ G + ++ E+++ LV QP+L + WWPGA
Sbjct 2 IKQTIVALLLSVGASSVFAAGTVKVFSNGSSEAKTLTGAEHLIDLVGQPRLANS-WWPGA 60
Query 64 LLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPD 123
++++ A A AL+ Q ++ +LA A++ D AA I ++RQQ+ L +TGR + LDPD
Sbjct 61 VISEELATAAALRQQQALLTRLAEQGADSSTDDAAAINALRQQIQALEVTGRQKINLDPD 120
Query 124 FVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGA 183
VRV E NPPL G+YTL+ P T+TL G +S G P+ GR V YL D L+GA
Sbjct 121 IVRVAERGNPPLQGNYTLWVGPPPSTVTLFGLISHPGNQPFTPGRDVASYLSDQSLLSGA 180
Query 184 DKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLT 243
D++ V+ P+G T APVA WNKRHVEP PGS +++G + V E LN I+ LT
Sbjct 181 DRSYAWVVYPDGRTQKAPVAYWNKRHVEPMPGSIIYVGLADSVWSETPDALNADILQTLT 240
Query 244 QRVP 247
QR+P
Sbjct 241 QRIP 244
>ref|YP_003232032.1| hypothetical protein ECO26_5143 [Escherichia coli O26:H11 str.
11368]
ref|YP_003237146.1| hypothetical protein ECO111_4850 [Escherichia coli O111:H- str.
11128]
dbj|BAI28292.1| conserved predicted protein [Escherichia coli O26:H11 str. 11368]
dbj|BAI38595.1| conserved predicted protein [Escherichia coli O111:H- str. 11128]
gb|EFZ41755.1| hypothetical protein ECEPECA14_2520 [Escherichia coli EPECa14]
gb|EFZ63178.1| hypothetical protein ECOK1180_3669 [Escherichia coli 1180]
Length=245
Score = 195 bits (496), Expect = 4e-48, Method: Compositional matrix adjust.
Identities = 103/244 (42%), Positives = 148/244 (60%), Gaps = 1/244 (0%)
Query 4 LQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGA 63
++ F+A +L V FA GTV ++ G + ++ E+++ LV QP+L + WWPGA
Sbjct 2 IKQTFVALLLSVGASSVFAAGTVKVFSNGSSEAKTLTGAEHLIDLVGQPRLANS-WWPGA 60
Query 64 LLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPD 123
++++ A A AL+ Q ++ +LA A++ D AA I ++RQQ+ L +TGR + LDPD
Sbjct 61 VISEELATAAALRQQQALLTRLAEQGADSSTDDAAAINALRQQIQALKVTGRQKINLDPD 120
Query 124 FVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGA 183
VRV E NPPL G+YTL+ P T+TL G +S G P+ GR V YL L+GA
Sbjct 121 IVRVAERGNPPLQGNYTLWVGPPPSTVTLFGLISHPGNQPFTPGRDVASYLSGQNLLSGA 180
Query 184 DKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLT 243
D++ V+ P+G T APVA WNKRHVEP PGS +++G + V E LN I+ LT
Sbjct 181 DRSYAWVVYPDGRTQKAPVAYWNKRHVEPMPGSIIYVGLADSVWSETPDALNADILQTLT 240
Query 244 QRVP 247
QR+P
Sbjct 241 QRIP 244
>ref|YP_003212148.1| hypothetical protein CTU_37850 [Cronobacter turicensis z3032]
emb|CBA34102.1| Uncharacterized protein yjbG [Cronobacter turicensis z3032]
Length=235
Score = 195 bits (495), Expect = 6e-48, Method: Compositional matrix adjust.
Identities = 107/229 (46%), Positives = 143/229 (62%), Gaps = 2/229 (0%)
Query 20 AFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGALLTDSAAKAKALKDYQ 79
AFA GT+T+Y Q L V +++ LV+QPQL WW GA++++ A +A +Q
Sbjct 9 AFADGTITVYR-DHAQPLKVSGAKHLADLVSQPQLTGS-WWLGAVISERQASVEAQAQHQ 66
Query 80 HVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPDFVRVDENSNPPLVGDY 139
++ +LAS AE D A I VRQQL + +TGR V LDPD VRV ++NPPL GDY
Sbjct 67 VLLNRLASLAAEEGGDDGAAINRVRQQLQAIKVTGRQRVILDPDRVRVRPHNNPPLEGDY 126
Query 140 TLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGADKNNVMVITPEGETVV 199
L+ +P TITL+G VS G+ + G+ VTDYL + RL+GA+++ +I P G
Sbjct 127 ELWVGPQPSTITLVGLVSAPGKKTFTPGKPVTDYLDEVSRLSGAERSYAWLIQPTGRVEK 186
Query 200 APVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLTQRVPD 248
PVA WNKRHVEP PGS L++GF+ H Y LN+QI+S LT RVPD
Sbjct 187 VPVAYWNKRHVEPMPGSILYVGFADHTFTSAYDGLNEQIISSLTHRVPD 235
>gb|EGC97468.1| hypothetical protein ECD227_3706 [Escherichia fergusonii ECD227]
Length=245
Score = 195 bits (495), Expect = 6e-48, Method: Compositional matrix adjust.
Identities = 102/244 (41%), Positives = 147/244 (60%), Gaps = 1/244 (0%)
Query 4 LQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGA 63
++ A ++ V FA GTV ++ G + ++ E+++ LV QP+L + WWPGA
Sbjct 2 IKQTIAALIMSVGASSVFAAGTVKVFSNGSNEAKTLTGAEHLIDLVGQPRLANS-WWPGA 60
Query 64 LLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPD 123
++++ A A AL+ Q ++ +LA +E+ D AA I ++RQQ+ L +TGR + LDPD
Sbjct 61 VISEELATAAALRQQQALLTRLAELASESSTDDAAAINALRQQIQALKVTGRQKINLDPD 120
Query 124 FVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGA 183
VRV E SNPPL G+YTL+ P T+TL G +S G P+ GR V YL L+GA
Sbjct 121 IVRVAERSNPPLQGNYTLWVGPPPSTVTLFGLISRPGNQPFTPGRDVASYLSGQSLLSGA 180
Query 184 DKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLT 243
D++ V+ P+G T APVA WNKRHVEP PGS +++G + V E LN I+ LT
Sbjct 181 DRSYAWVVYPDGRTQKAPVAYWNKRHVEPMPGSIIYVGLADSVWSETPDALNADILQTLT 240
Query 244 QRVP 247
QR+P
Sbjct 241 QRIP 244
>ref|ZP_05969412.1| conserved hypothetical protein [Enterobacter cancerogenus ATCC
35316]
gb|EFC55292.1| conserved hypothetical protein [Enterobacter cancerogenus ATCC
35316]
Length=246
Score = 192 bits (489), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 102/247 (41%), Positives = 149/247 (60%), Gaps = 3/247 (1%)
Query 4 LQSYFIASVLYV--MTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWP 61
++ IA VL P A++ GTV +Y P Q ++ +++ LV QP+L + WWP
Sbjct 1 MKKTVIAIVLLAGFAAPLAWSAGTVKVYTPENAQPKTLTNAGHLLDLVGQPRLANS-WWP 59
Query 62 GALLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLD 121
GA++ + A +A + + ++A+L + D D AA I SVRQQL L +TGR + LD
Sbjct 60 GAVIGERQASVEAEQKHNALLARLTGLAGQEDGDDAAAINSVRQQLQALKVTGRQTINLD 119
Query 122 PDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLA 181
PD VRV E NP L G+YTL+ +P T+T++G +S G+ P+ GR V YL + L+
Sbjct 120 PDVVRVAEKGNPALEGEYTLWLPTQPSTVTVMGLISSPGKKPFTPGRDVASYLDEQSLLS 179
Query 182 GADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSV 241
GAD + ++ P+G T APVA WNKRH+EP PGS +++GF+ H + Y LN I+
Sbjct 180 GADNSYAWIVYPDGHTQKAPVAYWNKRHIEPMPGSVIFVGFADHFWTKAYDGLNADILHS 239
Query 242 LTQRVPD 248
LTQR+PD
Sbjct 240 LTQRIPD 246
>ref|YP_001455401.1| hypothetical protein CKO_03889 [Citrobacter koseri ATCC BAA-895]
gb|ABV14965.1| hypothetical protein CKO_03889 [Citrobacter koseri ATCC BAA-895]
Length=245
Score = 192 bits (489), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 101/245 (41%), Positives = 147/245 (60%), Gaps = 1/245 (0%)
Query 4 LQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGA 63
++ A L + +FA GTV ++ G + ++ E+++ LV QP+L + WWPGA
Sbjct 2 MKQTIAALALSLCASSSFAAGTVKVFAAGSTEPKTLTGAEHLIDLVGQPKLANS-WWPGA 60
Query 64 LLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPD 123
++++ A A A + Q ++ +LA++ AE D AA I ++RQQ+ L +TGR V LDPD
Sbjct 61 VISEERATATAQRQQQELLGRLAAFGAEKSGDDAAAINTLRQQVQTLKVTGRQLVNLDPD 120
Query 124 FVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGA 183
VRV E NPPL G YTL+ +P ITL G VS G+ P+ GR V YL + L+GA
Sbjct 121 TVRVSERGNPPLQGHYTLWVGGQPTDITLFGLVSQPGKRPFSPGRDVASYLDEQSLLSGA 180
Query 184 DKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLT 243
D++ V+ P+G T APVA WNKRHVEP PGS +++G + + LN I+ LT
Sbjct 181 DRSYAWVVYPDGRTQKAPVAYWNKRHVEPMPGSIIFVGLTDSLWSNTPDALNADILRTLT 240
Query 244 QRVPD 248
QR+P+
Sbjct 241 QRIPE 245
>ref|YP_003367162.1| hypothetical protein ROD_37221 [Citrobacter rodentium ICC168]
emb|CBG90427.1| putative exported protein [Citrobacter rodentium ICC168]
Length=245
Score = 192 bits (489), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 101/245 (41%), Positives = 149/245 (60%), Gaps = 1/245 (0%)
Query 4 LQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGA 63
++ A ++ + FA GTV +++ G Q ++ E ++ LV QP+L + WWPGA
Sbjct 2 MKRTLFALLISLNAASVFAAGTVNVFIAGTPQAKTLTGAERLIDLVGQPRLANS-WWPGA 60
Query 64 LLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPD 123
++++ A A AL+ Q ++A+LA+ A D AA I ++RQQ+ L ITGR V LDPD
Sbjct 61 VISEEQATAAALRQQQELVARLAALSAGESGDDAAAINALRQQVQALRITGRQRVNLDPD 120
Query 124 FVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGA 183
VRV E +NPPL G+YTL+ +P +TL G +S G+LP+ GR V YL+ L+GA
Sbjct 121 VVRVSERANPPLQGNYTLWVGPQPAEVTLFGLMSRPGKLPFMPGRDVVSYLEGQSLLSGA 180
Query 184 DKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLT 243
D++ VI P+G + PVA WN+RHVEP PGS +++G V + LN I+ LT
Sbjct 181 DRSYAWVIYPDGRSQKVPVAYWNRRHVEPMPGSIIFVGLDDAVWSSEPDALNADILHTLT 240
Query 244 QRVPD 248
QR+P+
Sbjct 241 QRIPE 245
>gb|EGB70445.1| SLBB-domain-containing protein [Escherichia coli TW10509]
Length=245
Score = 192 bits (487), Expect = 6e-47, Method: Compositional matrix adjust.
Identities = 101/244 (41%), Positives = 146/244 (59%), Gaps = 1/244 (0%)
Query 4 LQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGA 63
++ A ++ V FA GTV ++ G + ++ E+++ LV QP+L + WWPGA
Sbjct 2 IKQTIAALIMSVGASSVFAAGTVKVFSNGSNEAKTLTGAEHLIDLVGQPRLANS-WWPGA 60
Query 64 LLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPD 123
++++ A A AL+ Q ++ +LA +E+ D AA I ++RQQ+ L +TGR + LDPD
Sbjct 61 VISEELATAAALRQQQALLTRLAELASESSTDDAAAINALRQQIQALKVTGRQKINLDPD 120
Query 124 FVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGA 183
VRV E NPPL G+YTL+ P T+TL G +S G P+ GR V YL D L+GA
Sbjct 121 IVRVAERGNPPLQGNYTLWVGPPPSTVTLFGLISRPGNQPFTPGRDVASYLSDQSLLSGA 180
Query 184 DKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLT 243
D++ V+ P+G T APVA WNKRHVE PGS +++G + V E LN I+ LT
Sbjct 181 DRSYAWVVYPDGRTQKAPVAYWNKRHVELMPGSIIYVGLADSVWSETPDALNADILQTLT 240
Query 244 QRVP 247
QR+P
Sbjct 241 QRIP 244
>ref|ZP_08500193.1| group 4 capsule (G4C) polysaccharide, YmcB [Enterobacter hormaechei
ATCC 49162]
gb|EGK57100.1| group 4 capsule (G4C) polysaccharide, YmcB [Enterobacter hormaechei
ATCC 49162]
Length=245
Score = 191 bits (485), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 107/248 (43%), Positives = 155/248 (62%), Gaps = 3/248 (1%)
Query 1 MNKLQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWW 60
MN + + L + + A VT++ PG +T S + + +LVTQPQ + +WW
Sbjct 1 MNGHKKWLPGVGLSLFSLSALGASVVTVHQPG--KTWSAEQADTLSRLVTQPQFNN-VWW 57
Query 61 PGALLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKL 120
GA + +A +A + Q V+A L +W+ A+D+ AAT+++V Q+ +L I GR V L
Sbjct 58 QGAAIATPSATRRAQQTQQQVLALLTAWQNRANDERAATVRAVAAQIRSLRIVGRQFVNL 117
Query 121 DPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRL 180
DPD VR D + + PL G Y L+ P T+TL+GAV+ G+ W+ G S+ DYLQ PRL
Sbjct 118 DPDAVRTDAHGDRPLEGRYDLWLSPAPRTVTLMGAVATPGKRAWRPGASIRDYLQGQPRL 177
Query 181 AGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVS 240
AGAD+NNV VI P+G TVVA V WN RH+E PG+ LW+GF +P+ + LN+QIV+
Sbjct 178 AGADRNNVTVIDPDGSTVVAQVGYWNARHIEAEPGAVLWVGFDPRAVPDDFTGLNEQIVA 237
Query 241 VLTQRVPD 248
+LT+R+PD
Sbjct 238 LLTRRIPD 245
>ref|YP_001465525.1| hypothetical protein EcE24377A_4576 [Escherichia coli E24377A]
ref|YP_001460814.1| hypothetical protein EcHS_A4267 [Escherichia coli HS]
ref|ZP_07591769.1| protein of unknown function DUF1017 [Escherichia coli W]
gb|ABV08431.1| conserved hypothetical protein [Escherichia coli HS]
gb|ABV16853.1| conserved hypothetical protein [Escherichia coli E24377A]
gb|EFN38440.1| protein of unknown function DUF1017 [Escherichia coli W]
gb|ADT77679.1| conserved protein [Escherichia coli W]
gb|ADX52853.1| protein of unknown function DUF1017 [Escherichia coli KO11]
Length=245
Score = 190 bits (483), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 104/244 (42%), Positives = 150/244 (61%), Gaps = 1/244 (0%)
Query 4 LQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGA 63
++ F+A +L V FA GTV ++ G + ++ E+++ LV QP+L + WWPGA
Sbjct 2 IKQTFVALLLSVGASSVFAAGTVKVFSNGSSEAKTLTGAEHLIDLVGQPRLANS-WWPGA 60
Query 64 LLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPD 123
++++ A A AL+ Q ++ +LA A++ D AA I ++RQQ+ L +TGR + LDPD
Sbjct 61 VISEELATAAALRQQQALLTRLAELAADSSADDAAAINALRQQIQALKVTGRQKINLDPD 120
Query 124 FVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGA 183
VRV E NPPL G+YTL+ P T+TL G +S G+ P+ GR V YL D L+GA
Sbjct 121 IVRVAERGNPPLQGNYTLWVGPPPSTVTLFGLISRPGKQPFTPGRDVASYLSDQSLLSGA 180
Query 184 DKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLT 243
D++ V+ P+G T APVA WNKRHVEP PGS +++G + V E LN I+ LT
Sbjct 181 DRSYAWVVYPDGRTQKAPVAYWNKRHVEPMPGSIIYVGLADSVWSETPDALNADILQTLT 240
Query 244 QRVP 247
QR+P
Sbjct 241 QRIP 244
>ref|ZP_07140265.1| hypothetical protein HMPREF9548_02441 [Escherichia coli MS 182-1]
ref|ZP_07223211.1| conserved hypothetical protein [Escherichia coli MS 78-1]
gb|EFK02824.1| hypothetical protein HMPREF9548_02441 [Escherichia coli MS 182-1]
gb|EFK71207.1| conserved hypothetical protein [Escherichia coli MS 78-1]
gb|EFW75431.1| YjbG polysaccharide synthesis-related protein [Escherichia coli
EC4100B]
Length=245
Score = 190 bits (482), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 104/244 (42%), Positives = 150/244 (61%), Gaps = 1/244 (0%)
Query 4 LQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGA 63
++ F+A +L V FA GTV ++ G + ++ E+++ LV QP+L + WWPGA
Sbjct 2 IKQTFVALLLSVGASSVFAAGTVKVFSNGSSEAKTLTGAEHLIDLVGQPRLANS-WWPGA 60
Query 64 LLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPD 123
++++ A A AL+ Q ++ +LA A++ D AA I ++RQQ+ L +TGR + LDPD
Sbjct 61 VISEELATAAALRQQQALLTRLAEQGADSSADDAAAINALRQQIQALKVTGRQKINLDPD 120
Query 124 FVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGA 183
VRV E NPPL G+YTL+ P T+TL G +S G+ P+ GR V YL D L+GA
Sbjct 121 IVRVAERGNPPLQGNYTLWVGPPPSTVTLFGLISRPGKQPFTPGRDVASYLSDQSLLSGA 180
Query 184 DKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLT 243
D++ V+ P+G T APVA WNKRHVEP PGS +++G + V E LN I+ LT
Sbjct 181 DRSYAWVVYPDGRTQKAPVAYWNKRHVEPMPGSIIYVGLADSVWSETPDALNADILQTLT 240
Query 244 QRVP 247
QR+P
Sbjct 241 QRIP 244
>emb|CBK84486.1| SLBB-domain like (DUF1017) [Enterobacter cloacae subsp. cloacae
NCTC 9394]
Length=245
Score = 189 bits (480), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 99/240 (41%), Positives = 150/240 (62%), Gaps = 1/240 (0%)
Query 9 IASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGALLTDS 68
IA + + +P A++ GTV +Y P ++ ++ +++ LV QP+L + WW GA++++
Sbjct 7 IALLASLTSPLAWSAGTVQVYTPDSEKPKTLTNAGHLLDLVGQPRLA-KSWWTGAVISER 65
Query 69 AAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPDFVRVD 128
A A + +Q ++A+L+ + D D AA I S+RQQL + +TGR V LDPD VRV
Sbjct 66 QATIVAEQKHQALLARLSGLAQQEDTDDAAAITSLRQQLQAVKVTGRQKVNLDPDEVRVA 125
Query 129 ENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGADKNNV 188
EN NP L GDYTL+ +P T+T++G +S G+ P+ GR V YL + L+GAD +
Sbjct 126 ENGNPSLEGDYTLWLPAQPSTVTVMGLLSSPGKKPFTPGRDVASYLDEQSLLSGADNSYA 185
Query 189 MVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLTQRVPD 248
V+ P+G T APVA WNKRH+EP PGS +++GF+ H + Y LN I+ L QR+P+
Sbjct 186 WVVYPDGHTQKAPVAYWNKRHIEPMPGSIIFVGFADHFWTKAYDGLNADILRSLIQRIPE 245
>ref|ZP_08366591.1| conserved hypothetical protein [Escherichia coli TA143]
gb|EGI28742.1| conserved hypothetical protein [Escherichia coli TA143]
Length=245
Score = 188 bits (478), Expect = 6e-46, Method: Compositional matrix adjust.
Identities = 103/244 (42%), Positives = 149/244 (61%), Gaps = 1/244 (0%)
Query 4 LQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGA 63
++ +A +L V FA GTV ++ G + ++ E+++ LV QP+L + WWPGA
Sbjct 2 IKQTIVALILSVGASSVFAAGTVKVFSNGSSEAKTLTGAEHLIDLVGQPRLANS-WWPGA 60
Query 64 LLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPD 123
++++ A A AL+ Q ++ +LA A++ D AA I ++RQQ+ L +TGR + LDPD
Sbjct 61 VISEELATAAALRQQQALLTRLAELAADSSADDAAAINALRQQIQALKVTGRQKINLDPD 120
Query 124 FVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGA 183
VRV E NPPL G+YTL+ P T+TL G +S G+ P+ GR V YL D L+GA
Sbjct 121 IVRVAERGNPPLQGNYTLWVGPPPSTVTLFGLISRPGKQPFTPGRDVASYLSDQSLLSGA 180
Query 184 DKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLT 243
D++ V+ P+G T APVA WNKRHVEP PGS +++G + V E LN I+ LT
Sbjct 181 DRSYAWVVYPDGRTQKAPVAYWNKRHVEPMPGSIIYVGLADSVWSETPDALNADILQTLT 240
Query 244 QRVP 247
QR+P
Sbjct 241 QRIP 244
>ref|ZP_08497921.1| hypothetical protein HMPREF9086_2183 [Enterobacter hormaechei
ATCC 49162]
gb|EGK61041.1| hypothetical protein HMPREF9086_2183 [Enterobacter hormaechei
ATCC 49162]
Length=245
Score = 188 bits (477), Expect = 8e-46, Method: Compositional matrix adjust.
Identities = 98/240 (40%), Positives = 150/240 (62%), Gaps = 1/240 (0%)
Query 9 IASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGALLTDS 68
+A + + +P A++ GTV +Y P ++ ++ +++ LV QP+L + WW GA++++
Sbjct 7 VALLASLASPLAWSAGTVQVYTPDSEKPKTLTNAGHLLDLVGQPRLA-KSWWTGAVISER 65
Query 69 AAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPDFVRVD 128
A A + +Q ++A+L+ + D D AA I S+RQQL + +TGR V LDPD VRV
Sbjct 66 QATVVAEQKHQALLARLSGLAQQEDADDAAGINSLRQQLQAVKVTGRQKVNLDPDEVRVA 125
Query 129 ENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGADKNNV 188
EN NP L GDYTL+ +P T+T++G +S G+ P+ GR V YL + L+GAD +
Sbjct 126 ENGNPSLEGDYTLWLPAQPSTVTVMGLLSSPGKKPFTPGRDVASYLDEQSLLSGADNSYA 185
Query 189 MVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLTQRVPD 248
V+ P+G T APVA WNKRH+EP PGS +++GF+ H + Y LN I+ L QR+P+
Sbjct 186 WVVYPDGHTQKAPVAYWNKRHIEPMPGSIIFVGFADHFWTKAYDGLNADILRSLIQRIPE 245
>ref|ZP_03048870.1| conserved hypothetical protein [Escherichia coli E110019]
ref|ZP_07124032.1| hypothetical protein HMPREF9536_04298 [Escherichia coli MS 84-1]
ref|ZP_07208830.1| hypothetical protein HMPREF9347_01280 [Escherichia coli MS 124-1]
gb|EDV89316.1| conserved hypothetical protein [Escherichia coli E110019]
gb|EFJ85444.1| hypothetical protein HMPREF9536_04298 [Escherichia coli MS 84-1]
gb|EFK69681.1| hypothetical protein HMPREF9347_01280 [Escherichia coli MS 124-1]
gb|EFU34662.1| conserved hypothetical protein [Escherichia coli MS 85-1]
Length=245
Score = 187 bits (475), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 103/244 (42%), Positives = 148/244 (60%), Gaps = 1/244 (0%)
Query 4 LQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGA 63
++ +A +L V FA GTV ++ G + ++ E ++ LV QP+L + WWPGA
Sbjct 2 IKQTIVALILSVGASSVFAAGTVKVFSNGSSEAKTLTGAEYLIDLVGQPRLANS-WWPGA 60
Query 64 LLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPD 123
++++ A A AL+ Q ++ +LA A++ D AA I ++RQQ+ L +TGR + LDPD
Sbjct 61 VISEELATAAALRQQQALLTRLAELAADSSADDAAAINALRQQIQALKVTGRQKINLDPD 120
Query 124 FVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGA 183
VRV E NPPL G+YTL+ P T+TL G +S G+ P+ GR V YL D L+GA
Sbjct 121 IVRVAERGNPPLQGNYTLWVGPPPSTVTLFGLISRPGKQPFTPGRDVASYLSDQSLLSGA 180
Query 184 DKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLT 243
D++ V+ P+G T APVA WNKRHVEP PGS +++G + V E LN I+ LT
Sbjct 181 DRSYAWVVYPDGRTQKAPVAYWNKRHVEPMPGSIIYVGLADSVWSETPDALNADILQTLT 240
Query 244 QRVP 247
QR+P
Sbjct 241 QRIP 244
>ref|YP_001726928.1| hypothetical protein EcolC_4002 [Escherichia coli ATCC 8739]
ref|YP_003038179.1| hypothetical protein ECBD_4009 [Escherichia coli 'BL21-Gold(DE3)pLysS
AG']
ref|YP_003047072.1| hypothetical protein ECB_03900 [Escherichia coli B str. REL606]
9 more sequence titles
ref|ZP_07145213.1| conserved hypothetical protein [Escherichia coli MS 187-1]
gb|ACA79601.1| protein of unknown function DUF1017 [Escherichia coli ATCC 8739]
emb|CAQ34377.1| conserved protein [Escherichia coli BL21(DE3)]
gb|ACT30994.1| protein of unknown function DUF1017 [Escherichia coli 'BL21-Gold(DE3)pLysS
AG']
gb|ACT41536.1| hypothetical protein ECB_03900 [Escherichia coli B str. REL606]
gb|ACT45691.1| hypothetical protein ECD_03900 [Escherichia coli BL21(DE3)]
gb|EFK25799.1| conserved hypothetical protein [Escherichia coli MS 187-1]
gb|EGB56130.1| SLBB-domain-containing protein [Escherichia coli H489]
gb|EGB65122.1| SLBB-domain-containing protein [Escherichia coli TA007]
Length=245
Score = 187 bits (475), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 103/244 (42%), Positives = 149/244 (61%), Gaps = 1/244 (0%)
Query 4 LQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGA 63
++ F+A +L V FA GTV ++ G + ++ E+++ LV QP+L + WWPGA
Sbjct 2 IKQTFVALLLSVGASSVFAAGTVKVFSNGSSEAKTLTGAEHLIDLVGQPRLANS-WWPGA 60
Query 64 LLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPD 123
++++ A A AL+ Q ++ +LA A++ D AA I ++RQQ+ L +TGR + LDPD
Sbjct 61 VISEELATAAALRQQQALLTRLAELAADSSADDAAAINALRQQIQALKVTGRQKINLDPD 120
Query 124 FVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGA 183
VRV E NPPL G+YTL+ P T+TL G +S G+ P+ GR V YL L+GA
Sbjct 121 IVRVAERGNPPLQGNYTLWVGPPPSTVTLFGLISRPGKQPFTPGRDVASYLSGQNLLSGA 180
Query 184 DKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLT 243
D++ V+ P+G T APVA WNKRHVEP PGS +++G + V E LN I+ LT
Sbjct 181 DRSYAWVVYPDGRTQKAPVAYWNKRHVEPMPGSIIYVGLADSVWSETPDALNADILQTLT 240
Query 244 QRVP 247
QR+P
Sbjct 241 QRIP 244
>ref|NP_418452.1| conserved protein [Escherichia coli str. K-12 substr. MG1655]
ref|AP_004529.1| hypothetical protein [Escherichia coli str. K-12 substr. W3110]
ref|YP_001732805.1| hypothetical protein ECDH10B_4217 [Escherichia coli str. K-12
substr. DH10B]
29 more sequence titles
Length=245
Score = 187 bits (475), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 103/244 (42%), Positives = 149/244 (61%), Gaps = 1/244 (0%)
Query 4 LQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGA 63
++ +A +L V FA GTV ++ G + ++ E+++ LV QP+L + WWPGA
Sbjct 2 IKQTIVALLLSVGASSVFAAGTVKVFSNGSSEAKTLTGAEHLIDLVGQPRLANS-WWPGA 60
Query 64 LLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPD 123
++++ A A AL+ Q ++ +LA A++ D AA I ++RQQ+ L +TGR + LDPD
Sbjct 61 VISEELATAAALRQQQALLTRLAEQGADSSADDAAAINALRQQIQALKVTGRQKINLDPD 120
Query 124 FVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGA 183
VRV E NPPL G+YTL+ P T+TL G +S G+ P+ GR V YL D L+GA
Sbjct 121 IVRVAERGNPPLQGNYTLWVGPPPSTVTLFGLISRPGKQPFTPGRDVASYLSDQSLLSGA 180
Query 184 DKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLT 243
D++ V+ P+G T APVA WNKRHVEP PGS +++G + V E LN I+ LT
Sbjct 181 DRSYAWVVYPDGRTQKAPVAYWNKRHVEPMPGSIIYVGLADSVWSETPDALNADILQTLT 240
Query 244 QRVP 247
QR+P
Sbjct 241 QRIP 244
>ref|YP_001882714.1| hypothetical protein SbBS512_E4533 [Shigella boydii CDC 3083-94]
gb|ACD09338.1| conserved hypothetical protein [Shigella boydii CDC 3083-94]
Length=245
Score = 187 bits (474), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 103/244 (42%), Positives = 149/244 (61%), Gaps = 1/244 (0%)
Query 4 LQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGA 63
++ F+A +L V FA GTV ++ G + ++ E+++ LV QP+L + WWPGA
Sbjct 2 IKQTFVALLLSVGASSVFAAGTVKVFSNGSSEAKTLTGAEHLIDLVGQPRLANS-WWPGA 60
Query 64 LLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPD 123
++++ A A AL+ Q ++ +LA A++ D AA I ++RQQ+ L +TGR + LDPD
Sbjct 61 VISEELATAAALRQQQALLTRLAEQAADSSADDAAAINALRQQIQALKVTGRQKINLDPD 120
Query 124 FVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGA 183
VRV E NPPL G+YTL+ P T+TL G +S G+ P+ GR V YL L+GA
Sbjct 121 IVRVAERGNPPLQGNYTLWVGPPPSTVTLFGLISRPGKQPFTPGRDVASYLSGQNLLSGA 180
Query 184 DKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLT 243
D++ V+ P+G T APVA WNKRHVEP PGS +++G + V E LN I+ LT
Sbjct 181 DRSYAWVVYPDGRTQKAPVAYWNKRHVEPMPGSIIYVGLADSVWSETPDALNADILQTLT 240
Query 244 QRVP 247
QR+P
Sbjct 241 QRIP 244
>ref|YP_002295592.1| hypothetical protein ECSE_4317 [Escherichia coli SE11]
dbj|BAG79841.1| conserved hypothetical protein [Escherichia coli SE11]
gb|EGB86303.1| hypothetical protein HMPREF9542_04277 [Escherichia coli MS 117-3]
Length=245
Score = 187 bits (474), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 103/244 (42%), Positives = 148/244 (60%), Gaps = 1/244 (0%)
Query 4 LQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGA 63
++ +A +L V FA GTV ++ G + ++ E ++ LV QP+L + WWPGA
Sbjct 2 IKQTIVALILSVGASSVFAAGTVKVFSNGSSEAKTLTGAEYLIDLVGQPRLANS-WWPGA 60
Query 64 LLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPD 123
++++ A A AL+ Q ++ +LA A++ D AA I ++RQQ+ L +TGR + LDPD
Sbjct 61 VISEELATAAALRQQQALLTRLAEQGADSSADDAAAINALRQQIQALKVTGRQKINLDPD 120
Query 124 FVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGA 183
VRV E NPPL G+YTL+ P T+TL G +S G+ P+ GR V YL D L+GA
Sbjct 121 IVRVAERGNPPLQGNYTLWVGPPPSTVTLFGLISRPGKQPFTPGRDVASYLSDQSLLSGA 180
Query 184 DKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLT 243
D++ V+ P+G T APVA WNKRHVEP PGS +++G + V E LN I+ LT
Sbjct 181 DRSYAWVVYPDGRTQKAPVAYWNKRHVEPMPGSIIYVGLADSVWSETPDALNADILQTLT 240
Query 244 QRVP 247
QR+P
Sbjct 241 QRIP 244
>gb|EFW53980.1| YjbG polysaccharide synthesis-related protein [Shigella boydii
ATCC 9905]
gb|EGI89476.1| hypothetical protein SD15574_5023 [Shigella dysenteriae 155-74]
Length=245
Score = 186 bits (473), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 103/244 (42%), Positives = 149/244 (61%), Gaps = 1/244 (0%)
Query 4 LQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGA 63
++ F+A +L V FA GTV ++ G + ++ E+++ LV QP+L + WWPGA
Sbjct 2 IKQTFVALLLSVGASSVFAAGTVKVFSNGSSEAKTLTGAEHLIDLVGQPRLANS-WWPGA 60
Query 64 LLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPD 123
++++ A A AL+ Q ++ +LA A++ D AA I ++RQQ+ L +TGR + LDPD
Sbjct 61 VISEELATAAALRQQQALLTRLAEQGADSSADDAAAINALRQQIQALKVTGRQKINLDPD 120
Query 124 FVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGA 183
VRV E NPPL G+YTL+ P T+TL G +S G+ P+ GR V YL L+GA
Sbjct 121 IVRVAERGNPPLQGNYTLWVGPPPSTVTLFGLISRPGKQPFTPGRDVASYLSGQNLLSGA 180
Query 184 DKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLT 243
D++ V+ P+G T APVA WNKRHVEP PGS +++G + V E LN I+ LT
Sbjct 181 DRSYAWVVYPDGRTQKAPVAYWNKRHVEPMPGSIIYVGLADSVWSETPDALNADILQTLT 240
Query 244 QRVP 247
QR+P
Sbjct 241 QRIP 244
>ref|YP_003933062.1| hypothetical protein Pvag_3494 [Pantoea vagans C9-1]
gb|ADO11613.1| Uncharacterized protein ymcB precursor [Pantoea vagans C9-1]
Length=247
Score = 186 bits (473), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 98/229 (42%), Positives = 136/229 (59%), Gaps = 0/229 (0%)
Query 20 AFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGALLTDSAAKAKALKDYQ 79
A A V ++ P + + ++ QL T P L+ W ++ + A A A + YQ
Sbjct 19 AQATAQVIVHAPHNGGQAELSQIADLSQLATLPPLQANTDWRRTVIAERGASAVAQQQYQ 78
Query 80 HVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPDFVRVDENSNPPLVGDY 139
+ L +W A++ D AA I V +QL +N+TGR LDPD++R+ N L G Y
Sbjct 79 QTLGALRAWRADSSGDRAAAIDEVIRQLSAINVTGRQFTPLDPDWIRLHPADNRRLEGSY 138
Query 140 TLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGADKNNVMVITPEGETVV 199
L+ ++ LLGA+SGAG++ WQ G+SV DYL+ HP L+GA++N V VI+P G T
Sbjct 139 DLWLQTPSDSVLLLGALSGAGKVSWQPGKSVRDYLEGHPSLSGAERNFVTVISPSGATQQ 198
Query 200 APVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLTQRVPD 248
PVA WN RH E PGS +WLGFS+ LP Y DLND+I+SVLT R+PD
Sbjct 199 VPVAYWNHRHAEVEPGSVIWLGFSSWSLPGSYEDLNDRILSVLTHRIPD 247
>emb|CBG37221.1| conserved hypothetical protein [Escherichia coli 042]
Length=244
Score = 186 bits (472), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 102/244 (41%), Positives = 149/244 (61%), Gaps = 1/244 (0%)
Query 4 LQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGA 63
++ +A +L V FA GTV ++ G + ++ E+++ LV QP+L + WWPGA
Sbjct 2 IKQTIVALILSVGASSVFAAGTVKVFSNGSSEAKTLTGAEHLIDLVGQPRLANS-WWPGA 60
Query 64 LLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPD 123
++++ A A AL+ Q ++ +LA A++ D AA I ++RQQ+ L +TGR + LDPD
Sbjct 61 VISEELATAAALRQQQALLTRLAEQGADSSADDAAAINALRQQIQALKVTGRQKINLDPD 120
Query 124 FVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGA 183
VRV E NPPL G+YTL+ P T+TL G +S G+ P+ GR V YL D L+GA
Sbjct 121 IVRVAERGNPPLQGNYTLWVGPPPSTVTLFGLISRPGKQPFTPGRDVASYLSDQSLLSGA 180
Query 184 DKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLT 243
D++ V+ P+G T APVA WNKRHVEP PGS +++G + V + LN I+ LT
Sbjct 181 DRSYAWVVYPDGRTQKAPVAYWNKRHVEPMPGSIIYVGLADSVWSKTPDALNADILQTLT 240
Query 244 QRVP 247
QR+P
Sbjct 241 QRIP 244
>emb|CAP78488.1| Uncharacterized protein yjbG [Escherichia coli LF82]
gb|ADR29433.1| hypothetical protein NRG857_20125 [Escherichia coli O83:H1 str.
NRG 857C]
gb|EFU56817.1| conserved hypothetical protein [Escherichia coli MS 16-3]
gb|EFW68025.1| YjbG polysaccharide synthesis-related protein [Escherichia coli
WV_060327]
Length=245
Score = 186 bits (472), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 102/244 (41%), Positives = 148/244 (60%), Gaps = 1/244 (0%)
Query 4 LQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGA 63
++ +A +L V FA GTV ++ G + ++ E+++ LV QP+L + WWPGA
Sbjct 2 IKQTIVALILSVGASSVFAAGTVKVFSNGSSEAKTLTGAEHLIDLVGQPRLANS-WWPGA 60
Query 64 LLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPD 123
++++ A A AL+ Q ++ +LA A++ D AA I ++RQQ+ L +TGR + LDPD
Sbjct 61 VISEELATAAALRQQQALLTRLAELAADSSADDAAAINALRQQIQALKVTGRQKINLDPD 120
Query 124 FVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGA 183
VRV E NPPL G+YTL+ P T+TL G +S G+ + GR V YL D L+GA
Sbjct 121 IVRVAERGNPPLQGNYTLWVGTPPSTVTLFGLISRPGKQSFTPGRDVASYLSDQSLLSGA 180
Query 184 DKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLT 243
D++ V+ P+G T APVA WNKRHVEP PGS +++G + V E LN I+ LT
Sbjct 181 DRSYAWVVYPDGRTQKAPVAYWNKRHVEPMPGSIIYVGLADSVWSETPDALNADILQTLT 240
Query 244 QRVP 247
QR+P
Sbjct 241 QRIP 244
>ref|ZP_06939706.1| hypothetical protein EcolOP_27029 [Escherichia coli OP50]
Length=245
Score = 186 bits (472), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 103/244 (42%), Positives = 148/244 (60%), Gaps = 1/244 (0%)
Query 4 LQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGA 63
++ F+A +L V FA GTV ++ G + ++ E ++ LV QP+L + WWPGA
Sbjct 2 IKQTFVALLLSVGASSVFAAGTVKVFSNGSSEAKTLTGAEYLIDLVGQPRLANS-WWPGA 60
Query 64 LLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPD 123
++++ A A AL+ Q ++ +LA A++ D AA I ++RQQ+ L +TGR + LDPD
Sbjct 61 VISEELATAAALRQQQALLTRLAELAADSSADDAAAINALRQQIQALKVTGRQKINLDPD 120
Query 124 FVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGA 183
VRV E NPPL G+YTL+ P T+TL G +S G+ P+ GR V YL L+GA
Sbjct 121 IVRVAERGNPPLQGNYTLWVGPPPSTVTLFGLISRPGKQPFTPGRDVASYLSGQNLLSGA 180
Query 184 DKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLT 243
D++ V+ P+G T APVA WNKRHVEP PGS +++G + V E LN I+ LT
Sbjct 181 DRSYAWVVYPDGRTQKAPVAYWNKRHVEPMPGSIIYVGLADSVWSETPDALNADILQTLT 240
Query 244 QRVP 247
QR+P
Sbjct 241 QRIP 244
>ref|ZP_07152932.1| conserved hypothetical protein [Escherichia coli MS 21-1]
gb|EFK20313.1| conserved hypothetical protein [Escherichia coli MS 21-1]
Length=245
Score = 186 bits (471), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 102/244 (41%), Positives = 147/244 (60%), Gaps = 1/244 (0%)
Query 4 LQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGA 63
++ +A +L FA GTV ++ G + ++ E+++ LV QPQL + WWPGA
Sbjct 2 IKQTIVALLLSAGASSVFAAGTVKVFSNGSSEAKTLTGAEHLIDLVGQPQLANS-WWPGA 60
Query 64 LLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPD 123
++++ A A AL+ Q ++ +LA A++ D AA I ++RQQ+ L +TGR + LDPD
Sbjct 61 VISEELATAAALRQQQALLTRLAELAADSSADDAAAINALRQQIQALKVTGRQKINLDPD 120
Query 124 FVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGA 183
VRV E NPPL G+YTL+ P T+TL G +S G+ + GR V YL D L+GA
Sbjct 121 IVRVAERGNPPLQGNYTLWVGPPPSTVTLFGLISRPGKQSFTPGRDVASYLSDQSLLSGA 180
Query 184 DKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLT 243
D++ V+ P+G T APVA WNKRHVEP PGS +++G + V E LN I+ LT
Sbjct 181 DRSYAWVVYPDGRTQKAPVAYWNKRHVEPMPGSIIYVGLADSVWSETSDALNADILQTLT 240
Query 244 QRVP 247
QR+P
Sbjct 241 QRIP 244
>ref|YP_543535.1| hypothetical protein UTI89_C4596 [Escherichia coli UTI89]
ref|YP_859620.1| hypothetical protein APECO1_2440 [Escherichia coli APEC O1]
ref|YP_002394011.1| hypothetical protein ECS88_4501 [Escherichia coli S88]
10 more sequence titles
ref|ZP_04534064.1| conserved hypothetical protein [Escherichia sp. 3_2_53FAA]
gb|ABE10004.1| hypothetical protein YjbG precursor [Escherichia coli UTI89]
gb|ABJ03496.1| conserved hypothetical protein [Escherichia coli APEC O1]
emb|CAR05662.1| conserved hypothetical protein [Escherichia coli S88]
gb|EEH87754.1| conserved hypothetical protein [Escherichia sp. 3_2_53FAA]
gb|ADE92199.1| conserved hypothetical protein [Escherichia coli IHE3034]
gb|ADN73381.1| hypothetical protein UM146_20255 [Escherichia coli UM146]
gb|EFU47234.1| conserved hypothetical protein [Escherichia coli MS 110-3]
gb|EGB46309.1| SLBB-domain-containing protein [Escherichia coli H252]
gb|EGB50295.1| SLBB-domain-containing protein [Escherichia coli H263]
Length=245
Score = 186 bits (471), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 102/244 (41%), Positives = 147/244 (60%), Gaps = 1/244 (0%)
Query 4 LQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGA 63
++ +A +L V FA GTV ++ G + ++ E+++ LV QP+L + WWPGA
Sbjct 2 IKQTIVALILSVGASSVFAAGTVKVFSNGSSEAKTLTGAEHLIDLVGQPRLANS-WWPGA 60
Query 64 LLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPD 123
++++ A A AL+ Q ++ +LA A++ D AA I ++RQQ+ L +TGR + LDPD
Sbjct 61 VISEELATAAALRQQQALLTRLAELAADSSADDAAAINALRQQIQALKVTGRQKINLDPD 120
Query 124 FVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGA 183
VRV E NPPL G+YTL+ P T+TL G +S G+ + GR V YL D L+GA
Sbjct 121 IVRVAERGNPPLQGNYTLWVGTPPSTVTLFGLISRPGKQSFTPGRDVASYLSDQSLLSGA 180
Query 184 DKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLT 243
D++ V+ P G T APVA WNKRHVEP PGS +++G + V E LN I+ LT
Sbjct 181 DRSYAWVVYPSGRTQKAPVAYWNKRHVEPMPGSIIYVGLADSVWSETPDALNADILQTLT 240
Query 244 QRVP 247
QR+P
Sbjct 241 QRIP 244
>ref|ZP_06660155.1| YjbG polysaccharide synthesis protein [Escherichia coli B185]
gb|EFF03249.1| YjbG polysaccharide synthesis protein [Escherichia coli B185]
Length=245
Score = 186 bits (471), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 102/244 (41%), Positives = 148/244 (60%), Gaps = 1/244 (0%)
Query 4 LQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGA 63
++ F+A +L V FA GTV ++ G + ++ E+++ LV QP+L + WWPGA
Sbjct 2 IKQTFVALLLSVGASSVFAAGTVKVFSNGSSEAKTLTGAEHLIDLVGQPRLANS-WWPGA 60
Query 64 LLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPD 123
++++ A A AL+ Q ++ +LA A++ D AA I ++RQQ+ L +TGR + LDPD
Sbjct 61 VISEELATAAALRQQQALLTRLAELAADSSADDAAAINALRQQIQALKVTGRQKINLDPD 120
Query 124 FVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGA 183
VRV E +NPPL G+YTL+ P T+TL G +S G P+ GR V YL L+GA
Sbjct 121 IVRVAERANPPLQGNYTLWVGPPPSTVTLFGLISRPGNQPFTPGRDVASYLSGQSLLSGA 180
Query 184 DKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLT 243
D++ V+ P+G T APVA WNKRHVEP PGS +++G + V E LN I+ L
Sbjct 181 DRSYAWVVYPDGRTQKAPVAYWNKRHVEPMPGSIIYVGLADSVWSETPDALNADILQTLA 240
Query 244 QRVP 247
QR+P
Sbjct 241 QRIP 244
>ref|ZP_08350965.1| conserved hypothetical protein [Escherichia coli M605]
dbj|BAI57425.1| conserved hypothetical protein [Escherichia coli SE15]
gb|EGH36867.1| YjbG polysaccharide synthesis-related protein [Escherichia coli
AA86]
gb|EGI13421.1| conserved hypothetical protein [Escherichia coli M605]
gb|AEG39028.1| Hypothetical protein ECNA114_4184 [Escherichia coli NA114]
Length=245
Score = 186 bits (471), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 102/244 (41%), Positives = 147/244 (60%), Gaps = 1/244 (0%)
Query 4 LQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGA 63
++ +A +L V FA GTV ++ G ++ E+++ LV QP+L + WWPGA
Sbjct 2 IKQTIVALILSVGASSVFAAGTVKVFSNGSSDAKTLTGAEHLIDLVGQPRLANS-WWPGA 60
Query 64 LLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPD 123
++++ A A AL+ Q +M ++A A++ D AA I ++RQQ+ L +TGR + LDPD
Sbjct 61 VISEELATAAALRQQQALMTRMAELAADSSADDAAAINALRQQIQALKVTGRQKINLDPD 120
Query 124 FVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGA 183
VRV E NPPL G+YTL+ P T+TL G +S G+ + GR V YL D L+GA
Sbjct 121 IVRVAERGNPPLQGNYTLWVGPPPSTVTLFGLISRPGKQSFTPGRDVASYLSDQSLLSGA 180
Query 184 DKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLT 243
D++ V+ P+G T APVA WNKRHVEP PGS +++G + V E LN I+ LT
Sbjct 181 DRSYAWVVYPDGRTQKAPVAYWNKRHVEPMPGSIIYVGLADSVWSETPDALNADILQTLT 240
Query 244 QRVP 247
QR+P
Sbjct 241 QRIP 244
>ref|YP_001746417.1| hypothetical protein EcSMS35_4489 [Escherichia coli SMS-3-5]
gb|ACB18208.1| conserved hypothetical protein [Escherichia coli SMS-3-5]
Length=245
Score = 185 bits (470), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 102/244 (41%), Positives = 147/244 (60%), Gaps = 1/244 (0%)
Query 4 LQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGA 63
++ +A +L FA GTV ++ G + ++ E+++ LV QPQL + WWPGA
Sbjct 2 IKQTIVALLLSAGASSVFAAGTVKVFSNGSSEAKTLTGAEHLIDLVGQPQLANS-WWPGA 60
Query 64 LLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPD 123
++++ A A AL+ Q ++ +LA A++ D AA I ++RQQ+ L +TGR + LDPD
Sbjct 61 VISEELATAAALRQQQALLTRLAELAADSSADDAAAINALRQQIQALKVTGRQKINLDPD 120
Query 124 FVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGA 183
VRV E NPPL G+YTL+ P T+TL G +S G+ P+ GR V YL L+GA
Sbjct 121 IVRVAERGNPPLQGNYTLWVGPPPSTVTLFGLISRPGKQPFTPGRDVASYLSGQNLLSGA 180
Query 184 DKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLT 243
D++ V+ P+G T APVA WNKRHVEP PGS +++G + V E LN I+ LT
Sbjct 181 DRSYAWVVYPDGRTQKAPVAYWNKRHVEPMPGSIIYVGLADSVWSETSDALNADILQTLT 240
Query 244 QRVP 247
QR+P
Sbjct 241 QRIP 244
>ref|ZP_08356834.1| conserved hypothetical protein [Escherichia coli M718]
gb|EGI18290.1| conserved hypothetical protein [Escherichia coli M718]
Length=245
Score = 185 bits (469), Expect = 6e-45, Method: Compositional matrix adjust.
Identities = 103/244 (42%), Positives = 148/244 (60%), Gaps = 1/244 (0%)
Query 4 LQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGA 63
++ +A +L V T FA GTV ++ G + ++ E+++ LV QP+L + WWPGA
Sbjct 2 IKQTIVALLLSVGTSSVFAAGTVKVFSNGSGEAKTLTGAEHLIDLVGQPRLANS-WWPGA 60
Query 64 LLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPD 123
++++ A A AL+ Q ++ +LA A++ D AA I ++RQQ+ L +TGR + LDPD
Sbjct 61 VISEELATAAALRQQQALLTRLAELAADSSADDAAAINALRQQIQALKVTGRQKINLDPD 120
Query 124 FVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGA 183
VRV E NPPL G+YTL+ P T+TL G +S G+ P+ GR V YL L+GA
Sbjct 121 IVRVAERGNPPLQGNYTLWVGPPPSTVTLFGLISRPGKQPFTPGRDVASYLSGQNLLSGA 180
Query 184 DKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLT 243
D++ V+ P G T APVA WNKRHVEP PGS +++G + V E LN I+ LT
Sbjct 181 DRSYAWVVYPHGRTQKAPVAYWNKRHVEPMPGSIIYVGLADSVWSEMPDALNADILQTLT 240
Query 244 QRVP 247
QR+P
Sbjct 241 QRIP 244
>ref|YP_002415168.1| hypothetical protein ECUMN_4561 [Escherichia coli UMN026]
ref|ZP_06646764.1| YjbG polysaccharide synthesis protein [Escherichia coli FVEC1412]
ref|ZP_06988080.1| YjbG polysaccharide synthesis protein [Escherichia coli FVEC1302]
ref|ZP_07118071.1| conserved hypothetical protein [Escherichia coli MS 198-1]
emb|CAR15678.1| conserved hypothetical protein [Escherichia coli UMN026]
gb|EFF02596.1| YjbG polysaccharide synthesis protein [Escherichia coli FVEC1412]
gb|EFI22031.1| YjbG polysaccharide synthesis protein [Escherichia coli FVEC1302]
gb|EFJ72462.1| conserved hypothetical protein [Escherichia coli MS 198-1]
Length=245
Score = 185 bits (469), Expect = 6e-45, Method: Compositional matrix adjust.
Identities = 102/244 (41%), Positives = 148/244 (60%), Gaps = 1/244 (0%)
Query 4 LQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGA 63
++ +A +L V FA GTV ++ G + ++ E+++ LV QP+L + WWPGA
Sbjct 2 IKQTIVALILSVGASSVFAAGTVKVFSNGSSEAKTLTGAEHLIDLVGQPRLANS-WWPGA 60
Query 64 LLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPD 123
++++ A A AL+ Q ++ +LA A++ D AA I ++RQQ+ L +TGR + LDPD
Sbjct 61 VISEELATAAALRQQQALLTRLAEQGADSSADDAAAINALRQQIQALKVTGRQKINLDPD 120
Query 124 FVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGA 183
VRV E NPPL G+YTL+ P T+TL G +S G+ P+ GR V YL L+GA
Sbjct 121 IVRVAERGNPPLQGNYTLWVGPPPSTVTLFGLISRPGKQPFTPGRDVASYLSGQNLLSGA 180
Query 184 DKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLT 243
D++ V+ P+G T APVA WNKRHVEP PGS +++G + V E LN I+ LT
Sbjct 181 DRSYAWVVYPDGRTQKAPVAYWNKRHVEPMPGSIIYVGLADSVWSETPDALNADILQTLT 240
Query 244 QRVP 247
QR+P
Sbjct 241 QRIP 244
>ref|YP_004214318.1| hypothetical protein Rahaq_3601 [Rahnella sp. Y9602]
gb|ADW75191.1| protein of unknown function DUF1017 [Rahnella sp. Y9602]
Length=247
Score = 184 bits (468), Expect = 8e-45, Method: Compositional matrix adjust.
Identities = 102/245 (41%), Positives = 141/245 (57%), Gaps = 9/245 (3%)
Query 8 FIASVLYVMTPHAFAQGTVTIYLPGEQ----QTLSVGPVENVVQLVTQPQLRDRLWWPGA 63
FI S L M AF++G V +Y Q Q LS ++++ Q++ + L + W PG
Sbjct 8 FIFSALPGM---AFSEGNVAVYTSASQGQPAQVLS--HIKDMRQMMAESDLIRQSWSPGT 62
Query 64 LLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPD 123
++ AA +A + Y + QL +W A D TI V QQL L +TGR +D D
Sbjct 63 VIAVPAATPEAQQQYLSMQNQLKAWRATESGDTQQTINRVIQQLQGLQVTGRQFTPMDAD 122
Query 124 FVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGA 183
+ + +N L GDY +YT RP ++ LLGAVSGAG+ PW AGR++ +YL DH L+GA
Sbjct 123 LILNNNAANRQLQGDYRVYTATRPNSVLLLGAVSGAGKQPWVAGRTIREYLADHQFLSGA 182
Query 184 DKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLT 243
+ N +VI P G V PVA WN RH E PGS +W+GFS LP ++ +LN IVSVL
Sbjct 183 NLNEAVVIDPNGTIRVVPVAYWNYRHAEAQPGSIIWVGFSDWTLPRQFKNLNQHIVSVLA 242
Query 244 QRVPD 248
R+P+
Sbjct 243 HRIPE 247
>gb|EFX17993.1| hypothetical protein ECO2687_20949 [Escherichia coli O157:H-
str. H 2687]
Length=245
Score = 184 bits (468), Expect = 9e-45, Method: Compositional matrix adjust.
Identities = 102/244 (41%), Positives = 148/244 (60%), Gaps = 1/244 (0%)
Query 4 LQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGA 63
++ +A +L V FA GTV ++ G + ++ E+++ LV QP+L + WWPGA
Sbjct 2 IKQTIVALLLSVRASSVFAAGTVKVFSNGSSEAKTLTGAEHLIDLVGQPRLANS-WWPGA 60
Query 64 LLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPD 123
++++ A A AL+ Q ++ +LA A++ D AA I ++RQQ+ L +TGR + LDPD
Sbjct 61 VISEELATAAALRQQQALLTRLAEQGADSSADDAAAINALRQQIQALKVTGRQKINLDPD 120
Query 124 FVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGA 183
VRV E +NPPL G+YTL+ P T+TL G +S G P+ GR V YL L+GA
Sbjct 121 IVRVAERANPPLQGNYTLWVGPPPSTVTLFGLISRPGNQPFTPGRDVASYLSGQSLLSGA 180
Query 184 DKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLT 243
D++ V+ P+G T APVA WNKRHVEP PGS +++G + V E LN I+ LT
Sbjct 181 DRSYAWVVYPDGRTQKAPVAYWNKRHVEPMPGSIIYVGLADSVWSEMPDALNADILQTLT 240
Query 244 QRVP 247
QR+P
Sbjct 241 QRIP 244
>gb|EGC12490.1| SLBB-domain-containing protein [Escherichia coli E1167]
Length=245
Score = 184 bits (468), Expect = 9e-45, Method: Compositional matrix adjust.
Identities = 102/244 (41%), Positives = 147/244 (60%), Gaps = 1/244 (0%)
Query 4 LQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGA 63
++ +A +L V FA GTV ++ G + ++ E ++ LV QP+L + WWPGA
Sbjct 2 IKQTIVALILSVGASSVFAAGTVKVFSNGSSEAKTLTGAEYLIDLVGQPRLANS-WWPGA 60
Query 64 LLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPD 123
++++ A A AL+ Q ++ +LA A++ D AA I ++RQQ+ L +TGR + LDPD
Sbjct 61 VISEELATAAALRQQQALLTRLAELAADSSADDAAAINALRQQIQALKVTGRQKINLDPD 120
Query 124 FVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGA 183
VRV E NPPL G+YTL+ P T+TL G +S G+ P+ GR V YL L+GA
Sbjct 121 IVRVAERGNPPLQGNYTLWVGPPPSTVTLFGLISRLGKQPFTPGRDVASYLSGQNLLSGA 180
Query 184 DKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLT 243
D++ V+ P+G T APVA WNKRHVEP PGS +++G + V E LN I+ LT
Sbjct 181 DRSYAWVVYPDGRTQKAPVAYWNKRHVEPMPGSIIYVGLADSVWSETPDALNADILQTLT 240
Query 244 QRVP 247
QR+P
Sbjct 241 QRIP 244
>ref|ZP_08386356.1| conserved hypothetical protein [Escherichia coli H299]
gb|EGI48189.1| conserved hypothetical protein [Escherichia coli H299]
Length=245
Score = 184 bits (468), Expect = 9e-45, Method: Compositional matrix adjust.
Identities = 102/244 (41%), Positives = 148/244 (60%), Gaps = 1/244 (0%)
Query 4 LQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGA 63
++ +A +L V FA GTV ++ G + ++ E+++ LV QP+L + WWPGA
Sbjct 2 IKQTIVALILSVGASSVFAAGTVKVFSNGSSEAKTLTGAEHLIDLVGQPRLANS-WWPGA 60
Query 64 LLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPD 123
++++ A A AL+ Q ++ +LA A++ D AA I ++RQQ+ L +TGR + LDPD
Sbjct 61 VISEELATAAALRQQQALLTRLAELAADSSADDAAAINALRQQIQALKVTGRQKINLDPD 120
Query 124 FVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGA 183
VRV E NPPL G+YTL+ P T+TL G +S G+ P+ GR V YL L+GA
Sbjct 121 IVRVAERGNPPLQGNYTLWVGPPPSTVTLFGLISRPGKQPFIPGRDVASYLSGQNLLSGA 180
Query 184 DKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLT 243
D++ V+ P+G T APVA WNKRHVEP PGS +++G + V E LN I+ LT
Sbjct 181 DRSYAWVVYPDGRTQKAPVAYWNKRHVEPMPGSIIYVGLADSVWSETPDALNADILQTLT 240
Query 244 QRVP 247
QR+P
Sbjct 241 QRIP 244
>ref|YP_002389495.1| hypothetical protein ECIAI1_4253 [Escherichia coli IAI1]
emb|CAR01003.1| conserved hypothetical protein [Escherichia coli IAI1]
Length=245
Score = 184 bits (467), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 102/244 (41%), Positives = 148/244 (60%), Gaps = 1/244 (0%)
Query 4 LQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGA 63
++ +A +L V FA GTV ++ G + ++ E+++ LV QP+L + WWPGA
Sbjct 2 IKQTIVALLLSVGASSVFAAGTVKVFSNGSSEAKTLTGAEHLIDLVGQPRLANS-WWPGA 60
Query 64 LLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPD 123
++++ A A AL+ Q ++ +LA A++ D AA I ++RQQ+ L +TGR + LDPD
Sbjct 61 VISEELATAAALRQQQALLTRLAEQAADSSADDAAAINALRQQIQALKVTGRQKINLDPD 120
Query 124 FVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGA 183
VRV E NPPL G+YTL+ P T+TL G +S G+ P+ GR V YL L+GA
Sbjct 121 IVRVAERGNPPLQGNYTLWVGPPPSTVTLFGLISRPGKQPFTPGRDVASYLSGQNLLSGA 180
Query 184 DKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLT 243
D++ V+ P+G T APVA WNKRHVEP PGS +++G + V E LN I+ LT
Sbjct 181 DRSYAWVVYPDGRTQKAPVAYWNKRHVEPMPGSIIYVGLADSVWSETPDALNADILQTLT 240
Query 244 QRVP 247
QR+P
Sbjct 241 QRIP 244
>ref|ZP_03029269.1| conserved hypothetical protein [Escherichia coli B7A]
ref|YP_002405400.1| hypothetical protein EC55989_4516 [Escherichia coli 55989]
ref|ZP_06664741.1| hypothetical protein ECCG_02649 [Escherichia coli B088]
16 more sequence titles
Length=245
Score = 184 bits (467), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 102/244 (41%), Positives = 147/244 (60%), Gaps = 1/244 (0%)
Query 4 LQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGA 63
++ +A +L V FA GTV ++ G + ++ E ++ LV QP+L + WWPGA
Sbjct 2 IKQTIVALILSVGASSVFAAGTVKVFSNGSSEAKTLTGAEYLIDLVGQPRLANS-WWPGA 60
Query 64 LLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPD 123
++++ A A AL+ Q ++ +LA A++ D AA I ++RQQ+ L +TGR + LDPD
Sbjct 61 VISEELATAAALRQQQALLTRLAELAADSSADDAAAINALRQQIQALKVTGRQKINLDPD 120
Query 124 FVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGA 183
VRV E NPPL G+YTL+ P T+TL G +S G+ P+ GR V YL L+GA
Sbjct 121 IVRVAERGNPPLQGNYTLWVGPPPSTVTLFGLISRPGKQPFTPGRDVASYLSGQNLLSGA 180
Query 184 DKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLT 243
D++ V+ P+G T APVA WNKRHVEP PGS +++G + V E LN I+ LT
Sbjct 181 DRSYAWVVYPDGRTQKAPVAYWNKRHVEPMPGSIIYVGLADSVWSETPDALNADILQTLT 240
Query 244 QRVP 247
QR+P
Sbjct 241 QRIP 244
>gb|EFZ75386.1| hypothetical protein ECRN5871_1899 [Escherichia coli RN587/1]
Length=245
Score = 184 bits (467), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 102/244 (41%), Positives = 146/244 (59%), Gaps = 1/244 (0%)
Query 4 LQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGA 63
++ +A +L FA GTV ++ G + ++ E+++ LV QPQL + WWPGA
Sbjct 2 IKQTIVALLLSAGASSVFAAGTVKVFSNGSSEAKTLTGAEHLIDLVGQPQLANS-WWPGA 60
Query 64 LLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPD 123
++++ A A AL Q ++ +LA A++ D AA I ++RQQ+ L +TGR + LDPD
Sbjct 61 VISEELATAAALHQQQALLTRLAELAADSSADDAAAINALRQQIQALKVTGRQKINLDPD 120
Query 124 FVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGA 183
VRV E NPPL G+YTL+ P T+TL G +S G+ + GR V YL D L+GA
Sbjct 121 IVRVAERGNPPLQGNYTLWVGTPPSTVTLFGLISRPGKQSFTPGRDVASYLSDQSLLSGA 180
Query 184 DKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLT 243
D++ V+ P+G T APVA WNKRHVEP PGS +++G + V E LN I+ LT
Sbjct 181 DRSYAWVVYPDGRTQKAPVAYWNKRHVEPMPGSIIYVGLADSVWSETPDALNADILQTLT 240
Query 244 QRVP 247
QR+P
Sbjct 241 QRIP 244
>gb|EGC06312.1| SLBB-domain-containing protein [Escherichia fergusonii B253]
Length=245
Score = 184 bits (467), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 101/244 (41%), Positives = 147/244 (60%), Gaps = 1/244 (0%)
Query 4 LQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGA 63
++ A ++ V FA GTV ++ G + ++ E+++ LV QP+L + WWPGA
Sbjct 2 IKQTIAAMIMSVGASSVFAAGTVKVFSNGSNEAKTLTGAEHLIDLVGQPRLANS-WWPGA 60
Query 64 LLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPD 123
++++ A A AL+ Q ++ +LA A++ D AA I ++RQQ+ L +TGR + LDPD
Sbjct 61 VISEELATAAALRQQQALLTRLAELAADSSADDAAAINALRQQIQALKVTGRQKINLDPD 120
Query 124 FVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGA 183
VRV E NPPL G+YTL+ P T+TL G +S G+ P+ GR V YL L+GA
Sbjct 121 IVRVAERGNPPLQGNYTLWVGPPPSTVTLFGLISRPGKQPFTPGRDVASYLSGQSLLSGA 180
Query 184 DKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLT 243
D++ V+ P+G T APVA WNKRHVEP PGS +++G + V E LN I+ LT
Sbjct 181 DRSYAWVVYPDGRTQKAPVAYWNKRHVEPMPGSIIYVGLADSVWSETPDALNADILQTLT 240
Query 244 QRVP 247
QR+P
Sbjct 241 QRIP 244
>ref|ZP_02805425.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
EC4076]
ref|ZP_03081176.1| hypothetical protein EscherichcoliO157_04915 [Escherichia coli
O157:H7 str. EC4024]
ref|ZP_03248902.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
EC4206]
17 more sequence titles
ref|ZP_03260742.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
EC4042]
ref|YP_003080865.1| hypothetical protein ECSP_5104 [Escherichia coli O157:H7 str.
TW14359]
ref|YP_003502263.1| hypothetical protein G2583_4853 [Escherichia coli O55:H7 str.
CB9615]
gb|EDU70838.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
EC4076]
gb|EDZ75967.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
EC4206]
gb|EDZ88227.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
EC4042]
gb|ACI74041.1| hypothetical protein ECs5011 [Escherichia coli]
gb|ACI74043.1| hypothetical protein ECs5011 [Escherichia coli]
gb|ACI74044.1| hypothetical protein ECs5011 [Escherichia coli]
gb|ACI74045.1| hypothetical protein ECs5011 [Escherichia coli]
gb|ACT74789.1| conserved protein [Escherichia coli O157:H7 str. TW14359]
gb|ADD59279.1| hypothetical protein G2583_4853 [Escherichia coli O55:H7 str.
CB9615]
gb|EFX08428.1| hypothetical protein ECO5101_15931 [Escherichia coli O157:H7
str. G5101]
gb|EFX13215.1| hypothetical protein ECO9389_13358 [Escherichia coli O157:H-
str. 493-89]
gb|EFX22826.1| hypothetical protein ECO7815_22192 [Escherichia coli O55:H7 str.
3256-97 TW 07815]
gb|EFX28170.1| hypothetical protein ECO5905_03803 [Escherichia coli O55:H7 str.
USDA 5905]
gb|EFX32679.1| hypothetical protein ECOSU61_10673 [Escherichia coli O157:H7
str. LSU-61]
Length=245
Score = 184 bits (467), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 102/244 (41%), Positives = 148/244 (60%), Gaps = 1/244 (0%)
Query 4 LQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGA 63
++ +A +L V FA GTV ++ G + ++ E+++ LV QP+L + WWPGA
Sbjct 2 IKQTIVALLLSVGASSVFAAGTVKVFSNGSSEAKTLTGAEHLIDLVGQPRLANS-WWPGA 60
Query 64 LLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPD 123
++++ A A AL+ Q ++ +LA A++ D AA I ++RQQ+ L +TGR + LDPD
Sbjct 61 VISEELATAAALRQQQALLTRLAEQGADSSADDAAAINALRQQIQALKVTGRQKINLDPD 120
Query 124 FVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGA 183
VRV E +NPPL G+YTL+ P T+TL G +S G P+ GR V YL L+GA
Sbjct 121 IVRVAERANPPLQGNYTLWVGPPPSTVTLFGLISRPGNQPFTPGRDVASYLSGQSLLSGA 180
Query 184 DKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLT 243
D++ V+ P+G T APVA WNKRHVEP PGS +++G + V E LN I+ LT
Sbjct 181 DRSYAWVVYPDGRTQKAPVAYWNKRHVEPMPGSIIYVGLADSVWSEMPDALNADILQTLT 240
Query 244 QRVP 247
QR+P
Sbjct 241 QRIP 244
>ref|NP_756848.1| hypothetical protein c4996 [Escherichia coli CFT073]
ref|YP_672097.1| hypothetical protein ECP_4246 [Escherichia coli 536]
ref|ZP_03033573.1| conserved hypothetical protein [Escherichia coli F11]
25 more sequence titles
Length=245
Score = 184 bits (467), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 101/244 (41%), Positives = 147/244 (60%), Gaps = 1/244 (0%)
Query 4 LQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGA 63
++ +A +L FA GTV ++ G + ++ E+++ LV QP+L + WWPGA
Sbjct 2 IKQTIVALLLSAGASSVFAAGTVKVFSNGSSEAKTLTGAEHLIDLVGQPRLANS-WWPGA 60
Query 64 LLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPD 123
++++ A A AL+ Q ++ +LA A++ D AA I ++RQQ+ L +TGR + LDPD
Sbjct 61 VISEELATAAALRQQQALLTRLAELAADSSADDAAAINALRQQIQALKVTGRQKINLDPD 120
Query 124 FVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGA 183
VRV E NPPL G+YTL+ P T+TL G +S G+ + GR V YL D L+GA
Sbjct 121 IVRVAERGNPPLQGNYTLWVGTPPSTVTLFGLISRPGKQSFTPGRDVASYLSDQSLLSGA 180
Query 184 DKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLT 243
D++ V+ P+G T APVA WNKRHVEP PGS +++G + V E LN I+ LT
Sbjct 181 DRSYAWVVYPDGRTQKAPVAYWNKRHVEPMPGSIIYVGLADSVWSETPDALNADILQTLT 240
Query 244 QRVP 247
QR+P
Sbjct 241 QRIP 244
>ref|ZP_05433479.1| hypothetical protein ShiD9_11915 [Shigella sp. D9]
ref|ZP_08393156.1| conserved hypothetical protein [Shigella sp. D9]
gb|EGJ06441.1| conserved hypothetical protein [Shigella sp. D9]
Length=245
Score = 184 bits (466), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 102/244 (41%), Positives = 147/244 (60%), Gaps = 1/244 (0%)
Query 4 LQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGA 63
++ +A +L V FA GTV ++ G + ++ E ++ LV QP+L + WWPGA
Sbjct 2 IKQTIVALILSVGASSVFAAGTVKVFSNGSSEAKTLTGAEYLIDLVGQPRLANS-WWPGA 60
Query 64 LLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPD 123
++++ A A AL+ Q ++ +LA A++ D AA I ++RQQ+ L +TGR + LDPD
Sbjct 61 VISEELATAAALRQQQALLTRLAEQGADSSADDAAAINALRQQIQALKVTGRQKINLDPD 120
Query 124 FVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGA 183
VRV E NPPL G+YTL+ P T+TL G +S G+ P+ GR V YL L+GA
Sbjct 121 IVRVAERGNPPLQGNYTLWVGPPPSTVTLFGLISRPGKQPFTPGRDVASYLSGQNLLSGA 180
Query 184 DKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLT 243
D++ V+ P+G T APVA WNKRHVEP PGS +++G + V E LN I+ LT
Sbjct 181 DRSYAWVVYPDGRTQKAPVAYWNKRHVEPMPGSIIYVGLADSVWSETPDALNADILQTLT 240
Query 244 QRVP 247
QR+P
Sbjct 241 QRIP 244
>ref|NP_290662.1| hypothetical protein Z5626 [Escherichia coli O157:H7 EDL933]
ref|NP_313038.1| hypothetical protein ECs5011 [Escherichia coli O157:H7 str. Sakai]
ref|ZP_03440396.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
TW14588]
gb|AAG59227.1|AE005635_7 orf, hypothetical protein [Escherichia coli O157:H7 str. EDL933]
dbj|BAB38434.1| hypothetical protein [Escherichia coli O157:H7 str. Sakai]
gb|ACI74042.1| hypothetical protein ECs5011 [Escherichia coli]
gb|EEC28957.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
TW14588]
gb|EGD70232.1| YjbG polysaccharide synthesis-related protein [Escherichia coli
O157:H7 str. 1044]
Length=245
Score = 184 bits (466), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 102/244 (41%), Positives = 147/244 (60%), Gaps = 1/244 (0%)
Query 4 LQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGA 63
++ +A +L V FA GTV ++ G + ++ E+++ LV QP+L + WWPGA
Sbjct 2 IKQTIVALLLSVGASSVFAAGTVKVFSNGSSEAKTLTGAEHLIDLVGQPRLANS-WWPGA 60
Query 64 LLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPD 123
++++ A A AL+ Q ++ +LA A++ D AA I ++RQQ+ L +TGR + LDPD
Sbjct 61 VISEELATAAALRQQQALLTRLAEQGADSSADDAAAINALRQQIQALKVTGRQKINLDPD 120
Query 124 FVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGA 183
VRV E +NPPL G+YTL+ P T+TL G +S G P+ GR V YL L GA
Sbjct 121 IVRVAERANPPLQGNYTLWVGPPPSTVTLFGLISRPGNQPFTPGRDVASYLSGQSLLGGA 180
Query 184 DKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLT 243
D++ V+ P+G T APVA WNKRHVEP PGS +++G + V E LN I+ LT
Sbjct 181 DRSYAWVVYPDGRTQKAPVAYWNKRHVEPMPGSIIYVGLADSVWSEMPDALNADILQTLT 240
Query 244 QRVP 247
QR+P
Sbjct 241 QRIP 244
>ref|YP_405613.1| hypothetical protein SDY_4220 [Shigella dysenteriae Sd197]
ref|ZP_07678832.1| conserved hypothetical protein [Shigella dysenteriae 1617]
gb|ABB64122.1| conserved hypothetical protein [Shigella dysenteriae Sd197]
gb|EFP73168.1| conserved hypothetical protein [Shigella dysenteriae 1617]
Length=245
Score = 183 bits (465), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 102/244 (41%), Positives = 148/244 (60%), Gaps = 1/244 (0%)
Query 4 LQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGA 63
++ +A +L V FA GTV ++ G + ++ E+++ LV QP+L + WWPGA
Sbjct 2 IKQTIVALLLSVGASSVFAAGTVKVFSNGSSEAKTLTGAEHLIDLVGQPRLANS-WWPGA 60
Query 64 LLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPD 123
++++ A A AL+ Q ++ +LA A++ D AA I ++RQQ+ L +TGR + LDPD
Sbjct 61 VISEELATAAALRQQQALLTRLAEQGADSSADDAAAINALRQQIQVLKVTGRQKINLDPD 120
Query 124 FVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGA 183
VRV E +NPPL G+YTL+ P T+TL G +S G P+ GR V YL L+GA
Sbjct 121 IVRVAERANPPLQGNYTLWVGPPPSTVTLFGLISRPGNQPFTPGRDVASYLSGQSLLSGA 180
Query 184 DKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLT 243
D++ V+ P+G T APVA WNKRHVEP PGS +++G + V E LN I+ LT
Sbjct 181 DRSYAWVVYPDGRTQKAPVAYWNKRHVEPMPGSIIYVGLADSVWSETPDALNADILQTLT 240
Query 244 QRVP 247
QR+P
Sbjct 241 QRIP 244
>ref|YP_002385133.1| hypothetical protein EFER_4120 [Escherichia fergusonii ATCC 35469]
emb|CAQ91542.1| conserved hypothetical protein [Escherichia fergusonii ATCC 35469]
Length=245
Score = 183 bits (465), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 101/244 (41%), Positives = 147/244 (60%), Gaps = 1/244 (0%)
Query 4 LQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGA 63
++ A ++ V FA GTV ++ G + ++ E+++ LV QP+L + WWPGA
Sbjct 2 IKQTIAALIMSVGASSVFAAGTVKVFSNGSNEAKTLTGAEHLIDLVGQPRLANS-WWPGA 60
Query 64 LLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPD 123
++++ A A AL+ Q ++ +LA A++ D AA I ++RQQ+ L +TGR + LDPD
Sbjct 61 VISEELATAAALRQQQALLTRLAELAADSSADDAAAINALRQQIQALKVTGRQKINLDPD 120
Query 124 FVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGA 183
VRV E NPPL G+YTL+ P T+TL G +S G+ P+ GR V YL L+GA
Sbjct 121 IVRVAERGNPPLQGNYTLWVGPPPSTVTLFGLISRPGKQPFTPGRDVASYLSGQSLLSGA 180
Query 184 DKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLT 243
D++ V+ P+G T APVA WNKRHVEP PGS +++G + V E LN I+ LT
Sbjct 181 DRSYAWVVYPDGRTQKAPVAYWNKRHVEPMPGSIIYVGLADSVWSETPDALNADILQTLT 240
Query 244 QRVP 247
QR+P
Sbjct 241 QRIP 244
>ref|YP_219090.1| hypothetical protein SC4103 [Salmonella enterica subsp. enterica
serovar Choleraesuis str. SC-B67]
ref|YP_002639789.1| hypothetical protein SPC_4285 [Salmonella enterica subsp. enterica
serovar Paratyphi C strain RKS4594]
gb|AAX68009.1| putative periplasmic protein [Salmonella enterica subsp. enterica
serovar Choleraesuis str. SC-B67]
gb|ACN48348.1| hypothetical protein SPC_4285 [Salmonella enterica subsp. enterica
serovar Paratyphi C strain RKS4594]
gb|EFZ08743.1| Uncharacterized protein yjbG [Salmonella enterica subsp. enterica
serovar Choleraesuis str. SCSA50]
Length=245
Score = 183 bits (465), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 95/239 (39%), Positives = 140/239 (58%), Gaps = 1/239 (0%)
Query 10 ASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGALLTDSA 69
A L + AFA GTVT++ G + ++ E ++ LV QP+L WWP A++ +
Sbjct 8 ALALAFIASSAFASGTVTVFTQGNSEPKTLTDAERLLDLVGQPRLATS-WWPAAVIGEEQ 66
Query 70 AKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPDFVRVDE 129
A + Q ++ +LA+ AE D D A I ++R+Q+ + +TGR V LDPD VRV E
Sbjct 67 TTVTARQQQQELLGRLAALSAEEDGDAAGAINTLRRQIQAVKVTGRQFVNLDPDVVRVSE 126
Query 130 NSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGADKNNVM 189
NPPL G YTL+ +P +TL G +S G P+ GR V YL+D L+GAD++
Sbjct 127 RGNPPLQGHYTLWVGPQPTQVTLFGLISQPGSQPFVPGRDVASYLEDQRLLSGADRSYAW 186
Query 190 VITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLTQRVPD 248
V+ P+G + APVA WNKRH+EP PGS +++GF+ + +N I+ LTQR+P+
Sbjct 187 VVYPDGRSQKAPVAYWNKRHIEPMPGSIIFVGFADSLWRGTPEAMNADILHTLTQRIPE 245
>ref|YP_003943655.1| hypothetical protein Entcl_4138 [Enterobacter cloacae SCF1]
gb|ADO50371.1| protein of unknown function DUF1017 [Enterobacter cloacae SCF1]
Length=246
Score = 182 bits (463), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 97/248 (39%), Positives = 151/248 (60%), Gaps = 3/248 (1%)
Query 1 MNKLQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWW 60
MNKL + F+ L + ++A GTV +Y+ G Q ++ ++ LV QP+L WW
Sbjct 1 MNKLPALFL--TLGMAAAPSWASGTVDVYMNGATQPKTLADAARLIDLVEQPRLAGS-WW 57
Query 61 PGALLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKL 120
PGA++ A AL+ Q ++++LA+ A ++ D AA I ++RQQL + + GR + L
Sbjct 58 PGAVIAAQPQTAVALQQKQALLSRLATLAARSNGDDAAAINALRQQLQAVRVVGRQFISL 117
Query 121 DPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRL 180
DPD VR + +NPPL G Y+L+ +P T+TL G +S G++ + GR + YL D L
Sbjct 118 DPDQVRAGQLNNPPLEGKYSLWVGPQPGTVTLFGLISRPGKVAFTPGRDIASYLDDVSLL 177
Query 181 AGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVS 240
+GA +++ VI P+G T APVA WNKRHVEP PGS +++GF+ + +Y +LN ++
Sbjct 178 SGAGRSDAWVIYPDGRTEKAPVAYWNKRHVEPMPGSTIFVGFADALWTTQYDELNADVLR 237
Query 241 VLTQRVPD 248
L QR+P+
Sbjct 238 ALAQRIPE 245
>gb|EFZ53016.1| hypothetical protein SS53G_2453 [Shigella sonnei 53G]
Length=220
Score = 182 bits (462), Expect = 4e-44, Method: Compositional matrix adjust.
Identities = 95/220 (43%), Positives = 136/220 (61%), Gaps = 1/220 (0%)
Query 28 IYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGALLTDSAAKAKALKDYQHVMAQLAS 87
++ G + ++ E+++ LV QP+L + WWPGA++++ A A AL+ Q ++ +LA
Sbjct 1 MFSNGSSEAKTLTGAEHLIDLVGQPRLANS-WWPGAVISEELATAAALRQQQALLTRLAE 59
Query 88 WEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPDFVRVDENSNPPLVGDYTLYTVQRP 147
A++ D AA I ++RQQ+ L +TGR + LDPD VRV E NPPL G+YTL+ P
Sbjct 60 QGADSSTDDAAAINALRQQIQALEVTGRQKINLDPDIVRVAERGNPPLQGNYTLWVGPPP 119
Query 148 VTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGADKNNVMVITPEGETVVAPVALWNK 207
T+TL G +S G P+ GR V YL D L+GAD++ V+ P+G T APVA WNK
Sbjct 120 STVTLFGLISHPGNQPFTPGRDVASYLSDQSLLSGADRSYAWVVYPDGRTQKAPVAYWNK 179
Query 208 RHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLTQRVP 247
RHVEP PGS +++G + V E LN I+ LTQR+P
Sbjct 180 RHVEPMPGSIIYVGLADSVWSETPDALNADILQTLTQRIP 219
>gb|EFS13220.1| uncharacterized protein gfcC [Shigella flexneri 2a str. 2457T]
gb|EGK33796.1| hypothetical protein SFK227_4042 [Shigella flexneri K-227]
Length=220
Score = 182 bits (462), Expect = 4e-44, Method: Compositional matrix adjust.
Identities = 95/220 (43%), Positives = 136/220 (61%), Gaps = 1/220 (0%)
Query 28 IYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGALLTDSAAKAKALKDYQHVMAQLAS 87
++ G + ++ E+++ LV QP+L + WWPGA++++ A A AL+ Q ++ +LA
Sbjct 1 MFSNGSSEAKTLTGAEHLIDLVGQPRLANS-WWPGAVISEELATAAALRQQQALLTRLAE 59
Query 88 WEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPDFVRVDENSNPPLVGDYTLYTVQRP 147
A++ D AA I ++RQQ+ L +TGR + LDPD VRV E NPPL G+YTL+ P
Sbjct 60 QGADSSTDDAAAINALRQQIQALKVTGRQKINLDPDIVRVAERGNPPLQGNYTLWVGPPP 119
Query 148 VTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGADKNNVMVITPEGETVVAPVALWNK 207
T+TL G +S G P+ GR V YL D L+GAD++ V+ P+G T APVA WNK
Sbjct 120 STVTLFGLISHPGNQPFTPGRDVASYLSDQSLLSGADRSYAWVVYPDGRTQKAPVAYWNK 179
Query 208 RHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLTQRVP 247
RHVEP PGS +++G + V E LN I+ LTQR+P
Sbjct 180 RHVEPMPGSIIYVGLADSVWSETPDALNADILQTLTQRIP 219
>ref|ZP_02685432.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
serovar Hadar str. RI_05P066]
gb|EDZ34654.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
serovar Hadar str. RI_05P066]
Length=245
Score = 182 bits (462), Expect = 4e-44, Method: Compositional matrix adjust.
Identities = 95/239 (39%), Positives = 141/239 (58%), Gaps = 1/239 (0%)
Query 10 ASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGALLTDSA 69
A L + AFA GTVT++ G + ++ E ++ LV QP+L WWP A++ +
Sbjct 8 ALALAFIASSAFASGTVTVFTQGNSEPKTLTDAERLLDLVGQPRLATS-WWPAAVIGEEQ 66
Query 70 AKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPDFVRVDE 129
A A + Q ++ +LA+ AE + D AA I ++R+Q+ + +TGR V LDPD VRV E
Sbjct 67 ATVTARQQQQELLGRLAALSAEENGDAAAAINTLRRQIQAVKVTGRQFVNLDPDVVRVSE 126
Query 130 NSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGADKNNVM 189
NPPL G YTL+ +P +TL G +S G P+ GR V YL+ L+GAD++
Sbjct 127 RGNPPLQGHYTLWVGPQPTQVTLFGLISQPGSQPFVPGRDVASYLEGQRLLSGADRSYAW 186
Query 190 VITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLTQRVPD 248
V+ P+G + APVA WNKRH+EP PGS +++GF+ + +N I+ LTQR+P+
Sbjct 187 VVYPDGRSQKAPVAYWNKRHIEPMPGSIIFVGFADSLWRGTPEAMNADILHTLTQRIPE 245
>ref|ZP_03059326.1| conserved hypothetical protein [Escherichia coli B171]
ref|YP_003224603.1| hypothetical protein ECO103_4776 [Escherichia coli O103:H2 str.
12009]
gb|EDX31378.1| conserved hypothetical protein [Escherichia coli B171]
dbj|BAI33469.1| conserved predicted protein [Escherichia coli O103:H2 str. 12009]
gb|EFZ47328.1| hypothetical protein ECE128010_2426 [Escherichia coli E128010]
Length=245
Score = 182 bits (461), Expect = 5e-44, Method: Compositional matrix adjust.
Identities = 101/244 (41%), Positives = 146/244 (59%), Gaps = 1/244 (0%)
Query 4 LQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGA 63
++ +A +L V FA GTV ++ G + ++ E ++ LV QP+L + WWPGA
Sbjct 2 IKQTIVALILSVGASSVFAAGTVKVFSNGSSEAKTLTGAEYLIDLVGQPRLANS-WWPGA 60
Query 64 LLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPD 123
++++ A A A + Q ++ +LA A++ D AA I ++RQQ+ L +TGR + LDPD
Sbjct 61 VISEELATAAASRQQQALLTRLAELAADSSADDAAAINALRQQIQALKVTGRQKINLDPD 120
Query 124 FVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGA 183
VRV E NPPL G+YTL+ P T+TL G +S G+ P+ GR V YL L+GA
Sbjct 121 IVRVAERGNPPLQGNYTLWVGPPPSTVTLFGLISRPGKQPFTPGRDVASYLSGQNLLSGA 180
Query 184 DKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLT 243
D++ V+ P+G T APVA WNKRHVEP PGS +++G + V E LN I+ LT
Sbjct 181 DRSYAWVVYPDGRTQKAPVAYWNKRHVEPMPGSIIYVGLADSVWSETPDALNADILQTLT 240
Query 244 QRVP 247
QR+P
Sbjct 241 QRIP 244
>ref|ZP_02809628.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
EC869]
ref|ZP_05937737.1| hypothetical protein EscherichiacoliO157_02451 [Escherichia coli
O157:H7 str. FRIK2000]
ref|ZP_05949524.1| hypothetical protein EscherichiacoliO157EcO_14370 [Escherichia
coli O157:H7 str. FRIK966]
gb|EDU93445.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
EC869]
Length=245
Score = 182 bits (461), Expect = 6e-44, Method: Compositional matrix adjust.
Identities = 101/244 (41%), Positives = 147/244 (60%), Gaps = 1/244 (0%)
Query 4 LQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGA 63
++ +A +L V FA GTV ++ G + ++ E+++ LV QP+L + WWPGA
Sbjct 2 IKQTIVALLLSVGASSVFAAGTVKVFSNGSSEAKTLTGAEHLIDLVGQPRLANS-WWPGA 60
Query 64 LLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPD 123
++++ A A AL+ Q ++ +LA A++ D AA I ++ QQ+ L +TGR + LDPD
Sbjct 61 VISEELATAAALRQQQALLTRLAEQGADSSADDAAAINALHQQIQALKVTGRQKINLDPD 120
Query 124 FVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGA 183
VRV E +NPPL G+YTL+ P T+TL G +S G P+ GR V YL L+GA
Sbjct 121 IVRVAERANPPLQGNYTLWVGPPPSTVTLFGLISRPGNQPFTPGRDVASYLSGQSLLSGA 180
Query 184 DKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLT 243
D++ V+ P+G T APVA WNKRHVEP PGS +++G + V E LN I+ LT
Sbjct 181 DRSYAWVVYPDGRTQKAPVAYWNKRHVEPMPGSIIYVGLADSVWSEMPDALNADILQTLT 240
Query 244 QRVP 247
QR+P
Sbjct 241 QRIP 244
>ref|YP_002410323.1| hypothetical protein ECIAI39_4450 [Escherichia coli IAI39]
emb|CAR20556.1| conserved hypothetical protein [Escherichia coli IAI39]
Length=245
Score = 181 bits (459), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 99/244 (40%), Positives = 145/244 (59%), Gaps = 1/244 (0%)
Query 4 LQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGA 63
++ +A +L FA GTV ++ G + ++ E+++ LV QP+L + WWPGA
Sbjct 2 IKQTIVALLLSAGVSSVFAAGTVKVFSNGSSEAKTLTGAEHLIDLVGQPRLANS-WWPGA 60
Query 64 LLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPD 123
++++ A A AL+ Q ++ +LA A++ D A I ++RQQ+ L +TGR + LDPD
Sbjct 61 VISEELATAAALRQQQALLTRLAELTADSSADDADAINALRQQIQALKVTGRQKINLDPD 120
Query 124 FVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGA 183
VRV E NPPL G+YTL+ P T+ L G +S G+ P+ GR V YL L+GA
Sbjct 121 IVRVAERGNPPLQGNYTLWVGPPPSTVMLFGLISRPGKQPFTPGRDVASYLSGQNLLSGA 180
Query 184 DKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLT 243
D++ V+ P+G T APVA WNKRHVEP PGS +++G + V E LN I+ LT
Sbjct 181 DRSYAWVVYPDGRTQKAPVAYWNKRHVEPMPGSIIYVGLADSVWSETPDALNADILQTLT 240
Query 244 QRVP 247
QR+P
Sbjct 241 QRIP 244
>gb|EGB61141.1| SLBB-domain-containing protein [Escherichia coli M863]
gb|EGE62083.1| hypothetical protein ECSTEC7V_4766 [Escherichia coli STEC_7v]
Length=245
Score = 181 bits (458), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 100/244 (40%), Positives = 146/244 (59%), Gaps = 1/244 (0%)
Query 4 LQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGA 63
++ A ++ V FA GTV ++ G + ++ E+++ LV QP+L + WWPGA
Sbjct 2 IKQTIAALIMSVGASSVFAAGTVKVFSNGSNEAKTLTGAEHLIDLVGQPRLANS-WWPGA 60
Query 64 LLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPD 123
++++ A A AL+ Q ++ +LA A++ D AA I ++RQQ+ L +TGR + LD D
Sbjct 61 VISEELATAAALRQQQALLTRLAELAADSSADDAAAINALRQQIQALKVTGRQKINLDSD 120
Query 124 FVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGA 183
VRV E NPPL G+YTL+ P T+TL G +S G+ + GR V YL D L+GA
Sbjct 121 IVRVAERGNPPLQGNYTLWVGPPPSTVTLFGLISRPGKQSFTPGRDVASYLSDQSLLSGA 180
Query 184 DKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLT 243
D++ V+ P+G T APVA WNKRHVEP PGS +++G + V E LN I+ LT
Sbjct 181 DRSYAWVVYPDGRTQKAPVAYWNKRHVEPMPGSIIYVGLADSVWSETPDALNADILQTLT 240
Query 244 QRVP 247
QR+P
Sbjct 241 QRIP 244
>ref|ZP_07185547.1| conserved hypothetical protein [Escherichia coli MS 69-1]
gb|EFJ81563.1| conserved hypothetical protein [Escherichia coli MS 69-1]
Length=245
Score = 181 bits (458), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 100/240 (41%), Positives = 144/240 (60%), Gaps = 1/240 (0%)
Query 8 FIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGALLTD 67
+A +L V FA GTV ++ G + ++ E+++ LV QP+L + WWPGA++++
Sbjct 6 IVALILSVGASSVFAAGTVKVFSNGSSEAKTLTGAEHLIDLVGQPRLANS-WWPGAVISE 64
Query 68 SAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPDFVRV 127
A A AL+ Q ++ +LA A++ D AA I ++RQQ+ L +TGR + LDPD VRV
Sbjct 65 ELATAAALRQQQALLTRLAELAADSSADDAAAINALRQQIQALKVTGRQKINLDPDIVRV 124
Query 128 DENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGADKNN 187
E NPPL G+YTL+ P T+ L G +S G+ P+ R V YL L+GAD++
Sbjct 125 AERGNPPLQGNYTLWVGPPPSTVMLFGLISRPGKQPFTPSRDVASYLSGQNLLSGADRSY 184
Query 188 VMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLTQRVP 247
V+ P+G T APVA WNKRHVEP PGS +++G + V E LN I+ LTQR+P
Sbjct 185 AWVVYPDGRTQKAPVAYWNKRHVEPMPGSIIYVGLADSVWSETPDALNADILQTLTQRIP 244
>ref|ZP_03217856.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
serovar Javiana str. GA_MM04042433]
gb|EDZ08514.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
serovar Javiana str. GA_MM04042433]
Length=244
Score = 180 bits (457), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 97/239 (40%), Positives = 141/239 (58%), Gaps = 2/239 (0%)
Query 10 ASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGALLTDSA 69
A L + AFA GTVT++ G + ++ E ++ LV QP+L WWP A++ +
Sbjct 8 ALALAFVASSAFASGTVTVFTQGNSEPKTLTDAERLLDLVGQPRLATS-WWPAAVIGEEQ 66
Query 70 AKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPDFVRVDE 129
A A + Q ++ +LA+ AE D D AA I ++R+Q+ L +TGR V LDPD VRV E
Sbjct 67 ATVTARQQQQELLGRLAALSAEEDGDAAA-INTLRRQIQALKVTGRQFVNLDPDVVRVSE 125
Query 130 NSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGADKNNVM 189
NPPL G YTL+ +P +TL G +S G P+ GR V YL+ L+GAD++
Sbjct 126 RGNPPLQGHYTLWVGPQPTQVTLFGLISQPGSQPFVPGRDVASYLEGQRLLSGADRSYAW 185
Query 190 VITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLTQRVPD 248
V+ P+G + APVA WNKRH+EP PGS +++GF+ + +N I+ LTQR+P+
Sbjct 186 VVYPDGRSQKAPVAYWNKRHIEPMPGSIIFVGFADSLWRGTPEAINADILHTLTQRIPE 244
>ref|ZP_03044141.1| conserved hypothetical protein [Escherichia coli E22]
gb|EDV83910.1| conserved hypothetical protein [Escherichia coli E22]
Length=245
Score = 180 bits (457), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 101/244 (41%), Positives = 145/244 (59%), Gaps = 1/244 (0%)
Query 4 LQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGA 63
++ +A +L V FA GTV ++ G + ++ E ++ LV QP L + WWPGA
Sbjct 2 IKQTIVALILSVGASSVFAAGTVKVFSNGSSEAKTLTGAEYLIDLVGQPWLANS-WWPGA 60
Query 64 LLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPD 123
++++ A A A + Q ++ +LA A++ D AA I ++RQQ+ L +TGR + LDPD
Sbjct 61 VISEELATAAASRQQQALLTRLAELAADSSADDAAAINALRQQIQALKVTGRQKINLDPD 120
Query 124 FVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGA 183
VRV E NPPL G+YTL+ P T+TL G +S G+ P+ GR V YL L+GA
Sbjct 121 IVRVAERGNPPLQGNYTLWVGPPPSTVTLFGLISRPGKQPFTPGRDVASYLSGQNLLSGA 180
Query 184 DKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLT 243
D++ V+ P+G T APVA WNKRHVEP PGS +++G + V E LN I+ LT
Sbjct 181 DRSYAWVVYPDGRTQKAPVAYWNKRHVEPMPGSIIYVGLADSVWSETPDALNADILQTLT 240
Query 244 QRVP 247
QR+P
Sbjct 241 QRIP 244
>ref|ZP_06651602.1| predicted protein [Escherichia coli B354]
gb|EFF14495.1| predicted protein [Escherichia coli B354]
Length=245
Score = 180 bits (456), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 100/244 (40%), Positives = 146/244 (59%), Gaps = 1/244 (0%)
Query 4 LQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGA 63
++ +A +L V FA GTV ++ G + ++ E+++ LV QP+L + WWPGA
Sbjct 2 IKQTIVALILSVGASSVFAAGTVKVFSNGSSEAKTLTGAEHLIDLVGQPRLANS-WWPGA 60
Query 64 LLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPD 123
++++ A A AL+ Q ++ +LA A++ D AA I ++RQQ+ L +TGR + LD D
Sbjct 61 VISEELATAAALRQQQALLTRLAELTADSSADDAAAINALRQQIQALKVTGRQKINLDSD 120
Query 124 FVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGA 183
VRV E NPPL G+YTL+ P T+TL G +S G+ P+ GR V YL L+ A
Sbjct 121 IVRVAERGNPPLQGNYTLWVGPPPSTVTLFGLISRPGKQPFTPGRDVASYLSGQNLLSSA 180
Query 184 DKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLT 243
D++ V+ P+G T APVA WNKRHVEP PGS +++G + V E LN I+ LT
Sbjct 181 DRSYAWVVYPDGRTQKAPVAYWNKRHVEPMPGSIIYVGLADSVWSETPDALNADILQTLT 240
Query 244 QRVP 247
QR+P
Sbjct 241 QRIP 244
>ref|ZP_06356425.1| conserved hypothetical protein [Citrobacter youngae ATCC 29220]
gb|EFE05535.1| conserved hypothetical protein [Citrobacter youngae ATCC 29220]
Length=245
Score = 180 bits (456), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 100/245 (40%), Positives = 147/245 (60%), Gaps = 1/245 (0%)
Query 4 LQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGA 63
++ +A V + P FA G V + + G + + E+++ LV QP+L + WWPGA
Sbjct 2 IKRAVMALVFSLSVPSVFAAGDVKVMIAGSAEPKILTGAEHLIDLVGQPRLSNS-WWPGA 60
Query 64 LLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPD 123
++++ A +A + Q ++A+LA+ AE D AA I ++RQQ+ L +TGR + LDPD
Sbjct 61 VISEERATVEAERQQQALLARLAALSAEESGDDAAAINALRQQIQALKVTGRQKINLDPD 120
Query 124 FVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGA 183
VRV E NPPL G+YTL+ +P ITL G +S G+ P+ GR V YL RL+GA
Sbjct 121 VVRVSERGNPPLQGNYTLWVGPQPTDITLFGLLSRPGKQPFMPGRDVASYLDGQSRLSGA 180
Query 184 DKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLT 243
D++ VI P+G T P+A WNKRHVEP PGS +++G S V ++N I+ LT
Sbjct 181 DRSYAWVIYPDGRTQKVPIAYWNKRHVEPMPGSIIFVGLSDAVWSSTPDEINADILRTLT 240
Query 244 QRVPD 248
QR+P+
Sbjct 241 QRIPE 245
>ref|YP_001572429.1| hypothetical protein SARI_03458 [Salmonella enterica subsp. arizonae
serovar 62:z4,z23:-- str. RSK2980]
gb|ABX23287.1| hypothetical protein SARI_03458 [Salmonella enterica subsp. arizonae
serovar 62:z4,z23:--]
Length=245
Score = 180 bits (456), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 94/239 (39%), Positives = 140/239 (58%), Gaps = 1/239 (0%)
Query 10 ASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGALLTDSA 69
A L + AFA GTVT+++ G + ++ ++ LV QP+L WWP A++ +
Sbjct 8 ALALAFVASSAFASGTVTVFIKGHNEPKTLTDAGRLLDLVGQPRLATS-WWPAAVIGEEK 66
Query 70 AKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPDFVRVDE 129
A A A + Q ++ +LA+ E + D AA I ++R+Q+ + +TGR V LDPD VRV E
Sbjct 67 ATATARQQQQELLGRLAALSTEENGDAAAAINALRRQIQAVKVTGRQFVNLDPDVVRVSE 126
Query 130 NSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGADKNNVM 189
NPPL G+YTL+ +P ITL G +S G P+ GR V YL L+GAD++
Sbjct 127 RGNPPLQGNYTLWVGPQPTQITLFGLISRPGSQPFIPGRDVASYLDGQRLLSGADRSYAW 186
Query 190 VITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLTQRVPD 248
V+ P+G + APVA WNKRH+EP PGS +++GF++ +N + LTQR+P+
Sbjct 187 VVYPDGRSQKAPVAYWNKRHIEPMPGSIIFVGFASSRWRGTPEAINADTLHTLTQRIPE 245
>ref|NP_458522.1| hypothetical protein STY4420 [Salmonella enterica subsp. enterica
serovar Typhi str. CT18]
ref|NP_807734.1| hypothetical protein t4130 [Salmonella enterica subsp. enterica
serovar Typhi str. Ty2]
ref|ZP_02656274.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
serovar Kentucky str. CDC 191]
11 more sequence titles
ref|ZP_03075142.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
serovar Kentucky str. CVM29188]
ref|ZP_03348876.1| hypothetical protein Salmoneentericaenterica_25318 [Salmonella
enterica subsp. enterica serovar Typhi str. E00-7866]
ref|ZP_03351503.1| hypothetical protein Salmonentericaenterica_11323 [Salmonella
enterica subsp. enterica serovar Typhi str. E01-6750]
ref|ZP_03379652.1| hypothetical protein SentesTy_21220 [Salmonella enterica subsp.
enterica serovar Typhi str. J185]
ref|ZP_03384439.1| hypothetical protein SentesT_20139 [Salmonella enterica subsp.
enterica serovar Typhi str. M223]
ref|ZP_06544161.1| hypothetical protein Salmonellentericaenterica_05466 [Salmonella
enterica subsp. enterica serovar Typhi str. E98-3139]
pir||AG1013 probable exported protein STY4420 [imported] - Salmonella enterica
subsp. enterica serovar Typhi (strain CT18)
emb|CAD09208.1| putative exported protein [Salmonella enterica subsp. enterica
serovar Typhi]
gb|AAO71594.1| putative exported protein [Salmonella enterica subsp. enterica
serovar Typhi str. Ty2]
gb|EDX44361.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
serovar Kentucky str. CVM29188]
gb|EDZ21036.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
serovar Kentucky str. CDC 191]
Length=244
Score = 179 bits (453), Expect = 4e-43, Method: Compositional matrix adjust.
Identities = 96/239 (40%), Positives = 141/239 (58%), Gaps = 2/239 (0%)
Query 10 ASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGALLTDSA 69
A L + AFA GTVT++ G + ++ E ++ LV QP+L WWP A++ +
Sbjct 8 ALALAFVASSAFASGTVTVFTQGNSEPKTLTDAERLLDLVGQPRLATS-WWPAAVIGEEQ 66
Query 70 AKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPDFVRVDE 129
A A + Q ++ +LA+ AE D D AA I ++R+Q+ + +TGR V LDPD VRV E
Sbjct 67 ATVTARQQQQELLGRLAALSAEEDGDAAA-INTLRRQIQAVKVTGRQFVNLDPDVVRVSE 125
Query 130 NSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGADKNNVM 189
NPPL G YTL+ +P +TL G +S G P+ GR V YL+ L+GAD++
Sbjct 126 RGNPPLQGHYTLWVGPQPTQVTLFGLISQPGSQPFVPGRDVASYLEGQRLLSGADRSYAW 185
Query 190 VITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLTQRVPD 248
V+ P+G + APVA WNKRH+EP PGS +++GF+ + +N I+ LTQR+P+
Sbjct 186 VVYPDGRSQKAPVAYWNKRHIEPMPGSIIFVGFADSLWRGTPEAINADILHTLTQRIPE 244
>ref|ZP_07380293.1| protein of unknown function DUF1017 [Pantoea sp. aB]
gb|EFM18299.1| protein of unknown function DUF1017 [Pantoea sp. aB]
Length=247
Score = 179 bits (453), Expect = 5e-43, Method: Compositional matrix adjust.
Identities = 94/222 (42%), Positives = 129/222 (58%), Gaps = 0/222 (0%)
Query 27 TIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGALLTDSAAKAKALKDYQHVMAQLA 86
T++ P + + ++ QLVT P L+ W + + A A A + YQ + L
Sbjct 26 TVHTPHNGGQAELSQITDLSQLVTLPPLQVNTDWRSTFIAERGATAVARQQYQQTLGALR 85
Query 87 SWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPDFVRVDENSNPPLVGDYTLYTVQR 146
+W A++ D AA I V +QL + + GR LDPD++R+ N L G Y LY
Sbjct 86 AWRADSSGDRAAAIDEVIRQLSAIKVAGRQFTSLDPDWIRLHPADNRRLEGSYDLYLQAP 145
Query 147 PVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGADKNNVMVITPEGETVVAPVALWN 206
++ LLGA+SGAG++ WQ G+SV DYL H L+GA++N V VI P G T PVA WN
Sbjct 146 TDSVLLLGALSGAGKVSWQPGKSVRDYLDGHDALSGAERNFVTVIAPSGATQQVPVAYWN 205
Query 207 KRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLTQRVPD 248
+RH E PGS +WLGFS+ LP DLND+I+SVLT R+PD
Sbjct 206 RRHAEVEPGSVIWLGFSSWSLPGSDEDLNDRILSVLTHRIPD 247
>ref|ZP_03357288.1| hypothetical protein SentesTyphi_01726 [Salmonella enterica subsp.
enterica serovar Typhi str. E02-1180]
Length=244
Score = 177 bits (450), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 96/239 (40%), Positives = 140/239 (58%), Gaps = 2/239 (0%)
Query 10 ASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGALLTDSA 69
A L + AFA GTVT++ G + ++ E ++ LV QP+L WWP A++ +
Sbjct 8 ALALAFVASSAFASGTVTVFTQGNSEPKTLTDAERLLDLVGQPRLATS-WWPAAVIGEEQ 66
Query 70 AKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPDFVRVDE 129
A A + Q ++ +LA+ AE D D AA I ++R+Q+ + +TGR V LDPD VRV E
Sbjct 67 ATVTARQQQQELLGRLAALSAEEDGDAAA-INTLRRQIQAVKVTGRQFVNLDPDVVRVSE 125
Query 130 NSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGADKNNVM 189
NPPL G YTL+ +P +TL G +S G P+ GR V YL+ L+GAD +
Sbjct 126 RGNPPLQGHYTLWVGPQPTQVTLFGLISQPGSQPFVPGRDVASYLEGQRLLSGADCSYAW 185
Query 190 VITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLTQRVPD 248
V+ P+G + APVA WNKRH+EP PGS +++GF+ + +N I+ LTQR+P+
Sbjct 186 VVYPDGRSQKAPVAYWNKRHIEPMPGSIIFVGFADSLWRGTPEAINADILHTLTQRIPE 244
>ref|YP_004114116.1| hypothetical protein Pat9b_0234 [Pantoea sp. At-9b]
gb|ADU67560.1| protein of unknown function DUF1017 [Pantoea sp. At-9b]
Length=245
Score = 177 bits (449), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 102/248 (41%), Positives = 147/248 (59%), Gaps = 3/248 (1%)
Query 1 MNKLQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWW 60
M K+ S L+ +A A V ++ + TL V +N+ QL++ P + WW
Sbjct 1 MKKITSLLAGLALFTAG-NALADSQVIVHDGPHRATLQVDHAQNLSQLLSNPAIHT--WW 57
Query 61 PGALLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKL 120
PG ++ + AA A A + Q ++A L +W+A+ + AA I +V QQL +TGR L
Sbjct 58 PGTVIAEHAATAVAKQQQQQLLADLRAWQADNSGERAAAIGAVIQQLAATPVTGRQFTSL 117
Query 121 DPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRL 180
DPD+VR+ +N L G Y LYT+ P + +LGA+ G++ WQ GR+V YL DH RL
Sbjct 118 DPDWVRLRPEANRILQGSYDLYTLAAPTQVLVLGALEHPGKVSWQPGRTVRSYLADHDRL 177
Query 181 AGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVS 240
+G +++ VI P GET P+A WN RHVE PGS +WLGFS+ LP +DLND+++S
Sbjct 178 SGGERSFATVIAPSGETQQVPIAYWNHRHVEVEPGSIIWLGFSSWSLPWGQSDLNDRMIS 237
Query 241 VLTQRVPD 248
VLT R+PD
Sbjct 238 VLTHRIPD 245
>ref|ZP_08376264.1| conserved hypothetical protein [Escherichia coli TA280]
gb|EGI38697.1| conserved hypothetical protein [Escherichia coli TA280]
Length=245
Score = 177 bits (448), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 99/244 (40%), Positives = 145/244 (59%), Gaps = 1/244 (0%)
Query 4 LQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGA 63
++ +A +L V FA GTV ++ G + ++ E+++ LV QP+L + WW GA
Sbjct 2 IKQTIVALLLSVGASSVFAAGTVKVFSNGSSEAKTLTGAEHLIDLVGQPRLANS-WWSGA 60
Query 64 LLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPD 123
++++ A A AL+ Q ++ +LA A++ D AA I ++RQQ+ L +TGR + LDPD
Sbjct 61 VISEELATAAALRQQQALLTRLAELTADSSADDAAAINALRQQIQALKVTGRQKINLDPD 120
Query 124 FVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGA 183
VRV E NPPL G+YTL+ P T+ L G +S G+ + GR V YL L+GA
Sbjct 121 IVRVAERGNPPLQGNYTLWVGPPPSTVMLFGLISRPGKQSFTPGRDVASYLSGQNLLSGA 180
Query 184 DKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLT 243
D++ V+ P+G T APVA WNKRHVEP PGS +++G + V E LN I+ LT
Sbjct 181 DRSYAWVVYPDGRTQKAPVAYWNKRHVEPMPGSIIYVGLADSVWSETPDALNADILQTLT 240
Query 244 QRVP 247
QR+P
Sbjct 241 QRIP 244
>ref|ZP_04558593.1| conserved hypothetical protein [Citrobacter sp. 30_2]
gb|EEH96041.1| conserved hypothetical protein [Citrobacter sp. 30_2]
Length=245
Score = 176 bits (445), Expect = 4e-42, Method: Compositional matrix adjust.
Identities = 96/228 (42%), Positives = 141/228 (61%), Gaps = 1/228 (0%)
Query 21 FAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGALLTDSAAKAKALKDYQH 80
FA G+V ++ ++ ++ E+++ LV QP+L + WWPGA++++ A +A + Q
Sbjct 19 FAAGSVKVFTSASEEPKTLTGAEHLLDLVGQPRLSNS-WWPGAVISEERATMEAGRQQQA 77
Query 81 VMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPDFVRVDENSNPPLVGDYT 140
++A+LA+ AE D AA I ++RQQ+ L +TGR + LDPD VRV E NPPL G+YT
Sbjct 78 LLARLAALSAEESGDDAAAINTLRQQIQALKVTGRQKINLDPDVVRVSERGNPPLQGNYT 137
Query 141 LYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGADKNNVMVITPEGETVVA 200
L+ +P ITL G +S G+ P+ GR V YL RL+GAD++ VI P+G T
Sbjct 138 LWVGAQPTHITLFGLLSHPGKQPFMPGRDVASYLDGQSRLSGADRSFAWVIYPDGRTQKV 197
Query 201 PVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLTQRVPD 248
VA WNKRHVEP PGS +++G S V ++N I+ LTQR+P+
Sbjct 198 SVAYWNKRHVEPMPGSIIYVGLSDAVWSSTSDEINADILRTLTQRIPE 245
>ref|ZP_02346309.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
serovar Saintpaul str. SARA29]
gb|EDZ10637.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
serovar Saintpaul str. SARA29]
Length=244
Score = 175 bits (444), Expect = 5e-42, Method: Compositional matrix adjust.
Identities = 94/239 (39%), Positives = 139/239 (58%), Gaps = 2/239 (0%)
Query 10 ASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGALLTDSA 69
A L + AFA GTVT++ G + ++ E ++ LV QP+L WWP A++ +
Sbjct 8 ALALAFIASSAFASGTVTVFTQGNSEPKTLTDAERLLDLVGQPRLATS-WWPAAVIGEEQ 66
Query 70 AKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPDFVRVDE 129
A A + Q ++ +LA+ AE + D AA I ++R+Q+ + +TGR V LDPD VRV E
Sbjct 67 ATVTARQQQQELLGRLAALSAEEEGDAAA-INTLRRQIQAVKVTGRQFVNLDPDVVRVSE 125
Query 130 NSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGADKNNVM 189
NPPL G YTL+ P +TL G +S G P+ GR V YL+ L+GAD++
Sbjct 126 RGNPPLQGHYTLWVGPEPTQVTLFGLISQPGSQPFVPGRDVASYLEGQRLLSGADRSYAW 185
Query 190 VITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLTQRVPD 248
V+ P+G + APVA WNKRH+EP PGS +++GF+ + +N I+ L QR+P+
Sbjct 186 VVYPDGRSQKAPVAYWNKRHIEPMPGSIIFVGFADSLWRGTPEAINADILHTLKQRIPE 244
>ref|ZP_07183334.1| conserved hypothetical protein [Escherichia coli MS 196-1]
gb|EFI90597.1| conserved hypothetical protein [Escherichia coli MS 196-1]
gb|AEE59357.1| conserved hypothetical protein [Escherichia coli UMNK88]
Length=220
Score = 173 bits (438), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 95/220 (43%), Positives = 137/220 (62%), Gaps = 1/220 (0%)
Query 28 IYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGALLTDSAAKAKALKDYQHVMAQLAS 87
++ G + ++ E+++ LV QP+L + WWPGA++++ A A AL+ Q ++ +LA
Sbjct 1 MFSNGSSEAKTLTGAEHLIDLVGQPRLANS-WWPGAVISEELATAAALRQQQALLTRLAE 59
Query 88 WEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPDFVRVDENSNPPLVGDYTLYTVQRP 147
A++ D AA I ++RQQ+ L +TGR + LDPD VRV E NPPL G+YTL+ P
Sbjct 60 QGADSSADDAAAINALRQQIQALKVTGRQKINLDPDIVRVAERGNPPLQGNYTLWVGPPP 119
Query 148 VTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGADKNNVMVITPEGETVVAPVALWNK 207
T+TL G +S G+ P+ GR V YL D L+GAD++ V+ P+G T APVA WNK
Sbjct 120 STVTLFGLISRPGKQPFTPGRDVASYLSDQSLLSGADRSYAWVVYPDGRTQKAPVAYWNK 179
Query 208 RHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLTQRVP 247
RHVEP PGS +++G + V E LN I+ LTQR+P
Sbjct 180 RHVEPMPGSIIYVGLADSVWSETPDALNADILQTLTQRIP 219
>ref|YP_002218119.1| hypothetical protein SeD_A4618 [Salmonella enterica subsp. enterica
serovar Dublin str. CT_02021853]
gb|ACH75379.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
serovar Dublin str. CT_02021853]
gb|EGE32261.1| Putative periplasmic protein [Salmonella enterica subsp. enterica
serovar Dublin str. SD3246]
Length=245
Score = 172 bits (435), Expect = 5e-41, Method: Compositional matrix adjust.
Identities = 97/239 (40%), Positives = 142/239 (59%), Gaps = 1/239 (0%)
Query 10 ASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGALLTDSA 69
A L + AFA GTVT++ G + ++ E ++ LV QP+L WWP A++ +
Sbjct 8 ALALAFVASSAFASGTVTVFTQGNSEPKTLTDAERLLDLVGQPRLATS-WWPAAVIGEEQ 66
Query 70 AKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPDFVRVDE 129
A A + Q ++ +LA+ AE D D AA I ++R+Q+ + +TGR V LDPD VRV E
Sbjct 67 ATVTARQQQQELLGRLAALGAEEDGDAAAAINTLRRQIQAVKVTGRQFVNLDPDVVRVSE 126
Query 130 NSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGADKNNVM 189
NPPL G YTL+ +P +TL G +S G P+ GR V YL+D L+GAD++
Sbjct 127 RGNPPLQGHYTLWVGPQPTQVTLFGLISQPGSQPFVPGRDVASYLEDQRLLSGADRSYAW 186
Query 190 VITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLTQRVPD 248
V+ P+G + APVA WNKRH+EP PGS +++GF+ + +N I+ LTQR+P+
Sbjct 187 VVYPDGRSQKAPVAYWNKRHIEPMPGSIIFVGFADSLWRGTPEAMNADILHTLTQRIPE 245
>ref|ZP_03213760.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
serovar Virchow str. SL491]
gb|EDZ02791.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
serovar Virchow str. SL491]
Length=245
Score = 172 bits (435), Expect = 6e-41, Method: Compositional matrix adjust.
Identities = 97/239 (40%), Positives = 142/239 (59%), Gaps = 1/239 (0%)
Query 10 ASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGALLTDSA 69
A L + AFA GTVT++ G + ++ E ++ LV QP+L WWP A++ +
Sbjct 8 ALALAFVASSAFASGTVTVFTQGNSEPKTLTDAERLLDLVGQPRLATS-WWPAAVIGEEQ 66
Query 70 AKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPDFVRVDE 129
A A + Q ++ +LA+ AE D D AA I ++R+Q+ + +TGR V LDPD VRV E
Sbjct 67 ATVTARQQQQELLGRLAALGAEEDGDAAAAINTLRRQIQAVKVTGRQFVNLDPDVVRVSE 126
Query 130 NSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGADKNNVM 189
NPPL G YTL+ +P +TL G +S G P+ GR V YL+D L+GAD++
Sbjct 127 RGNPPLQGHYTLWVGPQPTQVTLFGLISQPGSQPFVPGRDVASYLEDQRLLSGADRSYAW 186
Query 190 VITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLTQRVPD 248
V+ P+G + APVA WNKRH+EP PGS +++GF+ + +N I+ LTQR+P+
Sbjct 187 VVYPDGRSQKAPVAYWNKRHIEPMPGSIIFVGFADSLWRGTPKAINADILHTLTQRIPE 245
>ref|YP_001591314.1| hypothetical protein SPAB_05204 [Salmonella enterica subsp. enterica
serovar Paratyphi B str. SPB7]
ref|ZP_02700750.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
serovar Newport str. SL317]
gb|ABX70481.1| hypothetical protein SPAB_05204 [Salmonella enterica subsp. enterica
serovar Paratyphi B str. SPB7]
gb|EDX49107.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
serovar Newport str. SL317]
Length=245
Score = 172 bits (435), Expect = 6e-41, Method: Compositional matrix adjust.
Identities = 97/239 (40%), Positives = 142/239 (59%), Gaps = 1/239 (0%)
Query 10 ASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGALLTDSA 69
A L + AFA GTVT++ G + ++ E ++ LV QP+L WWP A++ +
Sbjct 8 ALALAFVASSAFASGTVTVFTQGNSEPKTLTDAERLLDLVGQPRLATS-WWPAAVIGEEQ 66
Query 70 AKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPDFVRVDE 129
A A + Q ++ +LA+ AE D D AA I ++R+Q+ + +TGR V LDPD VRV E
Sbjct 67 ATVTARQQQQELLGRLAALGAEEDGDAAAAINTLRRQIQAVKVTGRQFVNLDPDVVRVSE 126
Query 130 NSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGADKNNVM 189
NPPL G YTL+ +P +TL G +S G P+ GR V YL+D L+GAD++
Sbjct 127 RGNPPLQGHYTLWVGPQPTQVTLFGLISQPGSQPFVPGRDVASYLEDQRLLSGADRSYAW 186
Query 190 VITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLTQRVPD 248
V+ P+G + APVA WNKRH+EP PGS +++GF+ + +N I+ LTQR+P+
Sbjct 187 VVYPDGRSQKAPVAYWNKRHIEPMPGSIIFVGFADSLWRGTPEAINADILHTLTQRIPE 245
>ref|ZP_02664389.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
serovar Schwarzengrund str. SL480]
ref|YP_002117102.1| hypothetical protein SeSA_A4415 [Salmonella enterica subsp. enterica
serovar Schwarzengrund str. CVM19633]
gb|ACF92324.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
serovar Schwarzengrund str. CVM19633]
gb|EDY27341.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
serovar Schwarzengrund str. SL480]
Length=245
Score = 171 bits (433), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 97/239 (40%), Positives = 141/239 (58%), Gaps = 1/239 (0%)
Query 10 ASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGALLTDSA 69
A L + AFA GTVT++ G + ++ E ++ LV QP+L WWP A++ +
Sbjct 8 ALALAFVASSAFASGTVTVFTQGNSEPKTLTDAERLLDLVGQPRLATS-WWPAAVIGEEQ 66
Query 70 AKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPDFVRVDE 129
A A + Q ++ +LA+ AE D D AA I ++R+Q+ L +TGR V LDPD VRV E
Sbjct 67 ATVTARQQQQELLGRLAALSAEEDGDAAAAINTLRRQIQALKVTGRQFVNLDPDVVRVSE 126
Query 130 NSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGADKNNVM 189
NPPL G YTL+ +P +TL G +S G P+ GR V YL+ L+GAD++
Sbjct 127 RGNPPLQGHYTLWVGPQPTQVTLFGLISQPGSQPFVPGRDVASYLEGQRLLSGADRSYAW 186
Query 190 VITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLTQRVPD 248
V+ P+G + APVA WNKRH+EP PGS +++GF+ + +N I+ LTQR+P+
Sbjct 187 VVYPDGRSQKAPVAYWNKRHIEPMPGSIIFVGFADSLWRGTPEAMNADILHTLTQRIPE 245
>gb|EFY12299.1| hypothetical protein SEEM315_09434 [Salmonella enterica subsp.
enterica serovar Montevideo str. 315996572]
gb|EFY15329.1| hypothetical protein SEEM971_09698 [Salmonella enterica subsp.
enterica serovar Montevideo str. 495297-1]
gb|EFY18999.1| hypothetical protein SEEM973_18592 [Salmonella enterica subsp.
enterica serovar Montevideo str. 495297-3]
32 more sequence titles
gb|EFY24011.1| hypothetical protein SEEM974_00967 [Salmonella enterica subsp.
enterica serovar Montevideo str. 495297-4]
gb|EFY27944.1| hypothetical protein SEEM201_05803 [Salmonella enterica subsp.
enterica serovar Montevideo str. 515920-1]
gb|EFY34303.1| hypothetical protein SEEM202_03119 [Salmonella enterica subsp.
enterica serovar Montevideo str. 515920-2]
gb|EFY39165.1| hypothetical protein SEEM954_05801 [Salmonella enterica subsp.
enterica serovar Montevideo str. 531954]
gb|EFY40139.1| hypothetical protein SEEM054_18897 [Salmonella enterica subsp.
enterica serovar Montevideo str. NC_MB110209-0054]
gb|EFY44779.1| hypothetical protein SEEM675_22469 [Salmonella enterica subsp.
enterica serovar Montevideo str. OH_2009072675]
gb|EFY50949.1| hypothetical protein SEEM965_15608 [Salmonella enterica subsp.
enterica serovar Montevideo str. CASC_09SCPH15965]
gb|EFY55832.1| hypothetical protein SEEM19N_07614 [Salmonella enterica subsp.
enterica serovar Montevideo str. 19N]
gb|EFY58419.1| hypothetical protein SEEM801_16871 [Salmonella enterica subsp.
enterica serovar Montevideo str. 81038-01]
gb|EFY62230.1| hypothetical protein SEEM507_02454 [Salmonella enterica subsp.
enterica serovar Montevideo str. MD_MDA09249507]
gb|EFY68583.1| hypothetical protein SEEM877_14809 [Salmonella enterica subsp.
enterica serovar Montevideo str. 414877]
gb|EFY71972.1| hypothetical protein SEEM867_14018 [Salmonella enterica subsp.
enterica serovar Montevideo str. 366867]
gb|EFY76423.1| hypothetical protein SEEM180_11783 [Salmonella enterica subsp.
enterica serovar Montevideo str. 413180]
gb|EFY80724.1| hypothetical protein SEEM600_18332 [Salmonella enterica subsp.
enterica serovar Montevideo str. 446600]
gb|EFZ81150.1| hypothetical protein SEEM581_05484 [Salmonella enterica subsp.
enterica serovar Montevideo str. 609458-1]
gb|EFZ82227.1| hypothetical protein SEEM501_17299 [Salmonella enterica subsp.
enterica serovar Montevideo str. 556150-1]
gb|EFZ87261.1| hypothetical protein SEEM460_10347 [Salmonella enterica subsp.
enterica serovar Montevideo str. 609460]
gb|EFZ91272.1| hypothetical protein SEEM020_11755 [Salmonella enterica subsp.
enterica serovar Montevideo str. 507440-20]
gb|EFZ98124.1| hypothetical protein SEEM6152_13293 [Salmonella enterica subsp.
enterica serovar Montevideo str. 556152]
gb|EGA00435.1| hypothetical protein SEEM0077_08473 [Salmonella enterica subsp.
enterica serovar Montevideo str. MB101509-0077]
gb|EGA06500.1| hypothetical protein SEEM0047_17570 [Salmonella enterica subsp.
enterica serovar Montevideo str. MB102109-0047]
gb|EGA10732.1| hypothetical protein SEEM0055_01306 [Salmonella enterica subsp.
enterica serovar Montevideo str. MB110209-0055]
gb|EGA13280.1| hypothetical protein SEEM0052_05865 [Salmonella enterica subsp.
enterica serovar Montevideo str. MB111609-0052]
gb|EGA20316.1| hypothetical protein SEEM3312_18136 [Salmonella enterica subsp.
enterica serovar Montevideo str. 2009083312]
gb|EGA21702.1| hypothetical protein SEEM5258_20322 [Salmonella enterica subsp.
enterica serovar Montevideo str. 2009085258]
gb|EGA25723.1| hypothetical protein SEEM1156_12037 [Salmonella enterica subsp.
enterica serovar Montevideo str. 315731156]
gb|EGA32096.1| hypothetical protein SEEM9199_15729 [Salmonella enterica subsp.
enterica serovar Montevideo str. IA_2009159199]
gb|EGA38321.1| hypothetical protein SEEM8282_03575 [Salmonella enterica subsp.
enterica serovar Montevideo str. IA_2010008282]
gb|EGA42107.1| hypothetical protein SEEM8283_08901 [Salmonella enterica subsp.
enterica serovar Montevideo str. IA_2010008283]
gb|EGA43343.1| hypothetical protein SEEM8284_10847 [Salmonella enterica subsp.
enterica serovar Montevideo str. IA_2010008284]
gb|EGA49478.1| hypothetical protein SEEM8285_07750 [Salmonella enterica subsp.
enterica serovar Montevideo str. IA_2010008285]
gb|EGA53872.1| hypothetical protein SEEM8287_19916 [Salmonella enterica subsp.
enterica serovar Montevideo str. IA_2010008287]
Length=245
Score = 171 bits (432), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 97/239 (40%), Positives = 141/239 (58%), Gaps = 1/239 (0%)
Query 10 ASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGALLTDSA 69
A L + AFA GTVT++ G + ++ E ++ LV QP+L WWP A++ +
Sbjct 8 ALALAFVASSAFASGTVTVFTQGNSEPKTLTDAERLLDLVGQPRLATS-WWPAAVIGEEQ 66
Query 70 AKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPDFVRVDE 129
A A + Q ++ +LA+ AE D D AA I ++R+Q+ L +TGR V LDPD VRV E
Sbjct 67 ATVTARQQQQELLGRLAALSAEEDGDAAAAINTLRRQIQALKVTGRQFVNLDPDVVRVSE 126
Query 130 NSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGADKNNVM 189
NPPL G YTL+ +P +TL G +S G P+ GR V YL+ L+GAD++
Sbjct 127 RGNPPLQGHYTLWVGPQPTQVTLFGLISQPGSQPFVPGRDVASYLEGQRLLSGADRSYAW 186
Query 190 VITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLTQRVPD 248
V+ P+G + APVA WNKRH+EP PGS +++GF+ + +N I+ LTQR+P+
Sbjct 187 VVYPDGRSQKAPVAYWNKRHIEPMPGSIIFVGFADSLWRGTPEAINADILHTLTQRIPE 245
>ref|ZP_03069392.1| conserved hypothetical protein [Escherichia coli 101-1]
ref|ZP_07788357.1| uncharacterized protein gfcC [Escherichia coli 1827-70]
gb|EDX39839.1| conserved hypothetical protein [Escherichia coli 101-1]
gb|EFP98976.1| uncharacterized protein gfcC [Escherichia coli 1827-70]
Length=220
Score = 171 bits (432), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 94/220 (42%), Positives = 136/220 (61%), Gaps = 1/220 (0%)
Query 28 IYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGALLTDSAAKAKALKDYQHVMAQLAS 87
++ G + ++ E+++ LV QP+L + WWPGA++++ A A AL+ Q ++ +LA
Sbjct 1 MFSNGSSEAKTLTGAEHLIDLVGQPRLANS-WWPGAVISEELATAAALRQQQALLTRLAE 59
Query 88 WEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPDFVRVDENSNPPLVGDYTLYTVQRP 147
A++ D AA I ++RQQ+ L +TGR + LDPD VRV E NPPL G+YTL+ P
Sbjct 60 LAADSSADDAAAINALRQQIQALKVTGRQKINLDPDIVRVAERGNPPLQGNYTLWVGPPP 119
Query 148 VTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGADKNNVMVITPEGETVVAPVALWNK 207
T+TL G +S G+ P+ GR V YL L+GAD++ V+ P+G T APVA WNK
Sbjct 120 STVTLFGLISRPGKQPFTPGRDVASYLSGQNLLSGADRSYAWVVYPDGRTQKAPVAYWNK 179
Query 208 RHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLTQRVP 247
RHVEP PGS +++G + V E LN I+ LTQR+P
Sbjct 180 RHVEPMPGSIIYVGLADSVWSETPDALNADILQTLTQRIP 219
>ref|YP_153099.1| hypothetical protein SPA4041 [Salmonella enterica subsp. enterica
serovar Paratyphi A str. ATCC 9150]
ref|YP_002144588.1| hypothetical protein SSPA3750 [Salmonella enterica subsp. enterica
serovar Paratyphi A str. AKU_12601]
gb|AAV79787.1| putative exported protein [Salmonella enterica subsp. enterica
serovar Paratyphi A str. ATCC 9150]
emb|CAR62034.1| putative exported protein [Salmonella enterica subsp. enterica
serovar Paratyphi A str. AKU_12601]
Length=245
Score = 170 bits (431), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 96/239 (40%), Positives = 141/239 (58%), Gaps = 1/239 (0%)
Query 10 ASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGALLTDSA 69
A L + AFA GTVT++ G + ++ E ++ LV QP+L WWP A++ +
Sbjct 8 ALALAFIASSAFASGTVTVFTQGNSEPKTLTDAERLLDLVGQPRLATS-WWPAAVIGEEQ 66
Query 70 AKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPDFVRVDE 129
A A + Q ++ +LA+ AE D D AA I ++R+Q+ + +TGR V LDPD VRV E
Sbjct 67 ATVTARQQQQELLGRLAALSAEEDGDAAAAINTLRRQIQAVKVTGRQFVNLDPDVVRVSE 126
Query 130 NSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGADKNNVM 189
NPPL G Y L+ +P +TL G +S G P+ GR V YL+D L+GAD++
Sbjct 127 RGNPPLQGHYMLWVGPQPTQVTLFGLISQPGSQPFVPGRDVASYLEDQRLLSGADRSYAW 186
Query 190 VITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLTQRVPD 248
V+ P+G + APVA WNKRH+EP PGS +++GF+ + +N I+ LTQR+P+
Sbjct 187 VVYPDGRSQKAPVAYWNKRHIEPMPGSIIFVGFADSLWRGTPEAINADILHTLTQRIPE 245
>ref|ZP_02799370.2| conserved hypothetical protein [Escherichia coli O157:H7 str.
EC4196]
ref|ZP_02775866.2| conserved hypothetical protein [Escherichia coli O157:H7 str.
EC4113]
ref|ZP_02780183.2| conserved hypothetical protein [Escherichia coli O157:H7 str.
EC4401]
12 more sequence titles
ref|ZP_02791273.2| conserved hypothetical protein [Escherichia coli O157:H7 str.
EC4486]
ref|ZP_02823339.2| conserved hypothetical protein [Escherichia coli O157:H7 str.
EC508]
ref|ZP_03253885.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
EC4045]
ref|YP_002273547.1| hypothetical protein ECH74115_5507 [Escherichia coli O157:H7
str. EC4115]
gb|EDU33657.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
EC4196]
gb|EDU53135.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
EC4113]
gb|EDU75958.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
EC4401]
gb|EDU82489.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
EC4486]
gb|EDU97565.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
EC508]
gb|EDZ82520.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
EC4045]
gb|ACI36304.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
EC4115]
gb|EGD64359.1| YjbG polysaccharide synthesis-related protein [Escherichia coli
O157:H7 str. 1125]
Length=220
Score = 170 bits (431), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 94/220 (42%), Positives = 136/220 (61%), Gaps = 1/220 (0%)
Query 28 IYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGALLTDSAAKAKALKDYQHVMAQLAS 87
++ G + ++ E+++ LV QP+L + WWPGA++++ A A AL+ Q ++ +LA
Sbjct 1 MFSNGSSEAKTLTGAEHLIDLVGQPRLANS-WWPGAVISEELATAAALRQQQALLTRLAE 59
Query 88 WEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPDFVRVDENSNPPLVGDYTLYTVQRP 147
A++ D AA I ++RQQ+ L +TGR + LDPD VRV E +NPPL G+YTL+ P
Sbjct 60 QGADSSADDAAAINALRQQIQALKVTGRQKINLDPDIVRVAERANPPLQGNYTLWVGPPP 119
Query 148 VTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGADKNNVMVITPEGETVVAPVALWNK 207
T+TL G +S G P+ GR V YL L+GAD++ V+ P+G T APVA WNK
Sbjct 120 STVTLFGLISRPGNQPFTPGRDVASYLSGQSLLSGADRSYAWVVYPDGRTQKAPVAYWNK 179
Query 208 RHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLTQRVP 247
RHVEP PGS +++G + V E LN I+ LTQR+P
Sbjct 180 RHVEPMPGSIIYVGLADSVWSEMPDALNADILQTLTQRIP 219
>gb|EFW51577.1| YjbG polysaccharide synthesis-related protein [Shigella dysenteriae
CDC 74-1112]
Length=220
Score = 170 bits (431), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 94/220 (42%), Positives = 136/220 (61%), Gaps = 1/220 (0%)
Query 28 IYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGALLTDSAAKAKALKDYQHVMAQLAS 87
++ G + ++ E+++ LV QP+L + WWPGA++++ A A AL+ Q ++ +LA
Sbjct 1 MFSNGSSEAKTLTGAEHLIDLVGQPRLANS-WWPGAVISEELATAAALRQQQALLTRLAE 59
Query 88 WEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPDFVRVDENSNPPLVGDYTLYTVQRP 147
A++ D AA I ++RQQ+ L +TGR + LDPD VRV E NPPL G+YTL+ P
Sbjct 60 QAADSSADDAAAINALRQQIQALKVTGRQKINLDPDIVRVAERGNPPLQGNYTLWVGPPP 119
Query 148 VTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGADKNNVMVITPEGETVVAPVALWNK 207
T+TL G +S G+ P+ GR V YL L+GAD++ V+ P+G T APVA WNK
Sbjct 120 STVTLFGLISRPGKQPFTPGRDVASYLSGQNLLSGADRSYAWVVYPDGRTQKAPVAYWNK 179
Query 208 RHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLTQRVP 247
RHVEP PGS +++G + V E LN I+ LTQR+P
Sbjct 180 RHVEPMPGSIIYVGLADSVWSETPDALNADILQTLTQRIP 219
>ref|YP_002043474.1| hypothetical protein SNSL254_A4566 [Salmonella enterica subsp.
enterica serovar Newport str. SL254]
gb|ACF62021.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
serovar Newport str. SL254]
Length=245
Score = 170 bits (431), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 96/239 (40%), Positives = 141/239 (58%), Gaps = 1/239 (0%)
Query 10 ASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGALLTDSA 69
A L + AFA GTVT++ G + ++ E ++ LV QP+L WWP A++ +
Sbjct 8 ALALAFIASSAFASGTVTVFTQGNSEPKTLTDAERLLDLVGQPRLATS-WWPAAVIGEEQ 66
Query 70 AKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPDFVRVDE 129
A A + Q ++ +LA+ AE D D AA I ++R+Q+ + +TGR V LDPD VRV E
Sbjct 67 ATVTARQQQQELLGRLAALSAEEDGDAAAAINTLRRQIQAVKVTGRQLVNLDPDVVRVSE 126
Query 130 NSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGADKNNVM 189
NPPL G YTL+ +P +TL G +S G P+ GR V YL+ L+GAD++
Sbjct 127 RGNPPLQGHYTLWVGPQPTQVTLFGLISQPGSQPFVPGRDVASYLEGQRLLSGADRSYAW 186
Query 190 VITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLTQRVPD 248
V+ P+G + APVA WNKRH+EP PGS +++GF+ + +N I+ LTQR+P+
Sbjct 187 VVYPDGRSQKAPVAYWNKRHIEPMPGSIIFVGFADSLWRGTPEAMNADILHTLTQRIPE 245
>ref|ZP_03064544.1| conserved hypothetical protein [Shigella dysenteriae 1012]
gb|EDX35540.1| conserved hypothetical protein [Shigella dysenteriae 1012]
Length=220
Score = 170 bits (431), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 94/220 (42%), Positives = 136/220 (61%), Gaps = 1/220 (0%)
Query 28 IYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGALLTDSAAKAKALKDYQHVMAQLAS 87
++ G + ++ E+++ LV QP+L + WWPGA++++ A A AL+ Q ++ +LA
Sbjct 1 MFSNGSSEAKTLTGAEHLIDLVGQPRLANS-WWPGAVISEELATAAALRQQQALLTRLAE 59
Query 88 WEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPDFVRVDENSNPPLVGDYTLYTVQRP 147
A++ D AA I ++RQQ+ L +TGR + LDPD VRV E NPPL G+YTL+ P
Sbjct 60 QGADSSADDAAAINALRQQIQALKVTGRQKINLDPDIVRVAERGNPPLQGNYTLWVGPPP 119
Query 148 VTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGADKNNVMVITPEGETVVAPVALWNK 207
T+TL G +S G+ P+ GR V YL L+GAD++ V+ P+G T APVA WNK
Sbjct 120 STVTLFGLISRPGKQPFTPGRDVASYLSGQNLLSGADRSYAWVVYPDGRTQKAPVAYWNK 179
Query 208 RHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLTQRVP 247
RHVEP PGS +++G + V E LN I+ LTQR+P
Sbjct 180 RHVEPMPGSIIYVGLADSVWSETPDALNADILQTLTQRIP 219
>ref|YP_002149138.1| hypothetical protein SeAg_B4481 [Salmonella enterica subsp. enterica
serovar Agona str. SL483]
gb|ACH50713.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
serovar Agona str. SL483]
Length=245
Score = 170 bits (431), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 96/245 (39%), Positives = 143/245 (58%), Gaps = 1/245 (0%)
Query 4 LQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGA 63
++ A L + AFA GTVT++ G + ++ E ++ LV QP+L WWP A
Sbjct 2 MKRMISALALAFIASSAFASGTVTVFTQGNSEPKTLTDAERLLDLVGQPRLATS-WWPAA 60
Query 64 LLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPD 123
++ + A A + Q ++ +LA+ AE D D AA I ++R+Q+ + +TGR V LDPD
Sbjct 61 VIGEEQATVTARQQQQELLGRLAALSAEEDGDAAAAINTLRRQIQAVKVTGRQFVNLDPD 120
Query 124 FVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGA 183
VRV E NPPL G YTL+ +P +TL G +S G P+ GR V YL+ L+GA
Sbjct 121 VVRVSERGNPPLQGHYTLWVGPQPTQVTLFGLISQPGSQPFVPGRDVASYLEGQRLLSGA 180
Query 184 DKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLT 243
D++ V+ P+G + APVA WNKRH+EP PGS +++GF+ + +N I+ LT
Sbjct 181 DRSYAWVVYPDGRSQKAPVAYWNKRHIEPMPGSIIFVGFADSLWRGTPEAINADILHTLT 240
Query 244 QRVPD 248
QR+P+
Sbjct 241 QRIPE 245
>ref|ZP_02785878.2| conserved hypothetical protein [Escherichia coli O157:H7 str.
EC4501]
gb|EDU86949.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
EC4501]
gb|EFW65521.1| YjbG polysaccharide synthesis-related protein [Escherichia coli
O157:H7 str. EC1212]
Length=220
Score = 170 bits (430), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 94/220 (42%), Positives = 135/220 (61%), Gaps = 1/220 (0%)
Query 28 IYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGALLTDSAAKAKALKDYQHVMAQLAS 87
++ G + ++ E+++ LV QP+L + WWPGA++++ A A AL+ Q ++ +LA
Sbjct 1 MFSNGSSEAKTLTGAEHLIDLVGQPRLANS-WWPGAVISEELATAAALRQQQALLTRLAE 59
Query 88 WEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPDFVRVDENSNPPLVGDYTLYTVQRP 147
A++ D AA I ++RQQ+ L +TGR + LDPD VRV E +NPPL G+YTL+ P
Sbjct 60 QGADSSADDAAAINALRQQIQALKVTGRQKINLDPDIVRVAERANPPLQGNYTLWVGPPP 119
Query 148 VTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGADKNNVMVITPEGETVVAPVALWNK 207
T+TL G +S G P+ GR V YL L GAD++ V+ P+G T APVA WNK
Sbjct 120 STVTLFGLISRPGNQPFTPGRDVASYLSGQSLLGGADRSYAWVVYPDGRTQKAPVAYWNK 179
Query 208 RHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLTQRVP 247
RHVEP PGS +++G + V E LN I+ LTQR+P
Sbjct 180 RHVEPMPGSIIYVGLADSVWSEMPDALNADILQTLTQRIP 219
>ref|ZP_02833337.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
serovar Weltevreden str. HI_N05-537]
gb|EDZ28933.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
serovar Weltevreden str. HI_N05-537]
emb|CBY98390.1| Uncharacterized protein yjbG Flags: Precursor [Salmonella enterica
subsp. enterica serovar Weltevreden str. 2007-60-3289-1]
Length=245
Score = 169 bits (429), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 96/239 (40%), Positives = 141/239 (58%), Gaps = 1/239 (0%)
Query 10 ASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGALLTDSA 69
A L + AFA GTVT++ G + ++ E ++ LV QP+L WWP A++ +
Sbjct 8 ALALAFVASSAFASGTVTVFTQGNSEPKTLTDAERLLDLVGQPRLATS-WWPAAVIGEEQ 66
Query 70 AKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPDFVRVDE 129
A A + Q ++ +LA+ AE D D AA I ++R+Q+ + +TGR V LDPD VRV E
Sbjct 67 ATVTARQQQQELLGRLAALGAEEDGDAAAAINTLRRQIQAVKVTGRQFVNLDPDVVRVSE 126
Query 130 NSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGADKNNVM 189
NPPL G YTL+ +P +TL G +S G P+ GR V YL+ L+GAD++
Sbjct 127 RGNPPLQGHYTLWVGPQPTQVTLFGLISQLGSQPFVPGRDVASYLEGQRLLSGADRSYAW 186
Query 190 VITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLTQRVPD 248
V+ P+G + APVA WNKRH+EP PGS +++GF+ + +N I+ LTQR+P+
Sbjct 187 VVYPDGRSQKAPVAYWNKRHIEPMPGSIIFVGFADSLWRGTPEAINADILHTLTQRIPE 245
>ref|NP_463089.1| periplasmic protein [Salmonella enterica subsp. enterica serovar
Typhimurium str. LT2]
ref|ZP_02572509.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
serovar 4,[5],12:i:- str. CVM23701]
ref|ZP_02668549.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
serovar Heidelberg str. SL486]
14 more sequence titles
ref|YP_002048215.1| hypothetical protein SeHA_C4566 [Salmonella enterica subsp. enterica
serovar Heidelberg str. SL476]
ref|ZP_03161919.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
serovar Saintpaul str. SARA23]
gb|AAL23048.1| putative periplasmic protein [Salmonella enterica subsp. enterica
serovar Typhimurium str. LT2]
gb|ACF68307.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
serovar Heidelberg str. SL476]
gb|EDY22720.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
serovar Saintpaul str. SARA23]
gb|EDZ16971.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
serovar 4,[5],12:i:- str. CVM23701]
gb|EDZ24092.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
serovar Heidelberg str. SL486]
emb|CBG27185.1| putative exported protein [Salmonella enterica subsp. enterica
serovar Typhimurium str. D23580]
gb|ACY91417.1| putative periplasmic protein [Salmonella enterica subsp. enterica
serovar Typhimurium str. 14028S]
emb|CBW20248.1| putative exported protein [Salmonella enterica subsp. enterica
serovar Typhimurium str. SL1344]
dbj|BAJ39302.1| hypothetical protein STMDT12_C43590 [Salmonella enterica subsp.
enterica serovar Typhimurium str. T000240]
gb|EFX48161.1| YjbG polysaccharide synthesis-related protein [Salmonella enterica
subsp. enterica serovar Typhimurium str. TN061786]
gb|ADX19993.1| Uncharacterized protein yjbG [Salmonella enterica subsp. enterica
serovar Typhimurium str. ST4/74]
gb|AEF10023.1| putative periplasmic protein [Salmonella enterica subsp. enterica
serovar Typhimurium str. UK-1]
Length=245
Score = 169 bits (427), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 96/239 (40%), Positives = 140/239 (58%), Gaps = 1/239 (0%)
Query 10 ASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGALLTDSA 69
A L + AFA GTVT++ G + ++ E ++ LV QP+L WWP A++ +
Sbjct 8 ALALAFVASSAFASGTVTVFTQGNSEPKTLTDAERLLDLVGQPRLATS-WWPAAVIGEEQ 66
Query 70 AKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPDFVRVDE 129
A A + Q ++ +LA AE D D AA I ++R+Q+ + +TGR V LDPD VRV E
Sbjct 67 ATVTARQQQQELLGRLAVLSAEEDGDAAAAINTLRRQIQAVKVTGRQFVNLDPDVVRVSE 126
Query 130 NSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGADKNNVM 189
NPPL G Y L+ +P +TL G +S G P+ GR V YL+D L+GAD++
Sbjct 127 RGNPPLQGHYMLWVGPQPTQVTLFGLISQPGSQPFVPGRDVASYLEDQRLLSGADRSYAW 186
Query 190 VITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLTQRVPD 248
V+ P+G + APVA WNKRH+EP PGS +++GF+ + +N I+ LTQR+P+
Sbjct 187 VVYPDGRSQKAPVAYWNKRHIEPMPGSIIFVGFADSLWRGTPEAINADILHTLTQRIPE 245
>ref|ZP_06537804.1| hypothetical protein Salmonellaentericaenterica_23277 [Salmonella
enterica subsp. enterica serovar Typhi str. AG3]
Length=229
Score = 168 bits (426), Expect = 6e-40, Method: Compositional matrix adjust.
Identities = 89/214 (41%), Positives = 129/214 (60%), Gaps = 2/214 (0%)
Query 10 ASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGALLTDSA 69
A L + AFA GTVT++ G + ++ E ++ LV QP+L WWP A++ +
Sbjct 8 ALALAFVASSAFASGTVTVFTQGNSEPKTLTDAERLLDLVGQPRLATS-WWPAAVIGEEQ 66
Query 70 AKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPDFVRVDE 129
A A + Q ++ +LA+ AE D D AA I ++R+Q+ + +TGR V LDPD VRV E
Sbjct 67 ATVTARQQQQELLGRLAALSAEEDGDAAA-INTLRRQIQAVKVTGRQFVNLDPDVVRVSE 125
Query 130 NSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGADKNNVM 189
NPPL G YTL+ +P +TL G +S G P+ GR V YL+ L+GAD++
Sbjct 126 RGNPPLQGHYTLWVGPQPTQVTLFGLISQPGSQPFVPGRDVASYLEGQRLLSGADRSYAW 185
Query 190 VITPEGETVVAPVALWNKRHVEPPPGSQLWLGFS 223
V+ P+G + APVA WNKRH+EP PGS +++GF+
Sbjct 186 VVYPDGRSQKAPVAYWNKRHIEPMPGSIIFVGFA 219
>ref|ZP_03373074.1| hypothetical protein SentesTyp_23395 [Salmonella enterica subsp.
enterica serovar Typhi str. E98-2068]
Length=230
Score = 168 bits (426), Expect = 6e-40, Method: Compositional matrix adjust.
Identities = 89/214 (41%), Positives = 129/214 (60%), Gaps = 2/214 (0%)
Query 10 ASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGALLTDSA 69
A L + AFA GTVT++ G + ++ E ++ LV QP+L WWP A++ +
Sbjct 8 ALALAFVASSAFASGTVTVFTQGNSEPKTLTDAERLLDLVGQPRLATS-WWPAAVIGEEQ 66
Query 70 AKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPDFVRVDE 129
A A + Q ++ +LA+ AE D D AA I ++R+Q+ + +TGR V LDPD VRV E
Sbjct 67 ATVTARQQQQELLGRLAALSAEEDGDAAA-INTLRRQIQAVKVTGRQFVNLDPDVVRVSE 125
Query 130 NSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGADKNNVM 189
NPPL G YTL+ +P +TL G +S G P+ GR V YL+ L+GAD++
Sbjct 126 RGNPPLQGHYTLWVGPQPTQVTLFGLISQPGSQPFVPGRDVASYLEGQRLLSGADRSYAW 185
Query 190 VITPEGETVVAPVALWNKRHVEPPPGSQLWLGFS 223
V+ P+G + APVA WNKRH+EP PGS +++GF+
Sbjct 186 VVYPDGRSQKAPVAYWNKRHIEPMPGSIIFVGFA 219
>ref|YP_002228797.1| hypothetical protein SG4066 [Salmonella enterica subsp. enterica
serovar Gallinarum str. 287/91]
ref|YP_002246029.1| hypothetical protein SEN3991 [Salmonella enterica subsp. enterica
serovar Enteritidis str. P125109]
emb|CAR39836.1| putative exported protein [Salmonella enterica subsp. enterica
serovar Gallinarum str. 287/91]
emb|CAR35557.1| putative exported protein [Salmonella enterica subsp. enterica
serovar Enteritidis str. P125109]
gb|EGE36491.1| Putative exported protein [Salmonella enterica subsp. enterica
serovar Gallinarum str. SG9]
Length=245
Score = 168 bits (425), Expect = 8e-40, Method: Compositional matrix adjust.
Identities = 95/245 (38%), Positives = 142/245 (57%), Gaps = 1/245 (0%)
Query 4 LQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGA 63
++ A L + FA GTVT++ G + ++ E ++ LV QP+L WWP A
Sbjct 2 MKRMISALALAFIASSVFASGTVTVFTQGNSEPKTLTDAERLLDLVGQPRLATS-WWPAA 60
Query 64 LLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPD 123
++ + A A + Q ++ +LA+ AE D D AA I ++R+Q+ + +TGR V LDPD
Sbjct 61 VIGEEQATVTARQQQQELLGRLAALGAEEDGDAAAAINTLRRQIQAVKVTGRQLVNLDPD 120
Query 124 FVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGA 183
VRV E NPPL G YTL+ +P +TL G +S G P+ GR V YL+ L+GA
Sbjct 121 VVRVSERGNPPLQGHYTLWVGPQPTQVTLFGLISQPGSQPFVPGRDVASYLEGQRLLSGA 180
Query 184 DKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLT 243
D++ V+ P+G + APVA WNKRH+EP PGS +++GF+ + +N I+ LT
Sbjct 181 DRSYAWVVYPDGRSQKAPVAYWNKRHIEPMPGSIIFVGFADSLWRGTPKAINADILHTLT 240
Query 244 QRVPD 248
QR+P+
Sbjct 241 QRIPE 245
>ref|ZP_04656864.1| hypothetical protein SentesTe_18130 [Salmonella enterica subsp.
enterica serovar Tennessee str. CDC07-0191]
Length=245
Score = 168 bits (425), Expect = 8e-40, Method: Compositional matrix adjust.
Identities = 95/239 (39%), Positives = 140/239 (58%), Gaps = 1/239 (0%)
Query 10 ASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGALLTDSA 69
A L + FA GTVT++ G + ++ E ++ LV QP+L WWP A++ +
Sbjct 8 ALALAFIASSVFASGTVTVFTQGNSEPKTLTDAERLLDLVGQPRLATS-WWPAAVIGEEQ 66
Query 70 AKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPDFVRVDE 129
A A + Q ++ +LA+ AE D D AA I ++R+Q+ + +TGR V LDPD VRV E
Sbjct 67 ATVTARQQQQELLGRLAALSAEEDGDAAAAINTLRRQIQAVKVTGRQFVNLDPDVVRVSE 126
Query 130 NSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGADKNNVM 189
NPPL G YTL+ +P +TL G +S G P+ GR V YL+ L+GAD++
Sbjct 127 RGNPPLQGHYTLWVGPQPTQVTLFGLISQPGSQPFVPGRDVASYLEGQRLLSGADRSYAW 186
Query 190 VITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLTQRVPD 248
V+ P+G + APVA WNKRH+EP PGS +++GF+ + +N I+ LTQR+P+
Sbjct 187 VVYPDGRSQKAPVAYWNKRHIEPMPGSIIFVGFADSLWRGTPEAINADILHTLTQRIPE 245
>gb|EGI88734.1| hypothetical protein SB521682_4838 [Shigella boydii 5216-82]
Length=220
Score = 168 bits (425), Expect = 9e-40, Method: Compositional matrix adjust.
Identities = 93/220 (42%), Positives = 135/220 (61%), Gaps = 1/220 (0%)
Query 28 IYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGALLTDSAAKAKALKDYQHVMAQLAS 87
++ G + ++ E+++ LV QP+L + WWPGA++++ A A AL+ Q ++ +LA
Sbjct 1 MFSNGSSEAKTLTGAEHLIDLVGQPRLANS-WWPGAVISEELATAAALRQQQALLTRLAE 59
Query 88 WEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPDFVRVDENSNPPLVGDYTLYTVQRP 147
A++ D AA I ++RQQ+ L +T R + LDPD VRV E NPPL G+YTL+ P
Sbjct 60 QGADSSADDAAAINALRQQIQALKVTSRQKINLDPDIVRVAERGNPPLQGNYTLWVGPPP 119
Query 148 VTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGADKNNVMVITPEGETVVAPVALWNK 207
T+TL G +S G+ P+ GR V YL L+GAD++ V+ P+G T APVA WNK
Sbjct 120 STVTLFGLISRPGKQPFTPGRDVASYLSGQNLLSGADRSYAWVVYPDGRTQKAPVAYWNK 179
Query 208 RHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLTQRVP 247
RHVEP PGS +++G + V E LN I+ LTQR+P
Sbjct 180 RHVEPMPGSIIYVGLADSVWSETPDALNADILQTLTQRIP 219
>ref|ZP_03364691.1| hypothetical protein SentesTyph_17318 [Salmonella enterica subsp.
enterica serovar Typhi str. E98-0664]
Length=187
Score = 149 bits (377), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 78/188 (41%), Positives = 114/188 (60%), Gaps = 1/188 (0%)
Query 61 PGALLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKL 120
P A++ + A A + Q ++ +LA+ AE D D AA I ++R+Q+ + +TGR V L
Sbjct 1 PAAVIGEEQATVTARQQQQELLGRLAALSAEEDGDAAA-INTLRRQIQAVKVTGRQFVNL 59
Query 121 DPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRL 180
DPD VRV E NPPL G YTL+ +P +TL G +S G P+ GR V YL+ L
Sbjct 60 DPDVVRVSERGNPPLQGHYTLWVGPQPTQVTLFGLISQPGSQPFVPGRDVASYLEGQRLL 119
Query 181 AGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVS 240
+GAD++ V+ P+G + APVA WNKRH+EP PGS +++GF+ + +N I+
Sbjct 120 SGADRSYAWVVYPDGRSQKAPVAYWNKRHIEPMPGSIIFVGFADSLWRGTPEAINADILH 179
Query 241 VLTQRVPD 248
LTQR+P+
Sbjct 180 TLTQRIPE 187
>gb|EFW61835.1| YjbG polysaccharide synthesis-related protein [Shigella flexneri
CDC 796-83]
gb|EGI92902.1| hypothetical protein SB359474_4677 [Shigella boydii 3594-74]
Length=185
Score = 149 bits (375), Expect = 5e-34, Method: Compositional matrix adjust.
Identities = 82/184 (44%), Positives = 114/184 (61%), Gaps = 0/184 (0%)
Query 64 LLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPD 123
++++ A A AL+ Q ++ +LA A++ D AA I ++RQQ+ L +TGR + LDPD
Sbjct 1 MISEELATAAALRQQQALLTRLAEQAADSSADDAAAINALRQQIQALKVTGRQKINLDPD 60
Query 124 FVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGA 183
VRV E NPPL G+YTL+ P T+TL G +S G+ P+ GR V YL L+GA
Sbjct 61 IVRVAERGNPPLQGNYTLWVGPPPSTVTLFGLISRPGKQPFTPGRDVASYLSGQNLLSGA 120
Query 184 DKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLT 243
D++ V+ P+G T APVA WNKRHVEP PGS +++G + V E LN I+ LT
Sbjct 121 DRSYAWVVYPDGRTQKAPVAYWNKRHVEPMPGSIIYVGLADSVWSETPDALNADILQTLT 180
Query 244 QRVP 247
QR+P
Sbjct 181 QRIP 184
>ref|YP_410318.1| hypothetical protein SBO_4056 [Shigella boydii Sb227]
gb|ABB68490.1| conserved hypothetical protein [Shigella boydii Sb227]
Length=174
Score = 141 bits (356), Expect = 9e-32, Method: Compositional matrix adjust.
Identities = 78/173 (45%), Positives = 107/173 (61%), Gaps = 0/173 (0%)
Query 75 LKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPDFVRVDENSNPP 134
++ Q ++ +LA A++ D AA I ++RQQ+ L +TGR + LDPD VRV E NPP
Sbjct 1 MRQQQALLTRLAEQAADSSADDAAAINALRQQIQALKVTGRQKINLDPDIVRVAERGNPP 60
Query 135 LVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGADKNNVMVITPE 194
L G+YTL+ P T+TL G +S G+ P+ GR V YL L+GAD++ V+ P+
Sbjct 61 LQGNYTLWVGPPPSTVTLFGLISRPGKQPFTPGRDVASYLSGQNLLSGADRSYAWVVYPD 120
Query 195 GETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLTQRVP 247
G T APVA WNKRHVEP PGS +++G + V E LN I+ LTQR+P
Sbjct 121 GRTQKAPVAYWNKRHVEPMPGSIIYVGLADSVWSETPDALNADILQTLTQRIP 173
>ref|ZP_03338398.1| hypothetical protein Salmonelentericaenterica_16032 [Salmonella
enterica subsp. enterica serovar Typhi str. 404ty]
Length=144
Score = 125 bits (314), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 63/144 (43%), Positives = 89/144 (61%), Gaps = 0/144 (0%)
Query 105 QQLLNLNITGRLPVKLDPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPW 164
+Q+ + +TGR V LDPD VRV E NPPL G YTL+ +P +TL G +S G P+
Sbjct 1 RQIQAVKVTGRQFVNLDPDVVRVSERGNPPLQGHYTLWVGPQPTQVTLFGLISQPGSQPF 60
Query 165 QAGRSVTDYLQDHPRLAGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSA 224
GR V YL+ L+GAD++ V+ P+G + APVA WNKRH+EP PGS +++GF+
Sbjct 61 VPGRDVASYLEGQRLLSGADRSYAWVVYPDGRSQKAPVAYWNKRHIEPMPGSIIFVGFAD 120
Query 225 HVLPEKYADLNDQIVSVLTQRVPD 248
+ +N I+ LTQR+P+
Sbjct 121 SLWRGTPEAINADILHTLTQRIPE 144
>gb|EDA50530.1| hypothetical protein GOS_1989170 [marine metagenome]
Length=179
Score = 125 bits (313), Expect = 8e-27, Method: Compositional matrix adjust.
Identities = 66/179 (36%), Positives = 105/179 (58%), Gaps = 5/179 (2%)
Query 75 LKDYQHVMAQLASWEAE--ADDDVAATIKSVRQQLLN---LNITGRLPVKLDPDFVRVDE 129
+ D ++++ L + + + D ++ A ++S +Q L+ LN+TGRLP+ +DP R E
Sbjct 1 IADKMNLLSDLKALQVQWMRDGNMGAWVQSSQQLLIEIDRLNVTGRLPIAIDPVINRAHE 60
Query 130 NSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGADKNNVM 189
+ NP L GDYTL+ R I G ++GA + + G + DY Q + LAGAD
Sbjct 61 DKNPLLSGDYTLFISPRSQFIYFTGLINGASRQLLREGAGLADYWQAYSLLAGADLAQAY 120
Query 190 VITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLTQRVPD 248
+I P GE + PVA+WN+ H EP G+ L++GF +LP KY +LN +I ++L R+P+
Sbjct 121 LIQPTGEVSLVPVAVWNQLHREPMAGATLFVGFDTDLLPAKYKNLNLRIANLLANRIPE 179
>gb|EBB46452.1| hypothetical protein GOS_229183 [marine metagenome]
Length=180
Score = 124 bits (310), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 66/180 (36%), Positives = 102/180 (56%), Gaps = 1/180 (0%)
Query 4 LQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGA 63
++ A + FA G+V + G + ++ E+++ LV QP+L + WWPGA
Sbjct 2 MKQMITALTFSLCAASVFAAGSVKVITTGSTEAKTLTGAEHLLDLVGQPRLSNS-WWPGA 60
Query 64 LLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPD 123
++++ A +A + Q ++A+LA+ AEA + A I ++RQQ+ L + GR + LDPD
Sbjct 61 VISEERATTEAQRQQQALLARLATLSAEASGEDAGAINALRQQIQALKVAGRQTINLDPD 120
Query 124 FVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGA 183
VR E NPPL G+YTL+ +P ITLLG +S G+ + GR V YL RL+GA
Sbjct 121 VVRTSEPGNPPLQGNYTLWVGPQPTDITLLGLLSHTGKQLFIPGRDVASYLDGQHRLSGA 180
>ref|ZP_03830735.1| hypothetical protein PcarcW_05039 [Pectobacterium carotovorum
subsp. carotovorum WPP14]
Length=260
Score = 121 bits (303), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 84/258 (32%), Positives = 135/258 (52%), Gaps = 25/258 (9%)
Query 9 IASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQL------VTQPQLRDRLWWPG 62
IA++L +++ A +T+ P Q+T++V +++ +L V+ PQ + W
Sbjct 7 IATLLLLVSGVA-TSAQLTVKSP--QETIAVVKLDDGTRLEKFYEQVSWPQ---NINWQT 60
Query 63 ALLTDSAAKAKALKDYQHVMAQLAS----WEAEADDDVAATIKSVRQQLLNLNITGRLPV 118
A ++D A K ++ +LA W D D+A + +R+ + +N+ GR+
Sbjct 61 AFISDFATTQKVRAQGDVLLQKLAELETRWRNSGDGDLAISAWLLRKTINPINVAGRIRT 120
Query 119 KLDPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGA---------GQLPWQAGRS 169
LDPD VRV +N PLVG+Y LY ++L+G V+ A G++ +AG S
Sbjct 121 DLDPDRVRVYIENNRPLVGEYALYVAPHDDKLSLIGLVNTAADVGELETSGKVALRAGWS 180
Query 170 VTDYLQDHPRLAGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPE 229
V +YL LAGAD + +I G+ P+ALWN++HVEP G L++GF+ VLP+
Sbjct 181 VDNYLSGRRLLAGADNSYGYLIAGNGKWRKVPLALWNRQHVEPAAGETLFIGFNPSVLPQ 240
Query 230 KYADLNDQIVSVLTQRVP 247
+ LNDQ+ L R P
Sbjct 241 DMSSLNDQLADYLANRTP 258
>ref|YP_003260364.1| hypothetical protein Pecwa_3011 [Pectobacterium wasabiae WPP163]
gb|ACX88757.1| protein of unknown function DUF1017 [Pectobacterium wasabiae
WPP163]
Length=260
Score = 119 bits (299), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 68/201 (33%), Positives = 105/201 (52%), Gaps = 13/201 (6%)
Query 60 WPGALLTDSAAKAKALKDYQHVMAQLAS----WEAEADDDVAATIKSVRQQLLNLNITGR 115
W A ++D A ++ +LA W D D+A + +R+ + +N+ GR
Sbjct 58 WQTAFISDFATTQNVRAQGDTLLQKLAELETRWRNSGDGDLAISAWLLRKAISPINVAGR 117
Query 116 LPVKLDPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGA---------GQLPWQA 166
+ LDPD VRV +N PLVG+Y LY ++L+G V+ + G++ +A
Sbjct 118 IHTNLDPDRVRVYIENNRPLVGEYALYVAPHDDKLSLIGLVNTSADVGELETSGKVALRA 177
Query 167 GRSVTDYLQDHPRLAGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHV 226
G SV DYL LAGAD + +I G+ P+ALWN++H EP G +++GF+ V
Sbjct 178 GSSVEDYLAGRRLLAGADNSYGYLIGGNGQWRKVPLALWNRQHTEPAAGETIFIGFNPSV 237
Query 227 LPEKYADLNDQIVSVLTQRVP 247
LP+ + LNDQ+ L R+P
Sbjct 238 LPQDMSSLNDQLADYLANRIP 258
>ref|YP_003016906.1| hypothetical protein PC1_1323 [Pectobacterium carotovorum subsp.
carotovorum PC1]
gb|ACT12370.1| protein of unknown function DUF1017 [Pectobacterium carotovorum
subsp. carotovorum PC1]
Length=260
Score = 119 bits (298), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 82/258 (31%), Positives = 134/258 (51%), Gaps = 25/258 (9%)
Query 9 IASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQL------VTQPQLRDRLWWPG 62
IA++L +++ A +T+ P QQT++V +++ +L V PQ + W
Sbjct 7 IATLLLLVSGVA-TSAQLTVKSP--QQTIAVVKLDDGTRLEKFYEQVPWPQ---NINWQT 60
Query 63 ALLTDSAAKAKALKDYQHVMAQLAS----WEAEADDDVAATIKSVRQQLLNLNITGRLPV 118
A ++D A K ++ +LA W D D+A + +R+ + +N+ GR+
Sbjct 61 AFISDFATTQKVRAQGDVLLQKLAELETRWRNSGDGDLAISAWLLRKTINPINVAGRIST 120
Query 119 KLDPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGA---------GQLPWQAGRS 169
LDPD VRV +N PLVG+Y LY ++L+G V+ + G++ +AG S
Sbjct 121 DLDPDRVRVYAENNRPLVGEYALYVAPHDDKLSLIGLVNTSADVGELETSGKVALRAGWS 180
Query 170 VTDYLQDHPRLAGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPE 229
V +YL +AGAD + +I G+ P+ALWN++H+EP G L++GF+ VLP+
Sbjct 181 VENYLSGRRLIAGADNSYGYLIGGNGQWRKVPLALWNRQHIEPAAGETLFIGFNPAVLPQ 240
Query 230 KYADLNDQIVSVLTQRVP 247
+ LNDQ+ L R P
Sbjct 241 DMSSLNDQLADYLANRTP 258
>gb|EFU99994.1| conserved hypothetical protein [Escherichia coli 3431]
Length=248
Score = 118 bits (295), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 70/241 (29%), Positives = 114/241 (47%), Gaps = 5/241 (2%)
Query 13 LYVMTPHAFAQGTVTIYLPG-EQQTLSVGPVENVVQLVTQPQLRDRLWWPGALLTDSAAK 71
L + + A V++Y G +QQ L + + L+ + + ++W A +
Sbjct 8 LLLFSSVCMADANVSLYFNGNQQQNLILSSDARLDTLLQSSHIPENVYWRSAQIATPEQH 67
Query 72 AKALKDYQHVMAQLAS----WEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPDFVRV 127
++ ++++L + W E A + + QQ+ L+++GRLP+ +DP V
Sbjct 68 KVIMQRQSALLSELQTIETLWRNEGKQKNADSTAKLYQQIAKLHLSGRLPITIDPYQVLR 127
Query 128 DENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGADKNN 187
+ NP L G Y LY R I L G +S LP + G V Y Q GAD +
Sbjct 128 SKADNPRLDGQYQLYLASRASKIALFGLISALPNLPLEPGFGVDQYWQRSALQPGADTAH 187
Query 188 VMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLTQRVP 247
V +I P G PVA+WNK H EP PG+ L +GF LP ++ +N +I ++ +VP
Sbjct 188 VWLIQPTGNIEKVPVAVWNKLHREPMPGATLLVGFDESQLPTRFEGINRRIAEIIANKVP 247
Query 248 D 248
+
Sbjct 248 E 248
>ref|ZP_03826818.1| hypothetical protein PcarbP_09375 [Pectobacterium carotovorum
subsp. brasiliensis PBR1692]
Length=260
Score = 117 bits (294), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 81/258 (31%), Positives = 134/258 (51%), Gaps = 25/258 (9%)
Query 9 IASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQL------VTQPQLRDRLWWPG 62
IA++L +++ A +T+ P QQT++V +++ +L V PQ + W
Sbjct 7 IATLLLLVSGVA-TSAQLTVKSP--QQTIAVVKLDDGTRLEKFYEQVPWPQ---NINWQT 60
Query 63 ALLTDSAAKAKALKDYQHVMAQLAS----WEAEADDDVAATIKSVRQQLLNLNITGRLPV 118
A ++D A K ++ +LA W D D+A + +R+ + +N+ GR+
Sbjct 61 AFISDFATTQKVRAQGDVLLQKLAELETRWRNSGDGDLAISAWLLRKTINPINVAGRIRT 120
Query 119 KLDPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGA---------GQLPWQAGRS 169
LDPD VRV +N PLVG+Y LY ++L+G V+ + G++ +AG S
Sbjct 121 DLDPDRVRVYAENNRPLVGEYALYVAPHDDKLSLIGLVNTSADVGELETSGKVALRAGWS 180
Query 170 VTDYLQDHPRLAGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPE 229
+YL LAGAD + +I G+ P+ALWN++H+EP G L++GF+ VLP+
Sbjct 181 AENYLSGRRLLAGADNSYGYLIAGNGQWRKVPLALWNRQHIEPAAGETLFIGFNPAVLPQ 240
Query 230 KYADLNDQIVSVLTQRVP 247
+ + LN+Q+ L R P
Sbjct 241 EMSSLNEQLADYLANRTP 258
>ref|YP_049553.1| hypothetical protein ECA1447 [Pectobacterium atrosepticum SCRI1043]
emb|CAG74357.1| putative exported protein [Pectobacterium atrosepticum SCRI1043]
Length=260
Score = 117 bits (293), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 80/258 (31%), Positives = 135/258 (52%), Gaps = 25/258 (9%)
Query 9 IASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQL------VTQPQLRDRLWWPG 62
IA++L +++ A + +T+ P Q+T++V +++ +L V+ PQ + W
Sbjct 7 IATLLLLISGVAMS-AQLTVKSP--QETIAVVKLDDGTRLEKFYEQVSWPQ---NINWQT 60
Query 63 ALLTDSAAKAKALKDYQHVMAQLAS----WEAEADDDVAATIKSVRQQLLNLNITGRLPV 118
A ++D A K ++ +LA W D D+A + +R+ + +N+ GR+
Sbjct 61 AFISDFATTQKVRVQGDVLLQKLAELETRWRNSGDGDLAISAWLLRKAINPINVAGRIRT 120
Query 119 KLDPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGA---------GQLPWQAGRS 169
LDPD VRV +N PLVG+Y LY + L+G V+ + G + +AG S
Sbjct 121 NLDPDRVRVYSENNRPLVGEYALYVAPHDDKLALIGLVNTSADVGELETSGNVVLRAGWS 180
Query 170 VTDYLQDHPRLAGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPE 229
V +YL LAGAD + +I G+ P+ALWN++H+EP G +++GF+ VLP+
Sbjct 181 VENYLVGRRLLAGADNSYGYLIGGNGQWRKVPLALWNRQHIEPAAGETIFIGFNPSVLPQ 240
Query 230 KYADLNDQIVSVLTQRVP 247
+ LN+Q+ L R+P
Sbjct 241 DMSSLNEQLADYLANRIP 258
>ref|YP_003005145.1| hypothetical protein Dd1591_2844 [Dickeya zeae Ech1591]
gb|ACT07666.1| protein of unknown function DUF1017 [Dickeya zeae Ech1591]
Length=259
Score = 117 bits (292), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 70/214 (32%), Positives = 100/214 (46%), Gaps = 13/214 (6%)
Query 47 QLVTQPQLRDRLWWPGALLTDSAAKAKALKDYQHVMAQLAS----WEAEADDDVAATIKS 102
Q Q D++ W ALLT+ K + + + A+L W+ E D D A
Sbjct 45 QFYGQMAFPDKVNWQTALLTNERVTEKVREKGEKLQARLYQLQLVWQVEGDGDWAIAAWY 104
Query 103 VRQQLLNLNITGRLPVKLDPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGA--- 159
+ Q L +N GR+ +DP+ VR+ + N PLVG+YTLY L G +S
Sbjct 105 MAQALKQVNTVGRIRASIDPEIVRLSQRDNRPLVGNYTLYLSPYRQQFFLFGLISTGIDI 164
Query 160 ------GQLPWQAGRSVTDYLQDHPRLAGADKNNVMVITPEGETVVAPVALWNKRHVEPP 213
+ Q G SV Y+ L G D + +I+ EG P+A+WN RH EP
Sbjct 165 GTPHVFKDIDLQPGWSVEQYIGRRRFLPGGDSRDGYLISGEGHWRKVPLAVWNSRHHEPA 224
Query 214 PGSQLWLGFSAHVLPEKYADLNDQIVSVLTQRVP 247
G L++GF +LPE + LN+QI L R+P
Sbjct 225 AGETLFIGFDPSILPEGFTSLNEQIADYLANRIP 258
>ref|YP_003882187.1| hypothetical protein Dda3937_03274 [Dickeya dadantii 3937]
gb|ADM97630.1| hypothetical protein Dda3937_03274 [Dickeya dadantii 3937]
Length=259
Score = 115 bits (288), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 70/214 (32%), Positives = 99/214 (46%), Gaps = 13/214 (6%)
Query 47 QLVTQPQLRDRLWWPGALLTDSAAKAKALKDYQHVMAQLAS----WEAEADDDVAATIKS 102
Q Q D+ W ALLT+ K + + + A+L W+AE D D A
Sbjct 45 QFYGQTAFPDKANWQTALLTNERVTEKVREKGEKLQARLYQLQLVWQAEGDGDWAIAAWY 104
Query 103 VRQQLLNLNITGRLPVKLDPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGA--- 159
+ Q L +N GR+ +DPD VR+ N PLVG+YTLY L G +S
Sbjct 105 MAQALKQVNTVGRIRASIDPDIVRLSPRDNRPLVGNYTLYLSPYRDQFFLFGLISTGVDI 164
Query 160 ------GQLPWQAGRSVTDYLQDHPRLAGADKNNVMVITPEGETVVAPVALWNKRHVEPP 213
+ Q+G SV Y+ L G D + +I G P+A+WN++H EP
Sbjct 165 GTPNVFKDIDLQSGWSVEQYIGRRRFLPGGDNRDGYLIAGNGHWRKVPLAVWNRQHNEPA 224
Query 214 PGSQLWLGFSAHVLPEKYADLNDQIVSVLTQRVP 247
G L++GF +LPE + LN+QI L R+P
Sbjct 225 AGETLFIGFDPSILPEGFTSLNEQIADYLANRIP 258
>ref|YP_691475.1| hypothetical protein SFV_4186 [Shigella flexneri 5 str. 8401]
gb|ABF06170.1| conserved hypothetical protein [Shigella flexneri 5 str. 8401]
Length=213
Score = 114 bits (284), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 65/164 (39%), Positives = 98/164 (59%), Gaps = 6/164 (3%)
Query 4 LQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGA 63
++ +A +L V FA GTV ++ G + ++ E+++ LV QP+L + WWPGA
Sbjct 2 IKQTIVALLLSVGASSVFAAGTVKVFSNGSSEAKTLTGAEHLIDLVGQPRLANS-WWPGA 60
Query 64 LLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPD 123
++++ A A AL+ Q ++ +LA A++ D AA I ++RQQ+ L +TGR + LDPD
Sbjct 61 VISEELATAAALRQQQALLTRLAEQGADSSTDDAAAINALRQQIQALKVTGRQKINLDPD 120
Query 124 FVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLP-WQA 166
VRV E NPPL G+YTL+ V P TL G +S G P WQ+
Sbjct 121 IVRVAERGNPPLQGNYTLW-VGPP---TLFGLISHPGNQPSWQS 160
>ref|ZP_06715180.1| conserved hypothetical protein [Edwardsiella tarda ATCC 23685]
gb|EFE22503.1| conserved hypothetical protein [Edwardsiella tarda ATCC 23685]
Length=248
Score = 114 bits (284), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 61/205 (29%), Positives = 109/205 (53%), Gaps = 4/205 (1%)
Query 48 LVTQPQLRDRLWWPGALLTDSAAKAKALKDYQHVMAQLAS----WEAEADDDVAATIKSV 103
L+ +L ++W A +T A + + + ++++L + W + ++ A + + +
Sbjct 44 LLNDSRLPSDIYWRSAQITTPAHQVAIKQQRRALLSELGALETLWRQQGEEAWAESTRHL 103
Query 104 RQQLLNLNITGRLPVKLDPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLP 163
+QL L +TGRLP+ LDP + + NP LVGDY L+ R + +LG + LP
Sbjct 104 IRQLSALRLTGRLPIVLDPRQAQRSQADNPRLVGDYQLFIAPRRAQVMMLGLIHALPTLP 163
Query 164 WQAGRSVTDYLQDHPRLAGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFS 223
+ V++Y + L AD +V +I P G+ PVA+WNKR EP G+ + +GF
Sbjct 164 LIPAQGVSEYWRRDALLPAADSAHVWLIQPTGDISQVPVAVWNKRLREPMAGASILIGFD 223
Query 224 AHVLPEKYADLNDQIVSVLTQRVPD 248
+ LP ++ +N +I +++ RVP+
Sbjct 224 PNTLPSRFQGINQRIAEIISNRVPE 248
>ref|YP_962878.1| hypothetical protein Sputw3181_1486 [Shewanella sp. W3-18-1]
gb|ABM24324.1| protein of unknown function DUF1017 [Shewanella sp. W3-18-1]
Length=261
Score = 112 bits (280), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 68/189 (35%), Positives = 101/189 (53%), Gaps = 3/189 (1%)
Query 58 LWWPGALLTDSAAKAKALKDYQHVMAQLASW-EAEADDDVAATIKSVRQQLLNLNITGRL 116
++W GA L D A Q V++QLA EA D + + Q L + + R+
Sbjct 73 IYWLGAALVDLENTAVLETKRQQVLSQLAQMGEATNDSRYITKLAQLAQFLRGIKLGQRV 132
Query 117 PVKLDPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQD 176
LD D +R+ E+ N + G Y L RP TITL+GAVS G +PWQ+ S DYLQ
Sbjct 133 MQPLDIDAIRITESYNAIIEGKYQLVLPPRPSTITLVGAVSQTGNMPWQSQASSKDYLQQ 192
Query 177 HPRLAGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLND 236
L A+ + V +I P+G + P+A WN + + PG+ L++ FS+ L + YA+LN+
Sbjct 193 AGLLENAETSFVWIIQPDGNAIRQPIAYWNYQAQDIAPGATLFVEFSS--LFDGYANLNE 250
Query 237 QIVSVLTQR 245
I+ +L R
Sbjct 251 NIIELLKNR 259
>gb|ADV54996.1| protein of unknown function DUF1017 [Shewanella putrefaciens
200]
Length=261
Score = 111 bits (278), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 67/189 (35%), Positives = 101/189 (53%), Gaps = 3/189 (1%)
Query 58 LWWPGALLTDSAAKAKALKDYQHVMAQLASW-EAEADDDVAATIKSVRQQLLNLNITGRL 116
++W GA L D A Q V++QLA EA D + + Q L + + R+
Sbjct 73 IYWLGAALVDLENTAVLETKRQQVLSQLAQMGEATNDSRYITKLAQLAQFLRGIKLGQRV 132
Query 117 PVKLDPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQD 176
LD D +R+ E+ N + G Y L RP TITL+GAVS G +PWQ+ S DYL+
Sbjct 133 MQPLDIDAIRITESYNAIIEGKYQLVLPPRPSTITLVGAVSQTGNMPWQSQASSKDYLKQ 192
Query 177 HPRLAGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLND 236
L A+ + V +I P+G + P+A WN + + PG+ L++ FS+ L + YA+LN+
Sbjct 193 AGLLENAETSFVWIIQPDGNAIRQPIAYWNHQAQDIAPGATLFVEFSS--LFDGYANLNE 250
Query 237 QIVSVLTQR 245
I+ +L R
Sbjct 251 NIIELLKNR 259
>ref|YP_001184042.1| hypothetical protein Sputcn32_2522 [Shewanella putrefaciens CN-32]
gb|ABP76243.1| protein of unknown function DUF1017 [Shewanella putrefaciens
CN-32]
Length=261
Score = 110 bits (274), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 66/189 (34%), Positives = 101/189 (53%), Gaps = 3/189 (1%)
Query 58 LWWPGALLTDSAAKAKALKDYQHVMAQLASW-EAEADDDVAATIKSVRQQLLNLNITGRL 116
++W GA L D A Q V++QLA EA D + + Q L + + R+
Sbjct 73 IYWLGAALVDLENTAVLETKRQQVLSQLAQMGEATNDSRYITKLAQLAQFLRGIKLGQRV 132
Query 117 PVKLDPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQD 176
LD D +R+ E+ N + G Y L RP TITL+GAV+ G +PWQ+ S DYL+
Sbjct 133 MQPLDIDAIRITESYNAIIEGKYQLVLPPRPSTITLVGAVTQTGNMPWQSQTSSKDYLKQ 192
Query 177 HPRLAGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLND 236
L A+ + V +I P+G + P+A WN + + PG+ L++ FS+ L + YA+LN+
Sbjct 193 AGLLENAETSFVWIIQPDGNAIRQPIAYWNYQAQDIAPGATLFVEFSS--LFDGYANLNE 250
Query 237 QIVSVLTQR 245
I+ +L R
Sbjct 251 NIIELLKNR 259
>ref|YP_737462.1| hypothetical protein Shewmr7_1406 [Shewanella sp. MR-7]
gb|ABI42405.1| protein of unknown function DUF1017 [Shewanella sp. MR-7]
Length=261
Score = 108 bits (271), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 64/189 (33%), Positives = 103/189 (54%), Gaps = 3/189 (1%)
Query 58 LWWPGALLTDSAAKAKALKDYQHVMAQLASWEAEADDD-VAATIKSVRQQLLNLNITGRL 116
++W GA L D AK QHV+ QLA+ +AD+ A + Q L N+ + R+
Sbjct 73 IYWLGATLLDLQNTAKLEATRQHVLQQLANMGQQADNSQYIAKLSKFAQFLRNIKLGQRV 132
Query 117 PVKLDPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQD 176
LD D +R+ + NP + G + L RP T+T++GAV+ G+ WQ+ S +YL+
Sbjct 133 NQPLDLDLIRITDAYNPIIDGQFLLVLPPRPTTVTVVGAVAQTGEQEWQSRASSKNYLKQ 192
Query 177 HPRLAGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLND 236
L A+ + V +I P+G + P+A WN + + PG+ L++ FSA L + Y+ LN+
Sbjct 193 AGLLDNAENSFVWIIQPDGNAIRQPIAYWNHQAQDIAPGAVLFVEFSA--LFDGYSTLNN 250
Query 237 QIVSVLTQR 245
I+ +L R
Sbjct 251 NIIELLKNR 259
>ref|YP_733476.1| hypothetical protein Shewmr4_1341 [Shewanella sp. MR-4]
gb|ABI38419.1| protein of unknown function DUF1017 [Shewanella sp. MR-4]
Length=261
Score = 106 bits (264), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 63/189 (33%), Positives = 102/189 (53%), Gaps = 3/189 (1%)
Query 58 LWWPGALLTDSAAKAKALKDYQHVMAQLASWEAEADDD-VAATIKSVRQQLLNLNITGRL 116
++W GA L D AK QHV+ QLA+ +AD+ A + Q L N+ + R+
Sbjct 73 IYWLGATLLDLQNTAKLEATRQHVLQQLANMGQQADNSQYIAKLSKFAQFLRNIKLGQRV 132
Query 117 PVKLDPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQD 176
LD D +R+ + NP + G + L RP T+T++GAV+ + WQ+ S +YL+
Sbjct 133 NQPLDLDLIRITDAYNPIIDGQFLLVLPPRPTTVTVVGAVAQTSEQEWQSRASSKNYLKQ 192
Query 177 HPRLAGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLND 236
L A+ + V +I P+G + P+A WN + + PG+ L++ FSA L + Y+ LN+
Sbjct 193 AGLLDNAENSFVWIIQPDGNAIRQPIAYWNHQAQDIAPGAVLFVEFSA--LFDGYSTLNN 250
Query 237 QIVSVLTQR 245
I+ +L R
Sbjct 251 NIIELLKNR 259
>ref|YP_001051215.1| hypothetical protein Sbal_2862 [Shewanella baltica OS155]
gb|ABN62346.1| protein of unknown function DUF1017 [Shewanella baltica OS155]
gb|AEH14691.1| protein of unknown function DUF1017 [Shewanella baltica OS117]
Length=261
Score = 105 bits (261), Expect = 9e-21, Method: Compositional matrix adjust.
Identities = 62/189 (32%), Positives = 101/189 (53%), Gaps = 3/189 (1%)
Query 58 LWWPGALLTDSAAKAKALKDYQHVMAQLASWEAEADDDV-AATIKSVRQQLLNLNITGRL 116
++W GA L D A Q V++QLA DD A + + Q + +L + R+
Sbjct 73 IYWLGAALLDIQNTAALETKRQQVLSQLAKMGQVKDDSAYIAKLAKLAQLIRSLQLGQRV 132
Query 117 PVKLDPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQD 176
LD D +R++++ N + G + L RP T+T+LGAV+ G L WQ ++ DYL+
Sbjct 133 MQPLDIDLIRINDSYNSLIDGRFLLVLPPRPSTVTVLGAVAQTGDLAWQGQKTSKDYLKQ 192
Query 177 HPRLAGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLND 236
L A+ + V +I P+G+ + P+A WN + + PG+ L++ FS+ L + Y LN+
Sbjct 193 AGLLDNAETSFVWIIQPDGKAIKQPIAYWNHQEQDIAPGASLYVEFSS--LFDDYTQLNE 250
Query 237 QIVSVLTQR 245
IV +L R
Sbjct 251 NIVELLRNR 259
>ref|YP_003332847.1| hypothetical protein Dd586_1258 [Dickeya dadantii Ech586]
gb|ACZ76142.1| protein of unknown function DUF1017 [Dickeya dadantii Ech586]
Length=259
Score = 105 bits (261), Expect = 9e-21, Method: Compositional matrix adjust.
Identities = 63/214 (29%), Positives = 101/214 (47%), Gaps = 13/214 (6%)
Query 47 QLVTQPQLRDRLWWPGALLTDSAAKAKALKDYQHVMAQLA----SWEAEADDDVAATIKS 102
Q ++ D + W ALLT+ + +A + + + A+L +W+ + D D A
Sbjct 45 QFYSRITFADNVNWQTALLTNESVTQQAKEAGEKLQARLYQLQLAWQIDGDGDWAIAAWY 104
Query 103 VRQQLLNLNITGRLPVKLDPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGA--- 159
+ Q L +N GR+ + PD +R+ N PLVG+YTLY L G +S
Sbjct 105 MAQALKQVNAVGRIRASIAPDIIRLSPRKNRPLVGNYTLYLSPYRQQFFLFGLISTGIDI 164
Query 160 ------GQLPWQAGRSVTDYLQDHPRLAGADKNNVMVITPEGETVVAPVALWNKRHVEPP 213
+ + G SV Y+ L G D +I+ +G P+A+WN++H EP
Sbjct 165 GTPNVFKDIDLKPGWSVEQYIGRRRFLPGGDNREGYLISGDGHWRKVPLAIWNRQHHEPA 224
Query 214 PGSQLWLGFSAHVLPEKYADLNDQIVSVLTQRVP 247
G L++GF +LP+ ++ LN QI L R+P
Sbjct 225 AGETLFIGFDPSILPDGFSSLNAQIADYLANRIP 258
>gb|EDA46671.1| hypothetical protein GOS_1996150 [marine metagenome]
Length=292
Score = 104 bits (260), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 61/189 (32%), Positives = 102/189 (53%), Gaps = 3/189 (1%)
Query 58 LWWPGALLTDSAAKAKALKDYQHVMAQLASWEAEADDD-VAATIKSVRQQLLNLNITGRL 116
++W GA L D+ A Q ++ QLA+ + D+ A + Q L N+ + R+
Sbjct 104 VYWLGAALLDTKNTATLELIRQQILQQLANMGQQTDNSQYIAKLSKFAQFLRNIKLGQRV 163
Query 117 PVKLDPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQD 176
LD D +R+ + NP + G + L RP TIT++GAV+ G+ W + S DYL+
Sbjct 164 NQPLDLDLIRITDAYNPIMDGQFLLVLPPRPTTITVVGAVAQTGEQKWVSRTSSKDYLKQ 223
Query 177 HPRLAGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLND 236
L A+ + V +I P+G+T+ P+A WN + ++ PG+ L++ FS L + Y+ LN+
Sbjct 224 AGLLENAENSFVWIIQPDGKTIRQPIAYWNHQSMDIAPGAILFVEFSG--LFDDYSTLNN 281
Query 237 QIVSVLTQR 245
I+ +L R
Sbjct 282 NIIELLKNR 290
>ref|YP_869034.1| hypothetical protein Shewana3_1394 [Shewanella sp. ANA-3]
gb|ABK47628.1| protein of unknown function DUF1017 [Shewanella sp. ANA-3]
Length=262
Score = 104 bits (259), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 62/190 (32%), Positives = 101/190 (53%), Gaps = 4/190 (2%)
Query 58 LWWPGALLTDSAAKAKALKDYQHVMAQLASWEAEADDD--VAATIKSVRQQLLNLNITGR 115
++W GA L D A Q V+ +LA+ +AD++ A + Q L N+ + R
Sbjct 73 IYWLGAALLDIHNTAALETTRQQVLQKLANMGQQADNNSQYIAKLSKFAQFLRNIKLGQR 132
Query 116 LPVKLDPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQ 175
+ LD D +R+ + NP + G + L RP T+T++GAVS G+ WQ+ S +YLQ
Sbjct 133 VNQPLDLDLIRITDAYNPIIDGQFLLVLPPRPSTVTVVGAVSQTGEQAWQSQTSSREYLQ 192
Query 176 DHPRLAGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLN 235
L A+ + V +I P+G + P+A WN + + PG+ L++ FS L + Y+ LN
Sbjct 193 QAGLLENAENSFVWIIQPDGNAIRQPIAYWNHQAQDIAPGATLFVEFSG--LFDDYSTLN 250
Query 236 DQIVSVLTQR 245
+ I+ +L R
Sbjct 251 NNIIELLKNR 260
>gb|EDA79313.1| hypothetical protein GOS_1936285 [marine metagenome]
Length=293
Score = 103 bits (257), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 61/189 (32%), Positives = 102/189 (53%), Gaps = 3/189 (1%)
Query 58 LWWPGALLTDSAAKAKALKDYQHVMAQLASWEAEADDD-VAATIKSVRQQLLNLNITGRL 116
++W GA L D+ A Q ++ +LAS +AD+ A + Q L N+ + R+
Sbjct 105 VYWLGAALLDTKNTATLELIRQQILQRLASLGQQADNSQYIAKLSKFAQFLRNIKLGQRV 164
Query 117 PVKLDPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQD 176
LD D +R+ + NP + G + L RP TIT++GAV+ G+ W + S DYL+
Sbjct 165 NQPLDLDLIRITDAYNPIIDGQFLLVLPPRPTTITVVGAVAQTGEQKWVSRTSSKDYLKQ 224
Query 177 HPRLAGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLND 236
L A+ + V +I P+G+ + P+A WN + ++ PG+ L++ FS L + Y+ LN+
Sbjct 225 AGLLENAENSFVWIIQPDGKAIRQPIAYWNHQSMDIAPGAILFVEFSG--LFDDYSTLNN 282
Query 237 QIVSVLTQR 245
I+ +L R
Sbjct 283 NIIELLKNR 291
>ref|YP_001555433.1| hypothetical protein Sbal195_3008 [Shewanella baltica OS195]
gb|ABX50173.1| protein of unknown function DUF1017 [Shewanella baltica OS195]
gb|ADT95167.1| protein of unknown function DUF1017 [Shewanella baltica OS678]
Length=261
Score = 103 bits (257), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 62/189 (32%), Positives = 101/189 (53%), Gaps = 3/189 (1%)
Query 58 LWWPGALLTDSAAKAKALKDYQHVMAQLASWEAEADDDV-AATIKSVRQQLLNLNITGRL 116
++W GA L D KA Q V++QLA DD A + + Q + +L + R+
Sbjct 73 IYWLGAALLDIQNKAALETKRQQVLSQLAKMGQVKDDSAYIAKLAKLAQLIRSLQLGQRV 132
Query 117 PVKLDPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQD 176
LD D +R++++ N + G + L RP T+T+LGAV + L WQ ++ DYL+
Sbjct 133 MQPLDIDLIRINDSYNSLIDGRFLLVLPPRPSTVTVLGAVEQSRDLAWQGQKTSKDYLKQ 192
Query 177 HPRLAGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLND 236
L A+ + V +I P+G+ + P+A WN + + PG+ L++ FS+ L + Y LN+
Sbjct 193 AGLLDNAETSFVWIIQPDGKAIKQPIAYWNHQTKDIAPGATLYVEFSS--LFDDYTKLNE 250
Query 237 QIVSVLTQR 245
IV +L R
Sbjct 251 NIVELLRNR 259
>ref|ZP_07391524.1| protein of unknown function DUF1017 [Shewanella baltica OS183]
gb|EFM15713.1| protein of unknown function DUF1017 [Shewanella baltica OS183]
gb|AEG10767.1| protein of unknown function DUF1017 [Shewanella baltica BA175]
Length=261
Score = 103 bits (256), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 61/189 (32%), Positives = 99/189 (52%), Gaps = 3/189 (1%)
Query 58 LWWPGALLTDSAAKAKALKDYQHVMAQLASWEAEADDDV-AATIKSVRQQLLNLNITGRL 116
++W GA L D A + +++QLA ADD + A + + Q L + + R+
Sbjct 73 IYWLGAALLDIQNTAALEAKRKEILSQLAQMGQAADDSIYTAKLAKLAQFLRQIKLGQRV 132
Query 117 PVKLDPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQD 176
L D +R++++ NP + G + L QRP T+ ++GAV+ AG W+ S DYL
Sbjct 133 MQPLHLDLIRINDSYNPLVDGRFELILPQRPTTVLVMGAVAKAGSFEWKVNASSKDYLAK 192
Query 177 HPRLAGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLND 236
L AD + V +I P+G+ + P+A WN + + PG+ L++ FS L E Y+ LN
Sbjct 193 AMPLENADNSFVWIIQPDGKALKQPIAYWNAQVQDIAPGAVLYVEFSD--LVEDYSTLNA 250
Query 237 QIVSVLTQR 245
I+ +L R
Sbjct 251 NIIELLRNR 259
>ref|ZP_08567183.1| YjbG polysaccharide synthesis protein [Shewanella sp. HN-41]
gb|EGM69329.1| YjbG polysaccharide synthesis protein [Shewanella sp. HN-41]
Length=261
Score = 103 bits (256), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 61/189 (32%), Positives = 96/189 (50%), Gaps = 3/189 (1%)
Query 58 LWWPGALLTDSAAKAKALKDYQHVMAQLASW-EAEADDDVAATIKSVRQQLLNLNITGRL 116
++W GA L D A Q V++QL EA D A + + Q + + + R+
Sbjct 73 IYWLGAALLDIKNTASLETKRQQVLSQLIQMGEATDDSHYIAQLAKLAQFIRKIKLGQRV 132
Query 117 PVKLDPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQD 176
LD D +R++++ N L G + L RP T+TLLGAV G WQA +YL+
Sbjct 133 IQPLDIDLIRINQSFNAVLDGRFLLVLPPRPTTVTLLGAVEQMGSYEWQANIDSKEYLKQ 192
Query 177 HPRLAGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLND 236
L A+ + + +I P+G+ + P+A WN + + PG+ L++ FS+ L + Y LN
Sbjct 193 AGLLGNAETSTIWIIQPDGKAIKQPIAYWNHQEQDIAPGASLYVEFSS--LFDDYTQLNK 250
Query 237 QIVSVLTQR 245
IV +L R
Sbjct 251 NIVELLRNR 259
>gb|EDA62109.1| hypothetical protein GOS_1968443 [marine metagenome]
Length=268
Score = 102 bits (254), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 61/189 (32%), Positives = 100/189 (52%), Gaps = 3/189 (1%)
Query 58 LWWPGALLTDSAAKAKALKDYQHVMAQLASWEAEADD-DVAATIKSVRQQLLNLNITGRL 116
++W GA L D A Q V+ QLA+ + D+ + A + + Q L + + R+
Sbjct 80 IYWLGASLLDIKNTAALEATRQQVLQQLATMGEQIDNGNYIAKLSKLAQFLRTIKLGQRI 139
Query 117 PVKLDPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQD 176
LD D +R+ + NP + GD+ L RP T+T++GAV+ G+ PWQ+ S DY+
Sbjct 140 NQPLDLDLIRITDAYNPVIDGDFLLVLPPRPTTVTVVGAVAQTGEQPWQSRASSKDYINQ 199
Query 177 HPRLAGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLND 236
L A+ + V +I P+G + P+A WN + + PG+ L++ FS L +A LN+
Sbjct 200 AGLLDNAENSFVWIIQPDGNAIKQPIAYWNYQAQDIAPGAILFVEFSE--LFADHAKLNN 257
Query 237 QIVSVLTQR 245
I+ +L R
Sbjct 258 NIIELLKNR 266
>ref|YP_857379.1| putative periplasmic protein [Aeromonas hydrophila subsp. hydrophila
ATCC 7966]
gb|ABK37777.1| putative periplasmic protein [Aeromonas hydrophila subsp. hydrophila
ATCC 7966]
Length=252
Score = 101 bits (251), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 76/259 (29%), Positives = 130/259 (50%), Gaps = 23/259 (8%)
Query 3 KLQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVV---QLVTQPQLRD--- 56
K+ + F+ S+L +T A+A T++I+ G+ PV+++ Q+ L D
Sbjct 2 KIINIFLISIL--VTNPAWADATISIWWKGK-------PVKDLYYAKQITLSSALSDPAI 52
Query 57 ---RLWWPGALLTDSAAKAKALKDYQHVMAQLA----SWEAEADDDVAATIKSVRQQLLN 109
+WP ++ A + + Q ++A L W + +A+ + QQL
Sbjct 53 LSYDSYWPVGQISTPARQQELEHQRQVLLADLTVLSKMWSDMGEPSLASATLQLLQQLQQ 112
Query 110 LNITGRLPVKLDPDFVRVDENSNPPLV-GDYTLYTVQRPVTITLLGAVSGAGQLPWQAGR 168
L +TGR V +DPD + + P++ G Y LY R + + G V G P G
Sbjct 113 LELTGRFDVSVDPDVNQARAGVDAPILKGHYQLYLAPRHPEVQIAGLVKQIGGAPLLPGA 172
Query 169 SVTDYLQDHPRLAGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLP 228
+ +Y + L G D V +++P G+ PVA+WN+RHVE PG+ L++GFS VLP
Sbjct 173 GLREYWKRKDILEGGDPAGVYLVSPSGKYDWFPVAIWNERHVEAMPGATLFVGFSPDVLP 232
Query 229 EKYADLNDQIVSVLTQRVP 247
++Y +LN++I+++ R+P
Sbjct 233 KQYQNLNERILTLFANRMP 251
>ref|ZP_08570204.1| SLBB-domain like (DUF1017) [Rheinheimera sp. A13L]
gb|EGM78158.1| SLBB-domain like (DUF1017) [Rheinheimera sp. A13L]
Length=253
Score = 98.2 bits (243), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 74/255 (29%), Positives = 119/255 (46%), Gaps = 14/255 (5%)
Query 1 MNKLQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVV--QLVTQPQLRDRL 58
MN+L + A + + A V + G Q P +V QL T P L
Sbjct 1 MNRLSCFLFALAIVFANTVSSATPVVLVEHQGNYQGFFDRPRLGLVVSQLNTSPSL---- 56
Query 59 WWPGALL--TDSAAKAKALKDYQHVMAQLA----SWEAEADDDVAATIKSVRQQLLNLNI 112
+WP A L D K K + + ++ QLA ++ ++D +AA+++ + + + + +
Sbjct 57 YWPAAKLFKVDVETKLKLEQQRKELLNQLALLKHEFQQDSDSGMAASVEKLEKDISSWEL 116
Query 113 TGRLPVKLDPDFVRVDENSNPPL-VGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVT 171
G + + LDPD VR ++ NP L G Y L RP + + G V P + SV
Sbjct 117 AGNMNLALDPDRVRAKKSLNPLLSAGQYKLVVGARPTELQIEGLVD-EQMTPLRNAVSVD 175
Query 172 DYLQDHPRLAGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKY 231
YL L G + V +I G+ +A LWNK H E PG+ L++ F LP+ +
Sbjct 176 SYLDTISILDGGSSSFVYIIPASGKISIAKTGLWNKHHQEVLPGTVLFIPFEQRHLPDVF 235
Query 232 ADLNDQIVSVLTQRV 246
+ +N+QIV +L +V
Sbjct 236 SHINEQIVELLLHKV 250
>ref|NP_718714.1| polysaccharide synthesis-related protein [Shewanella oneidensis
MR-1]
gb|AAN56158.1|AE015753_4 polysaccharide synthesis-related protein [Shewanella oneidensis
MR-1]
Length=261
Score = 97.8 bits (242), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 58/189 (30%), Positives = 99/189 (52%), Gaps = 3/189 (1%)
Query 58 LWWPGALLTDSAAKAKALKDYQHVMAQLASWEAEADDD-VAATIKSVRQQLLNLNITGRL 116
++W GA L D+ A Q V+ QLA + D+ A + Q L N+ + R+
Sbjct 73 IYWLGAALLDTKNTAVLEVTRQQVLQQLADMGEQVDNSQYIAKLAKFAQFLRNIKLGQRV 132
Query 117 PVKLDPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQD 176
LD D +R+ + NP + GD+ L RP T++++GAV+ G+ W + S DY+
Sbjct 133 NQPLDLDLIRITDAYNPVIDGDFLLVLPPRPTTVSVVGAVAQTGEQTWLSQASSKDYINQ 192
Query 177 HPRLAGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLND 236
L A+ + V +I P+G + P+A WN + + PG+ L++ F+A L + + +LN+
Sbjct 193 AGLLDNAENSFVWIIQPDGNAIKQPIAYWNHQAQDIAPGAILFVEFTA--LFDDHTELNN 250
Query 237 QIVSVLTQR 245
I+ +L R
Sbjct 251 NIIELLKNR 259
>ref|YP_001367076.1| hypothetical protein Shew185_2879 [Shewanella baltica OS185]
gb|ABS09013.1| protein of unknown function DUF1017 [Shewanella baltica OS185]
Length=261
Score = 94.0 bits (232), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 60/189 (31%), Positives = 93/189 (49%), Gaps = 3/189 (1%)
Query 58 LWWPGALLTDSAAKAKALKDYQHVMAQLAS-WEAEADDDVAATIKSVRQQLLNLNITGRL 116
++W GA L D A + +++LA EA D A + + Q L + + R+
Sbjct 73 IYWLGAALLDIQNTAVLENKRKEALSELAKVGEASNDSIYIAKLAKLAQFLRQIKLGKRV 132
Query 117 PVKLDPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQD 176
LD D +R+ ++ NP L G + L RP TI + GAV+ G W+ S DYL
Sbjct 133 MQPLDLDLIRITDSYNPLLDGRFELVLPPRPTTIVVAGAVARTGSYEWKLNTSSKDYLAK 192
Query 177 HPRLAGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLND 236
L +D + V +I P+G + P+A WN + + PG+ ++L FS L E Y+ LN
Sbjct 193 AQPLENSDGDFVWIIQPDGNAIKQPIAYWNAQAQDIAPGAVIYLEFSD--LLEDYSTLNA 250
Query 237 QIVSVLTQR 245
I+ +L R
Sbjct 251 NIIELLRNR 259
>gb|EGK28123.1| hypothetical protein SFK272_1496 [Shigella flexneri K-272]
gb|EGK39941.1| hypothetical protein SFK227_0664 [Shigella flexneri K-227]
Length=55
Score = 93.6 bits (231), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 44/46 (95%), Positives = 46/46 (100%), Gaps = 0/46 (0%)
Query 48 LVTQPQLRDRLWWPGALLTDSAAKAKALKDYQHVMAQLASWEAEAD 93
+VTQPQLRDRLWWPGALLTDSAAKAKALKDYQHVMAQLASWEAEA+
Sbjct 1 MVTQPQLRDRLWWPGALLTDSAAKAKALKDYQHVMAQLASWEAEAE 46
>ref|YP_002357425.1| hypothetical protein Sbal223_1497 [Shewanella baltica OS223]
gb|ACK46002.1| protein of unknown function DUF1017 [Shewanella baltica OS223]
Length=261
Score = 93.6 bits (231), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 60/189 (31%), Positives = 93/189 (49%), Gaps = 3/189 (1%)
Query 58 LWWPGALLTDSAAKAKALKDYQHVMAQLAS-WEAEADDDVAATIKSVRQQLLNLNITGRL 116
++W GA L D A + +++LA EA D A + + Q L + + R+
Sbjct 73 IYWLGAALLDIQNTAVLENKRKEALSELAKVGEASNDSIYIAKLAKLVQFLRQIKLGKRV 132
Query 117 PVKLDPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQD 176
LD D +R+ ++ NP L G + L RP TI + GAV+ G W+ S DYL
Sbjct 133 MQPLDLDLIRITDSYNPLLDGRFELVLPPRPTTIVVAGAVARTGSYEWKLNTSSKDYLAK 192
Query 177 HPRLAGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLND 236
L +D + V +I P+G + P+A WN + + PG+ ++L FS L E Y+ LN
Sbjct 193 AQPLENSDGDFVWIIQPDGNAIKQPIAYWNAQAQDIAPGAVIYLEFSD--LLEDYSTLNA 250
Query 237 QIVSVLTQR 245
I+ +L R
Sbjct 251 NIIELLRNR 259
>ref|YP_455825.1| hypothetical protein SG2145 [Sodalis glossinidius str. 'morsitans']
dbj|BAE75420.1| hypothetical protein [Sodalis glossinidius str. 'morsitans']
Length=164
Score = 91.7 bits (226), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 44/103 (42%), Positives = 65/103 (63%), Gaps = 0/103 (0%)
Query 86 ASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPDFVRVDENSNPPLVGDYTLYTVQ 145
A + D ++A + VR QL + ITGR V LDPD++R+ +N L G+Y++YT+
Sbjct 62 AELRGDNDGELADLVDRVRAQLAAMRITGRQFVPLDPDWIRLRSEANRRLSGEYSVYTLS 121
Query 146 RPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGADKNNV 188
RP +I ++G + AG P+Q GR V +YLQ H R +GA+KN V
Sbjct 122 RPTSIQVVGVTTPAGPQPYQPGRDVAEYLQTHQRFSGAEKNVV 164
>ref|YP_004433358.1| hypothetical protein Glaag_1129 [Glaciecola agarilytica 4H-3-7+YE-5]
gb|AEE22090.1| protein of unknown function DUF1017 [Glaciecola sp. 4H-3-7+YE-5]
Length=265
Score = 90.5 bits (223), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 61/195 (31%), Positives = 101/195 (51%), Gaps = 6/195 (3%)
Query 58 LWWPGALL--TDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQ--QLLNL-NI 112
++WP A L T + A Q ++ +L + +D A + ++ Q Q++N +
Sbjct 70 IYWPAAALYETQESKVAPLHAQRQRLIEKLTTLHQRFANDDRALLSAIDQLTQVVNSWQL 129
Query 113 TGRLPVKLDPDFVRVDENSNPPLV-GDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVT 171
R P+K+D D R+ NP L GDYTL R I + GAV+ + QA + V+
Sbjct 130 GKRSPIKIDLDLARIQPPKNPLLTEGDYTLSAKPRSNKIFITGAVNQTQVVAHQAYQDVS 189
Query 172 DYLQDHPRLAGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKY 231
Y+ R+ A+++ V VI +G + AP A WNK+H E PGS L++ F+ + +
Sbjct 190 HYVPASARIDKANQDYVYVIQADGRVIFAPTAYWNKQHQEVMPGSLLFVPFNTSLFHPEL 249
Query 232 ADLNDQIVSVLTQRV 246
A++ND +VS+ R+
Sbjct 250 AEVNDLVVSLAKNRL 264
>ref|YP_203541.1| hypothetical protein VF_0158 [Vibrio fischeri ES114]
gb|AAW84653.1| hypothetical protein VF_0158 [Vibrio fischeri ES114]
Length=253
Score = 88.6 bits (218), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 61/249 (24%), Positives = 113/249 (45%), Gaps = 11/249 (4%)
Query 6 SYFIASVLYVMTPHAFAQGT--VTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGA 63
S F+ S+L + +A T V++ LP + L+ + Q++ + + GA
Sbjct 5 SSFVFSLLLSASTVTYASSTQAVSVTLPNQNLVLNYSQPVRLEQVILDANAQVNFYSLGA 64
Query 64 LLTDSAAKAKALKDYQHVMAQLASWEAE-----ADDDVAATIKSVRQQLLNLNITGRLPV 118
L+D+ + + + QL+S E A+++ + + QL + GR+
Sbjct 65 ALSDNQLQKDIDNLRNNSIEQLSSLSRETSLFSANNEFKRSATQIISQLEHHTFVGRIFS 124
Query 119 KLDPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQL--PWQAGRSVTDYLQD 176
LD D +R++E NP L DY L+ RP +++ GA+ + P+ S+ DYL
Sbjct 125 SLDLDLIRINEKLNPILNADYQLFVHPRPTSVSFFGAIDSESSISVPFIEHASIDDYLDS 184
Query 177 HPRLAGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLND 236
P + AD + + VI P+G ++W + PG+ +++ LP ++ LN
Sbjct 185 LPLSSTADTSTIYVIQPDGVVQTTEFSVWQNKPAYLAPGASVYIPLGG--LPSDFSSLNT 242
Query 237 QIVSVLTQR 245
IV +L +
Sbjct 243 SIVQLLRNK 251
>ref|ZP_01218708.1| hypothetical polysaccharide synthesis-related protein [Photobacterium
profundum 3TCK]
gb|EAS44622.1| hypothetical polysaccharide synthesis-related protein [Photobacterium
profundum 3TCK]
Length=289
Score = 88.6 bits (218), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 63/194 (32%), Positives = 99/194 (51%), Gaps = 16/194 (8%)
Query 57 RLWWPGALLTDSAAKAKALKDYQHV----MAQLAS-WEAEADDDVAATIKSVRQQLLNLN 111
R++W GA L + + QHV + QLA+ W++E V ++ QQL L
Sbjct 105 RIYWTGAALFHAFPHPQ-----QHVVVAQLNQLATHWQSEQQQAVL----NLSQQLAQLM 155
Query 112 ITGRLPVKLDPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVT 171
R+ LD D VR+++ SN + + TL RP I +LGA+ WQ
Sbjct 156 TGERIFTSLDYDNVRLNKQSNTLITQNLTLILPPRPERILVLGALEKPVWTKWQTRLDAE 215
Query 172 DYLQDHPRLAGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKY 231
YL+ L+ A+K++ VI P+G P A WN+ H + PG+ ++LGFS+ LP+ +
Sbjct 216 AYLKQSKPLSNANKSDAWVIQPDGTVEQHPTAYWNRDHHDIAPGAIVYLGFSS--LPDGF 273
Query 232 ADLNDQIVSVLTQR 245
LN+ I+++L R
Sbjct 274 ETLNEDIINLLRNR 287
>ref|YP_130871.1| polysaccharide synthesis-like protein [Photobacterium profundum
SS9]
emb|CAG21069.1| hypothetical polysaccharide synthesis-related protein [Photobacterium
profundum SS9]
Length=293
Score = 87.0 bits (214), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 54/189 (28%), Positives = 89/189 (47%), Gaps = 2/189 (1%)
Query 57 RLWWPGALLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRL 116
R++W GA L + + + W E ++ QQ+ L R+
Sbjct 105 RIYWTGAALFQAFPHPQQQTVVDQINQLATYWHNEQQQPQQQAALNLSQQIEQLTTGERI 164
Query 117 PVKLDPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQD 176
LD D +R+++ +N + D TL RP I +LGA+ WQ YL+
Sbjct 165 FTSLDYDDIRLNKQANTLITDDLTLILPPRPERILVLGALDKPIWAEWQTRLDAEAYLKQ 224
Query 177 HPRLAGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLND 236
L+ A+ +N VI P+G P+A WN+ H + PG+ ++LGFS+ LP+ + LN+
Sbjct 225 AKSLSNANNSNAWVIQPDGTVEQHPIAYWNRDHHDIAPGAIVYLGFSS--LPKGFETLNE 282
Query 237 QIVSVLTQR 245
I+++L R
Sbjct 283 DIINLLRNR 291
>ref|YP_002154921.1| hypothetical protein VFMJ11_0150 [Vibrio fischeri MJ11]
gb|ACH64877.1| conserved hypothetical protein [Vibrio fischeri MJ11]
Length=253
Score = 86.7 bits (213), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 60/249 (24%), Positives = 113/249 (45%), Gaps = 11/249 (4%)
Query 6 SYFIASVLYVMTPHAFAQGT--VTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRLWWPGA 63
S F+ S+L + +A T V++ LP + L+ + Q++ + + GA
Sbjct 5 SSFVFSLLLSASTVTYASSTQAVSVTLPNQNLVLNYSQPVRLEQVILDANAQMNFYSLGA 64
Query 64 LLTDSAAKAKALKDYQHVMAQLASWEAE-----ADDDVAATIKSVRQQLLNLNITGRLPV 118
+L+D+ + + + QL+S E A+++ + + QL + GR+
Sbjct 65 VLSDNQLQKDIDNLRNNSIEQLSSLSRETSLFSANNEFKRSATQIISQLEHHTFVGRIFS 124
Query 119 KLDPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQL--PWQAGRSVTDYLQD 176
LD D +R++E NP L DY L+ RP +++ GA+ + P+ + DYL
Sbjct 125 PLDLDLIRINEKLNPILNADYQLFVHPRPTSVSFFGAIDSESSISVPFIEHAGIDDYLGS 184
Query 177 HPRLAGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLND 236
P + AD + + VI P+G ++W + PG+ +++ LP ++ LN
Sbjct 185 LPLSSTADTSTIYVIQPDGVVQTTEFSVWQNKPAYLAPGASVYIPLGG--LPSDFSSLNT 242
Query 237 QIVSVLTQR 245
IV +L +
Sbjct 243 SIVQLLRNK 251
>ref|YP_662763.1| hypothetical protein Patl_3203 [Pseudoalteromonas atlantica T6c]
gb|ABG41709.1| protein of unknown function DUF1017 [Pseudoalteromonas atlantica
T6c]
Length=247
Score = 81.3 bits (199), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 55/210 (26%), Positives = 102/210 (48%), Gaps = 19/210 (9%)
Query 48 LVTQPQLRDRLWWPGALL------TDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIK 101
++++ L ++WP A L T A+ +L + + + L S E K
Sbjct 44 VLSKLDLNTNIYWPSAALFIPNDVTLERARRSSLSNLNILASHLPSDTHEQ--------K 95
Query 102 SVRQQLLN----LNITGRLPVKLDPDFVRVDENSNPPLVGDYTLYTV-QRPVTITLLGAV 156
SV L+N ++ RL VK+D D R+ E NP Y + ++ +R ++ ++GAV
Sbjct 96 SVFTNLINELEHWHLANRLSVKIDYDLARISEAHNPQFDNGYYVVSLHERQESVEIIGAV 155
Query 157 SGAGQLPWQAGRSVTDYLQDHPRLAGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGS 216
+ + V++YL + A+++ V++I +G + AP A WN++H E PGS
Sbjct 156 TNTVETKHLPHTDVSEYLNSANIASFANRDTVIIIQADGRIIEAPTAYWNRQHQEVMPGS 215
Query 217 QLWLGFSAHVLPEKYADLNDQIVSVLTQRV 246
+++ F + +Y +LN IV++ R+
Sbjct 216 IIYVPFKESLFTPQYKELNQLIVTLAKNRL 245
>ref|ZP_06054113.1| hypothetical polysaccharide synthesis-related protein [Grimontia
hollisae CIP 101886]
gb|EEY71428.1| hypothetical polysaccharide synthesis-related protein [Grimontia
hollisae CIP 101886]
Length=271
Score = 80.9 bits (198), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 57/189 (30%), Positives = 87/189 (46%), Gaps = 4/189 (2%)
Query 60 WPGALLTDSAAKAKALKDYQHVMAQLASWEAEADDD--VAATIKSVRQQLLNLNITGRLP 117
W GA L + + + V L + AE DD A + ++ + N RLP
Sbjct 84 WQGAALFNHVLSPETAQLLDSVKQDLKTLRAEWADDPTYANAVDALIDYVENATFRERLP 143
Query 118 VKLDPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDH 177
+ LD D ++NP + G TL R ++++GAV+ LP+ A S +YL
Sbjct 144 LPLDEDHYLAGSHTNPLITGKVTLILPSRAKQVSVIGAVTQPHTLPFTALTSAREYLTQS 203
Query 178 PRLAGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQ 237
P L ++V VI+P GE + A WN +H PG+ +++ F LP A LND
Sbjct 204 PTLNRFGISDVAVISPNGELAIHHTAYWNAQHQNVAPGAVIFVPFQR--LPFGLASLNDT 261
Query 238 IVSVLTQRV 246
+ +L RV
Sbjct 262 LPRLLQHRV 270
>gb|EBT31304.1| hypothetical protein GOS_7294385 [marine metagenome]
Length=115
Score = 80.1 bits (196), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 42/114 (36%), Positives = 67/114 (58%), Gaps = 2/114 (1%)
Query 132 NPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGADKNNVMVI 191
NP + G + L RP TIT++GAV+ G+ W +G S DYLQ L A+ + V +I
Sbjct 2 NPIINGQFLLVLPPRPTTITVVGAVAQTGEQSWVSGVSSKDYLQQAGLLENAENSFVWII 61
Query 192 TPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLTQR 245
P+G+ + P+A WN + ++ PG+ L++ FS L + Y+ LN+ I+ +L R
Sbjct 62 QPDGKAIRQPIAYWNHQSMDIAPGAILFVEFSG--LFDDYSTLNNNIIELLKNR 113
>ref|YP_002261809.1| exported protein [Aliivibrio salmonicida LFI1238]
emb|CAQ77958.1| exported protein [Aliivibrio salmonicida LFI1238]
Length=254
Score = 80.1 bits (196), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 57/196 (29%), Positives = 101/196 (51%), Gaps = 16/196 (8%)
Query 62 GALLTDSAAKAKALKDYQHVMAQLA---------SWEAEADDDVAATIKSVRQQLLNLNI 112
G +L+D K ++ + + QL+ SW E+ + +++ + QL L+
Sbjct 63 GLILSDDEKKNTVIQQQEKLSQQLSKLGEYSPFLSW-TESTYQLNSSL--LINQLAQLSF 119
Query 113 TGRLPVKLDPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLG-AVSGAGQ-LPWQAGRSV 170
RL + LD D +R+ + +NP L G ++L+ +RP TIT+LG +S Q L + SV
Sbjct 120 VSRLFIPLDIDEIRIKKENNPLLSGQFSLFVPERPTTITVLGLTLSPKPQTLSYIENGSV 179
Query 171 TDYLQDHPRLAGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEK 230
DYL + + A+ + V VI P+G +A W K V PG+ +++GF+ LP+
Sbjct 180 KDYLHNVDVSSQANTSQVYVIQPDGVVQIASNNQWQKNTVSIAPGATIFIGFNE--LPDS 237
Query 231 YADLNDQIVSVLTQRV 246
+ ++ I+ +L +V
Sbjct 238 LSSIHQDIIQLLRNKV 253
>gb|ECC92702.1| hypothetical protein GOS_5462754 [marine metagenome]
Length=116
Score = 76.3 bits (186), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 40/114 (35%), Positives = 66/114 (57%), Gaps = 2/114 (1%)
Query 132 NPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGADKNNVMVI 191
NP + G + L RP TIT++GAV+ G+ W + S DYL+ L A+ + V +I
Sbjct 3 NPIIDGQFLLVLPPRPTTITVVGAVAQTGEQKWVSRTSSKDYLKQAGLLENAENSFVWII 62
Query 192 TPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLTQR 245
P+G+ + P+A WN + ++ PG+ L++ FS L + Y+ LN+ I+ +L R
Sbjct 63 QPDGKAIRQPIAYWNHQSMDIAPGAILFVEFSG--LFDDYSTLNNNIIELLKNR 114
>gb|EBT37533.1| hypothetical protein GOS_7284195 [marine metagenome]
Length=127
Score = 73.9 bits (180), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 43/125 (34%), Positives = 63/125 (50%), Gaps = 2/125 (1%)
Query 123 DFVRVDENSNPPL-VGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLA 181
D VR ++ NP L G Y L RP + L G V+ + SV YL L
Sbjct 1 DIVRAKKSLNPLLSAGQYKLLVNVRPTVVQLEGLVA-EKNIALADAVSVVHYLDSVSVLD 59
Query 182 GADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSV 241
G + + +IT G+ ++A LWN + E PGS L++ F LPE ++D+N+QIV +
Sbjct 60 GGSSSYLYIITAAGKVLIAKTGLWNNTYQEVSPGSLLFVPFEQRFLPEAFSDINEQIVEL 119
Query 242 LTQRV 246
L +V
Sbjct 120 LLHKV 124
>ref|YP_004565064.1| hypothetical protein VAA_02509 [Vibrio anguillarum 775]
gb|ABI93956.1| WbfC [Listonella anguillarum]
gb|AEH32022.1| Hypothetical exported protein [Vibrio anguillarum 775]
Length=289
Score = 73.9 bits (180), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 64/256 (25%), Positives = 114/256 (44%), Gaps = 16/256 (6%)
Query 4 LQSYFIASVLYVMTPHAFAQGTVTIYLPGEQQ---TLSVGPVENVVQLV-----TQPQLR 55
L +F+ VL ++P A++ + + +LS V V +LV QL
Sbjct 38 LMKHFLLGVLCALSPAAYSAPASVVEVRSADTPALSLSFASVPRVDELVINALNASAQLP 97
Query 56 DRLWWPGALLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGR 115
+ W + L D ++ + Q V+ + E+ A + +++QL R
Sbjct 98 ADIDWLSSALFDVSSP---YQKKQQVLLAITHQESLAASAHKKRWRQLKEQLRARTFAQR 154
Query 116 LPVKLDPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQ 175
+ LDPD R+ + NP L G + L P T+++LG V AG + W+ Y++
Sbjct 155 IFTPLDPDITRITASQNPKLQGQWLLSLNALPTTVSVLGNVKQAGDMAWKPRTDAGHYVR 214
Query 176 DHPRLAGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGF---SAHVLPE-KY 231
LA D + V VI P+G + +A WN + PG+ L++ F + + P+
Sbjct 215 S-AGLAETDISQVWVIQPDGHASLHDIAYWNHDFQDIAPGATLYVPFPIETTSLYPQYSL 273
Query 232 ADLNDQIVSVLTQRVP 247
++ND +V +L ++P
Sbjct 274 HNVNDIVVELLRNQLP 289
>ref|ZP_05879679.1| hypothetical protein VFA_003816 [Vibrio furnissii CIP 102972]
gb|EEX39519.1| hypothetical protein VFA_003816 [Vibrio furnissii CIP 102972]
Length=220
Score = 72.4 bits (176), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 55/166 (33%), Positives = 75/166 (45%), Gaps = 3/166 (1%)
Query 59 WWPGALLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPV 118
+ G L A KA + Y V A+L++ VAA K + QL R +
Sbjct 34 YHEGYRLFSGAKADKAQELYAGVKARLSALLNNDTYRVAA--KQLLTQLSGYQYGYREKL 91
Query 119 KLDPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHP 178
LD D VR+ NP L G + L QRP +I L G +S Q+ + A +V DY+
Sbjct 92 NLDVDAVRLKPEFNPLLPGQFQLELAQRPNSIALFG-LSEQQQMTFNANFTVADYIARSR 150
Query 179 RLAGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSA 224
++ VI+PEGE A WN H PGS L +GF+A
Sbjct 151 SPHKKHHSDAWVISPEGEITHVGYAYWNNAHTHLKPGSALLVGFNA 196
>ref|YP_154958.1| hypothetical protein IL0568 [Idiomarina loihiensis L2TR]
gb|AAV81409.1| Fusion of WbfC- and WbfB-like uncharacterized domains involved
in polysaccharide synthesis [Idiomarina loihiensis L2TR]
Length=955
Score = 72.4 bits (176), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 53/198 (26%), Positives = 93/198 (46%), Gaps = 8/198 (4%)
Query 59 WWPGALLTDSAAKAKALKDYQHVMAQLAS----WEAEADDDVAATIKSVRQQLLNLNITG 114
+WP L + KA+ + ++AQL W++ + + A + ++ Q+ + +
Sbjct 51 YWPLVRLVKTDDKAEIEQQRNQILAQLTELEQYWQSRRETEKAQSAALLKSQVKSWQLGK 110
Query 115 RLPVKLDPDFVRVDENSNPPL-VGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDY 173
+L ++ + R + SNP L G+Y LY +RP ++ + G VS G + +G++V D+
Sbjct 111 QLWGQISIENARTELASNPLLPAGEYKLYVPERPDSVHVYGVVSTPGDYRYASGKTVADW 170
Query 174 LQ---DHPRLAGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEK 230
L D L A + V + + A A + + E PG LWLGF + LP K
Sbjct 171 LNSIDDSEGLGAAYNKSQAVRIRQNQQQTADWAYYKQSDAELLPGDILWLGFEPNQLPPK 230
Query 231 YADLNDQIVSVLTQRVPD 248
+ LN I +L V +
Sbjct 231 FESLNADIRDLLMHFVAN 248
>gb|ADT85366.1| hypothetical periplasmic protein [Vibrio furnissii NCTC 11218]
Length=252
Score = 72.4 bits (176), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 67/225 (29%), Positives = 98/225 (43%), Gaps = 9/225 (4%)
Query 6 SYFIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVG-PVE-NVVQLVTQPQLRDR----LW 59
++ + + V + A+A T+ LP +Q L PV + V L Q Q + +
Sbjct 7 TWLLLCLTSVSSTSAWANTPTTVTLPMQQVILQYNQPVRLDRVLLDAQQQANQKNHLNTY 66
Query 60 WPGALLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVK 119
G L A KA + Y V +L++ +AA K + QL R +
Sbjct 67 HEGYRLFSGAKADKAQELYASVKERLSTLLNNETYRIAA--KQLLTQLSGYQYGYREKLN 124
Query 120 LDPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPR 179
LD D VR+ NP L G + L QRP +I L G +S Q+ + A +V DY+
Sbjct 125 LDVDAVRLKPEFNPLLPGQFQLELAQRPNSIALFG-LSEQQQMTFNANFTVADYIARSRS 183
Query 180 LAGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSA 224
++ VI+PEGE A WN H PGS L +GF+A
Sbjct 184 PHKKHHSDAWVISPEGEITHVGYAYWNNAHTHLKPGSALLVGFNA 228
>ref|ZP_05883360.1| polysaccharide synthesis-related protein [Vibrio metschnikovii
CIP 69.14]
gb|EEX35778.1| polysaccharide synthesis-related protein [Vibrio metschnikovii
CIP 69.14]
Length=274
Score = 70.1 bits (170), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 52/178 (29%), Positives = 82/178 (46%), Gaps = 15/178 (8%)
Query 80 HVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPDFVRVDENSNPPLVGDY 139
V+ QLA + D A + +R QL R + LDPD +RV + NP L G +
Sbjct 100 QVLKQLALQQDNISADQARAWEQLRNQLRRSEFAQREFIPLDPDVIRVVDRHNPLLKGHF 159
Query 140 TLY--TVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQD----HPRLAGADKNNVMVITP 193
L+ + +++ GAV+ + WQA S DY Q +PRL+ VMVI P
Sbjct 160 ALHLPLLSDQTNVSVWGAVNEPTRFEWQANFSAKDYAQQAQWINPRLSA-----VMVIQP 214
Query 194 EGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLP----EKYADLNDQIVSVLTQRVP 247
G+ PV W + + PG+ +++ F+ +L + N Q+V +L +P
Sbjct 215 NGDVQSHPVGYWQSQPLPVQPGAIIYVPFTRSLLASLSRSELDQTNQQVVELLRHLLP 272
>ref|YP_002415894.1| hypothetical protein VS_0210 [Vibrio splendidus LGP32]
emb|CAV17242.1| Conserved hypothetical protein [Vibrio splendidus LGP32]
Length=313
Score = 68.9 bits (167), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 71/277 (25%), Positives = 121/277 (43%), Gaps = 45/277 (16%)
Query 8 FIASVLYVMTPHAFAQGTVTIYLPGEQQTLSVGPVENVVQLVTQPQLRDRL-------WW 60
++++ V+TP + TI LP + TL ++Q++ + ++
Sbjct 42 YVSAASNVVTPDGSSLMKTTIELPLQGVTLEYKANVRLLQVLDDANASSNVNDNSSIGYF 101
Query 61 P-GALLTD--SAAKAKALKDYQ----HVMAQLASWEAEADDDVAATIKSVRQQLLNLNIT 113
P A L D +A KA KD + +V QL ++ E + K V+QQL +
Sbjct 102 PLSAQLFDKTNAESNKANKDIEAKKRNVFNQLDAFSVEEPE-----AKLVKQQLASFQYL 156
Query 114 GRLPVKLDPDFVRVDENSNPPLVGD---------------------YTLYTVQRPVTITL 152
R+ ++LD + V + NP LV ++LY QRP +I L
Sbjct 157 NRVFIELDRNAVISQSDKNPLLVSSSHTNKPSSAAIKRASTSQTQAFSLYLPQRPTSIQL 216
Query 153 LGAVSGAGQLPWQAGRSVTDYLQDHPR-LAG--ADKNNVMVITPEGETVVAPVALWNKRH 209
+GA+ + + ++ DYL P G ADK+ V+ P+G A WN++
Sbjct 217 MGAMKASVTMNLIEHGTLNDYLDALPNGFIGESADKSVAYVVQPDGVVRTIQYAYWNEQS 276
Query 210 VEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLTQRV 246
PG+ +++ F + LP +Y+ LN IV +L +V
Sbjct 277 AYFAPGAIVFMAF--YSLPSEYSTLNQDIVDLLRHKV 311
>ref|ZP_01982747.1| conserved hypothetical protein [Vibrio cholerae 623-39]
gb|EDL72570.1| conserved hypothetical protein [Vibrio cholerae 623-39]
Length=256
Score = 68.6 bits (166), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 50/173 (28%), Positives = 80/173 (46%), Gaps = 5/173 (2%)
Query 79 QHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPDFVRVDENSNPPLVGD 138
QHV+ LA ++ A A +++R QL R+ +DPD+ R+ NP L G
Sbjct 84 QHVLLVLAHQQSAAPMAQKALWETLRDQLRLSAFAKRIFTPVDPDWTRLAAQDNPRLNGQ 143
Query 139 YTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGADKNNVMVITPEGETV 198
+ L R +++ GAV G + W+ +S DY Q L + ++VI P+GE
Sbjct 144 WLLTLNPRVAEVSVYGAVKKPGDVTWRGRQSAKDYAQA-AGLVDEKISEIVVIQPDGEVQ 202
Query 199 VAPVALWNKRHVEPPPGSQLWLGFSAH---VLPEK-YADLNDQIVSVLTQRVP 247
VA WN E PG+ +++ H P+ D N ++ +L R+P
Sbjct 203 KHSVAYWNMTFAEVAPGAIVYVPMPLHDSSFYPDTPSTDANQLVIELLRNRLP 255
>ref|ZP_06053682.1| hypothetical polysaccharide synthesis-related protein [Grimontia
hollisae CIP 101886]
gb|EEY70997.1| hypothetical polysaccharide synthesis-related protein [Grimontia
hollisae CIP 101886]
Length=277
Score = 68.6 bits (166), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 70/263 (26%), Positives = 111/263 (42%), Gaps = 31/263 (11%)
Query 9 IASVLYVMTPHAFAQG-----TVTIYLPGEQQ-TLSVGPVENVVQLV------------- 49
I +V+TPHA TVT + ++ +LS + QLV
Sbjct 20 IVLFCFVLTPHAAIATEETAVTVTSSVDADKHFSLSFNNAPRISQLVSEGASVIRAHISE 79
Query 50 TQPQLRDRLWWPGA-LLTDSA-----AKAKALKDYQHVMAQLASWEAEADDDVAATIKSV 103
T+ D ++W GA L +D A + ++ + + +A L W +++ A + S+
Sbjct 80 TKAHSTDSIYWTGAGLFSDDEDTSLNASSSSVINKLNKLADL--WYSDSKKRDA--VLSL 135
Query 104 RQQLLNLNITGRLPVKLDPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLP 163
R + RL V LD D NP + G + RP I + GAV+ ++P
Sbjct 136 RDFIAASTFKPRLTVTLDEDVYLAGTQQNPLMKGRFYFQLPSRPTHIWMTGAVAKTQKIP 195
Query 164 WQAGRSVTDYLQDHPRLAGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFS 223
+ A +DYL + L +NV VI P GE V+ WN + PG+ +++ F
Sbjct 196 FSAPFFASDYLNNTTTLNTFGISNVSVIQPNGELETHHVSYWNHQPAGLAPGAIIFVPFQ 255
Query 224 AHVLPEKYADLNDQIVSVLTQRV 246
LP LN +I +L RV
Sbjct 256 H--LPTDLDVLNQEIPRLLQHRV 276
>ref|ZP_01950873.1| WbfC protein [Vibrio cholerae 1587]
gb|EAY32675.1| WbfC protein [Vibrio cholerae 1587]
Length=256
Score = 68.2 bits (165), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 50/173 (28%), Positives = 80/173 (46%), Gaps = 5/173 (2%)
Query 79 QHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPDFVRVDENSNPPLVGD 138
QHV+ LA ++ A A +++R QL R+ +DPD+ R+ NP L G
Sbjct 84 QHVLLVLAHQQSAAPMAQKALWETLRDQLRLSAFAKRIFTPVDPDWTRLAAQDNPRLNGQ 143
Query 139 YTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGADKNNVMVITPEGETV 198
+ L R +++ GAV G + W+ +S DY Q L + ++VI P+GE
Sbjct 144 WLLTLNPRVAEVSVYGAVKKPGDVTWRGRQSAKDYAQA-AGLVDEQISEIVVIQPDGEVQ 202
Query 199 VAPVALWNKRHVEPPPGSQLWLGFSAH---VLPEK-YADLNDQIVSVLTQRVP 247
VA WN E PG+ +++ H P+ D N ++ +L R+P
Sbjct 203 KHSVAYWNMTFAEVAPGAIVYVPMPLHDSSFYPDTPSTDANQLVIELLRNRLP 255
>ref|ZP_03348517.1| hypothetical protein Salmoneentericaenterica_23372 [Salmonella
enterica subsp. enterica serovar Typhi str. E00-7866]
Length=33
Score = 67.0 bits (162), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 33/33 (100%), Positives = 33/33 (100%), Gaps = 0/33 (0%)
Query 70 AKAKALKDYQHVMAQLASWEAEADDDVAATIKS 102
AKAKALKDYQHVMAQLASWEAEADDDVAATIKS
Sbjct 1 AKAKALKDYQHVMAQLASWEAEADDDVAATIKS 33
>ref|ZP_06154782.1| protein of unknown function DUF1017 [Photobacterium damselae
subsp. damselae CIP 102761]
gb|EEZ40479.1| protein of unknown function DUF1017 [Photobacterium damselae
subsp. damselae CIP 102761]
Length=272
Score = 67.0 bits (162), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 42/139 (30%), Positives = 71/139 (51%), Gaps = 3/139 (2%)
Query 108 LNLNITGR-LPVKLDPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQA 166
LN N+ R L LD DFVR++ +NP L G Y L P + +LGAV+ ++ WQ
Sbjct 134 LNQNVVHRRLWQNLDYDFVRLNIANNPQLQGSYQLVLPTTPNKVLVLGAVATPTEVMWQP 193
Query 167 GRSVTDYLQDHPRLAGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHV 226
S + L + +++ + VI P+G+ +A WN+ H + PG+ L++ +
Sbjct 194 RISAAELLTQVNVIDEHNRSQISVIQPDGQIETHSIAYWNQNHKDIAPGATLYVHYEQAF 253
Query 227 LPEKYADLNDQIVSVLTQR 245
+ + DLN ++ +L R
Sbjct 254 --DLHRDLNQFVIQLLQNR 270
>ref|ZP_05118573.1| conserved hypothetical protein [Vibrio parahaemolyticus 16]
gb|EED27588.1| conserved hypothetical protein [Vibrio parahaemolyticus 16]
Length=242
Score = 66.6 bits (161), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 45/152 (29%), Positives = 74/152 (48%), Gaps = 5/152 (3%)
Query 94 DDVAATIKSVRQQLLNLNITGRLPVKLDPDFVRVDENSNPPLVGDYTLYTVQRPVTITLL 153
++VA K QQ+ + + R + LD D +R D NP L G+Y L T +R T++
Sbjct 92 EEVAGVFK---QQIQSWTVAYREKIDLDFDQIRTDAADNPMLQGNYELITPKRTRTLSFE 148
Query 154 GAVSGAGQLPWQAGRSVTDYLQDHPRLAGADKNNVMVITPEGETVVAPVALWNKRHVEPP 213
GA+ + + ++ YL L A + VI P+G V A WN ++
Sbjct 149 GALYTPQDVEFDESFPLSGYLSRLNLLKSAHPSYAWVIYPDGNVVRRGYAYWNSQNTSLT 208
Query 214 PGSQLWLGFSAHVLPEKYADLNDQIVSVLTQR 245
PGS +++GF++ ++ L QIV ++T R
Sbjct 209 PGSVIFIGFNSS--NKEVQQLEQQIVQLITMR 238
>ref|ZP_05888474.1| hypothetical protein VIC_004993 [Vibrio coralliilyticus ATCC
BAA-450]
gb|EEX30697.1| hypothetical protein VIC_004993 [Vibrio coralliilyticus ATCC
BAA-450]
Length=246
Score = 66.6 bits (161), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 47/144 (32%), Positives = 67/144 (46%), Gaps = 4/144 (2%)
Query 103 VRQQLLNLNITGRLPVKLDPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQL 162
+R+Q+ ++ R V LD DFVR+ ++NP L G Y RP I L G
Sbjct 103 LREQVKTWSVGYRENVSLDLDFVRLTPSANPMLSGHYQFEYPDRPTNIHLEGLFFSTTMP 162
Query 163 PWQAGRSVTDYLQDHPRLAGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGF 222
+ +V DYL L+ A + VI P+G A WN + PPGS ++LGF
Sbjct 163 EARPDWTVKDYLSTSRVLSSASNSYAWVIYPDGNYKQVGFAYWNDENTPLPPGSSIFLGF 222
Query 223 SAHVLPEK-YADLNDQIVSVLTQR 245
+ P K + L IVS++ R
Sbjct 223 NN---PSKELSQLEQDIVSLIAWR 243
>ref|ZP_00989941.1| hypothetical protein V12B01_23145 [Vibrio splendidus 12B01]
gb|EAP95066.1| hypothetical protein V12B01_23145 [Vibrio splendidus 12B01]
Length=326
Score = 65.5 bits (158), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 49/168 (29%), Positives = 78/168 (46%), Gaps = 26/168 (15%)
Query 103 VRQQLLNLNITGRLPVKLDPDFVRVDENSNPPLVGD---------------------YTL 141
VRQQL + R+ ++LD + V + NP LV ++L
Sbjct 159 VRQQLASFQYLNRVFIELDRNAVISQSDKNPLLVSSSHTNKPSSTAIKRASTSQTQAFSL 218
Query 142 YTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPR-LAG--ADKNNVMVITPEGETV 198
Y QRP +I L+GA+ + + ++ DYL P G ADK+ V+ P+G
Sbjct 219 YLPQRPTSIQLMGAMKESVTMNLIEHGTLNDYLDALPNGFIGESADKSVAYVVQPDGLVQ 278
Query 199 VAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLTQRV 246
A WN++ V PG+ +++ F + LP +Y+ LN IV +L +V
Sbjct 279 TIQYAYWNEQPVYLAPGAIVFMAF--YSLPSEYSTLNQDIVDLLRHKV 324
>ref|ZP_04961240.1| Periplasmic protein involved in polysaccharide export [Vibrio
cholerae AM-19226]
gb|EDN15638.1| Periplasmic protein involved in polysaccharide export [Vibrio
cholerae AM-19226]
Length=256
Score = 65.1 bits (157), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 48/173 (27%), Positives = 78/173 (45%), Gaps = 5/173 (2%)
Query 79 QHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPDFVRVDENSNPPLVGD 138
QHV+ LA ++ A A +++R QL R+ +DPD+ R+ NP L G
Sbjct 84 QHVLLVLAHQQSAAPMAQKAIWETLRDQLRLSAFAKRIFTPIDPDWTRLAAQDNPRLNGQ 143
Query 139 YTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGADKNNVMVITPEGETV 198
+ L R +++ GAV G + W+ +S DY Q L + ++VI P+G
Sbjct 144 WLLTLNPRVAEVSVYGAVKKPGDVTWRGRQSAKDYAQA-AGLVDEQISEIVVIQPDGVVQ 202
Query 199 VAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKY----ADLNDQIVSVLTQRVP 247
VA WN E PG+ +++ H + D N ++ +L R+P
Sbjct 203 KHSVAYWNMTFAEVAPGAIVYVPMPLHDSSFYHDTPSTDANQLVIELLRNRLP 255
>ref|ZP_04919294.1| WbfC protein [Vibrio cholerae V51]
gb|EAZ50134.1| WbfC protein [Vibrio cholerae V51]
Length=288
Score = 64.7 bits (156), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 56/197 (28%), Positives = 87/197 (44%), Gaps = 18/197 (9%)
Query 60 WPGALLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVK 119
WP A L D + A + V+ +LA ++ A A S+ QL RL +
Sbjct 100 WPSAGLFD---LSHAFLFKRDVLLKLADQQSSAPPTQQALWASLIAQLRQAEFAKRLFIS 156
Query 120 LDPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPR 179
+DPD+ R+ NP L G + L + +++ GAV+ G + W S DY Q
Sbjct 157 VDPDWTRIAPQHNPRLNGSWLLTLNSKSTQVSVYGAVNQPGDVIWHNRLSAKDYAQ-AAG 215
Query 180 LAGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKY-------- 231
L + ++VI P+G VA WN+ E PG+ +++ LP K
Sbjct 216 LIDEQISEIVVIQPDGIAQKHAVAYWNQDFNEVAPGAIVYVP-----LPLKRAFFDPTVT 270
Query 232 -ADLNDQIVSVLTQRVP 247
ADLN ++ +L R+P
Sbjct 271 DADLNQLVIELLRNRLP 287
>dbj|BAA33618.1| unknown [Vibrio cholerae]
Length=288
Score = 64.7 bits (156), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 56/197 (28%), Positives = 87/197 (44%), Gaps = 18/197 (9%)
Query 60 WPGALLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVK 119
WP A L D + A + V+ +LA ++ A A S+ QL RL +
Sbjct 100 WPSAGLFD---LSHAFLFKRDVLLKLADQQSSAPPTQQALWASLIAQLRQAEFAKRLFIS 156
Query 120 LDPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPR 179
+DPD+ R+ NP L G + L + +++ GAV+ G + W S DY Q
Sbjct 157 VDPDWTRIAPQHNPRLNGSWLLTLNSKSTQVSVYGAVNQPGDVIWHNRLSAKDYAQ-AAG 215
Query 180 LAGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKY-------- 231
L + ++VI P+G VA WN+ E PG+ +++ LP K
Sbjct 216 LIDEQISEIVVIQPDGIAQKHAVAYWNQDFNEVAPGAIVYVP-----LPLKRAFFDPTVT 270
Query 232 -ADLNDQIVSVLTQRVP 247
ADLN ++ +L R+P
Sbjct 271 DADLNQLVIELLRNRLP 287
>ref|NP_933122.1| hypothetical protein VV0329 [Vibrio vulnificus YJ016]
dbj|BAC93093.1| putative exported protein [Vibrio vulnificus YJ016]
Length=265
Score = 64.3 bits (155), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 47/166 (28%), Positives = 73/166 (43%), Gaps = 3/166 (1%)
Query 59 WWPGALLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPV 118
+ G L + +A+A QHV QL E D+D + + L R +
Sbjct 82 FHEGFQLFNLDKQAEADAQLQHVRQQLI--ELAKDEDYRQASQLLLTLLEKHQYGYRENI 139
Query 119 KLDPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHP 178
LD D VR+ + NP L G Y L R + LLG + +P+ A V DY+
Sbjct 140 NLDIDAVRLKADLNPALPGHYALKQASRENKVLLLGLID-QKTVPFSADLDVADYIATST 198
Query 179 RLAGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSA 224
+K+ VI P G + + WN +H+ PGS +++GF++
Sbjct 199 LNNNGNKSEAWVIAPNGNSSKVGYSYWNNQHMSVLPGSTIFIGFNS 244
>gb|ABI85351.1| hypothetical protein [Vibrio cholerae]
Length=256
Score = 64.3 bits (155), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 48/173 (27%), Positives = 78/173 (45%), Gaps = 5/173 (2%)
Query 79 QHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPDFVRVDENSNPPLVGD 138
QHV+ LA ++ A A +++R QL R+ +DPD+ R+ NP L G
Sbjct 84 QHVLLVLAHQQSAAPMAQKALWETLRDQLRLSAFAKRIFTPVDPDWTRLAAQDNPRLNGQ 143
Query 139 YTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGADKNNVMVITPEGETV 198
+ L R +++ GAV G + W+ +S DY Q L + ++VI P+G
Sbjct 144 WLLTLNPRVAEVSVYGAVKKPGDVTWRGRQSAKDYAQA-AGLVDEQISEIVVIQPDGVVQ 202
Query 199 VAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKY----ADLNDQIVSVLTQRVP 247
VA WN E PG+ +++ H + D N ++ +L R+P
Sbjct 203 KHSVAYWNMTFAEVAPGAIVYVPMPLHDSSFYHDTPSTDANQLVIELLRNRLP 255
>ref|YP_004190015.1| YjbG polysaccharide synthesis-related protein [Vibrio vulnificus
MO6-24/O]
gb|ADV87812.1| YjbG polysaccharide synthesis-related protein [Vibrio vulnificus
MO6-24/O]
Length=265
Score = 64.3 bits (155), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 47/166 (28%), Positives = 73/166 (43%), Gaps = 3/166 (1%)
Query 59 WWPGALLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPV 118
+ G L + +A+A QHV QL E D+D + + L R +
Sbjct 82 FHEGFQLFNLDKQAEADAQLQHVRQQLI--ELAKDEDYRQASQLLLTLLEKHQYGYRENI 139
Query 119 KLDPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHP 178
LD D VR+ + NP L G Y L R + LLG + +P+ A V DY+
Sbjct 140 NLDIDAVRLKADLNPALPGHYALKQASRENKVLLLGLID-QKTVPFSADLDVADYIATST 198
Query 179 RLAGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSA 224
+K+ VI P G + + WN +H+ PGS +++GF++
Sbjct 199 LNNNGNKSEAWVIAPNGNSSKVGYSYWNNQHMSVLPGSTIFIGFNS 244
>ref|ZP_06943634.1| periplasmic protein [Vibrio cholerae RC385]
gb|EFH72958.1| periplasmic protein [Vibrio cholerae RC385]
Length=256
Score = 64.3 bits (155), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 48/173 (27%), Positives = 78/173 (45%), Gaps = 5/173 (2%)
Query 79 QHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPDFVRVDENSNPPLVGD 138
QHV+ LA ++ A +++R QL R+ +DPD+ R+ NP L G
Sbjct 84 QHVLLVLAHQQSAAPMTQKVLWETLRDQLRLSAFAKRIFTPIDPDWTRLAAQDNPRLNGQ 143
Query 139 YTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGADKNNVMVITPEGETV 198
+ L R +++ GAV G + W+ +S DY Q L + ++VI P+G
Sbjct 144 WLLTLNPRVAEVSVYGAVKKPGDVTWRGRQSAKDYAQA-AGLVDEQISEIVVIQPDGVVQ 202
Query 199 VAPVALWNKRHVEPPPGSQLWLGFSAH---VLPEK-YADLNDQIVSVLTQRVP 247
VA WN E PG+ +++ H P+ D N ++ +L R+P
Sbjct 203 KHSVAYWNMTFAEVAPGAIVYVPMPLHDSSFYPDTPSTDANQLVIELLRNRLP 255
>ref|ZP_01978991.1| conserved hypothetical protein [Vibrio cholerae MZO-2]
gb|EDM54080.1| conserved hypothetical protein [Vibrio cholerae MZO-2]
Length=256
Score = 63.5 bits (153), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 48/173 (27%), Positives = 78/173 (45%), Gaps = 5/173 (2%)
Query 79 QHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPDFVRVDENSNPPLVGD 138
QHV+ LA ++ A +++R QL R+ +DPD+ R+ NP L G
Sbjct 84 QHVLLVLAHQQSAAPMAQKVLWETLRDQLRLSAFAKRIFTPVDPDWTRLAAQDNPRLNGQ 143
Query 139 YTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGADKNNVMVITPEGETV 198
+ L R +++ GAV G + W+ +S DY Q L + ++VI P+G
Sbjct 144 WLLTLNPRVAEVSVYGAVKKPGDVTWRGRQSAKDYAQA-AGLVDEQISEIVVIQPDGVVQ 202
Query 199 VAPVALWNKRHVEPPPGSQLWLGFSAH---VLPEK-YADLNDQIVSVLTQRVP 247
VA WN E PG+ +++ H P+ D N ++ +L R+P
Sbjct 203 KHSVAYWNMTFAEVAPGAIVYVPMPLHDSSFYPDTPSTDANQLVIELLRNRLP 255
>ref|ZP_05715654.1| hypothetical protein VMD_07000 [Vibrio mimicus VM573]
gb|EEW11844.1| hypothetical protein VMD_07000 [Vibrio mimicus VM573]
Length=256
Score = 63.5 bits (153), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 48/173 (27%), Positives = 78/173 (45%), Gaps = 5/173 (2%)
Query 79 QHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPDFVRVDENSNPPLVGD 138
QHV+ +A ++ A A + +R QL R+ +DPD+ R+ NP L G
Sbjct 84 QHVLLVMAHQQSSAPIQQKALWEKMRSQLRLSAFAKRVFTPIDPDWTRIAPPDNPRLNGQ 143
Query 139 YTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGADKNNVMVITPEGETV 198
+ L R +++ GAV G + W++ ++ DY Q L ++VI P+G
Sbjct 144 WLLTLNPRVGEVSVYGAVHKPGDVTWRSRQTAKDYAQA-AGLIDEKIAEIVVIQPDGVVQ 202
Query 199 VAPVALWNKRHVEPPPGSQLWLGFSAH---VLPEK-YADLNDQIVSVLTQRVP 247
V VA WN E PG+ +++ H P D N ++ +L R+P
Sbjct 203 VHSVAYWNMHFAEVAPGAIVYVPLPLHDSSFYPNTPNTDANQLVIELLRNRLP 255
>ref|NP_759771.1| hypothetical protein VV1_0794 [Vibrio vulnificus CMCP6]
gb|AAO09298.1| hypothetical protein VV1_0794 [Vibrio vulnificus CMCP6]
Length=265
Score = 63.2 bits (152), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 49/178 (27%), Positives = 78/178 (43%), Gaps = 25/178 (14%)
Query 67 DSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNL---------------- 110
+ ++ A+AL H QL + +A+ D A ++ VRQQL+ L
Sbjct 72 NGSSNAQALSF--HEGFQLFNLNKQAETD--ALLQHVRQQLIELAKDEDYRQASQLLLTL 127
Query 111 ----NITGRLPVKLDPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQA 166
R + LD D VR+ + NP L G Y L R + LLG + +P+ A
Sbjct 128 LEKHQYGYRENINLDIDAVRLKADLNPALPGHYALKQASRENKVLLLGLID-QKTVPFSA 186
Query 167 GRSVTDYLQDHPRLAGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSA 224
V DY+ +K+ VI P G + + WN +H+ PGS +++GF++
Sbjct 187 DLDVADYIATSTLNNNGNKSEAWVIAPNGNSSKVGYSYWNNQHMSVLPGSTIFIGFNS 244
>ref|ZP_08103789.1| putative periplasmic protein [Vibrio sinaloensis DSM 21326]
gb|EGA69190.1| putative periplasmic protein [Vibrio sinaloensis DSM 21326]
Length=257
Score = 63.2 bits (152), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 36/143 (25%), Positives = 77/143 (53%), Gaps = 2/143 (1%)
Query 103 VRQQLLNLNITGRLPVKLDPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQL 162
V +Q+ N ++ R ++LD D +R ++NP L G+ + +R ++ G + ++
Sbjct 113 VIEQVENWDVVYRELIELDFDTIRTQPSANPMLQGNLEFISPKRSQELSFEGLLFPPQKV 172
Query 163 PWQAGRSVTDYLQDHPRLAGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGF 222
P+ A + +++Y + L+ A + +I P G V A A WN++ + PGS +++GF
Sbjct 173 PFDASQPLSEYFRKLNLLSNAHPSYAWIIYPNGHFVRAGYAYWNEQKTQLTPGSAVFIGF 232
Query 223 SAHVLPEKYADLNDQIVSVLTQR 245
++ L + + ++IV +++ R
Sbjct 233 NSEDL--EIQKIEERIVQLISMR 253
>ref|ZP_05240723.1| conserved hypothetical protein [Vibrio cholerae MO10]
emb|CAA62137.1| WbfC protein [Vibrio cholerae]
dbj|BAA33588.1| unknown [Vibrio cholerae]
gb|EET25492.1| conserved hypothetical protein [Vibrio cholerae MO10]
Length=288
Score = 63.2 bits (152), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 55/197 (27%), Positives = 86/197 (43%), Gaps = 18/197 (9%)
Query 60 WPGALLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVK 119
WP A L D + A + V+ +LA ++ A A S+ QL RL +
Sbjct 100 WPSAGLFD---LSHAFLFKRDVLLKLADQQSSAPPTQQALWASLIAQLRQAEFAKRLFIS 156
Query 120 LDPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPR 179
+DPD+ R+ NP L G + L + +++ GAV+ G + W S DY
Sbjct 157 VDPDWTRIAPQHNPRLNGSWLLTLNSKSTQVSVYGAVNQPGDVIWHNRLSAKDYAHA-AG 215
Query 180 LAGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKY-------- 231
L + ++VI P+G VA WN+ E PG+ +++ LP K
Sbjct 216 LIDEQISEIVVIQPDGIAQKHAVAYWNQDFNEVAPGAIVYVP-----LPLKRAFFDPTVT 270
Query 232 -ADLNDQIVSVLTQRVP 247
ADLN ++ +L R+P
Sbjct 271 DADLNQLVIELLRNRLP 287
>ref|ZP_04402220.1| polysaccharide synthesis-related protein [Vibrio cholerae TMA
21]
gb|EEO15214.1| polysaccharide synthesis-related protein [Vibrio cholerae TMA
21]
Length=256
Score = 62.8 bits (151), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 48/173 (27%), Positives = 78/173 (45%), Gaps = 5/173 (2%)
Query 79 QHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPDFVRVDENSNPPLVGD 138
QHV+ LA ++ A +++R QL R+ +DPD+ R+ NP L G
Sbjct 84 QHVLLVLAHQQSAAPMVQKVLWETLRDQLRLSAFAKRIFTPVDPDWTRLAAQDNPRLNGQ 143
Query 139 YTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGADKNNVMVITPEGETV 198
+ L R +++ GAV G + W+ +S DY Q L + ++VI P+G
Sbjct 144 WLLTLNPRVAEVSVYGAVKKPGDVTWRGRQSAKDYAQ-AAGLVDEQISEIVVIQPDGVVQ 202
Query 199 VAPVALWNKRHVEPPPGSQLWLGFSAH---VLPEK-YADLNDQIVSVLTQRVP 247
VA WN E PG+ +++ H P+ D N ++ +L R+P
Sbjct 203 KHSVAYWNMTFAEVAPGAIVYVPMPLHDSSFYPDTPSTDANQLVIELLRNRLP 255
>ref|ZP_04717494.1| hypothetical protein AmacA2_21203 [Alteromonas macleodii ATCC
27126]
Length=249
Score = 62.4 bits (150), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 46/191 (24%), Positives = 90/191 (47%), Gaps = 4/191 (2%)
Query 59 WWPGALLTDSAAKAKALKDYQHVMAQLASW--EAEADDDVAATIKSVRQQLLNLNITGRL 116
+WP A D A KA ++ + ++Q++ + +AD + ++++ Q+ + ++ R+
Sbjct 57 YWPSASAFD-LANPKAEEEKEIALSQISGLLNQFDADSETHKALQNLYDQVSSWTVSTRI 115
Query 117 PVKLDPDFVRVDENSNPPLV-GDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQ 175
+ + + R+ NP G Y + RP + GAV G Q+ SV +
Sbjct 116 DMPISYNRARLFFEDNPMFQPGKYWIRLNGRPDVVHFSGAVVKPGAYKHQSDTSVYTAVH 175
Query 176 DHPRLAGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLN 235
+ AD+++V VI P G +A WN + PGSQ+++ S+ + K LN
Sbjct 176 TVKKAVDADRSHVYVIDPMGNIEEKGIAYWNLDFGQLMPGSQVYVPISSELFSNKLKQLN 235
Query 236 DQIVSVLTQRV 246
+++ ++ RV
Sbjct 236 ERVAALAVHRV 246
>gb|ECV65415.1| hypothetical protein GOS_2858926 [marine metagenome]
Length=237
Score = 62.0 bits (149), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 46/191 (24%), Positives = 90/191 (47%), Gaps = 4/191 (2%)
Query 59 WWPGALLTDSAAKAKALKDYQHVMAQLASW--EAEADDDVAATIKSVRQQLLNLNITGRL 116
+WP A D A KA ++ + ++Q++ + +AD + ++++ Q+ + ++ R+
Sbjct 45 YWPSASAFD-LANPKAEEEKKIALSQISGLLNQFDADSETHKALQNLYDQVSSWTVSTRI 103
Query 117 PVKLDPDFVRVDENSNPPLV-GDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQ 175
+ + + R+ NP G Y + RP + GAV G Q+ SV +
Sbjct 104 DMPISYNRARLFFEDNPMFQPGKYWIRLNGRPDVVHFSGAVVKPGAYKHQSDTSVYTAVH 163
Query 176 DHPRLAGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLN 235
+ AD+++V VI P G +A WN + PGSQ+++ S+ + K LN
Sbjct 164 TVKKAVDADRSHVYVIDPMGNIEEKGIAYWNLDFGQLMPGSQVYVPISSELFSNKLKQLN 223
Query 236 DQIVSVLTQRV 246
+++ ++ RV
Sbjct 224 ERVAALAVHRV 234
>ref|YP_002873221.1| hypothetical protein PFLU3660 [Pseudomonas fluorescens SBW25]
emb|CAY49907.1| putative membrane protein [Pseudomonas fluorescens SBW25]
Length=256
Score = 60.8 bits (146), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 55/176 (31%), Positives = 77/176 (43%), Gaps = 8/176 (4%)
Query 71 KAKALKDYQ--HVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPDFVRVD 128
KA L D Q H A +A E D A + QQ+ L +TGR LDP V V
Sbjct 77 KAGVLFDLQTLHQAALVAGRE-----DRARAAAQLYQQVQALPVTGRQAAVLDPVAVEVG 131
Query 129 ENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGADKNNV 188
N P+ L RP ++ +LGAV+ A L + + + YL P AD + +
Sbjct 132 FAPNLPVSSGDRLVYPPRPDSVRVLGAVARACTLAFAPLQQGSAYLDACPASKAADTDYL 191
Query 189 MVITPEGETVVAPVALWNKRHVEPP-PGSQLWLGFSAHVLPEKYADLNDQIVSVLT 243
+I P+G +A WN+ PP PGS L + + L +LN Q+ L
Sbjct 192 WLIQPDGHVTRLGIAPWNREEGAPPAPGSTLLVPIRSDDLDPPTPELNQQLAEFLA 247
>ref|ZP_06040321.1| polysaccharide synthesis-related protein [Vibrio mimicus MB-451]
gb|EEY39705.1| polysaccharide synthesis-related protein [Vibrio mimicus MB-451]
Length=256
Score = 60.5 bits (145), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 47/173 (27%), Positives = 77/173 (44%), Gaps = 5/173 (2%)
Query 79 QHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPDFVRVDENSNPPLVGD 138
QHV+ +A ++ A A + +R QL R+ +DPD+ R+ N L G
Sbjct 84 QHVLLVMAHQQSSAPIQQKALWEKMRSQLRLSAFAKRVFTPIDPDWTRIAPQDNHRLNGQ 143
Query 139 YTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGADKNNVMVITPEGETV 198
+ L R +++ GAV G + W++ ++ DY Q L ++VI P+G
Sbjct 144 WLLTLNPRVGEVSVYGAVHKPGDVTWRSRQTAKDYAQA-AGLIDEKIAEIVVIQPDGVVQ 202
Query 199 VAPVALWNKRHVEPPPGSQLWLGFSAH---VLPEK-YADLNDQIVSVLTQRVP 247
V VA WN E PG+ +++ H P D N ++ +L R+P
Sbjct 203 VHSVAYWNMNFAEVAPGAIVYVPIPLHDSSFYPNTPNTDANQLVIELLRNRLP 255
>ref|ZP_07774980.1| hypothetical protein PFWH6_2379 [Pseudomonas fluorescens WH6]
gb|EFQ63928.1| hypothetical protein PFWH6_2379 [Pseudomonas fluorescens WH6]
Length=255
Score = 58.5 bits (140), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 48/155 (30%), Positives = 72/155 (46%), Gaps = 3/155 (1%)
Query 91 EADDDVAATIKSVRQQLLNLNITGRLPVKLDPDFVRVDENSNPPL-VGDYTLYTVQRPVT 149
E D AA + +Q+ L +TGR LDP + V + PL GD +Y RP T
Sbjct 93 EGRDTRAALSARLYKQVERLPVTGRQVAVLDPIALEVGFALDSPLDEGDRLIYPA-RPST 151
Query 150 ITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGADKNNVMVITPEGETVVAPVALWNKRH 209
+ + GAV Q+P+ A + Y + L+ A + V +I P+G VA WN
Sbjct 152 VEVWGAVEQTCQVPYAAAQEAWVYARRCAILSDAQSDYVWLIQPDGHVRRLGVAPWNHEE 211
Query 210 -VEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLT 243
V P PGS++ + + L +LN+Q+ L
Sbjct 212 GVMPAPGSRILVPIRSDDLQSPTPELNEQLAEFLA 246
>ref|YP_004469043.1| hypothetical protein ambt_18740 [Alteromonas sp. SN2]
gb|AEF05241.1| hypothetical protein ambt_18740 [Alteromonas sp. SN2]
Length=247
Score = 58.2 bits (139), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 44/190 (23%), Positives = 85/190 (44%), Gaps = 2/190 (1%)
Query 59 WWPGALLTD-SAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLP 117
+WP + + + + A A+ KD ++ + A+ D ++++ QQ+ + ++ R+
Sbjct 55 YWPTSGIYNLNDAYAEREKDAVLSEIRMVMKDYNANSDTYRALENLYQQVSSWTVSTRVI 114
Query 118 VKLDPDFVRVDENSNPPLV-GDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQD 176
+ + R+ NP G Y L RP + G V G S+ +
Sbjct 115 TPISYNRARLIAEENPMFQPGRYLLRISPRPSVVHFSGLVIKPGAYRHGNDLSIFSTAKS 174
Query 177 HPRLAGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLND 236
+ + ADK++V VITP GE +A WN + + PGSQ+++ + + LN
Sbjct 175 VTKASDADKSHVFVITPMGEIEKRGIAYWNIDYSQLMPGSQIYVPITGQIFSSTLDALNT 234
Query 237 QIVSVLTQRV 246
+I ++ R+
Sbjct 235 RIANLAVHRI 244
>ref|YP_001339659.1| hypothetical protein Mmwyl1_0790 [Marinomonas sp. MWYL1]
gb|ABR69724.1| conserved hypothetical protein [Marinomonas sp. MWYL1]
Length=263
Score = 57.0 bits (136), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 56/225 (24%), Positives = 96/225 (42%), Gaps = 12/225 (5%)
Query 21 FAQGTVTIYLPGEQQTLSVGPVENVVQLVTQP--QLRDRLWWPGALLTDSAAKAKALKDY 78
+Q + +Y P + TLS + Q++T QL + + G L D + + + +
Sbjct 42 LSQEQIALYYPQQPVTLSYPQEVRLSQVLTDAYAQLNYQPYSLGTALIDLSKQQQIDEKK 101
Query 79 QHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPDFVRVDENSNPPLVGD 138
++ QL A + +A + S L+ R ++ DP VRV+ +P L G
Sbjct 102 HAILKQLQDINTPASNYIAKKLNS-------LDFVYRERIETDPSKVRVNPKYDPMLKGV 154
Query 139 YTLYTVQRPVTITLLGAVS-GAGQLPWQAGRSVTDYLQDHPRLAGADKNNVMVITPEGET 197
Y L+ +RP I L+ A L A ++ DYL + ++ +I +
Sbjct 155 YQLFLPKRPQHIYLINADDHNYLTLKLTANSNLKDYLAEQFEANRYTYDSAWIIQANQDV 214
Query 198 VVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVL 242
A W + PG+ +++G + LPEKY DLN I +L
Sbjct 215 YRATDIQWKGKLYFLSPGAIVFIGLTD--LPEKYRDLNADIAHLL 257
>gb|EDJ38382.1| hypothetical protein GOS_1705098 [marine metagenome]
Length=838
Score = 56.2 bits (134), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 42/173 (24%), Positives = 83/173 (47%), Gaps = 8/173 (4%)
Query 54 LRDRLWWPGALLTDSAAKAK-------ALKDYQHVMAQLASWEAEADDDVAATIKSVRQQ 106
L ++ + GA+ T ++ + + A ++ MA++ S E + D ++ + + +
Sbjct 627 LTEQAFADGAIFTRASERRQEANRNRQAAREVDQAMARVLSREDDVDSNLVVMSERLANE 686
Query 107 LLNLNITGRLPVKLDPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQA 166
L GR+ V+ DP + + + L G +Y ++ +T+ + G V L +++
Sbjct 687 LREAETLGRITVEADPQILSQRPDLDVLLQGGDHIYYPEQTLTVRVSGEVQSPSALMFES 746
Query 167 GRSVTDYLQDHPRL-AGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQL 218
G++ +DYL+D A ADK+ V+ P+G V+ WN V PGS +
Sbjct 747 GKTASDYLRDAGGFTALADKSRSFVVHPDGSARPLRVSSWNYDPVTILPGSTI 799
>ref|YP_002890759.1| hypothetical protein Tmz1t_3793 [Thauera sp. MZ1T]
gb|ACR02382.1| protein of unknown function DUF940 membrane lipoprotein putative
[Thauera sp. MZ1T]
Length=1261
Score = 55.5 bits (132), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 50/183 (27%), Positives = 87/183 (47%), Gaps = 10/183 (5%)
Query 68 SAAKAKA-LKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKL-DPDFV 125
SA +A+A LK Q ++AQL W A T + + L +L +TGR+P+ + D ++
Sbjct 87 SARQAQAELK--QALLAQL--WSARDLQADETTRQRLADWLRSLPVTGRVPLAIVDARWL 142
Query 126 RVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGADK 185
+ + + +P L + L QRP T+T+L + + G YLQ A ++
Sbjct 143 QANPDQDPILAPGHALVLPQRPGTVTVLADDGRPCAVVHRPGTQARGYLQACIGGAASEA 202
Query 186 NNVMVITPEGETVVAPVALWNKR-HVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLTQ 244
+ ++ P+G +A WN EP PG+ +W + PE D +D++ + L
Sbjct 203 DMAWIVQPDGRVQRFGIATWNAEPQSEPAPGAWIWAPRRSAAWPE---DFSDRLATFLAT 259
Query 245 RVP 247
+ P
Sbjct 260 QSP 262
>gb|EBF39314.1| hypothetical protein GOS_9605035 [marine metagenome]
Length=354
Score = 55.1 bits (131), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 38/147 (25%), Positives = 72/147 (48%), Gaps = 1/147 (0%)
Query 73 KALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPDFVRVDENSN 132
+A ++ MA++ S E + D ++ + + +L GR+ V+ DP + + +
Sbjct 169 QAAREVDQAMARVLSREDDVDSNLVVMSERLANELREAETLGRITVEADPQILSQRPDLD 228
Query 133 PPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRL-AGADKNNVMVI 191
L G +Y ++ +T+ + G V L +++G++ +DYL+D A ADK+ V+
Sbjct 229 VLLQGGDHIYYPEQTLTVRVSGEVQSPSALMFESGKTASDYLRDAGGFTALADKSRSFVV 288
Query 192 TPEGETVVAPVALWNKRHVEPPPGSQL 218
P+G V+ WN V PGS +
Sbjct 289 HPDGSARPLRVSSWNYDPVTILPGSTI 315
>gb|ECO71570.1| hypothetical protein GOS_4328030 [marine metagenome]
Length=158
Score = 54.7 bits (130), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 35/155 (22%), Positives = 73/155 (47%), Gaps = 1/155 (0%)
Query 93 DDDVAATIKSVRQQLLNLNITGRLPVKLDPDFVRVDENSNPPLV-GDYTLYTVQRPVTIT 151
D + + ++ QQ+ + ++ R+ + + + R+ NP G Y + RP +
Sbjct 1 DSETHKALDNLFQQVASWTVSTRINMPISYNRARLLIEENPMFQPGKYWIRLNGRPNVVH 60
Query 152 LLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGADKNNVMVITPEGETVVAPVALWNKRHVE 211
GA+ G + S+ + + + AD+++V +I+P G+ +A WN +
Sbjct 61 FSGAILKPGAYQHLSDTSIYTAVNTVKKASDADRSHVYLISPTGQVEEKGIAYWNLDFSQ 120
Query 212 PPPGSQLWLGFSAHVLPEKYADLNDQIVSVLTQRV 246
PGSQ+++ S + K LN+++V++ RV
Sbjct 121 IMPGSQVYIPISNELFSNKLKRLNERVVALAVHRV 155
>ref|ZP_06173722.1| conserved hypothetical protein [Vibrio harveyi 1DA3]
gb|EEZ89852.1| conserved hypothetical protein [Vibrio harveyi 1DA3]
Length=243
Score = 52.8 bits (125), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 33/141 (23%), Positives = 64/141 (45%), Gaps = 4/141 (2%)
Query 86 ASWEAEADDDVAATIKSVRQQLLNL----NITGRLPVKLDPDFVRVDENSNPPLVGDYTL 141
A E D+ K + +L N + R+ +D D VR+++ +NP L G Y L
Sbjct 80 ARLSKERVLDLMKEYKLNKTELFNFIQRSGTSKRVISNIDLDTVRLNKKNNPLLKGKYIL 139
Query 142 YTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGADKNNVMVITPEGETVVAP 201
+R + + +LG +P + S++ L D L ++ ++I P+G+ +
Sbjct 140 SVGEREIPLLVLGNTPKVATIPTEENMSLSRLLNDETSLFKNLEHAPVLIYPDGKLTQSH 199
Query 202 VALWNKRHVEPPPGSQLWLGF 222
+ W + PPG+ +++ F
Sbjct 200 IGNWKTKSYSLPPGTIIYIPF 220
>gb|ECG95961.1| hypothetical protein GOS_3683526 [marine metagenome]
Length=231
Score = 52.0 bits (123), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 42/165 (25%), Positives = 82/165 (49%), Gaps = 9/165 (5%)
Query 64 LLTDSAAKAKALKDYQHVMAQLASWEAEADD----DVAATIKSVRQQLLNLNITGRLPVK 119
L T+ + A + Y+ + L++ +A + AA+ + + ++L N ++GR+ +
Sbjct 32 LNTEKINRMAADELYKSFLDNLSNLSGKATSGSALEGAASTRLIMEELKNSPVSGRVSAE 91
Query 120 LDPDFVRVDENSNPPLVGDYTLYTVQRPVT-ITLLGAVSGAGQLPWQAGRSVTDYLQDHP 178
D + + D S ++ D T+ V + + G VS G + ++ G+ V+ Y++
Sbjct 92 FDINVLEEDP-SKDVVLQDGDKITIPEFVNQVYIFGEVSSEGTVRFEKGQPVSFYIEKKG 150
Query 179 RLAG-ADKNNVMVITPEGET--VVAPVALWNKRHVEPPPGSQLWL 220
+G AD+ NV V+ P GET V V + R +E PGS +++
Sbjct 151 GFSGFADERNVFVLNPNGETFKVSKNVFMRQGRDIEIYPGSVIFV 195
>ref|ZP_07742696.1| putative periplasmic protein [Vibrio caribbenthicus ATCC BAA-2122]
gb|EFP97003.1| putative periplasmic protein [Vibrio caribbenthicus ATCC BAA-2122]
Length=243
Score = 51.2 bits (121), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 43/181 (23%), Positives = 76/181 (41%), Gaps = 6/181 (3%)
Query 65 LTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPDF 124
L D+ + +A V+A+L E D V I ++Q N R+ LD D
Sbjct 66 LFDNQKQKEAQMLQNSVLARLQQIETTTDYPVEQLIADIKQ----WNTGYRIKTSLDYDA 121
Query 125 VRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGAD 184
+R++ +P L G + QR + L+G ++ Q+ S+ ++D L AD
Sbjct 122 IRINPELDPLLSGHFEFTFPQRDHKVELIGLITQPTQVSITDYSSIAALMRDIVPLPHAD 181
Query 185 KNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLTQ 244
+ V V+ P G A WN S +++GF + ++ L I+ +++
Sbjct 182 PSFVWVVHPNGYAERVGYAYWNNAATRLTSNSTIYVGFDSD--SDQLTSLEKDIIKLISM 239
Query 245 R 245
R
Sbjct 240 R 240
>ref|YP_349560.1| hypothetical protein Pfl01_3831 [Pseudomonas fluorescens Pf0-1]
gb|ABA75569.1| putative membrane protein [Pseudomonas fluorescens Pf0-1]
Length=243
Score = 51.2 bits (121), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 45/150 (30%), Positives = 74/150 (49%), Gaps = 4/150 (2%)
Query 97 AATIKSVRQQLLNLNITGRLPVKLDPDFVRVDENSNPPL-VGDYTLYTVQRPVTITLLGA 155
AA + + +Q+ + +TGR LDP V V N L GD +Y +R + +LGA
Sbjct 87 AALAQRLAEQVRQMPVTGRQIADLDPVAVEVGFARNIRLDDGDQLIYP-KRVDEVEVLGA 145
Query 156 VSGAGQLPWQAGRSVTDYLQDHPRL-AGADKNNVMVITPEGETVVAPVALWNKRHVEPP- 213
V+ +LP+Q + +YLQ L A AD + + +I P GE+ +A WN+ + P
Sbjct 146 VAEPCRLPYQPLQEAREYLQGCTLLEADADADYLWLIQPNGESRRVGIAHWNRESGQMPV 205
Query 214 PGSQLWLGFSAHVLPEKYADLNDQIVSVLT 243
GS++ + L +LN Q+ ++
Sbjct 206 AGSKILVPVKNDDLDPPVPELNQQLAELIA 235
>ref|ZP_01074260.1| hypothetical protein MED121_15079 [Marinomonas sp. MED121]
gb|EAQ67261.1| hypothetical protein MED121_15079 [Marinomonas sp. MED121]
Length=256
Score = 50.8 bits (120), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 32/137 (23%), Positives = 66/137 (48%), Gaps = 6/137 (4%)
Query 94 DDVAATIKSVRQQLLNLNITGRLPVKLDPDFVRVDENSNPPLVGDYTLYTVQRPVTITLL 153
+D T K++ L N R + LD + +++D+NSNP + G Y L + P I ++
Sbjct 102 NDKVETKKNLLSILEKHNFKKREEITLDLEKIQLDKNSNPIIQGKYELMLPRTPNYIIVI 161
Query 154 GAVSGAG--QLPWQAGRSVTDYL---QDHPRLAGADKN-NVMVITPEGETVVAPVALWNK 207
+ +LP + DY+ +D + K+ + ++ + ET++ V+ WN
Sbjct 162 DPSNSKDLIKLPLKYNYDFEDYINFYKDGFNIKNKIKSEKITIVQADKETIIPKVSYWND 221
Query 208 RHVEPPPGSQLWLGFSA 224
++ PG+ +++G +
Sbjct 222 KNYYLSPGAFIYIGIES 238
>gb|EBP46974.1| hypothetical protein GOS_7906243 [marine metagenome]
Length=231
Score = 47.0 bits (110), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 31/106 (29%), Positives = 52/106 (49%), Gaps = 2/106 (1%)
Query 114 GRLPVKLDPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWQAGRSVTDY 173
GR + D ++ D N PL+G LY +RP +I ++G V A L + ++ DY
Sbjct 90 GRQTISADILTLKTDPYKNIPLMGGDELYIPKRPNSINVVGEVLNATTLNFHPDYALEDY 149
Query 174 LQDHPRLAG-ADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQL 218
++ L AD+NN+ ++ P+G L+ K+ PGS +
Sbjct 150 IEMSGGLTNYADQNNIYIVKPDGSAYTHKKTLF-KKDRNLLPGSMI 194
>ref|ZP_06180024.1| hypothetical protein VMC_14540 [Vibrio alginolyticus 40B]
gb|EEZ83725.1|