Comparative analysis of the protein sequences encoded in the genomes of three families of large DNA viruses that replicate, completely or partly, in the cytoplasm of eukaryotic cells (poxviruses, asfarviruses, and iridoviruses) and phycodnaviruses that replicate in the nucleus reveals 9 genes that are shared by all of these viruses and 22 more genes that are present in at least three of the four compared viral families. Although orthologous proteins from different viral families typically show weak sequence similarity, because of which some of them have not been identified previously, at least five of the conserved genes appear to be synapomorphies (shared derived characters) that unite these four viral families, to the exclusion of all other known viruses and cellular life forms. Cladistic analysis with the genes shared by at least two viral families as evolutionary characters supports the monophyly of poxviruses, asfarviruses, iridoviruses, and phycodnaviruses. The results of genome comparison allow a tentative reconstruction of the ancestral viral genome and suggest that the common ancestor of all of these viral families was a nucleocytoplasmic virus with an icosahedral capsid, which encoded complex systems for DNA replication and transcription, a redox protein involved in disulfide bond formation in virion membrane proteins, and probably inhibitors of apoptosis. The conservation of the disulfide-oxidoreductase, a major capsid protein, and two virion membrane proteins indicates that the odd-shaped virions of poxviruses have evolved from the more common icosahedral virion seen in asfarviruses, iridoviruses, and phycodnaviruses. |
------------------------------KILAN------------------------------------------------------------------------------ # KilAN+ kilAC ----------------------------------------------------- 125404 KILA_BPP1 kilA N 1-128 + 143-266 (kilAC) # kilA N + Bro-aC ---------------------------------------------------- 9964424 AMV110 9964414 AMV100 9964341 AMV027 9964550 AMV236 9964470 AMV156 15078718 CIV006L 231-352 (CIV029) 9634794 FPV124 N1R/p28 gene family prote. + BROC (217-271) 2407299 17K ORF [Heliothis armigera. + BROC. 9964426 AMV112 + BROC 9964338 AMV024 + BROC # kilAN+ Orf11D3 ---------------------------- 9635595 Orf11 [Pseudomonas phage D3] >gi|889... 13559861 unknown [Bacteriophage HK620] >gi|1.. + Orf``D3 C-terminus (105-161) # kilAN + T5orf172 ------------------------------ 15079027 CIV315L + Bro-E Cterminus 96-201 # kilA N (the Cs are distinct and not conserved ----------- 11281012 hypothetical protein NMB0900 + C- nothing 11345968 phage-related protein XF2294 + C-nothing 11290039 hypothetical protein NMA1544 [imported] .... C distinct but not conserved 9634745 ORF FPV075 N1R/p28 gene family prote..... C-distinct but not conserved 9634833 C- distinct but not conserved FPV163 9634829 ORF FPV159 N1R/p28 gene family prote.. C-distinct but not conserved 9634825 ORF FPV155 N1R/p28 gene family prote.... C distinct but not conserved... low complexity 9634906 ORF FPV236 N1R/p28 gene family prote..C distinct but not conserved... low complexity 9634918 ORF FPV248 N1R/p28 gene family prote..--- little extension.. nothing 9634831 ORF FPV161 N1R/p28 gene family prote..----little extension.. nothing 1777419 ORF4 [Fowlpox virus].........little extension.. nothing 9964446 AMV132- C matches low complexity region C -terminal to MSV199-T5orf172 # kilaN + CIV029R/BROC ----------------- 15079025 313L 10-130 (kilAN) + 130-237 (MSV199) 237-353 (CIV029R/BROC) #P63C + kilAN ------------ 9634189 Gp73 [Bacteriophage HK97] >gi|690161.. kilaN inserted into 9632501 (or 4499795) 933W orf12 #kilAN + RING --------------- 9634827 ORF FPV157 N1R/p28 gene family prote.... 2 RINGS or 1 zinc + RING 9634820 ORF FPV150 N1R/p28 gene family prote.. 9633890 gp143R [Rabbit fibroma virus] >gi|46.. 12085126 143R protein [Yaba-like disease vir.. 9633779 m143R [Myxoma virus] >gi|6523998|gb|.. 6682986 Yb-C4R [Yaba monkey tumor .. ----------------------------------MSV199 like-------------------------------------------------------------------------------------------------------------------------- # MSV199like solo ------------ 9631447 MSV199 fragment of MSV198 CIV200R fragment # MSV199like motif + UVRC -------------------------------- 15078859 CIV146R 1-118 (UVRC domain) 143-243 (MSV199like motif) # N-MSV199like motif + CIV029R/BROC like -------------------- 15079179 CIV468L 1-177 + C 177-376 (CIV029R/BROC) 15079099 CIV388R 1-175 + C 226-344 (CIV029) 15078923 CIV211L 1-180 + C 259-381 (CIV029R/BROC) 15078924 CIV212L 30-155 + C 238-360 (CIV029R/BROC) 15078950 CIV238R N+ 86-230 (MSV199like) + C 315-436 (29R/BROC) 15078732 CIV019R N + 136-274 (MSV199like) 15078861 148R _CIV 61-191 (MSV199like???) + C _ 265-330 (414) + BROC (330-414) # MSV199+C Bro-e C terminus -------------------------- 9964508 AMV194 N? + 66-215 (MSV199like) + C Bro-e (252-358 9631448 MSV198 1-155 (MSV199like) + C Bro-e (192-292) 9631453 MSV191 1-120 (MSV199like) + C 9964523 AMV209 N + 72-215 (MSV199like) + C Bro-e 257-356 9964521 AMV207 77-228 + C 270-369 Bro-eC 15079131 CIV420R 1-154 + C Bro-e (234-327) + 9631537 52-144(T5orf172 + MSV021 (MSV199like) 105-250 + -------------------------------T5orf172--------------------------------------------------------------------------------------------------------------------------------- # BRO-e C-terminus Looks like a uvrC nuclease to me ------------------------------------------------- 9631041 Ld-bro-f [Lymantria dispar nucleopol.. 10-129 (solo) 281258 hypothetical protein - phage T5 >gi|579090.. 65-158 probably solo 93750 hypothetical protein 172 - phage T5 65-158 probably solo 7474985 hypothetical protein yeeC - Bacillus subt.. N--nothing + 265-374 14194257 hypothetical pro.. N--nothing + 137-233 (Unidentified bacterium) 8346568| phage P27 N--nothing + 266-376 11345564 hypothetical protein NMB1132, NMB1170 [i.. 2-73 + C low complexity 15079171 460R CIV 4-88 --------------------------------- BRON------------------------------------------------------------------------------------------------------------------------- # P22ANT-N + BroA like N-terminus + P22ARC ------------------------------------------ 9635550 P22-ant 130-207 + -199-272 # Bro-A N + T5orf172 -------------------------- 15079001 CIV 289L N-BroN (1-120) 186-299 Bro-e C 15078913 CIV 201R N-Bro N (1-188) 188-301 Bro-eC 13751084 (AJ309235) Bro-I protein [Bombyx mor.. 1-111 (BRON) + 111-220 9630900 BRO-b [Bombyx mori nuclear polyhedro.. 1-111 (BRON) + 111-218 9630956 BRO-e [Bombyx mori nuclear polyhedro.. 1-111 (BRON) + 111-220 9631082 Ld-bro-k [Lymantria dispar nucleopol.. 1-108 (BRON) + 108-217 9631117 Ld-bro-m [Lymantria dispar nucleopol. . 1-102 (BRON) + 137-233 9631452 ORF MSV194 ALI motif gene family pro.. 1-100 (BRON) + 175-290 9631535 ORF MSV023 ALI motif gene family pro.. ~1-100 (BRON) + 145-257 12597544 Heliocoverpa armigera nucleopo.. 1-145 BRON+ 145-243 9635380 ORF130 [Xestia c-nigrum granulovirus.. 1-84 BRON 84-197 C 9631042 Ld-bro-g [Lymantria dispar nucleopol... 17-100 (Bro-aN) + 100-222 13242588 Esv-1-117 # BroA-N + kilAC --------------------------- 13095813 bIL309 BRON + 137-247(kilAC) 1395130 LL-H _ BRON + 152-258 (kilAC) 1362213 2-139 (BRON) + 139-247(kilAC).. 1251473 prophage CP-933N 1-123 (BRON) + 117-229 (kilAC) 14246624 1-139 (BRON) + 139-252(kilAC) 14251162 BK5-T 1-138 (BRON) + 140-256 (kilAC) 9635686 phiPV83 1-143 (BRON) + 143-256 (kilAC) 1353522 ORF5_r1t 1-137 (BRON) + 139-255 (kilAC) 13622137 putative antirepressor - p... 6-92 + kilAC # BRON+ P63 -------------- >gi|15320633 p63 Bacteriophage Mx8 : Myxococcus xanthus #BroA like N-terminus + P22ARC ------------------------------------------------------------ 12514734 putative antire - 4-122+ 191-264 1175791 HI1418 11-124 + - 137-194 # Bro-aN -------------------------------------------------------------------------------------------------- 9964369 AMV055 [Amsacta moorei entomopoxviru... solo (BRON) 9631040 Ld-bro-e [Lymantria dispar nucleopol... 1-82 (solo) 9631451 ORF MSV195 ALI motif gene family pro... solo (BRON) 9631397 ORF MSV226 hypothetical protein [Mel... 3-95 (solo) 1395127 putative [Bacteriophage LL-H] solo -- truncated? 12697190 putative antirepressor [N... 5-90 + C or probably solo 6599316 Broa-N solo 13623111 hypothetical protein - pha... 8-89 + C -- nothing 7480004 othetical protein SCGD3.15 - Streptomy... 17-120 + C nothing 11349554 othetical protein PA2423 [imported] -... N-nothing 99-251 (bro) + C nothing 11349113 othetical protein PA1153 [imported] -... 1-139 9964576 AMV262 [Amsacta moorei entomopoxviru... 1-100 (BRON) + C nothing 9635312 ORF62 [Xestia c-nigrum granulovirus]... 28-121 + C nothing AMV055 #BRON duplication ------------------------ 11068085 PxORF82 peptide [Plutella xylostell... 120-200(BRON), 250-325 (BRON) 13160526 F274292) unknown [Culex nigripalpus... 36-100(BRON), 149-256 (BRON) + C nothing # Bro-aN + BROC ----- 9799895 hypothetical protein [Antica... 1-112 + C \ 93042 othetical protein ORF2, ptp-region [impo... 1-113 + C | 10442572 38.7 kD-like pr. +295-390 (b | 347406 24 kDa ORF [Autographa califor... 45-139 + C + 147-206 \ 9627755 AcOrf-13 peptide [Autograph +215-326 | 5565846 AcMNPV ORF13 homo. +107-207 | 9627744 baculovirus repeated ORF [Autographa... 1-113 + C 133- | 9629950 unknown [Orgyia pseudotsuga BRON ? +214-316 | 9635364 ORF114 [Xestia c-nigrum granulovirus... N-nothing 211-384 + C..427 9635326 ORF76 [Xestia c-nigrum granulovirus]... N + 143-236 9635409 ORF159 [Xestia c-nigrum granulovirus. Broa N 48-153 + 281-392 (BROC) 9631120 Ld-bro-n [Lymantria dispar nucleopol... 1-116 + C | 9631081 Ld-bro-j [Lymantria dispar nucleopol... 1-113 + C / 9631128 Ld-bro-p [Lymantria dispar nucleopol. BRON 1-134 + 63-178 (BROC) 9630998 Ld-bro-a [Lymantria dispar nucleopol. Broa N 1-102 +216-327 (BROC) 9631113 Ld-bro-l [Lymantria dispar nucleopol. Broa N 1-108 +222-333 (BROC) 9631121 Ld-bro-o [Lymantria dispar nucleopol. Broa N 1-112 +205-316 (BROC) 9630999 Ld-bro-b [Lymantria dispar nucleopol. Broa N (1-113) +202-313 (BROC) 12597545 bro [Heliocoverpa armigera nucleopo. 1-107 BRON + BRON (183-284)+391-502 (BROC)- duplication of BRON 13751087 (AJ309236) Bro-II protein [Bombyx mo. BRON (1-115) +197-306 (BROC) 9630821 AcMNPV orf13 [Bombyx mori n 49-143 (BRON)+ 219-330 | 13751089 Bro-III protein [Bombyx m... 1-114 + C | 9630901 BRO-c [Bombyx mori nuclear polyhedro. BRON +195-304 (BROC) 9630839 BRO-a [Bombyx mori nuclear polyhedro. BRON (1-115) +195-304 (BROC) 9630955 BRO-d=AcMNPV orf2 [Bombyx mori nucle... 1-115 + C | 9635359 ORF109 [Xestia c-nigrum granulovirus. BRON 1-82 +189-296 (BROC) 7672865 bro-a [Spodoptera. BRON( 1-113) +200-309 (BROC) 12597608 38.7kd [Heliocoverpa armig + 292-382 / 15213135 unknown [Epiphyas postvitt... 83 2e-15 9634234 ORF13 38.7kD [Spodoptera exigua nucl... | 9964371 AMV057 210-290 -(CIV029R/BROC) 9964491 AMV177 217-297 - (CIV029R/BROC) 9964489 AMV175 203-283- (CIV029R/BROC) 3510491 orf6Heliolithis BRON + C 190-271 (CIV029R/BROC) *BROC solo ---------------- 9630054 Orgyia pseudotsugata single. C-terminus solo-- NO BRO 15213228 unknown [Epiphyas postvitt... 146 9e-35 NO BRO-JUST THE C-terminal region and even that may be fragmented 9631089 LdOrf-122 peptide [Lymantria + 97-176--solo 2760643 CIV029R and Bro-a C can be unified 15078742 11931724 DpAV4 (59-181) 11931708 DpAV4 (91-194) 11931709 DpAV4 (15-97) Xylella specific family BRON-BRON-BRON + C XF1559 (that has no bro) ------------------------------ 11362500 phage-related protein XF2524 [imported] ... 32-122, 166-253, 283-371 (BRON) + C 11362477 phage-related protein XF0684 [imported] ... 6-96 (BRON) 140-226 (BRON) 256-344 (BRON) + C 11362484 phage-related protein XF1663 [imported] ... 12-120, 130-237 Only 2 Bro domains 11362478 phage-related protein XF0704 [imported] ... 17-104 + C 11362483 phage-related protein XF1645 [imported] ... N + 122-210 + C C XF0704 11362060 hypothetical protein XF2506 [imported] -... N 1-74 XF2129/XF1645 + 189-279 (BRON) + C XF0704 XF0704 + XF2129 = XF1645 and XF2506 XF1559+ #BRON- gp30 like ----------------------- 9633590 P43 [Bacteriophage APSE-1] >gi|61180... 1-96 + C 9630500 gp30 [Bacteriophage N15] >gi|7521545... 7-101 (BRON)+ C # Broa-N -- C synapomorphic to this group ----------------------------------------- 9635310 ORF60 [Xestia c-nigrum granulovirus]... 1-114 + C + C 10442560 Orf60-like proti... 12-110 + C+C 12597590 bro [Heliocoverpa armigera nucleopo... 12-110 + C + C 9635381 ORF131 [Xestia c-nigrum granulovirus... 1-93 + C +C 9631038 Ld-bro-c [Lymantria dispar nucleopol... 1-112 + C +C 9631039 Ld-bro-d [Lymantria dispar nucleopol... 1-112 + C 9631080 Ld-bro-i [Lymantria dispar nucleopol... 1-112 + C missing the middle domain # Broa-NC ----------------------------------------- 9635363 ORF113 [Xestia c-nigrum granulovirus... N + 146-239+ C 14602336 ORF99 similar to XcGV ORF113 [Cydia... 150-240 (BRON) + BRON # Broa-N + C- SinR like HTH ------------------ 13623110 hypothetical protein - pha... N + 88-182 + C - HTH # BRON + Vsr --------------- 15078782 069L [Chilo iridescent virus] >gi|7... 9631450 ORF MSV196 4-75 BRON + C vsr nuclease (101-196) 9631533 ORF MSV026 1-69 (BRON) + C vsr nuclease (101-196) 9631534 ORF MSV024 8-79 + C vsr nuclease 9631444 ORF MSV204 ALI motif gene family pro.1-91 + C vsr nuclease #VSR nuclease ------------- 9631531 ORF MSV028 hypothetical protein [Mel.. (solo VSR). 9964571 AMV257 [Amsacta moorei entomopoxviru...(solo VSR) 9631399 ORF MSV229 leucine rich repeat gene ... -----------------------------------------phi31orf238N---------------------------------------------------------------------------------------------------------------------------------------------------------------- #phi31orf238N + kilAC ------- 2897108 tr thermophilus phageTP-J34 + 112-236 (238) -- N - phi31orf238N 1-111 9632967 Strthe_orf287_Sfi21 - + N--phi31orf238N 46-116 13622110 Spy Mgas_ 116-240 (242) --- + N phi31orf238N 1-116 14247767, 13701788, 8918426, 1370718, 9635199 Sa 132-350 -- + N phi31orf238N 10-124 ORF11_phi ETA_131-246 (250) - -- N phi31orf238N 10-123 5823644 A118_136-260-- N 9-108 N phi31orf238N 1-111 13701788 anti repressor [Staphyloc 10-123 + C kilAC #kilA middle + kilAC Ant1/2 : Originally may have had a RHAdomain N-terminal to it as it seems to have a region C-terminus in RHAat its N (the DELE motif) ------------------ 137444, 15742 kilA middle + 225-321 (kilAC) 12514711 kila middle + kilAC # phi31orf238N domain + shares a C-terminal domain with orf6_BPbIL285 -------------- gi|7239197, 12724417, 13095749, 13487806 Orf238 1-103 + C # phi31orf238N + P22ARC ----------------------------------------- 9632546, 9633476, 1175795 hypothetical protein [Bacteriophage 8-114 + C unknown (191-264)- CP-933N like antipreressor C 11354036 NMA1293 4-128 + C 11138338 wonder if this is truncated shares a small domain wiht phi31orf238N proteins + KilAC #phi31orf238N + phiSLT orf 81a --------- 15024925 Cab 1-98 + C- phiSLT orf 81a (which is a solo) -12719400 ------------------------------------------RHA------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- #RHA+ kilAC ------------------ 9630225 SpbC2 1-109 + 133-244 (kilAC) 13559853 Roi_HK620 23-116 + 143-233 (kilAC) 1197729 Roi_HK022 24-117 + 118-234 (kilAC) -- check extreme C-terminus, 9634208 (RHAidentical) 9632502 Roi_BP-933W 48-118 + 119-235 (kilAC) 4499797: orf14_933W ilAN 48-118 + 119-235 (kilAC) gi:2668765 Roi_H-19B kilA N 23-116 + 102-233 (kilAC) gi:9633432 roi_VT2-Sa kilA N 26-118 + 119-235 (kilAC) gi:13360660 coli kilA N 24-116 + 117-233 (kilAC), 13360660 coli kilA N 24-116 + 117-233 (kilAC) 9634208 RHA+ KilA # RHA domain ------------- 12719399 antirepressor [Staphylococcus aureu 2-108 --probably solo 2120256 rha protein - phage phi-80 >gi|1019108|gb 28-124---probably solo 12722192 unknown [Pasteurella multo 26-125 + C-nothing 13095661 Orf3 [bacteriophage bIL311] >gi|127 14-100 \subfamily 14972611 hypothetical protein [Stre 3-102 / 13095876 anti-repressor [bacteriophage bIL31 8-127 RHA+ a C-terminal region specific to 14972611 and this protein 9633589 P42 [Bacteriophage APSE-1] >gi|61180 12-101 + kilA middle domain #RHA+ P4ASH-N ------------ 421263 hypothetical protein 179 - Shigella flexne 1-106 (P4ASH only part N)+ 106-168 (RHA) #RHA+ Orf11D3 -------------- 1175786 HYPOTHETICAL PROTEIN HI1412 >gi| 10-114 + Orf11D3 114-173 9635533 unknown protein [Enterobacteria phag 39-139 + Orf11D3 144-197 -------------------------------------------------------------------------------------------------------------------------------------- -----------------------------------------ALIGNMENTS----------------------------------------------------------------------------------------------- 1. kilA N-terminus PHD Sec. Structure -EEEEE--------EEE-------------HHHHHHHHH--------HHHHH----------------------------------------EEEEEE------------HHHHHHHHHHHHH------HH--HHHHHHHHHHHHHH-- NMA1544_Nm_11290039 LIPRVESG---EIIPQRMSD-------GYINATALCKSVG----KSYSDYRQLQSTNHFLNELKAQTG---------------------LSEQQLIQQRIGGEPSL--QGSWVHPYLAINLAQ------WLSPAFAVKVSTWVHEWMSG \ NMB0900_Nm_11281012 NVSVLNFG---NTPVSFRQD-------GFLNATAIASHFG----KLPKDYLKSEQTQQYISALAENLSVRRKIL---------------TEANQIVIVKRGGSE----QGTWLHPKLAIHFAR------WLNPKFAVWCDEQIEILLNG /kilAN solo AMV132_AMV_9964446 NYWCLHIN---DFNLIYNKKL------NLYNASRVCDIYE----KNIHIWLE-ENYDYTIKYLKIKEI---------------------NDHVSIINNNKESSL----NGLYVSEHILLGISI------WISEECYYKCINIILHNHDI-- has a C-terminus which matches the C-terminus of MSV-T5orf172 KILA_BPP1_125404 STTLPVIC---GVEITTDRA-------GRYNLNALHRASGLGAHKAPAQWLRTLSAKQLIEELEKET----------------------MQNCIVSFEGRGG-------GTFAHELLAVEYAG------WISPAFRLKVNQTFIDYRTG | kilAN + kilAC HKBK_BPhk620_13559861 -MKAITLF---NTPIRVDES-------GMICLTDMWKASGKSESESPYHYLRNKQTKEFLAELEKN-----------------------HESVVFTERGVHG-------GTYGGKFVAYDYAA------WLNPGFKYAAYKVLDDYFTG |kilAN + ORF11D3-C Orf11_D3__9635595 NVIPFHYQ---GKPVRFNSD-------GWINATDIAAAHG----MRLDNWLRNKETEAYIEALARHLNTSD------------------SRDLIRGQRGRGG-------GTWLHPKLAVAFAR------WISPDFAVWADLHIDALLRG |kilAN + ORF11D3-C XF2294_Xf_11345968 TTQQLAIN---SLPIR-EQD-------GLYSLNDFHKASGGAVRHRPSEFLRLDKTKALVVELTNSPEFVSSIKGGA------------PHLFVRKEKGRAG-------STFACRELAIAYAS------WISPAFQLKVIRVFLASVVV |C-terminus Gp73_HK97_9634189 NIIPIDFE---GHPMRFSDD-------GWFDATAAADKFN----KEPAQWLRLPETVRYIEALKSRYGNIT------------------YVKTSRARKDRGG-------GTWLHPKLAVRFAR------WLSVDFEIWCDEQIDAIIQG |Fused to p63C -EEEEEEE---EEEEEEE-------------HHHHHHH--------HHHHHHHHHHHHHHHHHHHH-------------------------EEEEEE---------EEEEE---HHHHHHHHH------HHEEEEE----EEEEEEEE- CIV315L_CIV_15079027 NFYYGLFR---DFKLVVDKNT------ECFNATKLCNSGG----KQFRQWTRLEKSKKLMEYYSRRG----------------------SQQMYEIKGDNKDQLVTQTTGTYAPIDFFEDIKR------WIQLPKASSASGVVYVVTTS kilAN + Bro-eC FPV124_FPV_9634794 RFCYIKYD---KFDLIMMKEN------RFINATKLCKLGG----KDFHRWKRLDGSKELMIKVNEMN-EMWKSAPPPPDL---------GGIIIEVNG-SNQYTEYDIAGSYVHQDLIPHIAS------WISPLFALKVSKIISCYVSG \ AMV112_AMV_9964426 SYYYGLFG---DFKLVIDKTT------GCFNATKLCNLSG----KRFRNWIRLDRSKQLLKYMENYRSSYV------------------SVGFYEVKGDNNNKTSKEITGQYVPKEVILDISS------WISVEFYLKCNDIIINYYNN | AMV024_AMV__9964338 SYYYGLFG---DFKLVIDKTT------GCFNATKLCNLGG----KKFKQWKRLEKSQELIDYIKNNRGGDP------------------HPGFYETKGDNKDENVKKITGCYVPKEVILDISS------WISVEFYLKCNDIIINYYNT | CIV006L_CIV_15078718 TFYKGLFG---DFPLIVDKKT------GCFNATKLCVLGG----KRFVDWNKTLRSKKLIQYYETRCDIKT------------------ESLLYEIKGDNNDEITKQITGTYLPKEFILDIAS------WISVEFYDKCNNIIINYFVN | CIV313L-CIV_15079025 NFYYGLFG---DFKLVVDKNT------ECFNATKLCNSGG----KRFRDWTKLEKSKKLMEYYKGRRDDHRG-----------------GSNFYEVKGDNKDDEVSKTTGQYVKKELILDIAS------WISTEFYDKCNQIVIDFFVV | kilAN + Broa-c AMV110_AMV__9964424 SYYYGLFG---DFKLVIDKTT------GCFNATKLCNLGG----KQYRDWKRLEKSKELIKTLINVRRENS------------------RVWEYNIISNNNHEIHKQYTGYYVSKDLILDIAS------WIAPEFYLKCNDIIINYYNN | AMV100_AMV__9964414 TFYSAHIN---SYQLVIDKKT------GFFNASYVCIKNY----RKINNWLNNKKTIKLIKYYMNLLNNKNNN----------------NNKIKYKIVDKYDNIN----GIYLHPILLNHLLD------WINIKINNKYN--IIDYIIL / FPV161_FPV_9634831 GFLILYYD---SIEIIVMSCN------HFINISALLAKKN----KDFNEWLKIESFREIIDTLDKIN--YDLGQRYCEEPYGASHSSVIIEVKASNLIDDRTA------GFYVHKDLIPYILT------CISIPFSLKVVRVLDTYIGE \ FPV236_FPV_9634906 YFMSMKLL---DVEVVIMRSN------GFVNITRLCNLEG----KDFNDWKQLESSRRLLNTLKDNN--KLHDP-----------------IINIRHTRIKIN------GEYVSQLLLDYVIP------WISPYVATRVSILMRYYRRC | FPV155_FPV_9634825 EFCYIQYS---GFHLVMMISN------CYINASKLCDT------KDFKKWLRLDSSLSLLQEIENTN---FPSEKKFSIKNSK------SVIILEKYYHEEVE------GYYIHPDILPHIVG------WLSPTFAISMSKFINGYISN | C-nothing FPV159_FPV_9634829 KFSYIIYD---KIKIIIMKSN------NYVNATRLCELRG----RKFTNWKKLSESKILVDNVKKIN---DKTNQLKTDMI--------IYVKDIDHKGRDTC------GYYVHQDLVSSISN------WISPLFAVKVNKIINYYICN | FPV248_FPV_9634918 NFCKLSYE---DIEIIMMKEN------EYINATRLCSSRG----RDILDWMSKESSVELINELDRIN---RSCNDYYDY----------RGIVLNVVSDSETS------ELYVHRDLILHISH------WISPLFSLKVVKFINSYIQD / FPV163_FPV_9634833 HFCYIKYD---GITLTMMKDN------GYINATQLCMLGN----KDFKEWIKLDHSIELIKEIEKNI--NKETTKYVKAVISV------RSDYYNSETSNDIK------GFYIHGNIMPHICA------WISSKFAIKVSNIVHNYLND FPV075_FPV_9634745 NFCFINYA---NIEVIMLKYN------GYINATKICDLGN----KNFRQWCRLESSKKLIKTLNYKN---GIYNKAVLE----------IGLASNSAYKYELV------GTYVHIDLVPHIIC------WVFPSIALNFSKILNSYLSN FPV157_FPV_9634827 SFDSIKYR---DIKVIIMKNN------GYVNCSKLCKMRN----KYFSRWLRLSTSKALLDIYNNKS---VDNA---------------IVKVYGKGKKLIIT------GFYLKQNMIRYVIE------WIGDDFTNDIYKMINFYNAL \ + RING FPV150_FPV_9634820 EYRVIEDN---GFSIILLKHT------EYINVTKLCKIHN----KEFYRWKRLISAGRIIETVSRDISNQGFESPL---------------VYVNRKGNKEFY------GFYAHPQLALYIAK------WISEDIFNKIKHLINSYTIS | p28_Ectro_1360841 LQYIDEPN---DIRLPVCIIRNINNITYFINITKINPDLA----NQFRAWKKRIAGRDYMTNLSRDT--GIQQSKL-------------TETIRNCQKNRNIY------GLYIHYNLVINVVI-----DWITDVIVQSILRGLVNWYIA | D6R_VAR_885801 LQYIDEPN---DIRLTVCIIQNINNITYYINITKINPHLA----NQFRAWKKRIAGRDYMTNLSRDT--GIQQSNL-------------TETIRNCQKNRNIY------GLYIHYNLVINVVI-----DWITDVIVQSILRGLVNWYID | YH22_VV_140731 LQYIDEPN---DIRLTVCIIRNINNITYYINITKINTHLA----NQFRAWKKRIAGRDYMTNLSRDT--GIQQSKL-------------TETIRNCQKNRNIY------GLYIHYNLVINVVI-----DWITDVIVQSILRGLVNWYIA -- no RING- truncated PHDSec Str -EEEEE------EEEEEE----------EEEEHHHHHHHH----HHHHHHHHHHHHHHHHHHHHHH----------------------EEEEEEEE----EEE------EEEHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHH 1MB1_Sc_3402004 VDVYEFIH---STGSIMKRKK-----DDWVNATHILKA------ANFAKAKRTR----ILEKEVLK----------------------ETHEKV--QGGFGKY-----QGTWVPLNIAKQLAEKFSVYDQLKPLFDFTQTDGSASP MBP1_Kla_729994 VDVYEFIH---PTGSIMKRKA-----DNWVNATHILKA------AKFPKAKRTR----ILEKEVIT----------------------DTHEKV--QGGFGKY-----QGTWIPLELASKLAEKFEVLDELKPLFDFTQQEGSASP PCT1_Sp_11346262 VEVYECFI---KGVSVMRRRR-----DSWLNATQILKV------ADFDKPQRTR----VLERQVQI----------------------GAHEKV--QGGYGKY-----QGTWVPFQRGVDLATKYKVDGIMSPILSLDIDEGKAIA SCT1_Sp_464742 VEVFEYTI---NGFPLMKRCH-----DNWLNATQILKI------AELDKPRRTR----ILEKFAQK----------------------GLHEKI--QGGCGKY-----QGTWVPSERAVELAHEYNVFDLIQPLIEYS---GSAFM SWI4_Sc_666106 TDVYECYIRGFETKIVMRRTK-----DDWINITQVFKI------AQFSKTKRTK----ILEKESND----------------------MQHEKV--QGGYGRF-----QGTWIPLDSAKFLVNKYEIIDPVVNSILTFQFDPNNPP CC10_Sp_115906 MKYMELSC---GDNVALRRCP-----DSYFNISQILRL------AGTSSSENAK----ELDDIIES----------------------GDYENV--DSKHPQI-----DGVWVPYDRAISIAKRYGVYEILQPLISFNLDLFPKFS EFG1_Cal_1169477 TLCYQVDA---NNVSVVRRAD-----NNMINGTKLLNV------AQMTRGRRDG----ILKSE-------------------------KVRHVV--KIGSMHL-----KGVWIPFERALAMAQREQIVDMLYPLFVRDIKRVIQTG Sok2_Sc__6323658 TLCYQVEA---NGISVVRRAD-----NDMVNGTKLLNV------TKMTRGRRDG----ILKAE-------------------------KIRHVV--KIGSMHL-----KGVWIPFERALAIAQREKIADYLYPLFIRDIQSVLKQN Phd1-_Sc_6322808 TICYQVEA---NGISVVRRAD-----NNMINGTKLLNV------TKMTRGRRDG----ILRSE-------------------------KVREVV--KIGSMHL-----KGVWIPFERAYILAQREQILDHLYPLFVKDIESIVDAR MGF-1_Yaly_5139660 TLCFQVEA---RGICVARRED-----NDMINGTKLLNV------AGMTRGRRDG----ILKGE-------------------------KLRHVV--KAGAMHL-----KGVWIPYDRALEFANKEKIIDLLFPLFVRDIKSVLYHP AM1_Nc_1517923 SLCFQVEA---RGICVARRED-----NAMINGTKLLNV------AGMTRGRRDG----ILKSE-------------------------KVRHVV--KIGPMHL-----KGVWIPFERALDFANKEKITELLYPLFVHNIGALLYHP StuA_Eni_549002 SLCYQVEA---KGVCVARRED-----NGMINGTKLLNV------AGMTRGRRDG----ILKSE-------------------------KVRNVV--KIGPMHL-----KGVWIPFDRALEFANKEKITDLLYPLFVQHISNLLYHP SPBC19C7_10_Sp_7491471 LKCTNPES--KVPHFLMRMAK-----DSSISATSMFRS------AFPKATQEEE----DLEMRW------------------------IRDNLN--PIEDKRV-----AGLWVPPADALALAKDYSMTPFINALLEASSTPSTYAT G6G8_4_Nc__12802359 SGIFKSSP---PSYFLMRRSQ-----DGYISATGMFKA------TFPYASQEEE----EAERKY------------------------IKSIPT--TSSEETA-----GNVWIPPEQALILAEEYQITPWIRALLDPSDIAVTATD 1MB1 Sec structure EEEEEEEE-----EEEEEEE---------EEEHHHHHH----------HHHHHH----HHHHH----------------------------EEE---------------EEEE-HHHHHHHHHH---HH---HH------------ 2. kilAC-terminus roi1 roi2 --------------- * * PHD Sec Str. ------------HHH-HHH-----EEEHHHHHHHHH----HHHHHHHHHHHH--EEE-----------HHHH---EEEEEEEEEE-----EEEEEEEE----HHHHHHHHHHH------ SA1801_SaN315_13701788 LETKIERDKPKIVFADAVATTKTSILVGELAKIIKQNGINIGQRRLFEWLRQNGFLIKRKGVDYNMPTQYSMERELFEIKETSITHSDGHTSISKTPKVTGKGQQYFVNKFLGEKQTS \ ORF11_BPETA_8918426 LETKIERDKPKIVFADAVATTKTSILVGELAKIIKQNGINIGQRRLFEWLRQNGFLIKRKGVDYNMPTQYSMERELFEIKETSITHSDGHTSISKTPKVTGKGQQYFVNKFLGETQTT | orf238_BPTP-J34_2897108 LEAQIEADRPKVLFADAVSASKSSCLIGELAKILKQNGINIGQNKLFQWLRSNGYLISRRGDSWNQPTQKSMQLGLFELKKTNINHADGHTTTNTTTKVTGKGQQYFINKFLNQERLT | orf287_BPSfi21_9632967 LEAQIEADRPKVLFADAVSASKSSCLIGELAKILKQNGINIGRNKLFQWLRSNGYLISRRGDSWNQPTQKSMQLGLFELKKTNINHADGHTTTNTTTKVTGKGQQYFINKFLNQERLT | phi31orf238N + kilAC SPy0946_Spy_13622110 LEAQIEADRPKVLFADAVSASHTSILVGELAKLLKQNGVNIGATRLFTWLRKHGYLIKRNGRDWNMPTQKSVELGLIRVKETSITHSDGHITVSKTPLVTGKGQQYFINKFLNQEYLP | ORF42_A118_5823644 ALNQIEEQKPKVIFADAVQTSENTVLVKDLATILKQNGLDIGQNRLFEWLRGSGYLLN-KGTYYNKPSQKAMNLGLFEQKTHIHTDRNGLMVTTYTPRVTGKGQVYLLNKLLEEHGLV / ORF169a_BPmv4_11138338 LTLQLEESNKKASYLDIILGTPDLLATTQIAAD-----YGYSARTFNQLLKEVGIQH--KVNGQWILYKAYMGKGYVQSKSFAFKDRKGHDRSKPSTYWTQKGRKLIYDVLKENGTLP |shares a small domain with phi31orf238N KILA_BPP1_125404 LEQKMLMDAPKVEFAERVATASG-VLIGNYAKV-----LGLGQNYLFTWLRDNGILIA-TGERRNVPKQEYISRGYFTLKETVIDTSNG-SRISFTTRITGKGQQWLMKRLLDAGVLV \+kilAN yoqD_BPSPBC2_9630225 LQEQLTLAEPKVEKYDRFLNTDGLMKIGQVAKAIGI--KGMGQNNLFRFLRENKVLI--DGTNKNAPYQKYVERGFFQVKTQETS-----VGIKTITLVTPKGADFIVDLLKKHGHKR \ ROI_HK022_1197729 LENQLAIAAPKVEFADRVGEASG-ILIGNFAKV-----VGIGPNKLFAWMRDHKILIA-SGSRRNVPMQEYMDRGYFTVKETAVNTNHG-IQISFTTKITGRGQQWLTRKFVDNGMLK | orf14_BP933W_4499797 LEKQLALAAPKVEFADRVGEASG-ILIGNFAKVV-----GIGPNKLFAWMRDHKILIA-SGARRNVPMQEYMDRGYFTVKETAVNTNHG-IQISFTTKITGRGQQWLTRKLLDNGMLK | ROI_BP933W_9632502 LEKQLALAAPKVEFADRVGEASG-ILIGNFAKV-----VGIGPNKLFAWMRDHKILIA-SGARRNVPMQEYMDRGYFTVKETAVNTNHG-IQISFTTKITGRGQQWLTRKLLDNGMLK | RHA+ KilAC ROI_BPH-19B_2668765 LEKQLALAAPKVEFADRVGEASG-ILIGNFAKVV-----GIGPNKLFAWMRDHKILIA-SGARRNVPMQEYMDRGYFTVKETAVNTNHG-IQISFTTKITGRGQQWLTRKLLDNGMLK | ROI_BPHK97_9634208 LENQLAIAAPKVEFADRVGEASG-ILIGNFAKV-----VGIGPNKLFAWMRDHKILIA-SGARRNVPMQEYMERGYFTVKETAVNTNHG-IQISFTTKITGRGQQWLTRKLLDNGMLK | ROI_BPHK620_13559853 LENQLAIAAPKVEFADRVGEASG-ILIGNFAKV-----VGIGPNKLFAWMRDHKILIA-SGSRRNVPMQEYMERGYFTVKETAVNTNHG-IQISFTTKITGRGQQWLTRKLLDNGMLK / Ant1_BPP1_137444 LEQQLVAAAPKVDFADRVSVANG-ILIGNFAKV-----VGLKQNALFSWLRQNGILMA-FGARKNVPRQQYINAGYFTVKEVVLDDENG-YQIRLTPN-------------------- \ ANT2_BPP1_15742 LEQQLVAAAPKVDFADRVSVANG-ILIGNFAKVV-----GLKQNALFSWLRQNGILMA-FGARKNVPRQQYINAGYFTVKEVVLDDENG-YQIRLTPN-------------------- | truncated? kila middle + kilAC Z1797_Ec_12514711 LENQLAIAAPKAEFVDNYVEASGLMGFREVAKLL-----GIKETDFRLFLLENGIMYR--LAGKMTPYSHHLDAGRFSVKTGEA----GNGHAFTQVKFTPKGVQWIAGLLAAWRATA / ORF23_BPRLT_1353540 SITYVPIEK-K-----NIILSNQEISYSEFIELLELNNIKMSKIMFLKFMRDRRITIDEKGKFYNFPTAFSIEMGIMLLSSTTKENVQ-----KYIPKITIEGQKYFIEKFHYMIEDK | kilAC solo ORF5_BPRLT_1353522 LNIELAAATEKTTYLDLILESPDDILITQIAQD-----YGFSAVKFNRILNELRIQR--KVNKQWVLYSRYMGKGYIGSRTQNYVDSKGQERTSITTTWKQKGRKFLYETLKKHGYLP \ ORF38_BPBK5-T_14251162 LNLELAAATEKTTYLDLILEIPDDILITQIAQD-----YGFSAVKLNRILNELRIQR--KVNKQWVLYSRYMGKGYIGSRTQNYVDSKGQERTSITTTWKQKGRKFLYETLKKHGYLP | SAV0855_Sa_14246624 LQQEIGELKPKADYVDEILKSTGTLATTQIAADY-----GISAQKLNKLLHEARLQRK-VNKQWVLYSEHM-GKSYTDSDTITIVRSDGREDTVLQTRWTQKGRLKIHEIMTEFGYEA | + Broa-N orf8_BPbIL309_13095813 LAVENQIMQPKAQYFDDLVERNLLTSFRDTAKML-----KVGQKQLIDWLLENKYIYR-DKKNKLMPYAQY-NNDLFEIKESKGATN---SWKGAQTLITPVGRETFNLLLN-EYKAS | ORF291_BPLL-H_1395130 AEQKLSEAKPKLDYVDKILASKKTILTTHLATDY-----GCSAVAFNRMLCDKKIQRK-VRDTYVLYSQYQ-GHGWTHTFARAIKTKHG-QEIKEQMEWTQKGNIGLYELLKDRFGLL | orf9_BPphiPV83_9635686 LQQQVEVNKPKVLFADSVAGSDNSILVGELAKILKQNGVDIGQNRLFKWLRNNGYLIKKSGESYNLPTQKSMDLKILDIKKRIINNPDGSSKVSRTPKVTGKGQQYFVNKFLGETQTT | SPy0980_Strpy_13622137 LSVENMVMKPKADYFDDLVDRNLLTSFRETAKQL-----KVKERRFIQFLLDKKYVYR-DKKGKLMPFADK-NNGLFEVKESVNEKTN---WAGTQTLITPKGRETFRLLFI------ / onsensus/85% Lp.bl.....Kh.ahD.l..sp..h.h.phAp.......sh....h..hhpp..hbb......b....p.....shhp.+p....p.p.......ps.hp.cG...h...h....... 3. ORF11CD3 -------------- PHD Sec. Str. ---HHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHH---- HI1412_Hi_1175786 GYSLMHKYNELCIEHKAKKAFASLCGKGLREW-KGDKPVLEATLKLFEDKMQIELPIK |+ RHA orf201_BPP22_9635533 SESYEAERNAIMLEYMKEKDVASMSGRLLNRWGKIKKPQLLARIGRLEQHGQTVIPGL |+ RHA orf11_BPD3_9635595 ELTEKQAFDRACKQLEDGRQLASLHGKGLADW-KFKKPMLEHRVDEMRDRLQMVLGLE |+ kilAN hkbK_HK620_13559861 RNSLSAQLNMKCHEFDQKKDMASFCGQGLAAW-RYTKPVLVAEINSLANQLQITIPGL |+ kilAN ANT_BP7888_10799917 RMSVMEELNQACADMKRDKNIASVFATGLNEW-KQVKSAHVSKIRTLINEANLLIDFV | + P22 ANT-N- near identical or identical, 9632512, 13361651, 12516389 ANT_BP933W_9632512 KMSVMEELNQACADMKRDKNIASVFATGLNEW-KQVKAAHVSKIRTLVNEANMLIDFV || + P22 ANT-N- near identical or identical, 9632512, 13361651, 12516389 consensus/100% ..o....ts..h.p....+.hASh.sp.L..W.+..Ks....pl..h.pp.p..ls.. 4. P22 AR-N ------------------ ANT_BP7888_10799917 MNMMTVPFHGDSLYVVNHNGEPYVPMKPVVAGMGLAWQSQLAKL-RQRFASTITEIVMVAEDGKRRNMVSLPLRKLAGWLQTINPNKVKPEIRGKVIQYQEECDDVLYEYWTKGFVVNPR|P22ANT-N + ORF11D3C ANT_BPP22_9635550 VNTSYVPFNGQHVLTAMVAGVAYVAMKPVVDNIGLSWSSQVQKLLKMKDKFNYVDIDMVAGDMKKRLMGCIPLKKLNGWLFSINPEKVRADIRDKLIKYQEECFTVLYDYWTKGKAENPR| P22 ANT-N + Bro-AN+ P22ANTC 5. RHA roi3 ---------------- * PHD Sec. Str. ----------------------EE--HHHHHHHHHHHHHHHHHHHHHHHH---------EEE----------------------EEEEE--------EEEEEEEE----EEEEEEEEE----HHHHHHHHHHHHHHHHHH-- Orf3_BPbIL311_13095661 MKGLAFLTSP------DLSKAEVVTNHVVIAEYAGIERKSVRRLINNHKNDF------ENFGRL----------------RF-EITTLPDSR---GQKVKIYQLNRNQAMLMITYLDNTEVVRNFKIALVKRFDEMEKELYA \ SP1134_Spn__14972611 M-ELVY----------MDGKKEPYTTSEIIAECAEVQHHTITRLIRENKADF------EELGIL----------------GF-K-IHKLDTR---GQPKKSYILNEQQATFLITYLKNTETVRQFKLNLVKAFFEMREELS- / orf14_bIL310_13095876 MNEITV------SLDVIIKNKNVIVSSLSVAKAFDRQHSHILRSIEDIKRDWDSLIQSKNGLNRNIIPLKSQGNKQVIFTDYFKESEYIAEN---GRLVKFFEMNRNGFMLLANSFNGKRIL-PIKLAFIERFDELE----- | shares a C-terminal domain with 14972611 PM1774_Pm__12722192 LQGAESAVNNAVFPKVFHKETVAMTDSLKVAHYFGKRHDNLLNTIKNLGC-------SDEFRLL----------------NF-KESYYLNEQ---NKKQPMFYMTQDGFTLLVMGFTGKKAM-QFKEQYIKEFNEMKKRLAT |solo RHA_BPphi-80_2120256 MNNPSVIPAFDFREMVTTLDNKIITTSLKVADYFGKRHKDVLRAIRNLKC-------SDDFTQR----------------NF-APIDFIDKN---GDVQPMYNITRDGCMMLVMGFTGKTAA-AVKECYINAFNWMAEQLN- |solo orf179_Sf_421263 MATILTLSHP----DATIENGRAVTTSVAVAEFFRKMHKNVIQKIETLEC-------SPEFNRL----------------NF-KPVTYTDAK---AKNAQCTKSPKTASFSW------------------------------ + ASH-N orf182_BPphiSLT_12719399 MQAL----------QIVEQNETHYVDSREVAEMIGKRHDNLVRDIKGYIKVLED---SSKLSSH----------------NFFEESTYVNSQ---NKVQPCYLLTKKGCDIVANKMTGSKGI-LFTATYVDAFHKMDEYIKQ |solo ORF201_9BPP22_9635533 MNELIANHDFDFRQLVTAAEGQPVTDTFQIAKAFGKRHADVLRALKNCHC-------SEDFRRA----------------HF-CVSEKINNLGIFDKKQIYYRMDFSGFVMLVMGFNGAKAD-AVKEAYINAFNWMSAELR- \ + ORF11D3-C P42_BPAPSE-1__9633589 MQNLIT-------------FQSLTMSSLEIAELVNKRHDNVKRTIETLAK-------SEIIQLP----------------QS-EKVENKQSNSPNR-FTEVFIFEGEQGKRDSIIVVAQLC-PEFTACLVDRWQELEQKLNT|+ kilA middle yoqD_BPSPBc2_9630225 -MESYL--------TVIEQNGQLLVDSREVAEMVGKRHTDLLRSIDGYVAILL----NAKLRS----------------VEFFLESTYKDAT---GRSLKHFHLTRKGCDMVANKMTGAKGV-LFTAQYVSKFEEMEKALKA \ hkbC_BPHK620_13559853 MNELIN-------------GNAIKMTSIEIAELVGKRHDNVKRTIETLAK-------NGVIRLP----------------QI-EVSERINNLGFNV-QYEHYVFEGEQGKRDSIVVVAQLS-PEFTARLVDRWRELEETAVN | Roi_BPH-19B_2668765 MNELIN-------------GNAIKMTSIEIAELVGKRHDNVKRTIETLAK-------NGVIRLP----------------QI-EVSERINNLGFNV-QYEHYVFEGEQGKRDSIVVVAQLS-PEFTARLVDRWRELEGATAK | ROI_BPVT2-Sa_9633432 MNELIN-------------SNAIKMTSIEIAELVGSRHDKVKQSIERLAV-------RGVIRNP----------------PM-VVFEKINNLGLLR-GVEAYVFEGEQGKRDSIIVVAQLS-PEFTARLVDRWRELEGATAK | +kilAC ROI_BP933W_9632502 MNELIN-------------SNAIKMTSIEIAELVGSQHGNVRISIERLAK-------RGVIQLP----------------SM-QKVENKQTISPNK-FTSVYIFEGEQGKRGSIIVVAQLS-PEFTARLVDRWRELEGATAK | Roi_BPHK022_1197729 MNELIN-------------GNAIKMTSIEIAELVESRHSNVKVSIDRLVK-------RGVIKPP----------------AL-QHTNIINDLGVITGKRDFYVFEGEQGKRDSIIVVAQLS-PEFTARLVDRWRELEEAAVN | ROI_BPHK97__9634208 MNELIN-------------GNAIKMTSIEIAELVESRHSNVKVSIDRLVK-------RGVIKPP----------------AL-QHTNIINDLGVITGKRDFYVFEGEKGKRDSIIVVAQLS-PEFTARLVDRWRELEEAAV- / consensus/100% .........................sp..lAchh..b+.pl...lp.......................................................a.bp.p...b....h.s......hp..blp.a..b...... http://www.bmm.icnet.uk/servers/3dpssm/output/1f3d9c24585b2b85.job_summary.html 6. orf6N ------------- PHD Sec. Str. -----EE-----EEEEHHHHHHH----HHHHHHH------------EEEEE----HHHHH--------EEE-----------------------EEEE---HHHHH--- N.orf6_BPbIL285_13095686 MNELQITELNGQRVLTTQQIAEGYGTDSASITKNFNNNKSRFKEGKHFFLLQGADLKEFK---NNIQNLDV----------------VGNRAPKLYLWTEKGALLHAKS \ +ORF6C N.ORF6_BPTP901-1_13786537 MNELQITELNGQRVLTTQQIAEGYGTDSASITKNFNNNKSRFKEGKHFFLLQGADLKEFK---NNIQNLDV----------------VGNRAPKLYLWTEKGALLHAKS / orf13_GMSE-1_12276103 NTQLPVIEYQGQRVITTELLAQGYGAEVKSIHMNFTRNKSRFEETKHYFLLQGEELKAFI---NYPTNCGL----------------VDKRSPSLVLWTGRGS------ \ H0107_Ec_7649857 VETLSPITHNQIPVITTELLAQLYGTEPVRIRQNHHENKVRFVEGKHFFKVVGNDLKELRVALNYSQNLRVTLSNSQNLQPSLRGLQISPKARSLILWTERGAARHAKR /solo ANT_BPVT2-Sa_9633431 VETLSPITHNQIPVITTELLAQLYGTEPVRIRQNHHENKVRFVEGKHFFKVVGNDLKELRVALNYSQNLRVTLSNSQNLQPSLRGLQISPKARSLILWTERGAARHAKM | + P22ARC 7. ORF6C : the proteins are almost identical but at least two of them are fusedto orf6N ------------- PHD Sec. Str. -------HHHHHHHHHH------HHHHHHHHHHHHHHHH-----HHHHHHHHHHHHEEEEE----HHHHHHHHHHHHHHHHHHHHHH---EEEEE------HHHHHHHHHHHH-----EEEEE------------ C.orf6_BPbIL285_13095686 EKQLPQTPEQQIALLAQGNVNLNKKVEQIENSVLDLTDRFGLPSNKAKVLQKKVASKVYMFTGGKYSNAHKKLGAKVFREFYKDLNNRFDVVKYSDIPLSRYDEATEYLDMWQPSFNTTLEIRGLNSQTSFDFEE \ + orf6N C.ORF6_BPTP901-1_13786537 EKQLPQTPEQQIALLAQGNVNLNKKVEQIENSVLDLTDRFGLPSNKAKVLQKKVASKVYMFTGGKYSNAHKKLGAKVFREFYKDLNNRFDVVKYSDIPLSRYDEATEYLDMWQPSFNTTLEIRGLNSQTSFDFEE / ORF55_BPpi3_12724417 QQLLPQTPEQQIALLAQGNVNLNKKVEQIENSVLDLTDRFGLPSNKAKVLQKKVASKVYMFTGGKYSNAHKKLGAKVFREFYKDLNNRFDVVKYSDIPLSRYDEATEYLDMWQPSFNTTLEIRGLNSQTSFDFEA \ ORF6_BPTuc2009_13487806 QQLLPQTPEQQIALLARGNVNLNKKVERIENSVLDLTDRFGLPSNKAKVLQKKVASKVYMFTGGKYSNAHKKLGAKVFREFYKDLNNRFDVVKYSDIPLSRYDEATEYLDMWQPSFNTTLEIRGLNSQTSFDFEE | + phi31orf238N orf6_BPbIL286_13095749 QHLLPQTPEQQIALLAQGNVNLNKKVEQIENSVLDLTDRFGLPSNKAKVLQKKVASKVYMFTGGKYSNAHKKLGAKVFREFYKDLNNRFDVVKYSDIPLSRYDEATEYLDMWQPSFNTTLEIRGLNSQTSFDFEA | orf238_BPphi31.1_7239197 QQLLPQTPEQQIALLAQGNVNLNKKVEQIENSVLDLTVRFGLPSNKAKVLQKKVASKVYMFTGGKYSNAHKKLGAKVFREFYKDLNNRFDVVKYSDIPLSRYDEATEYLDMWQPSFNTMLEIRGLNSQTSLSNYQ / 8. phi31orf238N --------------------- PHD Sec. Str. --EEEEEEE------EE--HHHHHH------HHHHHHHHHHH----------EEEEEEEE-----------------EEEEEHHHHHHHHHHH----HHHHHHHHHHHHH-- orf238_BPphi31_1_7239197 MNQLITITQNENNEQVVSGRELHQFLGV-KTRYNDWFED-MVKYG-FTENVDFIGFTEKRV-KPQG-----GRPSVDHALKLDMAKEISMIQRNEKGKQARQYFIEVEKELK\ + ORF6C orf6_BPTuc2009_13487806 MNQLITITQNENNDQVVSGRELHEFLGV-KTRYNDWFED-MVKYG-FTENVDFIGFTEKRV-KPQG-----GRPSVDHALKLDMAKEISMIQRNEKGKQARQYFIEVEKELK | orf6_BPbIL286_13095749 MNQLITITQNENNDQVVSGRELHEFLDI-TERYSTWFER-MLKYG-FVENIDFVGC--KVF-NTLA-----KQELQDHALKIDMAKEISMIQRNEKGKQARQYFIEVEKELK | P55_BPpi3_12724417 MNQLITITQNENNDQVVSGRELHEFLDI-TERYSTWFER-MLKYG-FVENIDFVGC--KVF-NTLA-----KQELQDHALKIDMAKEISMIQRNEKGKQARQYFIEVEKELK / orf238_BPTP-J34_2897108 MNELINITLNENQEPVVSGRQLHKALGV-KTAYKDWFPR-MTEYG-FTDGEDFSSFLSKSTG---------GRPSQDHIIKLDMAKEIAMIQRTDKGKEVRQYFIQVEKDFN \ + kilAC SPy0946_Spy_13622110 MNQLINVTLNENQEPVVSGRDLHKVLEI-KTQYTKWLER-MSEYG-FVENEDFMAISQKRL-TAQGN----QTEYTDHVLKLDMAKEIAMLQRNEKSKEVRKYFIQVEKDFN | SA1801_Sa_13701788 IGEMFNIQEKENGEIAISGRELHQALEV-KTPYKKWFER-MSDYG-FEENIDYIVTDIFVH-NPLGG----RQNQTDHALTLDTAKEIAMIQRSEPGKRARQYFIQVEKAWN | ORF11_BPphiETA_8918426 IGEMFNIQEKENGEIAISGRELHQALEV-KTAYKDWFPR-MLKYG-FEENTDYTAIAQKRA-TAQGN----MTHYIDHALTLDTAKEIAMIQRSEPGKRARQYFIQVEKAWN | orf34_BPphiPVL_9635199 IGEMFNIQEKENGEIAISARELYKALEV-KKRFSAWAEI-NLKH--FKENRDFTSVLTSTV-VNNGA----VRQLEDYALTLDVAKHVAMMSGTEKGFDFREYFIQVEKAWN | ORF42_BPA118_5823644 ANEMLPVLENEKGEKFVNARTLHEKLMT-TTKFADWIKRRIRQYG-FVENEDFFSLLKNEK-RAIG-----GTTSIDYIFTLDSGKELAMVENTEQGRAIRKYFIEVEKQAR | SAV1994_Sa_14247767 IGEMFNIQEKENGEIAISGRELHQALEV-STRYDKWFER-MTEYG-FENGIDFISQVEKVH-GQKRAR---TYEQVNHILTLDTAKEIAMIQRSEPGKRARQYFIQVEKAWN | orf287_BPsfi21_9632967 MNELINVTLDKNNEPIVSARQLHKTLEV-KTRFSQWVEQ-NFKI--FKENEDFSSVVTTTQQNQYGG----TKELQDYAVTIRMAEHLAMMSKTNKGHEVREYFIKVEKDFN / L0142_BP933W_9632546 RIPVFNGTIANETTLLVNARDLHTFLGV-GKRFASWITERIEEYG-FVENQDYIAISQKREIGY-------GRGKKDYHLTLDTAKETAMVERNEKGRQIRRYFIECEKKLR \ + P22ARC orf80_BPVT2-Sa_9633476 LIPVFNGTIANETTLLVNARDLHTFLGV-GKRFASWITERIEEYG-FVENQDYIAISQKREIGY-------GRGKKDYHLTLDTAKETAMVERNEKGRQIRRYFIECEKKLR | HI1422_Hi_1175795 LIPVFNGLIQNQPVQLCNARELHAFVES-KQQYTDWIKNRINEYG-FIQDEDYLVITERTN----------GRPRKEYHITLDMGKELGMVERNERGRQIRQYFIRCERTLK | NMA1293_Nm_11354036 LIPTVSGQLDNQTQALVDAHDLHKFLGV-ETPFSKWIQRRIEEYG-FTQALDFIGVDKIVR-TEAGFFGQRDKTVQGYYLSLDMAKELCMVERNDKGRQARRYFIEMEKQAK / CAC1945_Cab_15024925 MENLIRIS----DKGLVSAKELYLGLGLNKTNWSRWYPKNIQSNEFFKENIDWIGVRHNDE----------GNETMDFAISIEFAKHIAMMAKTEKSHEYRNYFIKCENKLK + phi-SLT -orf81a consensus/100% ................hss+pLH..l....p.a..W......p...F.ps.Da.s......................a..plc.scc.sM...sp.s..hRpYFIbhEp..p 9. P22ARC -------------------- PHD Sec. Str. -------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH------EE--HHHHHHHHHHHHHHHHHHHH--- L0142_BP933W_9632546 QAEPQQQFTDEEIILLCYMQLWMEKAQDLSKHLYPIMKELNSSYTNKLYDIAFETIYMVTKNRDVLLREAARLD \ orf80_BPVT2-Sa_9633476 QAEPQQQFTDEEIILLCYMQLWMEKAQDLSKHLYPIMKELNSSYTNKLYDIAFETIYMVTKNRDVLLREAARLD | HI1422_Hi_1175795 PEKFTHEFTEFEIETLVWLLIGHHQMNTLLGQLEKPLDAIGSNLHPAVYSYWKEYGRQYKDALPTIKRLMAPFK | + phi31orf238N HI1418_Hi_1175791 EKKFSFEFTEYELQQLVWLWFAFMRGIVTFQHIEKAFKALGSNMSGDIYGQAYEYLSVYAQQTKS--------- / Z1818_Ec_12514734 QEKSTNELSAKEANSLVWLWDYANRSQALFRELYPALKQIQSNYSGRCYDYGHEFSYVIGMARDVLINHTRDVD | + Bro-a N ANT_BPP22_9635550 QEKKTNDLSAKEANSLVWLWDYANRSQALFRELYPAMRQIQSNYSGKCYDYGHEFSYIIGIARDVLINHTRDVD |P22ANTN + Broa-N + P22ARC ANT_BPVT2-Sa_9633431 QEKKLNGLSAKETDSLVWLWDYANRSQALFRELYPALKLIQSGYSGICHDYGYEFSYIIGRARGVLINHTRDID | +orf6N at N 12514711 1-102 start from here (kilAC) 10. ASH ------------------- ECs2630_Ec_13362098 TQKNRLPCRNRSGYISAAPHKTGAGILNPIQSKAHNRASGFFVRTV------------LPRLFRVRIMAGRTGPTSVGPDSLLSGVENPVRLASP-RFSTL-DGELFL--------------------------------------------------------------------- + C low complexity ash_BPP4_75898 ------------------------------MVWCVVSRADGIPCIL-----------PASAHYAAESMVAQAGQPPGWPVSCEAGILTPVWAIAI-ERENS-GDSVICYSQEAAIMATTLTPSHPEFVFVFAAVRRADRHPRICMLRTVAGDERSARRSLVRDYVLSLAARLPVVEV ASH_BPphi-R73_93828 ------------------------------MVWRVVCRAGMIL--------------FAIACYATESMVAQAGQPPGWPVFFEAGIPTPVWAIAI-ERRNS-GDSSYLLLEGDGLMATTLTPSHPEFVFVFAAVRRADRHPRICMLRTVAGDERSARRSLVRDYVLSLAARLPVVEV Z0337_Ec_12513051 YLYSGLLTVVISRYSFSAVAKSAAGIGVPYNLLATIDAPCVFFYVVAQAQPFSGLWCLCLHHGSIEIMVVRAGQPSGWPVSNKAGYANPVRAATS-EIGVS-GGSNNRYLLEAAIMATILTPSHPQYVFVFAAIRRADTHPRICMLRTVSCDERSARRLLVRDYVLSLSARLPAGEV Orf179_Sf_421263 MLNVAIENQNGWNYSAPAPHKTGAGIATPTMTTAHNRAQAVF---------------LCVKHSHIQIMVGRAGQPQGWPVSVVTGCSNPVRLTTH-EIATS-GGESFKLTIEAAIMATILTLSHPD--------------------------------------------------- + RHA, the other paralogs dont seem to have this.. I wonder if this is an artificial fusion ORF199_Sf_312621 MLNVAIENQNGWNYSASAPHKTGAGRGNPNVTRAHSRAEAVF---------------LCVMHSSIQIMVGCAGQSQDWPGSRVTGISTPVRLTTL-MVVENLGGELINLSLEDAIMATIPALSHPD--------------------------------------------------- gp32_BPN15_9630502 -----------------------------------------------------------------------MGPTSVGPVSSCTGVENPVWATTPIEILNS-GGSTLYKIGM-----------HTMFKFKFAAVVRTDKKSHIHRLSTIASSEREARRQFASRFVLVLSARIPVSEV Reconstruction: ancestor of Ecs2630/Z0337 + P4ASH protein insert = Z0337, Eca2630 secondary loss of C C-terminal loss = ORF199 + RHA= ORF179 gp32, secondary loss of linker region or P4ASH plus extension = ECs2630 11. BRON ----------------------------------- EEEEEEEEEEE-------------------------HHHHHHHHHHHHHH--H-HHHH---------------------------HHHHHHHHHHHHHHHHH--------------------------------EEEEEE---HHHHHHH-HHH-------HHHHHHHHHHHHHHHHHHHHHHHHHH MSV226_MSV_9631397 EKVP----FVI------------------KK-DNETWYNMLDII-KILGYKKKL-HLHA---------------------------SLLNKNNK-KKFYQLLTK---------NTLKNKYFKY------TNVQKNRIFINEVALFYILLSSKKE--------NAII--CKNYVFG--NLFKLENLNL |solo AMV055_AMV_9964369 TFNEI---FKFN------NKSIDVI----GT-LNNPWFCGKDVL-NILEYEKSSFKKIL---------------------------QRLKESYK-KSYREILYKV-----------GDNLSP-----TLNGNNSKIIYINDSGLYTLIMNFNLN--------NAIV--FKEYVI------------- | AMV262_AMV_9964576 TIIKQ--IYISDT-----KEKYNIYIYVDIK-TKLSYFISNDIL-KILTESTDN-IY-----------------------------KYCEKSD---IFKWINIHN---------------------NIPSNISDETILINKNGLNNIISKLNNE--------KSNH--FRKWLND--IDINIIIKNE |solo SCGD3_15_Scoe_7480004 -DVSD---FVYA------ATGARVR-RLTMP-GGSHWFPAADVC-KELGYTTTR-KALL---------------------------DHVPEEHR-DSLETVTG------------SHSLSIPAG-----RKWRRDLQLIDLQGLILLVNACTKP--------ACAP--FKQWVA---EVVETVQREG |Clong but nothing significant ORF1_Nm_12697190 -MNEI---FNFH------GQEVRTL---T-I-DDEPWFVGKDVA-DILGYSKAR-NAIA---------------------------LHVDEEDA-LK-QGI--------------------------PTSGGTQDMLIINESGLYSLILSSKLP--------QARE--FKRWVTS--EVLPAIRKQ- |solo ANT_LcBPA2_6599316 NELQH---FDFK------GRQVRTV---V-V-DNEPMFVGKDIA-EVLGYSKPA-NAVN---------------------------KYVPDKFK-GVTKL---------------------------MTPGGKQDFVVIAEPGLYKLVFKSDMP--------NADE--FTDWVAE--KVLPSIRKHG | fragment of kilAC PA2423_Pa_11349554 -QLAP--HYFFRQ-----QRLLRA----LLI-DDQAWFVLDDFA-RLIEHSQPE-QMLA----------------------------RLDDDQARR--ESL-------------------------RSERGEDQAQWLISESGAYAALIYQQRG--------DGGE--LRRWLSG--EVVPELRSAT |N-nothing + Bro + C nothing PA1153_Pa_11349113 TLLQPS-RFTHH------HRVLRA---VL-L-DEEGWFVLSDLV-RLLGRYLGG-RAPAALCDEAPWPLATAEQRERLFALCHALERHLDTDQWRLAWL---------------------------HDERHGPRQDCLVSESGLYALLWLAAPG--------AARG--LRRWVSG--SVLPRLRSQS SPy2128_Spy_13623111 NKTE-----TWN------GYTIRF----VEH-QGEWWAVLADIA-KALDL-NPK--FIK---------------------------QRLGDE----------------------VVSNNHV-----TDSLGRQQEMLIVNEFGIYETIFSSRKK--------EAKT--FKLWVFE--TIKQLRQSTG | solo ORF5_BPRLT_1353522 KELQN---FNFN------NLPVRTV---L-I-NDEPWFVGKDVA-IAIGYKNFR-DALK---------------------------SHVKDKYK-RESRI---------------------------TTPSGVQSVTVISEPGLYQLAGESKLP--------SAEP--FQDWVYE--EVLPTIRSTE \ orf8_BPbIL309_13095813 KELQN---FT--------NGIFNLD--VKVD-GENILFSAEQAA-KAMGITQVK-NGK----------------------------EYV----K---WERVNSYL-----------PNS---------P--EVGKGSFISEPMVYKLAFKANNA--------VSEK--FTDWLAV--EVLPTIRKHG | orf9_BPphiPV_9635686 QALQT---FNFK------ELPVRTV----EI-ENEPYFVGKDIA-EILGYARTD-NAIR---------------------------NHVDSEDK-LTHQF---------------------------SASGQNRNMIIINESGLYSLIFDASKQSKNEKIRETARK--FKRWVTS--DVLPAIRKHG | + kilAC ORF38_BK5T_14251162 NELQN---FNFN------NLPVRTV---L-I-NDEPWFVGKDVA-IAIGYKNFR-DALK---------------------------SHVKDKYK-RESRI---------------------------TTPSGVQSVTVISEPGLYQLAGESKLP--------SAEP--FQDWVYE--EVLPTIRKHG | ORF291_BPLLH_1395130 NEVQI---FENN------GRGISLP--VKEV-GGQVYFEAEAAA-IGLGITT-----------------------------------EVNGDTY-VRWPRINSYL-------------GFATSG------KKIKKGDWITEPQFYKLAFKASND--------VAEK--FQDWVAS--EVLPSIRKHG | SPy0980_Spy_13622137 MELQV---FTNEQ-----FGEVRT----ATI-NNQIYFNLNDCC-QILELSNPR-KTIE---------------------------R-LNKDG--VTTSDII-------------------------DSLGRTQQANFINESNFYKLVFQSRKP--------EAEK--FADWVTS--EVLPSIRKH- | SAV0855_Sa_14246624 QALQT---FNFE------ELPVRTL---E-V-DGEPYFIGKDVA-DILGYANGR-DALS---------------------------KHVDEDDK-KVLTSRNTTL-----------------------ENLPNRGLTAVNESGLYSLIFSSKLE--------SAKR--FKRWVTS--DVLPAIRKYG / Z1818_Ec_12514734 NDFTI---FKFG------DSEIRVI----NK-CGEPWFVAKDVC-DALALTNSR-KALT---------------------------ALDDDE-KGVTLSY----------------------------TLGGEQNLSIVSESGMYTLVLRCRDA---VNKGSVPHK--FRKWVTA--EVLPSIRKHG | + p22ARC gp30_BPN15_9630500 KALSV---FSFQE-----SHPIRVV---L-V-GGDPWFVALDIC-AALNIANPS-DALR---------------------------K-LDHDEK-LTLGLTEAQ-----------------------KLDRMAREVNVVSESGLYTIILRCRDA---VKQGTTAWR--FRKWVTN--EVLPAIRKNG \N15 gp30 like BRON P43_BPAPSE1_9633590 --MTT---LVFR------NTVLET----ISH-NGQIWFTSSVLA-KALQYSSSK-SV-------------------------------TDLYHK--NSDEFADHM--------SKVVDST-------TLGKSRNKTRIFSLRGAHLIAIFSRTP--------VAKE--FRKWVLD--ILDKQTVNQT / XF2506_Xf_11362060 QSIIP---FDFH------SHAVRVV---M-R-DGNPWFVATDVC-TALGYRNPS-KAVA---------------------------DHLDDDEK-SNQSLGLA-----------------------------GKPVIIISESGLYALVLRSRKP--------EARK--FSKWVTS--EVLPSIRKTC \ M_XF2524_Xf_11362500 NAITP---FQFE------SHAVRT---VVDD-HGEVWFVGKDVA-DVLGYTNHN-KALG---------------------------DHCRGVTK-CYPIL---------------------------DSLGRSRETRIISEPDMLRLIVSSKLP--------AAER--FERWVFE--ELLPTLRKTG | C_XF2524_11362500 NAITP---FQFE------SKDVRIQ---LDE-ASAPWFNANDVC-AVLEFGNPH-QAIE---------------------------SHVDADDL-QKLEVI--------------------------DALGRTQRANHINESGLYALIMGSTKP--------AAKR--FKRWVTS--EVLPTLRKTG | Xylella specific fusion N_XF2524_Xf_11362500 MNAPSEFTLQFE------SHAVRVQ---VDE-AGTPWFNANDIC-TAVELLNPC-AALA---------------------------QHVGARNV--SKRKII-------------------------DTIGRTQRANYLNEPGMLTLLIGSTKE--------AAKR--LRRWLIS--EALPAAAVQK | XF0684_Xf_11362477 NAITP---FQFE------SHAVRT---VVDD-HGEVWFVGKDVA-DVLGYTNHN-KALG---------------------------DHCKGVPK-RYPL----------------------------QTPGGIQEIRIISEPDMLRLIVSSKLP--------AAER--FERWVTS--EVLPTIHKT- | XF1663_Xf_11362484 NAITP---FHFE------SQAVRT---VVDD-HGEVWFVGKDVA-DVLGYANHN-DALG---------------------------AHCKGVAK-RYPLP---------------------------DSLGRLQYFRIISEPDMFRLIAGSKLP--------AAER--FERWVFE--GVLPTIHKTG | XF1645_Xf_11362483 TLPAS---VDFS------DVSLTI----IDH-DGIPYLTAADLA-RALGYKDAS-AVLR---------------------------IYSRHTDE-FTSEM-------------SLTVNLTVKG---FGCGNSEKPVRLFSPRGCHLVAMFARTS--------VAAA--FRRWVLDVLEVLPSIRKTG | XF0704_Xf_11362478 TQLPA--AVCFS------GKSLS----IIDR-DGVPHLTAADLA-RALGYKDTS-AVLR---------------------------IYSRHTDE-FTYQM-------------SLVVNLTVKG---FGSGNSEKPVRLFSPRGCHLVAMFARTS--------VAAA--FRRWVLDVLEVLPSIRKTG / MSV194_MSV_9631452 MDLDN---LIFN------NKKIHIA----IY-ENKPYFKGKDIA-EILEYKDTN-DAIK---------------------------KHVDDDDK-SKYEDLINR--------PGILP----------SLTYNEKNTIYISESGLYSLILSSKKS--------EAKI--FKKWITN--EVLPNIRKHG \ + T5orf172 BROE_BMNV_9630956 VKIGK---FKFG------EDTFTLR-YVLGG-EQPVRFVARDIA-NKLKFKNTK-KAIR---------------------------DHVDGKYK-CTFEQACI----------NISKEKHVKQG---NPLYLQTQTILLDKIGVIQLFMRSKMT--------NAAE--LQNWFYE--HVLPQCTARQ | orf117_ESV_13242588 DILQT---FVFN------NTRHKVV-ILRDE-NDDPLFKASDIG-KILSIKNIH-TSMI---------------------------D-LHDDDK--AIRTA--------------------------STPGGEQKTVFVTEKGVYKLIMRSRKP--------VAKP--FQDWVF---EVLKTIRKRG | BROM_LdNV_9631117 MALTK---VNFV------SGPLEVF-TVQDD-EQENWMAANPFA-ETLKYNNCN-KAIR---------------------------IHVSANNQ-KTLEELNID-----------------KSQ--VLPRNVQAKTKFINMNGVIELLLASQMQ--------QAKE--FRYWMTN--VKFAETSADP | BROK_LDNV_9631082 VKIGQ---FRFG------EDAFTLR-YVLAA-EQPVKFVAKDIA-RSLKYEKPA-NAIA---------------------------KHVDDKYK-SAFEQLCF-------------DDLRVKQG---DPLYLHKSTILIDKIGVIQLFMRSKLH--------NAAE--LQNWFYE--RVLPQCTARQ | BROI_BmNV_13751084 VKIGE---FKFG------EDTFTLR-YVLDA-EQQVKFVAKDIA-SSLKYVNCK-QAVI---------------------------VNVDNKYK-TTYEQACI----------NISKENRVKQG---DPLYLQSQTILLDKIGVIQLFMRSKMT--------NAAE--LQNWFYE--HVLPQCTARQ | Brob_BMNV_9630900 VKIGQ---FKFG------QDEFTLR-YVLGD-EQPVKFVAKDIA-RSLKYVNYE-KAVR---------------------------VHVDVKYK-TTYEQACI----------NISKENRVKHG---DPLYLSPQTILLDKIGVIQLFMRSKMH--------NAAE--LQNWFYE--HVLPQCTASA | BROG_LdNV_9631042 THLQH---FEASL---DDGVKFECW-GVVTP-DGKVACKLKEFM-DFLGYKEVN-SAYK----------------------------MIPKEWK-VYWHKLQDDL----------CVDS---------SVDLHPRNVFVYEPGMYAFMTRSGSP--------LAKW--CMGFLYD--VVVPTLKKNQ | ORF130_XnNV_9635380 --------------------------------MDKLLYTGHGVA-ESLGYKCPR-RALY---------------------------DHVKPQWR-KTWAEIKKL---------TFFNEAL-------LPSNWQPNTVFITEAGVYALINKSKLA--------GAEI--FREWLFD--TIIPQMRRAK | bro_HaNV_12597544 MSLTK---IQFG------DKEVET--YTVDF-NGEKWMVANPFA-EALNYSRAN-KAIL---------------------------EKVSDGNQ-KTFDQIKPYR--------IVHDGTGESSV---IPRNMKPNTKFINRAGVFELIMSSQME--------YARQ--FRYWLSS--VKLNTTVETD | 201R_CIV_15078913 YMTIT---INGN------EHQIKLA----GI-IEDPYFCGKDVC-TILGYKDKE-QALR---------------------------KRVKSKHK-KSLSELFEKK----LPVVTTGNFFLGTQN---ELSYHEGKSIYINEPGLYNLIMSSEAP--------FAEQ--FQDMVYE--KILPSIRKYG | 289L_CIV_15079001 YMTIT---FCNQ------EHQIKLA----GT-VDTPYFCGKDVC-KVLGYKDIK-DALK---------------------------KHVDREDK-LPLSEIKKVG-------GTAPPTFLGQTY--AYLSHNDGRAVYISEGGLYSLIMSSEAP--------FAKD--FRRLVCN--VILPSIRKFG | MSV023_MSV_9631535 DLIS----------------KINI----ITY-NNCSYYKAKDIA-DILNYKSVD-YFIK---------------------------KYVKNEHK-INYE-----------------------------------STIYVNNSGLYYIMFKSKKH--------EAEK--FQNWIKE--ENLPEIENNK / AMV175_AMV_9964489 TFNEI---FNYN------DVKIKVI----GT-INNPWFCGKNIL-KALEYSDDSHNKIL---------------------------NRLDDKFK-DNMYNILSSV-----------RDNLS------MTKNNKNKAIYLNEPGIYYIILHCTKD--------SAKG--FQDFILF--DLLPTIRKRT \ AMV057_AMV_9964371 NFNNI---FKFN------NISINII----GS-LDNPWFKGKDILIDGLEYTDQSAKCVL---------------------------KRLNTSFK-KSYNDIISVE-------GNLPP-----------TKNNDNKAIYVNEAGLYYIILHCTKD--------SAKG--FQNYILF--DLLPSIRKRA | orf6_HaEPV_3510491 --MKS---FKYK------NINIDVL----GD-INYPWFNGKNILIDGLQYTEQSAKCVL---------------------------KRLESKFK-NKLSDIICVG------GNLPPTGNLDKIS--NITRHNDGKAIYINEAGLYYIIIHCTKE--------SAKP--FQDYILF--DLLPSIRKLA | AMV177_AMV_9964491 NFNKI---FKFK------DTDIKIN----GT-IDQPWFCLKDIIIYGFGYTKESYKSIL---------------------------KELNNSYK-KSLYDIIVEG---------------GKTP---PTKNNENKAIYVNESGLYYIVFQCTKD--------SAKD--FQKYILD--ELLPSIRKLA | BROA_LDNV_9630998 MALSK---VEFV------NGPLEVF-TVQDD-KQENWMAANPFA-ETLKYLNVN-RAIR---------------------------VHVSKHNQ-KTLDELQSD------------RNGL-------ITSSLHPQTKFINRAGVFELISASEMP--------AAKR--FKQWNAN--DLLPSLCREG | BROC BROL_LDNV_9631113 MALSK---VEFV------NGPLEVF-TVQDE-NQEKWMVANPFA-EALGYTRLN-YAVT---------------------------QHVSVVNQ-KTYEEFKSQG-------STATDDS---SL---LPRNIQAKTKFINQAGVFELIGASEMP--------AAKR--FKTWNTN--DLLPTLCAEG | BroO_LdNV_9631121 MALTK---VEFV------NGPLEVF-TVQDE-NQEKWMVANPFA-ESLKYAIPH-IAIS---------------------------KFVSTVNQ-KTYEELRSMR---ITSRITSTDDS---SL---LPRNVQAKTKFINRAGVFELISASEMP--------AAKR--FKTWNTN--DLLPTLCAEG | BRO_HaNV_12597545 MSLTK---IQFG------DKEVETY--TVDF-NGEKWMVANPFA-EALSYSNVN-RAIR---------------------------VHVSEKNQ-QNYEEFKSDR--------VGLTDSV--TS---LPRNIQAKTKFINRAGVFELINASDMP--------GAKR--FQAWNNN--DLLPSLCQEG | ORF109_XnNV_9635359 -------------------------------------MVANPFA-EALNYSNVN-RAIR---------------------------VHVSNQNQ-KCMEELRSDR-------CGLTDDS---SC---LPRNIQAKTKFINRAGVFELINASEMP--------AAKR--FKAWNSN--DLLPTLCTDG | ORF159_XnNV_9635409 ARKQK---FLYC------NEELNVI-TQVDE-FGEPWMVANPFA-TVLQYYKPN-DAVR---------------------------KHVSEWNV-KSYEDFRSRR------IGADDSSHWVDE----ITSSLHPKTKFINRAGLFELIQSSRMP--------KAQE--FKNWVNS--DLLPKLCQEG | BRO-d_BmNPV_9630955 VKIGQ---FKFG------QDTFTLR-YVLEQGNPQVKFVAKDIA-SSLKYGNCK-DAVS---------------------------RHVDKKYK-YTYSESGARL-------PPSAPNSVAKQG---DPLYLQPHTVLITKSGVIQLIMKSKLP--------YAIE--LQEWLLE--EVIPQVLCT- | Essential for lytic infection Bro-III_BmNPV_13751089 VKIGE---FKFG------EDTFTLR-YVLEQGNQQVKFVAKDIA-ISLKYASYE-KAVR---------------------------VHVDGKYK-STFEHAG-QI-------GHHAPNSVAKQG---DPLYLHPRTVLITKSGVIQLIMKSKLP--------YAIE--LQEWLLE--EVIPQVLCT- | ORF2_AcNPV_9627744 VKIGE---FKFG------EDTFNLR-YVLER-DQQVRFVAKDVA-NSLKYTVCD-KAIR---------------------------VHVDNKYK-SLFEQTI-QN-------GGPTSNSVVKRG---DPLYLQPHTVLITKSGVIQLIMKSKLP--------YAIE--LQEWLLE--EVIPQVLCT- | AntgemNPV_9799895 VKIGQ---FKFG------EDVFTLR-YVLDR--DIVKFVAKDIA-NSLKHTNAA-EAVR---------------------------NHVDIKYK-TTYEQGE-TV-------SHPASTSLVKRG---DPLYLQPHTVLITKSGVIQLIMKSKLP--------YAVE--LQEWLLE--EVIPQVLCT- | ORF153_LdNPV_9631120 VKIGE---FKFG------EDTFTLR-YVLEK-DQQVKFVARDVA-VSLRYERPA-DAVS---------------------------KHVDIKYK-STYAELGRQI-----ADPTLNVKLIVKKG---DPLYLQPHTVLITKSGVIQLIMKSKLP--------YAVE--LQEWLLE--EVIPQVLCT- | BRO-c__BmNPV_9630901 VKIGE---FKFG------EDTFTLR-YVLGD-EQPVRFVAKDIA-SSLKYVNCE-RAIR---------------------------VHVDGKYK-STFEHAD-QI-------QHHAPDSVAKQG---DPLYLHPHTVLITKSGVIQLIMKSKLP--------YAIE--LQEWLLE--EVIPQVLCT- | BRO-a_BMNV_9630839 VKIGE---FKFG------EDTFTLR-YVLEQGNLQVKFVAKDIA-SSLKYVNCK-QAVI---------------------------VNVDKKYK-TTYSESGSIP-------YTPAPDNVVKQG---DPLYLQPHTVLITKEGVIQLIMKSKLP--------YAVE--LQAWLLE--EVIPQVLCTG | bro-a_SpLNV_7672865 VKIGE---FKFG------EDTFSLR-YVLER-DQPLKFVAKDVA-ASLKYQDAK-RAIK---------------------------IHVDDKYR-STFEHGG-QI-------APLVSNALAKQG---DPLYLHPHTVLITKEGVIQLIMKSKLP--------YAVE--LQAWLLE--EVIPQVLCTG | BROB_LDNV_9630999 VKIGQ---FKFG------EEEFTLR-YVLER-DQSIKFVAKDVA-ASLKYVDCK-QAVR---------------------------INVDDKYK-FTFEQGCVP--------HTLASDSVAKQG---DPLYLHPHTVLITKEGVIQLIMKSKLP--------YAVE--LQAWLLE--EVIPQVLCTG | BROP_LDNV_9631128 VKMGE---FRFG------EDVFRLR-YVL---NDPVKFVAKDVA-GSLKYQDAK-RAIR---------------------------IHVDDKYK-STFEHGE-IR-------SHLASNALAKQG---DPLYLHPHTVLITKEGVIQLIMKSKLP--------YAVE--LQAWLLE--EVIPQVLCTG | ORF114_XnNV_9635364 RKQV----ILFQ------NEPVEVVFSDKTGPDGLVYYF-FEVT-PFARLMNVD-NPL----------------------------SKIDSQHV-IVVEEPVTA----------ADTNNW-------AVRNNTRSTTLVSEAGLYQLMFTGKPV--------TVRQGMVRNWLFD--IVLPTVKQFT | BROJ_LDNV_9631081 VKIGQ---FKFG------QDTFTLR-YVLGG-EQQVKFVAKDIA-SNLKHANCA-EAVR---------------------------KHVDGKYK-STFEHGE-IR-------SHLASNALAKQG---DPLYLHPHTVLVTKEGVIQLIMKSKLP--------YAVE--LQAWLLE--EVIPQVLCTG | ORF13_SeNV_9634234 -KTKR---LQFDD-----QFSFTVD-YIF---NDEVWIAGNKLA-EGLGFREPQ-TAID---------------------------EFVDGKYK-RTINELVFN-----------------------NSVDDTNGLVCVNKHGVLQLIDRLDFK--------NKAE--FTAWIIE--EVYVELENKF | 38_7kd_HzNV_10442572 LERKR---INFDD-----QFSFTVR-HLTR--NQQMWMIGSDFA-SGIGFDEPE-FVVD---------------------------NYVSNHNK-ICLETLIFG-----------------KRV--EIENDDVKRSMCINRDGCLQLLNHIEFA--------NKSE--FIAWLVT--YAFDKLYSHM / MSV195_MSV_9631451 MNLDN---LIFN------NKKIHI---VIDN-NNKVLFKAKNCA-EILKYTNPL-KAIR---------------------------DHVRQKHQ-ISFKNINMN-------------DSF-------ILNNIHPDTIFITESDFYSLISK------------------------------------- | solo MSV196_MSV_9631450 ---------------------MNI--YVAIF-NNKSYFRAKDCA-SILEFKHTK-DAIR---------------------------HYVSNGNK-IKFKNINIR-----------------------SKKYIHPHTVFINNFGLIELILKHKSI--------VHHN-IIDKLICK-FDLNVDLNITP \ + vsr nuclease MSV024_MSV_9631534 MDLMQ---------------GI----HVINY-NDNLYFKAIDIA-KLLKHKNIY-RAIK---------------------------YKISDCNK-TLYKNISNT-----------------------NLSYKKNKMVYINKLGLIELIKESTTI--------VSPM-VINGLINKFNLNLDLPIKFI | MSV026_MSV_9631533 DIISN----------------IKT----INV-NNCLYFKGEDCA-KILKYKNTY-GAIR---------------------------NNVSKNNK-IKFE---------------------------------KNNDIYINKLGLSELIIKHKSI--------VSTN-TINTLIHNFNLNLDLFEKKK | MSV204_MSV_9631444 MEI-----INYN------NNQIHL---LYTT-IGEVYYKGKDIA-KILRYIDTK-KVIR---------------------------NNVLSTNK-VNYSTLIKNV----------SING--------QLHKTPHHTIFINTKGLKNLFDMP------------VKR--LSN--KEINDLIEFLNSHN | 069L_CIV_15078782 GGLRA--IFNLD------GVTLDTP--IMGT-WDKPVFFGKEIA-EFLGFKKPK-DALQ---------------------------KHVKPKYK-TTLSKVLEKK---------LDTEPV---------SYNEGKRVLLYKEGVVELIKKTRLV--------GIEN--KIDALIE--AFELNLNVVH / ORF62_XnNV_9635312 IVKKT---FTSD------KKKWELY-NITSC-PYHFYYEAYPIA-KLLCNKHPE-LAIK---------------------------NYVDRSCC-KIYEELKRWFRPYCIFQSVGSPCSPGPNN---QPIHWQSNTLFINKDGIISLINNSTLP--------VAHE--FKRWFLA--QRHDEAEVFK |solo ORF60like_HzNV_10442560 LVNRK---CKLG----------EVW-ITEIE-ENRFLCSGHGVA-EALGYKCPR-RALY---------------------------DHVKPQWR-KTWAEIKGVL------NQHSLVTSSDSIE---MPLNWQPNTLFITEAGIYALIMRSKLP--------AAEE--FQSWLFE--EVLPELRRTG \ BRO_HzNV_12597590 LVNRK---CKLG----------EVW-ITEIE-ENRFLCSGHGVA-EALGYKCPR-RALY---------------------------DHVKPQWR-KTWAEIKGVL------NQHSLVTSSDSIE---MPLNWQPNTLFITEAGIYALIMRSKLP--------AAEE--FQSWLFE--EVLPELRRTG | ORF60_XnNV_9635310 LVNRK---CNMG------GINADIW-LTQME-MDKFLYMGHSIA-KSVGYANPQ-KAIR---------------------------DHVRPEWR-KTWSEIVDGT------NRSPLVTSFNDSH---LPANWQPNTVFITEAGVWALIIKSKLP--------AAEK--FQKWLFE--EVLPELRRTG | ORF131_XnNV_9635381 -----------------------------ME-MDKFLYMGHSIA-KSVGYANPQ-KAIR---------------------------DHVRPEWR-KTWSEIVDGT------NRSPLVTSFNDSH---LPANWQPNTVFITEAGVWALIIKSKLP--------AAEK--FQKWLFE--EVLPELRRTG | BRON + C synapomorphy shared by this group BROD_LdNV_9631039 MALQR---FEFPMSADEDESKFECW-GIVMP-DGSVAVKLKELA-EFLNYEDVK-KAYK----------------------------LVPDEWK-ITWNILQNKL-------EPSRPHLVAPST---TPANWQPETLFVLEPGVYALMARSTKP--------MAKE--KMKYVYE--TILPTIRKTG | BROI_LdNV_9631080 MALQR---FVFPMSADEDGAKFECW-GVVMP-DGDVAVKLKELA-LFLGYADVK-MSYK----------------------------HVPDEWK-ITWKNLQNKL-------ASKRHQLVAPPT---TPANWHPETLFVLEPGVYALLARSNKP--------LAKE--RMKFVYE--TILPTIRKTG | BROC_LdNV_9631038 MALQR---FEFPMSADEDESKFECW-GVVMP-DGSVAVKLKELA-LFLGYADVK-MSYK----------------------------LIPEEWK-ITWKNLQNKL-------ASKRHQLVAPPT---TPANWHPETLFVLEPGVYALMARSTKP--------MAKE--KMKFVYE--TILPTIRKTG / CnBV__13160526 FQLQN---WDVD------DKSVVLRLYIHPI-TNEPWVVAADLA-RCLGYEKYR-QTHT----------------------------RILAAFKRKLSDLVHTEP-----FSGTVESEVARLEGAPVELSSRERDIVVVNEGGIHQMLIGSRLP--------NVQK--YKELVFG--KILPAARARG \BRON duplication PxORF82_PxNV_11068085 VGCSV--GILFD----------KLH--YIVI-DGVVWFKLNQIC-KYFD--------IP---------------------------KQCPD-YNIITWYTLSKRL----------------KSN-----ITWKLNTIMISDMGVYKLLIIKNEI--------IAEE--FYH------KRLHELRSTG / ORF99_CpGV_14602336 ESVDSVCGVL--------PSNIEF----FSV-NERTYFKGLDVA-RHLKCSPS--YTIN---------------------------KYVADTDM-VLWGDLRRYV-----------HDKYVWTN---CKNHWKDNTIFLKETGVKQLCIATQGD-------DKLYQ-EMMDGVYNYDSGDEQVVYAK |bro duplication SPy2127_Spy__13623110 QVITT---TNFH------GQPLDIY----GD-IQEPLFLARAVA-EMIDYTKTS-QGYY---------------------DVQAMLRKVDEDEK---LKGMAL--------------EGTTKN------FRSGQKVWFLTEHGLYEVLMRSNKP--------KAKE--FRKAVK---NILKEIRLNG | SinR like HTH p63_BPMx8_15320633 PTPEMPKPFLFEGS----TRIRVVVDE-----AGEPWFVAQDIA-HALEYRMAS-D-LT---------------------------RLLKPHHL-RTHAV---------------------------RTNRGERSATIISEPAMYRAVFLSKSK--------KAEP--FQEWVTS--DVLRSIRKTG |p63C consensus/85% ........h............hp.............bh..pshh...L.h.p....sh..............................l..p.b........................................p..hlsc.Ghh.lh..sp...........s....hb.hh.....hls.h.... 12. T5orf172 ------------------------ PHD Sec Str. ------EE---------HHHHHHHH----------EEEEEEEE---------HHHHHHHHHHHH-------------------------HHHHHHHHHHHH orf172_BPT5__93750 PAWKNQYKIGMSQN---PKERLAQYQTYSPY-RD--YKLEHWS-FWF---DKRKGEKLIHQYFKDLK--------------EHEWFSINSRDLSKYLERINSSSD \ yeeC_Bs_7474985 SSIKNLYKIGFTTG--SVENRIRNAENQSTYLYAPVEIVTTYQVFNM---NASKFETAIHHALENNNLDVSILGANGKMLVPKEWFVVTLEDLQAVIDEIVMMVH | orf240SM63E2_14194257 MRSAKRYKIGKSNS---PSRRYREVRLDLP---DA-TILVHTI-PTD---DPSGIEAYWHRRFADKRV------------RDTEFFNLTASDVTAFKRRKYQ--- |solos or with insignificant extensions CIV460R_CIV_15079171 YEPLDIYKIGCTKD---INRRLKTMNASRI-SFDK-FFIVNQI-QTF---HYFKLEQGLHKLLKKYRL-------------NNEFFQCNVNIIEKAISDYANNNV | BROF_LdNV_9631041 YRDRRIYKIGRTAS---PADRLCALNTGRA-DDF--LYFEHVS-PDLGHEASVRVERLMHDSLAPLR-------------MHGDSFN------------------ | NMB1170_Nm_11345564 TVIKGVYKIGISDV-SNFEGRMRHLENNGYANVAG-LERILAV-KTD---NYKEKENLLHEIFSKSRI------------GDTELFAVDENLVKRLFLSLRGEIV | ORF1_BPP27_8346568 SFGENVYKVGMTRR-LEPMDRVKELGDASV-PFD--FDVHAMI-SCD---DAPALEKALHDYLERYRV--------NKVNLRKEFFRVELEKIIEVVKHHHGNIE / CIV315L_CIV_15079027 LQVHNVFKIGYTKN---FEERLKTFNDYRH-SLEPQFFAVAIY-DTD---NAKKLETTIHKKLKDFRS-------------EGEFFQVELSVIKEAFLKEDCCLK | + kilAN AMV209_AMV_9964523 YASINNFKVGKTDN---LSSRQSNFNSSHI-DQDE-FYICFYQ-KVY---NMSKTENLIHDLLEDFR-----------DKKRKEIFIIHYTYLLDIINLVIKNIN \ AMV207_AMV_9964521 YAMINNFKVGKTDN---LSSRQSNFNSSHN-TEDE-FYICYYE-KVF---NISKTENLIHDLLDNFR-----------DKKRKEIFVIHYKYLLDMVNLVIKNIN | MSV198_MSV_9631448 YAKLNTFKIGKTDN--LISKRQSQLNNSHT-SFDK-IYICYYE-AVY---NPNKVEQIIHDVLESFR-----------DSSNNEFFILHYKYLLNIVKLIIKNIN | +MSV199 AMV194_AMV_9964508 YAAQNRFKIGGVENNNLIKPRLSTYNSRSA-EGDE-WYYTYIK-NIN---NYKHFENRFWSVMSSFR-----------DKKDKEIIVLYYNDLINIFNFISENYN | CIV420R_CIV_15079131 YAAQHRFKVGGVEGRRRLRGRLSDYNGRSA-SGDE-WYFCHLI-DVA---DFRKAEGRIEDIIGKFR-----------DKKDKEIYIMPYRKLLKVIELICQNYT | MSV021_MSV_9631537 YKEKNIYKIGYTND---VVGKLVKMNSNRL-KFEQ-FYYVKIY-KVN---NIFSIQNYIYKKLYPYI-------------LNYPYLNCD----LNVITNAMENID / orf117_ESV_13242588 EDNSVLVKIGSTKN---IRARTTGLVNEFG------SMAIFRIFECD---RYEEFEKSLHKHNDIKRY---RFKKPINGKRSMEVFNMTKEELQRAVNIAGSNVC \ BROM_LdNV_9631117 LQTVDAYKIGYTHD---LHDRIAELNVASP--LD--FKPVFVY-DTA---TPRRLEQQLHNYFLDKR-------------IKREFYKLDKEDLLMLPVVCNKLCA | ORF59_HaNV_12597544 LQMIDAYKIGYTFD---LTARLNELNVASP--LD--FKSVFVR-ESS---NPYDLEQKLHRHFHESRI-------------KREFFKLTEEDLALLPLICDNLLA | CIV289L_CIV_15079001 YQQQHKFKVGGVQTFDLLKSRLTQYNSGES-DSEA-HFFIYIR-KTV---NYRSIEHAIKGLLSGFR-----------ENQSNELYIMHYDWLVKFVDAIMDGNA | CIV201R_CIV_15078913 YQQHHKFKVGGVQSFKDLKSRLTQYNSGES-NSEA-HFFIYVR-KTV---SYRSIEHIIKGLLSGFR-----------ENQSNELYIMHCDWLVKFLDAIMDGNA | MSV194_MSV_9631452 DLSKNIFKIGKTNI-NSIKNRLSTYNTGAS---DP-YYYVFYK-EVY---DATKIEKDFNTLMNRYN---INVTSPNKTKLNNELYKLYYLDLEYVLNAVIDSND | MSV023_MSV_9631535 DLSNNIFKIGKTNI-NSIKNRLSTYNTGAS---DP-YYYVFYK-NVY---DGNKIEKEFNYLMNRYN------VLLNNNKINVELYKLYFPDLEYVLNAVIDSND | +BRON BROB_BmNV__9630900 YAERNLFKIGQTTN---LTRRLATLNCGRA-DDDQMQYVLQTE-PTV---HHTLLEKLMKQELRPYR-------------NSGEVYCTDFEHIKRALESCLPHCS | BROI_BmNV_13751084 YAERNLFKIGQTTN---LTRRLATLNCGRA-DDDQMQYVLQTE-PTV---HHTLLEKLMKQELRPYR-------------NSGEVYCTDFEHIKRALESCLPRCS | BROK_LdNV_9631082 YAERNLFKIGQTTN---LTRRLAALNCGRA-DDDQMRYVLQTE-PTV---HHTLLEKLMKQELRPYR-------------NSGEVYCTDFEHIKRALESCLRRCS | BROE_BmNV_9630956 -AERNLFKIGQTTN---LTRRLVSLNCGRA-DDDQMRYVLQTE-PTV---HHTLLEKLMKQELRPYR-------------NSGEVYCTDFEHIKRALETCLPHCS | ORF130_XnGV_9635380 YKSKHIYKIGTSRS---PAKRVRQLNCGRP-YDL--LILDHCQAAAD---QGFIVEALMLNEYKTQQ-------------LHGEWVQFADNKQYQSAKKKLDEFI | BROG_LdNV_9631042 NRERNLYRIGRTAS---PTALLCFLNEDRH-EDR--FYLDYVS-PDVSREGSVRAERMIREHIESLQ-------------THGDFYQFATKEALDLMREAIVKIQ / consensus/90% ....p.aKlG.s........R...bps..........bh..h...s....p...hE..b...b...p..............p.-hb.h.......h...h..... http://www.bmm.icnet.uk/servers/3dpssm/output/a47ba25a21f354b7.job_summary.html 13. MSV199 ---------------- PHD Sec. Str. HHHHHH------HHHHHHHHHHHHHHHHHH------------EEEHHHHHHHH------------------HHH------------------------------------------------HHHHHHHHHHH----EEEEE----------HHHHHHHHHHH---------EEEEE---HHHHHHHHH----HHHHHHHHHHHHHHH- CIV146R_CIV_15078859 LIDIFIEEEQN-----------FGTILNE--MTCQ-------HKIYISKKLLKWIGYEG--------------DYKK------------------------------------------------QRDSFKKLLKRHNIDFEELKSNDIECEN--YPEIKVDMANL-SNGVISQSKWLILNIYNFKYI------------------------ |+UVRC MSV199_MSV_9631447 MLNIFEFIEQN-NFEINLG-SWFNEIWLP--LFNK-------TELLITLNILHFIHYGTSKSVLDGNTT---LNYRE------------------------------------------------LKRDFEKILNNNKIKYKKIKYEEIVNNKNYYELVKNEIKNI-TPNNLNKSTWFILDVLQFKMLIMRLSTNVAKEICEYYVTLENILH |solo MSV198_MSV_9631448 MLNIFEFIEQN-NFDIKLG-PWFNEIWIP--LFNE-------TELLITLNILNFIHYGTSNVVLDYHPM---RNNTN------------------------------------------------LKRDFEKILNNNKINYKKIKYNDIINNEDYYNKVKEEIENI-RPCNLEKSTWFILSVDEFKMLIMRLSTNVAIEVREYFILLEKILF \ AMV209_AMV_9964523 FVDIFTFITNN-DYDFKLG-SWFKDIWYP--LFEE-------KDVLITNDILTFIYYFPEG---SQPPP---EMFKG------------------------------------------------YKKNLIDSLNNYNIKFIEIDYKHEYVLT--NKKLKNEIKFI-TPNNILRKRWIILSVENFKLLIMRLNTKSAHYIREYYLFIEGLLY | AMV194_AMV_9964508 FVDIFTFIKNN-NYEFKLG-EWFIDIWYP--LFER-------KDVLITNKILYFIHYGISGG-DTHPPL---EKYRL------------------------------------------------MRKDLEKILKNYNINYIKIKYYKNIDID--YNFLIDEIKNI-TPNNIIQKTWIKLSVKNFKKLILKIRTAIADDIRDYYITLEEILY | AMV207_AMV_9964521 LMDVSTFITYN-NYDIELG-SWFKDIWFP--LFNK-------KNVVITNEILNFIYNFQVGKCFPTYNL---DNYIQ------------------------------------------------YKKDYRSFLKKNNIEYNIIKYDENILNK--YNILKSELKLY-DKHALVQKTWLILSVDDFKESIMMMNNNNSKMIRKYYIKIEKILF | +T5orf172 MSV021_MSV_9631537 VNNIFSIQNY---IYKKLYPYILNYPYLNCDLNVITN-----AMENIDKSLLSNNLY---------------TEYQN------------------------------------------------LKDNFETILTTNRIKFKKLKYHEMSDEN--REMLNSEVIKL-SMSELANTTWCILKTSDFKNLILQINTLPVEEIREYYLLIEKILL | MSV191_MSV_9631453 MEHIYEYIENKQNENIIMN-PWIKDICLP--MYNK-------SNVLITSSILKFLYFGPKIPINDSPGYIYVDEYKKNEIYAIYYSKDNIEFPCKIKINVDDMVLVKNYLCYKLSEYKYGDSGELFKCDFDIILRAMEIPYN-----NELLTKNLEKVLIEKNITY-SKSEYNDSFRLIVHIDQFKLLINKLN---IDILSKPYEAVEKIIQ | CIV420R_CIV_15079131 MTDLFTYIKDK-NIAIDLNSKWFQELWYP--LSKK-------TGSIITTRLLEWMGYSG--------------EYKL------------------------------------------------QRQNFKRLLDNNNIPYEEIYHNDDRFLE--HPSMIYEIEQT-DKKQIKQKRWITLEMRNFKKAILRLNTKNAEVIRDYYLNLEEACF / CIV468L_CIV_15079179 LLDIFKFIEIT-NFDLD--PIMTNWFWQV--MVNN-------HSTHLGRVVLEWFGYEG--------------EDSN------------------------------------------------QKQKFIDMLKRNKIPYKQLKHTDNEIEL--YPSIKEEMTLLPHKGAIASSKWLVMEPFNIKMAMLRLNTKNADIIKRYYIKMEELIR \ CIV238R_CIV_15078950 ILD-----SAMNESKIKLDISWFFDNYMDQELTNVMNYFDGEEPIHINTVVLEWFGYEG--------------DLRT------------------------------------------------QKRKFIDMLKRNSIPYKELTSKE-EIEL--YPTIKEEILSLPHKGAIACSKWLVMKPYDIKIAMLRLNTKNSQIIKQYYIKMEELVR | CIV212L_CIV_15078924 LMDLETFIDTT-GFEKD--PIMNDYFWQI--MVTK-------QRTHLSAMLLQCLGYEG--------------EFRV------------------------------------------------QQQHFKRFLKSNNIHPLELTSSDPDIKN--YPTIQDEMKLL-KPNVISNRKWLIVEPREFKKVIMKLNTKHGDRIREYYLCLEEL-- | CIV388R_CIV_15079099 YLEIETFMDVI-GFVKD--PVMTDYFWHI--MVDN-------HCRHLATVLLECLGYEG--------------TYNK------------------------------------------------QQYAIKRFLKSNRINYSELSSDDPQIDL--YPTIKEEMKNM-KPNAIACRKWLIIEPREFKKVIMKLNTKNGDNIREYYIRLEELIK | +BROC CIV019R_CIV_15078732 KLEINEFIDLFIG---------EENKWNK--MFDSDL-----SGIHISSLILNQLGYEG--------------EFKN------------------------------------------------QQTCFKRFLKRNNIIIQEFSSSNPELKL--YPSIQEEMKNM-KTNVIANRKWLIANPRDLKKIIMKLNTKNGDAIREYYICMDELVQ | CIV211L_CIV_15078923 LLDIPSFMKVA-GIEFD--PIMFNHFWQV--LVDNGD-----RLPHVGETTLNWLGYEG--------------VFTK------------------------------------------------QKEKFINMLKRNQISFKELSYQDNEIQL--YPSIQKEMLLLPNESAKTKSKWLLMNPDDFKMAIMGLKTKNSEKIKRYYVTLEKTMK | CIV148R_CIV_15078861 IVDIIKFVEIT-NFDID--PFMIDKFWHT--MYDN-------SLLYISRDILEWMGYTG--------------EFGE------------------------------------------------QRKAFKKLLKRKNINFTELSNNDPTKHL--YPEIQKDSLLL-SNAVVSQSKWIIMNSDDFKDSILMLNTKNSGKIRKYYRSFEKLLK / consensus/85% h.pl.phbp....b.b.....h...ba....h.pp.......p...ls..lLphh.Y................php.................................................bbpphbphLpp.pI.a.blp.pc.......b..lbp-hb.h.p.s.l.pppWhlhp..phKbhhhblps..s..lpcYY..hEphh. http://www.bmm.icnet.uk/servers/3dpssm/output/e8c2fb22ee6b2ed4.job_summary.html 14. CIV029R / BROC ---------------------------------------- PHD Sec. Str. -----------------------HHHHHHHHHHHHHHHHHHHHHH---------------------HHHHHHHHHHHHHHHHHHHHH--------------HHH----------------------EEEEEE-----------EEEEE---------------------EEEEE--------HHHHHHHHHHHH---- 029R_CIV_15078742 -------------------------------------------------------------------------------*MVERLGI--------------AVED-------RSPK-LRKQAIRERFVLFKKNTERVE--KYEYYAIRGQSIYINGRLSKLQSERYPKMIILLDIFCQPNPRNLFLRFKERIDGKSEW \ ORF17_DpAV4_11931709 ------------------------------------------------------------------MGVQLDETNEQLNEMNNKLDV--------------AVED-------RAPI-PEDQSKVERFVFLKRPNE-----NYPYYAIRAQAASTKTAIRK-QQKEFGAIELLLDFETHPNTKTYYNRIKWR*------ | solos ORF116_OpNV_9630054 ---------------------------------------------------------------------MLEDKDRRIQELYASLLE---------------MSE-------RAVQYPAKGHQTP-MLCVARE-------FNCLRAITGQKVHVTKMKREL-TD---AAELVIDA-MRPNPQVDLNNFVNRV*----- | ORF103_EpNV_15213228 --MSPPDLTPMEKLLE-----SIENQIKIK-DEQLRKNNEMLERYIML----------------------LEEKNKRIEELYRSLME---------------MTD-------RVVQYPAKSYQTP-MLCMTRE-------FNCLRAITGQKVHVNKMKRDL-TT---AAEIIIDS-VRPNPQVDFNNIVNYVESEFKE | ORF3_DpAV4_11931724 GHGDHPQSQHCIG-------YEPETGGEGIGGGEIREQRRLLNRLA-------------------QMGIQLNETNEQLNEMNNKLDV--------------ACED-------RAPI-PDDCSKVERFVFLKRRTS-----DYPYYAIRAKRRARRRPIRK-QQNEFGVINILLDFETHPNTKTYYNRINWALNKRGVK | ORF122_LdNV_9631089 TDSGYDEDYEEEEDEE----QNAILAHLRATNASIREIQQKLQTLEKI---------GGILNRADADADADDDLSFL------DEPD--------------VEPD-KPPVGATVKF-PRDATKHPWLTVLAKEVRREGAVATEIAFATSRAA-ASARKRKYS-----DMSLIYQG-VHPNPQLAVCCITEEWQERGLS | P20_LsNV_2760643 FFVNTKYFVDMEAH-------IETQQYLIK-----SIADKDVIIQHK-------------DAQIAELLNAILLANSQCMSLSKRLVD--------------IVQD-------VVVKPQNCQLLHA-LAVCELS-------CNKFAFLRTQLRSLKRSIKRLQRAEQHEPTIIYQSEYVPNSINILNKIKEQLPKDKFT / ORF10_EpNV_15213135 LAIKTDKGYDCDD-------VRDNIKTVLKHIKTLNVNSDKFINAHKLFENQVCARFEQLEQRLETLERVPDA--------PTMP---------------------------GVIF-PRDVNKHQHLAVFVNQERG----NTQIGFARGQEEYFRKRKLEFEEE---DMHKMLET-VHPNPQMAVQCIKDRFISNGYK \ ORF12_OpNV_9629950 LAVGANKDHDRDN---LLDKIEAVLNHVKTLNTNSDKFISAHKSFKLEVGARFE-QFEQRLQTLDTKLNALQCA----APTRTAP---------------------------GVVF-PRDVTKHPHLAVFMGRVEDRG--VTQIAFARGQEEHFRKRKLEFEE----GMDVDVRG-RAPNPLLAVHCIKEEFANGGHK | ORF13_ACNV_9627755 --LHIQTEGERDDLRDKIESVLKHVKKLNANSEKFMVTHETFKNEVGN-------RFEQFELRLHELDAKLNML-QSAEKLKTAVVAE-------------SKNG-------TVTF-PRDITKHQHLAVFSERIDD----RIKLAFVLGQERHFRKRKMRFED----DMEVLYDG-VHPNPLLAIQCINEKLYDKHYK |solos more closely related to the ones with BRON.. maybe truncated or maybe an ancestral solo orf13_BmNV_9630821 --LHLQTEGERDDLRDKIESVLKHVKKLNTNSEKFMVTHETFKNDVGN-------RFEQFELRLNELDAKLNML-QSAEKLKTAIVTE-------------SKNG-------TVTF-PRDITKHQHLAIFSERIDD----RIKLAFVLGQERHFRKRKMRFED----DMEVLYDG-VHPNPLLAIQCINEKLYDKHYK | ORF13H_MbNV_5565846 ----------------------QVIEKFDAFDRRVAELNDKMNMYEN----------VDDLYRRLREHHRTLERPQHMSF--LSSSNTIN-----------DDHDQRCIRFDTVRF-PRDTSKHPRLSVFVKPVEEG---GTKVAFVAGQQRRICALKRKYS-----DMEMIYDS-VHPNPQLAMQCINEELDLKNLD / BROA_BmNV_9630998 DDIIVEKDKIIVAK-------TEQNQQLAS---ALQEANQNLIEANKG---------------LMTAFNMINDARKETAQLANRMAD--------------IAQD-------VITKPSDPRLCHS-LAVCSLG-------GDQYAFLRPQKRNLKRSLDRLSVD---NREIVYKSEYVPNAMNVLNKVKESLPRDKFK \ BROL_LdNV_9631113 KEIICKKDEIIAVK-------EDENKKLTI---SLQETNQNLIIANKG--------LLQAFEIINEARKDSENARKETAQLANRMAD--------------IAQD-------VITKPSDPRLCHS-LAVCSLG-------GDQYAFLRPQKRNMKRSLDRLSVD---SREIVYKSEYVPNAMNVLNKVKENLPRDKFK | ORF109_XnGV_9635359 KQKIVEKDTIIAVK-------DEENKKLTV---ALQDANQNLIEANKG---------------LLQAFNIINEARKETAQLANRMAD--------------IAQD-------VIAKPSDPQLLHS-LAVCAMG-------GDQYAFVRPQKRSL----DRLSVD---EKDIVYRSDYVPNAMNVLNKVKEALPKEKYK | ORF60_HaNV_12597545 KLMLSHKDELLAVK-------DKENEALTV---ALQNANHNLAVANQG---------------LLKAFDVVNDARKETAEIAKRMAD--------------IAQD-------VIAKPSDPQLLHS-LAVCSMG-------GDQYAFLRPQKRSLKRSLDRLSVD---EKDIVYKSDYVPNSMNVLNKVKERLPKEKYK | BROP_LdNV_9631128 IAEESILRNEIVAK-------TEENKQLAT---ALIEANGKIILFAGA--------LVEANAGLLLANKNLHDANQTIGQMANRMAD--------------IAQD-------VIAKPSNPNLCHS-LAVCALG-------GDQYAFLRPQKRNMKRSLDRLSVD---NREIVFKREYVPNAINVLNKVKESLPRDKFK | BroO_LdNV_9631121 AVHVATNEGREAPW-------MKDLEEFKV---VLAEKDRKIDKLTNA--------LIQSNEKNNTLTQALIAVTERTDKLANRIID--------------LAQD-------VVTKPSNPNLCHS-LAVCALG-------GDQYAFLRPQKRNMKRSLDRLSVD---NREIVFKSEYVPNAMNVLNKVKENLPRDKFK | ORF159_XnGV_9635409 TVALQESNQKLVIT-------TEKLTDANE---KLTETNNKLVTLATA--------LVSANEGLIKANTMLNDARVETAQLANRMAD--------------VAQD-------VIAKPSDPQLLHS-LAVCSMG-------GDQYAFLRPQKRSLKRSLNRLSVD---DSQILFKSDYVPNSMNVLNKVKENLPKDKFK | 38_7_HaNV_12597608 RADHHSANENMHK---------SILGKVGDIENRLSELDHKISAIEK----------IDVLYNHLKNYHRLQTNNSN------DTALY-------------SEED---NFVNGFRL-PRDSSKHPHLGVLVRSVDQH---NTEIEFLTGQRNYYQTRKRKLK-----SGDLIYDA-VHPNPQVAVHRFNEELDMKNLS | BROA_BmNV_9630839 ------PAVKMDTN-YGVI--EELNKKLAFASESLAEANEKIIHFANA--------LVTANAGLVQANTMLNEARRETAQLANRMAD--------------IAQD-------VIAKPNNPQLLHS-LAVCALG-------GEKYAFLRAQKRSLNRSIKRLG-----SSDVVFSSDYVPNAMNVLNKVKETLPRNQYK | BROA_SlNV_7672865 ------PAVEMDAN-YGAI--EELNKKLTFASESLAKANEKIIHFANALVTANT-GLVQANAMLNEARKDCENARRETAQLANRMAD--------------IAQD-------VIAKPDNPQLLHS-LAVCALG-------GEEYAFLRAQKRSLNRSIKRLG-----SSDVVFSSDYVPNAMNVLNKVKETLPRNQYK | + BRON BROC_BmNV_9630901 ------PAVEMDTN-DVIAKIDDLTQKLTVANADLAEANRSLILFANE--------MIVARRDAETARQDCENARRETAQLANRMAD--------------IAQD-------VIAKPSNPQLCHS-LAVCDVG-------NNEFAFLRPQKRSLGRSLKRLG-----SNDVIFSSDYVPNSMNVLNKVKEAIPRNKFK | BROII_BmNV_13751087 ------PAVEMDTN-NDIAKIDDLTQKLTVANADLAEANRSLILFANE--------MIVARRDAETARKDCENARRETAQLANRMAD--------------IAQD-------VIAKPSNPQLCHS-LAVCDVG-------NNEFAFLRPQKRSLGRSLKRLG-----SNDVIFSSDYVPNSMNVLNKVKEAIPRNKFK | BROB_LdNV_9630999 ------PAVKMDTS-GALVKIDDLTAKLTEANANLMEANKSLIVFANEMIVARR-DAETARQDCEAARQDCEAARRETAQLANRMAD--------------IAQD-------VIAKPADPRLRHT-LAVCEIG-------QNEYAFLRPQKRNFRQSLNRLSVD---DRNVVFKSEYVPNAMNVLNKVKESLPRDKFK | BRON_LdNV_9631120 ELVKKQEFIERIVA-------IKDKQIEAK-DLQVTRVMTDLNRMYTGFQETMQR----KDEIMQQKDAQVTELV-------AKVVD---------------LSE-------RAVQYPADERKHP-VLCVARD-------GTTFMAIAGQKSYVRSQKHKRNID---AASVVAEA-TRPNPTVDWNNATHRLPAKKTK | BRO_AcNV_9627744 ELVKKQEFIERIVA-------IKDKQIEAK-DLQVTRVMTDLNRMYTGFQETMQK----KDEIMQKKDAQVTDLV-------AKVVD---------------LSD-------RAVQYPADKRKHP-VLCVTRD-------GTTFTAITGQKTYVENQKHKRNIN---VANIVVEN-IRPNPTVDWNNATDRLQAKRSK | ORF2_AcNV_93042 ELVKKQEFIERIVA-------IKDKQIEAK-DLQVTRVMTDLNRMYTGFQETMQK----KDEIMQKKDAQVTDLV-------AKVVD---------------LSD-------RAVQYPADKRKHP-VLCVTRD-------GTTFTAITGQLTYVEMQLHLRMIM---VANIVVEN-IRPNPTVDWNNATDRLQAKRSK | BROD_BmNV_9630955 ELFKKQEFIERIIA-------IKDKQIEAK-DLQVTRVMTDLNRMYTGFQETMQR----KDEMMHKKDELLQVKDTQVSNLIAKMID---------------LSD-------RAVQYPADKRKHP-VLCVTRD-------GTTFTAITGQKTYVESQKHKRNID---AANIVVEN-IRPNPTVDWNNATDRLQSKRSK | BROIII_BmNV_13751089 ELFKKQEFIERIIA-------IKDKQIEAK-DLQVTRVMTDLNRMYTGFQETMQR----KDEMMHKKDELLQVKDTQVSNLIAKMID---------------LSD-------RAVQYPADKRKHP-VLCVTRD-------GTTFTAITGQKTYVESQKHKRNID---AANIVVEN-IRPNPTVDWNNATDRLQSKRSK | BROJ_LdNV_9631081 -----EQFQETMQK-----KDEQFKETIQKKDEQFKETIQKKDEQFQE----------IIQKKDAQLQETIQRKDEQIARLIDAAMD---------------LSS-------RAVQYPADERKHP-VLCVARD-------GTTFHGIAGQRRYVQSQKRKLGVK---DDDLVLET-RRPNPALDWTNATHTTSAVKRSK | ORF114_XnGV_9635364 -VYRERELESKTNQ------LANKEKQLKNALSLIEFKENQLSEVISL-------TQKKDIQLEQQFTMLSSLMGKHIKKIE--ISD---------------SDD------------ELPQNHDT-VLMIVREN------NTTFKGIAAKRRYVDQQKQKLRYH---ESMIVVHS-KRPDPKRDWNAAMDIVVELGVK | AMV175_AMV_9964489 RKRTQKKYIDIINN------KQDKIDILSIKLDNISKQNNELLTQNQ-----------LALNKLQELGINLIETKEEIKDVKDKLNV--------------VIED-------RNVK-PKEVKLQHKYLLLKNKII-----NNEYKFIRAQDQYIKTNKSNWLE----KHNVIIDEKYNPNPIDMCSRLKSKIYELDKI | AMV177_AMV_9964491 ------QKCKIDELFN---QNKKIISQNNELINKTEYQNNEILKLNKQ--------NQLALNKLQELGINLIETKEEIKDVKDKLNV--------------VIED-------RNVK-PKEVKLQHKYLLLKNKII-----NNEYKFIRAQDQYIKTNKSNWLE----KHNVIIDEKYNPNPIDMCSRLKSKIYELDKI | AMV057_AMV_9964371 MDIISNQKDKIDD-------LFKKIDNQSLEINNISKQNNELLTQNQ-----------LALNKLQELGINLIETKEEIKDVKDKLNV--------------VIED-------RNVK-PKEVKLQHKYLLLKNKII-----NNEYKFIRAQDQYIKTNKSNWLE----KHNVIIDEKYNPNPIDMCSRLKSKIYELDKI | orf6_HaEV_3510491 LDIINNKQDKID-------ILTQDLEEIKNQNILTIEQNNKLLQQNQ-----------LALNKLQELGINLIESKEEIKSINNRIDT--------------IIVD-------RNIK-PSNPKLHHKYLLLKNKN------KNEYKFIRAQDKYIKNNKSLWLE----KYNTIIEEKYNPNPIDLCSRLKDKIAKLNPQ | ORF13_SeNV_9634234 ----------------------QVIERFEWFNVQISELNKKMSTLNN----------VDELYRRLQDYHKSNNINTTMFSNNASSSTALSSSSSYENNLMGGIVDNEHTRYETVRF-PRDTSKHPRLSVFVKPSEE----GTDIAFITAQQRRHNALKRKFN-----DMEMIYDS-VHPNPQLAMHCINEELDIKQFN | 38_7l_HzNV_10442572 RADHHSANENMHK---------SILGKVGDIENRLSELDHKISAIEK----------IDVLYNHLKNYHRLQTNNSN------DTALY-------------SEED---NFVNGFRL-PRDSSKHPHLGVLVRSVDQH---NTEIEFLTGQRNYYQTRKRKLK-----SGDLIYDA-VHPNPQVAVHRFNEELDMKNLS / 468L_CIV_15079179 MKITNHRQENMLIE------SHNMLRSMGVEIKDIRHENNDLLDQNN------------------ELLERVDDVLQKVNTVQKKLDI--------------SVED-------RAPQ-PDKNTRRERFLLLKRNND-----TFPYYTIRAQEINARKALKR-QRNMYTDVTVLLDIVCHPNTKTFYVRIKDDLKSKGVE \ + MSV199 238R_CIV_15078950 QEEERKLDRLMLTE------SRNMLQTMGIEIKTVKYNNNNLIDQNN------------------ELLERVDEVLHKVDVVQTKLNI--------------SVED-------RAPQ-PDKNKRRERFLLLKRNDE-----NYPYYTIRAQDINAKKALKR-QKDMFSDVTILLDLICHPNTKTFYVRIKDDLKKKGVE | 211L_CIV_15078923 RQYMQKMGITLED-------TREEVKKVNIQNKDIKAQNEEIKAQN-------------------------EDLAFDLSDVRDRLIE--------------AAED-------RSPK-LETKPLRERFVIIKRKDS-----SFPYYAIRGQDVYVKGRLTHFKNTRYPELKIIFDTNYQPNPRNLYIRFKELKDERFII | 148R_CIV_15078861 RLERKQSEERSIK-------QEQLLLSIGYNLKELQEQKEEDTQKID-----------VLIDQNEDLKQNIEETNDKLDSVVEKLGI--------------AVED-------RAPR-LKRASIRERFVLFKKNNSTNE--IYQYYAIRGQSVYVNGRLSKLQSEKYPDMIILIDIICQPNPRNLFLRFKERIDGKPEW | 388R_CIV_15079099 --EYSLYFKEREAQ-----------IEKQKSQFHIETLEKKLDEMK-----------LEAEKRHDELLDKVEEVQYDLNVVGEKLDI--------------AVED-------RAPK-VKAELLRERFVVLNRNDKRA---SCQYYVMRGQDHYINGKIFSYK-NLHPNLKIIFDISCQPNPRNLFVRFKELKDNRFKV | 212L_CIV_15078924 ----CVYFKEREAKLQIT-TLEQKLEQMNITMIEMKEEMNLSMEEHAD-------KLDTLVDQNEELKLDVSEANEKLETVTHKLGI--------------AVED-------RSPR-LEQKPLRERFVLFKRNVKNA---RFQYYAIRGQSIYVNGRLTL-YNERYPNLEIIIDIFCQPNPRNLFLRFKNYVKDDERF / 313L_CIV_15079025 IYASKRQEQMLLE-------SHNLLKSMGIEVKDIKEQNNELLNEVG-----------ELREDNNELQEQVENVQEQIQKVQVKLEI--------------SVED-------RAPQ-PDKRGKKERFILLKRNDE-----HYPYYTIRAQDINAKKAVKR-QQGKYEEVLILLDLVCHPNTKTFYVRIKDDLKKKGVK \ 006L_CIV_15078718 QEKNDKIDELILFSKRMEEDRKKDREMMIKQEKMLRELGIHLEDVSSQ--------NNELIEKVDEQVEQNAVLNFKIDNIQNKLEI--------------AVED-------RAPQ-PKQNLKRERFILLKRNDD-----YYPYYTIRAQDINARSALKR-QKNLYNEVSVLLDLTCHPNSKTLYVRVKDELKQKGVV | AMV112_AMV_9964426 TNIIEENEITIKQK-------DDKIDELIQINKRIEEQNIKLLKLAE-----------KQNIKLDEISDELDETNYKLDTLTQTVEEN-------------ILPD-------RNIQ-PNDINLKHNLVIY-KKI------NNIIKITRAQNKYINKIKIS-------EDNIIIKE-YVPNPIDFINRMKLYCIDLNKK | kilAN AMV110_AMV_9964424 IKQKDDKIDELNNKLD---IIITTNKILEQKSTNLENINNKLLKLAE-----------KQNIKLDEISDELDETNYKLDTLTQTVEEN-------------ILPD-------RNIQ-PNDINLKHNLVIY-KKI------NNIIKITRAQNKYINKIKIS-------EDNIIIKE-YVPNPIDFINRMKLYCIDLNKK | AMV024_AMV_9964338 INIVEDKELEINDLNKKLSDIINQNNKILESNKNLENQNKKLLKLAE-----------KQNIKLDEIGDELDETNFKLDTLTQTVEEN-------------ILPD-------RNIS-PKDVNLKHNLVIY-KN-------NNEIKIIRAQNKYINKIKIL-------DENIIIKE-YVPNPIDFINRMKLYCVDINKK | FPV124_FPV_9634794 HKFNNKYDKDTLE-------LKELYREQRKEAKSLRKINERIEEKYDK-------DTRELKQGLKELKDENKELKFEL----KKIEER--------------LRD-------KVIN-PFSPNKHHRLVILQKKID-----NNSFKTLRLQAERLNQEMNKY------KTNILYFL-MHTNLTQYPVLIG*-------- / consensus/95% ...........................................................................................................................p..h.hh.............h..h.st.......t............hlhp....PNs...h.ph.......... consensus/90% ............................................................................p......................................t.......p..hhhh.............h..h.sQp..hp..t.p.........pllhp....PNs...h.php..h...... consensus/85% ..................................h....pph..................................p......ph...................s..........t.s....tc..hhhh.............h.hlpsQp..hp..t.p.........pllhc..h.PNst..h.phpp.h...... http://www.bmm.icnet.uk/servers/3dpssm/output/d454e5b32aaf504f.job_summary.html #P63C ----------------------- p63_BPMx8_15320633 QAVAERFLGVGLAPYAKRFPTPFYEGIFRLRGWPWHGPGTP--RPGVIAYWTNDLVYERLAP-ELLRLLRERNPMDKDTGRRAAKHHQLLSEDIGHPALAVAAKVDALNLPLEQNQVRVVFNWLQ | + BRON orf12_BP933W_4499795 --ILEAFVAKEIQPYITTFPADYYEELFRLRGLE-YPPENPRFRPQYFGVLTNDIVYKRLAP-NILEELKKQNV----KASKGTKLFQGLTPNIGYQKL \ + kilAN Gp73_BPHK97_9634189 ------FLLDKSQPWEKRFSDPFYSAMFKMSGLPRHRPGR---RPSLFGMISAKWVYGPVLPPEVYAEVKRR-------LAAGDKIHQHLKPD / PHAGES: HOST phiPV83, phi ETA, phiSLT S aureus bIL285, bIL286, bIL311 pi3, BPphi31_1, TP901-1, RLT, BK5-T LL-H Tuc2009 Lactococcus LcBPA2 Lactobacillus TP-J34 Sfi21 Streptococcus thermophilus A118: Listeria monocytogenes HK97, N15 , HK620 , HK022 933W, VT2-Sa, phi-R73 P22 H-19B, P1 Escherichia coli D3 Pseudomonas APSE-1, GMSE-1 Acyrthosiphon pisum Endosymbionts Mx8 Myxococcus xanthus XF2506_Xf_11362060 M_XF2524_Xf_11362500 C_XF2524_11362500 N_XF2524_Xf_11362500 XF0684_Xf_11362477 XF1663_Xf_11362484 XF1645_Xf_11362483 XF0704_Xf_11362478