SUPPLEMENTARY MATERIAL
BEN: A novel domain in chromatin proteins and DNA viruses



Saraswathi Abhiman, Lakshminarayan M. Iyer, and L. Aravind*

* Address for correspondence: L. Aravind ([email protected])

National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA





We report a previously unidentified α-helical module, the BEN domain, in several animal proteins such as BANP/SMAR1, NAC1 and the Drosophila mod(mdg4) isoform C, in the chordopoxvirus virosomal protein E5R and in several proteins of polydnaviruses. Contextual analysis suggests that the BEN domain mediates protein-DNA and protein-protein interactions during chromatin organization and transcription. The presence of BEN domains in a poxviral early virosomal protein and in polydnaviral proteins suggests a possible role for them in organization of viral DNA during replication or transcription.


  1. Materials and Methods
  2. Comprehensive multiple alignment of the BEN domain
  3. Comprehensive list of proteins with BEN domains


MATERIALS AND METHODS

Profile-based searches were conducted using the PSI-BLAST and HMMER (Eddy, 1998) programs. PSI-BLAST (Altschul, et al., 1997)searches were performed against the nonredundant (NR) database of protein sequences (National Center for Biotechnology Information [NCBI], NIH, Bethesda, MD, USA) and a locally compiled database of unfinished eukaryotic genomes, with either a single sequence or an alignment used as query, with the default profile inclusion expectation (e) value set to a threshold of 0.01. Most searches were iterated until convergence. A statistical correction for compositional bias was used to reduce false positives (Schaffer, et al., 2001). In order to exhaustively recover all orthologs of the BEN domain, sequences recovered by each of these searches were further used as queries for PSI-BLAST searches. A single linkage clustering was of the retrived proteins was then obtained using the BLASTCLUST program (ftp://ftp.ncbi.nih.gov/blast/documents/blastclust.html). A comprehensive multiple alignment was generated using the KALIGN program (Lassmann and Sonnhammer, 2005) and was further adjusted manually based on PSI-BLAST results, secondary structure predictions from and multiple alignments of individual families. Protein secondary structure was predicted using the JPRED program (Cuff, et al., 1998) that uses information extracted from a PSSM, HMM, and the input seed alignment. Neighbor joining and UPGMA trees were constructed using the MEGA software (Kumar, et al., 2004) to evaluate the inter- and intra-familial relationships in BEN domain containing proteins. For domain context analysis, a library of profiles for various transcription factor and chromatin domains was prepared by extracting all alignments from the PFAM (Finn et al., 2006) and SMART (Letunic et al., 2006) databases and updating them by adding new members from the NR database. HMMs with the HMMer package (Eddy, 1998)and PSSMs with PSI-BLAST (Altschul et al., 1997) were then made using these updated alignments. Profile searches were then conducted on a locally compiled database of proteins with BEN domains using the HMMer and PSI-BLAST programs. This database included BEN domain containing proteins from eukaryotes with completed genomes, as well as unfinished genomes with a high coverage. PSI-BLAST searches were conducted with a default profile inclusion expectation (e) value threshold of 0.01 (unless specified otherwise), and were iterated until convergence. The in-house TASS package (Anantharaman V, Balaji S, Aravind L; unpublished) was used for the automation of all large-scale sequence analysis procedures.

References

  • Altschul, S.F., Madden, T.L., Schaffer, A.A., Zhang, J., Zhang, Z., Miller, W. and Lipman, D.J. (1997) Gapped BLAST and PSI-BLAST: a new generation of protein database search programs, Nucleic Acids Res, 25, 3389-3402.
  • Cuff, J.A., Clamp, M.E., Siddiqui, A.S., Finlay, M. and Barton, G.J. (1998) JPred: a consensus secondary structure prediction server, Bioinformatics, 14, 892-893.
  • Eddy, S.R. (1998) Profile hidden Markov models, Bioinformatics, 14, 755-763.
  • Finn, R.D., Mistry, J., Schuster-Bockler, B., Griffiths-Jones, S., Hollich, V., Lassmann, T., Moxon, S., Marshall, M., Khanna, A., Durbin, R., Eddy, S.R., Sonnhammer, E.L. and Bateman, A. (2006) Pfam: clans, web tools and services, Nucleic Acids Res, 34, D247-251.
  • Kumar, S., Tamura, K. and Nei, M. (2004) MEGA3: Integrated software for Molecu-lar Evolutionary Genetics Analysis and sequence alignment, Brief Bioinform, 5, 150-163.
  • Lassmann, T. and Sonnhammer, E.L. (2005) Kalign--an accurate and fast multiple sequence alignment algorithm, BMC Bioinformatics, 6, 298.
  • Letunic, I., Copley, R.R., Pils, B., Pinkert, S., Schultz, J. and Bork, P. (2006) SMART 5: domains in the context of genomes and networks, Nucleic Acids Res, 34, D257-260.
  • Schaffer, A.A., Aravind, L., Madden, T.L., Shavirin, S., Spouge, J.L., Wolf, Y.I., Koonin, E.V. and Altschul, S.F. (2001) Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements, Nucleic Acids Res, 29, 2994-3005.




2. Comprehensive multiple alignment of the BEN domain

Secondary Structure - - - - - - H H H H H H H H H - - - - - - - - - - - - - H H H H H H H H H H H - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - H H H H H H H H H H H H H H - - - - - - - - - - - - - - - - - H H H H H H H - - H H H H H H H H - -
insv_Dmel_24581162 P N N T C V P A S V F E N I N W S - - - - - - V C - - - S L A T R K L L V T I F D R E T L A T H - S M T G K P S P A F - - - - - - - - - - - - K D Q D K P L K R M L D P G K I Q D I I F A V T H K C N A S E K - - - - - - - - - - - E V R N A I T - - T K C A D E N K M M 259-356\Insensitive-like
AgaP_ENSANGG00000025789_Agam_118791739 S N N T L V P K R A L E A V R W H - - - - - - S Y - - - K F G T R K L L Q M L F T R E T L A S C - S L S G R P C P A R I N - - - - - - - - - - E V D R P V K G - A L P P K V V A D I V E Y V M K K C N V E E C - - - - - - - - - - - H V R G V I T - - N K C A D E N K M L 619-717|
AaeL_AAEL001891_Aaeg_157124999 P N N T M V K K Q I L R A I N W T - - - - - - N Y - - - K A A T R K L L M T L F S R E V L A S H - T L T G R P S P A F M - - - - - - - - - - - G D R A K P V K D K L D Q K I I A D I I S I V A K T C S V S E P - - - - - - - - - - - M V R T A I T - - T K C A D E N K M S 500-598|
LOC661906_Tcas_91091380 G E G I E I Y E D Q L R S V K W N - - - - - - D Y - - - R K L T R G L A T I L F S P A E L A T C - S V T G Q R W S R A G S - - - - - - - - - - - G E R P V K P - A L D R S K V Q A I I S Y V A T R F P M V E I S - - - - - - - - - - R I K Q V L A - - Y K C K E N S T A F 356-454|
Dpul1000028485_Dpul_Dpul1000028485 I Q Q S G M E E E Q F D I H S R P - - - - - - K D - - F S K V T K N L V M G L F N N D E L M T S - T L T G N P K - - - - - - - - - - - - - - - - - D K D V L P N V L D K P K I T L I H N Y V T A R F P G T T I S - - - - - - - - - - Q V N K A I G - - E K L R D Y R K S K 312-406|
LOC724266_Amel_110759165 G E G I A I C E E Q L R A V K W S - - - - - - D Y - - - R K L T R G L A A I L F S P T E L A T C - S V T G Q R W S R A G T - - - - - - - - - - A T E R P V K P - A L D K A K V Q A I I S Y V T S R F P T V D V S - - - - - - - - - - S V K Q V L A - - Y K C K E N S T A L 27-126|
bsg25A_Dmel_1930012 P N G T E V S R I S L S A I N W D - - - - - - M T - - G P S I T R K L L C E I F D R D T L A H H - T L S G K P S P A F - - - - - - - - - - - - R D C A R P S K Q Q L D P L K V A D L V Y L M T N S L D M T P R - - - - - - - - - - - E V R T A I T - - T K C A D E N K M L 102-200|
Dpse_GA11475_Dpse_125986627 P N G T Q V S R N S L S A L N W G - - - - - - M T - - G P S I T R K L L C E I F D R D T L A H H - T L S G K P S P A F - - - - - - - - - - - - R D C A R P S K Q Q L D P L K V A D L V Y L M T N T L D M T P R - - - - - - - - - - - E V R T A I T - - T K C A D E N K M L 254-352|
CG9883_Dmel_19920584 P N G T Q I T A H Q Y G E V F W T - - - - - - N A - - - P V A T R C L L C V V F S S D E L A T H - T L T G K P S P A F - - - - - - - - - - - - Y G R E R P P K L Q L D Q R K V D D I V V C V R N R T G G K E R - - - - - - - - - - - V I R A T I T - - T K C A D T A K K Y 268-365|
LOC792774_Drer_125835679 P N N S - I T G E Q F A H V N C T - - - - - - D P - - - K K A T K D L L V A V F G R N I L G T H - C Y T G K C S K A F - - - - - - - - - - - - R - - D K A V K P R L D S K K S L T S S V R F F L Q K I F W K - - - - - - - - - - - - Y C H I L R Y - - - - - - - - - - - - 50-128/note_fragmented
Dpul1000009773_Dpul_Dpul1000009773 G T K F P K T M - K V V E K L S K D A - - - - - - - K I N L F I T M L L Y Y L F T E E F M A K T - S V T N K R N K S A - - - - - - - - - - - - - - S P S E K G - Q L D E K E L R E I I D V L N M R C P P S D V D - - - - - A Q E K - F I R T T V W - - V K F N N V A A K L 130-230
Dpul1000017063_Dpul_Dpul1000017063 P S K F S I R R - E I A K T I Y N R V - - - - G R - K M S L F T R K M M D F L F P V S Y L T S H - R M T D N - - - - - - - - - - - - - - - - - - - G Y S G K Q - A A D K Q H I S E L I N C V L S F F P E E K E S - - - - - - - - - - A I R T M I R - - - - - - - - - - - - 147-230
C1orf165_Hsap_13375807 G S G I W V D E E K W H Q L Q V T Q - - - - - G D - - - S K Y T K N L A V M I W G T D V L K N R - S V T G V A T K K - - - - - - - - - - - - - K K D A V P K P - P L S P R K L S I V R E C L Y D R I A Q E T - - - - - - - - - - - - V D E T E I A - - Q R L S K V N K Y I 133-228\C1orf165
2310026E23Rik_Mmus_28175231 G S G I W V D E E K W H Q L Q V T Q - - - - - G D - - - S K Y T K N L A V M I W G T D V L K N R - S V T G V A T K K - - - - - - - - - - - - - K K D A I P K P - P L S P H K L S I V R E C L Y D R I A Q E T - - - - - - - - - - - - V D E T E I A - - Q R L S K V N K Y I 302-397|
LOC100086099_Oana_149452637 G S G I W V D E E K W H Q L Q V T Q - - - - - G D - - - S K Y T K N L A V M I W G T D V L K N R - S V T G V A T K K - - - - - - - - - - - - - K K D A V P K P - P L S P H K L S I V R E C L Y D R I A Q E T - - - - - - - - - - - - V D E T E I A - - Q R L S K V N K Y I 282-377|
LOC566161_Drer_125823408 G G G I W V D E E K W H Q L Q R T Q - - - - - G D - - - S K F T K N L A V M I W G T E T L K N R - S V T G V A T K K - - - - - - - - - - - - - K K D A L P K P - P L S P S K L K I V R E C L Y D R V S Q E T - - - - - - - - - - - - A D S A E I T - - Q R L S K V N K Y I 273-368|
LOC424628_Ggal_118094515 G S G I W V D E E K W H Q L Q V T Q - - - - - G D - - - S K Y T K N L A V M I W G T D V L K N R - S V T G V A T K K - - - - - - - - - - - - - K K D A V P K P - P L S P H K L S I V R E C L Y D R I A Q E T - - - - - - - - - - - - V D E T E I A - - Q R L S K V N K Y I 297-392|
LOC792358_Drer_125843659 G D N I W I R E E V W K K I K S A A - - - - - K D - - - S L F V K E M A V A L W G T A T L K T R - S V S G K E C P T - - - - - - - - - - - - - - K N S S A K P - P L T P S K L Q V L R V C F S D W L K Q K E - - - - - - - - - - - - P E R K E R E - - K R E N Q V G Y Y I 107-201|
AaeL_AAEL003998_Aaeg_157104034 V R D S L I P Y Q T M V D I D S V E - - P G E K Y - - D L R F V S K L A L A L W G H E R L A V S - S V T G R K S N N A - - - - - - - - - - - - S N N S T P S I - Q L E P E K L S F I K E K V Y H R A M Q E T N D - - - - - - - - - - R V Q A M A R - - F D D S R I N R L L 364-466|
AaeL_AAEL002989_Aaeg_157167939 T M H P A I P E I S T E F L Q N L N Y N S G P G E K G D R L F I S K L A V A V F G V D V L V N S - S V T G K P S N A - - - - - - - - - - - - - H H N I P P K P - P L C P E K I A A I E A K L Y E R V E Q E V G R - - - - - - - - - - S N R A E L L - Q R S G E K V V R L V 233-339|
AaeL_AAEL003984_Aaeg_157104040 E C P G Y I P A F R L K M F S D M A - - - - - N S - - - D Y L F V K S I M E D L W P D G F A G R - S V T G R A S N N A S G R S G K A T L P T A P P E Q S P K V - P L E A H K V E Y I R D R L L E R R I L L G - - - - - - - - - - - - D D R I K A I - - H E C K Q A N K L M 288-396|
AaeL_AAEL003016_Aaeg_157167933 K S N F V D P K V I E K I N A E C P V G - - L E G - - D G K F I S A L A D Q L W T R D Q L A A R - S V R G K Q C Y R - - - - - - - - - - - - - H P E S S T H I - P A S P T K V S F I H D K L M Q R I N L E E - - - - - - - - - - - - R E D D D P R R K L Q S R V S H K L N 304-405|
AaeL_AAEL003004_Aaeg_157167937 T T V E D I P I T R Q E L E A I N D R - - - S S S - - D S M F V G M L I T R L V A P N E L I N M - S C T G H A S L R F - - - - A R L K K P D G S P M Y P P T E - K I D P R I F D F V C N K V A E R T A M R I G - - - - - - - - - - - L D D I H T I R K N S D E R I I K R Y 302-412|
AaeL_AAEL013916_Aaeg_157138093 L T F T T D P E I D M T I N Q I D Q V N K A F K S - - D M N F G Q N L A L F F Y G A Q T L K E M - S V T G T A T H R - - - - - - - - - - - - - F K N A Q P K T - G I S P K K L S F I H E K V Q Q R V A A R V G V S - - - - - - - - - N L S L I N A V - A H P T I V N K I A 23-126/
mod(mdg4)_Agam_119112359 G S S V Y I A T K D L L S I Y T S - - - - - - K P - - - A V Y T G R L I Q L M F G L D T L K I S - C L D S K E R V - - - - - - - - - - - - - - - - N T D L V P - - L D P T T L E A V I T H I V D V F Q Q Q K Q H I T P G - - - - - - M V R N F I R - - N R L D L L R T N L 449-545\mod(mdg4)
mod(mdg4)_Dmel_24648736 G S R V F V S K V A L A K A Y I P - - - - - - M P - - - M I Y T C R V M D L V I G K D K L - V R - I A Q H E E T T - - - - - - - - - - - - - - - - - - - - - - - - - D K D L I Q D I I T H V C K V F A L R G N Q L T P S - - - - - - A V Q E F I D - - H K L S T L K L M P 441-529/
LOC590134_Spur_115729679 G N T F A M D Q L D Y D D A H Q G C S - - - - G P - - - K S L T I R L L T K V F T R E E L A R S - N F R G G E V Y T - - - - - - - - - - - - - G K E W V T K E S L K R K L A F Q A I I A Q V G Q Q F P G S T A T - - - - - T L F Q K E I C E A V N - - T K C R K T E R V - 524-627
NAC1_Hsap_16418383 G T N V Y I T R A Q L M N C H V S A G T - - - R H - - - K V L L R R L L A S F F D R N T L A N S - C G T G I R S S T - - - - - - - - - - - - - - - N D P R R K - P L D S R V L H A V K Y Y C Q N F A P N F K E S - - - - - - - - - - E M N A I A A - - D M C T N A R R V V 374-471\NAC1
LOC100020016_Mdom_126308327 N S G V Y I T H H Q L D D L S Q V S T D - - - K P - - - K L M T R R M L D Y F F S R E T L A R S - S A T G Q R I A H N N - - - - - - - - - - - - T T M E K P I - R L P V A V V N A I K E Y V T K V C G R G C - - - - - - - - - - - - N F N A V I N - - S K C G T S R R A V 349-447|
LOC495228_Xlae_148236339 S S G V Y I T Y Q Q L E D L S H I P P D - - - K P - - - K L M T R R L L D Y F F S R E T L A R S - S A T G Q R I A H N N - - - - - - - - - - - - T T M E K P L - R L P D K V V T A I K A Y V T R A C G R G C - - - - - - - - - - - - N F N A V I N - - S K C G T S R R A V 348-446|
LOC721794_Mmul_109109824 G S G V Y I T R G Q L M N C H L C A G V - - - K H - - - K V L L R R L L A T F F D R N T L A N S - C G T G I R S S T - - - - - - - - - - - - - - - S D P S R K - P L D S R V L N A V K L Y C Q N F A P S F K E S - - - - - - - - - - E M N V I A A - - D M C T N A R R V R 350-447/
CCDC4_Hsap_83287964 N Y P V Y I T S K Q W D E A V N S S K - - - - K D - - G R R L L R Y L I R F V F T T D E L K Y S - C G L G K R K R S V Q - - - - - - - - - - - S G E T G P E R R P L D P V K V T C L R E F I R M H C T S N P - - - - - - - D W W M - P S E E Q I N - - K V F S D A V G H A 386-490\CCDC4
LOC702395_Mmul_109074112 N Y P V Y I T S K Q W D E A V N S S K - - - - K D - - G R R L L R Y L I R F V F T T D E L K Y S - C G L G K R K R S V Q - - - - - - - - - - - S G E T G P E R R P L D P V K V T C L R E F I R M H C T S N P - - - - - - - D W W M - P S E E Q I N - - K V F S D A V G H A 386-490|
LOC100015448_Mdom_126331687 N Y P V Y I T S K Q W D E A V N S S K - - - - K D - - G R R L L R Y L I R F V F T T D E L K Y S - C G L G K R K R S V Q - - - - - - - - - - - S G E T G P E R R P L D P V K V T C L R E F I R M H C T S N P - - - - - - - D W W M - P S E E Q I N - - K V F S D A V G H A 401-505|
GSTEN:00029264:G:001_Tnig_47226171 N Y P L F I T N K Q W D E A V N S S K - - - - K D - - G R R L L R Y L I R F V F T T D E L K F S - C G L G K R K R S V H - - - - - - - - - - - S G D S G L E R R P L N P V K V S C L R E F I R M H C A S N P - - - - - - - D W W M - P S E E Q I N - - K V F S D A V G H A 255-359|
LOC560711_Drer_125843107 D Y D V F I P K A Q L D S I L L N Y T - - - - R S - - G S L L F R K L V C A F F D D T T L A N S - L P N G K R K R - - - - - - - - - - - - - - - G L N D T R K - G L D Q N I V G A I K V F T E K Y C T A N G I E K L P G P R D W V - Q I L Q D Q I - - K L A R R R L K R G 267-373|
C10orf30_Hsap_21618768 G F D V F M P K S Q L D S I L S N Y T - - - - R S - - G S L L F R K L V C A F F D D K T L A N S - L P N G K R K R - - - - - - - - - - - - - - - G L N D N R K - G L D Q N I V G A I K V F T E K Y C T A N H V D K L P G P R D W V - Q I L Q D Q I - - K L A R R R L K R G 239-345|
LOC100014309_Mdom_126340438 G F D V F M P K S Q L D S I L S N Y T - - - - R S - - G S L L F R K L V C A F F D D K T L A N S - L P N G K R K R - - - - - - - - - - - - - - - G L N D N R K - G L D Q N I V G A I K V F T E K Y C T A N H V D K L P G P R D W V - Q I L Q D Q I - - K L A R R R L K R G 286-392|
DUF1172_Mmus_26343845 G F D V F M P K S Q L D S I L S N Y T - - - - R S - - G S L L F R K L V C A F F D D K T L A N S - L P N G K R K R - - - - - - - - - - - - - - - G F N D N R K - G L D Q N I V G A I K V F T E N Y C T A N H V D K L P G P R D W V - Q I L Q D Q I - - K L A R R R L K R G 289-395|
LOC419037_Ggal_118081981 G F D V F M P K S Q L D S I L S N Y T - - - - R S - - G S L L F R K L V C A F F D D Q T L A N S - L P N G K R K R - - - - - - - - - - - - - - - G L N D N R K - G L D Q N I V G A I K V F T E R Y C T A N H V D K L P G P R D W V - Q I L Q D Q I - - K L A R R R L K R G 286-392/
CcBV_3.4_CcBV_57753424 R T G V Y V K R K E L K R C I R E S - - - - - N D - - C R T L A R L L L T E V F S Q N A L S V C - T W T G G K A K A F N - - - - - - - - - - - S V N I D I R P - G L D E N A R M V L L T F V E Q H - G K K - - - - - - - - C G W S M A N T S A V M - - S T I R T K I N D I 1112-1213\polydnavirus
CcBV_3.3_CcBV_57753423 Y D D C G G D K D V K A H C K L A A - - - - - K E - - S K E L A R M L L V E I F S Q S A L N V C - S L T S V R A N A F D - - - - - - - - - - - I S G T N V R P - G L D E K A R I T I L S F V K E H - A L E - - - - - - - - K N W S P F D S Q S V I - - N S L R S K I Q D E 487-588|
CcBVs6gp3_CcBV_57753417 Q R G V W V S Y G D L K Y C Q Q V S - - - - - K D - - C K S L A R R L L L A V F N R K A L S V C - L S I T E R A Q A S D - - - - - - - - - - - N V G S N A R P - E L D D H A C T V L L N F V L E H - G L Q - - - - - - - - R G W N - T D I Q P I L - - N T L H S K I Q E I 1083-1183|
CcBV_9.1_CcBV_57659251 Q S G V Y V S C G D L K Y C Q Q V S - - - - - K D - - C K S L A R R L L P Q V F N R N A L S V C - S S M S E K A Q A S N - - - - - - - - - - - N V G S N I R P - D L D D H A C S V L L N F V L E H - G L Q - - - - - - - - R G W N - T D L Q P I L - - S T L H N K M Q E I 586-686|
CcBV_12.2_CcBV_57753397 H S R V Y I D A I Q L S N I K I M S - - - - - K D - - S K T L A R S L L L E I F T E N A L S I C - S L T G K K A N A F D - - - - - - - - - - - L E G T S V R P - G L D E H A R T V L L N Y V K K H - A T G - - - - - - - - Q K W V E F D S Q L I L - - N T I R N K M Q E M 369-470|
CcBVs18gp6_CcBV_57753384 Y S G L Y V N A T N L K H C N E L A - - - - - M D - - C K S L A Q L L M L E V F S E S A L K V C - S L T G A K A T C F R - - - - - - - - - - - G T K T D V R P - G L D K D E R A I L V R Y V E I Y - G E K - - - - - - - - Q R W C T E D H R A I I - - N V M R N K L Y S S 150-251|
CcBVs18gp7_CcBV_57753385 G T D V Y I E V S K L N F C M R S S - - - - - K K - - C T E L T R L L T K H V F T E F A L S K C - K Y I N - - - - - - - - - - - - - - - - - - N V K G R E N L - R L D T G A V T A I I Y F V S A Y - G Y N - - - - - - - - H S W R P S N E K S I K - - A A M R Y E L I S A 325-419|
CcBV_20.2_CcBV_57659442 Q S G V Y V S Y G D L K Y C Q Q V S - - - - - K D - - C K S L A R R L L P Q V F N R N A L S V C S S M S E K A Q A S N - - - - - - - - - - - - N V G S T I R P - D L D D H A C S V L L N F V L Q H - G L Q - - - - - - - - R G W N - T D L Q P I L - - S T L H S K M Q E I 981-1081|
CcBV_23.1_CcBV_57659520 Q S D V Y V S Y G D L K Y C Q Q V S - - - - - K D - - C K S L A L R L L P A V F N S K A L S V C L S I T E R A Q A S D - - - - - - - - - - - - N V R S N A R P - E L D D H A C S A L L N F V L E H - G L Q - - - - - - - - R G W N - T D L Q P I L - - S T L H S K I Q E I 800-900|
CcBV_25.2_CcBV_57659551 Q S G V Y V S Y G D L K Y C Q Q V S - - - - - K D - - C Q S L A K R L L S A V F N R K A L S V C F S M T E K A Q A P D - - - - - - - - - - - - N V V S N I R P - E L D D N A C T V L L N F V L E H - G L Q - - - - - - - - R G W N - T D L Q P I L - - S T L H C K I Q E I 665-765|
CpBV-HP501_CpPV_69951893 Q S G V Y I S Y G D L I Y C Q Q V S - - - - - K D - - C K S L A L R L L S A I F N R K A L S V C L S M T E R A Q A S D - - - - - - - - - - - - D V R S N I R P - E L D D H A C S V L L N F V V E H - G L Q - - - - - - - - R G W N - T D L Q P I L - - S I L H S K I Q E I 1259-1359|
CpBV-HP3702_CpPV_118139754 Q S G V Y V S K G D L L Y C Q Q V S - - - - - K D - - C K S L A Q R L L P Q V F S R N A L S I C L S M S E K A Q A A N - - - - - - - - - - - - N V G S S I R P - D L D D H A C S V L L N F V I E H - G L Q - - - - - - - - R G W N - T D L E P I L - - S I L H S K M Q E I 711-811|
CpBV-HP3301_CpPV_62903502 N T G V Y I K D S E L N F C L Q R S - - - - - R R - - S T H L S R L L T K H V F A E S A L N K C R Y L N K K R V G Y H - - - - - - - - - - - - - - - - - - S L - Q L D T G A I T A I I H F V L T Y - G Y V - - - - - - - - H N W R P S S E K S I K - - A A I R Y Q L I L A 325-420|
CpBV-HP3302_CpPV_56554931 Y S G V Y V N A L K L K Y C N E L A - - - - - T D - - C R S L T R M L M V E V F S N S A L K V C S L T G A R P T F Y R - - - - - - - - - - - - G P K T E V R P - G L D Q D A R D I L V K Y V E M Y - G K L - - - - - - - - K G W C T E D R R V I I - - N T M R S K L C S S 156-257|
CpBV-4801_CpPV_118139774 Q S G V Y V S K G D L L Y C Q Q V S - - - - - K D - - C K S L A Q R L L P Q V F S R N A L S I C L S M S E K A Q A A N - - - - - - - - - - - - N V G S S I R P - D L D D H A C S V L L N F V I E H - G L Q - - - - - - - - R G W N - T D L E P I L - - S I L H S K M E E I 979-1079|
CpBV-HP301_CpPV_69951585 Q T G V Y V S Y G D L I Y C Q Q V S - - - - - K D - - C K S L A L R L L T A V F N R K A L S V C L S M T E K V Q A P D - - - - - - - - - - - - D V G S N I R P - E L D D H A C S A L L N F V V E L - G L Q - - - - - - - - R G W N - T D L Q P I L - - S T L H S K I Q E I 1266-1366|
CpBV-HP5101_CpPV_118139786 Q S G V Y V S K G D L L Y C Q Q V S - - - - - K D - - C K S L A Q R L L P Q V F S R N A L S I C L S M S E K A Q A A N - - - - - - - - - - - - N V G S S I R P - D L D D H A C S V L L N F V I E H - G L Q - - - - - - - - R G W N - T D L E P I L - - S I L H S K M E E I 979-1079|
CpBV-HP1102_CpPV_69952192 Q S G V Y I S Y G D L I Y C Q Q V S - - - - - K D - - C K S L A L R L L S A I F N R K A L S V C L S M T E R A Q A S D - - - - - - - - - - - - D V R S N I R P - E L D D H A C S V L L N F V V E H - G L Q - - - - - - - - R G W N - T D L Q P I L - - S I L H S K I Q E I 1237-1337|
CpBV-HP1001_CpPV_118139710 Q S G V Y V S K G D L L Y C Q Q V S - - - - - K D - - C K S L A Q R L L P Q V F S R N A L S I C L S M S E K A Q A A N - - - - - - - - - - - - N V G S S I R P - D L D D H A C S V L L N F V I E H - G L Q - - - - - - - - R G W N - T D L E P I L - - S I L H S K M Q E I 726-826|
CpBV-HP402_CpPV_69951773 Q S G V Y V S K G D L L Y C Q Q V S - - - - - K D - - C K S L A Q R L L P Q V F S R N A L S I C L S M S E K A Q A A N - - - - - - - - - - - - N V G S S I R P - D L D D H A C S V L L N F V I E H - G L Q - - - - - - - - R G W N - T D L E P I L - - S I L H S K M Q E I 726-826|
DUF1172_CpPV_118139720 N S R V Y I D A I Q L S N I K I T S - - - - - K D - - S K T L A R T L L L E I F T E N A L S I C S L T G K K A N A F D - - - - - - - - - - - - L E G T S V R P - G L D E H A R T V L L D Y V K K H - S A E - - - - - - - - Q N W V E F D A Q L I S - - N T I R N K M Q E M 535-636|
GIP_L1_00570_Gind_117935418 Q S G I Y V S Y G E L K Y C Q Q V S - - - - - K D - - C K S L A R R L L P E V F N R K A L G V C L S M S E K A H A S N - - - - - - - - - - - - N V G S N L R P - E L D K H A S K V L L N F V I D Y - G L H - - - - - - - - C G W N - T D S K P I L - - S T L H S K I Q E T 785-885|
GIP_L1_00370_Gind_117935398 Q S D I Y V S Y G E L K Y C Q Q V S - - - - - K D - - C K S L A R R L L P E V F N R K A L G V C L S M S E K A Q A S N - - - - - - - - - - - - N V G S N L R P - E L D E H A S K V L L N F V I D Y - G L Q - - - - - - - - C G W N - T D L K P I L - - K T L H S K V Q E I 787-887|
GIP_L1_00580_Gind_117935419 Q S D I Y V S Y G E L K Y C Q Q V S - - - - - K D - - C K S L A R R L L T E V F N K K A L S V C L S M S E K A Q A S N - - - - - - - - - - - - N V G S N L R P - E L D E H A S K V L L N F V I D Y - G L Q - - - - - - - - C G W N - T D L K P I L - - D T L H S K I Q D I 955-1055|
MdBV_sBgp1_MdBV_66391199 H T N V Y I N A I K L S N C K R L S - - - - - K D - - C K S L A R L L L V E I F T K S A L T I C S L T G S R A R A Y D - - - - - - - - - - - - V E G A T I R P - G L D E T A R T V L L T Y V E E Y - G R E - - - - - - - - K G W I T L D T Q S I Q - - N S I R N K M Q E F 142-243/
C6orf65_Hsap_148806920 E K Q F Q I E K W Q I A R C N - - - - - - - - K S - K P Q K F I N D L M Q V L Y T N E Y M A T H - S L T G A K S S T S R - - - - - - - - - - - - - D K A V K P - A M N Q N E V Q E I I G V T K Q L F P N T D D V - - - - - S I R R - M I G Q K L N - - - N C T K K P N L S 171-270\C6orf65
Dpul1000022193_Dpul_Dpul1000022193 P N A L K N A V S Y G K G G K K - - - - - - - K T - D M N A M I N V I I E D L W D R K F M S E H - S L S G N K A K N D K S - - - - - - - - - - - - D K P A K P - A L P T N S V Q A I I N Y V S E F W R K Q Y T I - - - - - - T L E - A K H I R S A - - - N S T K L S T K N 197-297|
LOC794392_Drer_125831342 - Y T E F I T P - E L L E R C N T - - - - - - G T - T A Q K L T N D L L R G L Y E R E C L A S H - S I S G V V Y N K - - - - - - - - - - - - - - - R G Q P K P - A L P T E E V Q A I L R T V Q Y F F P G K T D A - - - - - E I K G - Y I R Q K L Q - - N E A K R L R K K P 202-300|
LOC712498_Mmul_109071583 E K Q F Q I E K W Q I A R C N - - - - - - - - K S - K P Q K F I N D L M Q V L Y T N E Y M A T H - S L T G A K S S T S R - - - - - - - - - - - - - D K A V K P - A M N Q N E V Q E I I G V T K Q L F P N T D D V - - - - - S I R R - M I G Q K L N - - - N C T K K P N V S 169-268|
B230209C24Rik_Mmus_148682498 E K Q F T I E R W Q I A R C N - - - - - - - - K S - K P Q K F I N D L M Q V L Y T N E Y M A T H - S L T G A K S S T S R - - - - - - - - - - - - - D K V V K P - A M N Q N E V Q E I I G V T K Q V F P S A D D V - - - - - S I R R - M I G Q K L N - - - N C T K K P N A S 175-274|
LOC775985_Ggal_118088941 E K Q F K I E K W Q I A L C N - - - - - - - - K S - K P Q K F I N D L M Q A L Y T H E Y M A T H - S L T G A K S S S S K - - - - - - - - - - - - - D K A A K P - A M N Q N E V Q E I I G I T K Q L F P N T D D A - - - - - L I R R - M M G Q K L N - - - N C T K K P I L S 170-269|
LOC100018717_Mdom_126310289 D K E F K I E K W Q I A L C N - - - - - - - - K S - K P Q K F I N D I M Q A L Y T N E Y M A T H - S L T G A K S S S S K - - - - - - - - - - - - - E K A A K P - A M N Q N E V Q E I I G V T K Q L F P N T D D A - - - - - L I R R - M M G Q K L N - - - N S T K K P I L S 249-348/
BANP_Hsap_74729731 V R C A I I P S - D M L H I S T N C - - - - - R T - - A E K M A L T L L D Y L F H R E V Q A V S - N L S G Q G K H - - - - - - - - - - - - - - - - - - - G K K - Q L D P L T I Y G I R C H L F Y K F G I T E - - - - - - - S D W Y - R I K Q S I D - - S K C R T A W R R K 255-348\BANP/SMAR1
SMAR1_Mmus_10312104 V R C A I I P S - D M L H I S T N C - - - - - R T - - A E K M A L T L L D Y L F H R E V Q A V S - N L S G Q G K H - - - - - - - - - - - - - - - - - - - G K K - Q L D P L T I Y G I R C H L F Y K F G I T E - - - - - - - S D W Y - R I K Q S I D - - S K C R T A W R R K 237-330|
Bflo1000030747_Bflo_Bflo1000030747 V R C P I T P S - D L L H I H Q T C - - - - - R T - - A E R M A L I L L D Y L F D R E T Q A M S - N I S G M G R H - - - - - - - - - - - - - - - - - - - G K K - Q L D P L M I Y G I R C H L I S K F G I T D - - - - - - - A D W H - R I K Q N I D - - S K C R T S F R R R 233-326|
LOC575996_Spur_115728493 V R C K I N P T - E M V H I M N M Y - - - - - K T - - A D K L A L K L L D L L F D K E M Q A V S - N L S G T G K H - - - - - - - - - - - - - - - - - - - K K K - K L D P L L I Y G I H C H L V K H C G I T H - - - - - - - E D W Y - R I R Q N I D - - S K C R T A F R R K 278-371|
Caps100OO0016701_Caps_Caps100OO0016701 V R V P I T P S - D L L H I H S N C - - - - - R T - - P E K M A L S L L D Y L F D R D T Q A T S - N L S G M G K H - - - - - - - - - - - - - - - - - - - G K K - Q L D P L M I Y G I R C H L I Q R F G I T E - - - - - - - Q D W H - R I K Q N I D - - S K C R T A F R R R 228-321|
LOC409936_Amel_48138870 Y A D L L P S P - E I I S T W A S - - - - - - S R - P M D H L A C D L M K I L F T P E E R I L C - N V N G K M - - - - - - - - - - - - - - - - - - - - - G K Q - Q F D S N K I H L I R E V L L H F S G I A P - - - - - - - - N S V - E W E E T W K - - - N C V T K I D T S 276-365|
Lgig1000012318_Lgig_Lgig1000012318 V R T P I S K S - D L L H I H S N C - - - - - R T - - S E K M A L T L L D Y L F D R D T Q A N S - N L S G M G R H - - - - - - - - - - - - - - - - - - - G K K - Q L D P L M I Y G I R C H L I H R F N I S E - - - - - - - G D W H - R I K Q N I D - - S K C R T A F R R R 189-282/
Lgig1000009088_Lgig_Lgig1000009088 D P W M I T M D - S L K D I K F Q S - - - - - K T - - P E V F A R N L M Y R M F N I G E L K - - - - - D G M - - - - - - - - - - - - - - - - - - - - - - K K G - C L N V T K L K I I Q E L T F D M F T G M E - - - - - - - V S W S - D C V S M I N - - D S I R Y S I - L L 301-386
Lgig1000015905_Lgig_Lgig1000015905 D V S F F T E T - I L L H L T S V S - - - - - S R - - P E V F A R N L L Y R M F T L A E L Q N S - S V Y G R G F T - - - - - - - - - - - - - - - - N E Q R N T - A L N V D K I R L I R E L T F V L Y S N V R - - - - - - - F S W R - N C V L W M D - - K S I K Y V N K L L 510-606
Bflo1000009049_Bflo_Bflo1000009049 H G V A I Y P H - L L T K A R N S G - - - - - K T - - P E Q F F K I L M G L Y F T E E E L F R G - N L T G G - - - - - - - - - - - - - - - - - - - G D R N H Q - A L N P A I L G A I L A E T R L Q Y E G K Q V - - - - - - - - - - - G I Y K V V N - - E K C C R V R A T V 332-422
Dpul1000023096_Dpul_Dpul1000023096 H M S S E D L - - D Y C N M M A R N - - - - - - N - - F T K M I S L M M G K V F T V E E L T T K - S L T G K - - - - - - - - - - - - - - - - - - - - R T T K P - A L P V D K V N A V A K Y I L K R H P D K H I G - - - - - - - - - - E F N Q K V T - - N Y L R D Q A A K S 266-354
Dpul1000001605_Dpul_Dpul1000001605 A L V Q F L S E E Q L N K A C L H T S - - - - - - - N Y K K I I G N L M E C L F T E E E M L S C - S V T G I N T - - - - - - - - - - - - - - - - - - - - S K Q - P L D A C K T S A L I D F I T A N F P H V T V P - - - - - - - - - - Q V K Q R M G - - N K L R D K R Q L Q 168-259
Dpul1000013633_Dpul_Dpul1000013633 L E H M S E E - - D F E C V N M T A R - - - - N S - - F S K L I A R L M D K M F K E E E L A T H - S L S G L R I - - - - - - - - - - - - - - - - - - - - T K P - P L P A N K V A A V S R Y V L K K H P E K T T T - - - - - - - - - - D F N A K V T - - N N L R D K A - L K 241-330
Dpul1000023017_Dpul_Dpul1000023017 Y K H G S K E - - T T D A I C A T H D - - - - S N - - S N K M A G N L L G A L F T T E E L L N S - T V K G S K E - - - - - - - - - - - - - - - - - - - - H P N - I L E A E K M V L I R D F I M K K F P G P T A V - - - - - - - - - - K I N N A F G - - V K I R N F R N S K 26-116
NEMVEDRAFT_v1g232490_Nvec_156390312 P H I S D A E L Q S L R D E K R K - - - - - - K P - - - E N L A V V L L R R L T T R Q E R E G R - T V C G F - - - - - - - - - - - - - - - - - - - - - - G G S - G L D N D V V Q D I R R Y F Y R A L P D F P Q - - - - - - D K W G - Q C I S A M N - - S Y L R G T R R K R 285-375>
Bflo1000016838_Bflo_Bflo1000016838 E Q G V V T Y P Y I L A Q A K N K S - - - - - K T - - P E Q F F K I L M G V Y F T E E E L L N G - N L H G G G T H - - - - - - - - - - - - - - - - - - - - - Q - A L S P A I I S A I L T E T K K Q Y G G V Q V - - - - - - - - - - - K L Y R V V N - - E K C G R M R A T L 51-140\
Bflo1000018302_Bflo_Bflo1000018302 E Q G V V T Y P Y I L A Q A K N K S - - - - - K T - - P E Q F F K I L M G I Y F T E E E L L N G - N L H G G G T H - - - - - - - - - - - - - - - - - - - - - Q - A L S P A I I S A I L T E T K K Q Y G G V Q V - - - - - - - - - - - K L Y R V V N - - E K C G R M R A T L 140-229|
Bflo1000028913_Bflo_Bflo1000028913 E H G V L T Y P Y I L T Q A K N K A - - - - - K T - - P E Q F F K E L M G I Y F T E D E L F R G - N L H G G G N N - - - - - - - - - - - - - - - - - - - - - E - A L N P A I I G A I L A E T K R Q Y G G V K V - - - - - - - - - - - R L Y R V V N - - E K C G R V R A T C 672-761/
CXorf20_Hsap_23503281_2 W R N I R M P C - S V L T L A K T - - - - - - K S C - A S L S A R Y L I Q K L F T K D V L V Q S - N V Y G N L K H - - - - - - - - - - - - - - - - - - - G L C - A L D P N K I S A L R E F L Q E N Y P I C D L S E - - N G R D W K - S C V T S I N - - S G I R S L R H D V 667-765\Cxorf20
LOC100003955_Drer_125851480 L R K V W I P Q - C V Y K E V F K - - - - - - E T - E P Q K A V A P V L Y S I F P I S T L S C S - A V T G N P E K - - - - - - - - - - - - - - - - - - - G I Q - Q L D P N K I E A L R E F L A E M F P Q F D V S V - - R G V A W A - Q C L G V I N - - - S I T K N L K K T 383-480|
Lgig1000009086_Lgig_Lgig1000009086 H T N L I T E D - V L Q Y I K L R S - - - - - K T - - P E I F A R N L M Y R L F T I Q E L K G C - N V Y G K G Y I - - - - - - - - - - - - - - - - N G I R N G - H L N P D I V K I I Y R L T Y R M F P F V K - - - - - - - F S W Q - N C V R W M N - - K S I R Y V T S L I 389-485|
zgc:113423_Drer_71834604_1 E R K V F I S S - F I L Q R A G K - - - - - - M T - R P S A A V R Y L S R N I F T T K E L S Q S - S T T G N P S R - - - - - - - - - - - - - - - - - - - C L L - R L D T N K V D A I R E W A V K R Y P K F D L R E - - S G K D W K - V C L A V I N - - S T A R Y Y R F M D 239-337|
zgc:113423_Drer_71834604_2 H R G V K V P E - F A L S A A H L - - - - - - R T - R P E L V A R Y L I R F I F P E D V L V N S - N V Y G G V R R - - - - - - - - - - - - - - - - - - - G I H - A L D H N K I S A L R E H L S E R F P W M K L Q E - - D G S D W K - V C V G S I N - - S A I R K F R Y E R 415-513|
LOC100085785_Oana_149638389_1 W R N V Q L P F - S V I Y V A K G - - - - - - K S - R P E L S A R Y L I R H L F T E E V L V K S - N V Y G N L E R - - - - - - - - - - - - - - - - - - - G M C - P L D C N R I N A L R D F L Q E N Y P S F D L T E - - T G Y D W K - A C V A A I N - - S T I R S L R H D H 646-744|
LOC100085785_Oana_149638389_2 K R N V K V L G - T Y L M K A R Q - - - - - - K T - K P K Y A A R Y L V R V L F P K E T L L C S - - V M G V S A R - - - - - - - - - - - - - - - - - - - G R R - T L D P N K V A A I R E F L A T F F P N Y D L S E - - Y G R D W K - T C I T N V N A M I R C L C C E T K I 464-563|
LOC772331_Ggal_118084100_1 W R N V Q L P V - S V I Y V A K G - - - - - - K S - R P E L S A R Y L V R H L F T E D V L V K S - N V Y G N L E R - - - - - - - - - - - - - - - - - - - G M S - P L D C N R I N A L R D F L Q E N Y P S F D L K E - - T G Y D L K - A C V D A M N - - - S T V S S F R C D 492-589|
LOC772331_Ggal_118084100_2 V R N V K V L G - N Y L M K A R Q - - - - - - K T - K P K Y A A R Y L V R V L F P K E A L L C S - - V I G V S T Q - - - - - - - - - - - - - - - - - - - G W C - S L D P N K M N A I R E F L A A N F P N Y D L S E - - Y G K D W K - T C I T H I N A M I R C L H S E T K I 245-344|
LOC694277_Mmul_109130110_1 W R N I R M P Y - S V L T L A K A - - - - - - K S C - A S L S A R Y L I H K L F T K D V L I Q S - N V Y G S L K H - - - - - - - - - - - - - - - - - - - G L Y - A L D P N K I S A L Q E F L Q E N Y P I C D L S E - - N G R D W K - L C V T S I N - - S S I R S L R H N V 811-909|
LOC694277_Mmul_109130110_2 K R N V R V L K - T H L L A V R N - - - - - - M A - K P K Q A A C Y L V R I L F S K E I L I S N - S V D I H L K D - - - - - - - - - - - - - - - - - - - S Q - - S L D P N K M A A L R E Y L A T T F P T C D L C E - - H G K D W Q - D C I S G I N S M I Y C L C S E A K S 627-726|
Bflo1000017051_Bflo_Bflo1000017051 A R G V A V T S A Q M A K A W G T S S - - - - T T - - N T A R A R A M A G F L F S T S E M T N N - S V N G D A E K - - - - - - - - - - - - - - - - - - - G W G - K L D Q N R V E A I I E W T T E V A T T S L T R Q - - - - - - - - - A L I S A L N - - G K C R N A L A V T 346-440|
Bflo1000020219_Bflo_Bflo1000020219 A R G V A V T S A Q M A K A W G T S S - - - - T T - - N T A R A R A M A G F L F S T S E M T N N - S V N G D A E K - - - - - - - - - - - - - - - - - - - G W G - K L D Q N R V E A I I E W T T E V A T T S L T R Q - - - - - - - - - A L I S A L N - - G K C R N V R N P P 294-388|
Bflo1000008622_Bflo_Bflo1000008622 W R Q V A V P A P V I A Q V A S L Q S E T P D K S - - R A A K V R A L M D A I F K T E E M V A N - N T D G S V K D - - - - - - - - - - - - - - - - - - - G L G - R L D E N K L N A I R E H L V T T E P G N Q D P H - - E P G - Y K T K V H K V I N - - L R C R K V R H I V 333-437/
LOC764357_Spur_115613065 R I Q M V M Q D S R W E E M T P - Y A - - - - G A - - P R L A I A L A R Y C I F G T K I L I R S - S V T G R N - - - - - - - - - - - - - - - - - - - - - S K N - P L D F A G L R K I K H L L F Q K Y G S R C S P V - E F E V I W K - T S R E S I S - - Q L C K R L R R K Y 966-1064\NEMVEDRAFT_v1g243017_group
NEMVEDRAFT_v1g207147_Nvec_156383936_1 H P D V T L P R - D L Y K K A L Q - - - - - - S S - L P N L Y T L A L C D L L F S V E T L E K S - T L T G F G - - - - - - - - - - - - - - - - - - - - - G T E - P L D P N I I A A I K D E V V D R F G V G K S S D - E S R T I W A - Q C Q N S I S - - S R C K N L R S K I 421-518|
NEMVEDRAFT_v1g207147_Nvec_156383936_2 S F K V E L Q E - D G N S A C A V - - - - - - Y S - S P Q Q Y V L D R V K I V I G D E V L L K S - S L T G C S - - - - - - - - - - - - - - - - - - - - - R T E - R L N T E V L K E I K D D V I A R F L A D K S D K - E K E Q A W A - A C L L D I A - - D Y C E S L R S A C 286-383|
NEMVEDRAFT_v1g243017_Nvec_156383934_1 D P N V T M S L A D Y K V I I K - - - - - - - N H - D P K S Y V I A L A E K L F G R A V L A E Y - T V T G R S - - - - - - - - - - - - - - - - - - - - - N T P - K L D K E I L F A I K A D V I E R F A S D R S P A - E Q E S L W Y - E C L L C L S - - T R C R T L R Y R T 172-269|
NEMVEDRAFT_v1g243017_Nvec_156383934_2 P G V T L T T A Q Y Q E L G K - - - - - - - - L S - H C N K F V I A L A E M L F G V E T L S T A - R V T N N P - - - - - - - - - - - - - - - - - - - - - A R S - Q L D P K I L R A I K G E I I K R F G I G M T P Y - D Q E K L W S - D S F T S I A - - L K C R T L R C A R 310-406|
NEMVEDRAFT_v1g243017_Nvec_156383934_3 D P S V T L P R H K F D S L R T A - - - - - - - - - H P N K Y A I G L A E N L F G R E I M A R S - T V T G Q G - - - - - - - - - - - - - - - - - - - - - R T S - R L D E K I L C A I K A D V L T Y F A A D N T P A - Q Q E Q L W A - A C I Q S I A - - A K C K T L R Y A K 476-572|
NEMVEDRAFT_v1g243017_Nvec_156383934_4 Y Q D V T L P L D E F R Q I T V - - - - - - - E I - E P S N Y A V A L A V R L F P D E V L E R A - - A A G E - - - - - - - - - - - - - - - - - - - - - - G T R - S L D D T I L K A I K A D V L G R F A A E K T P E - E R D L I W D - N C L A A I T - - Q R I R N P L L G K 604-699/
GSTEN:00013760:G:001_Tnig_47209384 L R K I N S H M E K I L F E N C K R S A - - - G V P R A D R Y A S Y V F R Y L V P Y N K Y C E W - V T K V N Y - - - - - - - - - - - - - - - - - N G L M G K E - A L P T N V R R A M R L Y I E R R F P T L S C - - - - - - D H W R - E I R D A I N - - E I L R V K R K P E 221-322\Xpat-like
xpat-A_Xlae_148222226 L P D I I L N P L D G K K L V S M L R S - - - S N Y E P H R F A E L L F Q H H V P H S L F Q L W - A N K V N F - - - - - - - - - - - - - - - - - D G S R G K L - G L P R N L M I D I L H Q T S K R F V - L G P - - - - - - K E K R - K I K T R L N - - L L L R T R Q D R A 187-287|
si:ch211-173p18.1_Drer_113195580 H P N F D F H P P S K Q Q L N A L F N Q - - - S Q K R A D R Y G C L L F R V I V P Q T K Y K E W - A S N T N W - - - - - - - - - - - - - - - - - D G S R G K C - A L P V N L R S F I I D T V S Q R F P G L S D - - - - - - V D R K - C I K D R I N - - E F L R S P R N T A 1090-1191|
LOC100007097_Drer_125805454 H P D L L F S P P T Q E H I D M L M V Q - - - S H G R P D Q Y G C L L F R A V V P D Q R Y A E W - A S T T N W - - - - - - - - - - - - - - - - - D G S R G K V - A L P V N L K Q F I R D S V I K R F P N L T S - - - - - - A D S K - R I K D R V N - - E F L R S P R T S V 169-270|
LOC796612_Drer_125835388 H P D L L F S P P T Q E H I D M L M V Q - - - S H G R P D R Y G C L L F R A V V P D Q R Y A E W - A S T T N W - - - - - - - - - - - - - - - - - D G L R G K V - A L P V N L K Q F V R D S V I K R Y P N L T S - - - - - - A D S K - R I K D R V N - - E F L R S P R T S V 77-178/
KIAA1553_Hsap_10047171_1 P P E Y Q L T A A E L K Q I V D Q S - - - - - L S - - G G D L A C R L L V Q L F P E L F S D V - - D F S R G C S A - - - - - - - - - - - - - - - C G F A A K R - K L E S L H L Q L I R N Y V E V Y Y P S V K - - - - - D T A V W Q A E C L P Q L N - - D F F S R F W A Q R 85-185\KIAA1553/E5R-like
KIAA1553_Hsap_10047171_2 A S D H V V D T Q D L T E F L D E A - - - - - S S - - P G D F A V F L L H R L F P E L F D H R - - K L G E Q Y S C - - - - - - - - - - - - - - - Y G D G G K Q - E L D P Q R L Q I I R N Y T E I Y F P D M Q - - - - - E E E A W L Q Q C A Q R I N - - D E L E G L G L D A 229-329|
KIAA1553_Hsap_10047171_3 G A D C L L S K E Q L R S I Y E S S - - - - - L S - - I G N F A S R L L V H L F P E L F T H E - - N L R K Q Y N C - - - - - - - - - - - - - - - S G S L G K K - Q L D P S R I K L I R H Y V Q L L Y P R A K - - - - - N D R V W T L E F V G K L D - - E R C R R R D T E Q 392-492|
KIAA1553_Hsap_10047171_4 P S P Y L L S D K E V R E I V Q Q S - - - - - L S - - V G N F A A R L L V R L F P E L F T A E - - N L R L Q Y N H - - - - - - - - - - - - - - - S G A C N K K - Q L D P T R L R L I R H Y V E A V Y P V E K - - - - - M E E V W H Y E C I P S I D - - E R C R R P N R K K 558-658|
LOC100021452_Mdom_126310385_1 P P E Y Q L T A A E L K Q I V D Q S - - - - - S S - - G G D L A C R L L V Q L F P E L F S E G - - E F N R S C A T - - - - - - - - - - - - - - - C G F V N K K - K L E S L H L Q L I R N Y V E V C Y P S V K - - - - - N T A V W Q V E C L P Q V N - - D F F N R F W A Q R 251-351|
LOC100021452_Mdom_126310385_2 A S D H V L D A Q D L N E F L D E A - - - - - S S - - P G E F S V F L L H R L F P E L F D H R - - K L A E R Y S C - - - - - - - - - - - - - - - Y G D G S K Q - E L D P Q R L Q I I R R Y T E I Y F P D V Q - - - - - E E E A W Q Q Q C A Q R I N - - D E L E G L C L D G 395-495|
LOC100021452_Mdom_126310385_3 M S D Y L L N K E Q I R S I Y E S S - - - - - L S - - I G N F A S R L L V H L F P E L F T H E - - N L R K Q Y N C - - - - - - - - - - - - - - - S G S L G K K - Q L D P S R I K L I R H Y V Q L L Y P R A K - - - - - N D R V W T L E F V G K L D - - E R C R R R D T E Q 557-657|
LOC100021452_Mdom_126310385_4 P S L Y L L S D K E V R E I V Q Q S - - - - - L S - - V G N F A A R L L V R L F P E L F T P E - - N L R L Q Y N H - - - - - - - - - - - - - - - S G A C N K K - Q L D P T R L R L I R H Y V E A V Y P V E K - - - - - M E E V W H Y E C I P S I D - - E R C R R P N R K K 723-823|
GSTEN:00016974:G:001_Tnig_47220120_1 P Q E Y L L S R E Q L R N I Y E C S - - - - - L S - - I G N F A S R L L V L M F P E L F T Q E - - N A R R R Y N C - - - - - - - - - - - - - - - S G S L G K K - Q L D P V R V N L I R H Y V Q L V Y P Q A Q - - - - - N D R V W M A E F V G K L D - - E R C R R R E T E Q 529-629|
NEMVEDRAFT_v1g222421_Nvec_156343605_1 T R M K E L L R - W A E K V K Q - - - - - - - K S C S V G N F A T N L V K L L F T K E E L L N R - N C T G S R - - - - - - - - - - - - - - - - - - - - - G K T - A L D A D K L N F V R Y C T F R L Y P M D V V E - - - Q E S V W R K K C V V S I D - - E F L R R G N R T R 279-375|
NEMVEDRAFT_v1g243810_Nvec_156379688_1 R P Q F A S R S - A V M Q I K C - - - - - - - K S C S I G N F S V Q L L R Y I F Q G E E L S N K - N C S G T R - - - - - - - - - - - - - - - - - - - - - G K E - Q I D P V K L Q F I K Q T V Y E H Y N I P T E E - - - K A T T W R - H C I R A M D - - E F L R R P K K E R 169-264|
LOC584784_Spur_115651987 A W E K L S M G V I C H L Y E R A I - - - - T K G - - - - N F A R S V L R K L V D D D I L V K S - T C S G K R G R P Y - - - - - - - - - - - - - - - - - - E Q - A I D P D I L Q Y A L E T T Y D V Y G V E E H A - - - K E K C R R - E C V Q S I D - - S H C R Q L F N S Q 323-421|
LOC421778_Ggal_118088656_1 P P E Y T L T S A E L K Q L M D Q S - - - - - T S - - G G D L A C R L L V Q L F P E L F S D D E F N R N C S A - - - - - - - - - - - - - - - - - C G F P N K R - K L E S L H L Q L I R S Y V E V C Y P S V K - - - - - S T A V W Q V E C L P Q L N - - D F F N R F W A Q R 168-268|
LOC421778_Ggal_118088656_2 A S D Y I L D A Q D L N E F L D E A - - - - - S S - - P G E F C V F L L H R L F P E L F D H R - - K L A E R Y S C - - - - - - - - - - - - - - - Y G D S G K Q - L L D P H R L Q I I R R Y T E I Y F P D V Q - - - - - E E E A W L Q Q C V Q R I N - - E E L E N V Y M D G 312-412|
LOC421778_Ggal_118088656_3 V P D Y L L N K E Q I K N I Y E S S - - - - - L S - - I G N F A S R L L V L L F P E L F T H E - - N L R K Q Y N C - - - - - - - - - - - - - - - S G S L G K K - Q L D P T R I K L I R H Y V Q I L Y P R A K - - - - - N D R V W M L E F V G K L D - - E R C R R R D T E Q 474-574|
LOC421778_Ggal_118088656_4 P S P Y L L T D K E V R E I V Q Q S - - - - - L S - - V G N F A A R L L V R L F P E L F T P E - - N L R L Q Y N H - - - - - - - - - - - - - - - S G A C N K K - Q L D P I R L R L I R H Y V E A V Y P V E K - - - - - M E E V W H Y E C I P S I D - - E R C R R P N R K K 640-740|
MC036R_MCV_9628968_1 A L E M I P S P A E L C H L A H - - - - - - - C S T S C A D M A R R V L L R L Y P E V V C G A - - D S E - - - - - - - - - - - - - - - - - - - - - - - - - A E - - L P A I Y F D A V R A C V S E Y Y P L V C - - - - - D E Y V W Q H E G L L P L R - - E F V L R C R L V R 18-107|
MC036R_MCV_9628968_2 P A W A G P V T L D I Y E C A S - - - - - - - S V S S P G E L A V L L L H K V F Q E L F D A R - - Q L R R C Y S C - - - - - - - - - - - - - - - Y G D G R T H - C L D P A R L Q L I R H C V A L C F P S M S - - - - - D D G E W V R E C V S R V N - - S E L T G E E L M D 634-734|
MC036R_MCV_9628968_3 S C V P L P T R A H L R K M Y G - - - - - - - A S R S I Y N F A V R M L V Y M F P E L F T A E - - N L H T H F N C - - - - - - - - - - - - - - - Y G S M G K R - R L D P L R L R L L R H Y V Q L L H P A A R - - - - - N E R V W I T K F L A C L D - - E R C R R R C A R T 807-907|
MC036R_MCV_9628968_4 P A Q Y L I S A K R V K E L A R - - - - - - - R S L C P G H F A A Q L T V M L F P E L F S S C - - T E R Q K F S C - - - - - - - - - - - - - - - A G S D E H L - R L D P V R V R L I R H Y V R A V C L P G A - - - - - F E R T W E A E C V P S I D - - A R C Q Q P G L R R 935-1035|
DpV84gp044_DVW_115503111_1 Y N K L I K I D Y H L S K I C K - - - - - - - M S V N P Y S M V E A L M N Y M F P D L F E K D - - N R Y T F Y R C - - - - - - - - - - - - - - - N E S K K Y C - Q L S S K K I N L M K I L L E N R F K I - - - - - - - N E D T W Q - E L K K F I D E - K I C G N A S S N I 179-277|
SFV_s031R_RfV_9633840_2 C D R V K M I Y G H I H E I E R - - - - - - - V A V N E Y A M T K S L L H Y V F P N L F N D D - - K H H L F Y R C D K - - - - - - - - - - - - - - - V D G L G - V L S S K K L N L I R V I L E N R F K I - - - - - - - G K Q K W T - M L K K Y I D - - T V C S T G K P L L 164-261|
m031R_MV_9633667_2 C N K V R M I Y G H I N E I E R - - - - - - - V A V N E Y S M A K S L L H Y V F P N L F N D D - - K H H L F Y R C T K - - - - - - - - - - - - - - - M D G L G - V L P S K K L N L I R V I L E N K F K I - - - - - - - S K R K W T - M L K K Y I D - - T V C A T G K L R V 165-262|
YMTVg36R_YmtV_38229199_1 K K V V L K I D S H L S M I I K - - - - - - - E S N N Y Y S M A R S L M N Y M F P N L F E E D - - Q R H I F Y R Y N - - - - - - - - - - - - - - - - V K G F C - Q L S E K K I S L I K L L I K K R F P I - - - - - - - N E N E W Q - H I K E Y I N - - V I C A T P K R N N 121-217|
36R_YdV_12085019_2 Q K M I V K I D S H L S M I I N - - - - - - - E S N N Y Y S M T R S L M N Y M F P N L F E D D - - Q R H L F Y R Y N - - - - - - - - - - - - - - - - V Q G F C - P L S E K K I S L I K L L I K K K F S I - - - - - - - N E N D W L - N I T K Y I N - - I I C A T P K H N K 119-215|
E5R_VVC_137623_2 N Q K T Y K L F S D I S A I G K - - - - - - - A S Q N P S K M V Y A L L L Y M F P N L F G D D - - H R F I R Y R M H P - - - - - - - - - - - - - M S K I K H K - I F S P F K L N L I R I L V E E R F Y N N E C - - - - R S N K W R - I I G T Q V D - - K M L I A E S D K Y 102-204|
m031R_MV_9633667_3 S Y Q G D S V D - E L K T L V L - - - - - - - S S F S L V D L T E K L I K T T F P E V V K S - - - G E G H N Y R C Y P - - - - - - - - - - - - - - - D G T H Q - G L D P E R V I D M C Y K A R V A T D S E - - - - - - S V V D V H N A I V E T V N - - R F L I R S E K K V 281-378|
SFV_s031R_RfV_9633840_3 N Y Q G V S V D - E L K T F V F - - - - - - - S S F S L V D L T E K L T K A M F P E V I K S - - - G M G H T Y R C Y P - - - - - - - - - - - - - - - D G T H Q - G L D L K R V I D L C Y K V R V S T D S E - - - - - - C D I D V H N A I V E T V D - - R Y L I R S E K R V 280-377|
LSDV035_LsdVN_15150475_2 N Y R S K Y V S - D F K R I V S - - - - - - - S S F S L V D L T E K I T K K T F S N I F R N - - - E I S H L Y K F N A - - - - - - - - - - - - - - - E K N H L - A L D E Y K K M R M C N K I I S A I D Y P - - - - - - N K D H I Y S A I I E T V N - - N Y L D N P P K K L 291-388|
36R_YdV_12085019_3 N E N F R N V D - T L K E L V A - - - - - - - N S F S M V D L T E K L T K A T F Y N L F K N - - - R T S N K Y Q C Y A - - - - - - - - - - - - - - - K D N F M - G L N Q T K L I N M F G Y I K L A V D C D - - - - - - D Y N V F F N A C I Y T I N - - K Y L L K S K K V I 236-333|
DpV83gp044_MdPV_62637422_2 L T D D N N D D L F L L K S I K - - - - - - - S S F S L V D L T E K I I K K K F D Y I F K N - - - R M N E K Y R F Y F - - - - - - - - - - - - - - - D G N H I - G L N Q I K I M Q I Y N K I K L F V D H N - - - - - - D D E L V F N S F V Y S V D - - M C L S T P R K I L 298-396|
E5R_VVC_137623_3 I K G K S E E D - T L F I K Q M V E Q C - - - V T S - - Q E L V E K V L K I L F R D L F K S - - - G E Y K A Y R Y D - - - - - - - - - - - - - D D V E N G F I - G L D T L K - L N I V H D I V E P C M P V - - - - - - R R P V A K I L C K E M V N - - K Y F E N P L H I I 218-318|
m031R_MV_9633667_1 - - - - - M E G - D Y L I R P G E - - - - - - K Q - - - A S Y A C R L L G I L T K H S T Y P - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - P E - - - - E Y F P - L V R S I M S M Y N T L I K - - - - - D D V I W F R E I A P Y L Y - - E Y T M Y K Q N A R 1-75|
SFV_s031R_RfV_9633840_1 - - - - - M E G - D Y L I R S G E - - - - - - K Q - - - A S Y A C R L L G I L T K H S T F P - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - S E - - - - E Y F P - L V R S I M S M Y N T L I K - - - - - D D I I W F K E I T P H L Y - - E Y V M Y K Q N V N 1-75|
LSDV035_LsdVN_15150475_1 D F S I N A K M - E N N T P P N H F E - - - - K A - - - S V Y A C R L F K Q F Y N E K K Y D - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - L D - - - - - K C F L K V R N S M S Q Y Y S M V G - - - - - D D L V W N R E I L S Y L Y - - E Y I A Y K N N A N 22-103|
DpV83gp044_MdPV_62637422_1 - - - - - - M E - D V E I N S N E - - - - - - K I - - - T A Y V C R L F K E F Y I K K Y N Y - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - K K - - - L D L C I S N I R K K V S K F Y P M I N - - - - - D D I I W N K E F M P L L Y - - E F I A Y K K N S S 1-76|
36R_YdV_12085019_1 - - - - - - M - - N L H A K Y N E - - - - - - K G - - - Y E Y A C R L I R L I C G S K I N S - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - L N - - - V E K C I N Y I R S K V C I Y F Q L A K - - - - - D D M V W D K E F L S Y I R - - E Y V I F Y N Q K N 1-75|
E5R_VVC_137623_1 - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - M - L I I V L W L Y G Y N F I M S G S Q C P M I - - - - - N D D S F T L K R K Y Q I D - - S A E S T M K M D K 1-47/
consensus/70% . p . h b h . . . p h . . h . . . . . . . . . p s . . . p p h s p . L h . . l F s p p . h . . p . s h p . . . p . . . . . . . . . . . . . . . . . . . . . p . . . L s . p b l p h l b p h l b p . h s . . . . . . . . . . . . b . . . h . p . l . . . p . h p p . . p . .
3. Comprehensive list of proteins with BEN domains
Note: sequence ids from unfinished genomes are not linked and are prefixed by the species abbreviations.
GI               Gene name                length  Species                                     Taxon                                 Description

# 38;NAC1
76621504         LOC525378                 532    Bos taurus                              metazoa>vertebrata                       PREDICTED: similar to NAC1 protein [Bos taurus].
76673942         LOC528023                 585    Bos taurus                              metazoa>vertebrata                       PREDICTED: similar to BTB (POZ) domain containing 14A [Bos taurus].
73967574         LOC491255                 586    Canis lupus familiaris                  metazoa>vertebrata                       PREDICTED: similar to BTB (POZ) domain containing 14A [Canis
73986836         LOC484918                 517    Canis lupus familiaris                  metazoa>vertebrata                       PREDICTED: similar to transcriptional repressor NAC1 [Canis
125812793        LOC562747                 563    Danio rerio                             metazoa>vertebrata>actinopterygii        PREDICTED: hypothetical protein [Danio rerio].
68444635         LOC572093                 559    Danio rerio                             metazoa>vertebrata>actinopterygii        PREDICTED: hypothetical protein [Danio rerio].
125823175        LOC559520                 570    Danio rerio                             metazoa>vertebrata>actinopterygii        PREDICTED: hypothetical protein [Danio rerio].
149737961        LOC100066292              585    Equus caballus                          metazoa>vertebrata                       PREDICTED: similar to BTB (POZ) domain containing 14A [Equus
149756840        LOC100063244              538    Equus caballus                          metazoa>vertebrata                       PREDICTED: similar to tripartite motif-containing 65 [Equus
71897295         BTBD14A                   578    Gallus gallus                           metazoa>vertebrata                       BTB (POZ) domain containing 14A [Gallus gallus].
21389533         BTBD14A                   587    Homo sapiens                            metazoa>vertebrata                       BTB (POZ) domain containing 14A [Homo sapiens].
16418383         BTBD14B                   527    Homo sapiens                            metazoa>vertebrata                       transcriptional repressor NAC1 [Homo sapiens].
109126540        LOC720682                 373    Macaca mulatta                          metazoa>vertebrata                       PREDICTED: similar to transcriptional repressor NAC1, partial
109109824        LOC721794                 586    Macaca mulatta                          metazoa>vertebrata                       PREDICTED: similar to BTB (POZ) domain containing 14A [Macaca
126302629        LOC100016365              580    Monodelphis domestica                   metazoa>vertebrata                       PREDICTED: hypothetical protein [Monodelphis domestica].
126323172        LOC100021158              521    Monodelphis domestica                   metazoa>vertebrata                       PREDICTED: similar to NAC1 protein [Monodelphis domestica].
126308327        LOC100020016              455    Monodelphis domestica                   metazoa>vertebrata                       PREDICTED: similar to LOC495228 protein [Monodelphis domestica].
18380977         Btbd14a                   586    Mus musculus                            metazoa>vertebrata                       Btbd14a protein [Mus musculus].
74192140         -                         557    Mus musculus                            metazoa>vertebrata                       unnamed protein product [Mus musculus].
12849997         -                         514    Mus musculus                            metazoa>vertebrata                       unnamed protein product [Mus musculus].
81886163         Btbd14b                   514    Mus musculus                            metazoa>vertebrata                       BTB/POZ domain-containing protein 14B (Nucleus accumbens-1)
80861477         Btbd14a                   586    Mus musculus                            metazoa>vertebrata                       BTB (POZ) domain containing 14A isoform 2 [Mus musculus].
26335495         -                         376    Mus musculus                            metazoa>vertebrata                       unnamed protein product [Mus musculus].
31543309         Btbd14b                   514    Mus musculus                            metazoa>vertebrata                       transcriptional repressor NAC1 [Mus musculus].
149641882        LOC100081168              585    Ornithorhynchus anatinus                metazoa>vertebrata                       PREDICTED: hypothetical protein isoform 2 [Ornithorhynchus
149641880        LOC100081168              584    Ornithorhynchus anatinus                metazoa>vertebrata                       PREDICTED: hypothetical protein isoform 1 [Ornithorhynchus
149635584        LOC100079529              490    Ornithorhynchus anatinus                metazoa>vertebrata                       PREDICTED: similar to NAC1 protein [Ornithorhynchus anatinus].
114627504        BTBD14A                   586    Pan troglodytes                         metazoa>vertebrata                       PREDICTED: BTB (POZ) domain containing 14A isoform 2 [Pan
34853069         Btbd14a                   585    Rattus norvegicus                       metazoa>vertebrata                       PREDICTED: similar to BTB (POZ) domain containing 14A isoform 2
19705547         Btbd14b                   514    Rattus norvegicus                       metazoa>vertebrata                       BTB (POZ) domain containing 14B [Rattus norvegicus].
62531002         Btbd14a                   564    Rattus norvegicus                       metazoa>vertebrata                       Btbd14a protein [Rattus norvegicus].
119368225        Btbd14a                   585    Rattus norvegicus                       metazoa>vertebrata                       BTB/POZ domain-containing protein 14A.
47212212         GSTEN:00011446:G:001      503    Tetraodon nigroviridis                  metazoa>vertebrata>actinopterygii        unnamed protein product [Tetraodon nigroviridis].
47210295         GSTEN:00011739:G:001      488    Tetraodon nigroviridis                  metazoa>vertebrata>actinopterygii        unnamed protein product [Tetraodon nigroviridis].
47214905         GSTEN:00023726:G:001      547    Tetraodon nigroviridis                  metazoa>vertebrata>actinopterygii        unnamed protein product [Tetraodon nigroviridis].
47226870         GSTEN:00027189:G:001      119    Tetraodon nigroviridis                  metazoa>vertebrata>actinopterygii        unnamed protein product [Tetraodon nigroviridis].
148236339        LOC495228                 502    Xenopus laevis                          metazoa>vertebrata                       hypothetical protein LOC495228 [Xenopus laevis].
156718004        LOC100125188              577    Xenopus tropicalis                      metazoa>vertebrata                       hypothetical protein LOC100125188 [Xenopus tropicalis].
# 48;BANP/SMAR1
Caps1000016701   Caps1000016701            490    Capitella spI                           metazoa>annelida                         estExt_Genewise1.C_5360012
Bflo1000030747   Bflo1000030747            502    Branchiostoma floridae                  metazoa>chordata                         estExt_fgenesh2_pg.C_2250034
115728493        LOC575996                 506    Strongylocentrotus purpuratus           metazoa>echinodermata                    PREDICTED: similar to MGC139655 protein [Strongylocentrotus
48138870         LOC409936                 580    Apis mellifera                          metazoa>hexapoda                         PREDICTED: similar to Broad-complex core-protein isoform 6 [Apis
Lgig1000012318   Lgig1000012318            430    Lottia gigantea                         metazoa>mollusca                         fgenesh2_pg.C_sca_40000059
115498006        MGC139655                 503    Bos taurus                              metazoa>vertebrata                       hypothetical protein LOC513446 [Bos taurus].
73956941         LOC479618                 490    Canis lupus familiaris                  metazoa>vertebrata                       PREDICTED: similar to BTG3 associated nuclear protein isoform b
66472486         banp                      508    Danio rerio                             metazoa>vertebrata>actinopterygii        BTG3 associated nuclear protein [Danio rerio].
55963413         CH211-93F2.4              508    Danio rerio                             metazoa>vertebrata>actinopterygii        novel protein (zgc:111954) [Danio rerio].
149699458        LOC100056298              467    Equus caballus                          metazoa>vertebrata                       PREDICTED: similar to Btg3 associated nuclear protein isoform 2
149699455        LOC100056298              548    Equus caballus                          metazoa>vertebrata                       PREDICTED: similar to Btg3 associated nuclear protein isoform 1
149699461        LOC100056298              489    Equus caballus                          metazoa>vertebrata                       PREDICTED: similar to Btg3 associated nuclear protein isoform 3
118096572        BANP                      531    Gallus gallus                           metazoa>vertebrata                       PREDICTED: hypothetical protein [Gallus gallus].
7018460          DKFZp761H172              250    Homo sapiens                            metazoa>vertebrata                       hypothetical protein [Homo sapiens].
119615776        BANP                      338    Homo sapiens                            metazoa>vertebrata                       BTG3 associated nuclear protein, isoform CRA_e [Homo sapiens].
10435880         -                         197    Homo sapiens                            metazoa>vertebrata                       unnamed protein product [Homo sapiens].
119615770        BANP                      276    Homo sapiens                            metazoa>vertebrata                       BTG3 associated nuclear protein, isoform CRA_a [Homo sapiens].
119615772        BANP                      250    Homo sapiens                            metazoa>vertebrata                       BTG3 associated nuclear protein, isoform CRA_c [Homo sapiens].
119615773        BANP                      364    Homo sapiens                            metazoa>vertebrata                       BTG3 associated nuclear protein, isoform CRA_d [Homo sapiens].
74729731         BANP                      545    Homo sapiens                            metazoa>vertebrata                       Protein BANP (Btg3-associated nuclear protein)
7020713          -                         442    Homo sapiens                            metazoa>vertebrata                       unnamed protein product [Homo sapiens].
109698609        BANP                      491    Homo sapiens                            metazoa>vertebrata                       BTG3 associated nuclear protein isoform b [Homo sapiens].
113426372        LOC730809                 293    Homo sapiens                            metazoa>vertebrata                       PREDICTED: similar to BANP homolog [Homo sapiens].
119615771        BANP                      306    Homo sapiens                            metazoa>vertebrata                       BTG3 associated nuclear protein, isoform CRA_b [Homo sapiens].
17986266         BANP                      469    Homo sapiens                            metazoa>vertebrata                       BTG3 associated nuclear protein isoform a [Homo sapiens].
89065641         LOC648196                 86     Homo sapiens                            metazoa>vertebrata                       PREDICTED: similar to BANP homolog [Homo sapiens].
113430836        LOC650749                 152    Homo sapiens                            metazoa>vertebrata                       PREDICTED: similar to BTG3 associated nuclear protein isoform a
119570734        hCG_1790309               125    Homo sapiens                            metazoa>vertebrata                       hCG1790309 [Homo sapiens].
109129466        LOC696125                 597    Macaca mulatta                          metazoa>vertebrata                       PREDICTED: similar to BTG3 associated nuclear protein isoform b
109129468        LOC696125                 466    Macaca mulatta                          metazoa>vertebrata                       PREDICTED: similar to BTG3 associated nuclear protein isoform b
109129476        LOC696125                 469    Macaca mulatta                          metazoa>vertebrata                       PREDICTED: similar to BTG3 associated nuclear protein isoform a
126304950        LOC100025705              730    Monodelphis domestica                   metazoa>vertebrata                       PREDICTED: similar to BTG3 associated nuclear protein [Monodelphis
124248564        Banp                      548    Mus musculus                            metazoa>vertebrata                       Btg3 associated nuclear protein isoform 2 [Mus musculus].
74152151         -                         506    Mus musculus                            metazoa>vertebrata                       unnamed protein product [Mus musculus].
148679724        Banp                      508    Mus musculus                            metazoa>vertebrata                       Btg3 associated nuclear protein, isoform CRA_b [Mus musculus].
10312104         -                         548    Mus musculus                            metazoa>vertebrata                       SMAR1 [Mus musculus].
158534069        Banp                      545    Mus musculus                            metazoa>vertebrata                       Btg3 associated nuclear protein isoform 1 [Mus musculus].
3641352          -                         509    Mus musculus                            metazoa>vertebrata                       putative transcription factor [Mus musculus].
15426473         Banp                      200    Mus musculus                            metazoa>vertebrata                       Banp protein [Mus musculus].
149642170        LOC100076573              556    Ornithorhynchus anatinus                metazoa>vertebrata                       PREDICTED: similar to Btg3 associated nuclear protein isoform 2
149642168        LOC100076573              473    Ornithorhynchus anatinus                metazoa>vertebrata                       PREDICTED: similar to Btg3 associated nuclear protein isoform 1
157822243        Banp_predicted            548    Rattus norvegicus                       metazoa>vertebrata                       Btg3 associated nuclear protein [Rattus norvegicus].
149038376        Banp_predicted            509    Rattus norvegicus                       metazoa>vertebrata                       Btg3 associated nuclear protein (predicted), isoform CRA_b [Rattus
149038377        Banp_predicted            192    Rattus norvegicus                       metazoa>vertebrata                       Btg3 associated nuclear protein (predicted), isoform CRA_c [Rattus
47219810         GSTEN:00022821:G:001      201    Tetraodon nigroviridis                  metazoa>vertebrata>actinopterygii        unnamed protein product [Tetraodon nigroviridis].
148226396        MGC85000                  509    Xenopus laevis                          metazoa>vertebrata                       hypothetical protein LOC734310 [Xenopus laevis].
89272778         banp                      509    Xenopus tropicalis                      metazoa>vertebrata                       BTG3 associated nuclear protein [Xenopus tropicalis].
45361235         banp                      510    Xenopus tropicalis                      metazoa>vertebrata                       BTG3 associated nuclear protein [Xenopus tropicalis].
# 68;E5R/KIAA1553
335839           -                         341    Variola major virus                     dsDNA viruses, no RNA stage>poxviridae   ORF1.
401340           E5R                       341    Vaccinia virus (strain Dairen I)        dsDNA viruses, no RNA stage>poxviridae   Protein E5.
44971413         RPXV050                   341    Rabbitpox virus                         dsDNA viruses, no RNA stage>poxviridae   RPXV050 [Rabbitpox virus].
56713424         m8071R                    341    Vaccinia virus                          dsDNA viruses, no RNA stage>poxviridae   hypothetical protein m8071R [Vaccinia virus].
5830607          C5R                       341    Variola minor virus                     dsDNA viruses, no RNA stage>poxviridae   C5R protein [Variola minor virus].
66275858         E5R                       341    Vaccinia virus                          dsDNA viruses, no RNA stage>poxviridae   abundant component of virosome [Vaccinia virus].
9627567          E5R                       341    Variola virus                           dsDNA viruses, no RNA stage>poxviridae   hypothetical protein VARVgp046 [Variola virus].
113195239        TATV_DAH68_062            332    Taterapox virus                         dsDNA viruses, no RNA stage>poxviridae   hypothetical protein TATV_DAH68_062 [Taterapox virus].
1335802          -                         332    Taterapox virus                         dsDNA viruses, no RNA stage>poxviridae   E5 protein.
111184245        -                         331    Horsepox virus                          dsDNA viruses, no RNA stage>poxviridae   HSPV062 [Horsepox virus].
137623           E5R                       331    Vaccinia virus Copenhagen               dsDNA viruses, no RNA stage>poxviridae   Protein E5.
22164650         EVM045                    331    Ectromelia virus                        dsDNA viruses, no RNA stage>poxviridae   EVM045 [Ectromelia virus].
2772747          MVA052R                   331    Vaccinia virus                          dsDNA viruses, no RNA stage>poxviridae   putative 39.1k protein [Vaccinia virus].
30519435         F5R                       331    Cowpox virus                            dsDNA viruses, no RNA stage>poxviridae   F5R protein [Cowpox virus].
88854101         List057                   331    Vaccinia virus                          dsDNA viruses, no RNA stage>poxviridae   hypothetical protein List057 [Vaccinia virus].
90819721         VACV-DUKE-069             331    Vaccinia virus                          dsDNA viruses, no RNA stage>poxviridae   VACV-DUKE-069 [Vaccinia virus].
1335792          -                         329    Camelpox virus 903                      dsDNA viruses, no RNA stage>poxviridae   E5 protein [Camelpox virus 903].
18640291         CamMLVgp057               329    Camelpox virus                          dsDNA viruses, no RNA stage>poxviridae   hypothetical protein; CMLV057 [Camelpox virus].
1335798          -                         319    Cowpox virus                            dsDNA viruses, no RNA stage>poxviridae   E5 protein.
20178437         CPXV071 CDS               319    Cowpox virus                            dsDNA viruses, no RNA stage>poxviridae   CPXV071 protein [Cowpox virus].
90660299         CPXV_GER91_3_066          319    Cowpox virus                            dsDNA viruses, no RNA stage>poxviridae   unknown [Cowpox virus].
47088380         ACAM3000_MVA_052          317    Vaccinia virus                          dsDNA viruses, no RNA stage>poxviridae   virosome component [Vaccinia virus].
94487330         VARV_NIG69_001_049        317    Variola virus                           dsDNA viruses, no RNA stage>poxviridae   hypothetical protein VARV_NIG69_001_049 [Variola virus].
6969704          -                         257    Vaccinia virus Tian Tan                 dsDNA viruses, no RNA stage>poxviridae   TE6R [Vaccinia virus (strain Tian Tan)].
1335800          -                         256    Ectromelia virus                        dsDNA viruses, no RNA stage>poxviridae   E5 protein.
37551505         VACCL3_071                189    Vaccinia virus                          dsDNA viruses, no RNA stage>poxviridae   virosome component [Vaccinia virus].
38348927         VACAC2_071                189    Vaccinia virus                          dsDNA viruses, no RNA stage>poxviridae   virosome component [Vaccinia virus].
88900678         VACV_198                  189    Vaccinia virus                          dsDNA viruses, no RNA stage>poxviridae   VACV198 [Vaccinia virus].
68449331         MPXV_LIB1970_184_057      133    Monkeypox virus                         dsDNA viruses, no RNA stage>poxviridae   unknown [Monkeypox virus].
62637422         DpV83gp044                414    Mule deer poxvirus                      dsDNA viruses, no RNA stage>poxviridae   Hypothetical protein DpV83gp044 [Mule deer poxvirus].
115503111        DpV84gp044                412    Deerpox virus W-1170-84                 dsDNA viruses, no RNA stage>poxviridae   hypothetical protein DpV84gp044 [Deerpox virus W-1170-84].
15150475         LSDV035                   402    Lumpy skin disease virus NI-2490        dsDNA viruses, no RNA stage>poxviridae   LSDV035 RNA polymerase subunit [Lumpy skin disease virus NI-2490].
22595571         LD035                     402    Lumpy skin disease virus NW-LW          dsDNA viruses, no RNA stage>poxviridae   hypothetical protein [Lumpy skin disease virus NW-LW].
9633667          m031R                     393    Myxoma virus                            dsDNA viruses, no RNA stage>poxviridae   m31R [Myxoma virus].
9633840          SFV_s031R                 392    Rabbit fibroma virus                    dsDNA viruses, no RNA stage>poxviridae   gp031R [Rabbit fibroma virus].
148912913        GTPV_gp032                377    Goatpox virus Pellor                    dsDNA viruses, no RNA stage>poxviridae   hypothetical protein GTPV_gp032 [Goatpox virus Pellor].
21492489         SPPV_32                   377    Sheeppox virus                          dsDNA viruses, no RNA stage>poxviridae   hypothetical protein SPPV_32 [Sheeppox virus].
22595729         LW036                     377    Lumpy skin disease virus                dsDNA viruses, no RNA stage>poxviridae   RNA polymerase subunit [lumpy skin disease virus].
157939658        36R                       352    Tanapox virus                           dsDNA viruses, no RNA stage>poxviridae   hypothetical protein TANV_36R [Tanapox virus].
12085019         36R                       348    Yaba-like disease virus                 dsDNA viruses, no RNA stage>poxviridae   36R protein [Yaba-like disease virus].
38229199         YMTVg36R                  342    Yaba monkey tumor virus                 dsDNA viruses, no RNA stage>poxviridae   36R [Yaba monkey tumor virus].
68448729         MPXV_USA2003_044_057.5    69     Monkeypox virus                         dsDNA viruses, no RNA stage>poxviridae   unknown [Monkeypox virus].
68448930         MPXV_RCG2003_358_057      67     Monkeypox virus                         dsDNA viruses, no RNA stage>poxviridae   unknown [Monkeypox virus].
2105231          B-N'.1                    228    Molluscum contagiosum virus subtype 1   dsDNA viruses, no RNA stage>poxviridae   hypothetical protein [Molluscum contagiosum virus subtype 1].
68449132         MPXV_ZAI1979_005_058      64     Monkeypox virus                         dsDNA viruses, no RNA stage>poxviridae   unknown [Monkeypox virus].
68448728         MPXV_USA2003_044_057      62     Monkeypox virus                         dsDNA viruses, no RNA stage>poxviridae   unknown [Monkeypox virus].
37551506         VACCL3_072                149    Vaccinia virus                          dsDNA viruses, no RNA stage>poxviridae   virosome component [Vaccinia virus].
9628968          MC036R                    1057   Molluscum contagiosum virus             dsDNA viruses, no RNA stage>poxviridae   MC036R [Molluscum contagiosum virus].
2105203          B1-8.2                    171    Molluscum contagiosum virus subtype 1   dsDNA viruses, no RNA stage>poxviridae   hypothetical protein [Molluscum contagiosum virus subtype 1].
2105178          B2-2R                     172    Molluscum contagiosum virus subtype 1   dsDNA viruses, no RNA stage>poxviridae   hypothetical protein [Molluscum contagiosum virus subtype 1].
126310385        LOC100021452              835    Monodelphis domestica                   metazoa>vertebrata                       PREDICTED: similar to RP11-59I9.2 [Monodelphis domestica].
73973961         LOC481949                 831    Canis lupus familiaris                  metazoa>vertebrata                       PREDICTED: hypothetical protein XP_539070 [Canis familiaris].
47220120         GSTEN:00016974:G:001      829    Tetraodon nigroviridis                  metazoa>vertebrata>actinopterygii        unnamed protein product [Tetraodon nigroviridis].
76625770         LOC525371                 829    Bos taurus                              metazoa>vertebrata                       PREDICTED: similar to RP11-59I9.2 [Bos taurus].
109072266        LOC703032                 828    Macaca mulatta                          metazoa>vertebrata                       PREDICTED: hypothetical protein [Macaca mulatta].
114608851        LOC472087                 828    Pan troglodytes                         metazoa>vertebrata                       PREDICTED: similar to RP11-59I9.2 [Pan troglodytes].
122937295        KIAA1553                  828    Homo sapiens                            metazoa>vertebrata                       hypothetical protein LOC57673 [Homo sapiens].
124297093        KIAA1553                  828    Homo sapiens                            metazoa>vertebrata                       KIAA1553 [Homo sapiens].
149722859        LOC100066484              828    Equus caballus                          metazoa>vertebrata                       PREDICTED: similar to RP11-59I9.2 [Equus caballus].
109510284        LOC683923                 826    Rattus norvegicus                       metazoa>vertebrata                       PREDICTED: hypothetical protein [Rattus norvegicus].
39841055         AK122525                  825    Mus musculus                            metazoa>vertebrata                       hypothetical protein LOC331623 [Mus musculus].
28972782         mKIAA1553                 775    Mus musculus                            metazoa>vertebrata                       mKIAA1553 protein [Mus musculus].
118088656        LOC421778                 752    Gallus gallus                           metazoa>vertebrata                       PREDICTED: similar to RP11-59I9.2 [Gallus gallus].
119568786        hCG_1646472               743    Homo sapiens                            metazoa>vertebrata                       hCG1646472 [Homo sapiens].
10047171         KIAA1553                  670    Homo sapiens                            metazoa>vertebrata                       KIAA1553 protein [Homo sapiens].
156392775        NEMVEDRAFT_v1g241212      418    Nematostella vectensis                  metazoa                                  predicted protein [Nematostella vectensis].
156343605        NEMVEDRAFT_v1g222421      417    Nematostella vectensis                  metazoa                                  hypothetical protein NEMVEDRAFT_v1g222421 [Nematostella vectensis].
149636136        LOC100080938              227    Ornithorhynchus anatinus                metazoa>vertebrata                       PREDICTED: similar to KIAA1553 protein [Ornithorhynchus anatinus].
# 26;CCDC4
149253925        EG666938                  745    Mus musculus                            metazoa>vertebrata                       PREDICTED: hypothetical protein [Mus musculus].
119922782        LOC614525                 709    Bos taurus                              metazoa>vertebrata                       PREDICTED: hypothetical protein [Bos taurus].
109500586        LOC687246                 671    Rattus norvegicus                       metazoa>vertebrata                       PREDICTED: hypothetical protein [Rattus norvegicus].
109499670        LOC681008                 665    Rattus norvegicus                       metazoa>vertebrata                       PREDICTED: hypothetical protein [Rattus norvegicus].
126331687        LOC100015448              545    Monodelphis domestica                   metazoa>vertebrata                       PREDICTED: hypothetical protein isoform 1 [Monodelphis domestica].
148762950        CCDC4                     534    Homo sapiens                            metazoa>vertebrata                       coiled-coil domain containing 4 [Homo sapiens].
109074112        LOC702395                 530    Macaca mulatta                          metazoa>vertebrata                       PREDICTED: hypothetical protein isoform 3 [Macaca mulatta].
83287964         CCDC4                     530    Homo sapiens                            metazoa>vertebrata                       Coiled-coil domain-containing protein 4.
114593835        LOC471183                 528    Pan troglodytes                         metazoa>vertebrata                       PREDICTED: hypothetical protein isoform 1 [Pan troglodytes].
109074116        LOC702395                 495    Macaca mulatta                          metazoa>vertebrata                       PREDICTED: hypothetical protein isoform 1 [Macaca mulatta].
114593839        LOC471183                 488    Pan troglodytes                         metazoa>vertebrata                       PREDICTED: hypothetical protein isoform 3 [Pan troglodytes].
109074114        LOC702395                 437    Macaca mulatta                          metazoa>vertebrata                       PREDICTED: hypothetical protein isoform 2 [Macaca mulatta].
62122398         -                         437    Homo sapiens                            metazoa>vertebrata                       hypothetical protein [Homo sapiens].
114593837        LOC471183                 435    Pan troglodytes                         metazoa>vertebrata                       PREDICTED: hypothetical protein isoform 2 [Pan troglodytes].
113416074        CCDC4                     431    Homo sapiens                            metazoa>vertebrata                       PREDICTED: coiled-coil domain containing 4 [Homo sapiens].
34532244         -                         416    Homo sapiens                            metazoa>vertebrata                       unnamed protein product [Homo sapiens].
118763953        CCDC4                     405    Homo sapiens                            metazoa>vertebrata                       CCDC4 protein [Homo sapiens].
126331689        LOC100015448              405    Monodelphis domestica                   metazoa>vertebrata                       PREDICTED: hypothetical protein isoform 2 [Monodelphis domestica].
149703028        LOC100063015              376    Equus caballus                          metazoa>vertebrata                       PREDICTED: hypothetical protein [Equus caballus].
118090572        LOC422777                 374    Gallus gallus                           metazoa>vertebrata                       PREDICTED: hypothetical protein [Gallus gallus].
149253745        LOC677011                 367    Mus musculus                            metazoa>vertebrata                       PREDICTED: hypothetical protein [Mus musculus].
119613407        CCDC4                     326    Homo sapiens                            metazoa>vertebrata                       coiled-coil domain containing 4 [Homo sapiens].
125830684        LOC566937                 318    Danio rerio                             metazoa>vertebrata>actinopterygii        PREDICTED: hypothetical protein [Danio rerio].
118763959        CCDC4                     312    Homo sapiens                            metazoa>vertebrata                       CCDC4 protein [Homo sapiens].
73974946         LOC482119                 283    Canis lupus familiaris                  metazoa>vertebrata                       PREDICTED: hypothetical protein XP_539239 [Canis familiaris].
47226171         GSTEN:00029264:G:001      405    Tetraodon nigroviridis                  metazoa>vertebrata>actinopterygii        unnamed protein product [Tetraodon nigroviridis].
# 27;C10orf30
149437008        LOC100085310              608    Ornithorhynchus anatinus                metazoa>vertebrata                       PREDICTED: similar to Uncharacterized protein C10orf30
73949108         LOC487127                 608    Canis lupus familiaris                  metazoa>vertebrata                       PREDICTED: hypothetical protein XP_544255 [Canis familiaris].
114629532        LOC450314                 567    Pan troglodytes                         metazoa>vertebrata                       PREDICTED: similar to Uncharacterized protein C10orf30 [Pan
109820782        C10orf30                  519    Homo sapiens                            metazoa>vertebrata                       Uncharacterized protein C10orf30.
155029542        C10orf30                  468    Homo sapiens                            metazoa>vertebrata                       hypothetical protein LOC222389 isoform 1 [Homo sapiens].
21757423         -                         468    Homo sapiens                            metazoa>vertebrata                       unnamed protein product [Homo sapiens].
149743708        LOC100069127              458    Equus caballus                          metazoa>vertebrata                       PREDICTED: similar to Uncharacterized protein C10orf30 [Equus
31542577         E130319B15Rik             434    Mus musculus                            metazoa>vertebrata                       hypothetical protein LOC209645 [Mus musculus].
26343845         -                         433    Mus musculus                            metazoa>vertebrata                       unnamed protein product [Mus musculus].
66396495         E130319B15Rik             433    Mus musculus                            metazoa>vertebrata                       RIKEN cDNA E130319B15 gene [Mus musculus].
126340438        LOC100014309              431    Monodelphis domestica                   metazoa>vertebrata                       PREDICTED: similar to Uncharacterized protein C10orf30 [Monodelphis
118081981        LOC419037                 430    Gallus gallus                           metazoa>vertebrata                       PREDICTED: similar to Uncharacterized protein C10orf30 [Gallus
109506168        RGD1305898_predicted      419    Rattus norvegicus                       metazoa>vertebrata                       PREDICTED: similar to BTB (POZ) domain containing 14B [Rattus
109505216        RGD1305898_predicted      418    Rattus norvegicus                       metazoa>vertebrata                       PREDICTED: similar to BTB (POZ) domain containing 14B [Rattus
148676000        E130319B15Rik             417    Mus musculus                            metazoa>vertebrata                       RIKEN cDNA E130319B15 [Mus musculus].
26335345         -                         401    Mus musculus                            metazoa>vertebrata                       unnamed protein product [Mus musculus].
47216031         GSTEN:00033310:G:001      400    Tetraodon nigroviridis                  metazoa>vertebrata>actinopterygii        unnamed protein product [Tetraodon nigroviridis].
126632819        CH211-220F12.4-001        387    Danio rerio                             metazoa>vertebrata>actinopterygii        novel protein [Danio rerio].
125843107        LOC560711                 379    Danio rerio                             metazoa>vertebrata>actinopterygii        PREDICTED: similar to Uncharacterized protein C10orf30 [Danio
155029544        C10orf30                  374    Homo sapiens                            metazoa>vertebrata                       hypothetical protein LOC222389 isoform 2 [Homo sapiens].
149021070        RGD1305898_predicted      371    Rattus norvegicus                       metazoa>vertebrata                       similar to hypothetical protein FLJ40283 (predicted) [Rattus
21618768         C10orf30                  365    Homo sapiens                            metazoa>vertebrata                       C10orf30 protein [Homo sapiens].
83405406         LOC504404                 364    Bos taurus                              metazoa>vertebrata                       LOC504404 protein [Bos taurus].
119606694        C10orf30                  361    Homo sapiens                            metazoa>vertebrata                       chromosome 10 open reading frame 30, isoform CRA_d [Homo sapiens].
109091119        LOC720932                 227    Macaca mulatta                          metazoa>vertebrata                       PREDICTED: hypothetical protein, partial [Macaca mulatta].
33990622         E130319B15Rik             141    Mus musculus                            metazoa>vertebrata                       E130319B15Rik protein [Mus musculus].
55959077         C10orf30                  117    Homo sapiens                            metazoa>vertebrata                       chromosome 10 open reading frame 30 [Homo sapiens].
# 38;Polydnavirus
69951585         CpBV-HP301                1376   Cotesia plutellae polydnavirus          dsDNA viruses, no RNA stage              hypothetical protein ORF301 [Cotesia plutellae polydnavirus].
69951893         CpBV-HP501                1369   Cotesia plutellae polydnavirus          dsDNA viruses, no RNA stage              hypothetical protein ORF501 [Cotesia plutellae polydnavirus].
69952192         CpBV-HP1102               1347   Cotesia plutellae polydnavirus          dsDNA viruses, no RNA stage              hypothetical protein ORF1102 [Cotesia plutellae polydnavirus].
54109737         CcBV_6.3                  1196   Cotesia congregata bracovirus           dsDNA viruses, no RNA stage              hypothetical protein [Cotesia congregata bracovirus].
57753417         CcBVs6gp3                 1196   Cotesia congregata bracovirus           dsDNA viruses, no RNA stage              hypothetical protein CcBVs6gp3 [Cotesia congregata bracovirus].
54109813         CcBV_20.2                 1091   Cotesia congregata bracovirus           dsDNA viruses, no RNA stage              hypothetical protein [Cotesia congregata bracovirus].
57659442         CcBV_20.2                 1091   Cotesia congregata bracovirus           dsDNA viruses, no RNA stage              hypothetical protein CcBV_20.2 [Cotesia congregata bracovirus].
118139774        -                         1089   Cotesia plutellae polydnavirus          dsDNA viruses, no RNA stage              hypothetical protein [Cotesia plutellae polydnavirus].
118139786        -                         1089   Cotesia plutellae polydnavirus          dsDNA viruses, no RNA stage              hypothetical protein [Cotesia plutellae polydnavirus].
117935419        GIP_L1_00580              1065   Glyptapanteles indiensis                metazoa>hexapoda                         hypothetical protein GIP_L1_00580 [Glyptapanteles indiensis].
54109822         CcBV_23.1                 910    Cotesia congregata bracovirus           dsDNA viruses, no RNA stage              hypothetical protein [Cotesia congregata bracovirus].
57659520         CcBV_23.1                 910    Cotesia congregata bracovirus           dsDNA viruses, no RNA stage              hypothetical protein CcBV_23.1 [Cotesia congregata bracovirus].
117935398        GIP_L1_00370              897    Glyptapanteles indiensis                metazoa>hexapoda                         hypothetical protein GIP_L1_00370 [Glyptapanteles indiensis].
117935418        GIP_L1_00570              895    Glyptapanteles indiensis                metazoa>hexapoda                         hypothetical protein GIP_L1_00570 [Glyptapanteles indiensis].
118139710        -                         836    Cotesia plutellae polydnavirus          dsDNA viruses, no RNA stage              hypothetical protein [Cotesia plutellae polydnavirus].
69951773         CpBV-HP402                836    Cotesia plutellae polydnavirus          dsDNA viruses, no RNA stage              hypothetical protein ORF402 [Cotesia plutellae polydnavirus].
118139754        -                         821    Cotesia plutellae polydnavirus          dsDNA viruses, no RNA stage              hypothetical protein [Cotesia plutellae polydnavirus].
54109828         CcBV_25.2                 775    Cotesia congregata bracovirus           dsDNA viruses, no RNA stage              unnamed protein product [Cotesia congregata bracovirus].
57659551         CcBV_25.2                 775    Cotesia congregata bracovirus           dsDNA viruses, no RNA stage              hypothetical protein CcBV_25.2 [Cotesia congregata bracovirus].
54109750         CcBV_9.1                  696    Cotesia congregata bracovirus           dsDNA viruses, no RNA stage              hypothetical protein [Cotesia congregata bracovirus].
54109869         CcBV_33.2                 696    Cotesia congregata bracovirus           dsDNA viruses, no RNA stage              hypothetical protein [Cotesia congregata bracovirus].
57659251         CcBV_9.1                  696    Cotesia congregata bracovirus           dsDNA viruses, no RNA stage              hypothetical protein CcBV_9.1 [Cotesia congregata bracovirus].
57659718         CcBV_33.2                 696    Cotesia congregata bracovirus           dsDNA viruses, no RNA stage              hypothetical protein CcBV_33.2 [Cotesia congregata bracovirus].
56554931         CpBV-HP3302               265    Cotesia plutellae polydnavirus          dsDNA viruses, no RNA stage              hypothetical protein ORF3302 [Cotesia plutellae polydnavirus].
54109800         CcBV_18.6                 259    Cotesia congregata bracovirus           dsDNA viruses, no RNA stage              hypothetical protein [Cotesia congregata bracovirus].
57753384         CcBVs18gp6                259    Cotesia congregata bracovirus           dsDNA viruses, no RNA stage              hypothetical protein CcBVs18gp6 [Cotesia congregata bracovirus].
56788738         -                         248    Microplitis demolitor bracovirus        dsDNA viruses, no RNA stage              unknown [Microplitis demolitor bracovirus].
66391199         MdBV_sBgp1                248    Microplitis demolitor bracovirus        dsDNA viruses, no RNA stage              hypothetical protein MdBV_sBgp1 [Microplitis demolitor bracovirus].
118139720        -                         646    Cotesia plutellae polydnavirus          dsDNA viruses, no RNA stage              DUF-like 1 [Cotesia plutellae polydnavirus].
54109767         CcBV_12.2                 480    Cotesia congregata bracovirus           dsDNA viruses, no RNA stage              unnamed protein product [Cotesia congregata bracovirus].
57753397         CcBV_12.2                 480    Cotesia congregata bracovirus           dsDNA viruses, no RNA stage              hypothetical protein CcBV_12.2 [Cotesia congregata bracovirus].
62903502         CpBV-HP3301               427    Cotesia plutellae polydnavirus          dsDNA viruses, no RNA stage              hypothetical protein ORF3301 [Cotesia plutellae polydnavirus].
54109801         CcBV_18.7                 423    Cotesia congregata bracovirus           dsDNA viruses, no RNA stage              hypothetical protein [Cotesia congregata bracovirus].
57753385         CcBVs18gp7                423    Cotesia congregata bracovirus           dsDNA viruses, no RNA stage              hypothetical protein CcBVs18gp7 [Cotesia congregata bracovirus].
54109731         CcBV_3.3                  675    Cotesia congregata bracovirus           dsDNA viruses, no RNA stage              hypothetical protein [Cotesia congregata bracovirus].
57753423         CcBV_3.3                  675    Cotesia congregata bracovirus           dsDNA viruses, no RNA stage              hypothetical protein CcBV_3.3 [Cotesia congregata bracovirus].
54109732         CcBV_3.4                  1223   Cotesia congregata bracovirus           dsDNA viruses, no RNA stage              unnamed protein product [Cotesia congregata bracovirus].
57753424         CcBV_3.4                  1223   Cotesia congregata bracovirus           dsDNA viruses, no RNA stage              hypothetical protein CcBV_3.4 [Cotesia congregata bracovirus].
# 27;C1orf165
119890242        LOC535329                 484    Bos taurus                              metazoa>vertebrata                       PREDICTED: similar to C1orf165 protein isoform 1 [Bos taurus].
148698742        -                         477    Mus musculus                            metazoa>vertebrata                       RIKEN cDNA 2310026E23, isoform CRA_c [Mus musculus].
109004715        LOC711150                 421    Macaca mulatta                          metazoa>vertebrata                       PREDICTED: hypothetical protein isoform 1 [Macaca mulatta].
157822145        LOC362564                 421    Rattus norvegicus                       metazoa>vertebrata                       hypothetical protein LOC362564 [Rattus norvegicus].
28175231         2310026E23Rik             421    Mus musculus                            metazoa>vertebrata                       2310026E23Rik protein [Mus musculus].
30794456         2310026E23Rik             421    Mus musculus                            metazoa>vertebrata                       hypothetical protein LOC67621 [Mus musculus].
114556452        LOC743875                 420    Pan troglodytes                         metazoa>vertebrata                       PREDICTED: hypothetical protein LOC743875 isoform 2 [Pan
118094515        LOC424628                 417    Gallus gallus                           metazoa>vertebrata                       PREDICTED: similar to C1orf165 protein isoform 1 [Gallus gallus].
74195386         -                         417    Mus musculus                            metazoa>vertebrata                       unnamed protein product [Mus musculus].
125823408        LOC566161                 405    Danio rerio                             metazoa>vertebrata>actinopterygii        PREDICTED: hypothetical protein [Danio rerio].
94574193         LOC535329                 402    Bos taurus                              metazoa>vertebrata                       LOC535329 protein [Bos taurus].
149452637        LOC100086099              401    Ornithorhynchus anatinus                metazoa>vertebrata                       PREDICTED: similar to chromosome 1 open reading frame 165
148228384        MGC131330                 393    Xenopus laevis                          metazoa>vertebrata                       hypothetical protein LOC734853 [Xenopus laevis].
119627253        -                         336    Homo sapiens                            metazoa>vertebrata                       chromosome 1 open reading frame 165, isoform CRA_a [Homo sapiens].
73977019         LOC475363                 336    Canis lupus familiaris                  metazoa>vertebrata                       PREDICTED: hypothetical protein XP_532587 isoform 1 [Canis
148698740        -                         309    Mus musculus                            metazoa>vertebrata                       RIKEN cDNA 2310026E23, isoform CRA_a [Mus musculus].
109004718        LOC711150                 252    Macaca mulatta                          metazoa>vertebrata                       PREDICTED: hypothetical protein isoform 2 [Macaca mulatta].
13375807         C1orf165                  252    Homo sapiens                            metazoa>vertebrata                       hypothetical protein LOC79656 [Homo sapiens].
149693661        LOC100051340              252    Equus caballus                          metazoa>vertebrata                       PREDICTED: similar to chromosome 1 open reading frame 165 [Equus
47225633         GSTEN:00028816:G:001      242    Tetraodon nigroviridis                  metazoa>vertebrata>actinopterygii        unnamed protein product [Tetraodon nigroviridis].
125843659        LOC792358                 215    Danio rerio                             metazoa>vertebrata>actinopterygii        PREDICTED: hypothetical protein [Danio rerio].
157167939        AaeL_AAEL002989           385    Aedes aegypti                           metazoa>hexapoda                         hypothetical protein AaeL_AAEL002989 [Aedes aegypti].
157167933        AaeL_AAEL003016           423    Aedes aegypti                           metazoa>hexapoda                         hypothetical protein AaeL_AAEL003016 [Aedes aegypti].
157104034        AaeL_AAEL003998           484    Aedes aegypti                           metazoa>hexapoda                         conserved hypothetical protein [Aedes aegypti].
157138093        AaeL_AAEL013916           155    Aedes aegypti                           metazoa>hexapoda                         hypothetical protein AaeL_AAEL013916 [Aedes aegypti].
157104040        AaeL_AAEL003984           408    Aedes aegypti                           metazoa>hexapoda                         hypothetical protein AaeL_AAEL003984 [Aedes aegypti].
157167937        AaeL_AAEL003004           443    Aedes aegypti                           metazoa>hexapoda                         hypothetical protein AaeL_AAEL003004 [Aedes aegypti].
# 15;C6orf65
126310289        LOC100018717              381    Monodelphis domestica                   metazoa>vertebrata                       PREDICTED: similar to chromosome 6 open reading frame 65
118088941        LOC775985                 371    Gallus gallus                           metazoa>vertebrata                       PREDICTED: similar to RIKEN cDNA B230209C24 gene [Gallus gallus].
148682498        B230209C24Rik             285    Mus musculus                            metazoa>vertebrata                       RIKEN cDNA B230209C24, isoform CRA_b [Mus musculus].
40254314         B230209C24Rik             281    Mus musculus                            metazoa>vertebrata                       hypothetical protein LOC320705 [Mus musculus].
114607978        LOC743853                 279    Pan troglodytes                         metazoa>vertebrata                       PREDICTED: similar to RIKEN cDNA B230209C24 gene [Pan troglodytes].
148806920        C6orf65                   279    Homo sapiens                            metazoa>vertebrata                       hypothetical protein LOC221336 [Homo sapiens].
149732645        LOC100069928              279    Equus caballus                          metazoa>vertebrata                       PREDICTED: similar to chromosome 6 open reading frame 65 [Equus
157817985        RGD1310392_predicted      279    Rattus norvegicus                       metazoa>vertebrata                       hypothetical protein LOC363212 [Rattus norvegicus].
73973424         LOC610495                 279    Canis lupus familiaris                  metazoa>vertebrata                       PREDICTED: hypothetical protein XP_848011 [Canis familiaris].
76649778         LOC504789                 279    Bos taurus                              metazoa>vertebrata                       PREDICTED: similar to RIKEN cDNA B230209C24 gene [Bos taurus].
109071583        LOC712498                 277    Macaca mulatta                          metazoa>vertebrata                       PREDICTED: hypothetical protein [Macaca mulatta].
149046420        rCG_22515                 252    Rattus norvegicus                       metazoa>vertebrata                       rCG22515, isoform CRA_a [Rattus norvegicus].
26337447         -                         251    Mus musculus                            metazoa>vertebrata                       unnamed protein product [Mus musculus].
26336657         -                         219    Mus musculus                            metazoa>vertebrata                       unnamed protein product [Mus musculus].
22137383         C6orf65                   181    Homo sapiens                            metazoa>vertebrata                       C6orf65 protein [Homo sapiens].
# 16; Insensitive
17946138         -                         376    Drosophila melanogaster                 metazoa>hexapoda                         RE55538p [Drosophila melanogaster].
24581162         insv                      376    Drosophila melanogaster                 metazoa>hexapoda                         insensitive CG3227-PA [Drosophila melanogaster].
33328999         -                         375    Drosophila yakuba                       metazoa>hexapoda                         CG3227 [Drosophila yakuba].
125984436        Dpse\GA16802              371    Drosophila pseudoobscura                metazoa>hexapoda                         GA16802-PA [Drosophila pseudoobscura].
125986627        Dpse\GA11475              364    Drosophila pseudoobscura                metazoa>hexapoda                         GA11475-PA [Drosophila pseudoobscura].
24581706         Bsg25A                    363    Drosophila melanogaster                 metazoa>hexapoda                         Blastoderm-specific gene 25A CG12205-PA [Drosophila melanogaster].
1930012          bsg25A                    209    Drosophila melanogaster                 metazoa>hexapoda                         blastoderm-specific protein 25A [Drosophila melanogaster].
19920584         CG9883                    381    Drosophila melanogaster                 metazoa>hexapoda                         CG9883-PA [Drosophila melanogaster].
125984438        Dpse\GA22097              379    Drosophila pseudoobscura                metazoa>hexapoda                         GA22097-PA [Drosophila pseudoobscura].
118791739        AgaP_ENSANGG00000025789   749    Anopheles gambiae str. PEST             metazoa>hexapoda                         ENSANGP00000029795 [Anopheles gambiae str. PEST].
157013740        AgaP_AGAP009151           512    Anopheles gambiae str. PEST             metazoa>hexapoda                         AGAP009151-PA [Anopheles gambiae str. PEST].
158299900        AgaP_AGAP009151           512    Anopheles gambiae str. PEST             metazoa>hexapoda                         AGAP009151-PA [Anopheles gambiae str. PEST].
157124999        AaeL_AAEL001891           656    Aedes aegypti                           metazoa>hexapoda                         hypothetical protein AaeL_AAEL001891 [Aedes aegypti].
110759165        LOC724266                 151    Apis mellifera                          metazoa>hexapoda                         PREDICTED: hypothetical protein [Apis mellifera].
91091380         LOC661906                 463    Tribolium castaneum                     metazoa>hexapoda                         PREDICTED: similar to Broad-complex core-protein isoform 6
125835679        LOC792774                 410    Danio rerio                             metazoa>vertebrata>actinopterygii        PREDICTED: hypothetical protein, partial [Danio rerio].
# 10; CXorf20
109130110        LOC694277                 930    Macaca mulatta                          metazoa>vertebrata                       PREDICTED: hypothetical protein LOC139105 [Macaca mulatta].
23503281         CXorf20                   799    Homo sapiens                            metazoa>vertebrata                       hypothetical protein LOC139105 [Homo sapiens].
71153248         CXorf20                   799    Homo sapiens                            metazoa>vertebrata                       Uncharacterized protein CXorf20.
149638389        LOC100085785              784    Ornithorhynchus anatinus                metazoa>vertebrata                       PREDICTED: similar to Chromosome X open reading frame 20
66910950         CXorf20                   645    Homo sapiens                            metazoa>vertebrata                       CXorf20 protein [Homo sapiens].
118084100        LOC772331                 629    Gallus gallus                           metazoa>vertebrata                       PREDICTED: similar to chromosome X open reading frame 20 [Gallus
114687922        LOC738581                 320    Pan troglodytes                         metazoa>vertebrata                       PREDICTED: similar to CXorf20 protein [Pan troglodytes].
71834604         zgc:113423                522    Danio rerio                             metazoa>vertebrata>actinopterygii        hypothetical protein LOC569178 [Danio rerio].
125868980        LOC795930                 40     Danio rerio                             metazoa>vertebrata>actinopterygii        PREDICTED: hypothetical protein, partial [Danio rerio].
125851480        LOC100003955              484    Danio rerio                             metazoa>vertebrata>actinopterygii        PREDICTED: hypothetical protein [Danio rerio].
# 5; Xpat-like Vertebrate specific group
113195580        si:ch211-173p18.1         1523   Danio rerio                             metazoa>vertebrata>actinopterygii        hypothetical protein LOC327248 [Danio rerio].
47209384         GSTEN:00013760:G:001      334    Tetraodon nigroviridis                  metazoa>vertebrata>actinopterygii        unnamed protein product [Tetraodon nigroviridis].***
125805454        LOC100007097              278    Danio rerio                             metazoa>vertebrata>actinopterygii        PREDICTED: hypothetical protein [Danio rerio].
148222226        xpat-A                    293    Xenopus laevis                          metazoa>vertebrata                       Xpat protein [Xenopus laevis].
125835388        LOC796612                 186    Danio rerio                             metazoa>vertebrata>actinopterygii        PREDICTED: hypothetical protein [Danio rerio].
# 3;NEMVEDRAFT_v1g243017-like
156383934        NEMVEDRAFT_v1g243017      703    Nematostella vectensis                  metazoa                                  predicted protein [Nematostella vectensis].
115613065        LOC764357                 1092   Strongylocentrotus purpuratus           metazoa>echinodermata                    PREDICTED: hypothetical protein [Strongylocentrotus purpuratus].
156383936        NEMVEDRAFT_v1g207147      680    Nematostella vectensis                  metazoa                                  predicted protein [Nematostella vectensis].
# 2;
115944443        LOC584784                 1545   Strongylocentrotus purpuratus           metazoa>echinodermata                    PREDICTED: hypothetical protein [Strongylocentrotus purpuratus].
115651987        LOC584784                 1537   Strongylocentrotus purpuratus           metazoa>echinodermata                    PREDICTED: hypothetical protein, partial [Strongylocentrotus
# 2;
Bflo1000017051   Bflo1000017051            457    Branchiostoma floridae                  metazoa>chordata                         fgenesh2_pg.scaffold_262000003
Bflo1000020219   Bflo1000020219            399    Branchiostoma floridae                  metazoa>chordata                         fgenesh2_pg.scaffold_435000004
# 2;
125831342        LOC794392                 326    Danio rerio                             metazoa>vertebrata>actinopterygii        PREDICTED: similar to chromosome 6 open reading frame 65 [Danio
68370250         LOC560328                 326    Danio rerio                             metazoa>vertebrata>actinopterygii        PREDICTED: similar to chromosome 6 open reading frame 65 [Danio
# 2; mod(mdg4)-like
24648736         mod(mdg4)                 534    Drosophila melanogaster                 metazoa>hexapoda                         modifier of mdg4 CG32491-PC, isoform C [Drosophila melanogaster].
119112359        AgaP_AGAP003439           567    Anopheles gambiae str. PEST             metazoa>hexapoda                         AGAP003439-PC [Anopheles gambiae str. PEST].

# 1; Unclustered sequences
156391225        NEMVEDRAFT_v1g241523      434    Nematostella vectensis                  metazoa                                  predicted protein [Nematostella vectensis].
156379688        NEMVEDRAFT_v1g243810      314    Nematostella vectensis                  metazoa                                  predicted protein [Nematostella vectensis].
156390312        NEMVEDRAFT_v1g232490      384    Nematostella vectensis                  metazoa                                  predicted protein [Nematostella vectensis].
Bflo1000009049   Bflo1000009049            454    Branchiostoma floridae                  metazoa>chordata                         fgenesh2_pg.scaffold_100000089
Bflo1000028913   Bflo1000028913            3253   Branchiostoma floridae                  metazoa>chordata                         estExt_fgenesh2_pg.C_1400008
Bflo1000018302   Bflo1000018302            258    Branchiostoma floridae                  metazoa>chordata                         fgenesh2_pg.scaffold_317000006
Bflo1000008622   Bflo1000008622            446    Branchiostoma floridae                  metazoa>chordata                         fgenesh2_pg.scaffold_91000091
Bflo1000016838   Bflo1000016838            213    Branchiostoma floridae                  metazoa>chordata                         fgenesh2_pg.scaffold_256000003
Dpul1000022193   Dpul1000022193            342    Daphnia pulex                           metazoa>crustacea                        NCBI_GNO_15600040
Dpul1000017063   Dpul1000017063            231    Daphnia pulex                           metazoa>crustacea                        NCBI_GNO_5400065
Dpul1000009773   Dpul1000009773            240    Daphnia pulex                           metazoa>crustacea                        fgenesh1_pg.C_scaffold_21000028
Dpul1000001605   Dpul1000001605            276    Daphnia pulex                           metazoa>crustacea                        PASA_GEN_0400244
Dpul1000023096   Dpul1000023096            364    Daphnia pulex                           metazoa>crustacea                        NCBI_GNO_9600071
Dpul1000013633   Dpul1000013633            340    Daphnia pulex                           metazoa>crustacea                        fgenesh1_pg.C_scaffold_14000271
Dpul1000028485   Dpul1000028485            410    Daphnia pulex                           metazoa>crustacea                        NCBI_GNO_77000002
Dpul1000023017   Dpul1000023017            127    Daphnia pulex                           metazoa>crustacea                        NCBI_GNO_3400152
115729679        LOC590134                 967    Strongylocentrotus purpuratus           metazoa>echinodermata                    PREDICTED: similar to AGL019Wp [Strongylocentrotus purpuratus].
Lgig1000015905   Lgig1000015905            607    Lottia gigantea                         metazoa>mollusca                         fgenesh2_pg.C_sca_241000004
Lgig1000009088   Lgig1000009088            417    Lottia gigantea                         metazoa>mollusca                         fgenesh2_pg.C_sca_5000107
Lgig1000009086   Lgig1000009086            728    Lottia gigantea                         metazoa>mollusca                         fgenesh2_pg.C_sca_5000102
118100723        LOC419250                 364    Gallus gallus                           metazoa>vertebrata                       PREDICTED: hypothetical protein [Gallus gallus].
109148956        LOC708797                 73     Macaca mulatta                          metazoa>vertebrata                       PREDICTED: hypothetical protein, partial [Macaca mulatta].
114665043        LOC749569                 964    Pan troglodytes                         metazoa>vertebrata                       PREDICTED: similar to RUN domain containing 2A [Pan troglodytes].
114664059        LOC454354                 889    Pan troglodytes                         metazoa>vertebrata                       PREDICTED: hypothetical protein [Pan troglodytes].
114664064        LOC735459                 389    Pan troglodytes                         metazoa>vertebrata                       PREDICTED: hypothetical protein [Pan troglodytes].
149066506        rCG_60259                 89     Rattus norvegicus                       metazoa>vertebrata                       rCG60259 [Rattus norvegicus].
119607544        hCG_15529                 176    Homo sapiens                            metazoa>vertebrata                       hCG15529, isoform CRA_a [Homo sapiens].
119570859        hCG_2042710               55     Homo sapiens                            metazoa>vertebrata                       hCG2042710 [Homo sapiens].
47222379         GSTEN:00025040:G:001      336    Tetraodon nigroviridis                  metazoa>vertebrata>actinopterygii        unnamed protein product [Tetraodon nigroviridis].
47201697         GSTEN:00001457:G:001      158    Tetraodon nigroviridis                  metazoa>vertebrata>actinopterygii        unnamed protein product [Tetraodon nigroviridis].
47219811         GSTEN:00022822:G:001      229    Tetraodon nigroviridis                  metazoa>vertebrata>actinopterygii        unnamed protein product [Tetraodon nigroviridis].
47188408         GSTEN:00036805:G:001      51     Tetraodon nigroviridis                  metazoa>vertebrata>actinopterygii        unnamed protein product [Tetraodon nigroviridis].