NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|1039772322|ref|XP_017176552|]
View 

biorientation of chromosomes in cell division protein 1-like 1 isoform X6 [Mus musculus]

Protein Classification

UBX domain-containing protein; FRMD7 family protein( domain architecture ID 13708535)

UBX domain-containing protein similar to Schizosaccharomyces pombe protein C17C9.11c| FRMD7 family protein similar to Homo sapiens FERM domain-containing protein 7 (FRMD7) that plays a role in neurite development, and N-terminal region of Homo sapiens FERM, ARHGEF and pleckstrin domain-containing protein 1/2 that functions as guanine nucleotide exchange factor for RAC1

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
COMPASS-Shg1 pfam05205
COMPASS (Complex proteins associated with Set1p) component shg1; The Shg1 subunit is one of ...
55-148 2.24e-28

COMPASS (Complex proteins associated with Set1p) component shg1; The Shg1 subunit is one of the eight subunits of the COMPASS complex, complex associated with SET1, conserved in yeasts and in other eukaryotes up to humans. It is associated with the region of the Set1 protein that is N-terminal to the C-terminus, ie Set1-560-900. The function of Shg1 seems to be to slightly inhibit histone 3 lysine 4 (H3K4) di- and tri-methylation, and it is a pioneer protein. The COMPASS complex functions to methylate the fourth lysine of Histone 3 and for silencing of genes close to the telomeres of chromosomes.


:

Pssm-ID: 461586 [Multi-domain]  Cd Length: 100  Bit Score: 111.13  E-value: 2.24e-28
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039772322   55 IVNHLKSQGLFDQFRRDCLADVDTKPAYQNLRQRVDNFVANHLATHTWSPHLNKNQLRNNIRQQVLKSGML---ESGIDR 131
Cdd:pfam05205    2 LVHEFKKKGGFDKLRKDILADFDTSDAYQNLLQRLEEIVESEVERDPSLLSKNRGKAAALIEGAIDRSDVYkkaEAGVDR 81
                           90
                   ....*....|....*....
gi 1039772322  132 IISQVV--DPKINHTFRPQ 148
Cdd:pfam05205   82 LIDQVLdiEPKIREIRRPE 100
PTZ00121 super family cl31754
MAEBL; Provisional
303-1041 4.46e-18

MAEBL; Provisional


The actual alignment was detected with superfamily member PTZ00121:

Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 92.13  E-value: 4.46e-18
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039772322  303 KDAQQESTDPKIKSMDKGEKKPDGNEKGERKKEKKEKTEKK---IDHSKRNEDTQKVKDERQAKD-KEVESTKLPSE--K 376
Cdd:PTZ00121  1086 DNRADEATEEAFGKAEEAKKTETGKAEEARKAEEAKKKAEDarkAEEARKAEDARKAEEARKAEDaKRVEIARKAEDarK 1165
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039772322  377 SNSRARAAEGTKEDCSLLDSDVDGLTDI----TVSSVHTSDLSSFEEDTEEEVVVSESMEEGEITSEDEEKNKQNKAKVQ 452
Cdd:PTZ00121  1166 AEEARKAEDAKKAEAARKAEEVRKAEELrkaeDARKAEAARKAEEERKAEEARKAEDAKKAEAVKKAEEAKKDAEEAKKA 1245
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039772322  453 PGDSSDGKARGVRHAYVHKPYLYSKYYSDSDDELTVEQRRQSIAKEKEErlLRRRINREKLEEKRKQKAEKTKSSKVKSQ 532
Cdd:PTZ00121  1246 EEERNNEEIRKFEEARMAHFARRQAAIKAEEARKADELKKAEEKKKADE--AKKAEEKKKADEAKKKAEEAKKADEAKKK 1323
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039772322  533 GKSTVDLEDSSAKTLEPKAPRIKEVLKERKVLEKKVALSKRRRKDSRNVDENSKKKPQA----EEESKEALKTTEYCEKE 608
Cdd:PTZ00121  1324 AEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAakkkAEEKKKADEAKKKAEED 1403
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039772322  609 KASSKDLRHTHGKGEPSRPARRLSESLHSADENKTESKVEREYKRRTSTPVILEGAQEETDTRDGKKQPERSETNVEETQ 688
Cdd:PTZ00121  1404 KKKADELKKAAAAKKKADEAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEEAK 1483
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039772322  689 KQKSTLKNEKYQKKDDPETHGKGLPKKEAKSAKERPEKEKAQSEDKpsSKHKHKGDSVHKMSdetELHSSEKGETEESVR 768
Cdd:PTZ00121  1484 KADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKKADEAKK--AEEAKKADEAKKAE---EKKKADELKKAEELK 1558
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039772322  769 KqGQQTKLSSDDRTERKSKHKSERRLSVLGRDGKPVSEYTIKTDEHARKDNKKE-KHLSSEKSKAEHKSRrssDSKLQKD 847
Cdd:PTZ00121  1559 K-AEEKKKAEEAKKAEEDKNMALRKAEEAKKAEEARIEEVMKLYEEEKKMKAEEaKKAEEAKIKAEELKK---AEEEKKK 1634
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039772322  848 ALSSKQHSVTSQKRSESCSEDKCETDSTNADSSFKPEELPHKERRRTKSLLEDKVVSKSKSKGQSKQTKAAETEAQEgvt 927
Cdd:PTZ00121  1635 VEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKKAEELKKKE--- 1711
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039772322  928 rqvttpKPDKEKNTEDNDTERQRKFKLEDRTSEEtvtdpalENTVSSAHSAQKDSGHRAKLASIKEKHKTDKDSTSSKLE 1007
Cdd:PTZ00121  1712 ------AEEKKKAEELKKAEEENKIKAEEAKKEA-------EEDKKKAEEAKKDEEEKKKIAHLKKEEEKKAEEIRKEKE 1778
                          730       740       750
                   ....*....|....*....|....*....|....
gi 1039772322 1008 RKVSDGHRSRSLKHSNKDMKKKEENKPDDKNGKE 1041
Cdd:PTZ00121  1779 AVIEEELDEEDEKRRMEVDKKIKDIFDNFANIIE 1812
 
Name Accession Description Interval E-value
COMPASS-Shg1 pfam05205
COMPASS (Complex proteins associated with Set1p) component shg1; The Shg1 subunit is one of ...
55-148 2.24e-28

COMPASS (Complex proteins associated with Set1p) component shg1; The Shg1 subunit is one of the eight subunits of the COMPASS complex, complex associated with SET1, conserved in yeasts and in other eukaryotes up to humans. It is associated with the region of the Set1 protein that is N-terminal to the C-terminus, ie Set1-560-900. The function of Shg1 seems to be to slightly inhibit histone 3 lysine 4 (H3K4) di- and tri-methylation, and it is a pioneer protein. The COMPASS complex functions to methylate the fourth lysine of Histone 3 and for silencing of genes close to the telomeres of chromosomes.


Pssm-ID: 461586 [Multi-domain]  Cd Length: 100  Bit Score: 111.13  E-value: 2.24e-28
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039772322   55 IVNHLKSQGLFDQFRRDCLADVDTKPAYQNLRQRVDNFVANHLATHTWSPHLNKNQLRNNIRQQVLKSGML---ESGIDR 131
Cdd:pfam05205    2 LVHEFKKKGGFDKLRKDILADFDTSDAYQNLLQRLEEIVESEVERDPSLLSKNRGKAAALIEGAIDRSDVYkkaEAGVDR 81
                           90
                   ....*....|....*....
gi 1039772322  132 IISQVV--DPKINHTFRPQ 148
Cdd:pfam05205   82 LIDQVLdiEPKIREIRRPE 100
PTZ00121 PTZ00121
MAEBL; Provisional
303-1041 4.46e-18

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 92.13  E-value: 4.46e-18
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039772322  303 KDAQQESTDPKIKSMDKGEKKPDGNEKGERKKEKKEKTEKK---IDHSKRNEDTQKVKDERQAKD-KEVESTKLPSE--K 376
Cdd:PTZ00121  1086 DNRADEATEEAFGKAEEAKKTETGKAEEARKAEEAKKKAEDarkAEEARKAEDARKAEEARKAEDaKRVEIARKAEDarK 1165
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039772322  377 SNSRARAAEGTKEDCSLLDSDVDGLTDI----TVSSVHTSDLSSFEEDTEEEVVVSESMEEGEITSEDEEKNKQNKAKVQ 452
Cdd:PTZ00121  1166 AEEARKAEDAKKAEAARKAEEVRKAEELrkaeDARKAEAARKAEEERKAEEARKAEDAKKAEAVKKAEEAKKDAEEAKKA 1245
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039772322  453 PGDSSDGKARGVRHAYVHKPYLYSKYYSDSDDELTVEQRRQSIAKEKEErlLRRRINREKLEEKRKQKAEKTKSSKVKSQ 532
Cdd:PTZ00121  1246 EEERNNEEIRKFEEARMAHFARRQAAIKAEEARKADELKKAEEKKKADE--AKKAEEKKKADEAKKKAEEAKKADEAKKK 1323
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039772322  533 GKSTVDLEDSSAKTLEPKAPRIKEVLKERKVLEKKVALSKRRRKDSRNVDENSKKKPQA----EEESKEALKTTEYCEKE 608
Cdd:PTZ00121  1324 AEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAakkkAEEKKKADEAKKKAEED 1403
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039772322  609 KASSKDLRHTHGKGEPSRPARRLSESLHSADENKTESKVEREYKRRTSTPVILEGAQEETDTRDGKKQPERSETNVEETQ 688
Cdd:PTZ00121  1404 KKKADELKKAAAAKKKADEAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEEAK 1483
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039772322  689 KQKSTLKNEKYQKKDDPETHGKGLPKKEAKSAKERPEKEKAQSEDKpsSKHKHKGDSVHKMSdetELHSSEKGETEESVR 768
Cdd:PTZ00121  1484 KADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKKADEAKK--AEEAKKADEAKKAE---EKKKADELKKAEELK 1558
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039772322  769 KqGQQTKLSSDDRTERKSKHKSERRLSVLGRDGKPVSEYTIKTDEHARKDNKKE-KHLSSEKSKAEHKSRrssDSKLQKD 847
Cdd:PTZ00121  1559 K-AEEKKKAEEAKKAEEDKNMALRKAEEAKKAEEARIEEVMKLYEEEKKMKAEEaKKAEEAKIKAEELKK---AEEEKKK 1634
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039772322  848 ALSSKQHSVTSQKRSESCSEDKCETDSTNADSSFKPEELPHKERRRTKSLLEDKVVSKSKSKGQSKQTKAAETEAQEgvt 927
Cdd:PTZ00121  1635 VEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKKAEELKKKE--- 1711
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039772322  928 rqvttpKPDKEKNTEDNDTERQRKFKLEDRTSEEtvtdpalENTVSSAHSAQKDSGHRAKLASIKEKHKTDKDSTSSKLE 1007
Cdd:PTZ00121  1712 ------AEEKKKAEELKKAEEENKIKAEEAKKEA-------EEDKKKAEEAKKDEEEKKKIAHLKKEEEKKAEEIRKEKE 1778
                          730       740       750
                   ....*....|....*....|....*....|....
gi 1039772322 1008 RKVSDGHRSRSLKHSNKDMKKKEENKPDDKNGKE 1041
Cdd:PTZ00121  1779 AVIEEELDEEDEKRRMEVDKKIKDIFDNFANIIE 1812
PspC_subgroup_1 NF033838
pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, ...
527-943 6.95e-05

pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A. The other form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site.


Pssm-ID: 468201 [Multi-domain]  Cd Length: 684  Bit Score: 48.47  E-value: 6.95e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039772322  527 SKVKSQGKSTVDLEDSSAKTLEPKAPRIKEVLKERKVLEKKVALSKRRRKDSRNVDENSK---------KKPQAEEESKE 597
Cdd:NF033838    33 GVVHAEEVRGGNNPTVTSSGNESQKEHAKEVESHLEKILSEIQKSLDKRKHTQNVALNKKlsdikteylYELNVLKEKSE 112
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039772322  598 ALKTTEYCEKEKASSKDLrhthgKGEPSRPARRLSESLHSADENKTESKVEREYKRR---TSTPVILEGAQEETDTRDGK 674
Cdd:NF033838   113 AELTSKTKKELDAAFEQF-----KKDTLEPGKKVAEATKKVEEAEKKAKDQKEEDRRnypTNTYKTLELEIAESDVEVKK 187
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039772322  675 KQPERSETNVEETQkqkstlkNEKYQKKDDPETHGKglpKKEAKSAKE-RPEKEKAQSEDkpsskhKHKGDSVHKMSDET 753
Cdd:NF033838   188 AELELVKEEAKEPR-------DEEKIKQAKAKVESK---KAEATRLEKiKTDREKAEEEA------KRRADAKLKEAVEK 251
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039772322  754 ELHSSEKGETEESVrKQGQQTKLSSDDRTERKSKHKS-----ERRLSVLGRDGKPVSEYTIKTDEHARK--DNKKEKHLS 826
Cdd:NF033838   252 NVATSEQDKPKRRA-KRGVLGEPATPDKKENDAKSSDssvgeETLPSPSLKPEKKVAEAEKKVEEAKKKakDQKEEDRRN 330
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039772322  827 --SEKSKAEHKSRRSSDSKLQKDALSSKQHSVTSQKRSESCSEDKCETDSTNADSS----FKPEELPHKERRRTKSLLED 900
Cdd:NF033838   331 ypTNTYKTLELEIAESDVKVKEAELELVKEEAKEPRNEEKIKQAKAKVESKKAEATrlekIKTDRKKAEEEAKRKAAEED 410
                          410       420       430       440
                   ....*....|....*....|....*....|....*....|....*
gi 1039772322  901 KVVSKSKSKGQSKQTKAAETEA--QEGVTRQVTTPKPDKEKNTED 943
Cdd:NF033838   411 KVKEKPAEQPQPAPAPQPEKPApkPEKPAEQPKAEKPADQQAEED 455
Semenogelin pfam05474
Semenogelin; This family consists of several mammalian secreted seminal proteins including ...
634-957 3.61e-04

Semenogelin; This family consists of several mammalian secreted seminal proteins including semenogelin I and II. Freshly ejaculated human semen has the appearance of a loose gel in which the predominant structural protein components are the seminal vesicle secreted semenogelins (Sg). This family also includes seminal vesicle secretory protein 3A from mouse, which has been shown to be involved in the coagulation of semen resulting in the formation of the copulatory plug.


Pssm-ID: 368458 [Multi-domain]  Cd Length: 582  Bit Score: 46.02  E-value: 3.61e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039772322  634 SLHSADENKTESKVEREYKRRTSTPVIlegAQEETDTRDGKKQPERSETNVEETQKQKSTLKNEKYQKKDDPETHGkglP 713
Cdd:pfam05474  177 SASGAQKGRTQGGSQSSYVLQTEELVV---NKQQRETKNSHQNKGHYQNVVDVREEHSSKLQTSLHPAHQDRLQHG---P 250
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039772322  714 KKEAKSAKERPEKEKAQSEDKPSSKHKHKGDSVHKMS------DETELHSSEKgeteeSVRKQGQQTKLSSddRTERKSK 787
Cdd:pfam05474  251 KDIFTTQDELLVYNKNQHQTKNLSQDQEHGRKAHKISypssrtEERQLHHGEK-----SVQKDVSKGSISI--QTEEKIH 323
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039772322  788 HKSERRLSVLGRDgkpvseytiktDEHARKDNKkekhLSSEKSKAEHKSRRSSDSKLQKdALSSKQHSVTSQKRSESCSE 867
Cdd:pfam05474  324 GKSQNQVTIHSQD-----------QEHGHKENK----ISYQSSSTEERHLNCGEKGIQK-GVSKGSISIQTEEQIHGKSQ 387
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039772322  868 DKCETDSTNADSSFKPEELPHKERRRTKSLLEDKVVSKSKSKGQSKQTKAAETEAQEGVTRQVTTPKPDKEKNTEDNDTE 947
Cdd:pfam05474  388 NQVRIPSQAQEYGHKENKISYQSSSTEERRLNSGEKDVQKGVSKGSISIQTEEKIHGKSQNQVTIPSQDQEHGHKENKMS 467
                          330
                   ....*....|
gi 1039772322  948 RQRKFKLEDR 957
Cdd:pfam05474  468 YQSSSTEERR 477
SMC_prok_A TIGR02169
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...
489-842 8.83e-04

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274009 [Multi-domain]  Cd Length: 1164  Bit Score: 45.06  E-value: 8.83e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039772322  489 EQRRQSIAKEKEERLLRRRINREKLEEKRKQKAEKTKsSKVKSQGKSTVDLEDSsaktlepkaprIKEVLKERKVLEKKV 568
Cdd:TIGR02169  186 IERLDLIIDEKRQQLERLRREREKAERYQALLKEKRE-YEGYELLKEKEALERQ-----------KEAIERQLASLEEEL 253
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039772322  569 ALSKRRRKDsrNVDENSKKKPQAEEESKEALKTTEycEKEKASSKDLRHTHGKGEPSRPARRLSES-LHSADENKTESKV 647
Cdd:TIGR02169  254 EKLTEEISE--LEKRLEEIEQLLEELNKKIKDLGE--EEQLRVKEKIGELEAEIASLERSIAEKEReLEDAEERLAKLEA 329
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039772322  648 EREYKRRtstpvilegaqeetdtrdgkkQPERSETNVEETQKQKSTLKNEKYQKKDDPETHGKGLPKKEAKSAKERPEKE 727
Cdd:TIGR02169  330 EIDKLLA---------------------EIEELEREIEEERKRRDKLTEEYAELKEELEDLRAELEEVDKEFAETRDELK 388
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039772322  728 KAQSE-DKPSSKHKHKGDSVHKMSDETELHSSEKGETEESV-RKQGQQTKLSSDDRTERKSKHKSERRLSVLGRDGKPVS 805
Cdd:TIGR02169  389 DYREKlEKLKREINELKRELDRLQEELQRLSEELADLNAAIaGIEAKINELEEEKEDKALEIKKQEWKLEQLAADLSKYE 468
                          330       340       350
                   ....*....|....*....|....*....|....*..
gi 1039772322  806 EYTIKTDEHARKDNKKEKHLSSEKSKAEHKSRRSSDS 842
Cdd:TIGR02169  469 QELYDLKEEYDRVEKELSKLQRELAEAEAQARASEER 505
PspC_subgroup_1 NF033838
pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, ...
678-1031 1.07e-03

pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A. The other form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site.


Pssm-ID: 468201 [Multi-domain]  Cd Length: 684  Bit Score: 44.62  E-value: 1.07e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039772322  678 ERSETNVEETQKQKSTLKNEKYqKKDDPEthgkglPKKEAKSAKERPEKEKAQSEDKPSSKHKHKGDSVHKmSDETELHS 757
Cdd:NF033838   109 EKSEAELTSKTKKELDAAFEQF-KKDTLE------PGKKVAEATKKVEEAEKKAKDQKEEDRRNYPTNTYK-TLELEIAE 180
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039772322  758 SEKGETEESVRKQGQQTKLSSDDRTERKSKHKSERRLSVLGRDGKpvseytIKTDEHARKDNKKEKHLSSEKSKAEHKSR 837
Cdd:NF033838   181 SDVEVKKAELELVKEEAKEPRDEEKIKQAKAKVESKKAEATRLEK------IKTDREKAEEEAKRRADAKLKEAVEKNVA 254
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039772322  838 RSSDSKLQKDAlsskqhsvtsqKRS---ESCSEDKCETDSTNADSSFKPEELPhkerrrTKSLLEDKvvskskskgqskq 914
Cdd:NF033838   255 TSEQDKPKRRA-----------KRGvlgEPATPDKKENDAKSSDSSVGEETLP------SPSLKPEK------------- 304
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039772322  915 tKAAETEAQegvtrqvttpKPDKEKNTEDNdterqrkfKLEDRTSEETVTDPALENTVSSAHSAQKDsghrAKLASIKEK 994
Cdd:NF033838   305 -KVAEAEKK----------VEEAKKKAKDQ--------KEEDRRNYPTNTYKTLELEIAESDVKVKE----AELELVKEE 361
                          330       340       350       360
                   ....*....|....*....|....*....|....*....|..
gi 1039772322  995 HKTDKD-----STSSKLERKVSDGHRSRSLKhsnKDMKKKEE 1031
Cdd:NF033838   362 AKEPRNeekikQAKAKVESKKAEATRLEKIK---TDRKKAEE 400
COG5022 COG5022
Myosin heavy chain [General function prediction only];
491-733 2.64e-03

Myosin heavy chain [General function prediction only];


Pssm-ID: 227355 [Multi-domain]  Cd Length: 1463  Bit Score: 43.53  E-value: 2.64e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039772322  491 RRQSIAKEKEERLLRRRINREKLEEKRKQKAEKTKSSKVKS-QGKSTVDLEDSSAKTLEPKApriKEVLKERKVLEKKVA 569
Cdd:COG5022    814 SYLACIIKLQKTIKREKKLRETEEVEFSLKAEVLIQKFGRSlKAKKRFSLLKKETIYLQSAQ---RVELAERQLQELKID 890
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039772322  570 LSKRR-------RKDSRNVDENS--KKKPQAEEESKEALKTTEYCEKEKASSKDlrhthgkgEPSRPARRLSE--SLHSA 638
Cdd:COG5022    891 VKSISslklvnlELESEIIELKKslSSDLIENLEFKTELIARLKKLLNNIDLEE--------GPSIEYVKLPElnKLHEV 962
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039772322  639 DENKTESKVERE--YKRRTSTPVILEGAQEE--------TDTRDGKKQPERSETNVEETQKQKSTLKNEkyQKKDDPETH 708
Cdd:COG5022    963 ESKLKETSEEYEdlLKKSTILVREGNKANSElknfkkelAELSKQYGALQESTKQLKELPVEVAELQSA--SKIISSEST 1040
                          250       260
                   ....*....|....*....|....*
gi 1039772322  709 GKGLPKKEAKSAKERPEKEKAQSED 733
Cdd:COG5022   1041 ELSILKPLQKLKGLLLLENNQLQAR 1065
 
Name Accession Description Interval E-value
COMPASS-Shg1 pfam05205
COMPASS (Complex proteins associated with Set1p) component shg1; The Shg1 subunit is one of ...
55-148 2.24e-28

COMPASS (Complex proteins associated with Set1p) component shg1; The Shg1 subunit is one of the eight subunits of the COMPASS complex, complex associated with SET1, conserved in yeasts and in other eukaryotes up to humans. It is associated with the region of the Set1 protein that is N-terminal to the C-terminus, ie Set1-560-900. The function of Shg1 seems to be to slightly inhibit histone 3 lysine 4 (H3K4) di- and tri-methylation, and it is a pioneer protein. The COMPASS complex functions to methylate the fourth lysine of Histone 3 and for silencing of genes close to the telomeres of chromosomes.


Pssm-ID: 461586 [Multi-domain]  Cd Length: 100  Bit Score: 111.13  E-value: 2.24e-28
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039772322   55 IVNHLKSQGLFDQFRRDCLADVDTKPAYQNLRQRVDNFVANHLATHTWSPHLNKNQLRNNIRQQVLKSGML---ESGIDR 131
Cdd:pfam05205    2 LVHEFKKKGGFDKLRKDILADFDTSDAYQNLLQRLEEIVESEVERDPSLLSKNRGKAAALIEGAIDRSDVYkkaEAGVDR 81
                           90
                   ....*....|....*....
gi 1039772322  132 IISQVV--DPKINHTFRPQ 148
Cdd:pfam05205   82 LIDQVLdiEPKIREIRRPE 100
PTZ00121 PTZ00121
MAEBL; Provisional
303-1041 4.46e-18

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 92.13  E-value: 4.46e-18
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039772322  303 KDAQQESTDPKIKSMDKGEKKPDGNEKGERKKEKKEKTEKK---IDHSKRNEDTQKVKDERQAKD-KEVESTKLPSE--K 376
Cdd:PTZ00121  1086 DNRADEATEEAFGKAEEAKKTETGKAEEARKAEEAKKKAEDarkAEEARKAEDARKAEEARKAEDaKRVEIARKAEDarK 1165
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039772322  377 SNSRARAAEGTKEDCSLLDSDVDGLTDI----TVSSVHTSDLSSFEEDTEEEVVVSESMEEGEITSEDEEKNKQNKAKVQ 452
Cdd:PTZ00121  1166 AEEARKAEDAKKAEAARKAEEVRKAEELrkaeDARKAEAARKAEEERKAEEARKAEDAKKAEAVKKAEEAKKDAEEAKKA 1245
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039772322  453 PGDSSDGKARGVRHAYVHKPYLYSKYYSDSDDELTVEQRRQSIAKEKEErlLRRRINREKLEEKRKQKAEKTKSSKVKSQ 532
Cdd:PTZ00121  1246 EEERNNEEIRKFEEARMAHFARRQAAIKAEEARKADELKKAEEKKKADE--AKKAEEKKKADEAKKKAEEAKKADEAKKK 1323
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039772322  533 GKSTVDLEDSSAKTLEPKAPRIKEVLKERKVLEKKVALSKRRRKDSRNVDENSKKKPQA----EEESKEALKTTEYCEKE 608
Cdd:PTZ00121  1324 AEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAakkkAEEKKKADEAKKKAEED 1403
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039772322  609 KASSKDLRHTHGKGEPSRPARRLSESLHSADENKTESKVEREYKRRTSTPVILEGAQEETDTRDGKKQPERSETNVEETQ 688
Cdd:PTZ00121  1404 KKKADELKKAAAAKKKADEAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEEAK 1483
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039772322  689 KQKSTLKNEKYQKKDDPETHGKGLPKKEAKSAKERPEKEKAQSEDKpsSKHKHKGDSVHKMSdetELHSSEKGETEESVR 768
Cdd:PTZ00121  1484 KADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKKADEAKK--AEEAKKADEAKKAE---EKKKADELKKAEELK 1558
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039772322  769 KqGQQTKLSSDDRTERKSKHKSERRLSVLGRDGKPVSEYTIKTDEHARKDNKKE-KHLSSEKSKAEHKSRrssDSKLQKD 847
Cdd:PTZ00121  1559 K-AEEKKKAEEAKKAEEDKNMALRKAEEAKKAEEARIEEVMKLYEEEKKMKAEEaKKAEEAKIKAEELKK---AEEEKKK 1634
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039772322  848 ALSSKQHSVTSQKRSESCSEDKCETDSTNADSSFKPEELPHKERRRTKSLLEDKVVSKSKSKGQSKQTKAAETEAQEgvt 927
Cdd:PTZ00121  1635 VEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKKAEELKKKE--- 1711
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039772322  928 rqvttpKPDKEKNTEDNDTERQRKFKLEDRTSEEtvtdpalENTVSSAHSAQKDSGHRAKLASIKEKHKTDKDSTSSKLE 1007
Cdd:PTZ00121  1712 ------AEEKKKAEELKKAEEENKIKAEEAKKEA-------EEDKKKAEEAKKDEEEKKKIAHLKKEEEKKAEEIRKEKE 1778
                          730       740       750
                   ....*....|....*....|....*....|....
gi 1039772322 1008 RKVSDGHRSRSLKHSNKDMKKKEENKPDDKNGKE 1041
Cdd:PTZ00121  1779 AVIEEELDEEDEKRRMEVDKKIKDIFDNFANIIE 1812
PTZ00121 PTZ00121
MAEBL; Provisional
496-1052 9.18e-18

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 91.36  E-value: 9.18e-18
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039772322  496 AKEKEERLLRRRINREKLEEKRKQKAEKTKSSKVKSQGKSTVD-LEDSSAKTLEPKAPRIKEVLKERKVLEKKVALSKRR 574
Cdd:PTZ00121  1327 AKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAeKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKK 1406
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039772322  575 RKDSRNVDENSKKKPQAEEESKEALKTTEYceKEKASSKdlRHTHGKGEPSRPARRLSESLHSADENKTESKVEREYKRR 654
Cdd:PTZ00121  1407 ADELKKAAAAKKKADEAKKKAEEKKKADEA--KKKAEEA--KKADEAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEEA 1482
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039772322  655 TSTPVILEGAQEETDTRDGKKQPERSETNVEETQKQKSTLKNEKYQKKDD---PETHGKGLPKKEAKSAKERPEKEKAQS 731
Cdd:PTZ00121  1483 KKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEakkADEAKKAEEKKKADELKKAEELKKAEE 1562
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039772322  732 EDKPSSKHKHKGDSVHKMSDETELHSSEKGETEESVRKQGQQTKLSSDD-RTERKSKHKSErRLSVLGRDGKPVSEYTIK 810
Cdd:PTZ00121  1563 KKKAEEAKKAEEDKNMALRKAEEAKKAEEARIEEVMKLYEEEKKMKAEEaKKAEEAKIKAE-ELKKAEEEKKKVEQLKKK 1641
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039772322  811 TDEHARKDNKKEKHLSSEKSKAEHKSRRSSDSKlqKDALSSKQHSVTSQKRSESCSEDKCETDSTNADSSFKPEELPHKE 890
Cdd:PTZ00121  1642 EAEEKKKAEELKKAEEENKIKAAEEAKKAEEDK--KKAEEAKKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAE 1719
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039772322  891 RRRTKSLLEDKVVSKSKSKGQSKQTKAAETEAQEGVTRQVTTPKPDKEKNTEDNDTERQRKFKLE------------DRT 958
Cdd:PTZ00121  1720 ELKKAEEENKIKAEEAKKEAEEDKKKAEEAKKDEEEKKKIAHLKKEEEKKAEEIRKEKEAVIEEEldeedekrrmevDKK 1799
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039772322  959 SEET----------------VTDPALENTVSSAHSAQKDSGHRAKLASIKEKHKTDKDSTSSKLERKVSDGHRSRSLKHS 1022
Cdd:PTZ00121  1800 IKDIfdnfaniieggkegnlVINDSKEMEDSAIKEVADSKNMQLEEADAFEKHKFNKNNENGEDGNKEADFNKEKDLKED 1879
                          570       580       590
                   ....*....|....*....|....*....|.
gi 1039772322 1023 N-KDMKKKEENKPDDKNGKEVDSSHEKGRGN 1052
Cdd:PTZ00121  1880 DeEEIEEADEIEKIDKDDIEREIPNNNMAGK 1910
PTZ00121 PTZ00121
MAEBL; Provisional
493-1378 2.28e-17

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 89.82  E-value: 2.28e-17
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039772322  493 QSIAKEKEERLLRRRINREKLEEKRKQKAEKTKSSKVKSQGKSTVDLEDSSAKTLEPKAPRIKEVLKERKVLEKKVALSK 572
Cdd:PTZ00121  1022 QNFNIEKIEELTEYGNNDDVLKEKDIIDEDIDGNHEGKAEAKAHVGQDEGLKPSYKDFDFDAKEDNRADEATEEAFGKAE 1101
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039772322  573 RRRKDSRNVDENSKKKPQAEEESKEALKTTEYCEKEKA-------SSKDLRHTHgKGEPSRPARRLSESLHSADENKTES 645
Cdd:PTZ00121  1102 EAKKTETGKAEEARKAEEAKKKAEDARKAEEARKAEDArkaeearKAEDAKRVE-IARKAEDARKAEEARKAEDAKKAEA 1180
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039772322  646 KVEREYKRRTSTPVILEGAQEETDTRDGKKQPERSETNVEETQKQKSTLKNEKYQKKDDPETHG--KGLPKKEAKSAKER 723
Cdd:PTZ00121  1181 ARKAEEVRKAEELRKAEDARKAEAARKAEEERKAEEARKAEDAKKAEAVKKAEEAKKDAEEAKKaeEERNNEEIRKFEEA 1260
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039772322  724 PEKEKAQSEDKPSSKHKHKGDSVHKMSdetELHSSEKGETEESVRKQGQQTKLSSDDRTERKSKHKSERRLSVLGRDGKP 803
Cdd:PTZ00121  1261 RMAHFARRQAAIKAEEARKADELKKAE---EKKKADEAKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKKADAAKKK 1337
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039772322  804 VSEYTIKTDEHARKDNKKEKHLSSEKSKAEHKSRRSSDSKLQKDALSSKqhsvTSQKRSESCSEDKCETDSTNADSSFKP 883
Cdd:PTZ00121  1338 AEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKK----AEEKKKADEAKKKAEEDKKKADELKKA 1413
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039772322  884 EElPHKERRRTKSLLEDKVVSKSKSKGQSKQTKAAETEAQEGVTRQVTTPKpdkeKNTEDNDTERQRKFKLEDRTSEETV 963
Cdd:PTZ00121  1414 AA-AKKKADEAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKAEEAK----KKAEEAKKADEAKKKAEEAKKADEA 1488
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039772322  964 TDPALENTVSSAHSAQKDSGHRAKLASIKEKHKTDKDSTSSKLERKVSDGHRSRSLKHSNKDMKKKEENKPDDKNGKEVD 1043
Cdd:PTZ00121  1489 KKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEAKKADEAKKAEEKKKADELKKAEELKKAEEKKKAEE 1568
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039772322 1044 SSHEKGRGNGPVTEKKLSRRLCENRRGSTSQEMAKEDKLVANMSGTTSSSSLQRPKKSTETTSIPEQEPMEIDSEAAVEN 1123
Cdd:PTZ00121  1569 AKKAEEDKNMALRKAEEAKKAEEARIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKK 1648
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039772322 1124 VSELSKTEDISSNSSQQDTDFENVTKHKAtagvlkDEFRTSMVDSKPAA-AVTCKSGRGLAVTSISERHADHK---STLT 1199
Cdd:PTZ00121  1649 AEELKKAEEENKIKAAEEAKKAEEDKKKA------EEAKKAEEDEKKAAeALKKEAEEAKKAEELKKKEAEEKkkaEELK 1722
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039772322 1200 KKVHSQGNPSKAAPREREPIQRGAQEVSVDSEVSRKALSRAPSENEKGQKNLKGMSKTTEECGTHRNASLEYSTDSDLLS 1279
Cdd:PTZ00121  1723 KAEEENKIKAEEAKKEAEEDKKKAEEAKKDEEEKKKIAHLKKEEEKKAEEIRKEKEAVIEEELDEEDEKRRMEVDKKIKD 1802
                          810       820       830       840       850       860       870       880
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039772322 1280 SSGSVTVVpqKESHNSNTIPVIDRE-----AISEGGRASTSLANHSDVPNQYSTVKKSEvhktnGSKEGNdgftvdmpTK 1354
Cdd:PTZ00121  1803 IFDNFANI--IEGGKEGNLVINDSKemedsAIKEVADSKNMQLEEADAFEKHKFNKNNE-----NGEDGN--------KE 1867
                          890       900
                   ....*....|....*....|....
gi 1039772322 1355 ANGGSKRHLSEDSQATLLYSKESK 1378
Cdd:PTZ00121  1868 ADFNKEKDLKEDDEEEIEEADEIE 1891
PspC_subgroup_1 NF033838
pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, ...
527-943 6.95e-05

pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A. The other form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site.


Pssm-ID: 468201 [Multi-domain]  Cd Length: 684  Bit Score: 48.47  E-value: 6.95e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039772322  527 SKVKSQGKSTVDLEDSSAKTLEPKAPRIKEVLKERKVLEKKVALSKRRRKDSRNVDENSK---------KKPQAEEESKE 597
Cdd:NF033838    33 GVVHAEEVRGGNNPTVTSSGNESQKEHAKEVESHLEKILSEIQKSLDKRKHTQNVALNKKlsdikteylYELNVLKEKSE 112
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039772322  598 ALKTTEYCEKEKASSKDLrhthgKGEPSRPARRLSESLHSADENKTESKVEREYKRR---TSTPVILEGAQEETDTRDGK 674
Cdd:NF033838   113 AELTSKTKKELDAAFEQF-----KKDTLEPGKKVAEATKKVEEAEKKAKDQKEEDRRnypTNTYKTLELEIAESDVEVKK 187
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039772322  675 KQPERSETNVEETQkqkstlkNEKYQKKDDPETHGKglpKKEAKSAKE-RPEKEKAQSEDkpsskhKHKGDSVHKMSDET 753
Cdd:NF033838   188 AELELVKEEAKEPR-------DEEKIKQAKAKVESK---KAEATRLEKiKTDREKAEEEA------KRRADAKLKEAVEK 251
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039772322  754 ELHSSEKGETEESVrKQGQQTKLSSDDRTERKSKHKS-----ERRLSVLGRDGKPVSEYTIKTDEHARK--DNKKEKHLS 826
Cdd:NF033838   252 NVATSEQDKPKRRA-KRGVLGEPATPDKKENDAKSSDssvgeETLPSPSLKPEKKVAEAEKKVEEAKKKakDQKEEDRRN 330
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039772322  827 --SEKSKAEHKSRRSSDSKLQKDALSSKQHSVTSQKRSESCSEDKCETDSTNADSS----FKPEELPHKERRRTKSLLED 900
Cdd:NF033838   331 ypTNTYKTLELEIAESDVKVKEAELELVKEEAKEPRNEEKIKQAKAKVESKKAEATrlekIKTDRKKAEEEAKRKAAEED 410
                          410       420       430       440
                   ....*....|....*....|....*....|....*....|....*
gi 1039772322  901 KVVSKSKSKGQSKQTKAAETEA--QEGVTRQVTTPKPDKEKNTED 943
Cdd:NF033838   411 KVKEKPAEQPQPAPAPQPEKPApkPEKPAEQPKAEKPADQQAEED 455
Semenogelin pfam05474
Semenogelin; This family consists of several mammalian secreted seminal proteins including ...
634-957 3.61e-04

Semenogelin; This family consists of several mammalian secreted seminal proteins including semenogelin I and II. Freshly ejaculated human semen has the appearance of a loose gel in which the predominant structural protein components are the seminal vesicle secreted semenogelins (Sg). This family also includes seminal vesicle secretory protein 3A from mouse, which has been shown to be involved in the coagulation of semen resulting in the formation of the copulatory plug.


Pssm-ID: 368458 [Multi-domain]  Cd Length: 582  Bit Score: 46.02  E-value: 3.61e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039772322  634 SLHSADENKTESKVEREYKRRTSTPVIlegAQEETDTRDGKKQPERSETNVEETQKQKSTLKNEKYQKKDDPETHGkglP 713
Cdd:pfam05474  177 SASGAQKGRTQGGSQSSYVLQTEELVV---NKQQRETKNSHQNKGHYQNVVDVREEHSSKLQTSLHPAHQDRLQHG---P 250
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039772322  714 KKEAKSAKERPEKEKAQSEDKPSSKHKHKGDSVHKMS------DETELHSSEKgeteeSVRKQGQQTKLSSddRTERKSK 787
Cdd:pfam05474  251 KDIFTTQDELLVYNKNQHQTKNLSQDQEHGRKAHKISypssrtEERQLHHGEK-----SVQKDVSKGSISI--QTEEKIH 323
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039772322  788 HKSERRLSVLGRDgkpvseytiktDEHARKDNKkekhLSSEKSKAEHKSRRSSDSKLQKdALSSKQHSVTSQKRSESCSE 867
Cdd:pfam05474  324 GKSQNQVTIHSQD-----------QEHGHKENK----ISYQSSSTEERHLNCGEKGIQK-GVSKGSISIQTEEQIHGKSQ 387
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039772322  868 DKCETDSTNADSSFKPEELPHKERRRTKSLLEDKVVSKSKSKGQSKQTKAAETEAQEGVTRQVTTPKPDKEKNTEDNDTE 947
Cdd:pfam05474  388 NQVRIPSQAQEYGHKENKISYQSSSTEERRLNSGEKDVQKGVSKGSISIQTEEKIHGKSQNQVTIPSQDQEHGHKENKMS 467
                          330
                   ....*....|
gi 1039772322  948 RQRKFKLEDR 957
Cdd:pfam05474  468 YQSSSTEERR 477
DUF5401 pfam17380
Family of unknown function (DUF5401); This is a family of unknown function found in ...
487-811 6.26e-04

Family of unknown function (DUF5401); This is a family of unknown function found in Chromadorea.


Pssm-ID: 375164 [Multi-domain]  Cd Length: 722  Bit Score: 45.50  E-value: 6.26e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039772322  487 TVEQRRQsiaKEKEERLLRRRINREKLE-----EKRKQKAEKTKSSKVKSQGKSTVDLE-DSSAKTLEPKAPRIKEVLKE 560
Cdd:pfam17380  283 AVSERQQ---QEKFEKMEQERLRQEKEEkarevERRRKLEEAEKARQAEMDRQAAIYAEqERMAMERERELERIRQEERK 359
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039772322  561 R---KVLEKKVALSKRRRKDSRNVDENSKKKPQAEEESKEALKTTEYCEKEKAsskdlrhthgkgEPSRPARRLSESLHS 637
Cdd:pfam17380  360 ReleRIRQEEIAMEISRMRELERLQMERQQKNERVRQELEAARKVKILEEERQ------------RKIQQQKVEMEQIRA 427
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039772322  638 ADENKTESKVEREYKRRTSTpviLEGAQEETDTRdgKKQPERSETNVEETQKQKSTLKNEKYQKKDDPETHGKGLPKKEA 717
Cdd:pfam17380  428 EQEEARQREVRRLEEERARE---MERVRLEEQER--QQQVERLRQQEEERKRKKLELEKEKRDRKRAEEQRRKILEKELE 502
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039772322  718 KSAKERPEKEKAQSEDKPSSKHKHKGDSVHKMSDETELHSSEKGETEESVRKQgQQTKLSSDDRTERKSKHKSERRLSVL 797
Cdd:pfam17380  503 ERKQAMIEEERKRKLLEKEMEERQKAIYEEERRREAEEERRKQQEMEERRRIQ-EQMRKATEERSRLEAMEREREMMRQI 581
                          330
                   ....*....|....
gi 1039772322  798 GRDGKPVSEYTIKT 811
Cdd:pfam17380  582 VESEKARAEYEATT 595
SMC_prok_A TIGR02169
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...
489-842 8.83e-04

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274009 [Multi-domain]  Cd Length: 1164  Bit Score: 45.06  E-value: 8.83e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039772322  489 EQRRQSIAKEKEERLLRRRINREKLEEKRKQKAEKTKsSKVKSQGKSTVDLEDSsaktlepkaprIKEVLKERKVLEKKV 568
Cdd:TIGR02169  186 IERLDLIIDEKRQQLERLRREREKAERYQALLKEKRE-YEGYELLKEKEALERQ-----------KEAIERQLASLEEEL 253
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039772322  569 ALSKRRRKDsrNVDENSKKKPQAEEESKEALKTTEycEKEKASSKDLRHTHGKGEPSRPARRLSES-LHSADENKTESKV 647
Cdd:TIGR02169  254 EKLTEEISE--LEKRLEEIEQLLEELNKKIKDLGE--EEQLRVKEKIGELEAEIASLERSIAEKEReLEDAEERLAKLEA 329
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039772322  648 EREYKRRtstpvilegaqeetdtrdgkkQPERSETNVEETQKQKSTLKNEKYQKKDDPETHGKGLPKKEAKSAKERPEKE 727
Cdd:TIGR02169  330 EIDKLLA---------------------EIEELEREIEEERKRRDKLTEEYAELKEELEDLRAELEEVDKEFAETRDELK 388
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039772322  728 KAQSE-DKPSSKHKHKGDSVHKMSDETELHSSEKGETEESV-RKQGQQTKLSSDDRTERKSKHKSERRLSVLGRDGKPVS 805
Cdd:TIGR02169  389 DYREKlEKLKREINELKRELDRLQEELQRLSEELADLNAAIaGIEAKINELEEEKEDKALEIKKQEWKLEQLAADLSKYE 468
                          330       340       350
                   ....*....|....*....|....*....|....*..
gi 1039772322  806 EYTIKTDEHARKDNKKEKHLSSEKSKAEHKSRRSSDS 842
Cdd:TIGR02169  469 QELYDLKEEYDRVEKELSKLQRELAEAEAQARASEER 505
PspC_subgroup_1 NF033838
pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, ...
678-1031 1.07e-03

pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A. The other form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site.


Pssm-ID: 468201 [Multi-domain]  Cd Length: 684  Bit Score: 44.62  E-value: 1.07e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039772322  678 ERSETNVEETQKQKSTLKNEKYqKKDDPEthgkglPKKEAKSAKERPEKEKAQSEDKPSSKHKHKGDSVHKmSDETELHS 757
Cdd:NF033838   109 EKSEAELTSKTKKELDAAFEQF-KKDTLE------PGKKVAEATKKVEEAEKKAKDQKEEDRRNYPTNTYK-TLELEIAE 180
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039772322  758 SEKGETEESVRKQGQQTKLSSDDRTERKSKHKSERRLSVLGRDGKpvseytIKTDEHARKDNKKEKHLSSEKSKAEHKSR 837
Cdd:NF033838   181 SDVEVKKAELELVKEEAKEPRDEEKIKQAKAKVESKKAEATRLEK------IKTDREKAEEEAKRRADAKLKEAVEKNVA 254
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039772322  838 RSSDSKLQKDAlsskqhsvtsqKRS---ESCSEDKCETDSTNADSSFKPEELPhkerrrTKSLLEDKvvskskskgqskq 914
Cdd:NF033838   255 TSEQDKPKRRA-----------KRGvlgEPATPDKKENDAKSSDSSVGEETLP------SPSLKPEK------------- 304
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039772322  915 tKAAETEAQegvtrqvttpKPDKEKNTEDNdterqrkfKLEDRTSEETVTDPALENTVSSAHSAQKDsghrAKLASIKEK 994
Cdd:NF033838   305 -KVAEAEKK----------VEEAKKKAKDQ--------KEEDRRNYPTNTYKTLELEIAESDVKVKE----AELELVKEE 361
                          330       340       350       360
                   ....*....|....*....|....*....|....*....|..
gi 1039772322  995 HKTDKD-----STSSKLERKVSDGHRSRSLKhsnKDMKKKEE 1031
Cdd:NF033838   362 AKEPRNeekikQAKAKVESKKAEATRLEKIK---TDRKKAEE 400
PTZ00108 PTZ00108
DNA topoisomerase 2-like protein; Provisional
494-780 1.40e-03

DNA topoisomerase 2-like protein; Provisional


Pssm-ID: 240271 [Multi-domain]  Cd Length: 1388  Bit Score: 44.27  E-value: 1.40e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039772322  494 SIAKEKEERLLrrrinrEKLEEKRKQkAEKTKSSKVKSQGKStvDLEDSSAKTLEPKAPRIKEVLKERKVLEKKVA-LSK 572
Cdd:PTZ00108  1098 SLTKEKVEKLN------AELEKKEKE-LEKLKNTTPKDMWLE--DLDKFEEALEEQEEVEEKEIAKEQRLKSKTKGkASK 1168
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039772322  573 RRRKDSRNVDENSKKKPQAEEESKEALKTTEYCEKEKASSKDLRHTHGKGEPsrparrlSESLHSADENKTESKVEREYK 652
Cdd:PTZ00108  1169 LRKPKLKKKEKKKKKSSADKSKKASVVGNSKRVDSDEKRKLDDKPDNKKSNS-------SGSDQEDDEEQKTKPKKSSVK 1241
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039772322  653 RRTSTPVILEGAQEETDTRDGKKQPERSETNVEETQKQKSTLKNEKYQKKDDPETHGKGLPKKEAKSAKERPEKEKAQSE 732
Cdd:PTZ00108  1242 RLKSKKNNSSKSSEDNDEFSSDDLSKEGKPKNAPKRVSAVQYSPPPPSKRPDGESNGGSKPSSPTKKKVKKRLEGSLAAL 1321
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|....*...
gi 1039772322  733 DKPSSKHKHKGDSVhKMSDETELHSSEKGETEESVRKQGQQTKLSSDD 780
Cdd:PTZ00108  1322 KKKKKSEKKTARKK-KSKTRVKQASASQSSRLLRRPRKKKSDSSSEDD 1368
SMC_N pfam02463
RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The ...
425-1169 2.07e-03

RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.


Pssm-ID: 426784 [Multi-domain]  Cd Length: 1161  Bit Score: 43.81  E-value: 2.07e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039772322  425 VVVSESMEEGEITSEDEEKNKQNKAKVQpgdssdGKARGVRHAYVhkpylysKYYSDSDDELTVEQRRQSIAKEKEERLL 504
Cdd:pfam02463  162 AAGSRLKRKKKEALKKLIEETENLAELI------IDLEELKLQEL-------KLKEQAKKALEYYQLKEKLELEEEYLLY 228
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039772322  505 RRRINREKLEEKRKQKAEKTKSSKVKSQGKSTVDLEDSSAKTLE--PKAPRIKEVLKERKVLEKKVALSKRRRKDSRNVD 582
Cdd:pfam02463  229 LDYLKLNEERIDLLQELLRDEQEEIESSKQEIEKEEEKLAQVLKenKEEEKEKKLQEEELKLLAKEEEELKSELLKLERR 308
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039772322  583 ENSKKKPQAEEESKEALKTTEYcEKEKASSKDLRHTHGKGEPSRPARRLSESLHSADENKTESK---VEREYKRRTSTPV 659
Cdd:pfam02463  309 KVDDEEKLKESEKEKKKAEKEL-KKEKEEIEELEKELKELEIKREAEEEEEEELEKLQEKLEQLeeeLLAKKKLESERLS 387
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039772322  660 ILEGAQEETDTRDGKKQPERSEtNVEETQKQKSTLKNEKYQKKDDPEThgkglpKKEAKSAKERPEKEKAQSEDKPSSKH 739
Cdd:pfam02463  388 SAAKLKEEELELKSEEEKEAQL-LLELARQLEDLLKEEKKEELEILEE------EEESIELKQGKLTEEKEELEKQELKL 460
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039772322  740 KHKGDSVHKMSDETELHSSEKGETEESVRKQGQQTKLSSDDRTERKSKHKSERRLSVLGRDGKPVSEYTIKTDEHARKDN 819
Cdd:pfam02463  461 LKDELELKKSEDLLKETQLVKLQEQLELLLSRQKLEERSQKESKARSGLKVLLALIKDGVGGRIISAHGRLGDLGVAVEN 540
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039772322  820 KKEKHLSSEkskaehkSRRSSDSKLQKDALSSKQHSVTSQKRSESCSEDKCETDSTNADSSFKPEELP----HKERRRTK 895
Cdd:pfam02463  541 YKVAISTAV-------IVEVSATADEVEERQKLVRALTELPLGARKLRLLIPKLKLPLKSIAVLEIDPilnlAQLDKATL 613
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039772322  896 SLLEDKVVSKSKSKGQSKQTKAAETEAQEGVTRQVTTPKPDKEKNTEDNDTERQRKFKLEDRTSEETVTDPALENTVSSA 975
Cdd:pfam02463  614 EADEDDKRAKVVEGILKDTELTKLKESAKAKESGLRKGVSLEEGLAEKSEVKASLSELTKELLEIQELQEKAESELAKEE 693
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039772322  976 HSAQKDSGHRAKLASIKEKHKTDKDSTSSKLERKVSDGHRS----RSLKHSNKDMKKKEENKPDDKNGKEVDSSHEKGRG 1051
Cdd:pfam02463  694 ILRRQLEIKKKEQREKEELKKLKLEAEELLADRVQEAQDKIneelKLLKQKIDEEEEEEEKSRLKKEEKEEEKSELSLKE 773
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039772322 1052 NGPVTEKKLSRRLcENRRGSTSQEMAKEDKLVANMSgtTSSSSLQRPKKSTETTSIPEQ-EPMEIDSEAAVENVSELSKT 1130
Cdd:pfam02463  774 KELAEEREKTEKL-KVEEEKEEKLKAQEEELRALEE--ELKEEAELLEEEQLLIEQEEKiKEEELEELALELKEEQKLEK 850
                          730       740       750
                   ....*....|....*....|....*....|....*....
gi 1039772322 1131 EDISSNSSQQDTDFENVTKHKATAGVLKDEFRTSMVDSK 1169
Cdd:pfam02463  851 LAEEELERLEEEITKEELLQELLLKEEELEEQKLKDELE 889
COG5022 COG5022
Myosin heavy chain [General function prediction only];
491-733 2.64e-03

Myosin heavy chain [General function prediction only];


Pssm-ID: 227355 [Multi-domain]  Cd Length: 1463  Bit Score: 43.53  E-value: 2.64e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039772322  491 RRQSIAKEKEERLLRRRINREKLEEKRKQKAEKTKSSKVKS-QGKSTVDLEDSSAKTLEPKApriKEVLKERKVLEKKVA 569
Cdd:COG5022    814 SYLACIIKLQKTIKREKKLRETEEVEFSLKAEVLIQKFGRSlKAKKRFSLLKKETIYLQSAQ---RVELAERQLQELKID 890
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039772322  570 LSKRR-------RKDSRNVDENS--KKKPQAEEESKEALKTTEYCEKEKASSKDlrhthgkgEPSRPARRLSE--SLHSA 638
Cdd:COG5022    891 VKSISslklvnlELESEIIELKKslSSDLIENLEFKTELIARLKKLLNNIDLEE--------GPSIEYVKLPElnKLHEV 962
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039772322  639 DENKTESKVERE--YKRRTSTPVILEGAQEE--------TDTRDGKKQPERSETNVEETQKQKSTLKNEkyQKKDDPETH 708
Cdd:COG5022    963 ESKLKETSEEYEdlLKKSTILVREGNKANSElknfkkelAELSKQYGALQESTKQLKELPVEVAELQSA--SKIISSEST 1040
                          250       260
                   ....*....|....*....|....*
gi 1039772322  709 GKGLPKKEAKSAKERPEKEKAQSED 733
Cdd:COG5022   1041 ELSILKPLQKLKGLLLLENNQLQAR 1065
PTZ00108 PTZ00108
DNA topoisomerase 2-like protein; Provisional
678-966 4.34e-03

DNA topoisomerase 2-like protein; Provisional


Pssm-ID: 240271 [Multi-domain]  Cd Length: 1388  Bit Score: 42.73  E-value: 4.34e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039772322  678 ERSETNVEETQKQKSTLKNEKYQK--KDDPETHGKGLPKKEAKSAKERPEKEKAQSedKPSSKHKHKGDSVHKMSDETEL 755
Cdd:PTZ00108  1105 EKLNAELEKKEKELEKLKNTTPKDmwLEDLDKFEEALEEQEEVEEKEIAKEQRLKS--KTKGKASKLRKPKLKKKEKKKK 1182
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039772322  756 HSSEKGETEESVRKQgqqtklSSDDRTERKSKHKSERRLSVLGRDGKPVSEYTIKTDEHARKDNKKEKHLSSEKSKAEHK 835
Cdd:PTZ00108  1183 KSSADKSKKASVVGN------SKRVDSDEKRKLDDKPDNKKSNSSGSDQEDDEEQKTKPKKSSVKRLKSKKNNSSKSSED 1256
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039772322  836 SRRSSDSKLQKDALSSKQHSVTSQKRSESCSEDKCETDSTNADSSFKPEELPHKERRRTKSLLedKVVSKSKSKGQSKQT 915
Cdd:PTZ00108  1257 NDEFSSDDLSKEGKPKNAPKRVSAVQYSPPPPSKRPDGESNGGSKPSSPTKKKVKKRLEGSLA--ALKKKKKSEKKTARK 1334
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|.
gi 1039772322  916 KAAETEAQEGVTRQVTTPKPdkekntedndteRQRKFKLEDRTSEETVTDP 966
Cdd:PTZ00108  1335 KKSKTRVKQASASQSSRLLR------------RPRKKKSDSSSEDDDDSEV 1373
PTZ00108 PTZ00108
DNA topoisomerase 2-like protein; Provisional
774-1048 9.81e-03

DNA topoisomerase 2-like protein; Provisional


Pssm-ID: 240271 [Multi-domain]  Cd Length: 1388  Bit Score: 41.57  E-value: 9.81e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039772322  774 TKLSSDDRTERKSKHKsERRLSVLGRDGKPVSEYTIKTDEhARKDNKKEKHLSSEKSKAEHKSRRSSDSKLQKDALSSKq 853
Cdd:PTZ00108  1139 EALEEQEEVEEKEIAK-EQRLKSKTKGKASKLRKPKLKKK-EKKKKKSSADKSKKASVVGNSKRVDSDEKRKLDDKPDN- 1215
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039772322  854 hsvTSQKRSESCSEDKCETDSTNADSSFKPEELPHKErrrTKSLLEDKVVSKSKSKGQSKQTKAAETEAQegvTRQVTTP 933
Cdd:PTZ00108  1216 ---KKSNSSGSDQEDDEEQKTKPKKSSVKRLKSKKNN---SSKSSEDNDEFSSDDLSKEGKPKNAPKRVS---AVQYSPP 1286
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039772322  934 KPDKEkntedndtERQRKFKLEDRTSEETVTDPAlentvssahsaQKDSGHRAKLASIKEKHKTDKDSTSSKLERKVSDG 1013
Cdd:PTZ00108  1287 PPSKR--------PDGESNGGSKPSSPTKKKVKK-----------RLEGSLAALKKKKKSEKKTARKKKSKTRVKQASAS 1347
                          250       260       270
                   ....*....|....*....|....*....|....*
gi 1039772322 1014 HRSRSLKHSNKDMKKKEEnkpDDKNGKEVDSSHEK 1048
Cdd:PTZ00108  1348 QSSRLLRRPRKKKSDSSS---EDDDDSEVDDSEDE 1379
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH