NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|1720384125|ref|XP_030104393|]
View 

caspase recruitment domain-containing protein 6 isoform X2 [Mus musculus]

Protein Classification

CARD domain-containing protein( domain architecture ID 10109095)

CARD domain-containing protein

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
PHA03247 super family cl33720
large tegument protein UL36; Provisional
789-1166 5.21e-13

large tegument protein UL36; Provisional


The actual alignment was detected with superfamily member PHA03247:

Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 74.20  E-value: 5.21e-13
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720384125  789 SATPQQYHPNGP--FGRSQRQASPVQTHPKSRQmSRTLERSGTVVSRVGHGRSLGSQARRAAGKPQPEKAcaqglqLTKA 866
Cdd:PHA03247  2615 SPLPPDTHAPDPppPSPSPAANEPDPHPPPTVP-PPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRP------RRRA 2687
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720384125  867 AGKSIRTLPHIKYPHPQPCQPAGASQERI----MPVSHQGAQQTTQGRPADFA--------FKPGSQSTSGSKLSSTSQS 934
Cdd:PHA03247  2688 ARPTVGSLTSLADPPPPPPTPEPAPHALVsatpLPPGPAAARQASPALPAAPAppavpagpATPGGPARPARPPTTAGPP 2767
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720384125  935 SAHQPKFQSKhfQPQPFQPVPSQKKPSHSRPSQAKPPHLDPSHANLTQGQPSQATPTHSQASQAKPTHSQANSHHPHPSH 1014
Cdd:PHA03247  2768 APAPPAAPAA--GPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGP 2845
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720384125 1015 AKPSHQNPSHANPTHPQSSHAKPSHPQSSHAKPSHPQSSH-AKPSHPQSSHAKPsHPQSSQAKPSHPQSSQAKPTHPQSS 1093
Cdd:PHA03247  2846 PPPSLPLGGSVAPGGDVRRRPPSRSPAAKPAAPARPPVRRlARPAVSRSTESFA-LPPDQPERPPQPQAPPPPQPQPQPP 2924
                          330       340       350       360       370       380       390
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1720384125 1094 QANSHHPQ-ASQAKPSHPQSSHAKPS-HPHPSHAKPSPSQST----QCKAHKAHQSQPKPFQPRPTQPKSSKTKPSQAR 1166
Cdd:PHA03247  2925 PPPQPQPPpPPPPRPQPPLAPTTDPAgAGEPSGAVPQPWLGAlvpgRVAVPRFRVPQPAPSREAPASSTPPLTGHSLSR 3003
CARD cd01671
Caspase activation and recruitment domain: a protein-protein interaction domain; Caspase ...
11-79 1.94e-11

Caspase activation and recruitment domain: a protein-protein interaction domain; Caspase activation and recruitment domains (CARDs) are death domains (DDs) found associated with caspases. Caspases are aspartate-specific cysteine proteases with functions in apoptosis, immune signaling, inflammation, and host-defense mechanisms. In addition to caspases, proteins containing CARDs include adaptor proteins such as RAIDD, CARD9, and RIG-I-like helicases, which can form multiprotein complexes and play important roles in mediating the signals to induce immune and inflammatory responses. In general, DDs are protein-protein interaction domains found in a variety of domain architectures. Their common feature is that they form homodimers by self-association or heterodimers by associating with other members of the DD superfamily including PYRIN and DED (Death Effector Domain). They serve as adaptors in signaling pathways and can recruit other proteins into signaling complexes.


:

Pssm-ID: 260018 [Multi-domain]  Cd Length: 79  Bit Score: 60.99  E-value: 1.94e-11
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1720384125   11 IEKKRTKLLSVLqqDPDSILDTLTSRSLISEKEYETLEEITDPLKKSRKLLILIQKKGEDSCRRFLRCL 79
Cdd:cd01671      1 LRKNRVELVEDL--DVEDILDHLIQKGVLTEEDKEEILSEKTRQDKARKLLDILPRRGPKAFEVFCEAL 67
 
Name Accession Description Interval E-value
PHA03247 PHA03247
large tegument protein UL36; Provisional
789-1166 5.21e-13

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 74.20  E-value: 5.21e-13
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720384125  789 SATPQQYHPNGP--FGRSQRQASPVQTHPKSRQmSRTLERSGTVVSRVGHGRSLGSQARRAAGKPQPEKAcaqglqLTKA 866
Cdd:PHA03247  2615 SPLPPDTHAPDPppPSPSPAANEPDPHPPPTVP-PPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRP------RRRA 2687
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720384125  867 AGKSIRTLPHIKYPHPQPCQPAGASQERI----MPVSHQGAQQTTQGRPADFA--------FKPGSQSTSGSKLSSTSQS 934
Cdd:PHA03247  2688 ARPTVGSLTSLADPPPPPPTPEPAPHALVsatpLPPGPAAARQASPALPAAPAppavpagpATPGGPARPARPPTTAGPP 2767
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720384125  935 SAHQPKFQSKhfQPQPFQPVPSQKKPSHSRPSQAKPPHLDPSHANLTQGQPSQATPTHSQASQAKPTHSQANSHHPHPSH 1014
Cdd:PHA03247  2768 APAPPAAPAA--GPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGP 2845
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720384125 1015 AKPSHQNPSHANPTHPQSSHAKPSHPQSSHAKPSHPQSSH-AKPSHPQSSHAKPsHPQSSQAKPSHPQSSQAKPTHPQSS 1093
Cdd:PHA03247  2846 PPPSLPLGGSVAPGGDVRRRPPSRSPAAKPAAPARPPVRRlARPAVSRSTESFA-LPPDQPERPPQPQAPPPPQPQPQPP 2924
                          330       340       350       360       370       380       390
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1720384125 1094 QANSHHPQ-ASQAKPSHPQSSHAKPS-HPHPSHAKPSPSQST----QCKAHKAHQSQPKPFQPRPTQPKSSKTKPSQAR 1166
Cdd:PHA03247  2925 PPPQPQPPpPPPPRPQPPLAPTTDPAgAGEPSGAVPQPWLGAlvpgRVAVPRFRVPQPAPSREAPASSTPPLTGHSLSR 3003
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
936-1170 1.46e-11

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 69.03  E-value: 1.46e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720384125  936 AHQPKFQSKHFQPQPFQPVPSQKKPSHSRPSQAKPPHLDPSHANLT-QGQPSQATPTHSQASQAKPTHSQANSHHPHPSH 1014
Cdd:pfam03154  162 AQQQILQTQPPVLQAQSGAASPPSPPPPGTTQAATAGPTPSAPSVPpQGSPATSQPPNQTQSTAAPHTLIQQTPTLHPQR 241
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720384125 1015 AKPSHQNPSHANPTHPQSSHAKPSHPQSSH--AKPSHPQSSHAKPSHPQsshakpsHPQSSQAKPSHPQSSQAK------ 1086
Cdd:pfam03154  242 LPSPHPPLQPMTQPPPPSQVSPQPLPQPSLhgQMPPMPHSLQTGPSHMQ-------HPVPPQPFPLTPQSSQSQvppgps 314
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720384125 1087 PTHPQSSQANSHHPqASQAKPSHPQSSHAKPSHPHP---SHAKPSPSQ------STQCKAHKAHQSQPKPFQPRPTQPKS 1157
Cdd:pfam03154  315 PAAPGQSQQRIHTP-PSQSQLQSQQPPREQPLPPAPlsmPHIKPPPTTpipqlpNPQSHKHPPHLSGPSPFQMNSNLPPP 393
                          250
                   ....*....|....
gi 1720384125 1158 SKTKP-SQARAFHP 1170
Cdd:pfam03154  394 PALKPlSSLSTHHP 407
CARD cd01671
Caspase activation and recruitment domain: a protein-protein interaction domain; Caspase ...
11-79 1.94e-11

Caspase activation and recruitment domain: a protein-protein interaction domain; Caspase activation and recruitment domains (CARDs) are death domains (DDs) found associated with caspases. Caspases are aspartate-specific cysteine proteases with functions in apoptosis, immune signaling, inflammation, and host-defense mechanisms. In addition to caspases, proteins containing CARDs include adaptor proteins such as RAIDD, CARD9, and RIG-I-like helicases, which can form multiprotein complexes and play important roles in mediating the signals to induce immune and inflammatory responses. In general, DDs are protein-protein interaction domains found in a variety of domain architectures. Their common feature is that they form homodimers by self-association or heterodimers by associating with other members of the DD superfamily including PYRIN and DED (Death Effector Domain). They serve as adaptors in signaling pathways and can recruit other proteins into signaling complexes.


Pssm-ID: 260018 [Multi-domain]  Cd Length: 79  Bit Score: 60.99  E-value: 1.94e-11
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1720384125   11 IEKKRTKLLSVLqqDPDSILDTLTSRSLISEKEYETLEEITDPLKKSRKLLILIQKKGEDSCRRFLRCL 79
Cdd:cd01671      1 LRKNRVELVEDL--DVEDILDHLIQKGVLTEEDKEEILSEKTRQDKARKLLDILPRRGPKAFEVFCEAL 67
CARD pfam00619
Caspase recruitment domain; Motif contained in proteins involved in apoptotic signaling. ...
9-88 4.99e-10

Caspase recruitment domain; Motif contained in proteins involved in apoptotic signaling. Predicted to possess a DEATH (pfam00531) domain-like fold.


Pssm-ID: 459874 [Multi-domain]  Cd Length: 85  Bit Score: 57.18  E-value: 4.99e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720384125    9 ELIEKKRTKLLSVLQqDPDSILDTLTSRSLISEKEYETLEEITDPLKKSRKLLILIQKKGEDSCRRFLRCLSNAFPESAS 88
Cdd:pfam00619    2 KLLKKNRVALVERLG-TLDGLLDYLLEKNVLTEEEEEKIKANPTRLDKARELLDLVLKKGPKACQIFLEALKEGDPDLAS 80
CARD smart00114
Caspase recruitment domain; Motif contained in proteins involved in apoptotic signalling. ...
8-82 1.18e-08

Caspase recruitment domain; Motif contained in proteins involved in apoptotic signalling. Mediates homodimerisation. Structure consists of six antiparallel helices arranged in a topology homologue to the DEATH and the DED domain.


Pssm-ID: 128424  Cd Length: 88  Bit Score: 53.50  E-value: 1.18e-08
                            10        20        30        40        50        60        70
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1720384125     8 SELIEKKRTKLLSVLQqdPDSILDTLTSRSLISEKEYETLEEITDPLKKSRKLLILIQKKGEDSCRRFLRCLSNA 82
Cdd:smart00114    6 KRLLRRNRVRLGEELG--VDGLLDYLVEKNVLTEKEIEAIKAATTKLRDKRELVDSLQKRGSQAFDTFLDSLQET 78
ROM1 COG5422
RhoGEF, Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases [Signal transduction ...
939-1127 3.47e-04

RhoGEF, Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases [Signal transduction mechanisms];


Pssm-ID: 227709 [Multi-domain]  Cd Length: 1175  Bit Score: 44.88  E-value: 3.47e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720384125  939 PKFQSKHFQPQPFQPVPSQKKpSHSRPSQAKPPHLDPSHANLTQGQPSQATPTHSQASQAKPTHSQANSHHPHPS-HAKP 1017
Cdd:COG5422     80 PKLFQRRNSAGPITHSPSATS-STSSLNSNDGDQFSPASDSLSFNPSSTQSRKDSGPGDGSPVQKRKNPLLPSSStHGTH 158
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720384125 1018 SHQNPSHANPTHPQSSHAK-PSHPQSSHAKPSHPQSSHAKPSHPQSSHAKPSHPQSSQAKPSHPQSSQAKPTHPQSSQAN 1096
Cdd:COG5422    159 PPIVFTDNNGSHAGAPNARsRKEIPSLGSQSMQLPSPHFRQKFSSSDTSNGFSYPSIRKNSRHSSNSMPSFPHSSTAVLL 238
                          170       180       190
                   ....*....|....*....|....*....|....*.
gi 1720384125 1097 SHHPQASQAkpsHPQSSHAKPSHPH-----PSHAKP 1127
Cdd:COG5422    239 KRHSGSSGA---SLISSNITPSSSNseamsTSSKRP 271
PspC_subgroup_2 NF033839
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ...
947-1167 4.46e-04

pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.


Pssm-ID: 468202 [Multi-domain]  Cd Length: 557  Bit Score: 44.37  E-value: 4.46e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720384125  947 QPQPFQPVPSQKKPSHSRPSQAKPPHLDPShanlTQGQPSQATPTHSQASQAKPTHSQANshhPHPSHAKPSHQnpshan 1026
Cdd:NF033839   300 QPSPQPEKKEVKPEPETPKPEVKPQLEKPK----PEVKPQPEKPKPEVKPQLETPKPEVK---PQPEKPKPEVK------ 366
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720384125 1027 pthPQSSHAKPS-HPQSSHAKPS-HPQSSHAKPS-HPQSSHAKPS-HPQSSQAKPS-HPQSSQAKP-THPQSSQANSH-H 1099
Cdd:NF033839   367 ---PQPEKPKPEvKPQPETPKPEvKPQPEKPKPEvKPQPEKPKPEvKPQPEKPKPEvKPQPEKPKPeVKPQPEKPKPEvK 443
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1720384125 1100 PQASQAKPSHPQSSHAKPSHPHPSHAKPSPSQSTQCKAHKAHQSQPKPFQPRPTQPKS-SKTKPSQARA 1167
Cdd:NF033839   444 PQPEKPKPEVKPQPETPKPEVKPQPEKPKPEVKPQPEKPKPDNSKPQADDKKPSTPNNlSKDKQPSNQA 512
KLF1_2_4_N-like cd22056
N-terminal domain of Kruppel-like factors with similarity to the N-terminal domains of ...
1011-1124 1.95e-03

N-terminal domain of Kruppel-like factors with similarity to the N-terminal domains of Kruppel-like factor (KLF)1, KLF2, and KLF4; Kruppel/Krueppel-like transcription factors (KLFs) belong to a family of proteins called the Specificity Protein (SP)/KLF family, characterized by a C-terminal DNA-binding domain of 81 amino acids consisting of three Kruppel-like C2H2 zinc fingers. These factors bind to a loose consensus motif, namely NNRCRCCYY (where N is any nucleotide; R is A/G, and Y is C/T), such as the recurring motifs in GC and GT boxes (5'-GGGGCGGGG-3' and 5-GGTGTGGGG-3') that are present in promoters and more distal regulatory elements of mammalian genes. Members of the KLF family can act as activators or repressors of transcription depending on cell and promoter context. KLFs regulate various cellular functions, such as proliferation, differentiation, and apoptosis, as well as the development and homeostasis of several types of tissue. In addition to the C-terminal DNA-binding domain, each KLF also has a unique N-terminal activation/repression domain that confers specifity and allows it to bind specifically to a certain partner, leading to distinct activities in vivo. This model represents the N-terminal domains of an unknown subfamily of KLFs, predominantly found in fish, related to the N-terminal domains of KLF1, KLF2, and KLF4.


Pssm-ID: 409231 [Multi-domain]  Cd Length: 339  Bit Score: 41.95  E-value: 1.95e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720384125 1011 HPSHAKPSHQnpshANPTHPQSSHAKPSHPQSSHAKPSHPQSSHAkPSHPQSSHAKPSHPQSSQAKPSHPQSSQAKPTHP 1090
Cdd:cd22056    203 FMGQQKPKHQ----MHSVHPQAFTHHQAAGPGALQGRGGRGGPDC-HLLHSSHHHHHHHHLQYQYMNAPYPPHYAHQGAP 277
                           90       100       110
                   ....*....|....*....|....*....|....
gi 1720384125 1091 QSSQANSHHPQASQAKPSHPQSSHAKPShPHPSH 1124
Cdd:cd22056    278 QFHGQYSVFREPMRVHHQGHPGSMLTPP-SSPPL 310
 
Name Accession Description Interval E-value
PHA03247 PHA03247
large tegument protein UL36; Provisional
789-1166 5.21e-13

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 74.20  E-value: 5.21e-13
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720384125  789 SATPQQYHPNGP--FGRSQRQASPVQTHPKSRQmSRTLERSGTVVSRVGHGRSLGSQARRAAGKPQPEKAcaqglqLTKA 866
Cdd:PHA03247  2615 SPLPPDTHAPDPppPSPSPAANEPDPHPPPTVP-PPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRP------RRRA 2687
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720384125  867 AGKSIRTLPHIKYPHPQPCQPAGASQERI----MPVSHQGAQQTTQGRPADFA--------FKPGSQSTSGSKLSSTSQS 934
Cdd:PHA03247  2688 ARPTVGSLTSLADPPPPPPTPEPAPHALVsatpLPPGPAAARQASPALPAAPAppavpagpATPGGPARPARPPTTAGPP 2767
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720384125  935 SAHQPKFQSKhfQPQPFQPVPSQKKPSHSRPSQAKPPHLDPSHANLTQGQPSQATPTHSQASQAKPTHSQANSHHPHPSH 1014
Cdd:PHA03247  2768 APAPPAAPAA--GPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGP 2845
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720384125 1015 AKPSHQNPSHANPTHPQSSHAKPSHPQSSHAKPSHPQSSH-AKPSHPQSSHAKPsHPQSSQAKPSHPQSSQAKPTHPQSS 1093
Cdd:PHA03247  2846 PPPSLPLGGSVAPGGDVRRRPPSRSPAAKPAAPARPPVRRlARPAVSRSTESFA-LPPDQPERPPQPQAPPPPQPQPQPP 2924
                          330       340       350       360       370       380       390
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1720384125 1094 QANSHHPQ-ASQAKPSHPQSSHAKPS-HPHPSHAKPSPSQST----QCKAHKAHQSQPKPFQPRPTQPKSSKTKPSQAR 1166
Cdd:PHA03247  2925 PPPQPQPPpPPPPRPQPPLAPTTDPAgAGEPSGAVPQPWLGAlvpgRVAVPRFRVPQPAPSREAPASSTPPLTGHSLSR 3003
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
936-1170 1.46e-11

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 69.03  E-value: 1.46e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720384125  936 AHQPKFQSKHFQPQPFQPVPSQKKPSHSRPSQAKPPHLDPSHANLT-QGQPSQATPTHSQASQAKPTHSQANSHHPHPSH 1014
Cdd:pfam03154  162 AQQQILQTQPPVLQAQSGAASPPSPPPPGTTQAATAGPTPSAPSVPpQGSPATSQPPNQTQSTAAPHTLIQQTPTLHPQR 241
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720384125 1015 AKPSHQNPSHANPTHPQSSHAKPSHPQSSH--AKPSHPQSSHAKPSHPQsshakpsHPQSSQAKPSHPQSSQAK------ 1086
Cdd:pfam03154  242 LPSPHPPLQPMTQPPPPSQVSPQPLPQPSLhgQMPPMPHSLQTGPSHMQ-------HPVPPQPFPLTPQSSQSQvppgps 314
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720384125 1087 PTHPQSSQANSHHPqASQAKPSHPQSSHAKPSHPHP---SHAKPSPSQ------STQCKAHKAHQSQPKPFQPRPTQPKS 1157
Cdd:pfam03154  315 PAAPGQSQQRIHTP-PSQSQLQSQQPPREQPLPPAPlsmPHIKPPPTTpipqlpNPQSHKHPPHLSGPSPFQMNSNLPPP 393
                          250
                   ....*....|....
gi 1720384125 1158 SKTKP-SQARAFHP 1170
Cdd:pfam03154  394 PALKPlSSLSTHHP 407
CARD cd01671
Caspase activation and recruitment domain: a protein-protein interaction domain; Caspase ...
11-79 1.94e-11

Caspase activation and recruitment domain: a protein-protein interaction domain; Caspase activation and recruitment domains (CARDs) are death domains (DDs) found associated with caspases. Caspases are aspartate-specific cysteine proteases with functions in apoptosis, immune signaling, inflammation, and host-defense mechanisms. In addition to caspases, proteins containing CARDs include adaptor proteins such as RAIDD, CARD9, and RIG-I-like helicases, which can form multiprotein complexes and play important roles in mediating the signals to induce immune and inflammatory responses. In general, DDs are protein-protein interaction domains found in a variety of domain architectures. Their common feature is that they form homodimers by self-association or heterodimers by associating with other members of the DD superfamily including PYRIN and DED (Death Effector Domain). They serve as adaptors in signaling pathways and can recruit other proteins into signaling complexes.


Pssm-ID: 260018 [Multi-domain]  Cd Length: 79  Bit Score: 60.99  E-value: 1.94e-11
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1720384125   11 IEKKRTKLLSVLqqDPDSILDTLTSRSLISEKEYETLEEITDPLKKSRKLLILIQKKGEDSCRRFLRCL 79
Cdd:cd01671      1 LRKNRVELVEDL--DVEDILDHLIQKGVLTEEDKEEILSEKTRQDKARKLLDILPRRGPKAFEVFCEAL 67
CARD pfam00619
Caspase recruitment domain; Motif contained in proteins involved in apoptotic signaling. ...
9-88 4.99e-10

Caspase recruitment domain; Motif contained in proteins involved in apoptotic signaling. Predicted to possess a DEATH (pfam00531) domain-like fold.


Pssm-ID: 459874 [Multi-domain]  Cd Length: 85  Bit Score: 57.18  E-value: 4.99e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720384125    9 ELIEKKRTKLLSVLQqDPDSILDTLTSRSLISEKEYETLEEITDPLKKSRKLLILIQKKGEDSCRRFLRCLSNAFPESAS 88
Cdd:pfam00619    2 KLLKKNRVALVERLG-TLDGLLDYLLEKNVLTEEEEEKIKANPTRLDKARELLDLVLKKGPKACQIFLEALKEGDPDLAS 80
PHA03247 PHA03247
large tegument protein UL36; Provisional
838-1170 8.41e-10

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 63.42  E-value: 8.41e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720384125  838 RSLGSQARRAAGKPQPEKACAQglqLTKAAGKSIRTlphiKYPHPQPCQPAGASQERIMPVSHQGAQQTTQGRPADfafk 917
Cdd:PHA03247  2538 RGLEELASDDAGDPPPPLPPAA---PPAAPDRSVPP----PRPAPRPSEPAVTSRARRPDAPPQSARPRAPVDDRG---- 2606
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720384125  918 pgsqSTSGSKLSSTSQSSAHQPKfqskhfqPQPFQPVPSQKKPSHSRPSQAKPPHLDPSHAnltqgQPSQATPTHSQASQ 997
Cdd:PHA03247  2607 ----DPRGPAPPSPLPPDTHAPD-------PPPPSPSPAANEPDPHPPPTVPPPERPRDDP-----APGRVSRPRRARRL 2670
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720384125  998 AKPTHSQANSHHPHPSHAKPshqnpshanPTHPQSSHAKPSHPQSSHAKPSHPQSShAKPSHPQSSHAKPSHPQS----- 1072
Cdd:PHA03247  2671 GRAAQASSPPQRPRRRAARP---------TVGSLTSLADPPPPPPTPEPAPHALVS-ATPLPPGPAAARQASPALpaapa 2740
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720384125 1073 ------------SQAKPSHPQSSqAKPTHPQSSQANSHHPQASQAKPSHPQSSHAKPSHPHPSHAKPSPSQSTQCKAHKA 1140
Cdd:PHA03247  2741 ppavpagpatpgGPARPARPPTT-AGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALP 2819
                          330       340       350
                   ....*....|....*....|....*....|
gi 1720384125 1141 HQSQPKPFQPRPTQPksSKTKPSQARAFHP 1170
Cdd:PHA03247  2820 PAASPAGPLPPPTSA--QPTAPPPPPGPPP 2847
PAT1 pfam09770
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ...
938-1164 2.24e-09

Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.


Pssm-ID: 401645 [Multi-domain]  Cd Length: 846  Bit Score: 61.98  E-value: 2.24e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720384125  938 QPKFQSKHFQPQPFQPVPSQKKPSHSRPSQAKPP-----------------HLDPSHANLTQGQPSQATPTHSQASQAKP 1000
Cdd:pfam09770  106 QPAARAAQSSAQPPASSLPQYQYASQQSQQPSKPvrtgyekykepepipdlQVDASLWGVAPKKAAAPAPAPQPAAQPAS 185
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720384125 1001 THSQ---------------ANSHHPHPSHAKPSHQNPSHANPTHPQSSHAKPSHPQsshakpsHPQSSHAKPSHPQSsHA 1065
Cdd:pfam09770  186 LPAPsrkmmsleeveaamrAQAKKPAQQPAPAPAQPPAAPPAQQAQQQQQFPPQIQ-------QQQQPQQQPQQPQQ-HP 257
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720384125 1066 KPSHPQSSQakpSHPQSSQAKPTHPQSSQANSHHPQasQAKPSHPQSSHAKPSHPHPSHAKPSPSQSTQCKAHKAHQSQP 1145
Cdd:pfam09770  258 GQGHPVTIL---QRPQSPQPDPAQPSIQPQAQQFHQ--QPPPVPVQPTQILQNPNRLSAARVGYPQNPQPGVQPAPAHQA 332
                          250
                   ....*....|....*....
gi 1720384125 1146 KPFQPRPTQPKSSKTKPSQ 1164
Cdd:pfam09770  333 HRQQGSFGRQAPIITHPQQ 351
CARD smart00114
Caspase recruitment domain; Motif contained in proteins involved in apoptotic signalling. ...
8-82 1.18e-08

Caspase recruitment domain; Motif contained in proteins involved in apoptotic signalling. Mediates homodimerisation. Structure consists of six antiparallel helices arranged in a topology homologue to the DEATH and the DED domain.


Pssm-ID: 128424  Cd Length: 88  Bit Score: 53.50  E-value: 1.18e-08
                            10        20        30        40        50        60        70
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1720384125     8 SELIEKKRTKLLSVLQqdPDSILDTLTSRSLISEKEYETLEEITDPLKKSRKLLILIQKKGEDSCRRFLRCLSNA 82
Cdd:smart00114    6 KRLLRRNRVRLGEELG--VDGLLDYLVEKNVLTEKEIEAIKAATTKLRDKRELVDSLQKRGSQAFDTFLDSLQET 78
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
882-1132 1.07e-07

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 56.72  E-value: 1.07e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720384125  882 PQPCQPAGASQERIMPVSHQGAQQTTQ-GRPADFAFKPGSQSTSGSKLSSTSQSSAHQPKFQSKHFQPQPFQPVPSQKKP 960
Cdd:PHA03307   126 PPPSPAPDLSEMLRPVGSPGPPPAASPpAAGASPAAVASDAASSRQAALPLSSPEETARAPSSPPAEPPPSTPPAAASPR 205
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720384125  961 SHSRPSQAKPPHLDPSHANL-TQGQPSQATPTHSQASQAKPTHSQANSHHPHPSHA----KPSHQNPSHANPTHPQSSHA 1035
Cdd:PHA03307   206 PPRRSSPISASASSPAPAPGrSAADDAGASSSDSSSSESSGCGWGPENECPLPRPApitlPTRIWEASGWNGPSSRPGPA 285
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720384125 1036 KPSHP---QSSHAKPSHPQSSHAKPSHPQSSHAKPSHPQSSQAKPSHPQSSQAKPTHPQSsqanSHHPQASQAKPSHPQS 1112
Cdd:PHA03307   286 SSSSSpreRSPSPSPSSPGSGPAPSSPRASSSSSSSRESSSSSTSSSSESSRGAAVSPGP----SPSRSPSPSRPPPPAD 361
                          250       260
                   ....*....|....*....|
gi 1720384125 1113 SHAKPSHPHPSHAKPSPSQS 1132
Cdd:PHA03307   362 PSSPRKRPRPSRAPSSPAAS 381
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
947-1175 1.57e-07

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 55.95  E-value: 1.57e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720384125  947 QPQPFQPVPSQKKPSHSRPSQAKPPHLDPSHANLTQGQPSQATPTHSQASQAKPTHSQANSHHPHPSHAKPSHQNPSHAN 1026
Cdd:PHA03307   171 QAALPLSSPEETARAPSSPPAEPPPSTPPAAASPRPPRRSSPISASASSPAPAPGRSAADDAGASSSDSSSSESSGCGWG 250
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720384125 1027 PThpqsSHAKPSHPQSSHAKPSHPQSSHAKPSHPQSSHAKPSHP---QSSQAKPSHPQSSQAKPTHPQSSQANSHHPQAS 1103
Cdd:PHA03307   251 PE----NECPLPRPAPITLPTRIWEASGWNGPSSRPGPASSSSSpreRSPSPSPSSPGSGPAPSSPRASSSSSSSRESSS 326
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1720384125 1104 QAKPSHPQSSHAKPSHPHPSHAKP----SPSQSTQCKAhKAHQSQPKPFQPRPTQPKSSKTKPSQARAFHPRAGRR 1175
Cdd:PHA03307   327 SSTSSSSESSRGAAVSPGPSPSRSpspsRPPPPADPSS-PRKRPRPSRAPSSPAASAGRPTRRRARAAVAGRARRR 401
PHA03378 PHA03378
EBNA-3B; Provisional
948-1173 1.98e-07

EBNA-3B; Provisional


Pssm-ID: 223065 [Multi-domain]  Cd Length: 991  Bit Score: 55.46  E-value: 1.98e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720384125  948 PQPFQPV--PSQKKPSHSRPSQAKPPHLDPSHANLTQGQPSQATPTHSQASQAKPTHSQANSHHPHPSH-------AKPS 1018
Cdd:PHA03378   571 PLQIQPLtsPTTSQLASSAPSYAQTPWPVPHPSQTPEPPTTQSHIPETSAPRQWPMPLRPIPMRPLRMQpitfnvlVFPT 650
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720384125 1019 HQNPSHANPTHPQSSHAKPSHPqsshakPSHPQSSHAKPSHPQSshAKPSHPQSSQAKPSHPQSSQAKPTHPQSSQANSH 1098
Cdd:PHA03378   651 PHQPPQVEITPYKPTWTQIGHI------PYQPSPTGANTMLPIQ--WAPGTMQPPPRAPTPMRPPAAPPGRAQRPAAATG 722
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1720384125 1099 HPQASQAKPSHPQSSHAKPSHPHPSHAKPSPSQSTQCKAHKAHQSQPKPFQPRPTQPKSSKTKPSQarafHPRAG 1173
Cdd:PHA03378   723 RARPPAAAPGRARPPAAAPGRARPPAAAPGRARPPAAAPGRARPPAAAPGAPTPQPPPQAPPAPQQ----RPRGA 793
PTZ00395 PTZ00395
Sec24-related protein; Provisional
956-1132 2.07e-07

Sec24-related protein; Provisional


Pssm-ID: 185594 [Multi-domain]  Cd Length: 1560  Bit Score: 55.47  E-value: 2.07e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720384125  956 SQKKPSHSRPSQAKPPHLDPSHANLT---QGQPsQATPTHSQASQAKPTHS---QANSHHPHPSHAKPSHQNPSHANPTH 1029
Cdd:PTZ00395   351 SAGAPFNGLGNQADGGHINQVHPDARgawAGGP-HSNASYNCAAYSNAAQSnaaQSNAGFSNAGYSNPGNSNPGYNNAPN 429
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720384125 1030 PQSSHAKPSHPQSSHAKPSHPQSSHAKPSHPQSSHAKPSH---PQSSQAKpsHPQSSQAKPTHPQSSQANSHHPQASQAK 1106
Cdd:PTZ00395   430 SNTPYNNPPNSNTPYSNPPNSNPPYSNLPYSNTPYSNAPLsnaPPSSAKD--HHSAYHAAYQHRAANQPAANLPTANQPA 507
                          170       180
                   ....*....|....*....|....*.
gi 1720384125 1107 PSHPQSSHAKpSHPHPSHAKPSPSQS 1132
Cdd:PTZ00395   508 ANNFHGAAGN-SVGNPFASRPFGSAP 532
PAT1 pfam09770
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ...
880-1072 5.78e-07

Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.


Pssm-ID: 401645 [Multi-domain]  Cd Length: 846  Bit Score: 53.89  E-value: 5.78e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720384125  880 PHPQPCQPAGASQER-IMPVSHQGAQQTTQgrpadfAFKPGSQSTSGSKLSSTSQSSAHQPKFQSKHFQPQPFQPVPSQK 958
Cdd:pfam09770  177 PQPAAQPASLPAPSRkMMSLEEVEAAMRAQ------AKKPAQQPAPAPAQPPAAPPAQQAQQQQQFPPQIQQQQQPQQQP 250
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720384125  959 KPSHSRPSQAKPPHLdpshanLTQGQPSQATPTHSQASQAKPTHSQanshhphpsHAKPSHQNPSH--ANPTHPQSSHAK 1036
Cdd:pfam09770  251 QQPQQHPGQGHPVTI------LQRPQSPQPDPAQPSIQPQAQQFHQ---------QPPPVPVQPTQilQNPNRLSAARVG 315
                          170       180       190
                   ....*....|....*....|....*....|....*.
gi 1720384125 1037 PSHPQSSHAKPSHPQSSHAKPSHPQSSHAKPSHPQS 1072
Cdd:pfam09770  316 YPQNPQPGVQPAPAHQAHRQQGSFGRQAPIITHPQQ 351
PHA03378 PHA03378
EBNA-3B; Provisional
881-1172 7.24e-07

EBNA-3B; Provisional


Pssm-ID: 223065 [Multi-domain]  Cd Length: 991  Bit Score: 53.53  E-value: 7.24e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720384125  881 HPQPCQPAGASQERIMPVSHQGAQQTTQGRPADFA---FKPGSQSTSGSKLSSTSQSSAHQPKFQSKHFQPQPFQPVPSQ 957
Cdd:PHA03378   560 HDQLLPAPGLGPLQIQPLTSPTTSQLASSAPSYAQtpwPVPHPSQTPEPPTTQSHIPETSAPRQWPMPLRPIPMRPLRMQ 639
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720384125  958 KKPSH--SRPSQAKPPHLDPSHANLTQGQ----PSQATPTHSQAS---QAKPTHSQANSHHP-------------HPSHA 1015
Cdd:PHA03378   640 PITFNvlVFPTPHQPPQVEITPYKPTWTQighiPYQPSPTGANTMlpiQWAPGTMQPPPRAPtpmrppaappgraQRPAA 719
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720384125 1016 KPSHQNPSHANPTHPQSSHAKPSH---PQSSHAKPSHPQSSHAKPSHPQSSHAKPSHPQSSQAKPSHPQSSQAKPTHPQS 1092
Cdd:PHA03378   720 ATGRARPPAAAPGRARPPAAAPGRarpPAAAPGRARPPAAAPGRARPPAAAPGAPTPQPPPQAPPAPQQRPRGAPTPQPP 799
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720384125 1093 SQANshhPQASQAKPSHPQSSHAKPSHPHPSHAKPSPSQSTQCKAHKAHQSQPKPFQPRPTQPKSSKTKPSQARAFHPRA 1172
Cdd:PHA03378   800 PQAG---PTSMQLMPRAAPGQQGPTKQILRQLLTGGVKRGRPSLKKPAALERQAAAGPTPSPGSGTSDKIVQAPVFYPPV 876
PHA03377 PHA03377
EBNA-3C; Provisional
802-1129 9.30e-07

EBNA-3C; Provisional


Pssm-ID: 177614 [Multi-domain]  Cd Length: 1000  Bit Score: 53.52  E-value: 9.30e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720384125  802 GRSQRQASPVQTHPKSRQMSRTLERSGTVVSRVGHGRSLGSQARRAAGKPQP-EKACAQGLQLTKAAGKSIRTLPHIKYP 880
Cdd:PHA03377   544 GRRQKRATPPKVSPSDRGPPKASPPVMAPPSTGPRVMATPSTGPRDMAPPSTgPRQQAKCKDGPPASGPHEKQPPSSAPR 623
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720384125  881 HPQPCQPAGASQERIMPvshqgaqQTTQGRPADF-AFKPGSQSTSGSKLSSTSQSSAHQPKFQSKHFQPQPF-------- 951
Cdd:PHA03377   624 DMAPSVVRMFLRERLLE-------QSTGPKPKSFwEMRAGRDGSGIQQEPSSRRQPATQSTPPRPSWLPSVFvlpsvdag 696
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720384125  952 QPVPSQKKPSHS----RPS--QAKPPHLDPSHANLTQGQPSQATPTHSQA---SQAKPTHSQAnshhPHPSHAKP----- 1017
Cdd:PHA03377   697 RAQPSEESHLSSmsptQPIshEEQPRYEDPDDPLDLSLHPDQAPPPSHQApysGHEEPQAQQA----PYPGYWEPrppqa 772
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720384125 1018 --------------SHQNPSHANPTHPQSSHAKPSHP---QSSHAKPSHPQSSHA-KPSHPQ---SSHAKPSHPQSSQAK 1076
Cdd:PHA03377   773 pylgyqepqaqgvqVSSYPGYAGPWGLRAQHPRYRHSwayWSQYPGHGHPQGPWApRPPHLPpqwDGSAGHGQDQVSQFP 852
                          330       340       350       360       370
                   ....*....|....*....|....*....|....*....|....*....|...
gi 1720384125 1077 PSHPQSSQAKPTHPQSSQANSHHPQASQAKPSHPQSSHAKPSHPHPSHAKPSP 1129
Cdd:PHA03377   853 HLQSETGPPRLQLSQVPQLPYSQTLVSSSAPSWSSPQPRAPIRPIPTRFPPPP 905
PTZ00449 PTZ00449
104 kDa microneme/rhoptry antigen; Provisional
982-1151 1.20e-06

104 kDa microneme/rhoptry antigen; Provisional


Pssm-ID: 185628 [Multi-domain]  Cd Length: 943  Bit Score: 53.15  E-value: 1.20e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720384125  982 QGQPSQAtPTHSQASQAKPTHSQANSHHPHPShaKPSHQNPSHANP-THPQSSHAKPSHPQSSHaKPSHPQSSHAKPSHP 1060
Cdd:PTZ00449   507 HDEPPEG-PEASGLPPKAPGDKEGEEGEHEDS--KESDEPKEGGKPgETKEGEVGKKPGPAKEH-KPSKIPTLSKKPEFP 582
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720384125 1061 QsshaKPSHPQ--SSQAKPSHPQSSQaKPTHPQSSQ--ANSHHPQaSQAKPSHPQSSHAKPSHPHPSHAKpSPSQSTQCK 1136
Cdd:PTZ00449   583 K----DPKHPKdpEEPKKPKRPRSAQ-RPTRPKSPKlpELLDIPK-SPKRPESPKSPKRPPPPQRPSSPE-RPEGPKIIK 655
                          170
                   ....*....|....*
gi 1720384125 1137 AHKAHQSQPKPFQPR 1151
Cdd:PTZ00449   656 SPKPPKSPKPPFDPK 670
PHA03247 PHA03247
large tegument protein UL36; Provisional
712-1169 1.28e-06

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 53.02  E-value: 1.28e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720384125  712 PYPAHPWPLPIEAGSNFYHVPLRAPRAISSHFRSQQKAewffPFPHQNTSVHSRGqnfaiKYLQPWRFYSRERFTRCSAT 791
Cdd:PHA03247  2609 RGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTV----PPPERPRDDPAPG-----RVSRPRRARRLGRAAQASSP 2679
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720384125  792 PQQYHPN------GPFGRSQRQASPVQThPKSRQMSRTLERSGTVVSRVGHGRSLGSQARRAAgKPQPEKACAQGLQLTK 865
Cdd:PHA03247  2680 PQRPRRRaarptvGSLTSLADPPPPPPT-PEPAPHALVSATPLPPGPAAARQASPALPAAPAP-PAVPAGPATPGGPARP 2757
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720384125  866 AAGKSIRTLPHikyPHPqPCQPAGASQERIMPVSHQGAQQTTQGRPADFAFKPGSQSTSGSKLSSTSQSSAHQPKFQSKH 945
Cdd:PHA03247  2758 ARPPTTAGPPA---PAP-PAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTS 2833
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720384125  946 FQPQPfQPVPSQKKPSHSRPSQAKPPHLDPSHANLTQGQPSQ-ATPTHSQASQ-AKPTHSQANSHHPHPSHAKPSHQNPS 1023
Cdd:PHA03247  2834 AQPTA-PPPPPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAKpAAPARPPVRRlARPAVSRSTESFALPPDQPERPPQPQ 2912
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720384125 1024 HANPTHPQSSHAKPSHPQSSHAKPSHPQSshakPSHPQSSHAKPSHPQSSQAKPSH---------------PQSSQAKPT 1088
Cdd:PHA03247  2913 APPPPQPQPQPPPPPQPQPPPPPPPRPQP----PLAPTTDPAGAGEPSGAVPQPWLgalvpgrvavprfrvPQPAPSREA 2988
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720384125 1089 HPQSSQANSHHPQ----------ASQAKPSHPQSSHAKPSHPHPSHAKPSPSQSTQCKAHKAHQSQPKPFQPRPTQPKSS 1158
Cdd:PHA03247  2989 PASSTPPLTGHSLsrvsswasslALHEETDPPPVSLKQTLWPPDDTEDSDADSLFDSDSERSDLEALDPLPPEPHDPFAH 3068
                          490
                   ....*....|.
gi 1720384125 1159 KTKPSQARAFH 1169
Cdd:PHA03247  3069 EPDPATPEAGA 3079
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
880-1165 1.38e-06

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 52.85  E-value: 1.38e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720384125  880 PHPQPCQPAGASQERIMPVSHQGAQQ----TTQGRPADF-------AFKPGSQSTSGSKLSSTSQSSAHQPkfQSKHFQP 948
Cdd:pfam03154  251 PMTQPPPPSQVSPQPLPQPSLHGQMPpmphSLQTGPSHMqhpvppqPFPLTPQSSQSQVPPGPSPAAPGQS--QQRIHTP 328
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720384125  949 qPFQPVPSQKKPSHSRPSQAKP---PHLDPSHANLTQGQPSQATPTHSqASQAKPTHSQANSHHPHPSHAKPSHQNPSHA 1025
Cdd:pfam03154  329 -PSQSQLQSQQPPREQPLPPAPlsmPHIKPPPTTPIPQLPNPQSHKHP-PHLSGPSPFQMNSNLPPPPALKPLSSLSTHH 406
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720384125 1026 NPT-HP---------QSSHAKPSHPQSSHAKPSHPQSShakpshpqSSHAKPSHPQSSQAKPSHPQSSQAKPTHPQSSQA 1095
Cdd:pfam03154  407 PPSaHPpplqlmpqsQQLPPPPAQPPVLTQSQSLPPPA--------ASHPPTSGLHQVPSQSPFPQHPFVPGGPPPITPP 478
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1720384125 1096 NSHHPQASQAKPSHPQSSHAKPSHPHPSHAKPS-PSQSTQCKAHKAHQSQ----PKPFQPRPTQPKSSKTKPSQA 1165
Cdd:pfam03154  479 SGPPTSTSSAMPGIQPPSSASVSSSGPVPAAVScPLPPVQIKEEALDEAEepesPPPPPRSPSPEPTVVNTPSHA 553
AF-4 pfam05110
AF-4 proto-oncoprotein N-terminal region; This family consists of AF4 (Proto-oncogene AF4) and ...
936-1131 1.43e-06

AF-4 proto-oncoprotein N-terminal region; This family consists of AF4 (Proto-oncogene AF4) and FMR2 (Fragile X E mental retardation syndrome) nuclear proteins. These proteins have been linked to human diseases such as acute lymphoblastic leukaemia and mental retardation. The family also contains a Drosophila AF4 protein homolog Lilliputian which contains an AT-hook domain. Lilliputian represents a novel pair-rule gene that acts in cytoskeleton regulation, segmentation and morphogenesis in Drosophila.


Pssm-ID: 461550 [Multi-domain]  Cd Length: 514  Bit Score: 52.43  E-value: 1.43e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720384125  936 AHQPKFQSKHFQPQPfQPVPSQKKPSHSRPSQAKPPHLDPSHA--NLTQGQPSQATPT-----HSQASQAKPTHSQANSH 1008
Cdd:pfam05110  100 LPPSFHTSSHSQPMG-PPSSSSPSVSSSQSQKKSQARTEPAHGghSSSGSQSSQRSQGqsrskGGQESHSSSHHKRQERR 178
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720384125 1009 HPHPSHAKPSHQNPSHANPTHPQSSHAKPSHPQSShakpSHPQSSHAKPShpqsshAKPSHPQSSQAKPshpqssqakpt 1088
Cdd:pfam05110  179 EDLFSCASLSHSLEELSPLLSSLSSPVKPLSPSHS----RQHTGSKAQNS------SDHHGKEYSHSKS----------- 237
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|...
gi 1720384125 1089 hPQSSQANSHHPQASQAKPSHPQSSHAKPSHPHPSHAKPSPSQ 1131
Cdd:pfam05110  238 -PRDSEAGSHGPESPSTSLLASSSQLSSQTFPPSLPSKTSAMQ 279
CARD_NOD2_1_CARD15 cd08787
Caspase activation and recruitment domain of NOD2, repeat 1; Caspase activation and ...
11-88 1.89e-06

Caspase activation and recruitment domain of NOD2, repeat 1; Caspase activation and recruitment domain (CARD) similar to that found in human NOD2 (CARD15), repeat 1. NOD2 is a member of the Nod-like receptor (NLR) family, which plays a central role in the innate immune response. NLRs typically contain an N-terminal effector domain, a central nucleotide-binding domain and a C-terminal ligand-binding region of several leucine-rich repeats (LRRs). In NOD2, as well as NOD1, the N-terminal effector domain is a CARD. NOD2 contains two N-terminal CARD repeats. Mutations in NOD2 have been associated with Crohns disease and Blau syndrome. Nod2-CARDs have been shown to interact with the CARD domain of the downstream effector RICK (RIP2, CARDIAK), a serine/threonine kinase. In general, CARDs are death domains (DDs) found associated with caspases. They are known to be important in the signaling pathways for apoptosis, inflammation, and host-defense mechanisms. DDs are protein-protein interaction domains found in a variety of domain architectures. Their common feature is that they form homodimers by self-association or heterodimers by associating with other members of the DD superfamily including PYRIN and DED (Death Effector Domain). They serve as adaptors in signaling pathways and can recruit other proteins into signaling complexes.


Pssm-ID: 176765  Cd Length: 87  Bit Score: 47.22  E-value: 1.89e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720384125   11 IEKKRTKLLSVL----QQDP-DSILDTLTSRSLISEKEYETLEEITDPL-KKSRKLLILIQKKGEDSCRRFLRCLSNAFP 84
Cdd:cd08787      1 FLAQRSELLEVLcsggSLEPfESVLDWLLSQEVLSWEDYEGFHVLGQPLsHNARQLLDTVYNKGEWACQKFLAAAQQALA 80

                   ....
gi 1720384125   85 ESAS 88
Cdd:cd08787     81 EEQS 84
PRK10263 PRK10263
DNA translocase FtsK; Provisional
947-1137 2.57e-06

DNA translocase FtsK; Provisional


Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 52.01  E-value: 2.57e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720384125  947 QPQPFQPVPSQKKPS--HSRPSQaKPPHLDPSHANLTQGQPSQATPTHSQASQAkpthsqANSHHPHPSHAKPSHQNPSH 1024
Cdd:PRK10263   376 APEGYPQQSQYAQPAvqYNEPLQ-QPVQPQQPYYAPAAEQPAQQPYYAPAPEQP------AQQPYYAPAPEQPVAGNAWQ 448
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720384125 1025 ANPTHPQSSHAKPSHPQSSHAKPSHPQSSHAKPSHPQSSHAKPSHPQSSQAKPSHP-----QSSQAKPTHPQSSQANSHH 1099
Cdd:PRK10263   449 AEEQQSTFAPQSTYQTEQTYQQPAAQEPLYQQPQPVEQQPVVEPEPVVEETKPARPplyyfEEVEEKRAREREQLAAWYQ 528
                          170       180       190
                   ....*....|....*....|....*....|....*...
gi 1720384125 1100 PQASQAKPSHPqsshAKPSHPHPSHAKPSPSQSTQCKA 1137
Cdd:PRK10263   529 PIPEPVKEPEP----IKSSLKAPSVAAVPPVEAAAAVS 562
PTZ00395 PTZ00395
Sec24-related protein; Provisional
966-1173 6.60e-06

Sec24-related protein; Provisional


Pssm-ID: 185594 [Multi-domain]  Cd Length: 1560  Bit Score: 50.84  E-value: 6.60e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720384125  966 SQAKPPHLDPSHANLTQGQPSQaTPTHSQASQaKPTHSQANSHHPHPSHAKPSHQNPS--------HANPTHPQSSHAKP 1037
Cdd:PTZ00395   303 NNTNDAQRNAIQGDLVRGAPND-KNSFDRGNE-KTYQIYGGFHDGSPNAASAGAPFNGlgnqadggHINQVHPDARGAWA 380
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720384125 1038 SHPQS----SHAKPSHPQSSHAKPSHPQSSHAKPSHPQSSQAKPSHPQSSQAkpthPQSSQANSHHPQASQAKPSHPQS- 1112
Cdd:PTZ00395   381 GGPHSnasyNCAAYSNAAQSNAAQSNAGFSNAGYSNPGNSNPGYNNAPNSNT----PYNNPPNSNTPYSNPPNSNPPYSn 456
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1720384125 1113 ---SHAKPSHPHPSHAKPSPSQSTQCKAHKAHQSQPKPfQPRPTQPKSSKtkpSQARAFHPRAG 1173
Cdd:PTZ00395   457 lpySNTPYSNAPLSNAPPSSAKDHHSAYHAAYQHRAAN-QPAANLPTANQ---PAANNFHGAAG 516
PAT1 pfam09770
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ...
1014-1170 7.77e-06

Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.


Pssm-ID: 401645 [Multi-domain]  Cd Length: 846  Bit Score: 50.42  E-value: 7.77e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720384125 1014 HAKPSHQNP----SHANPTHPQSSHAKPSHPQSSHAKPSHPQSS--------------HAKPS----HPQSSHAKPSHPQ 1071
Cdd:pfam09770   99 QVRFNRQQPaaraAQSSAQPPASSLPQYQYASQQSQQPSKPVRTgyekykepepipdlQVDASlwgvAPKKAAAPAPAPQ 178
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720384125 1072 SSQAKPSHPQSS---------------QAKPTHPQSSQANSHHPQASQAKPSHPQSSHAKP-SHPHPSHAKPSPSQSTQC 1135
Cdd:pfam09770  179 PAAQPASLPAPSrkmmsleeveaamraQAKKPAQQPAPAPAQPPAAPPAQQAQQQQQFPPQiQQQQQPQQQPQQPQQHPG 258
                          170       180       190
                   ....*....|....*....|....*....|....*.
gi 1720384125 1136 KAHKAHQSQ-PKPFQPRPTQPKSSKTKPSQARAFHP 1170
Cdd:pfam09770  259 QGHPVTILQrPQSPQPDPAQPSIQPQAQQFHQQPPP 294
PHA03247 PHA03247
large tegument protein UL36; Provisional
870-1117 8.13e-06

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 50.32  E-value: 8.13e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720384125  870 SIRTLPHIKYPHPQPCQPAGASQERIMPVSHQGAQQTTQGRPA--------DFAFKPGSQSTSGSKLSSTSQSSAHQPKF 941
Cdd:PHA03247  2834 AQPTAPPPPPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAKPAaparppvrRLARPAVSRSTESFALPPDQPERPPQPQA 2913
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720384125  942 QSKHfQPQPFQPVPSQKKPSHSRPSQAKPPhLDPSHANLTQGQPSQATPTHSQASQAKPTHSQANSHHPHPSHAKPSHQN 1021
Cdd:PHA03247  2914 PPPP-QPQPQPPPPPQPQPPPPPPPRPQPP-LAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPSREAPAS 2991
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720384125 1022 PSHANPTHPQSSHAKPSHPQSSHAKPSHPQSSHAKPSHPQS--SHAKPSHPQSSQAKPSHPQSSQAKPTHPQSSQANSHH 1099
Cdd:PHA03247  2992 STPPLTGHSLSRVSSWASSLALHEETDPPPVSLKQTLWPPDdtEDSDADSLFDSDSERSDLEALDPLPPEPHDPFAHEPD 3071
                          250
                   ....*....|....*...
gi 1720384125 1100 PQASQAKPSHPQSSHAKP 1117
Cdd:PHA03247  3072 PATPEAGARESPSSQFGP 3089
PRK10263 PRK10263
DNA translocase FtsK; Provisional
1040-1158 2.59e-05

DNA translocase FtsK; Provisional


Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 48.54  E-value: 2.59e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720384125 1040 PQSSHAKPSHPQSSHAKPSHPQSSHAKPSHPQSSQAKPSHPQSSQaKPTHPQSSQANSHHPQASQAKPSHPQsshaKPSH 1119
Cdd:PRK10263   751 PVQQPQQPVAPQQQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQ-QPVAPQPQYQQPQQPVAPQPQYQQPQ----QPVA 825
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....
gi 1720384125 1120 PHPSHAKPSPSQSTQCKAHKAH-----QSQPKPFQpRPTQPKSS 1158
Cdd:PRK10263   826 PQPQYQQPQQPVAPQPQDTLLHpllmrNGDSRPLH-KPTTPLPS 868
PRK10263 PRK10263
DNA translocase FtsK; Provisional
957-1130 2.94e-05

DNA translocase FtsK; Provisional


Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 48.54  E-value: 2.94e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720384125  957 QKKPSHSRPSQAKPPHLD-----PSHANLTQGqPSQATPTHSQASQAKPTHSQAnshhphpshakPSHQNPSHANPTHPQ 1031
Cdd:PRK10263   708 QQRYSGEQPAGANPFSLDdfefsPMKALLDDG-PHEPLFTPIVEPVQQPQQPVA-----------PQQQYQQPQQPVAPQ 775
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720384125 1032 SSHAKPSHPqsshAKPSHPQSSHAKPSHPQSSHAKPSHPQSSQAKPSHPQSSQAKPthPQSSQANSHHPQASQAKPSHPQ 1111
Cdd:PRK10263   776 PQYQQPQQP----VAPQPQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPVAPQ--PQYQQPQQPVAPQPQDTLLHPL 849
                          170
                   ....*....|....*....
gi 1720384125 1112 SSHAKPSHPHPSHAKPSPS 1130
Cdd:PRK10263   850 LMRNGDSRPLHKPTTPLPS 868
AF-4 pfam05110
AF-4 proto-oncoprotein N-terminal region; This family consists of AF4 (Proto-oncogene AF4) and ...
1036-1162 3.56e-05

AF-4 proto-oncoprotein N-terminal region; This family consists of AF4 (Proto-oncogene AF4) and FMR2 (Fragile X E mental retardation syndrome) nuclear proteins. These proteins have been linked to human diseases such as acute lymphoblastic leukaemia and mental retardation. The family also contains a Drosophila AF4 protein homolog Lilliputian which contains an AT-hook domain. Lilliputian represents a novel pair-rule gene that acts in cytoskeleton regulation, segmentation and morphogenesis in Drosophila.


Pssm-ID: 461550 [Multi-domain]  Cd Length: 514  Bit Score: 47.81  E-value: 3.56e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720384125 1036 KPSHPQSSHAKPSHP----QSSHAKPSHPQSSHAKPSHPQSSQAKPSHPQSSQAKPTHPQSSQANSHHPQASQAKPSHPQ 1111
Cdd:pfam05110   77 KNSVPQTPQEKPDQPffpdKTSGLPPSFHTSSHSQPMGPPSSSSPSVSSSQSQKKSQARTEPAHGGHSSSGSQSSQRSQG 156
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|.
gi 1720384125 1112 SSHAKPSHPHPSHAKPSPSQSTQCKAHKAHQSQPKPFQPRPTQPKSSKTKP 1162
Cdd:pfam05110  157 QSRSKGGQESHSSSHHKRQERREDLFSCASLSHSLEELSPLLSSLSSPVKP 207
PTZ00449 PTZ00449
104 kDa microneme/rhoptry antigen; Provisional
980-1171 5.15e-05

104 kDa microneme/rhoptry antigen; Provisional


Pssm-ID: 185628 [Multi-domain]  Cd Length: 943  Bit Score: 47.76  E-value: 5.15e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720384125  980 LTQGQPSQATPTHSQASQAK--PTHSQANSHHPHPshakPSHQNPSHANPTHPQSSHAKPSHPQSShaKPSHPQSSHAKP 1057
Cdd:PTZ00449   476 ISKIQFTQEIKKLIKKSKKKlaPIEEEDSDKHDEP----PEGPEASGLPPKAPGDKEGEEGEHEDS--KESDEPKEGGKP 549
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720384125 1058 SHPQSSHAKPSHPQSSQAKPSHPQSSQAKPTHPQssqanshhpqasqaKPSHPQsshaKPSHPHPSHAKPSPSQSTQCKA 1137
Cdd:PTZ00449   550 GETKEGEVGKKPGPAKEHKPSKIPTLSKKPEFPK--------------DPKHPK----DPEEPKKPKRPRSAQRPTRPKS 611
                          170       180       190
                   ....*....|....*....|....*....|....*
gi 1720384125 1138 HKAHQSQPKPFQP-RPTQPKSSKTKPSQARAFHPR 1171
Cdd:PTZ00449   612 PKLPELLDIPKSPkRPESPKSPKRPPPPQRPSSPE 646
PHA03378 PHA03378
EBNA-3B; Provisional
955-1175 5.15e-05

EBNA-3B; Provisional


Pssm-ID: 223065 [Multi-domain]  Cd Length: 991  Bit Score: 47.75  E-value: 5.15e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720384125  955 PSQKKPSHSRPSQAkpPHLDPshanlTQGQPsQATPTHSQASQAKPTHSQANSHHPHPSHAK--PSHQN--PSHANPTH- 1029
Cdd:PHA03378   553 PASTEPVHDQLLPA--PGLGP-----LQIQP-LTSPTTSQLASSAPSYAQTPWPVPHPSQTPepPTTQShiPETSAPRQw 624
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720384125 1030 PQSSHAKPSHP-----------------QSSHAKPSHPQSSHAKPSHP--QSSHAKPSHPQSSQAKPSHPQSSQAKPTHP 1090
Cdd:PHA03378   625 PMPLRPIPMRPlrmqpitfnvlvfptphQPPQVEITPYKPTWTQIGHIpyQPSPTGANTMLPIQWAPGTMQPPPRAPTPM 704
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720384125 1091 QSSQANSHHPQASQAKPSHPQSSHAKPSHPHPSHAKPSPSQSTQCKAHKAHQSQPKPFQPRPTQPKSSKTKPSQARAFHP 1170
Cdd:PHA03378   705 RPPAAPPGRAQRPAAATGRARPPAAAPGRARPPAAAPGRARPPAAAPGRARPPAAAPGRARPPAAAPGAPTPQPPPQAPP 784

                   ....*
gi 1720384125 1171 RAGRR 1175
Cdd:PHA03378   785 APQQR 789
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
792-1083 5.20e-05

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 47.45  E-value: 5.20e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720384125  792 PQQYHPNGPFGRSQRQASPVQTHPKSRQMSRTLERSGTVVSRVGHGRSLGSQARRAAGKPQPEKACAQGLQ-LTKAAGKS 870
Cdd:pfam03154  269 PSLHGQMPPMPHSLQTGPSHMQHPVPPQPFPLTPQSSQSQVPPGPSPAAPGQSQQRIHTPPSQSQLQSQQPpREQPLPPA 348
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720384125  871 IRTLPHIKYPHPQPCQPAGASQERIMP--VSHQGAQQTTQGRPADFAFKPGSQSTSgsklssTSQSSAHQPKFQSKHfQP 948
Cdd:pfam03154  349 PLSMPHIKPPPTTPIPQLPNPQSHKHPphLSGPSPFQMNSNLPPPPALKPLSSLST------HHPPSAHPPPLQLMP-QS 421
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720384125  949 QPFQPVPSQKKPSHSRPSQAKPPHLDPSHANLTQGQPSQATPTHSQASQAKPTHSQANSHHPHPSHAKPSHQNPSHANPT 1028
Cdd:pfam03154  422 QQLPPPPAQPPVLTQSQSLPPPAASHPPTSGLHQVPSQSPFPQHPFVPGGPPPITPPSGPPTSTSSAMPGIQPPSSASVS 501
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 1720384125 1029 HPQSSHAKPSHPQSSHAKPSHPQSSHAKPSHPQSSHAKPSHPQSSQAKPSHPQSS 1083
Cdd:pfam03154  502 SSGPVPAAVSCPLPPVQIKEEALDEAEEPESPPPPPRSPSPEPTVVNTPSHASQS 556
AF-4 pfam05110
AF-4 proto-oncoprotein N-terminal region; This family consists of AF4 (Proto-oncogene AF4) and ...
974-1170 6.11e-05

AF-4 proto-oncoprotein N-terminal region; This family consists of AF4 (Proto-oncogene AF4) and FMR2 (Fragile X E mental retardation syndrome) nuclear proteins. These proteins have been linked to human diseases such as acute lymphoblastic leukaemia and mental retardation. The family also contains a Drosophila AF4 protein homolog Lilliputian which contains an AT-hook domain. Lilliputian represents a novel pair-rule gene that acts in cytoskeleton regulation, segmentation and morphogenesis in Drosophila.


Pssm-ID: 461550 [Multi-domain]  Cd Length: 514  Bit Score: 47.04  E-value: 6.11e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720384125  974 DPSHANLTqGQPSQATPTHSQASQAKPTHSQANSHHPHPSHAKPSHQ--NPSHANPTHPQSSHakpsHPQSSHAKPSHPQ 1051
Cdd:pfam05110   66 NKSNQHLV-GIPKNSVPQTPQEKPDQPFFPDKTSGLPPSFHTSSHSQpmGPPSSSSPSVSSSQ----SQKKSQARTEPAH 140
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720384125 1052 SSHAKP-SHPQSSHAKPSHPQSSQAK-PSHPQSSQAKPTHPQSSQANSHhpQASQAKPSHP-QSSHAKPSHPHPS----H 1124
Cdd:pfam05110  141 GGHSSSgSQSSQRSQGQSRSKGGQEShSSSHHKRQERREDLFSCASLSH--SLEELSPLLSsLSSPVKPLSPSHSrqhtG 218
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|....*.
gi 1720384125 1125 AKPSPSQSTQCKAHKAHQSqPKPFQPRPTQPKSSKTKPSQARAFHP 1170
Cdd:pfam05110  219 SKAQNSSDHHGKEYSHSKS-PRDSEAGSHGPESPSTSLLASSSQLS 263
dnaA PRK14086
chromosomal replication initiator protein DnaA;
981-1163 1.21e-04

chromosomal replication initiator protein DnaA;


Pssm-ID: 237605 [Multi-domain]  Cd Length: 617  Bit Score: 46.36  E-value: 1.21e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720384125  981 TQGQPSQATPTHSQASQAKPTHSQANSHHPHPSHAKPSHQnpsHANPTHPQSSHAKPSHPQ-SSHAKP-SHPQSSHAKPS 1058
Cdd:PRK14086    91 SAGEPAPPPPHARRTSEPELPRPGRRPYEGYGGPRADDRP---PGLPRQDQLPTARPAYPAyQQRPEPgAWPRAADDYGW 167
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720384125 1059 HPQSSHAKPSHPQSSQAKPSHPQSSQAKPTHPQSSQANSHHPQASQAKPSHPQSSHAKPSHP-------HPSHAKPSPSQ 1131
Cdd:PRK14086   168 QQQRLGFPPRAPYASPASYAPEQERDREPYDAGRPEYDQRRRDYDHPRPDWDRPRRDRTDRPepppgagHVHRGGPGPPE 247
                          170       180       190
                   ....*....|....*....|....*....|..
gi 1720384125 1132 STQCKAHKAHQSQPKPFQPRPtQPKSSKTKPS 1163
Cdd:PRK14086   248 RDDAPVVPIRPSAPGPLAAQP-APAPGPGEPT 278
PAT1 pfam09770
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ...
875-1012 1.41e-04

Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.


Pssm-ID: 401645 [Multi-domain]  Cd Length: 846  Bit Score: 46.18  E-value: 1.41e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720384125  875 PHIKYPHPQPCQPAGASQERIMPVSHQGAQQTTQGRPADFAFK--PGSQSTSGSKLSSTSQSSAHQPKFQSKHFQPQPFQ 952
Cdd:pfam09770  217 APAQPPAAPPAQQAQQQQQFPPQIQQQQQPQQQPQQPQQHPGQghPVTILQRPQSPQPDPAQPSIQPQAQQFHQQPPPVP 296
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720384125  953 PVPSQKKPSHSRPSQAKPPHLDPSHANlTQGQPSQATPtHSQASQAKPTHSQAnshHPHP 1012
Cdd:pfam09770  297 VQPTQILQNPNRLSAARVGYPQNPQPG-VQPAPAHQAH-RQQGSFGRQAPIIT---HPQQ 351
PRK10263 PRK10263
DNA translocase FtsK; Provisional
961-1150 1.79e-04

DNA translocase FtsK; Provisional


Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 45.85  E-value: 1.79e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720384125  961 SHSRPSQAKPPHLDPSHANLTQGQPSQATPTHSQASQAKPTHSQANSHHPHPSHAKPSHQNPSHANPTHPQSSHAKPS-- 1038
Cdd:PRK10263   295 SGNRATQPEYDEYDPLLNGAPITEPVAVAAAATTATQSWAAPVEPVTQTPPVASVDVPPAQPTVAWQPVPGPQTGEPVia 374
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720384125 1039 ------HPQSSHAKPSHPQSSH-AKPSHPQSSHAKPSHPQSSQAKPSHPQSSQA--------KPTHPQSSQANSHHPQAS 1103
Cdd:PRK10263   375 papegyPQQSQYAQPAVQYNEPlQQPVQPQQPYYAPAAEQPAQQPYYAPAPEQPaqqpyyapAPEQPVAGNAWQAEEQQS 454
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|
gi 1720384125 1104 --QAKPSH-PQSSHAKPSHPHPSHAKPSPSQSTQCKAHKAHQSQPKPFQP 1150
Cdd:PRK10263   455 tfAPQSTYqTEQTYQQPAAQEPLYQQPQPVEQQPVVEPEPVVEETKPARP 504
dnaA PRK14086
chromosomal replication initiator protein DnaA;
1032-1171 2.70e-04

chromosomal replication initiator protein DnaA;


Pssm-ID: 237605 [Multi-domain]  Cd Length: 617  Bit Score: 45.20  E-value: 2.70e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720384125 1032 SSHAKPSHPQSSHAKPSHPQSSHAKPShPQSSHAKPSHPQSSQAKPSHPQSSQAKPTHPQSSQAN--SHHPQASQAKPSH 1109
Cdd:PRK14086    90 PSAGEPAPPPPHARRTSEPELPRPGRR-PYEGYGGPRADDRPPGLPRQDQLPTARPAYPAYQQRPepGAWPRAADDYGWQ 168
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1720384125 1110 PQSSHAKPSHPHPSHAKPSPSQSTQCKAHKAHQSQPKPFQPRPTQPKSSKTKPSQARAFHPR 1171
Cdd:PRK14086   169 QQRLGFPPRAPYASPASYAPEQERDREPYDAGRPEYDQRRRDYDHPRPDWDRPRRDRTDRPE 230
ROM1 COG5422
RhoGEF, Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases [Signal transduction ...
939-1127 3.47e-04

RhoGEF, Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases [Signal transduction mechanisms];


Pssm-ID: 227709 [Multi-domain]  Cd Length: 1175  Bit Score: 44.88  E-value: 3.47e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720384125  939 PKFQSKHFQPQPFQPVPSQKKpSHSRPSQAKPPHLDPSHANLTQGQPSQATPTHSQASQAKPTHSQANSHHPHPS-HAKP 1017
Cdd:COG5422     80 PKLFQRRNSAGPITHSPSATS-STSSLNSNDGDQFSPASDSLSFNPSSTQSRKDSGPGDGSPVQKRKNPLLPSSStHGTH 158
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720384125 1018 SHQNPSHANPTHPQSSHAK-PSHPQSSHAKPSHPQSSHAKPSHPQSSHAKPSHPQSSQAKPSHPQSSQAKPTHPQSSQAN 1096
Cdd:COG5422    159 PPIVFTDNNGSHAGAPNARsRKEIPSLGSQSMQLPSPHFRQKFSSSDTSNGFSYPSIRKNSRHSSNSMPSFPHSSTAVLL 238
                          170       180       190
                   ....*....|....*....|....*....|....*.
gi 1720384125 1097 SHHPQASQAkpsHPQSSHAKPSHPH-----PSHAKP 1127
Cdd:COG5422    239 KRHSGSSGA---SLISSNITPSSSNseamsTSSKRP 271
ARG80 COG5068
Regulator of arginine metabolism and related MADS box-containing transcription factors ...
951-1155 4.20e-04

Regulator of arginine metabolism and related MADS box-containing transcription factors [Transcription];


Pssm-ID: 227400 [Multi-domain]  Cd Length: 412  Bit Score: 44.24  E-value: 4.20e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720384125  951 FQPVPSQKKPSHSRPSQAKPPHLDPSHANLTQGQPSQAT-PTHSQASQAKPTHSQANSHHPHP-SHakpSHQNPShanPT 1028
Cdd:COG5068    161 NAPSDSSEEPSSSASFSVDPNDNNPMGSFQHNGSPQTNFiPLQNPQTQQYQQHSSRKDHPTVPhSN---TNNGRP---PA 234
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720384125 1029 HPQSSHAKPSHPQSSHakPSHPQSSHAKPSHPQSSHAKPSHPQSSQAKPSHPQSSQAKPTHpqssQANSHHPQASQAKPS 1108
Cdd:COG5068    235 KFMIPELHSSHSTLDL--PSDFISDSGFPNQSSTSIFPLDSAIIQITPPHLPNNPPQENRH----ELYSNDSSMVSETPP 308
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|....*..
gi 1720384125 1109 HPQSSHAKPSHPHPSHAKPSPSQSTQCKAHKAHQSQPKPFQPRPTQP 1155
Cdd:COG5068    309 PKNLPNGSPNQSPLNNLSRGNPASPNSIIRENNQVEDESFNGRQGSA 355
PRK10263 PRK10263
DNA translocase FtsK; Provisional
1050-1170 4.28e-04

DNA translocase FtsK; Provisional


Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 44.69  E-value: 4.28e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720384125 1050 PQSSHAKPSHPQSSHAKPSHPQSSQAKPSHPQssqaKPTHPQSSQANSHHPQASQAKPSHPQsshaKPSHPHPSHAKPSp 1129
Cdd:PRK10263   751 PVQQPQQPVAPQQQYQQPQQPVAPQPQYQQPQ----QPVAPQPQYQQPQQPVAPQPQYQQPQ----QPVAPQPQYQQPQ- 821
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|.
gi 1720384125 1130 sqstqckahkahqsQPKPFQPRPTQPKSSKTKPSQARAFHP 1170
Cdd:PRK10263   822 --------------QPVAPQPQYQQPQQPVAPQPQDTLLHP 848
PspC_subgroup_2 NF033839
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ...
947-1167 4.46e-04

pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.


Pssm-ID: 468202 [Multi-domain]  Cd Length: 557  Bit Score: 44.37  E-value: 4.46e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720384125  947 QPQPFQPVPSQKKPSHSRPSQAKPPHLDPShanlTQGQPSQATPTHSQASQAKPTHSQANshhPHPSHAKPSHQnpshan 1026
Cdd:NF033839   300 QPSPQPEKKEVKPEPETPKPEVKPQLEKPK----PEVKPQPEKPKPEVKPQLETPKPEVK---PQPEKPKPEVK------ 366
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720384125 1027 pthPQSSHAKPS-HPQSSHAKPS-HPQSSHAKPS-HPQSSHAKPS-HPQSSQAKPS-HPQSSQAKP-THPQSSQANSH-H 1099
Cdd:NF033839   367 ---PQPEKPKPEvKPQPETPKPEvKPQPEKPKPEvKPQPEKPKPEvKPQPEKPKPEvKPQPEKPKPeVKPQPEKPKPEvK 443
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1720384125 1100 PQASQAKPSHPQSSHAKPSHPHPSHAKPSPSQSTQCKAHKAHQSQPKPFQPRPTQPKS-SKTKPSQARA 1167
Cdd:NF033839   444 PQPEKPKPEVKPQPETPKPEVKPQPEKPKPEVKPQPEKPKPDNSKPQADDKKPSTPNNlSKDKQPSNQA 512
PRK11901 PRK11901
hypothetical protein; Reviewed
981-1163 4.51e-04

hypothetical protein; Reviewed


Pssm-ID: 237015 [Multi-domain]  Cd Length: 327  Bit Score: 43.90  E-value: 4.51e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720384125  981 TQGQPSQATPTHSQA-----SQAKPTHSQANSHHPHPSHAKPSHQNPSHANPTHPQSSHAKPSHPQSSHAKPSHPQSSHA 1055
Cdd:PRK11901    62 TEHESQQSSNNAGAEknidlSGSSSLSSGNQSSPSAANNTSDGHDASGVKNTAPPQDISAPPISPTPTQAAPPQTPNGQQ 141
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720384125 1056 KPSHPQSSHAKPSHPQS--SQAKPSHPQSSQAKPTHPQSsQANSHHPQASQAKPSHPQSshakpsHPHPSHAKPSPSQST 1133
Cdd:PRK11901   142 RIELPGNISDALSQQQGqvNAASQNAQGNTSTLPTAPAT-VAPSKGAKVPATAETHPTP------PQKPATKKPAVNHHK 214
                          170       180       190
                   ....*....|....*....|....*....|
gi 1720384125 1134 QCKAHKAHQSQPKPFQPRPTQPKSSKTKPS 1163
Cdd:PRK11901   215 TATVAVPPATSGKPKSGAASARALSSAPAS 244
PRK10263 PRK10263
DNA translocase FtsK; Provisional
1008-1170 5.33e-04

DNA translocase FtsK; Provisional


Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 44.31  E-value: 5.33e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720384125 1008 HHPHPSHAKPSHQNPSHANPTHPQSSHAKPSHPQSSHAKPSHPQSSHAKPSHPQSSHAKPSHPQSSQAKPSHPQSSQAKP 1087
Cdd:PRK10263   367 QTGEPVIAPAPEGYPQQSQYAQPAVQYNEPLQQPVQPQQPYYAPAAEQPAQQPYYAPAPEQPAQQPYYAPAPEQPVAGNA 446
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720384125 1088 THPQSSQANSHHPQASQAKPSHPQSSHAKPSHPHPSHAKPSPSQSTQCKAHKAHQSQPKPFQPRPTQPKSSKTKPSQARA 1167
Cdd:PRK10263   447 WQAEEQQSTFAPQSTYQTEQTYQQPAAQEPLYQQPQPVEQQPVVEPEPVVEETKPARPPLYYFEEVEEKRAREREQLAAW 526

                   ...
gi 1720384125 1168 FHP 1170
Cdd:PRK10263   527 YQP 529
PRK07994 PRK07994
DNA polymerase III subunits gamma and tau; Validated
1029-1167 5.59e-04

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236138 [Multi-domain]  Cd Length: 647  Bit Score: 44.09  E-value: 5.59e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720384125 1029 HPQSSHAKPSHPQSSHAKPSHPQSSHAKPSHPQSSHAKPSHPQSSQAKPSHPQSSQAKPTHPQSSQANSHHPQASQAKPS 1108
Cdd:PRK07994   360 HPAAPLPEPEVPPQSAAPAASAQATAAPTAAVAPPQAPAVPPPPASAPQQAPAVPLPETTSQLLAARQQLQRAQGATKAK 439
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1720384125 1109 HPQSSHAKPSHPHPSHAKPSPSQSTQCKAHKAHQSQPKPFQPRPTQP----KSSKTKPSQARA 1167
Cdd:PRK07994   440 KSEPAAASRARPVNSALERLASVRPAPSALEKAPAKKEAYRWKATNPvevkKEPVATPKALKK 502
PHA03269 PHA03269
envelope glycoprotein C; Provisional
961-1094 5.96e-04

envelope glycoprotein C; Provisional


Pssm-ID: 165527 [Multi-domain]  Cd Length: 566  Bit Score: 43.95  E-value: 5.96e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720384125  961 SHSRPSQAKPPHLDPSHANLTQGQPSQA-TPTHSQASQAKPTHSQANSHHPHPSHAKPSHQNPShanpthpqsshakpsh 1039
Cdd:PHA03269    46 PHQAASRAPDPAVAPTSAASRKPDLAQApTPAASEKFDPAPAPHQAASRAPDPAVAPQLAAAPK---------------- 109
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 1720384125 1040 PQSSHAKPSHPQSSHAKPSHPQSSHAKPSHPqssqakPSHPQSSQAKPTHPQSSQ 1094
Cdd:PHA03269   110 PDAAEAFTSAAQAHEAPADAGTSAASKKPDP------AAHTQHSPPPFAYTRSME 158
PRK14971 PRK14971
DNA polymerase III subunit gamma/tau;
1025-1131 6.84e-04

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237874 [Multi-domain]  Cd Length: 614  Bit Score: 43.61  E-value: 6.84e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720384125 1025 ANPTHPQSSHAKPSHPQSsHAKPSHPQSshAKPSHPQSSHAKPSHPQSSQAKPSHPQSSQAKPTHPQSSQANSHHPQASQ 1104
Cdd:PRK14971   360 AQLTQKGDDASGGRGPKQ-HIKPVFTQP--AAAPQPSAAAAASPSPSQSSAAAQPSAPQSATQPAGTPPTVSVDPPAAVP 436
                           90       100
                   ....*....|....*....|....*..
gi 1720384125 1105 AKPSHPQSSHAKPShPHPSHAKPSPSQ 1131
Cdd:PRK14971   437 VNPPSTAPQAVRPA-QFKEEKKIPVSK 462
PTZ00449 PTZ00449
104 kDa microneme/rhoptry antigen; Provisional
956-1170 8.10e-04

104 kDa microneme/rhoptry antigen; Provisional


Pssm-ID: 185628 [Multi-domain]  Cd Length: 943  Bit Score: 43.91  E-value: 8.10e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720384125  956 SQKKPSHSRPSQAKPPHLDPSHANLTQGQPSQaTPTHSQasqaKPTHSQANSHHPHPSHAKPSHQNPSHANPTHPQSsha 1035
Cdd:PTZ00449   540 SDEPKEGGKPGETKEGEVGKKPGPAKEHKPSK-IPTLSK----KPEFPKDPKHPKDPEEPKKPKRPRSAQRPTRPKS--- 611
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720384125 1036 kPSHPQS-----SHAKPSHPQSSHAKPShPQSShAKPSHPQSSQA-KPSHPQSSQAKPTHPQSSQA--NSHHPQASQAKP 1107
Cdd:PTZ00449   612 -PKLPELldipkSPKRPESPKSPKRPPP-PQRP-SSPERPEGPKIiKSPKPPKSPKPPFDPKFKEKfyDDYLDAAAKSKE 688
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1720384125 1108 ShpqsshakPSHPHPSHAKPSPSQSTQCKAHKAHQSQPKPFQP-RPTQPKSSKTKPSQARAFHP 1170
Cdd:PTZ00449   689 T--------KTTVVLDESFESILKETLPETPGTPFTTPRPLPPkLPRDEEFPFEPIGDPDAEQP 744
PLN03209 PLN03209
translocon at the inner envelope of chloroplast subunit 62; Provisional
965-1167 9.70e-04

translocon at the inner envelope of chloroplast subunit 62; Provisional


Pssm-ID: 178748 [Multi-domain]  Cd Length: 576  Bit Score: 43.38  E-value: 9.70e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720384125  965 PSQAKPPHlDPSHANLTQGQPSQATPTHSQASQAKPTHSQANSHHPHPSHAKPSHQN---PSHANPTHPQSSHAKPShPQ 1041
Cdd:PLN03209   324 PSQRVPPK-ESDAADGPKPVPTKPVTPEAPSPPIEEEPPQPKAVVPRPLSPYTAYEDlkpPTSPIPTPPSSSPASSK-SV 401
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720384125 1042 SSHAKPSHPQSSHAKPSHPQSSHAKPSHPQSSQAKPSHPQSSQAK---PTHPQSSQANSHHPQASQAkPSHPQSSHAKPS 1118
Cdd:PLN03209   402 DAVAKPAEPDVVPSPGSASNVPEVEPAQVEAKKTRPLSPYARYEDlkpPTSPSPTAPTGVSPSVSST-SSVPAVPDTAPA 480
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|....*....
gi 1720384125 1119 HPHPSHAKPSPSQSTQCKAHKAHQSQPKPFQPRPTQPKSSKTKPSQARA 1167
Cdd:PLN03209   481 TAATDAAAPPPANMRPLSPYAVYDDLKPPTSPSPAAPVGKVAPSSTNEV 529
PTZ00449 PTZ00449
104 kDa microneme/rhoptry antigen; Provisional
1047-1175 1.04e-03

104 kDa microneme/rhoptry antigen; Provisional


Pssm-ID: 185628 [Multi-domain]  Cd Length: 943  Bit Score: 43.52  E-value: 1.04e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720384125 1047 PSHPQSSHAKPSHPQSSHAKPSHPQSSqaKPSHPQSSQAKPTHPQSSQANSHHPQASQAKPSHPQSSHAKPSHPHPSHAK 1126
Cdd:PTZ00449   511 PEGPEASGLPPKAPGDKEGEEGEHEDS--KESDEPKEGGKPGETKEGEVGKKPGPAKEHKPSKIPTLSKKPEFPKDPKHP 588
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....*....
gi 1720384125 1127 PSPSQSTQCKAHKAHQSQPKPfqPRPTQPKSSKTKPSQARAFHPRAGRR 1175
Cdd:PTZ00449   589 KDPEEPKKPKRPRSAQRPTRP--KSPKLPELLDIPKSPKRPESPKSPKR 635
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
880-1094 1.19e-03

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 43.05  E-value: 1.19e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720384125  880 PHPQPCQPAGASQErimpvshqgaqqtTQGRPADFAFKPGSQSTSGSKLSSTSQSSAHQPKFQSKHFQPQPFQPVPSQKK 959
Cdd:PRK07764   590 PAPGAAGGEGPPAP-------------ASSGPPEEAARPAAPAAPAAPAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKH 656
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720384125  960 PSHSRPSQAKpphlDPSHANLTQGQPSQATPTHSQASQAKPTHSQANSHHPHPSHAKPSHQNPSHANPTHPQSSHAKPSH 1039
Cdd:PRK07764   657 VAVPDASDGG----DGWPAKAGGAAPAAPPPAPAPAAPAAPAGAAPAQPAPAPAATPPAGQADDPAAQPPQAAQGASAPS 732
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 1720384125 1040 PQSSHAKPSHPqsshAKPSHPQSSHAKPSHPQSSQAKPSHPQSSQAKPTHPQSSQ 1094
Cdd:PRK07764   733 PAADDPVPLPP----EPDDPPDPAGAPAQPPPPPAPAPAAAPAAAPPPSPPSEEE 783
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
1020-1174 1.55e-03

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 42.83  E-value: 1.55e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720384125 1020 QNPSHANPTHPQSSHAKPSHPQSSHAKPSHPQSSHAKPSHPQSSHAKPSHPQSSQAKPsHPQSSQAKPTHPQssQANSHH 1099
Cdd:pfam03154  170 QPPVLQAQSGAASPPSPPPPGTTQAATAGPTPSAPSVPPQGSPATSQPPNQTQSTAAP-HTLIQQTPTLHPQ--RLPSPH 246
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1720384125 1100 PQASQAKPSHPQSSHAKPSHPHPSHAKPSPSQSTQCKA---HKAHQSQPKPFQPRPTQPKSSKTKPSQARAFHPRAGR 1174
Cdd:pfam03154  247 PPLQPMTQPPPPSQVSPQPLPQPSLHGQMPPMPHSLQTgpsHMQHPVPPQPFPLTPQSSQSQVPPGPSPAAPGQSQQR 324
ROM1 COG5422
RhoGEF, Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases [Signal transduction ...
939-1170 1.95e-03

RhoGEF, Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases [Signal transduction mechanisms];


Pssm-ID: 227709 [Multi-domain]  Cd Length: 1175  Bit Score: 42.57  E-value: 1.95e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720384125  939 PKFQSKhfQPQPFQPVPSQKKPSHSRPSQAKPPHLDPSHANLTQGQPSQATPTHSQASQAKP---THSQANSHHPHPSHA 1015
Cdd:COG5422     24 DAFVSK--QLLPPRRLQRKLNPISIRNGADNDIINSESKESFGKYALGHQIFSSFSSSPKLFqrrNSAGPITHSPSATSS 101
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720384125 1016 KPS-------HQNPSHA----NPTHPQSS------HAKPSHPQSSHAKPSHPQSSHAKP---SHPQSSHAKPSHPQSSQA 1075
Cdd:COG5422    102 TSSlnsndgdQFSPASDslsfNPSSTQSRkdsgpgDGSPVQKRKNPLLPSSSTHGTHPPivfTDNNGSHAGAPNARSRKE 181
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720384125 1076 KPSH-PQSSQAKPTHPQSSQANSHHPQASQAKPSHPQSSHAKPSHPHPSHAKPSPSQSTQCKAHKAHQSQPKPFQPRPTQ 1154
Cdd:COG5422    182 IPSLgSQSMQLPSPHFRQKFSSSDTSNGFSYPSIRKNSRHSSNSMPSFPHSSTAVLLKRHSGSSGASLISSNITPSSSNS 261
                          250
                   ....*....|....*.
gi 1720384125 1155 PKSSKTkpSQARAFHP 1170
Cdd:COG5422    262 EAMSTS--SKRPYIYP 275
KLF1_2_4_N-like cd22056
N-terminal domain of Kruppel-like factors with similarity to the N-terminal domains of ...
1011-1124 1.95e-03

N-terminal domain of Kruppel-like factors with similarity to the N-terminal domains of Kruppel-like factor (KLF)1, KLF2, and KLF4; Kruppel/Krueppel-like transcription factors (KLFs) belong to a family of proteins called the Specificity Protein (SP)/KLF family, characterized by a C-terminal DNA-binding domain of 81 amino acids consisting of three Kruppel-like C2H2 zinc fingers. These factors bind to a loose consensus motif, namely NNRCRCCYY (where N is any nucleotide; R is A/G, and Y is C/T), such as the recurring motifs in GC and GT boxes (5'-GGGGCGGGG-3' and 5-GGTGTGGGG-3') that are present in promoters and more distal regulatory elements of mammalian genes. Members of the KLF family can act as activators or repressors of transcription depending on cell and promoter context. KLFs regulate various cellular functions, such as proliferation, differentiation, and apoptosis, as well as the development and homeostasis of several types of tissue. In addition to the C-terminal DNA-binding domain, each KLF also has a unique N-terminal activation/repression domain that confers specifity and allows it to bind specifically to a certain partner, leading to distinct activities in vivo. This model represents the N-terminal domains of an unknown subfamily of KLFs, predominantly found in fish, related to the N-terminal domains of KLF1, KLF2, and KLF4.


Pssm-ID: 409231 [Multi-domain]  Cd Length: 339  Bit Score: 41.95  E-value: 1.95e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720384125 1011 HPSHAKPSHQnpshANPTHPQSSHAKPSHPQSSHAKPSHPQSSHAkPSHPQSSHAKPSHPQSSQAKPSHPQSSQAKPTHP 1090
Cdd:cd22056    203 FMGQQKPKHQ----MHSVHPQAFTHHQAAGPGALQGRGGRGGPDC-HLLHSSHHHHHHHHLQYQYMNAPYPPHYAHQGAP 277
                           90       100       110
                   ....*....|....*....|....*....|....
gi 1720384125 1091 QSSQANSHHPQASQAKPSHPQSSHAKPShPHPSH 1124
Cdd:cd22056    278 QFHGQYSVFREPMRVHHQGHPGSMLTPP-SSPPL 310
PLN03209 PLN03209
translocon at the inner envelope of chloroplast subunit 62; Provisional
936-1155 2.15e-03

translocon at the inner envelope of chloroplast subunit 62; Provisional


Pssm-ID: 178748 [Multi-domain]  Cd Length: 576  Bit Score: 42.22  E-value: 2.15e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720384125  936 AHQPKFQSKHFQPQPFQPVPSQKKPShsRPSQAKPPHLDPSHANLTQGQPSQATPTHSQASQAKPTHSQANSHhPHPSHA 1015
Cdd:PLN03209   336 ADGPKPVPTKPVTPEAPSPPIEEEPP--QPKAVVPRPLSPYTAYEDLKPPTSPIPTPPSSSPASSKSVDAVAK-PAEPDV 412
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720384125 1016 KPSHQNPSHANPTHPQSSHAKPSHPQSSHA-----KPshpqsshakPSHPQSSHAKPSHPQSSQAkPSHPQSSQAKPthP 1090
Cdd:PLN03209   413 VPSPGSASNVPEVEPAQVEAKKTRPLSPYAryedlKP---------PTSPSPTAPTGVSPSVSST-SSVPAVPDTAP--A 480
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720384125 1091 QSSQANSHHPQA------------SQAKPSHPQSSHAKPSHPHPSHAKPSPSQST---QCKAHKAHQSQPKpfqPRPTQP 1155
Cdd:PLN03209   481 TAATDAAAPPPAnmrplspyavydDLKPPTSPSPAAPVGKVAPSSTNEVVKVGNSappTALADEQHHAQPK---PRPLSP 557
KLF1_2_4_N-like cd22056
N-terminal domain of Kruppel-like factors with similarity to the N-terminal domains of ...
967-1073 2.17e-03

N-terminal domain of Kruppel-like factors with similarity to the N-terminal domains of Kruppel-like factor (KLF)1, KLF2, and KLF4; Kruppel/Krueppel-like transcription factors (KLFs) belong to a family of proteins called the Specificity Protein (SP)/KLF family, characterized by a C-terminal DNA-binding domain of 81 amino acids consisting of three Kruppel-like C2H2 zinc fingers. These factors bind to a loose consensus motif, namely NNRCRCCYY (where N is any nucleotide; R is A/G, and Y is C/T), such as the recurring motifs in GC and GT boxes (5'-GGGGCGGGG-3' and 5-GGTGTGGGG-3') that are present in promoters and more distal regulatory elements of mammalian genes. Members of the KLF family can act as activators or repressors of transcription depending on cell and promoter context. KLFs regulate various cellular functions, such as proliferation, differentiation, and apoptosis, as well as the development and homeostasis of several types of tissue. In addition to the C-terminal DNA-binding domain, each KLF also has a unique N-terminal activation/repression domain that confers specifity and allows it to bind specifically to a certain partner, leading to distinct activities in vivo. This model represents the N-terminal domains of an unknown subfamily of KLFs, predominantly found in fish, related to the N-terminal domains of KLF1, KLF2, and KLF4.


Pssm-ID: 409231 [Multi-domain]  Cd Length: 339  Bit Score: 41.57  E-value: 2.17e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720384125  967 QAKPPHLDPSHANLTQGQPSQATPTHSQASQA--KPTHSQANSHHPHPSHakPSHQNPSHANPTHPQssHAKPSHPQ--- 1041
Cdd:cd22056    206 QQKPKHQMHSVHPQAFTHHQAAGPGALQGRGGrgGPDCHLLHSSHHHHHH--HHLQYQYMNAPYPPH--YAHQGAPQfhg 281
                           90       100       110
                   ....*....|....*....|....*....|....
gi 1720384125 1042 --SSHAKPSHPQSSHakpsHPQSSHAKPSHPQSS 1073
Cdd:cd22056    282 qySVFREPMRVHHQG----HPGSMLTPPSSPPLL 311
PRK14951 PRK14951
DNA polymerase III subunits gamma and tau; Provisional
1060-1175 2.42e-03

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237865 [Multi-domain]  Cd Length: 618  Bit Score: 42.01  E-value: 2.42e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720384125 1060 PQSSHAKPSHPQSSQAKPSHPQSSQAKPTHPQSSQANSHHPQASQAKPSHPQSSHAKPSHPHPSHAKPSPSqSTQCKAHK 1139
Cdd:PRK14951   373 AAPAEKKTPARPEAAAPAAAPVAQAAAAPAPAAAPAAAASAPAAPPAAAPPAPVAAPAAAAPAAAPAAAPA-AVALAPAP 451
                           90       100       110
                   ....*....|....*....|....*....|....*..
gi 1720384125 1140 AHQSQPKPFQ-PRPTQPKSSKTKPSQARAFHPRAGRR 1175
Cdd:PRK14951   452 PAQAAPETVAiPVRVAPEPAVASAAPAPAAAPAAARL 488
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
865-1051 2.51e-03

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 41.90  E-value: 2.51e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720384125  865 KAAGKSIRTLPHIKYPHPQPCQPAGASQERIMPVSHQGAQQTTQGRPADFAFKPGSQSTSGSKLSSTSQSSAHQPKFQSK 944
Cdd:PRK07764   593 GAAGGEGPPAPASSGPPEEAARPAAPAAPAAPAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVPDASDGGDGWPAK 672
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720384125  945 HFQPQPFQPVPSQKKPSHSRPSQAKPPHLDPSHANLTQGQPSQATPTHSQASQAKPTHSQANSHHPHPSHAKPS-HQNPS 1023
Cdd:PRK07764   673 AGGAAPAAPPPAPAPAAPAAPAGAAPAQPAPAPAATPPAGQADDPAAQPPQAAQGASAPSPAADDPVPLPPEPDdPPDPA 752
                          170       180
                   ....*....|....*....|....*...
gi 1720384125 1024 HANPTHPQSSHAKPSHPQSSHAKPSHPQ 1051
Cdd:PRK07764   753 GAPAQPPPPPAPAPAAAPAAAPPPSPPS 780
PRK11901 PRK11901
hypothetical protein; Reviewed
1031-1170 2.88e-03

hypothetical protein; Reviewed


Pssm-ID: 237015 [Multi-domain]  Cd Length: 327  Bit Score: 41.21  E-value: 2.88e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720384125 1031 QSSHAKPSHPQSSHAKPSHPQSSHAKPSHPQSSHAKPSHPQSSQAKPSHPQSSQAKPTHP---------QSSQANsHHPQ 1101
Cdd:PRK11901    87 LSSGNQSSPSAANNTSDGHDASGVKNTAPPQDISAPPISPTPTQAAPPQTPNGQQRIELPgnisdalsqQQGQVN-AASQ 165
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1720384125 1102 ASQAKPSH--------PQSSHAKPSHPHPSHAKPSPSQSTQCKAHKAHQSQPKPFQPRPTQPKSSKTKPSQARAFHP 1170
Cdd:PRK11901   166 NAQGNTSTlptapatvAPSKGAKVPATAETHPTPPQKPATKKPAVNHHKTATVAVPPATSGKPKSGAASARALSSAP 242
CARD_NOD1_CARD4 cd08324
Caspase activation and recruitment domain similar to that found in NOD1; Caspase activation ...
29-79 3.47e-03

Caspase activation and recruitment domain similar to that found in NOD1; Caspase activation and recruitment domain (CARD) found in human NOD1 (CARD4) and similar proteins. NOD1 is a member of the Nod-like receptor (NLR) family, which plays a central role in the innate immune response. NLRs typically contain an N-terminal effector domain, a central nucleotide-binding domain and a C-terminal ligand-binding region of several leucine-rich repeats (LRRs). In NOD1, as well as NOD2, the N-terminal effector domain is a CARD. Nod1-CARD has been shown to interact with the CARD domain of the downstream effector RICK (RIP2, CARDIAK), a serine/threonine kinase. In general, CARDs are death domains (DDs) found associated with caspases. They are known to be important in the signaling pathways for apoptosis, inflammation, and host-defense mechanisms. DDs are protein-protein interaction domains found in a variety of domain architectures. Their common feature is that they form homodimers by self-association or heterodimers by associating with other members of the DD superfamily including PYRIN and DED (Death Effector Domain). They serve as adaptors in signaling pathways and can recruit other proteins into signaling complexes.


Pssm-ID: 260035  Cd Length: 85  Bit Score: 37.84  E-value: 3.47e-03
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|.
gi 1720384125   29 ILDTLTSRSLISEKEYETLEEITDPLKKSRKLLILIQKKGEDSCRRFLRCL 79
Cdd:cd08324     20 LLDNLLKNGYFSTEDAEIVQRCPTQTDKVRKILDLVQSKGEEVSEFFIYIL 70
Pneumo_att_G pfam05539
Pneumovirinae attachment membrane glycoprotein G;
993-1163 3.59e-03

Pneumovirinae attachment membrane glycoprotein G;


Pssm-ID: 114270 [Multi-domain]  Cd Length: 408  Bit Score: 41.19  E-value: 3.59e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720384125  993 SQASQAKPTHSQANSHHPhpshAKPSHQNP-SHANPTHPQSSHAKpshpQSSHAKPSHPQSSHAKPSHPQSSHAKPSHPQ 1071
Cdd:pfam05539  170 TAVTTSKTTSWPTEVSHP----TYPSQVTPqSQPATQGHQTATAN----QRLSSTEPVGTQGTTTSSNPEPQTEPPPSQR 241
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720384125 1072 SSQAKPSHPQSsqakpTHPQSSQANSHHPQASQAKPSHPQSSHAKPSHPHPSHAKPSPSQSTQCKAHKAHQSQPKPFQPR 1151
Cdd:pfam05539  242 GPSGSPQHPPS-----TTSQDQSTTGDGQEHTQRRKTPPATSNRRSPHSTATPPPTTKRQETGRPTPRPTATTQSGSSPP 316
                          170
                   ....*....|..
gi 1720384125 1152 PTQPKSSKTKPS 1163
Cdd:pfam05539  317 HSSPPGVQANPT 328
KLF1_2_4_N-like cd22056
N-terminal domain of Kruppel-like factors with similarity to the N-terminal domains of ...
1054-1152 3.79e-03

N-terminal domain of Kruppel-like factors with similarity to the N-terminal domains of Kruppel-like factor (KLF)1, KLF2, and KLF4; Kruppel/Krueppel-like transcription factors (KLFs) belong to a family of proteins called the Specificity Protein (SP)/KLF family, characterized by a C-terminal DNA-binding domain of 81 amino acids consisting of three Kruppel-like C2H2 zinc fingers. These factors bind to a loose consensus motif, namely NNRCRCCYY (where N is any nucleotide; R is A/G, and Y is C/T), such as the recurring motifs in GC and GT boxes (5'-GGGGCGGGG-3' and 5-GGTGTGGGG-3') that are present in promoters and more distal regulatory elements of mammalian genes. Members of the KLF family can act as activators or repressors of transcription depending on cell and promoter context. KLFs regulate various cellular functions, such as proliferation, differentiation, and apoptosis, as well as the development and homeostasis of several types of tissue. In addition to the C-terminal DNA-binding domain, each KLF also has a unique N-terminal activation/repression domain that confers specifity and allows it to bind specifically to a certain partner, leading to distinct activities in vivo. This model represents the N-terminal domains of an unknown subfamily of KLFs, predominantly found in fish, related to the N-terminal domains of KLF1, KLF2, and KLF4.


Pssm-ID: 409231 [Multi-domain]  Cd Length: 339  Bit Score: 40.80  E-value: 3.79e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720384125 1054 HAKPSHPQSSHAKPSHPQSSQAKPSHPQSSQAKPTHPQSSQANSHHPQasqakPSHPQSSHAKPSHPHPSHAKPSPSQ-- 1131
Cdd:cd22056    206 QQKPKHQMHSVHPQAFTHHQAAGPGALQGRGGRGGPDCHLLHSSHHHH-----HHHHLQYQYMNAPYPPHYAHQGAPQfh 280
                           90       100
                   ....*....|....*....|....
gi 1720384125 1132 ---STQCKAHKAHQSQPKPFQPRP 1152
Cdd:cd22056    281 gqySVFREPMRVHHQGHPGSMLTP 304
PRK11901 PRK11901
hypothetical protein; Reviewed
955-1145 4.74e-03

hypothetical protein; Reviewed


Pssm-ID: 237015 [Multi-domain]  Cd Length: 327  Bit Score: 40.44  E-value: 4.74e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720384125  955 PSQKKPSHSRPSQ--AKPPHLDPSHANLTQGQPSQATPTHSQASQAKPTHSQANShhPHPSHAKPSHQNPSHANPTHPQS 1032
Cdd:PRK11901    61 PTEHESQQSSNNAgaEKNIDLSGSSSLSSGNQSSPSAANNTSDGHDASGVKNTAP--PQDISAPPISPTPTQAAPPQTPN 138
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720384125 1033 SHAKPSHPQSSHAKPSHPQSshakpshpQSSHAKPSHPQSSQAKPSHPQSSQAKPTHPQSSQANSH---HPQASQAKPsh 1109
Cdd:PRK11901   139 GQQRIELPGNISDALSQQQG--------QVNAASQNAQGNTSTLPTAPATVAPSKGAKVPATAETHptpPQKPATKKP-- 208
                          170       180       190
                   ....*....|....*....|....*....|....*.
gi 1720384125 1110 PQSSHAKPSHPHPSHAKPSPSQSTqcKAHKAHQSQP 1145
Cdd:PRK11901   209 AVNHHKTATVAVPPATSGKPKSGA--ASARALSSAP 242
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
696-1132 4.78e-03

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 41.31  E-value: 4.78e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720384125  696 EDSQNAVIFHQTPVFMPYPAHPWPLpieAGSNFYHVPLRAPRAISSHFRSQQKAEWFFPFPHQNTSVHSRGQNFAIKYLQ 775
Cdd:PHA03307    12 EAAAEGGEFFPRPPATPGDAADDLL---SGSQGQLVSDSAELAAVTVVAGAAACDRFEPPTGPPPGPGTEAPANESRSTP 88
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720384125  776 PWRFYSRERFT---RCSATPQQYHPNGPFGRSQRQASPVQTHPKSRQMSRtleRSGTVVSRVGHGRSLGSQARRAAGkPQ 852
Cdd:PHA03307    89 TWSLSTLAPASparEGSPTPPGPSSPDPPPPTPPPASPPPSPAPDLSEML---RPVGSPGPPPAASPPAAGASPAAV-AS 164
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720384125  853 PEKACAQGLQLTKAAGKSIRTLPHIKYPHPQPCQPAGAS---QERIMPVSHqGAQQTTQGRPADFAFKPGSQSTSGSKLS 929
Cdd:PHA03307   165 DAASSRQAALPLSSPEETARAPSSPPAEPPPSTPPAAASprpPRRSSPISA-SASSPAPAPGRSAADDAGASSSDSSSSE 243
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720384125  930 STSQSSAHQPKFQSKHFQPQPFQPVPSQKKPSHSRP-------SQAKPPHLDPSHANLTQGQPSQATPTHSQASQAKPTH 1002
Cdd:PHA03307   244 SSGCGWGPENECPLPRPAPITLPTRIWEASGWNGPSsrpgpasSSSSPRERSPSPSPSSPGSGPAPSSPRASSSSSSSRE 323
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720384125 1003 S--QANSHHPHPSHAKPSHQNPSHANPthpqsshakpshpqsshAKPSHPQSSHAKPSHPQSSHAKPSHPQSSQAKPSHP 1080
Cdd:PHA03307   324 SssSSTSSSSESSRGAAVSPGPSPSRS-----------------PSPSRPPPPADPSSPRKRPRPSRAPSSPAASAGRPT 386
                          410       420       430       440       450
                   ....*....|....*....|....*....|....*....|....*....|..
gi 1720384125 1081 QSSQAKPTHPQSSQANSHHPQASQAKPSHPQSSHAKPSHPHPSHAKPSPSQS 1132
Cdd:PHA03307   387 RRRARAAVAGRARRRDATGRFPAGRPRPSPLDAGAASGAFYARYPLLTPSGE 438
PRK10263 PRK10263
DNA translocase FtsK; Provisional
878-1063 4.92e-03

DNA translocase FtsK; Provisional


Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 41.22  E-value: 4.92e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720384125  878 KYPHPQPCQPAGASQERIMPVSHQGA---QQTTQG-RPA--------DFAFKP-------GSQSTSGSKLSSTSQSSAHQ 938
Cdd:PRK10263   679 QYQHDVPVNAEDADAAAEAELARQFAqtqQQRYSGeQPAganpfsldDFEFSPmkallddGPHEPLFTPIVEPVQQPQQP 758
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720384125  939 PKFQSKHFQPQpfQPVPSQKKpsHSRPSQAKPPhldpSHANLTQGQPSQATPTHSQASQakPTHSQANSHHPH-PSHAKP 1017
Cdd:PRK10263   759 VAPQQQYQQPQ--QPVAPQPQ--YQQPQQPVAP----QPQYQQPQQPVAPQPQYQQPQQ--PVAPQPQYQQPQqPVAPQP 828
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|....*.
gi 1720384125 1018 SHQNPSHANPTHPQSSHAKPSHPQSSHAKPSHpqsshaKPSHPQSS 1063
Cdd:PRK10263   829 QYQQPQQPVAPQPQDTLLHPLLMRNGDSRPLH------KPTTPLPS 868
PHA02666 PHA02666
hypothetical protein; Provisional
957-1128 5.72e-03

hypothetical protein; Provisional


Pssm-ID: 222914 [Multi-domain]  Cd Length: 287  Bit Score: 40.30  E-value: 5.72e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720384125  957 QKKPSHSRPSQAKPPHLDPSHANLTQGQPSQATPTHSQASQAKPTHSQANSHHPHPSHAKPSHQNPSHANPTHPQSSHAK 1036
Cdd:PHA02666    43 KSRPSRQHRSAERTPTTASSLTHENNTAPSRHGKQHSCKASSRSSHNRGSTSSSHNHHAHRGPHQSAHRRSKHDAVRDTY 122
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720384125 1037 PSHPQSshakPSHPQSSHAKPSHPQSSHAKPSH----PQSSQAKPSHPQSSQAKPthpqssqaNSHHPQASQAKPSHPQS 1112
Cdd:PHA02666   123 QPCPQS----PETDLYKGRLPGETERHYETPDHiydvPEDVRCAAVEPRRDLALP--------PLHIPSSKPARRMRPGS 190
                          170
                   ....*....|....*.
gi 1720384125 1113 SHAKPSHpHPSHAKPS 1128
Cdd:PHA02666   191 MGDFPMK-HTSAGKPN 205
PRK10263 PRK10263
DNA translocase FtsK; Provisional
966-1164 5.73e-03

DNA translocase FtsK; Provisional


Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 40.84  E-value: 5.73e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720384125  966 SQAKPPHLDpSHANLTQGQPSQA---------TPTHSQASQAKPTHSQANSHHPHPSHAKPS--------HQNPSHANPT 1028
Cdd:PRK10263   297 NRATQPEYD-EYDPLLNGAPITEpvavaaaatTATQSWAAPVEPVTQTPPVASVDVPPAQPTvawqpvpgPQTGEPVIAP 375
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720384125 1029 HPQS--SHAKPSHPQSSHAKPSHPQSSHAKPSHPQSSHAKPSHPQSSQAKPSHPQSSQAKPTHPQSSQANSHHPQaSQAK 1106
Cdd:PRK10263   376 APEGypQQSQYAQPAVQYNEPLQQPVQPQQPYYAPAAEQPAQQPYYAPAPEQPAQQPYYAPAPEQPVAGNAWQAE-EQQS 454
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|....*...
gi 1720384125 1107 PSHPQSSHaKPSHPHPSHAKPSPSqstqckaHKAHQSQPKPFQPRPtQPKSSKTKPSQ 1164
Cdd:PRK10263   455 TFAPQSTY-QTEQTYQQPAAQEPL-------YQQPQPVEQQPVVEP-EPVVEETKPAR 503
PRK10263 PRK10263
DNA translocase FtsK; Provisional
844-1096 5.78e-03

DNA translocase FtsK; Provisional


Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 40.84  E-value: 5.78e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720384125  844 ARRAAGKPQPEKACAQGLQLTKAagkSIRTLPHIKYPHPQPCQPAGASQE--------RIMPVSHQGAQQTTQGRPADFA 915
Cdd:PRK10263   349 VDVPPAQPTVAWQPVPGPQTGEP---VIAPAPEGYPQQSQYAQPAVQYNEplqqpvqpQQPYYAPAAEQPAQQPYYAPAP 425
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720384125  916 FKPGSQSTSGSKLSSTSQSSAHQPKFQSKHFQPQP-FQPVPSQKKPSHSRPSQAKPPHLDPSHANLTQGQPSQATPTHS- 993
Cdd:PRK10263   426 EQPAQQPYYAPAPEQPVAGNAWQAEEQQSTFAPQStYQTEQTYQQPAAQEPLYQQPQPVEQQPVVEPEPVVEETKPARPp 505
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720384125  994 -------QASQAKPTHSQANSHHPHPSHAK-PSHQNPSHANPTHPQSSHAKPSHPQSSHAKPSHPQSSHAkpshpqSSHA 1065
Cdd:PRK10263   506 lyyfeevEEKRAREREQLAAWYQPIPEPVKePEPIKSSLKAPSVAAVPPVEAAAAVSPLASGVKKATLAT------GAAA 579
                          250       260       270
                   ....*....|....*....|....*....|.
gi 1720384125 1066 KPSHPQSSQAKPSHPQSSQAKPTHPQSSQAN 1096
Cdd:PRK10263   580 TVAAPVFSLANSGGPRPQVKEGIGPQLPRPK 610
PTZ00395 PTZ00395
Sec24-related protein; Provisional
912-1094 6.17e-03

Sec24-related protein; Provisional


Pssm-ID: 185594 [Multi-domain]  Cd Length: 1560  Bit Score: 40.83  E-value: 6.17e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720384125  912 ADFAFKPGSQSTSGSKLSSTSQSSAHQPKFQSKHFQPQPFQPVPSQKKPsHSRPSQAKPPhldpsHANLTQGQPSQATPT 991
Cdd:PTZ00395   395 SNAAQSNAAQSNAGFSNAGYSNPGNSNPGYNNAPNSNTPYNNPPNSNTP-YSNPPNSNPP-----YSNLPYSNTPYSNAP 468
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720384125  992 HSQASQAKPThsqansHHPHPSHAKPSHQN---PSHANPTHPQSShAKPSHPQSSHAKPSHPQSSHAKPSHPQSSHAKPS 1068
Cdd:PTZ00395   469 LSNAPPSSAK------DHHSAYHAAYQHRAanqPAANLPTANQPA-ANNFHGAAGNSVGNPFASRPFGSAPYGGNAATTA 541
                          170       180
                   ....*....|....*....|....*.
gi 1720384125 1069 HPQSSQAKPSHPQSSQAKPTHPQSSQ 1094
Cdd:PTZ00395   542 DPNGIAKREDHPEGGTNRQKYEQSDE 567
PRK10263 PRK10263
DNA translocase FtsK; Provisional
963-1170 6.62e-03

DNA translocase FtsK; Provisional


Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 40.84  E-value: 6.62e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720384125  963 SRPSQAKPPHLDPSHANLTQGQPSQATPTHSQASQAKPTHSQANSHHPHPSHAKPShqNPSHANPTHPqSSHAKPSHPQS 1042
Cdd:PRK10263   282 ARGVAADPDDVLFSGNRATQPEYDEYDPLLNGAPITEPVAVAAAATTATQSWAAPV--EPVTQTPPVA-SVDVPPAQPTV 358
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720384125 1043 SHAKPSHPQSshAKPSHPQSSHAKPSHPQSSQakpshPQSSQAKPTHPQSSQANSHHPQASQAKPSHPQSSHAKPSHPHP 1122
Cdd:PRK10263   359 AWQPVPGPQT--GEPVIAPAPEGYPQQSQYAQ-----PAVQYNEPLQQPVQPQQPYYAPAAEQPAQQPYYAPAPEQPAQQ 431
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|
gi 1720384125 1123 SHAKPSPSQSTQCKAHKAHQSQPkPFQPRPT-QPKSSKTKP-SQARAFHP 1170
Cdd:PRK10263   432 PYYAPAPEQPVAGNAWQAEEQQS-TFAPQSTyQTEQTYQQPaAQEPLYQQ 480
KLF1_2_4_N-like cd22056
N-terminal domain of Kruppel-like factors with similarity to the N-terminal domains of ...
1008-1114 6.79e-03

N-terminal domain of Kruppel-like factors with similarity to the N-terminal domains of Kruppel-like factor (KLF)1, KLF2, and KLF4; Kruppel/Krueppel-like transcription factors (KLFs) belong to a family of proteins called the Specificity Protein (SP)/KLF family, characterized by a C-terminal DNA-binding domain of 81 amino acids consisting of three Kruppel-like C2H2 zinc fingers. These factors bind to a loose consensus motif, namely NNRCRCCYY (where N is any nucleotide; R is A/G, and Y is C/T), such as the recurring motifs in GC and GT boxes (5'-GGGGCGGGG-3' and 5-GGTGTGGGG-3') that are present in promoters and more distal regulatory elements of mammalian genes. Members of the KLF family can act as activators or repressors of transcription depending on cell and promoter context. KLFs regulate various cellular functions, such as proliferation, differentiation, and apoptosis, as well as the development and homeostasis of several types of tissue. In addition to the C-terminal DNA-binding domain, each KLF also has a unique N-terminal activation/repression domain that confers specifity and allows it to bind specifically to a certain partner, leading to distinct activities in vivo. This model represents the N-terminal domains of an unknown subfamily of KLFs, predominantly found in fish, related to the N-terminal domains of KLF1, KLF2, and KLF4.


Pssm-ID: 409231 [Multi-domain]  Cd Length: 339  Bit Score: 40.03  E-value: 6.79e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720384125 1008 HHPHPSHAKPSHQNPSHANPTHPQSSHAKPSHPQSSHAKPSHPQSSHakpsHPQSSHAKPSHPQSSQAKPSHPQ-----S 1082
Cdd:cd22056    209 PKHQMHSVHPQAFTHHQAAGPGALQGRGGRGGPDCHLLHSSHHHHHH----HHLQYQYMNAPYPPHYAHQGAPQfhgqyS 284
                           90       100       110
                   ....*....|....*....|....*....|..
gi 1720384125 1083 SQAKPTHPQssqaNSHHPQASQAKPSHPQSSH 1114
Cdd:cd22056    285 VFREPMRVH----HQGHPGSMLTPPSSPPLLE 312
CARD_BIRC2_BIRC3 cd08329
Caspase activation and recruitment domain found in Baculoviral IAP repeat-containing proteins, ...
1-79 7.70e-03

Caspase activation and recruitment domain found in Baculoviral IAP repeat-containing proteins, BIRC2 (c-IAP1) and BIRC3 (c-IAP2); Caspase activation and recruitment domain (CARD) similar to those found in Baculoviral IAP repeat (BIR)-containing protein 2 (BIRC2) or cellular Inhibitor of Apoptosis Protein 1 (c-IAP1), and BIRC3 (or c-IAP2). IAPs are anti-apoptotic proteins that contain at least one BIR domain. Most IAPs also contain a C-terminal RING domain. In addition, both BIRC2 and BIRC3 contain a CARD. BIRC2 and BIRC3, through their binding with TRAF (TNF receptor-associated factor) 2, are recruited to TNFR-1/2 signaling complexes, where they regulate caspase-8 activity. They also play important roles in pro-survival NF-kB signaling pathways. In general, CARDs are death domains (DDs) found associated with caspases. They are known to be important in the signaling pathways for apoptosis, inflammation and host-defense mechanisms. DDs are protein-protein interaction domains found in a variety of domain architectures. Their common feature is that they form homodimers by self-association or heterodimers by associating with other members of the DD superfamily including PYRIN and DED (Death Effector Domain). They serve as adaptors in signaling pathways and can recruit other proteins into signaling complexes.


Pssm-ID: 260038  Cd Length: 94  Bit Score: 37.04  E-value: 7.70e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720384125    1 MATEGASseLIEKKRTKLL----SVLqqdpdSILDTLTSRSLISEKEYETLEEITDPLKKSRKLLILIQKKGEDSCRRFL 76
Cdd:cd08329      3 MASDDLS--LIRKNRMALFqhltCVL-----PILDHLLSANVITEQEYDVIKQKTQTPLQARELIDTILVKGNAAAEVFR 75

                   ...
gi 1720384125   77 RCL 79
Cdd:cd08329     76 NCL 78
PRK14949 PRK14949
DNA polymerase III subunits gamma and tau; Provisional
956-1114 8.24e-03

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237863 [Multi-domain]  Cd Length: 944  Bit Score: 40.48  E-value: 8.24e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720384125  956 SQKKPSHSR----PSQAKPPhLDPSHANltqgqpSQATPTHSQASQAKPTHSQANSHHPHPSHAKPSHQNPSHANPTHP- 1030
Cdd:PRK14949   635 DGKKSSADRkpktPPSRAPP-ASLSKPA------SSPDASQTSASFDLDPDFELATHQSVPEAALASGSAPAPPPVPDPy 707
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720384125 1031 -------QSSHAKPSHPQSSHAKPSHPQSSH-AKPSHPQSSHAKPSHPQSSQAKPshpQSSQAKPTHPQSSQANSHHPQA 1102
Cdd:PRK14949   708 drppweeAPEVASANDGPNNAAEGNLSESVEdASNSELQAVEQQATHQPQVQAEA---QSPASTTALTQTSSEVQDTELN 784
                          170
                   ....*....|..
gi 1720384125 1103 SQAKPSHPQSSH 1114
Cdd:PRK14949   785 LVLLSSGSITGH 796
PRK10905 PRK10905
cell division protein DamX; Validated
952-1132 8.29e-03

cell division protein DamX; Validated


Pssm-ID: 236792 [Multi-domain]  Cd Length: 328  Bit Score: 39.92  E-value: 8.29e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720384125  952 QPVPSQKKPSHSRPSQAKPPHLDPSHANLTQGQpsqaTPTHSQASQAKPTHSQANShhphpshAKPSHQNPSHAN----- 1026
Cdd:PRK10905    52 QPAPGTTSAEQTAGNTQQDVSLPPISSTPTQGQ----TPVATDGQQRVEVQGDLNN-------ALTQPQNQQQLNnvavn 120
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720384125 1027 ---PTHPqSSHAKPSHPQSSHAKPSHPQSSHAKPSHPQSSHA--KPSHPQSSQAKPSHPQSSQAKPTHPQSSQANSHHPQ 1101
Cdd:PRK10905   121 stlPTEP-ATVAPVRNGNASRQTAKTQTAERPATTRPARKQAviEPKKPQATAKTEPKPVAQTPKRTEPAAPVASTKAPA 199
                          170       180       190
                   ....*....|....*....|....*....|.
gi 1720384125 1102 ASQAKPSHPQSSHAKPSHPHPSHAKPSPSQS 1132
Cdd:PRK10905   200 ATSTPAPKETATTAPVQTASPAQTTATPAAG 230
PRK14951 PRK14951
DNA polymerase III subunits gamma and tau; Provisional
1022-1155 9.56e-03

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237865 [Multi-domain]  Cd Length: 618  Bit Score: 40.08  E-value: 9.56e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720384125 1022 PSHANPThPQSSHAKPSHPQSSHAKPSHPQSSHAKPSHPQSSHAKPSHPQSSQAKPSHPQSSQAKPTHPQSSQAnsHHPQ 1101
Cdd:PRK14951   366 PAAAAEA-AAPAEKKTPARPEAAAPAAAPVAQAAAAPAPAAAPAAAASAPAAPPAAAPPAPVAAPAAAAPAAAP--AAAP 442
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....
gi 1720384125 1102 ASQAKPSHPQSSHAKPSHPHPSHAKPSPSQSTqckAHKAHQSQPKPFQPRPTQP 1155
Cdd:PRK14951   443 AAVALAPAPPAQAAPETVAIPVRVAPEPAVAS---AAPAPAAAPAAARLTPTEE 493
ARG80 COG5068
Regulator of arginine metabolism and related MADS box-containing transcription factors ...
882-1130 9.88e-03

Regulator of arginine metabolism and related MADS box-containing transcription factors [Transcription];


Pssm-ID: 227400 [Multi-domain]  Cd Length: 412  Bit Score: 39.61  E-value: 9.88e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720384125  882 PQPCQPAGASQERIMPVSHQGAQQTTQGRPADFAFKPGSQSTSGSKLSSTSQSSAHQPKFQSKHFQPQPFQPVPSQKKPS 961
Cdd:COG5068    144 SVVKSLEGKSLIQSPCSNAPSDSSEEPSSSASFSVDPNDNNPMGSFQHNGSPQTNFIPLQNPQTQQYQQHSSRKDHPTVP 223
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720384125  962 HSRPSQAKPPHLdPSHANLTQGQPSQATPTHSQASQAKPTHSQAnshhphpSHAKPSHQNPSHANPTHPQSSHAKPSHPQ 1041
Cdd:COG5068    224 HSNTNNGRPPAK-FMIPELHSSHSTLDLPSDFISDSGFPNQSST-------SIFPLDSAIIQITPPHLPNNPPQENRHEL 295
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720384125 1042 SSHakPSHPQSSHAKPshPQSSHAKPSHPQSSQAKPSHPQSSQAKPTHPQSSQANSHHPQASQAKPSHPQSSHAKPSHPH 1121
Cdd:COG5068    296 YSN--DSSMVSETPPP--KNLPNGSPNQSPLNNLSRGNPASPNSIIRENNQVEDESFNGRQGSAIWNALISTTQPNSGLH 371

                   ....*....
gi 1720384125 1122 PSHAKPSPS 1130
Cdd:COG5068    372 TEASTAPSS 380
CARD_RIP2_CARD3 cd08786
Caspase activation and recruitment domain of Receptor Interacting Protein 2; Caspase ...
9-79 9.99e-03

Caspase activation and recruitment domain of Receptor Interacting Protein 2; Caspase activation and recruitment domain (CARD) of Receptor Interacting Protein 2 (RIP2/RIPK2/RICK/CARDIAK/CARD3). RIP kinases serve as essential sensors of cellular stress. Vertebrates contain several types containing a homologous N-terminal kinase domain and varying C-terminal domains. RIP2 harbors a C-terminal CARD domain and functions as an effector kinase downstream of the pattern recognition receptors from the Nod-like (NLR)-family, NOD1 and NOD2, which recognizes bacterial peptidoglycans released upon infection. This cascade is implicated in inflammatory immune responses and the clearance of intracellular pathogens. RIP2 associates with NOD1 and NOD2 via CARD-CARD interactions. In general, CARDs are death domains (DDs) found associated with caspases. They are known to be important in the signaling pathways for apoptosis, inflammation, and host-defense mechanisms. DDs are protein-protein interaction domains found in a variety of domain architectures. Their common feature is that they form homodimers by self-association or heterodimers by associating with other members of the DD superfamily including PYRIN and DED (Death Effector Domain). They serve as adaptors in signaling pathways and can recruit other proteins into signaling complexes.


Pssm-ID: 176764  Cd Length: 87  Bit Score: 36.44  E-value: 9.99e-03
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1720384125    9 ELIEKKRTKLLSVLQQDP-DSILDTLTSRSLISEKEYETLEEITDPLKKSRKLLILIQKKGEDSCRRFLRCL 79
Cdd:cd08786      1 QWIASKREEIVSQMTEAClNQSLDALLSRQLLMREDYELISTKPTRTSKVRQLLDTCDCQGEEFARVVVQKL 72
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH