NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|2022791524|ref|XP_040404264|]
View 

protein piccolo isoform X8 [Cygnus olor]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
PDZ_RIM-like cd06714
PDZ domain of Rab3-interacting molecule 1 (RIM), RIM2, piccolo and related domains; PDZ ...
4525-4618 1.11e-47

PDZ domain of Rab3-interacting molecule 1 (RIM), RIM2, piccolo and related domains; PDZ (PSD-95 (Postsynaptic density protein 95), Dlg (Discs large protein), and ZO-1 (Zonula occludens-1)) domain of RIM, RIM2, piccolo and related domains. RIM proteins and Gallus gallus protein piccolo (also called aczonin) are involved in neurotransmitter release at presynaptic active zones, the site of vesicle fusion. A protein complex containing RIM proteins positions synaptic vesicles containing synaptotagmin at the active zone. RIM proteins simultaneously activate docking and priming of synaptic vesicles and recruit Ca2+-channels to active zones, thereby connecting primed synaptic vesicles to Ca2+-channels. RIM binding to vesicular Rab proteins (Rab3 and Rab27 isoforms) mediates vesicle docking; RIM binding to Munc13 activates vesicle priming; RIM binding to the Ca2+-channel, both directly and indirectly via RIM-BP, recruits the Ca2+-channels. The RIM PDZ domain interacts with the C-termini of N- and P/Q-type voltage-gated Ca2+-channels. RIM1, RIM2 and piccolo also participate in regulated exocytosis through binding cAMP-GEFII (cAMP-binding protein-guanidine nucleotide exchange factor II). The piccolo PDZ domain binds cAMP-GEFII. RIM2 also plays a role in dendrite formation by melanocytes. Caenorhabditis elegans RIM (also known as unc-10) may be involved in the regulation of defecation and daumone response. PDZ domains usually bind in a sequence-specific manner to short peptide sequences located at the C-terminal end of their partner proteins (known as PDZ binding motifs). The PDZ superfamily includes canonical PDZ domains as well as those with circular permutations and domain swapping mediated by beta-strands. This RIM-like family domain is a canonical PDZ domain containing six beta-strands A-F and two alpha-helices (alpha-helix 1 and 2), arranged in the order: beta-strands A, B, C, alpha-helix 1, beta-strands D, E, alpha-helix 2 and beta-strand F.


:

Pssm-ID: 467198 [Multi-domain]  Cd Length: 95  Bit Score: 166.57  E-value: 1.11e-47
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524 4525 FPHARLKLLRDPKDHTVSGNGLGIRVVGGKEIPgsSGETGAYIAKILPGGNAEQTGKLIEGMQVLEWNGIPLTGKTYEEV 4604
Cdd:cd06714      2 FLIGRIILQRDPKDGSVSGNGLGLKVVGGKMTE--SGRLGAYVTKVKPGSVADTVGHLREGDEVLEWNGISLQGKTFEEV 79
                           90
                   ....*....|....
gi 2022791524 4605 QNIIIQQSGEAEIC 4618
Cdd:cd06714     80 QDIISQSKGEVELV 93
FYVE1_PCLO cd15774
FYVE-related domain 1 found in protein piccolo; Protein piccolo, also termed aczonin, is a ...
454-515 2.31e-43

FYVE-related domain 1 found in protein piccolo; Protein piccolo, also termed aczonin, is a neuron-specific presynaptic active zone scaffolding protein that mainly interacts with a detergent-resistant cytoskeletal-like subcellular fraction and is involved in the organization of the interplay between neurotransmitter vesicles, the cytoskeleton, and the plasma membrane at synaptic active zones. It binds profilin, an actin-binding protein implicated in actin cytoskeletal dynamics. It also functions as a presynaptic low-affinity Ca2+ sensor and has been implicated in Ca2+ regulation of neurotransmitter release. Piccolo is a multi-domain protein containing two N-terminal FYVE zinc fingers, a polyproline tract, and a PDZ domain and two C-terminal C2 domains. This family corresponds to the first FYVE domain, which resembles a FYVE-related domain that is structurally similar to the canonical FYVE domains but lacks the three signature sequences: an N-terminal WxxD motif (x for any residue), the central basic R(R/K)HHCRxCG patch, and a C-terminal RVC motif.


:

Pssm-ID: 277313 [Multi-domain]  Cd Length: 62  Bit Score: 152.88  E-value: 2.31e-43
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 2022791524  454 TFCPLCTTTELLLHTPEKANYNTCTQCHTVVCSLCGFNPNPHITEIKEWLCLNCQMQRALGG 515
Cdd:cd15774      1 TICPLCKTTELLLHTPEKANYNTCTQCQTTVCSLCGFNPNPHITEKKEWLCLNCQMQRALGG 62
FYVE2_PCLO cd15776
FYVE-related domain 2 found in protein piccolo; Protein piccolo, also termed aczonin, is a ...
924-987 3.03e-42

FYVE-related domain 2 found in protein piccolo; Protein piccolo, also termed aczonin, is a neuron-specific presynaptic active zone scaffolding protein that mainly interacts with a detergent-resistant cytoskeletal-like subcellular fraction and is involved in the organization of the interplay between neurotransmitter vesicles, the cytoskeleton, and the plasma membrane at synaptic active zones. It binds profilin, an actin-binding protein implicated in actin cytoskeletal dynamics. It also functions as a presynaptic low-affinity Ca2+ sensor and has been implicated in Ca2+ regulation of neurotransmitter release. Piccolo is a multi-domain protein containing two N-terminal FYVE zinc fingers, a polyproline tract, and a PDZ domain and two C-terminal C2 domains. This family corresponds to the second FYVE domain, which resembles a FYVE-related domain that is structurally similar to the canonical FYVE domains but lacks the three signature sequences: an N-terminal WxxD motif (x for any residue), the central basic R(R/K)HHCRxCG patch, and a C-terminal RVC motif.


:

Pssm-ID: 277315 [Multi-domain]  Cd Length: 64  Bit Score: 149.83  E-value: 3.03e-42
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 2022791524  924 VSCPLCKTGLNVGSKDPPNFNTCTECKKVVCNLCGFNPMPHIAEVQEWLCLNCQTQRAMSGQLG 987
Cdd:cd15776      1 LLCPLCKTELNIGSKDPPNFNTCTECKKTVCNLCGFNPTPHLTEVKEWLCLNCQTQRAMSGQLG 64
PHA03247 super family cl33720
large tegument protein UL36; Provisional
63-454 8.42e-20

large tegument protein UL36; Provisional


The actual alignment was detected with superfamily member PHA03247:

Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 98.86  E-value: 8.42e-20
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524   63 RAQGLPRGNLAGAEPPpmqrHPELDTSHHPRQPskPPDPGPGLSKSRTVDVLKTEQRAPGR---SPSSISLREsKSRTDF 139
Cdd:PHA03247  2599 RAPVDDRGDPRGPAPP----SPLPPDTHAPDPP--PPSPSPAANEPDPHPPPTVPPPERPRddpAPGRVSRPR-RARRLG 2671
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  140 REDQKSSMMPSFLSEANPlSAVTSVVNKFNPfdlisdSDTAHEEAGRKQKVIQKEQGKPEEQRGSAKLPAQQQSP-KPVQ 218
Cdd:PHA03247  2672 RAAQASSPPQRPRRRAAR-PTVGSLTSLADP------PPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPaPPAV 2744
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  219 PQGHVKPTPQQSESSKAVPQQqqPGEPKQIQKPGPGHPADSKLEQVKQPPQPRGPQKSQPQQSEPTKPVQQQTSAKP-SA 297
Cdd:PHA03247  2745 PAGPATPGGPARPARPPTTAG--PPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPpAA 2822
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  298 GPAKPLPQQPDSARTSSQAPPPAKPSSQQLG-------------PVKQPSQQPARQGGPvkPSSQQAGPPKQPSQQPGPE 364
Cdd:PHA03247  2823 SPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGgsvapggdvrrrpPSRSPAAKPAAPARP--PVRRLARPAVSRSTESFAL 2900
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  365 KPSAQQMGPAKQA-SQPGPGKPPLQQTGPVKQAPPQSGPTKPPPQTAGATKAPVPQPGLTKP------PGQQPEPEKPSQ 437
Cdd:PHA03247  2901 PPDQPERPPQPQApPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPwlgalvPGRVAVPRFRVP 2980
                          410
                   ....*....|....*..
gi 2022791524  438 QKQASATQPAESAPKKT 454
Cdd:PHA03247  2981 QPAPSREAPASSTPPLT 2997
PRK07764 super family cl35613
DNA polymerase III subunits gamma and tau; Validated
328-720 1.46e-10

DNA polymerase III subunits gamma and tau; Validated


The actual alignment was detected with superfamily member PRK07764:

Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 67.70  E-value: 1.46e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  328 GPVKQPSQQPARQGGPVkPSSQQAGPPKQPSQQPGPEKPSAQQMGPAKQASQPGPGKPPLQQTGPVKQAPPQSGPTKPPP 407
Cdd:PRK07764   389 GGAGAPAAAAPSAAAAA-PAAAPAPAAAAPAAAAAPAPAAAPQPAPAPAPAPAPPSPAGNAPAGGAPSPPPAAAPSAQPA 467
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  408 QTAGATKAPVPQPGLTKPPGQQPEPEkPSQQKQASATQPA--------------ESAPKKTFcplcTTTELLLHTPEKAN 473
Cdd:PRK07764   468 PAPAAAPEPTAAPAPAPPAAPAPAAA-PAAPAAPAAPAGAddaatlrerwpeilAAVPKRSR----KTWAILLPEATVLG 542
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  474 Y--NTCTQCHTVVCSLCGFNpNPHITE-----IKEWLCLNCQMQRALGGDLAPGHGPGPQLPPPKQKTPTPPSTAKPPPQ 546
Cdd:PRK07764   543 VrgDTLVLGFSTGGLARRFA-SPGNAEvlvtaLAEELGGDWQVEAVVGPAPGAAGGEGPPAPASSGPPEEAARPAAPAAP 621
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  547 PQPGQKKDASPKPDPAQLAEPKKPPPQKKQPSmPGSPPVKAKTSRAEPADTSQQiDPAPKSDRAKPSQAEDKQKQPsiqK 626
Cdd:PRK07764   622 AAPAAPAPAGAAAAPAEASAAPAPGVAAPEHH-PKHVAVPDASDGGDGWPAKAG-GAAPAAPPPAPAPAAPAAPAG---A 696
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  627 PAADTVLTPAAPGLKQDSAGPRPPPTQQKVTDSPKPELAKPSQDTHPA-GEKPDSKPLPQVSRQKSDPKLSAQPGARSEA 705
Cdd:PRK07764   697 APAQPAPAPAATPPAGQADDPAAQPPQAAQGASAPSPAADDPVPLPPEpDDPPDPAGAPAQPPPPPAPAPAAAPAAAPPP 776
                          410
                   ....*....|....*
gi 2022791524  706 KTQKPVEPAQVKDDP 720
Cdd:PRK07764   777 SPPSEEEEMAEDDAP 791
PTZ00121 super family cl31754
MAEBL; Provisional
1023-1629 7.50e-08

MAEBL; Provisional


The actual alignment was detected with superfamily member PTZ00121:

Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 59.38  E-value: 7.50e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524 1023 QKTSTPPTPAAAKPKEEPGVQKEVPKLQQGRleKTLSADKIQQGVQKEDAKLKQGKLfKTPSADKIRVSQK-EDPR---- 1097
Cdd:PTZ00121  1104 KKTETGKAEEARKAEEAKKKAEDARKAEEAR--KAEDARKAEEARKAEDAKRVEIAR-KAEDARKAEEARKaEDAKkaea 1180
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524 1098 -------LQQTKLTKTPSSDKILHGVQKEDA-KFQEAKLAKTSSADKILHGVQKEDIKPQEAKLAKLpsadkilhgiQKE 1169
Cdd:PTZ00121  1181 arkaeevRKAEELRKAEDARKAEAARKAEEErKAEEARKAEDAKKAEAVKKAEEAKKDAEEAKKAEE----------ERN 1250
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524 1170 DLKLQQMKMAKALSADKIQPSAQKEDAQLQEvKLSKAVSADKIQHGIQKEDpnlqhvkIAKASLVEKIQQEAQK-EESKL 1248
Cdd:PTZ00121  1251 NEEIRKFEEARMAHFARRQAAIKAEEARKAD-ELKKAEEKKKADEAKKAEE-------KKKADEAKKKAEEAKKaDEAKK 1322
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524 1249 QQEKLSKTLSADKIPATVSSDQKKTLSKFEEDKKPLL--PEKKAPHSEDKKEQiiAETKDHVAERKGEVEVPCEELHAKK 1326
Cdd:PTZ00121  1323 KAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAeaAEEKAEAAEKKKEE--AKKKADAAKKKAEEKKKADEAKKKA 1400
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524 1327 QEDAKKEGLTTGIPQMVLKAE----KAQE----EETPVQMSRVPRSDHV-ETVREKTEKEDDKSDTSSSQQQKSPQGLSD 1397
Cdd:PTZ00121  1401 EEDKKKADELKKAAAAKKKADeakkKAEEkkkaDEAKKKAEEAKKADEAkKKAEEAKKAEEAKKKAEEAKKADEAKKKAE 1480
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524 1398 TGYSSDGISSSLGEIPSHipSDEKDLLRESSKK--DTVSQESPPSPSDLAKLESTVLSILEAQASTLSDEKSVKKKELYE 1475
Cdd:PTZ00121  1481 EAKKADEAKKKAEEAKKK--ADEAKKAAEAKKKadEAKKAEEAKKADEAKKAEEAKKADEAKKAEEKKKADELKKAEELK 1558
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524 1476 TYSEQTK-------DQHKTKPLPVTPESYSSDEEDLEGEGTIVEDGKGSASSQAEYKEgdgEDDIPARRQRydsvedSSE 1548
Cdd:PTZ00121  1559 KAEEKKKaeeakkaEEDKNMALRKAEEAKKAEEARIEEVMKLYEEEKKMKAEEAKKAE---EAKIKAEELK------KAE 1629
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524 1549 SENSPVPRRKRRGSVGSSSSDEYKRDDSQGSGDEEDFIRKQIIEMSADEDASGSEDDEFIRNQLKEISVTESQKKEDVKS 1628
Cdd:PTZ00121  1630 EEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKKAEELKK 1709

                   .
gi 2022791524 1629 K 1629
Cdd:PTZ00121  1710 K 1710
PHA03247 super family cl33720
large tegument protein UL36; Provisional
580-1054 1.98e-07

large tegument protein UL36; Provisional


The actual alignment was detected with superfamily member PHA03247:

Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 58.03  E-value: 1.98e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  580 PGSPPVKAKTSRAEPADTSQQIDPAPKSDRAKPSQAEDKQKQPSIQKPAADTVLTPAAPGLKQDSAGPRPPPTqqkvtdS 659
Cdd:PHA03247  2557 PAAPPAAPDRSVPPPRPAPRPSEPAVTSRARRPDAPPQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPP------S 2630
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  660 PKP---ELAKPSQDTHPAGEKPDSKPLP-QVSRQKSDPKLSAQPGARSEAKTQKPVEPAQVKDDPKKLQTKPSPKPETKP 735
Cdd:PHA03247  2631 PSPaanEPDPHPPPTVPPPERPRDDPAPgRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSLADPPPPPPTPEP 2710
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  736 APKGLQPGagpkpvpaqpapqpqQPQKTPEQPRRFSLNLGGITDTPKPQPTTPQETVTGRLFGFGASIFSQASNLISTAG 815
Cdd:PHA03247  2711 APHALVSA---------------TPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAP 2775
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  816 QPGSQTSGPPPAGPAAKQPQAP----------AQPPASQAPPKDTAQTQSSPKAVP--VKKEAKPLTSEKMEPTKVDSVL 883
Cdd:PHA03247  2776 AAGPPRRLTRPAVASLSESRESlpspwdpadpPAAVLAPAAALPPAASPAGPLPPPtsAQPTAPPPPPGPPPPSLPLGGS 2855
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  884 TTKGSDLEKKPSLAKDSKPQAAEAKKPAGVLEPEKASHPKVSCPLcktglnvgskdPPNfntcteckkvvcnlcgfnpmp 963
Cdd:PHA03247  2856 VAPGGDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFAL-----------PPD--------------------- 2903
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  964 hiaevqewlclncqtqramsgqlgdmGKVPLPKPGPSQPvSKPPATPQKQPVPAVSHFPQKTSTPPTPAAAKPKEEPGVQ 1043
Cdd:PHA03247  2904 --------------------------QPERPPQPQAPPP-PQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPS 2956
                          490
                   ....*....|.
gi 2022791524 1044 KEVPKLQQGRL 1054
Cdd:PHA03247  2957 GAVPQPWLGAL 2967
CCDC47 super family cl46382
PAT complex subunit CCDC47; This family represents CCDC47 proteins which are a component of ...
3782-3845 4.23e-03

PAT complex subunit CCDC47; This family represents CCDC47 proteins which are a component of the PAT complex, an endoplasmic reticulum (ER)-resident membrane multiprotein complex that facilitates multi-pass membrane proteins insertion into membranes. The PAT complex, formed by CCDC47 and Asterix proteins, acts as an intramembrane chaperone by directly interacting with nascent transmembrane domains (TMDs), releasing its substrates upon correct folding, and is needed for optimal biogenesis of multi-pass membrane proteins. CCDC47 is required to maintain the stability of Asterix. CCDC47 is associated with various membrane-associated processes and is component of a ribosome-associated ER translocon complex involved in multi-pass membrane protein transport into the ER membrane and biogenesis. It is also involved in the regulation of calcium ion homeostasis in the ER, being also required for proper protein degradation via the ERAD (ER-associated degradation) pathway.


The actual alignment was detected with superfamily member pfam07946:

Pssm-ID: 480722  Cd Length: 323  Bit Score: 42.55  E-value: 4.23e-03
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 2022791524 3782 ARAKILQDIDRELDLVERESAKLRKKQAELDEEEKEIDaklrylEMGINRRKEALLKEREKRER 3845
Cdd:pfam07946  265 TREEEIEKIKKAAEEERAEEAQEKKEEAKKKEREEKLA------KLSPEEQRKYEEKERKKEQR 322
 
Name Accession Description Interval E-value
PDZ_RIM-like cd06714
PDZ domain of Rab3-interacting molecule 1 (RIM), RIM2, piccolo and related domains; PDZ ...
4525-4618 1.11e-47

PDZ domain of Rab3-interacting molecule 1 (RIM), RIM2, piccolo and related domains; PDZ (PSD-95 (Postsynaptic density protein 95), Dlg (Discs large protein), and ZO-1 (Zonula occludens-1)) domain of RIM, RIM2, piccolo and related domains. RIM proteins and Gallus gallus protein piccolo (also called aczonin) are involved in neurotransmitter release at presynaptic active zones, the site of vesicle fusion. A protein complex containing RIM proteins positions synaptic vesicles containing synaptotagmin at the active zone. RIM proteins simultaneously activate docking and priming of synaptic vesicles and recruit Ca2+-channels to active zones, thereby connecting primed synaptic vesicles to Ca2+-channels. RIM binding to vesicular Rab proteins (Rab3 and Rab27 isoforms) mediates vesicle docking; RIM binding to Munc13 activates vesicle priming; RIM binding to the Ca2+-channel, both directly and indirectly via RIM-BP, recruits the Ca2+-channels. The RIM PDZ domain interacts with the C-termini of N- and P/Q-type voltage-gated Ca2+-channels. RIM1, RIM2 and piccolo also participate in regulated exocytosis through binding cAMP-GEFII (cAMP-binding protein-guanidine nucleotide exchange factor II). The piccolo PDZ domain binds cAMP-GEFII. RIM2 also plays a role in dendrite formation by melanocytes. Caenorhabditis elegans RIM (also known as unc-10) may be involved in the regulation of defecation and daumone response. PDZ domains usually bind in a sequence-specific manner to short peptide sequences located at the C-terminal end of their partner proteins (known as PDZ binding motifs). The PDZ superfamily includes canonical PDZ domains as well as those with circular permutations and domain swapping mediated by beta-strands. This RIM-like family domain is a canonical PDZ domain containing six beta-strands A-F and two alpha-helices (alpha-helix 1 and 2), arranged in the order: beta-strands A, B, C, alpha-helix 1, beta-strands D, E, alpha-helix 2 and beta-strand F.


Pssm-ID: 467198 [Multi-domain]  Cd Length: 95  Bit Score: 166.57  E-value: 1.11e-47
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524 4525 FPHARLKLLRDPKDHTVSGNGLGIRVVGGKEIPgsSGETGAYIAKILPGGNAEQTGKLIEGMQVLEWNGIPLTGKTYEEV 4604
Cdd:cd06714      2 FLIGRIILQRDPKDGSVSGNGLGLKVVGGKMTE--SGRLGAYVTKVKPGSVADTVGHLREGDEVLEWNGISLQGKTFEEV 79
                           90
                   ....*....|....
gi 2022791524 4605 QNIIIQQSGEAEIC 4618
Cdd:cd06714     80 QDIISQSKGEVELV 93
FYVE1_PCLO cd15774
FYVE-related domain 1 found in protein piccolo; Protein piccolo, also termed aczonin, is a ...
454-515 2.31e-43

FYVE-related domain 1 found in protein piccolo; Protein piccolo, also termed aczonin, is a neuron-specific presynaptic active zone scaffolding protein that mainly interacts with a detergent-resistant cytoskeletal-like subcellular fraction and is involved in the organization of the interplay between neurotransmitter vesicles, the cytoskeleton, and the plasma membrane at synaptic active zones. It binds profilin, an actin-binding protein implicated in actin cytoskeletal dynamics. It also functions as a presynaptic low-affinity Ca2+ sensor and has been implicated in Ca2+ regulation of neurotransmitter release. Piccolo is a multi-domain protein containing two N-terminal FYVE zinc fingers, a polyproline tract, and a PDZ domain and two C-terminal C2 domains. This family corresponds to the first FYVE domain, which resembles a FYVE-related domain that is structurally similar to the canonical FYVE domains but lacks the three signature sequences: an N-terminal WxxD motif (x for any residue), the central basic R(R/K)HHCRxCG patch, and a C-terminal RVC motif.


Pssm-ID: 277313 [Multi-domain]  Cd Length: 62  Bit Score: 152.88  E-value: 2.31e-43
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 2022791524  454 TFCPLCTTTELLLHTPEKANYNTCTQCHTVVCSLCGFNPNPHITEIKEWLCLNCQMQRALGG 515
Cdd:cd15774      1 TICPLCKTTELLLHTPEKANYNTCTQCQTTVCSLCGFNPNPHITEKKEWLCLNCQMQRALGG 62
FYVE2_PCLO cd15776
FYVE-related domain 2 found in protein piccolo; Protein piccolo, also termed aczonin, is a ...
924-987 3.03e-42

FYVE-related domain 2 found in protein piccolo; Protein piccolo, also termed aczonin, is a neuron-specific presynaptic active zone scaffolding protein that mainly interacts with a detergent-resistant cytoskeletal-like subcellular fraction and is involved in the organization of the interplay between neurotransmitter vesicles, the cytoskeleton, and the plasma membrane at synaptic active zones. It binds profilin, an actin-binding protein implicated in actin cytoskeletal dynamics. It also functions as a presynaptic low-affinity Ca2+ sensor and has been implicated in Ca2+ regulation of neurotransmitter release. Piccolo is a multi-domain protein containing two N-terminal FYVE zinc fingers, a polyproline tract, and a PDZ domain and two C-terminal C2 domains. This family corresponds to the second FYVE domain, which resembles a FYVE-related domain that is structurally similar to the canonical FYVE domains but lacks the three signature sequences: an N-terminal WxxD motif (x for any residue), the central basic R(R/K)HHCRxCG patch, and a C-terminal RVC motif.


Pssm-ID: 277315 [Multi-domain]  Cd Length: 64  Bit Score: 149.83  E-value: 3.03e-42
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 2022791524  924 VSCPLCKTGLNVGSKDPPNFNTCTECKKVVCNLCGFNPMPHIAEVQEWLCLNCQTQRAMSGQLG 987
Cdd:cd15776      1 LLCPLCKTELNIGSKDPPNFNTCTECKKTVCNLCGFNPTPHLTEVKEWLCLNCQTQRAMSGQLG 64
zf-piccolo pfam05715
Piccolo Zn-finger; This (predicted) Zinc finger is found in the bassoon and piccolo proteins. ...
924-982 1.36e-37

Piccolo Zn-finger; This (predicted) Zinc finger is found in the bassoon and piccolo proteins. There are eight conserved cysteines, suggesting that it coordinates two zinc ligands.


Pssm-ID: 461722 [Multi-domain]  Cd Length: 60  Bit Score: 136.39  E-value: 1.36e-37
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  924 VSCPLCK-TGLNVGSKDPPNFNTCTECKKVVCNLCGFNPMPHIAEVQEWLCLNCQTQRAM 982
Cdd:pfam05715    1 TLCPLCKtTELNVGSKEPPNYNTCTECKSQVCNLCGFNPTPHLTEKKEWLCLNCQTQRAL 60
zf-piccolo pfam05715
Piccolo Zn-finger; This (predicted) Zinc finger is found in the bassoon and piccolo proteins. ...
454-513 1.45e-37

Piccolo Zn-finger; This (predicted) Zinc finger is found in the bassoon and piccolo proteins. There are eight conserved cysteines, suggesting that it coordinates two zinc ligands.


Pssm-ID: 461722 [Multi-domain]  Cd Length: 60  Bit Score: 136.39  E-value: 1.45e-37
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  454 TFCPLCTTTELLLHTPEKANYNTCTQCHTVVCSLCGFNPNPHITEIKEWLCLNCQMQRAL 513
Cdd:pfam05715    1 TLCPLCKTTELNVGSKEPPNYNTCTECKSQVCNLCGFNPTPHLTEKKEWLCLNCQTQRAL 60
PHA03247 PHA03247
large tegument protein UL36; Provisional
63-454 8.42e-20

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 98.86  E-value: 8.42e-20
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524   63 RAQGLPRGNLAGAEPPpmqrHPELDTSHHPRQPskPPDPGPGLSKSRTVDVLKTEQRAPGR---SPSSISLREsKSRTDF 139
Cdd:PHA03247  2599 RAPVDDRGDPRGPAPP----SPLPPDTHAPDPP--PPSPSPAANEPDPHPPPTVPPPERPRddpAPGRVSRPR-RARRLG 2671
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  140 REDQKSSMMPSFLSEANPlSAVTSVVNKFNPfdlisdSDTAHEEAGRKQKVIQKEQGKPEEQRGSAKLPAQQQSP-KPVQ 218
Cdd:PHA03247  2672 RAAQASSPPQRPRRRAAR-PTVGSLTSLADP------PPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPaPPAV 2744
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  219 PQGHVKPTPQQSESSKAVPQQqqPGEPKQIQKPGPGHPADSKLEQVKQPPQPRGPQKSQPQQSEPTKPVQQQTSAKP-SA 297
Cdd:PHA03247  2745 PAGPATPGGPARPARPPTTAG--PPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPpAA 2822
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  298 GPAKPLPQQPDSARTSSQAPPPAKPSSQQLG-------------PVKQPSQQPARQGGPvkPSSQQAGPPKQPSQQPGPE 364
Cdd:PHA03247  2823 SPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGgsvapggdvrrrpPSRSPAAKPAAPARP--PVRRLARPAVSRSTESFAL 2900
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  365 KPSAQQMGPAKQA-SQPGPGKPPLQQTGPVKQAPPQSGPTKPPPQTAGATKAPVPQPGLTKP------PGQQPEPEKPSQ 437
Cdd:PHA03247  2901 PPDQPERPPQPQApPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPwlgalvPGRVAVPRFRVP 2980
                          410
                   ....*....|....*..
gi 2022791524  438 QKQASATQPAESAPKKT 454
Cdd:PHA03247  2981 QPAPSREAPASSTPPLT 2997
Glutenin_hmw pfam03157
High molecular weight glutenin subunit; Members of this family include high molecular weight ...
179-450 6.71e-14

High molecular weight glutenin subunit; Members of this family include high molecular weight subunits of glutenin. This group of gluten proteins is thought to be largely responsible for the elastic properties of gluten, and hence, doughs. Indeed, glutenin high molecular weight subunits are classified as elastomeric proteins, because the glutenin network can withstand significant deformations without breaking, and return to the original conformation when the stress is removed. Elastomeric proteins differ considerably in amino acid sequence, but they are all polymers whose subunits consist of elastomeric domains, composed of repeated motifs, and non-elastic domains that mediate cross-linking between the subunits. The elastomeric domain motifs are all rich in glycine residues in addition to other hydrophobic residues. High molecular weight glutenin subunits have an extensive central elastomeric domain, flanked by two terminal non-elastic domains that form disulphide cross-links. The central elastomeric domain is characterized by the following three repeated motifs: PGQGQQ, GYYPTS[P/L]QQ, GQQ. It possesses overlapping beta-turns within and between the repeated motifs, and assumes a regular helical secondary structure with a diameter of approx. 1.9 nm and a pitch of approx. 1.5 nm.


Pssm-ID: 367362 [Multi-domain]  Cd Length: 786  Bit Score: 78.84  E-value: 6.71e-14
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  179 TAHEEAGRKQKVIQKEQGKPEEQRGSAKLPAQQQSPKPVQP--------------QGHVKPTPQQSESSKAVPQQQQPGE 244
Cdd:pfam03157  372 TSQQQPQQGQQPEQGQQGQQQGQGQQGQQPGQGQQPGQGQPgyyptspqqsgqgqPGYYPTSPQQSGQGQQPGQGQQPGQ 451
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  245 --PKQIQKPGPGHPADS--KLEQVKQPPQ------PRGPQksQPQQSEPTKPVQQQTSAKPSAGPAKPL----------- 303
Cdd:pfam03157  452 eqPGQGQQPGQGQQGQQpgQPEQGQQPGQgqpgyyPTSPQ--QSGQGQQLGQWQQQGQGQPGYYPTSPLqpgqgqpgyyp 529
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  304 --PQQPDSARTSSQAPPPAKPSSQQLGPVKQPSQQPARQGGPVKPSSQQAGPPKQPSQQPGPEKP-----SAQQMGPAKQ 376
Cdd:pfam03157  530 tsPQQPGQGQQLGQLQQPTQGQQGQQSGQGQQGQQPGQGQQGQQPGQGQQGQQPGQGQQPGQGQPgyyptSPQQSGQGQQ 609
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 2022791524  377 A---SQPGPGKPPLQQTGPVKQAPPQSGPTKPPPQtagatkapvpQPGLTKPPGQQPEPEKPSQQKQASATQPAESA 450
Cdd:pfam03157  610 PgqwQQPGQGQPGYYPTSSLQLGQGQQGYYPTSPQ----------QPGQGQQPGQWQQSGQGQQGYYPTSPQQSGQA 676
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
328-720 1.46e-10

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 67.70  E-value: 1.46e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  328 GPVKQPSQQPARQGGPVkPSSQQAGPPKQPSQQPGPEKPSAQQMGPAKQASQPGPGKPPLQQTGPVKQAPPQSGPTKPPP 407
Cdd:PRK07764   389 GGAGAPAAAAPSAAAAA-PAAAPAPAAAAPAAAAAPAPAAAPQPAPAPAPAPAPPSPAGNAPAGGAPSPPPAAAPSAQPA 467
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  408 QTAGATKAPVPQPGLTKPPGQQPEPEkPSQQKQASATQPA--------------ESAPKKTFcplcTTTELLLHTPEKAN 473
Cdd:PRK07764   468 PAPAAAPEPTAAPAPAPPAAPAPAAA-PAAPAAPAAPAGAddaatlrerwpeilAAVPKRSR----KTWAILLPEATVLG 542
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  474 Y--NTCTQCHTVVCSLCGFNpNPHITE-----IKEWLCLNCQMQRALGGDLAPGHGPGPQLPPPKQKTPTPPSTAKPPPQ 546
Cdd:PRK07764   543 VrgDTLVLGFSTGGLARRFA-SPGNAEvlvtaLAEELGGDWQVEAVVGPAPGAAGGEGPPAPASSGPPEEAARPAAPAAP 621
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  547 PQPGQKKDASPKPDPAQLAEPKKPPPQKKQPSmPGSPPVKAKTSRAEPADTSQQiDPAPKSDRAKPSQAEDKQKQPsiqK 626
Cdd:PRK07764   622 AAPAAPAPAGAAAAPAEASAAPAPGVAAPEHH-PKHVAVPDASDGGDGWPAKAG-GAAPAAPPPAPAPAAPAAPAG---A 696
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  627 PAADTVLTPAAPGLKQDSAGPRPPPTQQKVTDSPKPELAKPSQDTHPA-GEKPDSKPLPQVSRQKSDPKLSAQPGARSEA 705
Cdd:PRK07764   697 APAQPAPAPAATPPAGQADDPAAQPPQAAQGASAPSPAADDPVPLPPEpDDPPDPAGAPAQPPPPPAPAPAAAPAAAPPP 776
                          410
                   ....*....|....*
gi 2022791524  706 KTQKPVEPAQVKDDP 720
Cdd:PRK07764   777 SPPSEEEEMAEDDAP 791
PspC_subgroup_2 NF033839
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ...
208-454 2.54e-10

pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.


Pssm-ID: 468202 [Multi-domain]  Cd Length: 557  Bit Score: 66.72  E-value: 2.54e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  208 PAQQQSPKPVQPQGHVKPTPQQSEssKAVPQQQQPGEPK---QIQKPGPGHPADSKLEQVKQPPQPRGPQ-KSQPQQSEP 283
Cdd:NF033839   284 PKEPGNKKPSAPKPGMQPSPQPEK--KEVKPEPETPKPEvkpQLEKPKPEVKPQPEKPKPEVKPQLETPKpEVKPQPEKP 361
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  284 TKPVQQQTSAKPSAGPAKPLPQQPDSARTSSQAPPPAKPSSQQLGPVKQPsqQPARQGGPVKPSSQqagpPKQPSQQPGP 363
Cdd:NF033839   362 KPEVKPQPEKPKPEVKPQPETPKPEVKPQPEKPKPEVKPQPEKPKPEVKP--QPEKPKPEVKPQPE----KPKPEVKPQP 435
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  364 EKPSAQqmgPAKQASQPGPGKPPLQQTgPVKQAPPQsgPTKPPPQTAGATKAPVPQPGLTKPPGQQPE-PEKPSQQKQ-- 440
Cdd:NF033839   436 EKPKPE---VKPQPEKPKPEVKPQPET-PKPEVKPQ--PEKPKPEVKPQPEKPKPDNSKPQADDKKPStPNNLSKDKQps 509
                          250
                   ....*....|....*.
gi 2022791524  441 --ASATQPAESAPKKT 454
Cdd:NF033839   510 nqASTNEKATNKPKKS 525
PBP1 COG5180
PAB1-binding protein, interacts with poly(A)-binding protein [RNA processing and modification]; ...
176-454 4.49e-08

PAB1-binding protein, interacts with poly(A)-binding protein [RNA processing and modification];


Pssm-ID: 444064 [Multi-domain]  Cd Length: 548  Bit Score: 59.31  E-value: 4.49e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  176 DSDTAHEEAGRKQKVIQKE-QGKPEEQRGSAKLP-AQQQSPKPVqpqghvkPTPQQSESSKAVPQQQQPGEPKQIQKPGP 253
Cdd:COG5180    191 ALKDSPEKLDRPKVEVKDEaQEEPPDLTGGADHPrPEAASSPKV-------DPPSTSEARSRPATVDAQPEMRPPADAKE 263
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  254 GHPADSKLEQVKQPP-QPRGPQKSQPQQSEPTKPVQQQTSAKPSAG-PAKPLPQQPDSARTSSQAPPPAKPSSQqlgPVK 331
Cdd:COG5180    264 RRRAAIGDTPAAEPPgLPVLEAGSEPQSDAPEAETARPIDVKGVASaPPATRPVRPPGGARDPGTPRPGQPTER---PAG 340
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  332 QPsqQPARQGGPvKPSSQQAGPPKQPSQQPGPEKPSAQQMGPAKQASQPGPGKPPLQQTGPVKQAPPQS--GPTKPPPQT 409
Cdd:COG5180    341 VP--EAASDAGQ-PPSAYPPAEEAVPGKPLEQGAPRPGSSGGDGAPFQPPNGAPQPGLGRRGAPGPPMGagDLVQAALDG 417
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|
gi 2022791524  410 AGATKAPVPQPG-----LTKPPGQQPEPEKPSQQKQASATQPAESAPKKT 454
Cdd:COG5180    418 GGRETASLGGAAggagqGPKADFVPGDAESVSGPAGLADQAGAAASTAMA 467
PTZ00121 PTZ00121
MAEBL; Provisional
1023-1629 7.50e-08

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 59.38  E-value: 7.50e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524 1023 QKTSTPPTPAAAKPKEEPGVQKEVPKLQQGRleKTLSADKIQQGVQKEDAKLKQGKLfKTPSADKIRVSQK-EDPR---- 1097
Cdd:PTZ00121  1104 KKTETGKAEEARKAEEAKKKAEDARKAEEAR--KAEDARKAEEARKAEDAKRVEIAR-KAEDARKAEEARKaEDAKkaea 1180
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524 1098 -------LQQTKLTKTPSSDKILHGVQKEDA-KFQEAKLAKTSSADKILHGVQKEDIKPQEAKLAKLpsadkilhgiQKE 1169
Cdd:PTZ00121  1181 arkaeevRKAEELRKAEDARKAEAARKAEEErKAEEARKAEDAKKAEAVKKAEEAKKDAEEAKKAEE----------ERN 1250
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524 1170 DLKLQQMKMAKALSADKIQPSAQKEDAQLQEvKLSKAVSADKIQHGIQKEDpnlqhvkIAKASLVEKIQQEAQK-EESKL 1248
Cdd:PTZ00121  1251 NEEIRKFEEARMAHFARRQAAIKAEEARKAD-ELKKAEEKKKADEAKKAEE-------KKKADEAKKKAEEAKKaDEAKK 1322
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524 1249 QQEKLSKTLSADKIPATVSSDQKKTLSKFEEDKKPLL--PEKKAPHSEDKKEQiiAETKDHVAERKGEVEVPCEELHAKK 1326
Cdd:PTZ00121  1323 KAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAeaAEEKAEAAEKKKEE--AKKKADAAKKKAEEKKKADEAKKKA 1400
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524 1327 QEDAKKEGLTTGIPQMVLKAE----KAQE----EETPVQMSRVPRSDHV-ETVREKTEKEDDKSDTSSSQQQKSPQGLSD 1397
Cdd:PTZ00121  1401 EEDKKKADELKKAAAAKKKADeakkKAEEkkkaDEAKKKAEEAKKADEAkKKAEEAKKAEEAKKKAEEAKKADEAKKKAE 1480
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524 1398 TGYSSDGISSSLGEIPSHipSDEKDLLRESSKK--DTVSQESPPSPSDLAKLESTVLSILEAQASTLSDEKSVKKKELYE 1475
Cdd:PTZ00121  1481 EAKKADEAKKKAEEAKKK--ADEAKKAAEAKKKadEAKKAEEAKKADEAKKAEEAKKADEAKKAEEKKKADELKKAEELK 1558
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524 1476 TYSEQTK-------DQHKTKPLPVTPESYSSDEEDLEGEGTIVEDGKGSASSQAEYKEgdgEDDIPARRQRydsvedSSE 1548
Cdd:PTZ00121  1559 KAEEKKKaeeakkaEEDKNMALRKAEEAKKAEEARIEEVMKLYEEEKKMKAEEAKKAE---EAKIKAEELK------KAE 1629
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524 1549 SENSPVPRRKRRGSVGSSSSDEYKRDDSQGSGDEEDFIRKQIIEMSADEDASGSEDDEFIRNQLKEISVTESQKKEDVKS 1628
Cdd:PTZ00121  1630 EEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKKAEELKK 1709

                   .
gi 2022791524 1629 K 1629
Cdd:PTZ00121  1710 K 1710
PHA03247 PHA03247
large tegument protein UL36; Provisional
580-1054 1.98e-07

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 58.03  E-value: 1.98e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  580 PGSPPVKAKTSRAEPADTSQQIDPAPKSDRAKPSQAEDKQKQPSIQKPAADTVLTPAAPGLKQDSAGPRPPPTqqkvtdS 659
Cdd:PHA03247  2557 PAAPPAAPDRSVPPPRPAPRPSEPAVTSRARRPDAPPQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPP------S 2630
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  660 PKP---ELAKPSQDTHPAGEKPDSKPLP-QVSRQKSDPKLSAQPGARSEAKTQKPVEPAQVKDDPKKLQTKPSPKPETKP 735
Cdd:PHA03247  2631 PSPaanEPDPHPPPTVPPPERPRDDPAPgRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSLADPPPPPPTPEP 2710
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  736 APKGLQPGagpkpvpaqpapqpqQPQKTPEQPRRFSLNLGGITDTPKPQPTTPQETVTGRLFGFGASIFSQASNLISTAG 815
Cdd:PHA03247  2711 APHALVSA---------------TPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAP 2775
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  816 QPGSQTSGPPPAGPAAKQPQAP----------AQPPASQAPPKDTAQTQSSPKAVP--VKKEAKPLTSEKMEPTKVDSVL 883
Cdd:PHA03247  2776 AAGPPRRLTRPAVASLSESRESlpspwdpadpPAAVLAPAAALPPAASPAGPLPPPtsAQPTAPPPPPGPPPPSLPLGGS 2855
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  884 TTKGSDLEKKPSLAKDSKPQAAEAKKPAGVLEPEKASHPKVSCPLcktglnvgskdPPNfntcteckkvvcnlcgfnpmp 963
Cdd:PHA03247  2856 VAPGGDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFAL-----------PPD--------------------- 2903
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  964 hiaevqewlclncqtqramsgqlgdmGKVPLPKPGPSQPvSKPPATPQKQPVPAVSHFPQKTSTPPTPAAAKPKEEPGVQ 1043
Cdd:PHA03247  2904 --------------------------QPERPPQPQAPPP-PQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPS 2956
                          490
                   ....*....|.
gi 2022791524 1044 KEVPKLQQGRL 1054
Cdd:PHA03247  2957 GAVPQPWLGAL 2967
PspC_subgroup_2 NF033839
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ...
183-413 2.73e-07

pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.


Pssm-ID: 468202 [Multi-domain]  Cd Length: 557  Bit Score: 56.70  E-value: 2.73e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  183 EAGRKQKVIQKEQGKPEeQRGSAKLPAQQQSPKPVQPQGHVKPTPQQSESS-KAVPQQQQPGEPKQIQKPGPGHPADSKL 261
Cdd:NF033839   304 QPEKKEVKPEPETPKPE-VKPQLEKPKPEVKPQPEKPKPEVKPQLETPKPEvKPQPEKPKPEVKPQPEKPKPEVKPQPET 382
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  262 EQVKQPPQPRGPQKSQPQQSEPTKPVQQQTSAKPSAgPAKPLPQQPDSARTSSQAPPPAKPSSQQLGPVKQPSQQPARQG 341
Cdd:NF033839   383 PKPEVKPQPEKPKPEVKPQPEKPKPEVKPQPEKPKP-EVKPQPEKPKPEVKPQPEKPKPEVKPQPEKPKPEVKPQPETPK 461
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 2022791524  342 GPVKPssqQAGPPKqPSQQPGPEKPSAQQMGPAKQASQPGPGKPPLQQTGPVKQAPPQSGPTKPPPQTAGAT 413
Cdd:NF033839   462 PEVKP---QPEKPK-PEVKPQPEKPKPDNSKPQADDKKPSTPNNLSKDKQPSNQASTNEKATNKPKKSLPST 529
PspC_subgroup_2 NF033839
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ...
559-742 1.35e-06

pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.


Pssm-ID: 468202 [Multi-domain]  Cd Length: 557  Bit Score: 54.77  E-value: 1.35e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  559 PDPAQLAEPKKPPPQKKQPSMPGSPPVKAKTSRAEPADTSQQIDPAPKSDRAKPSQAEDKQKQPSIQKPAADTVLTPAAP 638
Cdd:NF033839   295 PKPGMQPSPQPEKKEVKPEPETPKPEVKPQLEKPKPEVKPQPEKPKPEVKPQLETPKPEVKPQPEKPKPEVKPQPEKPKP 374
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  639 GLKQDSAGPRPP----PTQQKVTDSPKPELAKPSQDTHPAGEKPDSKPLPQVSRQKSDPKL-SAQPGARSEAKTQKP-VE 712
Cdd:NF033839   375 EVKPQPETPKPEvkpqPEKPKPEVKPQPEKPKPEVKPQPEKPKPEVKPQPEKPKPEVKPQPeKPKPEVKPQPEKPKPeVK 454
                          170       180       190
                   ....*....|....*....|....*....|
gi 2022791524  713 PAQVKDDPKKLQTKPSPKPETKPAPKGLQP 742
Cdd:NF033839   455 PQPETPKPEVKPQPEKPKPEVKPQPEKPKP 484
PspC_subgroup_2 NF033839
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ...
579-747 2.75e-06

pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.


Pssm-ID: 468202 [Multi-domain]  Cd Length: 557  Bit Score: 53.62  E-value: 2.75e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  579 MPGSPPVKAKTSRAEPADTSQQIDPAPKSDRAKPSQAEDKQK-----QPSIQKPAADTVLTPAAPGLKQDSAGPRP--PP 651
Cdd:NF033839   299 MQPSPQPEKKEVKPEPETPKPEVKPQLEKPKPEVKPQPEKPKpevkpQLETPKPEVKPQPEKPKPEVKPQPEKPKPevKP 378
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  652 TQQKVTDSPKPELAKPSQDTHPAGEKPDSKPLPQVSRQKSDPKLSAQPGaRSEAKTQKPVEPAQVKDDPKKlqtkpsPKP 731
Cdd:NF033839   379 QPETPKPEVKPQPEKPKPEVKPQPEKPKPEVKPQPEKPKPEVKPQPEKP-KPEVKPQPEKPKPEVKPQPEK------PKP 451
                          170
                   ....*....|....*.
gi 2022791524  732 ETKPAPKGLQPGAGPK 747
Cdd:NF033839   452 EVKPQPETPKPEVKPQ 467
PDZ pfam00595
PDZ domain; PDZ domains are found in diverse signaling proteins.
4537-4608 3.92e-06

PDZ domain; PDZ domains are found in diverse signaling proteins.


Pssm-ID: 395476 [Multi-domain]  Cd Length: 81  Bit Score: 47.66  E-value: 3.92e-06
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 2022791524 4537 KDHTVSGNGLGIRVVGGkeipGSSGETGAYIAKILPGGNAEQTGkLIEGMQVLEWNGIPLTGKTYEEVQNII 4608
Cdd:pfam00595    3 TLEKDGRGGLGFSLKGG----SDQGDPGIFVSEVLPGGAAEAGG-LKVGDRILSINGQDVENMTHEEAVLAL 69
Amelin smart00817
Ameloblastin precursor (Amelin); This family consists of several mammalian Ameloblastin ...
273-407 5.22e-06

Ameloblastin precursor (Amelin); This family consists of several mammalian Ameloblastin precursor (Amelin) proteins. Matrix proteins of tooth enamel consist mainly of amelogenin but also of non-amelogenin proteins, which, although their volumetric percentage is low, have an important role in enamel mineralisation. One of the non-amelogenin proteins is ameloblastin, also known as amelin and sheathlin. Ameloblastin (AMBN) is one of the enamel sheath proteins which is though to have a role in determining the prismatic structure of growing enamel crystals.


Pssm-ID: 214832 [Multi-domain]  Cd Length: 411  Bit Score: 52.20  E-value: 5.22e-06
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524   273 PQKSQPQQSEPTKPVQQQTSAKPSAGPakPLPQQPdsartssqappPAKPSSQQLGPVKQPSQQPARQGGPVKPSSQQAG 352
Cdd:smart00817   72 PHSSFPWLRPREHETQQYEYSLPVHPP--PLPSQP-----------SLQPQQPGLKPFLQPTALPTNQATPQKNGPQPPM 138
                            90       100       110       120       130       140
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 2022791524   353 PPKQPSQQPGPEKPSAQQMGPAKQASQP--------GPGKPPLQQTGP--VKQAPPQSGPTKPPP 407
Cdd:smart00817  139 HLGQPPLQQAELPMIPPQVAPSDKPPQTelplydfaDPQNPLLFQIAHlmSRGPMPQNKQQHLYP 203
PspC_subgroup_1 NF033838
pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, ...
176-405 3.24e-05

pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A. The other form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site.


Pssm-ID: 468201 [Multi-domain]  Cd Length: 684  Bit Score: 50.40  E-value: 3.24e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  176 DSDTAHEEAGRKQKVIQKEQGKPEEQRGSAKLPAQQQSPKPVqPQGHVKPTPQQSESSKAVPQQQQPGEpKQIQKPGPGH 255
Cdd:NF033838   254 ATSEQDKPKRRAKRGVLGEPATPDKKENDAKSSDSSVGEETL-PSPSLKPEKKVAEAEKKVEEAKKKAK-DQKEEDRRNY 331
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  256 PA------------------DSKLEQVKQ-PPQPRGPQK-SQPQ------QSEPTKPVQQQTSAKPSAGPAKPLPQQPDS 309
Cdd:NF033838   332 PTntyktleleiaesdvkvkEAELELVKEeAKEPRNEEKiKQAKakveskKAEATRLEKIKTDRKKAEEEAKRKAAEEDK 411
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  310 ARTSsqapppakpssqqlgPVKQPSQQPARQggPVKPSSQqagpPKQPSQQPGPEKPSAQQMGPAKQASQPGPGKPPLQQ 389
Cdd:NF033838   412 VKEK---------------PAEQPQPAPAPQ--PEKPAPK----PEKPAEQPKAEKPADQQAEEDYARRSEEEYNRLTQQ 470
                          250
                   ....*....|....*.
gi 2022791524  390 TGPVKQAPPQsgPTKP 405
Cdd:NF033838   471 QPPKTEKPAQ--PSTP 484
SP2_N cd22540
N-terminal domain of transcription factor Specificity Protein (SP) 2; Specificity Proteins ...
204-451 3.34e-05

N-terminal domain of transcription factor Specificity Protein (SP) 2; Specificity Proteins (SPs) are transcription factors that are involved in many cellular processes, including cell differentiation, cell growth, apoptosis, immune responses, response to DNA damage, and chromatin remodeling. SP2 contains the least conserved DNA-binding domain within the SP subfamily of proteins, and its DNA sequence specificity differs from the other SP proteins. It localizes primarily within subnuclear foci associated with the nuclear matrix, and can activate, or in some cases, repress expression from different promoters. The transcription factor SP2 serves as a paradigm for indirect genomic binding. It does not require its DNA-binding domain for genomic DNA binding and occupies target promoters independently of whether they contain a cognate DNA-binding motif. SP2 belongs to a family of proteins, called the SP/Kruppel or Krueppel-like Factor (KLF) family, characterized by a C-terminal DNA-binding domain of 81 amino acids consisting of three Kruppel-like C2H2 zinc fingers. These factors bind to a loose consensus motif, namely NNRCRCCYY (where N is any nucleotide; R is A/G, and Y is C/T), such as the recurring motifs in GC and GT boxes (5'-GGGGCGGGG-3' and 5-GGTGTGGGG-3') that are present in promoters and more distal regulatory elements of mammalian genes. SP factors preferentially bind GC boxes, while KLFs bind CACCC boxes. Another characteristic hallmark of SP factors is the presence of the Buttonhead (BTD) box CXCPXC, just N-terminal to the zinc fingers. The function of the BTD box is unknown, but it is thought to play an important physiological role. Another feature of most SP factors is the presence of a conserved amino acid stretch, the so-called SP box, located close to the N-terminus. SP factors may be separated into three groups based on their domain architecture and the similarity of their N-terminal transactivation domains: SP1-4, SP5, and SP6-9. The transactivation domains between the three groups are not homologous to one another. SP1-4 have similar N-terminal transactivation domains characterized by glutamine-rich regions, which, in most cases, have adjacent serine/threonine-rich regions. This model represents the N-terminal domain of SP2.


Pssm-ID: 411776 [Multi-domain]  Cd Length: 511  Bit Score: 49.92  E-value: 3.34e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  204 SAKLPAQQQSPKPVQPQghvKPTPQQSESSKAVPQQQQPGepkqIQKPGPGHPADSKLEQVKQPPQPRGPQK-SQPQQSE 282
Cdd:cd22540      9 SEYLQPAASTTQDSQPS---PLALLAATCSKIGPPAVEAA----VTPPAPPQPTPRKLVPIKPAPLPLGPGKnSIGFLSA 81
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  283 PTKPVQQQTSAKPSAGPAKPLP---QQPDSARTSSQAPPPAKPS---SQQLGPVKQPSQQPARQGGPVKPSSQQAGPPKQ 356
Cdd:cd22540     82 KGNIIQLQGSQLSSSAPGGQQVfaiQNPTMIIKGSQTRSSTNQQyqiSPQIQAAGQINNSGQIQIIPGTNQAIITPVQVL 161
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  357 PSQQpgpekpsaqqmgpakQASQPGPGKPPLQQTGPVKQAPPQSGPTKPPPQTAGATKAPVPQPGLT---KPPGQQPEPE 433
Cdd:cd22540    162 QQPQ---------------QAHKPVPIKPAPLQTSNTNSASLQVPGNVIKLQSGGNVALTLPVNNLVgtqDGATQLQLAA 226
                          250
                   ....*....|....*...
gi 2022791524  434 KPSQQKQASATQPAESAP 451
Cdd:cd22540    227 APSKPSKKIRKKSAQAAQ 244
PspC_subgroup_2 NF033839
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ...
604-747 4.96e-05

pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.


Pssm-ID: 468202 [Multi-domain]  Cd Length: 557  Bit Score: 49.38  E-value: 4.96e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  604 APKSDRAKPSQAEDKQKQPSIQKPAADTVLTPAAPG--LKQDSAGPRPP----PTQQKVTDSPKPELAKPSQDTHPAGEK 677
Cdd:NF033839   294 APKPGMQPSPQPEKKEVKPEPETPKPEVKPQLEKPKpeVKPQPEKPKPEvkpqLETPKPEVKPQPEKPKPEVKPQPEKPK 373
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 2022791524  678 PDSKPLPQVSRQKSDPKLSA-QPGARSEAKTQKPvepaQVKDDPKKlqtkpsPKPETKPAPKGLQPGAGPK 747
Cdd:NF033839   374 PEVKPQPETPKPEVKPQPEKpKPEVKPQPEKPKP----EVKPQPEK------PKPEVKPQPEKPKPEVKPQ 434
PspC_subgroup_2 NF033839
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ...
259-457 1.59e-04

pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.


Pssm-ID: 468202 [Multi-domain]  Cd Length: 557  Bit Score: 47.84  E-value: 1.59e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  259 SKLEQVKQPPQPRGPQKSQPQQSEPtkpvQQQTSAKPSAGPAKPLPQQPDSARTSSQAPPPAKPSSQQLGPVKQPSQQPA 338
Cdd:NF033839   273 TKFKKGLTQDTPKEPGNKKPSAPKP----GMQPSPQPEKKEVKPEPETPKPEVKPQLEKPKPEVKPQPEKPKPEVKPQLE 348
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  339 RQGGPVKPSSQQAGPPKQPsqQPGPEKPSAQ-QMGPAKQASQPGPGKP-PLQQTGPVKQAPP-QSGPTKPPPQTAGATKA 415
Cdd:NF033839   349 TPKPEVKPQPEKPKPEVKP--QPEKPKPEVKpQPETPKPEVKPQPEKPkPEVKPQPEKPKPEvKPQPEKPKPEVKPQPEK 426
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|...
gi 2022791524  416 PVPQpglTKPPGQQPEPE-KPSQQKQASATQPAESAPKKTFCP 457
Cdd:NF033839   427 PKPE---VKPQPEKPKPEvKPQPEKPKPEVKPQPETPKPEVKP 466
PspC_subgroup_2 NF033839
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ...
208-448 1.73e-04

pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.


Pssm-ID: 468202 [Multi-domain]  Cd Length: 557  Bit Score: 47.84  E-value: 1.73e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  208 PAQQQSPKPVQPQGHVKPTPQQSESSKAVPQQQQPGE----------PKQIQKPGPGHPADSKLEQVKQPPQPRGPQKSQ 277
Cdd:NF033839   164 PENPEHQKPTTPAPDTKPSPQPEGKKPSVPDINQEKEkaklavatymSKILDDIQKHHLQKEKHRQIVALIKELDELKKQ 243
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  278 --PQQSEPTKPVQQQTSAKPSAGPAKPLPQQPDSARTSSQAPPPAKPSSQQLGPVKQPSQQParQGGPVKPSSQQAGPPK 355
Cdd:NF033839   244 alSEIDNVNTKVEIENTVHKIFADMDAVVTKFKKGLTQDTPKEPGNKKPSAPKPGMQPSPQP--EKKEVKPEPETPKPEV 321
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  356 QPSQQ-------PGPEKPSAQ---QMGPAKQASQPGPGKPP----LQQTGPVKQAPPQSGPTKPPPQTAGATKAPVPQPG 421
Cdd:NF033839   322 KPQLEkpkpevkPQPEKPKPEvkpQLETPKPEVKPQPEKPKpevkPQPEKPKPEVKPQPETPKPEVKPQPEKPKPEVKPQ 401
                          250       260
                   ....*....|....*....|....*...
gi 2022791524  422 LTKP-PGQQPEPEKPSQQKQASATQPAE 448
Cdd:NF033839   402 PEKPkPEVKPQPEKPKPEVKPQPEKPKP 429
Streccoc_I_II NF033804
antigen I/II family LPXTG-anchored adhesin; Members of the antigen I/II family are adhesins ...
192-323 2.29e-04

antigen I/II family LPXTG-anchored adhesin; Members of the antigen I/II family are adhesins with a glucan-binding domain, two types of repetitive regions, an isopeptide bond-forming domain associated with shear resistance, and a C-terminal LPXTG motif for anchoring to the cell wall. They occur in oral Streptococci, and tend to be major cell surface adhesins. Members of this family include SspA and SspB from Streptococcus gordonii, antigen I/II from S. mutans, etc.


Pssm-ID: 468188 [Multi-domain]  Cd Length: 1552  Bit Score: 47.63  E-value: 2.29e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  192 QKEQGKPEEQRGSAKLPAQqqsPKPVQPQGHVKPTPQQSESSKAVPQQqqPGEPK-QIQKPGPGHPADSKLEQVKQPPQP 270
Cdd:NF033804   874 QPEPSKPEEPTYETEKPLE---PAPVAPTYENEPTPPVKTPDQPEPSK--PEEPTyETEKPLEPAPVAPSYENEPTPPVK 948
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....
gi 2022791524  271 rgpqksQPQQSEPTKPVQqqtsakPSagpAKPLPQQPdSARTSSQAP-PPAKPS 323
Cdd:NF033804   949 ------TPDQPEPSKPVE------PT---YDPLPTPP-VAPTPKQLPtPPAVPT 986
PspC_subgroup_1 NF033838
pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, ...
175-271 4.05e-04

pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A. The other form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site.


Pssm-ID: 468201 [Multi-domain]  Cd Length: 684  Bit Score: 46.54  E-value: 4.05e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  175 SDSDTAHEEAGRKQKVIQKEQGKPEEQRGSAKLPA-QQQSPKPVQPQGH---VKPTPQQSESSKAVPQQQQPGEPKQIQK 250
Cdd:NF033838   393 TDRKKAEEEAKRKAAEEDKVKEKPAEQPQPAPAPQpEKPAPKPEKPAEQpkaEKPADQQAEEDYARRSEEEYNRLTQQQP 472
                           90       100
                   ....*....|....*....|.
gi 2022791524  251 PGPGHPAdskleqvkQPPQPR 271
Cdd:NF033838   473 PKTEKPA--------QPSTPK 485
CCDC47 pfam07946
PAT complex subunit CCDC47; This family represents CCDC47 proteins which are a component of ...
3782-3845 4.23e-03

PAT complex subunit CCDC47; This family represents CCDC47 proteins which are a component of the PAT complex, an endoplasmic reticulum (ER)-resident membrane multiprotein complex that facilitates multi-pass membrane proteins insertion into membranes. The PAT complex, formed by CCDC47 and Asterix proteins, acts as an intramembrane chaperone by directly interacting with nascent transmembrane domains (TMDs), releasing its substrates upon correct folding, and is needed for optimal biogenesis of multi-pass membrane proteins. CCDC47 is required to maintain the stability of Asterix. CCDC47 is associated with various membrane-associated processes and is component of a ribosome-associated ER translocon complex involved in multi-pass membrane protein transport into the ER membrane and biogenesis. It is also involved in the regulation of calcium ion homeostasis in the ER, being also required for proper protein degradation via the ERAD (ER-associated degradation) pathway.


Pssm-ID: 462322  Cd Length: 323  Bit Score: 42.55  E-value: 4.23e-03
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 2022791524 3782 ARAKILQDIDRELDLVERESAKLRKKQAELDEEEKEIDaklrylEMGINRRKEALLKEREKRER 3845
Cdd:pfam07946  265 TREEEIEKIKKAAEEERAEEAQEKKEEAKKKEREEKLA------KLSPEEQRKYEEKERKKEQR 322
PABP-1234 TIGR01628
polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins ...
186-296 4.62e-03

polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins recognize the poly-A of mRNA and consists of four tandem RNA recognition domains at the N-terminus (rrm: pfam00076) followed by a PABP-specific domain (pfam00658) at the C-terminus. The protein is involved in the transport of mRNA's from the nucleus to the cytoplasm. There are four paralogs in Homo sapiens which are expressed in testis, platelets, broadly expressed and of unknown tissue range.


Pssm-ID: 130689 [Multi-domain]  Cd Length: 562  Bit Score: 43.26  E-value: 4.62e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  186 RKQKVIQKEQGKPEEQRGSAKLPAQQQSPKPVQPQGHVKPTPQQSESSKAVPQQQQPgepkqiqkPGPGHPADSKLEQVK 265
Cdd:TIGR01628  401 YGQGPQQQFNGQPLGWPRMSMMPTPMGPGGPLRPNGLAPMNAVRAPSRNAQNAAQKP--------PMQPVMYPPNYQSLP 472
                           90       100       110
                   ....*....|....*....|....*....|.
gi 2022791524  266 QPPQPRGPQKSQPQQSEPTKPVQQQTSAKPS 296
Cdd:TIGR01628  473 LSQDLPQPQSTASQGGQNKKLAQVLASATPQ 503
 
Name Accession Description Interval E-value
PDZ_RIM-like cd06714
PDZ domain of Rab3-interacting molecule 1 (RIM), RIM2, piccolo and related domains; PDZ ...
4525-4618 1.11e-47

PDZ domain of Rab3-interacting molecule 1 (RIM), RIM2, piccolo and related domains; PDZ (PSD-95 (Postsynaptic density protein 95), Dlg (Discs large protein), and ZO-1 (Zonula occludens-1)) domain of RIM, RIM2, piccolo and related domains. RIM proteins and Gallus gallus protein piccolo (also called aczonin) are involved in neurotransmitter release at presynaptic active zones, the site of vesicle fusion. A protein complex containing RIM proteins positions synaptic vesicles containing synaptotagmin at the active zone. RIM proteins simultaneously activate docking and priming of synaptic vesicles and recruit Ca2+-channels to active zones, thereby connecting primed synaptic vesicles to Ca2+-channels. RIM binding to vesicular Rab proteins (Rab3 and Rab27 isoforms) mediates vesicle docking; RIM binding to Munc13 activates vesicle priming; RIM binding to the Ca2+-channel, both directly and indirectly via RIM-BP, recruits the Ca2+-channels. The RIM PDZ domain interacts with the C-termini of N- and P/Q-type voltage-gated Ca2+-channels. RIM1, RIM2 and piccolo also participate in regulated exocytosis through binding cAMP-GEFII (cAMP-binding protein-guanidine nucleotide exchange factor II). The piccolo PDZ domain binds cAMP-GEFII. RIM2 also plays a role in dendrite formation by melanocytes. Caenorhabditis elegans RIM (also known as unc-10) may be involved in the regulation of defecation and daumone response. PDZ domains usually bind in a sequence-specific manner to short peptide sequences located at the C-terminal end of their partner proteins (known as PDZ binding motifs). The PDZ superfamily includes canonical PDZ domains as well as those with circular permutations and domain swapping mediated by beta-strands. This RIM-like family domain is a canonical PDZ domain containing six beta-strands A-F and two alpha-helices (alpha-helix 1 and 2), arranged in the order: beta-strands A, B, C, alpha-helix 1, beta-strands D, E, alpha-helix 2 and beta-strand F.


Pssm-ID: 467198 [Multi-domain]  Cd Length: 95  Bit Score: 166.57  E-value: 1.11e-47
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524 4525 FPHARLKLLRDPKDHTVSGNGLGIRVVGGKEIPgsSGETGAYIAKILPGGNAEQTGKLIEGMQVLEWNGIPLTGKTYEEV 4604
Cdd:cd06714      2 FLIGRIILQRDPKDGSVSGNGLGLKVVGGKMTE--SGRLGAYVTKVKPGSVADTVGHLREGDEVLEWNGISLQGKTFEEV 79
                           90
                   ....*....|....
gi 2022791524 4605 QNIIIQQSGEAEIC 4618
Cdd:cd06714     80 QDIISQSKGEVELV 93
FYVE1_PCLO cd15774
FYVE-related domain 1 found in protein piccolo; Protein piccolo, also termed aczonin, is a ...
454-515 2.31e-43

FYVE-related domain 1 found in protein piccolo; Protein piccolo, also termed aczonin, is a neuron-specific presynaptic active zone scaffolding protein that mainly interacts with a detergent-resistant cytoskeletal-like subcellular fraction and is involved in the organization of the interplay between neurotransmitter vesicles, the cytoskeleton, and the plasma membrane at synaptic active zones. It binds profilin, an actin-binding protein implicated in actin cytoskeletal dynamics. It also functions as a presynaptic low-affinity Ca2+ sensor and has been implicated in Ca2+ regulation of neurotransmitter release. Piccolo is a multi-domain protein containing two N-terminal FYVE zinc fingers, a polyproline tract, and a PDZ domain and two C-terminal C2 domains. This family corresponds to the first FYVE domain, which resembles a FYVE-related domain that is structurally similar to the canonical FYVE domains but lacks the three signature sequences: an N-terminal WxxD motif (x for any residue), the central basic R(R/K)HHCRxCG patch, and a C-terminal RVC motif.


Pssm-ID: 277313 [Multi-domain]  Cd Length: 62  Bit Score: 152.88  E-value: 2.31e-43
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 2022791524  454 TFCPLCTTTELLLHTPEKANYNTCTQCHTVVCSLCGFNPNPHITEIKEWLCLNCQMQRALGG 515
Cdd:cd15774      1 TICPLCKTTELLLHTPEKANYNTCTQCQTTVCSLCGFNPNPHITEKKEWLCLNCQMQRALGG 62
FYVE2_PCLO cd15776
FYVE-related domain 2 found in protein piccolo; Protein piccolo, also termed aczonin, is a ...
924-987 3.03e-42

FYVE-related domain 2 found in protein piccolo; Protein piccolo, also termed aczonin, is a neuron-specific presynaptic active zone scaffolding protein that mainly interacts with a detergent-resistant cytoskeletal-like subcellular fraction and is involved in the organization of the interplay between neurotransmitter vesicles, the cytoskeleton, and the plasma membrane at synaptic active zones. It binds profilin, an actin-binding protein implicated in actin cytoskeletal dynamics. It also functions as a presynaptic low-affinity Ca2+ sensor and has been implicated in Ca2+ regulation of neurotransmitter release. Piccolo is a multi-domain protein containing two N-terminal FYVE zinc fingers, a polyproline tract, and a PDZ domain and two C-terminal C2 domains. This family corresponds to the second FYVE domain, which resembles a FYVE-related domain that is structurally similar to the canonical FYVE domains but lacks the three signature sequences: an N-terminal WxxD motif (x for any residue), the central basic R(R/K)HHCRxCG patch, and a C-terminal RVC motif.


Pssm-ID: 277315 [Multi-domain]  Cd Length: 64  Bit Score: 149.83  E-value: 3.03e-42
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 2022791524  924 VSCPLCKTGLNVGSKDPPNFNTCTECKKVVCNLCGFNPMPHIAEVQEWLCLNCQTQRAMSGQLG 987
Cdd:cd15776      1 LLCPLCKTELNIGSKDPPNFNTCTECKKTVCNLCGFNPTPHLTEVKEWLCLNCQTQRAMSGQLG 64
zf-piccolo pfam05715
Piccolo Zn-finger; This (predicted) Zinc finger is found in the bassoon and piccolo proteins. ...
924-982 1.36e-37

Piccolo Zn-finger; This (predicted) Zinc finger is found in the bassoon and piccolo proteins. There are eight conserved cysteines, suggesting that it coordinates two zinc ligands.


Pssm-ID: 461722 [Multi-domain]  Cd Length: 60  Bit Score: 136.39  E-value: 1.36e-37
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  924 VSCPLCK-TGLNVGSKDPPNFNTCTECKKVVCNLCGFNPMPHIAEVQEWLCLNCQTQRAM 982
Cdd:pfam05715    1 TLCPLCKtTELNVGSKEPPNYNTCTECKSQVCNLCGFNPTPHLTEKKEWLCLNCQTQRAL 60
zf-piccolo pfam05715
Piccolo Zn-finger; This (predicted) Zinc finger is found in the bassoon and piccolo proteins. ...
454-513 1.45e-37

Piccolo Zn-finger; This (predicted) Zinc finger is found in the bassoon and piccolo proteins. There are eight conserved cysteines, suggesting that it coordinates two zinc ligands.


Pssm-ID: 461722 [Multi-domain]  Cd Length: 60  Bit Score: 136.39  E-value: 1.45e-37
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  454 TFCPLCTTTELLLHTPEKANYNTCTQCHTVVCSLCGFNPNPHITEIKEWLCLNCQMQRAL 513
Cdd:pfam05715    1 TLCPLCKTTELNVGSKEPPNYNTCTECKSQVCNLCGFNPTPHLTEKKEWLCLNCQTQRAL 60
FYVE2_BSN_PCLO cd15772
FYVE-related domain 2 found in protein bassoon and piccolo; This family includes protein ...
924-987 5.81e-37

FYVE-related domain 2 found in protein bassoon and piccolo; This family includes protein bassoon and piccolo. Protein bassoon, also termed zinc finger protein 231, is a core component of the presynaptic cytomatrix. It is a vertebrate-specific active zone scaffolding protein that plays a key role in structural organization and functional regulation of presynaptic release sites. Bassoon may modulate synaptic transmission efficiency by binding to presynaptic P/Q-type voltage-dependent calcium channel (VDCC) complexes and modify the channel function. As one of the most highly phosphorylated synaptic proteins, bassoon can interact with the small ubiquitous adaptor protein 14-3-3 in a phosphorylation-dependent manner, which modulates its anchoring to the presynaptic cytomatrix. Protein piccolo, also termed aczonin, is a neuron-specific presynaptic active zone scaffolding protein that mainly interacts with a detergent-resistant cytoskeletal-like subcellular fraction and is involved in the organization of the interplay between neurotransmitter vesicles, the cytoskeleton, and the plasma membrane at synaptic active zones. It binds profilin, an actin-binding protein implicated in actin cytoskeletal dynamics. It also functions as a presynaptic low-affinity Ca2+ sensor and has been implicated in Ca2+ regulation of neurotransmitter release. Both bassoon and piccolo contain two N-terminal FYVE zinc fingers, a PDZ domain and two C-terminal C2 domains. Their FYVE domain resembles a FYVE-related domain that is structurally similar to the canonical FYVE domains but lacks the three signature sequences: an N-terminal WxxD motif (x for any residue), the central basic R(R/K)HHCRxCG patch, and a C-terminal RVC motif. This model corresponds to the second FYVE-related domain.


Pssm-ID: 277311 [Multi-domain]  Cd Length: 64  Bit Score: 134.77  E-value: 5.81e-37
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 2022791524  924 VSCPLCKTGLNVGSKDPPNFNTCTECKKVVCNLCGFNPMPHIAEVQEWLCLNCQTQRAMSGQLG 987
Cdd:cd15772      1 VTCPLCKTELNVGSKEPPNYNTCTQCHTQVCNLCGFNPTPHLVEKKEWLCLNCQTQRLMSGGLG 64
FYVE1_BSN_PCLO cd15771
FYVE-related domain 1 found in protein bassoon and piccolo; This family includes protein ...
454-515 9.47e-36

FYVE-related domain 1 found in protein bassoon and piccolo; This family includes protein bassoon and piccolo. Protein bassoon, also termed zinc finger protein 231, is a core component of the presynaptic cytomatrix. It is a vertebrate-specific active zone scaffolding protein that plays a key role in structural organization and functional regulation of presynaptic release sites. Bassoon may modulate synaptic transmission efficiency by binding to presynaptic P/Q-type voltage-dependent calcium channel (VDCC) complexes and modify the channel function. As one of the most highly phosphorylated synaptic proteins, bassoon can interact with the small ubiquitous adaptor protein 14-3-3 in a phosphorylation-dependent manner, which modulates its anchoring to the presynaptic cytomatrix. Protein piccolo, also termed aczonin, is a neuron-specific presynaptic active zone scaffolding protein that mainly interacts with a detergent-resistant cytoskeletal-like subcellular fraction and is involved in the organization of the interplay between neurotransmitter vesicles, the cytoskeleton, and the plasma membrane at synaptic active zones. It binds profilin, an actin-binding protein implicated in actin cytoskeletal dynamics. It also functions as a presynaptic low-affinity Ca2+ sensor and has been implicated in Ca2+ regulation of neurotransmitter release. Both bassoon and piccolo contain two N-terminal FYVE zinc fingers, a PDZ domain and two C-terminal C2 domains. Their FYVE domain resembles a FYVE-related domain that is structurally similar to the canonical FYVE domains but lacks the three signature sequences: an N-terminal WxxD motif (x for any residue), the central basic R(R/K)HHCRxCG patch, and a C-terminal RVC motif. This model corresponds to the first FYVE-related domain.


Pssm-ID: 277310 [Multi-domain]  Cd Length: 61  Bit Score: 131.28  E-value: 9.47e-36
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 2022791524  454 TFCPLCTTTELLLHTPeKANYNTCTQCHTVVCSLCGFNPNPHITEIKEWLCLNCQMQRALGG 515
Cdd:cd15771      1 TLCPLCNTTELTLHVP-KPNFNTCTQCHTTVCNQCGFNPNPHLTEVKEWLCLNCQMQRALGM 61
FYVE2_BSN cd15775
FYVE-related domain 2 found in protein bassoon; Protein bassoon, also termed zinc finger ...
923-987 3.75e-34

FYVE-related domain 2 found in protein bassoon; Protein bassoon, also termed zinc finger protein 231, is a core component of the presynaptic cytomatrix. It is a vertebrate-specific active zone scaffolding protein that plays a key role in structural organization and functional regulation of presynaptic release sites. Bassoon may modulate synaptic transmission efficiency by binding to presynaptic P/Q-type voltage-dependent calcium channel (VDCC) complexes and modify the channel function. As one of the most highly phosphorylated synaptic proteins, bassoon can interact with the small ubiquitous adaptor protein 14-3-3 in a phosphorylation-dependent manner, which modulates its anchoring to the presynaptic cytomatrix. Bassoon contains two N-terminal FYVE zinc fingers, a PDZ domain and two C-terminal C2 domains. This family corresponds to the second FYVE domain, which resembles a FYVE-related domain that is structurally similar to the canonical FYVE domains but lacks the three signature sequences: an N-terminal WxxD motif (x for any residue), the central basic R(R/K)HHCRxCG patch, and a C-terminal RVC motif.


Pssm-ID: 277314 [Multi-domain]  Cd Length: 65  Bit Score: 126.96  E-value: 3.75e-34
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 2022791524  923 KVSCPLCKTGLNVGSKDPPNFNTCTECKKVVCNLCGFNPMPHIAEVQEWLCLNCQTQRAMSGQLG 987
Cdd:cd15775      1 RVTCPLCKTELNVGSTEPPNYNTCTSCRTQVCNLCGFNPTPHLVEKNEWLCLNCQTQRLLEGSLG 65
FYVE1_BSN cd15773
FYVE-related domain 1 found in protein bassoon; Protein bassoon, also termed zinc finger ...
451-514 1.51e-32

FYVE-related domain 1 found in protein bassoon; Protein bassoon, also termed zinc finger protein 231, is a core component of the presynaptic cytomatrix. It is a vertebrate-specific active zone scaffolding protein that plays a key role in structural organization and functional regulation of presynaptic release sites. Bassoon may modulate synaptic transmission efficiency by binding to presynaptic P/Q-type voltage-dependent calcium channel (VDCC) complexes and modify the channel function. As one of the most highly phosphorylated synaptic proteins, bassoon can interact with the small ubiquitous adaptor protein 14-3-3 in a phosphorylation-dependent manner, which modulates its anchoring to the presynaptic cytomatrix. Bassoon contains two N-terminal FYVE zinc fingers, a PDZ domain and two C-terminal C2 domains. This family corresponds to the first FYVE domain, which resembles a FYVE-related domain that is structurally similar to the canonical FYVE domains but lacks the three signature sequences: an N-terminal WxxD motif (x for any residue), the central basic R(R/K)HHCRxCG patch, and a C-terminal RVC motif.


Pssm-ID: 277312 [Multi-domain]  Cd Length: 64  Bit Score: 122.11  E-value: 1.51e-32
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 2022791524  451 PKKTFCPLCTTTELLlHTPEKANYNTCTQCHTVVCSLCGFNPNPHITEIKEWLCLNCQMQRALG 514
Cdd:cd15773      1 PSSTLCPICNTTELT-SFPSQPNFNTCTQCHNKVCNQCGFNPNPHLTEVKEWLCLNCQMQRALG 63
FYVE_BSN_PCLO cd15751
FYVE-related domain found in protein bassoon and piccolo; This family includes protein bassoon ...
926-982 2.08e-27

FYVE-related domain found in protein bassoon and piccolo; This family includes protein bassoon and piccolo. Protein bassoon, also termed zinc finger protein 231, is a core component of the presynaptic cytomatrix. It is a vertebrate-specific active zone scaffolding protein that plays a key role in structural organization and functional regulation of presynaptic release sites. Bassoon may modulate synaptic transmission efficiency by binding to presynaptic P/Q-type voltage-dependent calcium channel (VDCC) complexes and modify the channel function. As one of the most highly phosphorylated synaptic proteins, bassoon can interact with the small ubiquitous adaptor protein 14-3-3 in a phosphorylation-dependent manner, which modulates its anchoring to the presynaptic cytomatrix. Protein piccolo, also termed aczonin, is a neuron-specific presynaptic active zone scaffolding protein that mainly interacts with a detergent-resistant cytoskeletal-like subcellular fraction and is involved in the organization of the interplay between neurotransmitter vesicles, the cytoskeleton, and the plasma membrane at synaptic active zones. It binds profilin, an actin-binding protein implicated in actin cytoskeletal dynamics. It also functions as a presynaptic low-affinity Ca2+ sensor and has been implicated in Ca2+ regulation of neurotransmitter release. Both bassoon and piccolo contain two N-terminal FYVE zinc fingers, a PDZ domain and two C-terminal C2 domains. Their FYVE domain resembles a FYVE-related domain that is structurally similar to the canonical FYVE domains but lacks the three signature sequences: an N-terminal WxxD motif (x for any residue), the central basic R(R/K)HHCRxCG patch, and a C-terminal RVC motif.


Pssm-ID: 277290 [Multi-domain]  Cd Length: 62  Bit Score: 107.54  E-value: 2.08e-27
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*...
gi 2022791524  926 CPLCK-TGLNVGSKDPPNFNTCTECKKVVCNLCGFNPMPHIAEVQEWLCLNCQTQRAM 982
Cdd:cd15751      3 CPLCGtSELPLGSKSPPNYNTCTDCKNRVCNQCGFNSTPPVTKVKEWLCLNCQKKRAL 60
FYVE2_BSN_PCLO cd15772
FYVE-related domain 2 found in protein bassoon and piccolo; This family includes protein ...
456-517 3.79e-26

FYVE-related domain 2 found in protein bassoon and piccolo; This family includes protein bassoon and piccolo. Protein bassoon, also termed zinc finger protein 231, is a core component of the presynaptic cytomatrix. It is a vertebrate-specific active zone scaffolding protein that plays a key role in structural organization and functional regulation of presynaptic release sites. Bassoon may modulate synaptic transmission efficiency by binding to presynaptic P/Q-type voltage-dependent calcium channel (VDCC) complexes and modify the channel function. As one of the most highly phosphorylated synaptic proteins, bassoon can interact with the small ubiquitous adaptor protein 14-3-3 in a phosphorylation-dependent manner, which modulates its anchoring to the presynaptic cytomatrix. Protein piccolo, also termed aczonin, is a neuron-specific presynaptic active zone scaffolding protein that mainly interacts with a detergent-resistant cytoskeletal-like subcellular fraction and is involved in the organization of the interplay between neurotransmitter vesicles, the cytoskeleton, and the plasma membrane at synaptic active zones. It binds profilin, an actin-binding protein implicated in actin cytoskeletal dynamics. It also functions as a presynaptic low-affinity Ca2+ sensor and has been implicated in Ca2+ regulation of neurotransmitter release. Both bassoon and piccolo contain two N-terminal FYVE zinc fingers, a PDZ domain and two C-terminal C2 domains. Their FYVE domain resembles a FYVE-related domain that is structurally similar to the canonical FYVE domains but lacks the three signature sequences: an N-terminal WxxD motif (x for any residue), the central basic R(R/K)HHCRxCG patch, and a C-terminal RVC motif. This model corresponds to the second FYVE-related domain.


Pssm-ID: 277311 [Multi-domain]  Cd Length: 64  Bit Score: 103.96  E-value: 3.79e-26
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 2022791524  456 CPLCTTtELLLHTPEKANYNTCTQCHTVVCSLCGFNPNPHITEIKEWLCLNCQMQRALGGDL 517
Cdd:cd15772      3 CPLCKT-ELNVGSKEPPNYNTCTQCHTQVCNLCGFNPTPHLVEKKEWLCLNCQTQRLMSGGL 63
FYVE1_PCLO cd15774
FYVE-related domain 1 found in protein piccolo; Protein piccolo, also termed aczonin, is a ...
926-984 6.36e-26

FYVE-related domain 1 found in protein piccolo; Protein piccolo, also termed aczonin, is a neuron-specific presynaptic active zone scaffolding protein that mainly interacts with a detergent-resistant cytoskeletal-like subcellular fraction and is involved in the organization of the interplay between neurotransmitter vesicles, the cytoskeleton, and the plasma membrane at synaptic active zones. It binds profilin, an actin-binding protein implicated in actin cytoskeletal dynamics. It also functions as a presynaptic low-affinity Ca2+ sensor and has been implicated in Ca2+ regulation of neurotransmitter release. Piccolo is a multi-domain protein containing two N-terminal FYVE zinc fingers, a polyproline tract, and a PDZ domain and two C-terminal C2 domains. This family corresponds to the first FYVE domain, which resembles a FYVE-related domain that is structurally similar to the canonical FYVE domains but lacks the three signature sequences: an N-terminal WxxD motif (x for any residue), the central basic R(R/K)HHCRxCG patch, and a C-terminal RVC motif.


Pssm-ID: 277313 [Multi-domain]  Cd Length: 62  Bit Score: 103.19  E-value: 6.36e-26
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  926 CPLCK-TGLNVGSKDPPNFNTCTECKKVVCNLCGFNPMPHIAEVQEWLCLNCQTQRAMSG 984
Cdd:cd15774      3 CPLCKtTELLLHTPEKANYNTCTQCQTTVCSLCGFNPNPHITEKKEWLCLNCQMQRALGG 62
FYVE2_BSN cd15775
FYVE-related domain 2 found in protein bassoon; Protein bassoon, also termed zinc finger ...
453-518 6.86e-26

FYVE-related domain 2 found in protein bassoon; Protein bassoon, also termed zinc finger protein 231, is a core component of the presynaptic cytomatrix. It is a vertebrate-specific active zone scaffolding protein that plays a key role in structural organization and functional regulation of presynaptic release sites. Bassoon may modulate synaptic transmission efficiency by binding to presynaptic P/Q-type voltage-dependent calcium channel (VDCC) complexes and modify the channel function. As one of the most highly phosphorylated synaptic proteins, bassoon can interact with the small ubiquitous adaptor protein 14-3-3 in a phosphorylation-dependent manner, which modulates its anchoring to the presynaptic cytomatrix. Bassoon contains two N-terminal FYVE zinc fingers, a PDZ domain and two C-terminal C2 domains. This family corresponds to the second FYVE domain, which resembles a FYVE-related domain that is structurally similar to the canonical FYVE domains but lacks the three signature sequences: an N-terminal WxxD motif (x for any residue), the central basic R(R/K)HHCRxCG patch, and a C-terminal RVC motif.


Pssm-ID: 277314 [Multi-domain]  Cd Length: 65  Bit Score: 103.46  E-value: 6.86e-26
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 2022791524  453 KTFCPLCTTtELLLHTPEKANYNTCTQCHTVVCSLCGFNPNPHITEIKEWLCLNCQMQRALGGDLA 518
Cdd:cd15775      1 RVTCPLCKT-ELNVGSTEPPNYNTCTSCRTQVCNLCGFNPTPHLVEKNEWLCLNCQTQRLLEGSLG 65
FYVE1_BSN_PCLO cd15771
FYVE-related domain 1 found in protein bassoon and piccolo; This family includes protein ...
926-984 2.07e-25

FYVE-related domain 1 found in protein bassoon and piccolo; This family includes protein bassoon and piccolo. Protein bassoon, also termed zinc finger protein 231, is a core component of the presynaptic cytomatrix. It is a vertebrate-specific active zone scaffolding protein that plays a key role in structural organization and functional regulation of presynaptic release sites. Bassoon may modulate synaptic transmission efficiency by binding to presynaptic P/Q-type voltage-dependent calcium channel (VDCC) complexes and modify the channel function. As one of the most highly phosphorylated synaptic proteins, bassoon can interact with the small ubiquitous adaptor protein 14-3-3 in a phosphorylation-dependent manner, which modulates its anchoring to the presynaptic cytomatrix. Protein piccolo, also termed aczonin, is a neuron-specific presynaptic active zone scaffolding protein that mainly interacts with a detergent-resistant cytoskeletal-like subcellular fraction and is involved in the organization of the interplay between neurotransmitter vesicles, the cytoskeleton, and the plasma membrane at synaptic active zones. It binds profilin, an actin-binding protein implicated in actin cytoskeletal dynamics. It also functions as a presynaptic low-affinity Ca2+ sensor and has been implicated in Ca2+ regulation of neurotransmitter release. Both bassoon and piccolo contain two N-terminal FYVE zinc fingers, a PDZ domain and two C-terminal C2 domains. Their FYVE domain resembles a FYVE-related domain that is structurally similar to the canonical FYVE domains but lacks the three signature sequences: an N-terminal WxxD motif (x for any residue), the central basic R(R/K)HHCRxCG patch, and a C-terminal RVC motif. This model corresponds to the first FYVE-related domain.


Pssm-ID: 277310 [Multi-domain]  Cd Length: 61  Bit Score: 101.62  E-value: 2.07e-25
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*....
gi 2022791524  926 CPLCKTGLNVGSKDPPNFNTCTECKKVVCNLCGFNPMPHIAEVQEWLCLNCQTQRAMSG 984
Cdd:cd15771      3 CPLCNTTELTLHVPKPNFNTCTQCHTTVCNQCGFNPNPHLTEVKEWLCLNCQMQRALGM 61
FYVE_BSN_PCLO cd15751
FYVE-related domain found in protein bassoon and piccolo; This family includes protein bassoon ...
456-514 1.03e-24

FYVE-related domain found in protein bassoon and piccolo; This family includes protein bassoon and piccolo. Protein bassoon, also termed zinc finger protein 231, is a core component of the presynaptic cytomatrix. It is a vertebrate-specific active zone scaffolding protein that plays a key role in structural organization and functional regulation of presynaptic release sites. Bassoon may modulate synaptic transmission efficiency by binding to presynaptic P/Q-type voltage-dependent calcium channel (VDCC) complexes and modify the channel function. As one of the most highly phosphorylated synaptic proteins, bassoon can interact with the small ubiquitous adaptor protein 14-3-3 in a phosphorylation-dependent manner, which modulates its anchoring to the presynaptic cytomatrix. Protein piccolo, also termed aczonin, is a neuron-specific presynaptic active zone scaffolding protein that mainly interacts with a detergent-resistant cytoskeletal-like subcellular fraction and is involved in the organization of the interplay between neurotransmitter vesicles, the cytoskeleton, and the plasma membrane at synaptic active zones. It binds profilin, an actin-binding protein implicated in actin cytoskeletal dynamics. It also functions as a presynaptic low-affinity Ca2+ sensor and has been implicated in Ca2+ regulation of neurotransmitter release. Both bassoon and piccolo contain two N-terminal FYVE zinc fingers, a PDZ domain and two C-terminal C2 domains. Their FYVE domain resembles a FYVE-related domain that is structurally similar to the canonical FYVE domains but lacks the three signature sequences: an N-terminal WxxD motif (x for any residue), the central basic R(R/K)HHCRxCG patch, and a C-terminal RVC motif.


Pssm-ID: 277290 [Multi-domain]  Cd Length: 62  Bit Score: 99.83  E-value: 1.03e-24
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*....
gi 2022791524  456 CPLCTTTELLLHTPEKANYNTCTQCHTVVCSLCGFNPNPHITEIKEWLCLNCQMQRALG 514
Cdd:cd15751      3 CPLCGTSELPLGSKSPPNYNTCTDCKNRVCNQCGFNSTPPVTKVKEWLCLNCQKKRALG 61
FYVE1_BSN cd15773
FYVE-related domain 1 found in protein bassoon; Protein bassoon, also termed zinc finger ...
926-982 1.95e-24

FYVE-related domain 1 found in protein bassoon; Protein bassoon, also termed zinc finger protein 231, is a core component of the presynaptic cytomatrix. It is a vertebrate-specific active zone scaffolding protein that plays a key role in structural organization and functional regulation of presynaptic release sites. Bassoon may modulate synaptic transmission efficiency by binding to presynaptic P/Q-type voltage-dependent calcium channel (VDCC) complexes and modify the channel function. As one of the most highly phosphorylated synaptic proteins, bassoon can interact with the small ubiquitous adaptor protein 14-3-3 in a phosphorylation-dependent manner, which modulates its anchoring to the presynaptic cytomatrix. Bassoon contains two N-terminal FYVE zinc fingers, a PDZ domain and two C-terminal C2 domains. This family corresponds to the first FYVE domain, which resembles a FYVE-related domain that is structurally similar to the canonical FYVE domains but lacks the three signature sequences: an N-terminal WxxD motif (x for any residue), the central basic R(R/K)HHCRxCG patch, and a C-terminal RVC motif.


Pssm-ID: 277312 [Multi-domain]  Cd Length: 64  Bit Score: 99.00  E-value: 1.95e-24
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 2022791524  926 CPLCKTGLNVGSKDPPNFNTCTECKKVVCNLCGFNPMPHIAEVQEWLCLNCQTQRAM 982
Cdd:cd15773      6 CPICNTTELTSFPSQPNFNTCTQCHNKVCNQCGFNPNPHLTEVKEWLCLNCQMQRAL 62
FYVE2_PCLO cd15776
FYVE-related domain 2 found in protein piccolo; Protein piccolo, also termed aczonin, is a ...
456-517 4.26e-24

FYVE-related domain 2 found in protein piccolo; Protein piccolo, also termed aczonin, is a neuron-specific presynaptic active zone scaffolding protein that mainly interacts with a detergent-resistant cytoskeletal-like subcellular fraction and is involved in the organization of the interplay between neurotransmitter vesicles, the cytoskeleton, and the plasma membrane at synaptic active zones. It binds profilin, an actin-binding protein implicated in actin cytoskeletal dynamics. It also functions as a presynaptic low-affinity Ca2+ sensor and has been implicated in Ca2+ regulation of neurotransmitter release. Piccolo is a multi-domain protein containing two N-terminal FYVE zinc fingers, a polyproline tract, and a PDZ domain and two C-terminal C2 domains. This family corresponds to the second FYVE domain, which resembles a FYVE-related domain that is structurally similar to the canonical FYVE domains but lacks the three signature sequences: an N-terminal WxxD motif (x for any residue), the central basic R(R/K)HHCRxCG patch, and a C-terminal RVC motif.


Pssm-ID: 277315 [Multi-domain]  Cd Length: 64  Bit Score: 98.22  E-value: 4.26e-24
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 2022791524  456 CPLCTTtELLLHTPEKANYNTCTQCHTVVCSLCGFNPNPHITEIKEWLCLNCQMQRALGGDL 517
Cdd:cd15776      3 CPLCKT-ELNIGSKDPPNFNTCTECKKTVCNLCGFNPTPHLTEVKEWLCLNCQTQRAMSGQL 63
PHA03247 PHA03247
large tegument protein UL36; Provisional
63-454 8.42e-20

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 98.86  E-value: 8.42e-20
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524   63 RAQGLPRGNLAGAEPPpmqrHPELDTSHHPRQPskPPDPGPGLSKSRTVDVLKTEQRAPGR---SPSSISLREsKSRTDF 139
Cdd:PHA03247  2599 RAPVDDRGDPRGPAPP----SPLPPDTHAPDPP--PPSPSPAANEPDPHPPPTVPPPERPRddpAPGRVSRPR-RARRLG 2671
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  140 REDQKSSMMPSFLSEANPlSAVTSVVNKFNPfdlisdSDTAHEEAGRKQKVIQKEQGKPEEQRGSAKLPAQQQSP-KPVQ 218
Cdd:PHA03247  2672 RAAQASSPPQRPRRRAAR-PTVGSLTSLADP------PPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPaPPAV 2744
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  219 PQGHVKPTPQQSESSKAVPQQqqPGEPKQIQKPGPGHPADSKLEQVKQPPQPRGPQKSQPQQSEPTKPVQQQTSAKP-SA 297
Cdd:PHA03247  2745 PAGPATPGGPARPARPPTTAG--PPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPpAA 2822
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  298 GPAKPLPQQPDSARTSSQAPPPAKPSSQQLG-------------PVKQPSQQPARQGGPvkPSSQQAGPPKQPSQQPGPE 364
Cdd:PHA03247  2823 SPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGgsvapggdvrrrpPSRSPAAKPAAPARP--PVRRLARPAVSRSTESFAL 2900
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  365 KPSAQQMGPAKQA-SQPGPGKPPLQQTGPVKQAPPQSGPTKPPPQTAGATKAPVPQPGLTKP------PGQQPEPEKPSQ 437
Cdd:PHA03247  2901 PPDQPERPPQPQApPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPwlgalvPGRVAVPRFRVP 2980
                          410
                   ....*....|....*..
gi 2022791524  438 QKQASATQPAESAPKKT 454
Cdd:PHA03247  2981 QPAPSREAPASSTPPLT 2997
PHA03247 PHA03247
large tegument protein UL36; Provisional
207-744 7.57e-18

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 92.31  E-value: 7.57e-18
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  207 LPAQQQSPKPVQPQGHVKPTPQQSESSkAVPQQQQPGEPKQIQKP-GPGHPADSkleqvkqPPQPRGPQKSQPQQSEPTK 285
Cdd:PHA03247  2555 LPPAAPPAAPDRSVPPPRPAPRPSEPA-VTSRARRPDAPPQSARPrAPVDDRGD-------PRGPAPPSPLPPDTHAPDP 2626
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  286 PVQQQTSAKPSAG----PAKPLPQQPDSARTSSQAPPPAKPSSQQLGP-VKQPSQQPARQG-----GPVKPSSQQAGPPK 355
Cdd:PHA03247  2627 PPPSPSPAANEPDphppPTVPPPERPRDDPAPGRVSRPRRARRLGRAAqASSPPQRPRRRAarptvGSLTSLADPPPPPP 2706
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  356 QPSQQPGPEKP---------SAQQMGPAKQASqPGPGKPPLQQTGPVKQAPPQSGPTKPPPQTAGATKAPV--PQPGLTK 424
Cdd:PHA03247  2707 TPEPAPHALVSatplppgpaAARQASPALPAA-PAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAagPPRRLTR 2785
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  425 PPGQQPEPEKPsqqkqaSATQPAESAPKKTFCPLCTTTELLLHTPEKANYNTCTQCHTVVCSLCGFNPNPhiteikewlc 504
Cdd:PHA03247  2786 PAVASLSESRE------SLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPS---------- 2849
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  505 lncqmqRALGGDLAPGhGPgpqlpppkqktptppsTAKPPPQPQPGQKKDASPKPDPAQLAEPKKPPPQKKQPSMPGSPP 584
Cdd:PHA03247  2850 ------LPLGGSVAPG-GD----------------VRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPE 2906
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  585 VKAKTSRAEPADTSQQIDPAPKSDRAKPSQAEDKQKQPSIQKPAADTVLTPAAPGLKQDSAGPRPPPTQQKVTDSPKPEL 664
Cdd:PHA03247  2907 RPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPSR 2986
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  665 AKPSQDTHPagekPDSKPLPQVSRQKSDPKLSAQPGARSEAKTQKPVEPAQVKD---------DPKKLQTK---PSPKPE 732
Cdd:PHA03247  2987 EAPASSTPP----LTGHSLSRVSSWASSLALHEETDPPPVSLKQTLWPPDDTEDsdadslfdsDSERSDLEaldPLPPEP 3062
                          570
                   ....*....|..
gi 2022791524  733 TKPAPKGLQPGA 744
Cdd:PHA03247  3063 HDPFAHEPDPAT 3074
PHA03247 PHA03247
large tegument protein UL36; Provisional
214-738 3.85e-16

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 86.92  E-value: 3.85e-16
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  214 PKPVQPQGHVKPTPQQS-ESSKAVPQQQQPGEPKQIQKPG-PGHPAdskleqvkQPPQPRGPQKSQPQQSEPTKPVQQQT 291
Cdd:PHA03247  2551 PPPPLPPAAPPAAPDRSvPPPRPAPRPSEPAVTSRARRPDaPPQSA--------RPRAPVDDRGDPRGPAPPSPLPPDTH 2622
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  292 SAKPSAGPAKPLPQQPDSARTSSqAPPPAKPSSQQLGPVKQPSQQPARQGGPVKPSSQQAGPpkQPSQQPGPEKPSAQQM 371
Cdd:PHA03247  2623 APDPPPPSPSPAANEPDPHPPPT-VPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRP--RRRAARPTVGSLTSLA 2699
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  372 GPAKQASQPGPGKPPLQQTGPVKQAPPQSGPTKPPPQTAGATKAPVPQPGLTKPPGQQPEPEKPSQQKQAS-----ATQP 446
Cdd:PHA03247  2700 DPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAppaapAAGP 2779
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  447 AESAPKKTFCPLCTTTELLLHTPEKANyntctqcHTVVCSlcgfNPNPHITeikewlclncqmqralggdlaPGHGPGPQ 526
Cdd:PHA03247  2780 PRRLTRPAVASLSESRESLPSPWDPAD-------PPAAVL----APAAALP---------------------PAASPAGP 2827
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  527 LPPPKQKTPTPPSTAKPPPQPQPGQKKDASPKPDPAQLAEpkkpppqkkqpsmPGSPPVKAKTSRAEPADTSqqidPAPK 606
Cdd:PHA03247  2828 LPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRRRPP-------------SRSPAAKPAAPARPPVRRL----ARPA 2890
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  607 SDRAKPSQAedkQKQPSIQKPAadtvlTPAAPglkqdsAGPRPPPTQQkVTDSPKPELAKPSQDTHPAGEKPDSKPLPQV 686
Cdd:PHA03247  2891 VSRSTESFA---LPPDQPERPP-----QPQAP------PPPQPQPQPP-PPPQPQPPPPPPPRPQPPLAPTTDPAGAGEP 2955
                          490       500       510       520       530
                   ....*....|....*....|....*....|....*....|....*....|..
gi 2022791524  687 SRQKSDPKLSAQPGARSEAKTQKPVEPAqvkdDPKKLQTKPSPKPETKPAPK 738
Cdd:PHA03247  2956 SGAVPQPWLGALVPGRVAVPRFRVPQPA----PSREAPASSTPPLTGHSLSR 3003
PHA03378 PHA03378
EBNA-3B; Provisional
198-444 1.22e-15

EBNA-3B; Provisional


Pssm-ID: 223065 [Multi-domain]  Cd Length: 991  Bit Score: 84.73  E-value: 1.22e-15
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  198 PEEQRGSAKLPAQQQSPKPVqPQGHVKPTPQQSESSkaVPQQQQPGEPKQIQKPGPGHPADSkleQVKQPPQPRGPQKSQ 277
Cdd:PHA03378   580 PTTSQLASSAPSYAQTPWPV-PHPSQTPEPPTTQSH--IPETSAPRQWPMPLRPIPMRPLRM---QPITFNVLVFPTPHQ 653
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  278 PQQSEPTKpvQQQTSAKPSAGPAKPLPQQPDSARTSSQAPPPAKPssqqlgpvkqpsqqPARQGGPVKPSSQQAGPPKQP 357
Cdd:PHA03378   654 PPQVEITP--YKPTWTQIGHIPYQPSPTGANTMLPIQWAPGTMQP--------------PPRAPTPMRPPAAPPGRAQRP 717
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  358 SQQPGPEKPSAQQMGPAKQ-ASQPGPGKPPlqQTGPVKQAPPQSGPTK-PPPQTAGATKAPVPQPGLTKPPGQQPE-PEK 434
Cdd:PHA03378   718 AAATGRARPPAAAPGRARPpAAAPGRARPP--AAAPGRARPPAAAPGRaRPPAAAPGAPTPQPPPQAPPAPQQRPRgAPT 795
                          250
                   ....*....|
gi 2022791524  435 PSQQKQASAT 444
Cdd:PHA03378   796 PQPPPQAGPT 805
PHA03247 PHA03247
large tegument protein UL36; Provisional
237-744 1.64e-15

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 84.60  E-value: 1.64e-15
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  237 PQQQQPGEPKQIQKPGPGHPADSKleqvkQPPQPRGPQKSQPQQSEPTKPVQ----------QQTSAKPSAGPAKPLPQQ 306
Cdd:PHA03247  2484 AEARFPFAAGAAPDPGGGGPPDPD-----APPAPSRLAPAILPDEPVGEPVHprmltwirglEELASDDAGDPPPPLPPA 2558
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  307 PDSARTSSQAPPPaKPSSQQLGPVKQPSQ-------QPARQGGPVKPSSQQAGPPKQPSQQPGPEKPSAQQMGPAKQASQ 379
Cdd:PHA03247  2559 APPAAPDRSVPPP-RPAPRPSEPAVTSRArrpdappQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPSPSPAANE 2637
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  380 PGPG--------------------KPPLQQTGPVKQAPPQSGPTKPPPQTAGATKAPVPQpgLTKPPGQQPEPEKPSQQK 439
Cdd:PHA03247  2638 PDPHppptvppperprddpapgrvSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTS--LADPPPPPPTPEPAPHAL 2715
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  440 QASATQPAESAPKKTFCPLCTTTELLLHTPEKAnyntctqchtvvcslcgfnpnphiteikewlclncqmqRALGGDLAP 519
Cdd:PHA03247  2716 VSATPLPPGPAAARQASPALPAAPAPPAVPAGP--------------------------------------ATPGGPARP 2757
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  520 GHGPGPQLPPPKQKTPTPPSTAKPPPQPQPGQKkdASPKPDPAQLAEPKKPPPQKKQPSMPGSPPVKAKTSRAEPADTSQ 599
Cdd:PHA03247  2758 ARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVAS--LSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQ 2835
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  600 QIDPAPKSDRAKPSQAEDKQKQP----SIQKPAADTVLTPAAPGLKQDSAGPRPPPTQQkvTDS-PKPELAKPSQDTHPA 674
Cdd:PHA03247  2836 PTAPPPPPGPPPPSLPLGGSVAPggdvRRRPPSRSPAAKPAAPARPPVRRLARPAVSRS--TESfALPPDQPERPPQPQA 2913
                          490       500       510       520       530       540       550
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  675 GEKPDSKPLPQVSRQKsdpklsaQPGARSEAKTQKPVEPaqvkddpkklQTKPSPKPETKPAPKGLQPGA 744
Cdd:PHA03247  2914 PPPPQPQPQPPPPPQP-------QPPPPPPPRPQPPLAP----------TTDPAGAGEPSGAVPQPWLGA 2966
PHA03247 PHA03247
large tegument protein UL36; Provisional
253-791 4.06e-15

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 83.45  E-value: 4.06e-15
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  253 PGHPAdskleqVKQPPQPRGPqksqpqqseptkpvqqqTSAKPSAGPAKPLPQQPDSARTSSQAPP---PAKPSSQQLGP 329
Cdd:PHA03247  2475 PGAPV------YRRPAEARFP-----------------FAAGAAPDPGGGGPPDPDAPPAPSRLAPailPDEPVGEPVHP 2531
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  330 -----VKQPSQQPARQGGPVKPSSQQAGPPKQPSQQPGPEKPSAQQMGPAKQASQPGPGKPPLQQTGPVKQAPPQSGPTK 404
Cdd:PHA03247  2532 rmltwIRGLEELASDDAGDPPPPLPPAAPPAAPDRSVPPPRPAPRPSEPAVTSRARRPDAPPQSARPRAPVDDRGDPRGP 2611
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  405 PPPQ------TAGATKAPVPQPGLTKPPGQQPEPEKPSQQKQASATQPAESAPKKTFCPLCTTTEllLHTPEKAnynTCT 478
Cdd:PHA03247  2612 APPSplppdtHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQA--SSPPQRP---RRR 2686
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  479 QCHTVVCSLCGFNPNPhiteikewlclncqmqralggdlAPGHGPGPQLPPPKQKTPTPPSTAKPPPQPQPGQKKDASPK 558
Cdd:PHA03247  2687 AARPTVGSLTSLADPP-----------------------PPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPA 2743
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  559 P--DPAQLAEPKKPPPQKKQPSMPGSPPVKAKTSRAEPADTSQQIDPAPKSDRAKPSqAEDKQKQPSIQKPAADTVLTPA 636
Cdd:PHA03247  2744 VpaGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPS-PWDPADPPAAVLAPAAALPPAA 2822
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  637 APglkqdsAGPRPPPTQQKVTDSPKPELAKPSQD-----------------THPAGEKPDSKPLPQVSRQkSDPKLSAQP 699
Cdd:PHA03247  2823 SP------AGPLPPPTSAQPTAPPPPPGPPPPSLplggsvapggdvrrrppSRSPAAKPAAPARPPVRRL-ARPAVSRST 2895
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  700 gaRSEAKTQKPVEPAQVKDDPKKLQTKPSPKPETKPAPKGLQPGAGPKPVPAQPAPQPQQPQKTPEQPRRFSLNLGGITD 779
Cdd:PHA03247  2896 --ESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVA 2973
                          570
                   ....*....|....*
gi 2022791524  780 TPK---PQPTTPQET 791
Cdd:PHA03247  2974 VPRfrvPQPAPSREA 2988
PDZ_canonical cd00136
canonical PDZ domain; Canonical PDZ (PSD-95 (Postsynaptic density protein 95), Dlg (Discs ...
4542-4617 1.01e-14

canonical PDZ domain; Canonical PDZ (PSD-95 (Postsynaptic density protein 95), Dlg (Discs large protein), and ZO-1 (Zonula occludens-1)) domain. PDZ domains usually bind to short specific peptide sequences located at the C-terminal end of their partner proteins known as PDZ binding motifs. These domains can also interact with internal peptide motifs and certain lipids, and can take part in a head-to-tail oligomerization with other PDZ domains. The PDZ superfamily includes canonical PDZ domains as well as those with circular permutations and domain swapping mediated by beta-strands. The canonical PDZ domain contains six beta-strands A-F and two alpha-helices (alpha-helix 1 and 2), arranged in the order: beta-strands A, B, C, alpha-helix 1, beta-strands D, E, alpha-helix 2 and beta-strand F.


Pssm-ID: 467153 [Multi-domain]  Cd Length: 81  Bit Score: 71.80  E-value: 1.01e-14
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 2022791524 4542 SGNGLGIRVVGGKEipgssGETGAYIAKILPGGNAEQTGKLIEGMQVLEWNGIPLTGKTYEEVQNIIIQQSGEAEI 4617
Cdd:cd00136      8 PGGGLGFSIRGGKD-----GGGGIFVSRVEPGGPAARDGRLRVGDRILEVNGVSLEGLTHEEAVELLKSAGGEVTL 78
Glutenin_hmw pfam03157
High molecular weight glutenin subunit; Members of this family include high molecular weight ...
179-450 6.71e-14

High molecular weight glutenin subunit; Members of this family include high molecular weight subunits of glutenin. This group of gluten proteins is thought to be largely responsible for the elastic properties of gluten, and hence, doughs. Indeed, glutenin high molecular weight subunits are classified as elastomeric proteins, because the glutenin network can withstand significant deformations without breaking, and return to the original conformation when the stress is removed. Elastomeric proteins differ considerably in amino acid sequence, but they are all polymers whose subunits consist of elastomeric domains, composed of repeated motifs, and non-elastic domains that mediate cross-linking between the subunits. The elastomeric domain motifs are all rich in glycine residues in addition to other hydrophobic residues. High molecular weight glutenin subunits have an extensive central elastomeric domain, flanked by two terminal non-elastic domains that form disulphide cross-links. The central elastomeric domain is characterized by the following three repeated motifs: PGQGQQ, GYYPTS[P/L]QQ, GQQ. It possesses overlapping beta-turns within and between the repeated motifs, and assumes a regular helical secondary structure with a diameter of approx. 1.9 nm and a pitch of approx. 1.5 nm.


Pssm-ID: 367362 [Multi-domain]  Cd Length: 786  Bit Score: 78.84  E-value: 6.71e-14
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  179 TAHEEAGRKQKVIQKEQGKPEEQRGSAKLPAQQQSPKPVQP--------------QGHVKPTPQQSESSKAVPQQQQPGE 244
Cdd:pfam03157  372 TSQQQPQQGQQPEQGQQGQQQGQGQQGQQPGQGQQPGQGQPgyyptspqqsgqgqPGYYPTSPQQSGQGQQPGQGQQPGQ 451
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  245 --PKQIQKPGPGHPADS--KLEQVKQPPQ------PRGPQksQPQQSEPTKPVQQQTSAKPSAGPAKPL----------- 303
Cdd:pfam03157  452 eqPGQGQQPGQGQQGQQpgQPEQGQQPGQgqpgyyPTSPQ--QSGQGQQLGQWQQQGQGQPGYYPTSPLqpgqgqpgyyp 529
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  304 --PQQPDSARTSSQAPPPAKPSSQQLGPVKQPSQQPARQGGPVKPSSQQAGPPKQPSQQPGPEKP-----SAQQMGPAKQ 376
Cdd:pfam03157  530 tsPQQPGQGQQLGQLQQPTQGQQGQQSGQGQQGQQPGQGQQGQQPGQGQQGQQPGQGQQPGQGQPgyyptSPQQSGQGQQ 609
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 2022791524  377 A---SQPGPGKPPLQQTGPVKQAPPQSGPTKPPPQtagatkapvpQPGLTKPPGQQPEPEKPSQQKQASATQPAESA 450
Cdd:pfam03157  610 PgqwQQPGQGQPGYYPTSSLQLGQGQQGYYPTSPQ----------QPGQGQQPGQWQQSGQGQQGYYPTSPQQSGQA 676
Med15 pfam09606
ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of ...
184-449 1.39e-13

ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of the ARC-Mediator co-activator is a three-helix bundle with marked similarity to the KIX domain. The sterol regulatory element binding protein (SREBP) family of transcription activators use the ARC105 subunit to activate target genes in the regulation of cholesterol and fatty acid homeostasis. In addition, Med15 is a critical transducer of gene activation signals that control early metazoan development.


Pssm-ID: 312941 [Multi-domain]  Cd Length: 732  Bit Score: 77.74  E-value: 1.39e-13
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  184 AGRKQKVIQKEQGKPEEQRGSAKLPAQQQSPKPVQPQGhvkptpqqsesskAVPQQQQPGepkQIQKPGPGHPADSKLEQ 263
Cdd:pfam09606  156 GGMMQPSSGQPGSGTPNQMGPNGGPGQGQAGGMNGGQQ-------------GPMGGQMPP---QMGVPGMPGPADAGAQM 219
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  264 VKQPPQPRGPQKSQPQQSEPTKPVQQ-QTSAKPSAGPAKPLPQQPDSARTSSQAPPPAKPSSQQLGPVKQPSQQPARQGG 342
Cdd:pfam09606  220 GQQAQANGGMNPQQMGGAPNQVAMQQqQPQQQGQQSQLGMGINQMQQMPQGVGGGAGQGGPGQPMGPPGQQPGAMPNVMS 299
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  343 PVKPSSQQAGPPKQPSQQPGPEKPSAQQ------MGPAKQ----------------------ASQPGPGKP--------- 385
Cdd:pfam09606  300 IGDQNNYQQQQTRQQQQQQGGNHPAAHQqqmnqsVGQGGQvvalgglnhletwnpgnfgglgANPMQRGQPgmmsspspv 379
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 2022791524  386 ---PLQQTGPVKQAPPQSGPTKPPPQTAGATKAPVPQPGLTKPPGQQPEPE-KPSQQKQASATQPAES 449
Cdd:pfam09606  380 pgqQVRQVTPNQFMRQSPQPSVPSPQGPGSQPPQSHPGGMIPSPALIPSPSpQMSQQPAQQRTIGQDS 447
PHA03378 PHA03378
EBNA-3B; Provisional
184-453 2.64e-13

EBNA-3B; Provisional


Pssm-ID: 223065 [Multi-domain]  Cd Length: 991  Bit Score: 77.03  E-value: 2.64e-13
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  184 AGRKQKVIQKEQ--GKPEEQRGSAKLPAQQQSPKPVQPQGhVKPTPQQSESSKaVPQQQQPGEPKQIQkPGPGHPADSKL 261
Cdd:PHA03378   593 AQTPWPVPHPSQtpEPPTTQSHIPETSAPRQWPMPLRPIP-MRPLRMQPITFN-VLVFPTPHQPPQVE-ITPYKPTWTQI 669
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  262 EQVKQPPQPRGPQKSQPQQSEPTK---PVQQQTSAKPSAGPakPLPQQPDSARTSSQAPPPAKPSSqqlgpvkqpSQQPA 338
Cdd:PHA03378   670 GHIPYQPSPTGANTMLPIQWAPGTmqpPPRAPTPMRPPAAP--PGRAQRPAAATGRARPPAAAPGR---------ARPPA 738
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  339 RQGGPVKPSSQQAGPPKQPSQQPGPEKPsaqqmgPAKQASQPGPGKPPlqQTGPVKQAPPQSGPTKPPPQTAGATKAPVP 418
Cdd:PHA03378   739 AAPGRARPPAAAPGRARPPAAAPGRARP------PAAAPGAPTPQPPP--QAPPAPQQRPRGAPTPQPPPQAGPTSMQLM 810
                          250       260       270
                   ....*....|....*....|....*....|....*
gi 2022791524  419 QPgltKPPGQQPEPEKPSQQKQASATQPAESAPKK 453
Cdd:PHA03378   811 PR---AAPGQQGPTKQILRQLLTGGVKRGRPSLKK 842
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
169-445 2.64e-13

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 77.11  E-value: 2.64e-13
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  169 NPFDLISDSDTAHEEAGRKQK--VIQKEQGKPEEqrGSAKLPAQQQSPKPVQPQGHVKPTPQQSESSKAVPQQQQPGEPK 246
Cdd:pfam03154  150 SPQDNESDSDSSAQQQILQTQppVLQAQSGAASP--PSPPPPGTTQAATAGPTPSAPSVPPQGSPATSQPPNQTQSTAAP 227
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  247 Q--IQKPGPGHPA--DSKLEQVKQPPQPRGPQKSQPQQSEPTK----------PVQQQTSAKPSAGPAKPLPQQPDSARt 312
Cdd:pfam03154  228 HtlIQQTPTLHPQrlPSPHPPLQPMTQPPPPSQVSPQPLPQPSlhgqmppmphSLQTGPSHMQHPVPPQPFPLTPQSSQ- 306
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  313 sSQAPPPakPSSQQLGPVKQPSQQPARQGGPV--KPSSQQAGPP--------KQPSQQPGPEKPSAQQMGPAKQASQPGP 382
Cdd:pfam03154  307 -SQVPPG--PSPAAPGQSQQRIHTPPSQSQLQsqQPPREQPLPPaplsmphiKPPPTTPIPQLPNPQSHKHPPHLSGPSP 383
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 2022791524  383 GKPPLQQTGPVKQAPPQSGPTKPPPQTAGATKAPVPQPGLTKPPGQQPEPEKPSQQKQASATQ 445
Cdd:pfam03154  384 FQMNSNLPPPPALKPLSSLSTHHPPSAHPPPLQLMPQSQQLPPPPAQPPVLTQSQSLPPPAAS 446
Glutenin_hmw pfam03157
High molecular weight glutenin subunit; Members of this family include high molecular weight ...
182-446 3.63e-13

High molecular weight glutenin subunit; Members of this family include high molecular weight subunits of glutenin. This group of gluten proteins is thought to be largely responsible for the elastic properties of gluten, and hence, doughs. Indeed, glutenin high molecular weight subunits are classified as elastomeric proteins, because the glutenin network can withstand significant deformations without breaking, and return to the original conformation when the stress is removed. Elastomeric proteins differ considerably in amino acid sequence, but they are all polymers whose subunits consist of elastomeric domains, composed of repeated motifs, and non-elastic domains that mediate cross-linking between the subunits. The elastomeric domain motifs are all rich in glycine residues in addition to other hydrophobic residues. High molecular weight glutenin subunits have an extensive central elastomeric domain, flanked by two terminal non-elastic domains that form disulphide cross-links. The central elastomeric domain is characterized by the following three repeated motifs: PGQGQQ, GYYPTS[P/L]QQ, GQQ. It possesses overlapping beta-turns within and between the repeated motifs, and assumes a regular helical secondary structure with a diameter of approx. 1.9 nm and a pitch of approx. 1.5 nm.


Pssm-ID: 367362 [Multi-domain]  Cd Length: 786  Bit Score: 76.52  E-value: 3.63e-13
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  182 EEAGRKQKVIQKEQGKPEEQRGSAKLPAQQQSPKPVQPqGHVKPTPQQSESSKAVPQQQQPGEPKQIQKPGPGHPADS-- 259
Cdd:pfam03157  315 GQEQQDQQPGQGRQGQQPGQGQQGQQPAQGQQPGQGQP-GYYPTSPQQPGQGQPGYYPTSQQQPQQGQQPEQGQQGQQqg 393
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  260 KLEQVKQPPQPRGPQKSQPQQ--SEPTKPVQQQTSAKPSAGPAKPLPQQPDSARTSSQAPP-----PAKPSSQQLGPVKQ 332
Cdd:pfam03157  394 QGQQGQQPGQGQQPGQGQPGYypTSPQQSGQGQPGYYPTSPQQSGQGQQPGQGQQPGQEQPgqgqqPGQGQQGQQPGQPE 473
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  333 PSQQPAR-QGGPVKPSSQQAGPPKQPS--QQPGPEKPSAQQMGPAKQAS-QPGPGKPPLQQTGPVKQAPPQSGPTKPPPQ 408
Cdd:pfam03157  474 QGQQPGQgQPGYYPTSPQQSGQGQQLGqwQQQGQGQPGYYPTSPLQPGQgQPGYYPTSPQQPGQGQQLGQLQQPTQGQQG 553
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|.
gi 2022791524  409 TAGATKAPVPQPGLTKPPGQQPEP---EKPSQQKQASATQP 446
Cdd:pfam03157  554 QQSGQGQQGQQPGQGQQGQQPGQGqqgQQPGQGQQPGQGQP 594
PAT1 pfam09770
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ...
177-408 5.97e-13

Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.


Pssm-ID: 401645 [Multi-domain]  Cd Length: 846  Bit Score: 75.84  E-value: 5.97e-13
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  177 SDTAHEEAGR--KQKVIQKEQGKPEEQRGSAKLPAQQQSPKPVQPQGHVK----------PTPQ-QSESS----KAVPQQ 239
Cdd:pfam09770   92 SDAIEEEQVRfnRQQPAARAAQSSAQPPASSLPQYQYASQQSQQPSKPVRtgyekykepePIPDlQVDASlwgvAPKKAA 171
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  240 QQPGEPKQIQKPGPGHPADSK---LEQV--------KQPPQPRGPQKSQPQQSEPTKPVQQQTSAKPSAGPAKPLPQQPD 308
Cdd:pfam09770  172 APAPAPQPAAQPASLPAPSRKmmsLEEVeaamraqaKKPAQQPAPAPAQPPAAPPAQQAQQQQQFPPQIQQQQQPQQQPQ 251
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  309 SartSSQAPPPAKPSSQQLGPVKQPSQQPARQGGPVKPSSQQAGPPK--QPSQQ-PGPEKPSAQQMgPAKQASQPGPGKP 385
Cdd:pfam09770  252 Q---PQQHPGQGHPVTILQRPQSPQPDPAQPSIQPQAQQFHQQPPPVpvQPTQIlQNPNRLSAARV-GYPQNPQPGVQPA 327
                          250       260
                   ....*....|....*....|...
gi 2022791524  386 PLQQTGPVKQAPPQSGPTKPPPQ 408
Cdd:pfam09770  328 PAHQAHRQQGSFGRQAPIITHPQ 350
PDZ1_GgSTXBP4-like cd06692
PDZ1 domain of Gallus gallus uncharacterized syntaxin-binding protein 4 (STXBP4) isoform X1, ...
4544-4612 3.13e-12

PDZ1 domain of Gallus gallus uncharacterized syntaxin-binding protein 4 (STXBP4) isoform X1, and related domains; PDZ (PSD-95 (Postsynaptic density protein 95), Dlg (Discs large protein), and ZO-1 (Zonula occludens-1)) domain 1 of Gallus gallus uncharacterized syntaxin-binding protein 4 (STXBP4) isoform X1, and related domains. Gallus gallus STXBP4 isoform X1 contains 2 PDZ domains (PDZ1 and PDZ2). PDZ domains usually bind in a sequence-specific manner to short peptide sequences located at the C-terminal end of their partner proteins (known as PDZ binding motifs). The PDZ superfamily includes canonical PDZ domains as well as those with circular permutations and domain swapping mediated by beta-strands. This STXBP4-like family domain is a canonical PDZ domain containing six beta-strands A-F and two alpha-helices (alpha-helix 1 and 2), arranged in the order: beta-strands A, B, C, alpha-helix 1, beta-strands D, E, alpha-helix 2 and beta-strand F.


Pssm-ID: 467179 [Multi-domain]  Cd Length: 88  Bit Score: 65.32  E-value: 3.13e-12
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 2022791524 4544 NGLGIRVVGGKEiPGSSGETGAYIAKILPGGNAEQTGKLIEGMQVLEWNGIPLTGKTYEEVQNIIIQQS 4612
Cdd:cd06692      8 KGLGIKIIGGYR-ENTGEEFGIFIKRILPGGLAATDGRLKEGDLILEVNGESLQGVTNERAVSILRSAS 75
Glutenin_hmw pfam03157
High molecular weight glutenin subunit; Members of this family include high molecular weight ...
192-445 4.59e-12

High molecular weight glutenin subunit; Members of this family include high molecular weight subunits of glutenin. This group of gluten proteins is thought to be largely responsible for the elastic properties of gluten, and hence, doughs. Indeed, glutenin high molecular weight subunits are classified as elastomeric proteins, because the glutenin network can withstand significant deformations without breaking, and return to the original conformation when the stress is removed. Elastomeric proteins differ considerably in amino acid sequence, but they are all polymers whose subunits consist of elastomeric domains, composed of repeated motifs, and non-elastic domains that mediate cross-linking between the subunits. The elastomeric domain motifs are all rich in glycine residues in addition to other hydrophobic residues. High molecular weight glutenin subunits have an extensive central elastomeric domain, flanked by two terminal non-elastic domains that form disulphide cross-links. The central elastomeric domain is characterized by the following three repeated motifs: PGQGQQ, GYYPTS[P/L]QQ, GQQ. It possesses overlapping beta-turns within and between the repeated motifs, and assumes a regular helical secondary structure with a diameter of approx. 1.9 nm and a pitch of approx. 1.5 nm.


Pssm-ID: 367362 [Multi-domain]  Cd Length: 786  Bit Score: 72.67  E-value: 4.59e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  192 QKEQGKPEEQRGSAKLPAQQQSPKPVQPqGHVKPTPQQSESSKAVPQQQQPGEPKQIQKPGPGHPA-----------DSK 260
Cdd:pfam03157  136 QPGQGQQWYYPTSPQQPGQWQQPGQGQQ-GYYPTSPQQSGQRQQPGQGQQLRQGQQGQQSGQGQPGyyptssqqpgqLQQ 214
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  261 LEQVKQPPQP-RGPQKSQPQQSEP-------TKPVQQQTSAKPSAGPAKPLPQQPDSARTSSQAPPPAKPSSQQLGPVKQ 332
Cdd:pfam03157  215 TGQGQQGQQPeRGQQGQQPGQGQQpgqgqqgQQPGQPQQLGQGQQGYYPISPQQPRQWQQSGQGQQGYYPTSLQQPGQGQ 294
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  333 PSQQPARQGGPVKPSSQQAGPPKQPSQQPGpEKPSAQQMGPAKQASQPGPGKPPLQ-QTGPVKQAPPQSGPTkpppqtag 411
Cdd:pfam03157  295 SGYYPTSQQQAGQLQQEQQLGQEQQDQQPG-QGRQGQQPGQGQQGQQPAQGQQPGQgQPGYYPTSPQQPGQG-------- 365
                          250       260       270
                   ....*....|....*....|....*....|....
gi 2022791524  412 atkapvpQPGLTKPPGQQPEPEKPSQQKQASATQ 445
Cdd:pfam03157  366 -------QPGYYPTSQQQPQQGQQPEQGQQGQQQ 392
PAT1 pfam09770
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ...
199-447 8.53e-12

Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.


Pssm-ID: 401645 [Multi-domain]  Cd Length: 846  Bit Score: 71.99  E-value: 8.53e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  199 EEQ-RGSAKLPAQQQSPKPVQPQGHVKPTPQQSEsskavPQQQQPGEPKQI-----QKPGPghpaDSKLeQVKQPPQPRG 272
Cdd:pfam09770   97 EEQvRFNRQQPAARAAQSSAQPPASSLPQYQYAS-----QQSQQPSKPVRTgyekyKEPEP----IPDL-QVDASLWGVA 166
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  273 PQKSQPQQSEPtkpvqqQTSAKPSAGPA---KPLPQQPDSARTSSQAPPPAkpssqqlgpvkqPSQQPARQGGPVKPSSQ 349
Cdd:pfam09770  167 PKKAAAPAPAP------QPAAQPASLPApsrKMMSLEEVEAAMRAQAKKPA------------QQPAPAPAQPPAAPPAQ 228
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  350 QAGPPKQPSQQPGPEKPSAQQMGPAKQASQPG--------PGKPPLQQTGPVKQAPPQSGPTKPPPQTA----------- 410
Cdd:pfam09770  229 QAQQQQQFPPQIQQQQQPQQQPQQPQQHPGQGhpvtilqrPQSPQPDPAQPSIQPQAQQFHQQPPPVPVqptqilqnpnr 308
                          250       260       270
                   ....*....|....*....|....*....|....*...
gi 2022791524  411 -GATKAPVPQPGltkPPGQQPEPEKPSQQKQASATQPA 447
Cdd:pfam09770  309 lSAARVGYPQNP---QPGVQPAPAHQAHRQQGSFGRQA 343
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
221-450 3.43e-11

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 70.01  E-value: 3.43e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  221 GHVKPTPQQSESSKAVPQQQQPGEPKQIQKPGPGHPADSklEQVKQPPQPRGPQKSQPQQSEPTKPVqqqTSAKPSAGPA 300
Cdd:PRK07764   580 GDWQVEAVVGPAPGAAGGEGPPAPASSGPPEEAARPAAP--AAPAAPAAPAPAGAAAAPAEASAAPA---PGVAAPEHHP 654
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  301 KPLPQQPDSARTSSQAPPPAKPSSQQLGPVKQPSQQPARQGGPvkPSSQQAGPPKQPSQQPGPEKPSAQQMGPAKQASQP 380
Cdd:PRK07764   655 KHVAVPDASDGGDGWPAKAGGAAPAAPPPAPAPAAPAAPAGAA--PAQPAPAPAATPPAGQADDPAAQPPQAAQGASAPS 732
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 2022791524  381 --GPGKPPLQQTGPVKQAPPQSGPTKPPPQTAGATKAPVPQPGltkPPGQQPEPEKPSQQKQASATQPAESA 450
Cdd:PRK07764   733 paADDPVPLPPEPDDPPDPAGAPAQPPPPPAPAPAAAPAAAPP---PSPPSEEEEMAEDDAPSMDDEDRRDA 801
Glutenin_hmw pfam03157
High molecular weight glutenin subunit; Members of this family include high molecular weight ...
182-446 3.57e-11

High molecular weight glutenin subunit; Members of this family include high molecular weight subunits of glutenin. This group of gluten proteins is thought to be largely responsible for the elastic properties of gluten, and hence, doughs. Indeed, glutenin high molecular weight subunits are classified as elastomeric proteins, because the glutenin network can withstand significant deformations without breaking, and return to the original conformation when the stress is removed. Elastomeric proteins differ considerably in amino acid sequence, but they are all polymers whose subunits consist of elastomeric domains, composed of repeated motifs, and non-elastic domains that mediate cross-linking between the subunits. The elastomeric domain motifs are all rich in glycine residues in addition to other hydrophobic residues. High molecular weight glutenin subunits have an extensive central elastomeric domain, flanked by two terminal non-elastic domains that form disulphide cross-links. The central elastomeric domain is characterized by the following three repeated motifs: PGQGQQ, GYYPTS[P/L]QQ, GQQ. It possesses overlapping beta-turns within and between the repeated motifs, and assumes a regular helical secondary structure with a diameter of approx. 1.9 nm and a pitch of approx. 1.5 nm.


Pssm-ID: 367362 [Multi-domain]  Cd Length: 786  Bit Score: 69.98  E-value: 3.57e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  182 EEAGRKQKVIQKEQGKPEEQRGSAKLPAQ-QQSPKPVQPQ-------GHVKPTPQQSESSKAVPQQQQPGEPKQIQKPGP 253
Cdd:pfam03157  213 QQTGQGQQGQQPERGQQGQQPGQGQQPGQgQQGQQPGQPQqlgqgqqGYYPISPQQPRQWQQSGQGQQGYYPTSLQQPGQ 292
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  254 GH--------------PADSKLEQVKQPPQP----RGPQKSQPQQSEPTKPVQQQTSAKPSAGPAKPL------------ 303
Cdd:pfam03157  293 GQsgyyptsqqqagqlQQEQQLGQEQQDQQPgqgrQGQQPGQGQQGQQPAQGQQPGQGQPGYYPTSPQqpgqgqpgyypt 372
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  304 -PQQPDSARTSSQAPPPAKPSSQQLGPVKQPSQQPARQGGPVKPSSqqagpPKQPSQ-QPGPEKPSAQQMGpakQASQPG 381
Cdd:pfam03157  373 sQQQPQQGQQPEQGQQGQQQGQGQQGQQPGQGQQPGQGQPGYYPTS-----PQQSGQgQPGYYPTSPQQSG---QGQQPG 444
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 2022791524  382 PGKPPLQQTGPVKQAPPQSGPTKPPPQTAGATKAPVPQPGLTKPPGQQP-EPEKPSQQKQASATQP 446
Cdd:pfam03157  445 QGQQPGQEQPGQGQQPGQGQQGQQPGQPEQGQQPGQGQPGYYPTSPQQSgQGQQLGQWQQQGQGQP 510
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
251-451 4.94e-11

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 69.24  E-value: 4.94e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  251 PGPGHPADSKLEQVKQPPQPRGPQKSQPQQSEPTKPVQQQTSAKPSAGPAKPLPQQPDSARTSSQAPPPAKPSSQQLGPV 330
Cdd:PRK07764   592 PGAAGGEGPPAPASSGPPEEAARPAAPAAPAAPAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVPDASDGGDGWPA 671
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  331 KQPSQQPARQGGPVKPSSQQAG---PPKQPSQQPGPEKPSAQQMGPAKQASQPGPGKPPlQQTGPVKQAPPQSGPTKPP- 406
Cdd:PRK07764   672 KAGGAAPAAPPPAPAPAAPAAPagaAPAQPAPAPAATPPAGQADDPAAQPPQAAQGASA-PSPAADDPVPLPPEPDDPPd 750
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|....*
gi 2022791524  407 PQTAGATKAPVPQPGLTKPPGQQPEPEKPSQQkQASATQPAESAP 451
Cdd:PRK07764   751 PAGAPAQPPPPPAPAPAAAPAAAPPPSPPSEE-EEMAEDDAPSMD 794
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
251-451 5.28e-11

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 69.24  E-value: 5.28e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  251 PGPGHPADSKLEQVKQPPQPRGPQKSQPQQSEPTKPVQQQTSAKPSAGPAKPlpqqPDSARTSSQAPPPAKPSSQQLGPV 330
Cdd:PRK07764   588 VGPAPGAAGGEGPPAPASSGPPEEAARPAAPAAPAAPAAPAPAGAAAAPAEA----SAAPAPGVAAPEHHPKHVAVPDAS 663
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  331 KQPSQQPARQGGPVKPSSQQAGPPKQPSQQPGPEKPSAQQMGPAKQASQPGPGKPPLQQTGPVKQAPPQSG--PTKPPPQ 408
Cdd:PRK07764   664 DGGDGWPAKAGGAAPAAPPPAPAPAAPAAPAGAAPAQPAPAPAATPPAGQADDPAAQPPQAAQGASAPSPAadDPVPLPP 743
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|...
gi 2022791524  409 TAGATKAPVPQPGLTKPPGQQPEPEKPSQQKQASATQPAESAP 451
Cdd:PRK07764   744 EPDDPPDPAGAPAQPPPPPAPAPAAAPAAAPPPSPPSEEEEMA 786
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
213-399 6.58e-11

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 68.86  E-value: 6.58e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  213 SPKPVQPQGHVKPTPQQSESSKAVPQQQQPGEPKQIQKPG--PGHPADSKLEQVKQPPQPRGPQKSQPQQSEPTKP---- 286
Cdd:PRK07764   591 APGAAGGEGPPAPASSGPPEEAARPAAPAAPAAPAAPAPAgaAAAPAEASAAPAPGVAAPEHHPKHVAVPDASDGGdgwp 670
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  287 ----VQQQTSAKPSAGPAKPLPQQPDSARTSSQAPPPAKPSSQQLGPVKQPSQQPARQGGPVKPSSQQAGPPKQPSQQPG 362
Cdd:PRK07764   671 akagGAAPAAPPPAPAPAAPAAPAGAAPAQPAPAPAATPPAGQADDPAAQPPQAAQGASAPSPAADDPVPLPPEPDDPPD 750
                          170       180       190
                   ....*....|....*....|....*....|....*..
gi 2022791524  363 PEKPSaqqmGPAKQASQPGPGKPPLQQTGPVKQAPPQ 399
Cdd:PRK07764   751 PAGAP----AQPPPPPAPAPAAAPAAAPPPSPPSEEE 783
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
203-450 9.78e-11

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 68.66  E-value: 9.78e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  203 GSAKLPAQQQSPKPVQPQGHVKP-TPQQSESSKAVPQQQQPGEPKQIQKPGPGHPADSKLEQVKQPPQPRGPQKSQPQQS 281
Cdd:PHA03307    75 PGTEAPANESRSTPTWSLSTLAPaSPAREGSPTPPGPSSPDPPPPTPPPASPPPSPAPDLSEMLRPVGSPGPPPAASPPA 154
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  282 EPTKPVQQQTSAKPSAGPAKPLPQQPDSARTSSQAPPPAKPSSQQLG------PVKQPSQQPARQGGPVKPSSQQAGPPK 355
Cdd:PHA03307   155 AGASPAAVASDAASSRQAALPLSSPEETARAPSSPPAEPPPSTPPAAasprppRRSSPISASASSPAPAPGRSAADDAGA 234
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  356 QPSQQPGPEKPSAqqmGPAKQASQPGPGKPPLQQTGPVKQA----PPQSGPTKPPPQTAGATKAPVPQPGltkPPGQQPE 431
Cdd:PHA03307   235 SSSDSSSSESSGC---GWGPENECPLPRPAPITLPTRIWEAsgwnGPSSRPGPASSSSSPRERSPSPSPS---SPGSGPA 308
                          250
                   ....*....|....*....
gi 2022791524  432 PEKPSQQKQASATQPAESA 450
Cdd:PHA03307   309 PSSPRASSSSSSSRESSSS 327
PHA03378 PHA03378
EBNA-3B; Provisional
180-447 1.36e-10

EBNA-3B; Provisional


Pssm-ID: 223065 [Multi-domain]  Cd Length: 991  Bit Score: 68.17  E-value: 1.36e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  180 AHEEAGRKQKVIQKEQ--GKPEEQRGSAKL---PAQQQSPKPVQPQGHVKPTPQQSESSKAVPQQQQPGEPKQ-IQKPGP 253
Cdd:PHA03378   425 AIEEEHRKKKAARTEQprATPHSQAPTVVLhrpPTQPLEGPTGPLSVQAPLEPWQPLPHPQVTPVILHQPPAQgVQAHGS 504
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  254 G----HPADSKLEQ-------VKQPPQPRGPQKS--------QPQQSEPTKPVQQQTSAKPSAGPAkPLPQQPDSARTSS 314
Cdd:PHA03378   505 MldllEKDDEDMEQrvmatllPPSPPQPRAGRRApcvytedlDIESDEPASTEPVHDQLLPAPGLG-PLQIQPLTSPTTS 583
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  315 QAPPPAKPSSQQLGPVKQPSQQParqggpvKPSSQQAGPPKQ--PSQQPGPEKPSAQQMGPAKQASQPGPGKPPLQQTGP 392
Cdd:PHA03378   584 QLASSAPSYAQTPWPVPHPSQTP-------EPPTTQSHIPETsaPRQWPMPLRPIPMRPLRMQPITFNVLVFPTPHQPPQ 656
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 2022791524  393 VKQAPPQSGPTKPP-----PQTAGATKAPVPQ--PGLTKPPGQQPEPEKPSQQKQASATQPA 447
Cdd:PHA03378   657 VEITPYKPTWTQIGhipyqPSPTGANTMLPIQwaPGTMQPPPRAPTPMRPPAAPPGRAQRPA 718
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
328-720 1.46e-10

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 67.70  E-value: 1.46e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  328 GPVKQPSQQPARQGGPVkPSSQQAGPPKQPSQQPGPEKPSAQQMGPAKQASQPGPGKPPLQQTGPVKQAPPQSGPTKPPP 407
Cdd:PRK07764   389 GGAGAPAAAAPSAAAAA-PAAAPAPAAAAPAAAAAPAPAAAPQPAPAPAPAPAPPSPAGNAPAGGAPSPPPAAAPSAQPA 467
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  408 QTAGATKAPVPQPGLTKPPGQQPEPEkPSQQKQASATQPA--------------ESAPKKTFcplcTTTELLLHTPEKAN 473
Cdd:PRK07764   468 PAPAAAPEPTAAPAPAPPAAPAPAAA-PAAPAAPAAPAGAddaatlrerwpeilAAVPKRSR----KTWAILLPEATVLG 542
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  474 Y--NTCTQCHTVVCSLCGFNpNPHITE-----IKEWLCLNCQMQRALGGDLAPGHGPGPQLPPPKQKTPTPPSTAKPPPQ 546
Cdd:PRK07764   543 VrgDTLVLGFSTGGLARRFA-SPGNAEvlvtaLAEELGGDWQVEAVVGPAPGAAGGEGPPAPASSGPPEEAARPAAPAAP 621
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  547 PQPGQKKDASPKPDPAQLAEPKKPPPQKKQPSmPGSPPVKAKTSRAEPADTSQQiDPAPKSDRAKPSQAEDKQKQPsiqK 626
Cdd:PRK07764   622 AAPAAPAPAGAAAAPAEASAAPAPGVAAPEHH-PKHVAVPDASDGGDGWPAKAG-GAAPAAPPPAPAPAAPAAPAG---A 696
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  627 PAADTVLTPAAPGLKQDSAGPRPPPTQQKVTDSPKPELAKPSQDTHPA-GEKPDSKPLPQVSRQKSDPKLSAQPGARSEA 705
Cdd:PRK07764   697 APAQPAPAPAATPPAGQADDPAAQPPQAAQGASAPSPAADDPVPLPPEpDDPPDPAGAPAQPPPPPAPAPAAAPAAAPPP 776
                          410
                   ....*....|....*
gi 2022791524  706 KTQKPVEPAQVKDDP 720
Cdd:PRK07764   777 SPPSEEEEMAEDDAP 791
PspC_subgroup_2 NF033839
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ...
208-454 2.54e-10

pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.


Pssm-ID: 468202 [Multi-domain]  Cd Length: 557  Bit Score: 66.72  E-value: 2.54e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  208 PAQQQSPKPVQPQGHVKPTPQQSEssKAVPQQQQPGEPK---QIQKPGPGHPADSKLEQVKQPPQPRGPQ-KSQPQQSEP 283
Cdd:NF033839   284 PKEPGNKKPSAPKPGMQPSPQPEK--KEVKPEPETPKPEvkpQLEKPKPEVKPQPEKPKPEVKPQLETPKpEVKPQPEKP 361
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  284 TKPVQQQTSAKPSAGPAKPLPQQPDSARTSSQAPPPAKPSSQQLGPVKQPsqQPARQGGPVKPSSQqagpPKQPSQQPGP 363
Cdd:NF033839   362 KPEVKPQPEKPKPEVKPQPETPKPEVKPQPEKPKPEVKPQPEKPKPEVKP--QPEKPKPEVKPQPE----KPKPEVKPQP 435
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  364 EKPSAQqmgPAKQASQPGPGKPPLQQTgPVKQAPPQsgPTKPPPQTAGATKAPVPQPGLTKPPGQQPE-PEKPSQQKQ-- 440
Cdd:NF033839   436 EKPKPE---VKPQPEKPKPEVKPQPET-PKPEVKPQ--PEKPKPEVKPQPEKPKPDNSKPQADDKKPStPNNLSKDKQps 509
                          250
                   ....*....|....*.
gi 2022791524  441 --ASATQPAESAPKKT 454
Cdd:NF033839   510 nqASTNEKATNKPKKS 525
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
208-447 6.45e-10

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 65.94  E-value: 6.45e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  208 PAQQQSPKPVQP--------QGHVKPTPQQSESSKAVPQQQQPGEPKQIQKPGPGH-----PADSKLEQVKQPPQPRGPQ 274
Cdd:pfam03154  286 PSHMQHPVPPQPfpltpqssQSQVPPGPSPAAPGQSQQRIHTPPSQSQLQSQQPPReqplpPAPLSMPHIKPPPTTPIPQ 365
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  275 KSQPQ-QSEPTKPVQQQTSAKPSAGPakPLPQQPDSARTSSQAPPPAKPSSQQLGPVKQPSQQPARQGgPVKPSSQQAGP 353
Cdd:pfam03154  366 LPNPQsHKHPPHLSGPSPFQMNSNLP--PPPALKPLSSLSTHHPPSAHPPPLQLMPQSQQLPPPPAQP-PVLTQSQSLPP 442
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  354 PKQPSQQPGPEKPSAQQMGPAKQASQPGPGKPPLQQTGPVKQAPPQSGPTKPPPQTAGATKAPVP--------------- 418
Cdd:pfam03154  443 PAASHPPTSGLHQVPSQSPFPQHPFVPGGPPPITPPSGPPTSTSSAMPGIQPPSSASVSSSGPVPaavscplppvqikee 522
                          250       260       270
                   ....*....|....*....|....*....|....*
gi 2022791524  419 ------QPGLTKPPGQQPEPEKPSQQKQASATQPA 447
Cdd:pfam03154  523 aldeaeEPESPPPPPRSPSPEPTVVNTPSHASQSA 557
Glutenin_hmw pfam03157
High molecular weight glutenin subunit; Members of this family include high molecular weight ...
208-446 9.58e-10

High molecular weight glutenin subunit; Members of this family include high molecular weight subunits of glutenin. This group of gluten proteins is thought to be largely responsible for the elastic properties of gluten, and hence, doughs. Indeed, glutenin high molecular weight subunits are classified as elastomeric proteins, because the glutenin network can withstand significant deformations without breaking, and return to the original conformation when the stress is removed. Elastomeric proteins differ considerably in amino acid sequence, but they are all polymers whose subunits consist of elastomeric domains, composed of repeated motifs, and non-elastic domains that mediate cross-linking between the subunits. The elastomeric domain motifs are all rich in glycine residues in addition to other hydrophobic residues. High molecular weight glutenin subunits have an extensive central elastomeric domain, flanked by two terminal non-elastic domains that form disulphide cross-links. The central elastomeric domain is characterized by the following three repeated motifs: PGQGQQ, GYYPTS[P/L]QQ, GQQ. It possesses overlapping beta-turns within and between the repeated motifs, and assumes a regular helical secondary structure with a diameter of approx. 1.9 nm and a pitch of approx. 1.5 nm.


Pssm-ID: 367362 [Multi-domain]  Cd Length: 786  Bit Score: 65.35  E-value: 9.58e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  208 PAQQQSPKPVQPQghvkpTPQQSESSKAVPQQQQPGEPKQIQKPGPGHPADSkleqvkqpPQPRGPQKSQPQQSEPTKPV 287
Cdd:pfam03157  123 PGQASPQRPGQGQ-----QPGQGQQWYYPTSPQQPGQWQQPGQGQQGYYPTS--------PQQSGQRQQPGQGQQLRQGQ 189
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  288 QQQTSAKPSAGPAKPLPQQPDSARTSSQAPPPAKPSSQQLGPVKQPSQQPAR-QGGPVKPSSQQAGPPKQPSQQPGPEKP 366
Cdd:pfam03157  190 QGQQSGQGQPGYYPTSSQQPGQLQQTGQGQQGQQPERGQQGQQPGQGQQPGQgQQGQQPGQPQQLGQGQQGYYPISPQQP 269
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  367 -SAQQMGPAKQASQP-GPGKPPLQQTGPVKQAPPQSGPTKPPPQTAGATKAPVPQPGLT-KPPGQQPEPEKPSQQKQASA 443
Cdd:pfam03157  270 rQWQQSGQGQQGYYPtSLQQPGQGQSGYYPTSQQQAGQLQQEQQLGQEQQDQQPGQGRQgQQPGQGQQGQQPAQGQQPGQ 349

                   ...
gi 2022791524  444 TQP 446
Cdd:pfam03157  350 GQP 352
PRK10263 PRK10263
DNA translocase FtsK; Provisional
204-451 1.07e-09

DNA translocase FtsK; Provisional


Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 65.49  E-value: 1.07e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  204 SAKLPAQQQSPKPVQPqghVKPTPQQSESSKAVPQQQQPGEPKqiqkPGPGHPADSKLEQVKQ-PPQPRGPQKSQPQQSE 282
Cdd:PRK10263   324 AAATTATQSWAAPVEP---VTQTPPVASVDVPPAQPTVAWQPV----PGPQTGEPVIAPAPEGyPQQSQYAQPAVQYNEP 396
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  283 PTKPVQQQTSakPSAGPAKPLPQQPDSARTSSQAPPPAKPSSQQLGPVKQPSQQPARQGGPVKPSSQQAgpPKQPSQQPG 362
Cdd:PRK10263   397 LQQPVQPQQP--YYAPAAEQPAQQPYYAPAPEQPAQQPYYAPAPEQPVAGNAWQAEEQQSTFAPQSTYQ--TEQTYQQPA 472
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  363 PEKPSAQQ---------MGPAKQASQPGPGKPPLQQTGPVKQAPPQSgptkppPQTAGATKAPVPQpgltkpPGQQPEPE 433
Cdd:PRK10263   473 AQEPLYQQpqpveqqpvVEPEPVVEETKPARPPLYYFEEVEEKRARE------REQLAAWYQPIPE------PVKEPEPI 540
                          250
                   ....*....|....*....
gi 2022791524  434 KPS-QQKQASATQPAESAP 451
Cdd:PRK10263   541 KSSlKAPSVAAVPPVEAAA 559
PDZ2-PTPN13_FRMPD2-like cd06792
PDZ domain 2 of tyrosine kinase PTPN13, FERM and PDZ domain-containing protein 2 (FRMPD2), and ...
4544-4614 1.28e-09

PDZ domain 2 of tyrosine kinase PTPN13, FERM and PDZ domain-containing protein 2 (FRMPD2), and similar domains; PDZ (PSD-95 (Postsynaptic density protein 95), Dlg (Discs large protein), and ZO-1 (Zonula occludens-1)) domain 2 of human PTPN13, and related domains. PTPN13, also known as Fas-associated protein-tyrosine phosphatase 1 (FAP-1), protein-tyrosine phosphatase 1E (PTP-E1), and protein-tyrosine phosphatase (PTPL1), negatively regulates FAS-mediated apoptosis and NGFR-mediated pro-apoptotic signaling, and may also regulate phosphoinositide 3-kinase (PI3K) signaling. It contains 5 PDZ domains; interaction partners of its second PDZ domain (PDZ2) include the Fas receptor (TNFRSF6) and thyroid receptor-interacting protein 6 (TRIP6). The second PDZ (PDZ2) domain, but not PDZ1 or PDZ3, of FRMPD2 binds to GluN2A and GluN2B, two subunits of N-methyl-d-aspartic acid (NMDA) receptors. Other binding partners of the FRMPDZ2 PDZ2 domain include NOD2, and catenin family members, delta catenin (CTNND2), armadillo repeat gene deleted in velo-cardio-facial syndrome (ARVCF) and p0071 (also known as plakophilin 4; PKP4). PDZ domains usually bind in a sequence-specific manner to short peptide sequences located at the C-terminal end of their partner proteins (known as PDZ binding motifs). The PDZ superfamily includes canonical PDZ domains as well as those with circular permutations and domain swapping mediated by beta-strands. This PTPN13-like family PDZ2 domain is a canonical PDZ domain containing six beta-strands A-F and two alpha-helices (alpha-helix 1 and 2), arranged in the order: beta-strands A, B, C, alpha-helix 1, beta-strands D, E, alpha-helix 2 and beta-strand F.


Pssm-ID: 467254 [Multi-domain]  Cd Length: 87  Bit Score: 57.61  E-value: 1.28e-09
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 2022791524 4544 NGLGIRVVGGKEIPGSSGetGAYIAKILPGGNAEQTGKLIEGMQVLEWNGIPLTGKTYEEVQNiIIQQSGE 4614
Cdd:cd06792     12 GSLGISVTGGINTSVRHG--GIYVKSLVPGGAAEQDGRIQKGDRLLEVNGVSLEGVTHKQAVE-CLKNAGQ 79
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
196-450 1.71e-09

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 64.24  E-value: 1.71e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  196 GKPEEQRGSAKLPAQQQSPKPVQPQGHVKPTPQQSESSKAVPQQQQPGEPKQIQKPGPGHPADSKLEQVKQPPQPR---- 271
Cdd:PRK07764   392 GAPAAAAPSAAAAAPAAAPAPAAAAPAAAAAPAPAAAPQPAPAPAPAPAPPSPAGNAPAGGAPSPPPAAAPSAQPApapa 471
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  272 -GPQKSQPQQSEPTKPVQQQTSAKPSAGPAKPLPQ--------------------------------------------- 305
Cdd:PRK07764   472 aAPEPTAAPAPAPPAAPAPAAAPAAPAAPAAPAGAddaatlrerwpeilaavpkrsrktwaillpeatvlgvrgdtlvlg 551
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  306 -----------QPDSAR---------------------TSSQAPPPAKPSSQQLGPVKQPSQQPARQGGPVKPSSQQAGP 353
Cdd:PRK07764   552 fstgglarrfaSPGNAEvlvtalaeelggdwqveavvgPAPGAAGGEGPPAPASSGPPEEAARPAAPAAPAAPAAPAPAG 631
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  354 PKQPSQQPGPEKPSAQQMGPAKQ-------ASQPGPGKPPLQQTGPVKQAPPQSGPTKPPPQTAGATKAPVPQPGLTKPP 426
Cdd:PRK07764   632 AAAAPAEASAAPAPGVAAPEHHPkhvavpdASDGGDGWPAKAGGAAPAAPPPAPAPAAPAAPAGAAPAQPAPAPAATPPA 711
                          330       340
                   ....*....|....*....|....
gi 2022791524  427 GQQPEPEKPSQQKQASATQPAESA 450
Cdd:PRK07764   712 GQADDPAAQPPQAAQGASAPSPAA 735
PLN03209 PLN03209
translocon at the inner envelope of chloroplast subunit 62; Provisional
205-452 2.23e-09

translocon at the inner envelope of chloroplast subunit 62; Provisional


Pssm-ID: 178748 [Multi-domain]  Cd Length: 576  Bit Score: 63.79  E-value: 2.23e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  205 AKLPAQQQSPKPVQPQGHVKPTPQQSESSKA--VPQQQQPGEPKQiqkpgpghpadskleqvkQPPQPRGPQKSQPQQSE 282
Cdd:PLN03209   321 AKIPSQRVPPKESDAADGPKPVPTKPVTPEApsPPIEEEPPQPKA------------------VVPRPLSPYTAYEDLKP 382
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  283 PTKPVQQQTSAKPSAGPAKPLPQQPDSARTSsqaPPPAKPSSQqlgPVKQPSQQPARQGGPVKPSSqqAGPPKQPSQQPG 362
Cdd:PLN03209   383 PTSPIPTPPSSSPASSKSVDAVAKPAEPDVV---PSPGSASNV---PEVEPAQVEAKKTRPLSPYA--RYEDLKPPTSPS 454
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  363 PEKPSAQQMGPAKQASQPGPGKPPLQQTGPVKQAPPQSGPTKPPPQTAGATKAPVPQPGLTKPPGQQPEPEKPSQQKQAS 442
Cdd:PLN03209   455 PTAPTGVSPSVSSTSSVPAVPDTAPATAATDAAAPPPANMRPLSPYAVYDDLKPPTSPSPAAPVGKVAPSSTNEVVKVGN 534
                          250
                   ....*....|
gi 2022791524  443 ATQPAESAPK 452
Cdd:PLN03209   535 SAPPTALADE 544
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
294-684 2.58e-09

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 64.04  E-value: 2.58e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  294 KPSAGPAKPLPQQPDSARTSSQAPPPAKPSSQQLGPVKQPSQQParqGGPVKPSSQQAGPPKQPSQQPGPEKPSAQQMGP 373
Cdd:PHA03307    70 GPPPGPGTEAPANESRSTPTWSLSTLAPASPAREGSPTPPGPSS---PDPPPPTPPPASPPPSPAPDLSEMLRPVGSPGP 146
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  374 AKQASQPGPGKPPLQQTGPVKQAPPQSGPTKPPPQTAGATKAP-VPQPGLTKPPGQQPEPEKPSQQKQASATQPAESAPK 452
Cdd:PHA03307   147 PPAASPPAAGASPAAVASDAASSRQAALPLSSPEETARAPSSPpAEPPPSTPPAAASPRPPRRSSPISASASSPAPAPGR 226
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  453 KTFCPLCTTTELLLHTPEKanyntctQCHTVVCSLCgfnPNPHITEIKEWLCLncqmqRALGGDLAPGHGPGPQLPPPKQ 532
Cdd:PHA03307   227 SAADDAGASSSDSSSSESS-------GCGWGPENEC---PLPRPAPITLPTRI-----WEASGWNGPSSRPGPASSSSSP 291
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  533 KTPTPPSTAKPPPQPQPGQKKDASPKPDPAQLAEPKKPPPQKKQPSMPGSPPVKAKTSRAEPADTSQQIDPAPKSDRAKP 612
Cdd:PHA03307   292 RERSPSPSPSSPGSGPAPSSPRASSSSSSSRESSSSSTSSSSESSRGAAVSPGPSPSRSPSPSRPPPPADPSSPRKRPRP 371
                          330       340       350       360       370       380       390
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 2022791524  613 SQAEDKQKQPSIQKPAADTVLTPAAPGLKQDSAGPRP---PPTQQKVTDSPKPELAKPSQDTHPAGEK-PDSKPLP 684
Cdd:PHA03307   372 SRAPSSPAASAGRPTRRRARAAVAGRARRRDATGRFPagrPRPSPLDAGAASGAFYARYPLLTPSGEPwPGSPPPP 447
Pro-rich pfam15240
Proline-rich protein; This family includes several eukaryotic proline-rich proteins.
238-366 3.47e-09

Proline-rich protein; This family includes several eukaryotic proline-rich proteins.


Pssm-ID: 464580 [Multi-domain]  Cd Length: 167  Bit Score: 58.90  E-value: 3.47e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  238 QQQQPGEPKQIQKPGPGHPADSKLEQVKQPPQPRGPQKSQPQ--QSEPTKPVQQQTSAKPSAGPAKPLPQQPDSARTSSQ 315
Cdd:pfam15240   38 QSQQGGQGPQGPPPGGFPPQPPASDDPPGPPPPGGPQQPPPQggKQKPQGPPPQGGPRPPPGKPQGPPPQGGNQQQGPPP 117
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|.
gi 2022791524  316 APPPAKPSSQQLGPVKQPSQQparQGGPVKPSSQQAGPPKQPSQQPGPEKP 366
Cdd:pfam15240  118 PGKPQGPPPQGGGPPPQGGNQ---QGPPPPPPGNPQGPPQRPPQPGNPQGP 165
PTZ00449 PTZ00449
104 kDa microneme/rhoptry antigen; Provisional
187-464 7.34e-09

104 kDa microneme/rhoptry antigen; Provisional


Pssm-ID: 185628 [Multi-domain]  Cd Length: 943  Bit Score: 62.40  E-value: 7.34e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  187 KQKVIQKEQGKPEEQRGSAKLPAQQQSPkPVQPqGHVKPTPQQSESSKAVPQQQQPGEPKQI------QKPGPG--HPAD 258
Cdd:PTZ00449   493 KKKLAPIEEEDSDKHDEPPEGPEASGLP-PKAP-GDKEGEEGEHEDSKESDEPKEGGKPGETkegevgKKPGPAkeHKPS 570
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  259 SKLEQVKQPPQPRGPqKSQPQQSEPTKPVQQQTSAKPSAGPAkplPQQPDSARTSSQAPPPAKPSSQQLGPVKQPSQQPA 338
Cdd:PTZ00449   571 KIPTLSKKPEFPKDP-KHPKDPEEPKKPKRPRSAQRPTRPKS---PKLPELLDIPKSPKRPESPKSPKRPPPPQRPSSPE 646
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  339 RQGGPVKPSSQQagPPKQPSQQPGPE----------KPSAQQMGPAKQASQPGPGKPPLQQTGPVKQAPPQSGPTKPPPQ 408
Cdd:PTZ00449   647 RPEGPKIIKSPK--PPKSPKPPFDPKfkekfyddylDAAAKSKETKTTVVLDESFESILKETLPETPGTPFTTPRPLPPK 724
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 2022791524  409 TAGATKAPVPQPGltKPPGQQPEPEKPSQQKQASATQPAESaPKKTFCPLCTTTEL 464
Cdd:PTZ00449   725 LPRDEEFPFEPIG--DPDAEQPDDIEFFTPPEEERTFFHET-PADTPLPDILAEEF 777
Androgen_recep pfam02166
Androgen receptor;
221-450 8.61e-09

Androgen receptor;


Pssm-ID: 426632 [Multi-domain]  Cd Length: 501  Bit Score: 61.48  E-value: 8.61e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  221 GHVKPTPQqSESSKAVPQQQQPGEPKQIQKPGPGHPADSKleqVKQPPQPRGPQKSQPQQSEPTKPVQQQTSakpsagpa 300
Cdd:pfam02166    3 GRVYPRPP-AKTFRGAFQNLFQSVREVIQNPGPRHPEAAG---GAAPPGARLQHQQQQQQQVPQQPQQQESS-------- 70
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  301 kplPQQPDSARTSSQAPPPAKPSSQQLGPVKQPSQQPARQGGPVKPSSQQAGPPKQPSqQPGPEKPSAQQMGPAKQASQP 380
Cdd:pfam02166   71 ---PRQPQASVQPQQAGDDGSPPAHNRGPAGYLALEDDEQPQPSQAQPAAECCPENGC-VPEPGAAAAAGKGLPQQAVAP 146
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  381 GpgkPPLQQTGPVKQAPPQSGPTKPPPQTAGATKAPVPQPGLTKPPGQQPEPEKPSQQKQASATQPAESA 450
Cdd:pfam02166  147 A---APDDDDSAAPSTLSLLGPSFPGLSGCSADLKDILAEAGTMQLLQQQQQEHHHQQHQHQQQQQQEAA 213
PDZ2_PDZD2-like cd06758
PDZ domain 2 of PDZ domain containing 2 (PDZD2), and related domains; PDZ (PSD-95 ...
4537-4608 1.02e-08

PDZ domain 2 of PDZ domain containing 2 (PDZD2), and related domains; PDZ (PSD-95 (Postsynaptic density protein 95), Dlg (Discs large protein), and ZO-1 (Zonula occludens-1)) domain 2 of PDZD2, also known as KIAA0300, PIN-1, activated in prostate cancer (AIPC) and PDZ domain-containing protein 3 (PDZK3). PDZD2 has seven PDZ domains, and is expressed at exceptionally high levels in the pancreas and certain cancer tissues such as prostate cancer. It promotes the proliferation of insulinoma cells and is upregulated during prostate tumorigenesis. In osteosarcoma (OS), the microRNA miR-363 acts as a tumor suppressor by inhibiting PDZD2. PDZ domains usually bind in a sequence-specific manner to short peptide sequences located at the C-terminal end of their partner proteins (known as PDZ binding motifs). The PDZ superfamily includes canonical PDZ domains as well as those with circular permutations and domain swapping mediated by beta-strands. This PDZD2-like family PDZ2 domain is a canonical PDZ domain containing six beta-strands A-F and two alpha-helices (alpha-helix 1 and 2), arranged in the order: beta-strands A, B, C, alpha-helix 1, beta-strands D, E, alpha-helix 2 and beta-strand F.


Pssm-ID: 467239 [Multi-domain]  Cd Length: 88  Bit Score: 55.05  E-value: 1.02e-08
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 2022791524 4537 KDHTVSGN-GLGIRVVGGKeipGSS-GETGAYIAKILPGGNAEQTGKLIEGMQVLEWNGIPLTGKTYEEVQNII 4608
Cdd:cd06758      4 KMHLLKEKgGLGIQITGGK---GSKrGDIGIFVAGVEEGGSADRDGRLKKGDELLMINGQSLIGLSHQEAVAIL 74
PDZ7_MUPP1-PD6_PATJ-like cd06671
PDZ domain 7 of multi-PDZ-domain protein 1 (MUPP1), PDZ domain 6 of PATJ (protein-associated ...
4526-4608 1.07e-08

PDZ domain 7 of multi-PDZ-domain protein 1 (MUPP1), PDZ domain 6 of PATJ (protein-associated tight junction) and related domains; PDZ (PSD-95 (Postsynaptic density protein 95), Dlg (Discs large protein), and ZO-1 (Zonula occludens-1)) domain 7 of MUPP1 and PDZ domain 6 of PATJ, and related domains. MUPP1 and PATJ serve as scaffolding proteins linking different proteins and protein complexes involved in the organization of tight junctions and epithelial polarity. MUPP1 contains an L27 (Lin-2 and Lin-7 binding) domain and 13 PDZ domains. PATJ (also known as INAD-like) contains an L27 domain and ten PDZ domains. MUPP1 and PATJ share several binding partners, including junctional adhesion molecules (JAM), zonula occludens (ZO)-3, Pals1 (protein associated with Lin-7), Par (partitioning defective)-6 proteins, and nectins (adherence junction adhesion molecules). PATJ lacks 3 PDZ domains seen in MUPP1: PDZ6, 9, and 13; consequently, MUPP1 PDZ7 and 8 align with PATJ PDZ6 and 7; and MUPP1 PDZ domains 10-12 align with PATJ PDZ domains 8-10. PDZ domains usually bind in a sequence-specific manner to short peptide sequences located at the C-terminal end of their partner proteins (known as PDZ binding motifs). The PDZ superfamily includes canonical PDZ domains as well as those with circular permutations and domain swapping mediated by beta-strands. This MUPP1-like family PDZ7 domain is a canonical PDZ domain containing six beta-strands A-F and two alpha-helices (alpha-helix 1 and 2), arranged in the order: beta-strands A, B, C, alpha-helix 1, beta-strands D, E, alpha-helix 2 and beta-strand F.


Pssm-ID: 467159 [Multi-domain]  Cd Length: 96  Bit Score: 55.40  E-value: 1.07e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524 4526 PHARLKLLRDPkdhtvsGNGLGIRVVGGKEIPG--SSGET--GAYIAKILPGGNAEQTGKLIEGMQVLEWNGIPLTGKTY 4601
Cdd:cd06671      1 PPRRVELWREP------GKSLGISIVGGRVMGSrlSNGEEirGIFIKHVLEDSPAGRNGTLKTGDRILEVNGVDLRNATH 74

                   ....*..
gi 2022791524 4602 EEVQNII 4608
Cdd:cd06671     75 EEAVEAI 81
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
192-417 1.17e-08

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 61.43  E-value: 1.17e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  192 QKEQGKPEEQRGSAKLPAQQQSPKPVQPQGHVKPTPQQSESSKAVPQQQQPGEPKQIQKPGPGHPADSKLEQVKQPPQPR 271
Cdd:PRK12323   364 RPGQSGGGAGPATAAAAPVAQPAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAPEALAAARQASARG 443
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  272 GPQKSQPQQSEPTKPVQQQTSAKPSAGPAkplpqqpdsARTSSQAPPPAKPSSQqlgPVKQPSQQPARQGGPVK---PSS 348
Cdd:PRK12323   444 PGGAPAPAPAPAAAPAAAARPAAAGPRPV---------AAAAAAAPARAAPAAA---PAPADDDPPPWEELPPEfasPAP 511
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 2022791524  349 QQAGPPKQPSQQPGPEKPSAQQMGPAKQASQPGPGKPPLqqtgPVKQAPPQSGPTKPPPQTAGATKAPV 417
Cdd:PRK12323   512 AQPDAAPAGWVAESIPDPATADPDDAFETLAPAPAAAPA----PRAAAATEPVVAPRPPRASASGLPDM 576
PDZ3_PDZD2-PDZ1_hPro-IL-16-like cd06759
PDZ domain 3 of PDZ domain containing 2 (PDZD2), PDZ domain 1 of human pro-interleukin-16 ...
4543-4603 1.63e-08

PDZ domain 3 of PDZ domain containing 2 (PDZD2), PDZ domain 1 of human pro-interleukin-16 (isoform 1, 1332 AA), and related domains; PDZ (PSD-95 (Postsynaptic density protein 95), Dlg (Discs large protein), and ZO-1 (Zonula occludens-1)) domain 3 of PDZD2, also known as KIAA0300, PIN-1, activated in prostate cancer (AIPC) and PDZ domain-containing protein 3 (PDZK3). PDZD2 has seven PDZ domains. PDZD2 is expressed at exceptionally high levels in the pancreas and certain cancer tissues, such as prostate cancer. It promotes the proliferation of insulinoma cells and is upregulated during prostate tumorigenesis. In osteosarcoma (OS), the microRNA miR-363 acts as a tumor suppressor by inhibiting PDZD2. This family also includes the first PDZ domain (PDZ1) of human pro-interleukin-16 (isoform 1, also known as nPro-Il-16; 1332 amino-acid protein). Precursor IL-16 is cleaved to produce pro-IL-16 and mature IL-16 (derived from the C-terminal 121 AA). Pro-IL-16 functions as a regulator of T cell growth; mature IL-16 is a CD4 ligand that induces chemotaxis and CD25 expression in CD4+ T cells. IL-16 bioactivity has been closely associated with the progression of several different cancers. PDZ domains usually bind in a sequence-specific manner to short peptide sequences located at the C-terminal end of their partner proteins (known as PDZ binding motifs). The PDZ superfamily includes canonical PDZ domains as well as those with circular permutations and domain swapping mediated by beta-strands. This PDZD2-like family PDZ3 domain is a canonical PDZ domain containing six beta-strands A-F and two alpha-helices (alpha-helix 1 and 2), arranged in the order: beta-strands A, B, C, alpha-helix 1, beta-strands D, E, alpha-helix 2 and beta-strand F.


Pssm-ID: 467240 [Multi-domain]  Cd Length: 87  Bit Score: 54.59  E-value: 1.63e-08
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 2022791524 4543 GNGLGIRVVGGKEIPgsSGETGAYIAKILPGGNAEQTGKLIEGMQVLEWNGIPLTGKTYEE 4603
Cdd:cd06759     11 GKGLGFSIVGGRDSP--RGPMGIYVKTIFPGGAAAEDGRLKEGDEILEVNGESLQGLTHQE 69
PHA03379 PHA03379
EBNA-3A; Provisional
185-446 3.11e-08

EBNA-3A; Provisional


Pssm-ID: 223066 [Multi-domain]  Cd Length: 935  Bit Score: 60.46  E-value: 3.11e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  185 GRKQKVIQKEQGKPEEQRGSAKLPAQQqsPKPVQPQGHvkptpQQSESSKAVpQQQQPGePKQIQKPGPGHPADSkLEQV 264
Cdd:PHA03379   396 KLTERAREALEKASEPTYGTPRPPVEK--PRPEVPQSL-----ETATSHGSA-QVPEPP-PVHDLEPGPLHDQHS-MAPC 465
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  265 KQPPQPRGP-QKSQPQQSEPTKPVQQQTSAKPSAGPAKPL--PQQPDSARTSSQAPPPAKPSSQQLGPVKQPSQQPARqg 341
Cdd:PHA03379   466 PVAQLPPGPlQDLEPGDQLPGVVQDGRPACAPVPAPAGPIvrPWEASLSQVPGVAFAPVMPQPMPVEPVPVPTVALER-- 543
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  342 gPVKPSsqqagPPKQPSQQPGPEKPSAQ-QMGPAKQASQPGPGKPPLQ---QTGPVK---QAPPQSGP--TKPPPQTAGA 412
Cdd:PHA03379   544 -PVCPA-----PPLIAMQGPGETSGIVRvRERWRPAPWTPNPPRSPSQmsvRDRLARlraEAQPYQASveVQPPQLTQVS 617
                          250       260       270
                   ....*....|....*....|....*....|....
gi 2022791524  413 TKAPVPQPgltKPPGQQPEPEKPSQQKQASATQP 446
Cdd:PHA03379   618 PQQPMEYP---LEPEQQMFPGSPFSQVADVMRAG 648
PBP1 COG5180
PAB1-binding protein, interacts with poly(A)-binding protein [RNA processing and modification]; ...
176-454 4.49e-08

PAB1-binding protein, interacts with poly(A)-binding protein [RNA processing and modification];


Pssm-ID: 444064 [Multi-domain]  Cd Length: 548  Bit Score: 59.31  E-value: 4.49e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  176 DSDTAHEEAGRKQKVIQKE-QGKPEEQRGSAKLP-AQQQSPKPVqpqghvkPTPQQSESSKAVPQQQQPGEPKQIQKPGP 253
Cdd:COG5180    191 ALKDSPEKLDRPKVEVKDEaQEEPPDLTGGADHPrPEAASSPKV-------DPPSTSEARSRPATVDAQPEMRPPADAKE 263
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  254 GHPADSKLEQVKQPP-QPRGPQKSQPQQSEPTKPVQQQTSAKPSAG-PAKPLPQQPDSARTSSQAPPPAKPSSQqlgPVK 331
Cdd:COG5180    264 RRRAAIGDTPAAEPPgLPVLEAGSEPQSDAPEAETARPIDVKGVASaPPATRPVRPPGGARDPGTPRPGQPTER---PAG 340
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  332 QPsqQPARQGGPvKPSSQQAGPPKQPSQQPGPEKPSAQQMGPAKQASQPGPGKPPLQQTGPVKQAPPQS--GPTKPPPQT 409
Cdd:COG5180    341 VP--EAASDAGQ-PPSAYPPAEEAVPGKPLEQGAPRPGSSGGDGAPFQPPNGAPQPGLGRRGAPGPPMGagDLVQAALDG 417
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|
gi 2022791524  410 AGATKAPVPQPG-----LTKPPGQQPEPEKPSQQKQASATQPAESAPKKT 454
Cdd:COG5180    418 GGRETASLGGAAggagqGPKADFVPGDAESVSGPAGLADQAGAAASTAMA 467
PRK10263 PRK10263
DNA translocase FtsK; Provisional
192-434 4.60e-08

DNA translocase FtsK; Provisional


Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 60.10  E-value: 4.60e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  192 QKEQGKPEEQRGSAKLPAQQQSPKPVQPQGHVKPTPQQSESSKAVPQQQQPGEPKQIQKPGPGH-PADSKLEQVKQPPQP 270
Cdd:PRK10263   399 QPVQPQQPYYAPAAEQPAQQPYYAPAPEQPAQQPYYAPAPEQPVAGNAWQAEEQQSTFAPQSTYqTEQTYQQPAAQEPLY 478
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  271 RGPQKSQPQQSEPTKPVQQQTsaKPSAGPAKPLPQ-QPDSARtssqapppakpSSQQLGPVKQPSQQPARQGGPVKPS-- 347
Cdd:PRK10263   479 QQPQPVEQQPVVEPEPVVEET--KPARPPLYYFEEvEEKRAR-----------EREQLAAWYQPIPEPVKEPEPIKSSlk 545
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  348 --SQQAGPPKQPSQQPGPEKPSAQQ--MGPAKQASQPGPGKPPLQQTGPVKQAPPQSGPTKPPPQTAGA-TKAPVPQPGL 422
Cdd:PRK10263   546 apSVAAVPPVEAAAAVSPLASGVKKatLATGAAATVAAPVFSLANSGGPRPQVKEGIGPQLPRPKRIRVpTRRELASYGI 625
                          250
                   ....*....|..
gi 2022791524  423 tKPPGQQPEPEK 434
Cdd:PRK10263   626 -KLPSQRAAEEK 636
Pro-rich pfam15240
Proline-rich protein; This family includes several eukaryotic proline-rich proteins.
258-408 4.89e-08

Proline-rich protein; This family includes several eukaryotic proline-rich proteins.


Pssm-ID: 464580 [Multi-domain]  Cd Length: 167  Bit Score: 55.81  E-value: 4.89e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  258 DSKLEQVKQPPQPRGPQKSQPQQSE-PTKPVQQQTSAKPSAGPAKPLPQQPDSartsSQAPPPAKPSSQQLGPVKQPSQQ 336
Cdd:pfam15240   20 SEDVSQEDSPSLISEEEGQSQQGGQgPQGPPPGGFPPQPPASDDPPGPPPPGG----PQQPPPQGGKQKPQGPPPQGGPR 95
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 2022791524  337 PA---RQGGPVKPSSQQAGPPKQPSQQ--PGPEKPSAQQMGPAKQASQPGPGKPplqqTGPvKQAPPQSGPTKPPPQ 408
Cdd:pfam15240   96 PPpgkPQGPPPQGGNQQQGPPPPGKPQgpPPQGGGPPPQGGNQQGPPPPPPGNP----QGP-PQRPPQPGNPQGPPQ 167
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
239-443 7.00e-08

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 59.12  E-value: 7.00e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  239 QQQPGEPKQIQKPGPGHPADSKLEQVKQPPQPRGPQKSQPQQSEPTKPVQQQTSAKPSAGPAKPLPQQPDSARTSSQAPP 318
Cdd:PRK12323   371 GAGPATAAAAPVAQPAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAPEALAAARQASARGPGGAPAP 450
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  319 PAKPSSQQLGPVKQPSQQPARQGGPVKPSSQQAGPPKQPSqqPGPEKPSAQQMGPAKQASqPGPGKPPLQQTGPVKQAPP 398
Cdd:PRK12323   451 APAPAAAPAAAARPAAAGPRPVAAAAAAAPARAAPAAAPA--PADDDPPPWEELPPEFAS-PAPAQPDAAPAGWVAESIP 527
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|....*
gi 2022791524  399 QSGPTKPPPQTAGATKAPVPQPglTKPPGQQPEPEKPSQQKQASA 443
Cdd:PRK12323   528 DPATADPDDAFETLAPAPAAAP--APRAAAATEPVVAPRPPRASA 570
PRK07003 PRK07003
DNA polymerase III subunit gamma/tau;
184-458 7.28e-08

DNA polymerase III subunit gamma/tau;


Pssm-ID: 235906 [Multi-domain]  Cd Length: 830  Bit Score: 59.09  E-value: 7.28e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  184 AGRKQKVIQKEQGKPEEQRGSAKLPAQQQSPKPVQPQGHVKPTPQQSESSKAVPQQQQPgepkqiQKPGPGHPADSKLEQ 263
Cdd:PRK07003   369 GGGVPARVAGAVPAPGARAAAAVGASAVPAVTAVTGAAGAALAPKAAAAAAATRAEAPP------AAPAPPATADRGDDA 442
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  264 VKQPP--QPRGPQKSQPQQSEPTKPVQQQTSAKPSAGPAKPLPqqPDSARTSSqapPPAKPSSQQLGPVKQPSQQPARQG 341
Cdd:PRK07003   443 ADGDApvPAKANARASADSRCDERDAQPPADSGSASAPASDAP--PDAAFEPA---PRAAAPSAATPAAVPDARAPAAAS 517
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  342 GPVKPSSQQAGPPKQPSQQPGPEKPSAQQMGPA-----------KQASQPGPGKPplqqtGPVKQAPPQSGPTKPPPQTA 410
Cdd:PRK07003   518 REDAPAAAAPPAPEARPPTPAAAAPAARAGGAAaaldvlrnagmRVSSDRGARAA-----AAAKPAAAPAAAPKPAAPRV 592
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|.
gi 2022791524  411 GatkAPVPQPGLTKPPGQQ-PEPEKPSQQKQAS--ATQPAESAPKKTFCPL 458
Cdd:PRK07003   593 A---VQVPTPRARAATGDApPNGAARAEQAAESrgAPPPWEDIPPDDYVPL 640
PTZ00121 PTZ00121
MAEBL; Provisional
1023-1629 7.50e-08

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 59.38  E-value: 7.50e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524 1023 QKTSTPPTPAAAKPKEEPGVQKEVPKLQQGRleKTLSADKIQQGVQKEDAKLKQGKLfKTPSADKIRVSQK-EDPR---- 1097
Cdd:PTZ00121  1104 KKTETGKAEEARKAEEAKKKAEDARKAEEAR--KAEDARKAEEARKAEDAKRVEIAR-KAEDARKAEEARKaEDAKkaea 1180
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524 1098 -------LQQTKLTKTPSSDKILHGVQKEDA-KFQEAKLAKTSSADKILHGVQKEDIKPQEAKLAKLpsadkilhgiQKE 1169
Cdd:PTZ00121  1181 arkaeevRKAEELRKAEDARKAEAARKAEEErKAEEARKAEDAKKAEAVKKAEEAKKDAEEAKKAEE----------ERN 1250
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524 1170 DLKLQQMKMAKALSADKIQPSAQKEDAQLQEvKLSKAVSADKIQHGIQKEDpnlqhvkIAKASLVEKIQQEAQK-EESKL 1248
Cdd:PTZ00121  1251 NEEIRKFEEARMAHFARRQAAIKAEEARKAD-ELKKAEEKKKADEAKKAEE-------KKKADEAKKKAEEAKKaDEAKK 1322
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524 1249 QQEKLSKTLSADKIPATVSSDQKKTLSKFEEDKKPLL--PEKKAPHSEDKKEQiiAETKDHVAERKGEVEVPCEELHAKK 1326
Cdd:PTZ00121  1323 KAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAeaAEEKAEAAEKKKEE--AKKKADAAKKKAEEKKKADEAKKKA 1400
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524 1327 QEDAKKEGLTTGIPQMVLKAE----KAQE----EETPVQMSRVPRSDHV-ETVREKTEKEDDKSDTSSSQQQKSPQGLSD 1397
Cdd:PTZ00121  1401 EEDKKKADELKKAAAAKKKADeakkKAEEkkkaDEAKKKAEEAKKADEAkKKAEEAKKAEEAKKKAEEAKKADEAKKKAE 1480
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524 1398 TGYSSDGISSSLGEIPSHipSDEKDLLRESSKK--DTVSQESPPSPSDLAKLESTVLSILEAQASTLSDEKSVKKKELYE 1475
Cdd:PTZ00121  1481 EAKKADEAKKKAEEAKKK--ADEAKKAAEAKKKadEAKKAEEAKKADEAKKAEEAKKADEAKKAEEKKKADELKKAEELK 1558
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524 1476 TYSEQTK-------DQHKTKPLPVTPESYSSDEEDLEGEGTIVEDGKGSASSQAEYKEgdgEDDIPARRQRydsvedSSE 1548
Cdd:PTZ00121  1559 KAEEKKKaeeakkaEEDKNMALRKAEEAKKAEEARIEEVMKLYEEEKKMKAEEAKKAE---EAKIKAEELK------KAE 1629
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524 1549 SENSPVPRRKRRGSVGSSSSDEYKRDDSQGSGDEEDFIRKQIIEMSADEDASGSEDDEFIRNQLKEISVTESQKKEDVKS 1628
Cdd:PTZ00121  1630 EEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKKAEELKK 1709

                   .
gi 2022791524 1629 K 1629
Cdd:PTZ00121  1710 K 1710
PRK10263 PRK10263
DNA translocase FtsK; Provisional
228-455 1.28e-07

DNA translocase FtsK; Provisional


Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 58.56  E-value: 1.28e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  228 QQSESSKAVPQQQQPGEPKQIQKPGPGHPADSklEQVKQPPQPRGPQKSQPQQSEPTKPvqqqTSAKPSA------GPAK 301
Cdd:PRK10263   660 QQDELARQFAQTQQQRYGEQYQHDVPVNAEDA--DAAAEAELARQFAQTQQQRYSGEQP----AGANPFSlddfefSPMK 733
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  302 PLPQQPDSArtssqapPPAKPSSQqlgPVKQPSQQPARQggPVKPSSQQAGPPKQPSQQPGPEKPSAQQMGPAKQASQPG 381
Cdd:PRK10263   734 ALLDDGPHE-------PLFTPIVE---PVQQPQQPVAPQ--QQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPVAPQ 801
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  382 PG-KPPLQQTGPVKQAPPQSGPTKPPPQTA--GATKAPVPQPGLTKP---------PGQQPEPEKPSQQKQASAtqPAES 449
Cdd:PRK10263   802 PQyQQPQQPVAPQPQYQQPQQPVAPQPQYQqpQQPVAPQPQDTLLHPllmrngdsrPLHKPTTPLPSLDLLTPP--PSEV 879

                   ....*.
gi 2022791524  450 APKKTF 455
Cdd:PRK10263   880 EPVDTF 885
PRK07003 PRK07003
DNA polymerase III subunit gamma/tau;
244-426 1.32e-07

DNA polymerase III subunit gamma/tau;


Pssm-ID: 235906 [Multi-domain]  Cd Length: 830  Bit Score: 58.32  E-value: 1.32e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  244 EPKQIQKPGPGHPADSKLEQVKQPPQPRGPQKSQPQQSEPTKPVQQQTSAKPSAGPAKPLPQQPDSARTSSQAPPPAKPS 323
Cdd:PRK07003   359 EPAVTGGGAPGGGVPARVAGAVPAPGARAAAAVGASAVPAVTAVTGAAGAALAPKAAAAAAATRAEAPPAAPAPPATADR 438
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  324 SQqlGPVKQPSQQPARQGGPVKPSSQQAGPPKQPSQQPGPEKPSAQQMGPA--KQASQPGPGKPPLQQTGPVKQAPPQSG 401
Cdd:PRK07003   439 GD--DAADGDAPVPAKANARASADSRCDERDAQPPADSGSASAPASDAPPDaaFEPAPRAAAPSAATPAAVPDARAPAAA 516
                          170       180
                   ....*....|....*....|....*..
gi 2022791524  402 PT--KPPPQTAGATKAPVPQPGLTKPP 426
Cdd:PRK07003   517 SRedAPAAAAPPAPEARPPTPAAAAPA 543
PDZ_SYNJ2BP-like cd06709
PDZ domain of synaptojanin-2-binding protein (SYNJ2BP), and related domains; PDZ (PSD-95 ...
4543-4607 1.50e-07

PDZ domain of synaptojanin-2-binding protein (SYNJ2BP), and related domains; PDZ (PSD-95 (Postsynaptic density protein 95), Dlg (Discs large protein), and ZO-1 (Zonula occludens-1)) domain of SYNJ2BP, and related domains. SYNJ2BP (also known as mitochondrial outer membrane protein 25, OMP25) regulates endocytosis of activin type 2 receptor kinases through the Ral/RALBP1-dependent pathway and may be involved in suppression of activin-induced signal transduction. Binding partners of the SYNJ2BP PDZ domain include activin type II receptors (ActR-II), and SYNJ2. SYNJ2BP interacts with the PDZ binding motif of the Notch Delta-like ligand 1 (DLL1) and DLL4, promoting Delta-Notch signaling, and inhibiting sprouting angiogenesis. PDZ domains usually bind in a sequence-specific manner to short peptide sequences located at the C-terminal end of their partner proteins (known as PDZ binding motifs). The PDZ superfamily includes canonical PDZ domains as well as those with circular permutations and domain swapping mediated by beta-strands. This SYNJ2BP-like family domain is a canonical PDZ domain containing six beta-strands A-F and two alpha-helices (alpha-helix 1 and 2), arranged in the order: beta-strands A, B, C, alpha-helix 1, beta-strands D, E, alpha-helix 2 and beta-strand F.


Pssm-ID: 467193 [Multi-domain]  Cd Length: 86  Bit Score: 51.91  E-value: 1.50e-07
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 2022791524 4543 GNGLGIRVVGGKEIPGSSGETGAYIAKILPGGNAEQTGKLIEGMQVLEWNGIPLTGKTYEEVQNI 4607
Cdd:cd06709      9 PSGLGFNIVGGTDQPYIPNDSGIYVAKIKEDGAAAIDGRLQEGDKILEINGQSLENLTHQDAVEL 73
PDZ_MPP-like cd06726
PDZ domain of membrane palmitoylated proteins (MPPs), and related domains; PDZ (PSD-95 ...
4545-4613 1.70e-07

PDZ domain of membrane palmitoylated proteins (MPPs), and related domains; PDZ (PSD-95 (Postsynaptic density protein 95), Dlg (Discs large protein), and ZO-1 (Zonula occludens-1)) domain of MPP1-7 (also known as MAGUK p55 subfamily members 1-7), and related domains. MPPs comprise a subfamily of a larger group of multidomain proteins, namely, membrane-associated guanylate kinases (MAGUKs). MPPs form diverse protein complexes at the cell membranes, which are involved in a wide range of cellular processes, including establishing proper cell structure, polarity and cell adhesion. MPPs have only one PDZ domain. PDZ domains usually bind in a sequence-specific manner to short peptide sequences located at the C-terminal end of their partner proteins (known as PDZ binding motifs). The PDZ superfamily includes canonical PDZ domains as well as those with circular permutations and domain swapping mediated by beta-strands. This MPP1-like family domain is a canonical PDZ domain containing six beta-strands A-F and two alpha-helices (alpha-helix 1 and 2), arranged in the order: beta-strands A, B, C, alpha-helix 1, beta-strands D, E, alpha-helix 2 and beta-strand F.


Pssm-ID: 467208 [Multi-domain]  Cd Length: 80  Bit Score: 51.50  E-value: 1.70e-07
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 2022791524 4545 GLGIRVVGGKEIpgssgetgayIAKILPGGNAEQTGKLIEGMQVLEWNGIPLTGKTYEEVQNIIIQQSG 4613
Cdd:cd06726     14 GATIKMEEDSVI----------VARILHGGMAHRSGLLHVGDEILEINGIPVSGKTVDELQKLLSSLSG 72
PHA03247 PHA03247
large tegument protein UL36; Provisional
580-1054 1.98e-07

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 58.03  E-value: 1.98e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  580 PGSPPVKAKTSRAEPADTSQQIDPAPKSDRAKPSQAEDKQKQPSIQKPAADTVLTPAAPGLKQDSAGPRPPPTqqkvtdS 659
Cdd:PHA03247  2557 PAAPPAAPDRSVPPPRPAPRPSEPAVTSRARRPDAPPQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPP------S 2630
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  660 PKP---ELAKPSQDTHPAGEKPDSKPLP-QVSRQKSDPKLSAQPGARSEAKTQKPVEPAQVKDDPKKLQTKPSPKPETKP 735
Cdd:PHA03247  2631 PSPaanEPDPHPPPTVPPPERPRDDPAPgRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSLADPPPPPPTPEP 2710
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  736 APKGLQPGagpkpvpaqpapqpqQPQKTPEQPRRFSLNLGGITDTPKPQPTTPQETVTGRLFGFGASIFSQASNLISTAG 815
Cdd:PHA03247  2711 APHALVSA---------------TPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAP 2775
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  816 QPGSQTSGPPPAGPAAKQPQAP----------AQPPASQAPPKDTAQTQSSPKAVP--VKKEAKPLTSEKMEPTKVDSVL 883
Cdd:PHA03247  2776 AAGPPRRLTRPAVASLSESRESlpspwdpadpPAAVLAPAAALPPAASPAGPLPPPtsAQPTAPPPPPGPPPPSLPLGGS 2855
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  884 TTKGSDLEKKPSLAKDSKPQAAEAKKPAGVLEPEKASHPKVSCPLcktglnvgskdPPNfntcteckkvvcnlcgfnpmp 963
Cdd:PHA03247  2856 VAPGGDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFAL-----------PPD--------------------- 2903
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  964 hiaevqewlclncqtqramsgqlgdmGKVPLPKPGPSQPvSKPPATPQKQPVPAVSHFPQKTSTPPTPAAAKPKEEPGVQ 1043
Cdd:PHA03247  2904 --------------------------QPERPPQPQAPPP-PQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPS 2956
                          490
                   ....*....|.
gi 2022791524 1044 KEVPKLQQGRL 1054
Cdd:PHA03247  2957 GAVPQPWLGAL 2967
dnaA PRK14086
chromosomal replication initiator protein DnaA;
273-430 2.23e-07

chromosomal replication initiator protein DnaA;


Pssm-ID: 237605 [Multi-domain]  Cd Length: 617  Bit Score: 57.14  E-value: 2.23e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  273 PQKSQPQQSEPTKPVQQQTSAKPSAGPAKPLPQQPDSArTSSQAPPPAKPSSQQLGPVKQPSQQPARQGGPVKPSSQQAG 352
Cdd:PRK14086    68 PIISETLSRELGRPIRIAITVDPSAGEPAPPPPHARRT-SEPELPRPGRRPYEGYGGPRADDRPPGLPRQDQLPTARPAY 146
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 2022791524  353 PPKQPSQQPGPEKPSAQQMGPAKQASQPGPGKPPlqqTGPVKQAPPQSGPTKPPPQTAGATKAPVPQPGLTKPPGQQP 430
Cdd:PRK14086   147 PAYQQRPEPGAWPRAADDYGWQQQRLGFPPRAPY---ASPASYAPEQERDREPYDAGRPEYDQRRRDYDHPRPDWDRP 221
PHA03247 PHA03247
large tegument protein UL36; Provisional
75-442 2.61e-07

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 57.64  E-value: 2.61e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524   75 AEPPPMQRHPEldTSHHPRQPSKPPDPGPglSKSRTVDVLKTEQRAPGRSPSSISLRESKSRTDFRedQKSSMMPSFLSE 154
Cdd:PHA03247  2699 ADPPPPPPTPE--PAPHALVSATPLPPGP--AAARQASPALPAAPAPPAVPAGPATPGGPARPARP--PTTAGPPAPAPP 2772
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  155 ANPLS---------AVTSVVNKFNPFDLISD-SDTAHEEAGRKQKVIQKEQGKPEEQRGSAKLPAQQQSPKP-------- 216
Cdd:PHA03247  2773 AAPAAgpprrltrpAVASLSESRESLPSPWDpADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGppppslpl 2852
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  217 ---VQPQGHV--KPTPQQSESSKAVPQQQQPgepKQIQKPGPGHPADSKLEQVKQPPQPRGPQKSQPQQSEPTKPVQQQt 291
Cdd:PHA03247  2853 ggsVAPGGDVrrRPPSRSPAAKPAAPARPPV---RRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQ- 2928
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  292 sakpsagPAKPLPQQPdsartssQAPPPAKPSSQQLGPVKQPSQQPARQGGPVKPSS-----QQAGPPKQPSQQPGPEKP 366
Cdd:PHA03247  2929 -------PQPPPPPPP-------RPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRvavprFRVPQPAPSREAPASSTP 2994
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  367 SAQQMGPAKQAS---------QPGPGKPPLQQTGPVKQAPPQSGPTKPPPQTAGA--TKAPVPQPG-LTKPPGQQPEPEK 434
Cdd:PHA03247  2995 PLTGHSLSRVSSwasslalheETDPPPVSLKQTLWPPDDTEDSDADSLFDSDSERsdLEALDPLPPePHDPFAHEPDPAT 3074

                   ....*...
gi 2022791524  435 PSQQKQAS 442
Cdd:PHA03247  3075 PEAGARES 3082
PspC_subgroup_2 NF033839
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ...
183-413 2.73e-07

pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.


Pssm-ID: 468202 [Multi-domain]  Cd Length: 557  Bit Score: 56.70  E-value: 2.73e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  183 EAGRKQKVIQKEQGKPEeQRGSAKLPAQQQSPKPVQPQGHVKPTPQQSESS-KAVPQQQQPGEPKQIQKPGPGHPADSKL 261
Cdd:NF033839   304 QPEKKEVKPEPETPKPE-VKPQLEKPKPEVKPQPEKPKPEVKPQLETPKPEvKPQPEKPKPEVKPQPEKPKPEVKPQPET 382
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  262 EQVKQPPQPRGPQKSQPQQSEPTKPVQQQTSAKPSAgPAKPLPQQPDSARTSSQAPPPAKPSSQQLGPVKQPSQQPARQG 341
Cdd:NF033839   383 PKPEVKPQPEKPKPEVKPQPEKPKPEVKPQPEKPKP-EVKPQPEKPKPEVKPQPEKPKPEVKPQPEKPKPEVKPQPETPK 461
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 2022791524  342 GPVKPssqQAGPPKqPSQQPGPEKPSAQQMGPAKQASQPGPGKPPLQQTGPVKQAPPQSGPTKPPPQTAGAT 413
Cdd:NF033839   462 PEVKP---QPEKPK-PEVKPQPEKPKPDNSKPQADDKKPSTPNNLSKDKQPSNQASTNEKATNKPKKSLPST 529
PRK11901 PRK11901
hypothetical protein; Reviewed
283-454 2.95e-07

hypothetical protein; Reviewed


Pssm-ID: 237015 [Multi-domain]  Cd Length: 327  Bit Score: 55.84  E-value: 2.95e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  283 PTKPVQQQTSakPSAGPAKPLPQQPDSARTSSQAPPPAKPSSQQLGPVKQPSQQPAR----QGGPVKPSSQQAGPPKQPS 358
Cdd:PRK11901    61 PTEHESQQSS--NNAGAEKNIDLSGSSSLSSGNQSSPSAANNTSDGHDASGVKNTAPpqdiSAPPISPTPTQAAPPQTPN 138
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  359 QQPGPEKPS------AQQMGPAKQASQPGPGKPplqQTGPVKQAPPQSGPTKPPPQTAGATKAPVPQPGLTKPPGQQPeP 432
Cdd:PRK11901   139 GQQRIELPGnisdalSQQQGQVNAASQNAQGNT---STLPTAPATVAPSKGAKVPATAETHPTPPQKPATKKPAVNHH-K 214
                          170       180
                   ....*....|....*....|..
gi 2022791524  433 EKPSQQKQASATQPAESAPKKT 454
Cdd:PRK11901   215 TATVAVPPATSGKPKSGAASAR 236
Pro-rich pfam15240
Proline-rich protein; This family includes several eukaryotic proline-rich proteins.
306-437 3.20e-07

Proline-rich protein; This family includes several eukaryotic proline-rich proteins.


Pssm-ID: 464580 [Multi-domain]  Cd Length: 167  Bit Score: 53.12  E-value: 3.20e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  306 QPDSARTSSQAPPPAKPSSQQLGPVKQPSQQPARQGGPVKPssqQAGPPKQPSQQPGPEKPsaqQMGPAKQASQPGPGKP 385
Cdd:pfam15240   28 SPSLISEEEGQSQQGGQGPQGPPPGGFPPQPPASDDPPGPP---PPGGPQQPPPQGGKQKP---QGPPPQGGPRPPPGKP 101
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 2022791524  386 --PLQQTGPVKQAPPQSG-PTKPPPQTAGATKAPVPQPGLTKPPGQQPE--PEKPSQ 437
Cdd:pfam15240  102 qgPPPQGGNQQQGPPPPGkPQGPPPQGGGPPPQGGNQQGPPPPPPGNPQgpPQRPPQ 158
PDZ2_MUPP1-like cd06667
PDZ domain 2 of multi-PDZ-domain protein 1 (MUPP1) and PATJ (protein-associated tight junction) ...
4543-4613 3.24e-07

PDZ domain 2 of multi-PDZ-domain protein 1 (MUPP1) and PATJ (protein-associated tight junction) and similar domains; PDZ (PSD-95 (Postsynaptic density protein 95), Dlg (Discs large protein), and ZO-1 (Zonula occludens-1)) domain 2 of MUPP1 and PATJ, and related domains. MUPP1 and PATJ serve as scaffolding proteins linking different proteins and protein complexes involved in the organization of tight junctions and epithelial polarity. MUPP1 contains an L27 (Lin-2 and Lin-7 binding) domain and 13 PDZ domains. PATJ (also known as INAD-like) contains an L27 domain and ten PDZ domains. MUPP1 and PATJ share several binding partners, including junctional adhesion molecules (JAM), zonula occludens (ZO)-3, Pals1 (protein associated with Lin-7), Par (partitioning defective)-6 proteins, and nectins (adherence junction adhesion molecules). PATJ lacks 3 PDZ domains seen in MUPP1: PDZ6, 9, and 13; consequently, MUPP1 PDZ7 and 8 align with PATJ PDZ6 and 7; and MUPP1 PDZ domains 10-12 align with PATJ PDZ domains 8-10. PDZ domains usually bind in a sequence-specific manner to short peptide sequences located at the C-terminal end of their partner proteins (known as PDZ binding motifs). The PDZ superfamily includes canonical PDZ domains as well as those with circular permutations and domain swapping mediated by beta-strands. This MUPP1-like family PDZ2 domain is a canonical PDZ domain containing six beta-strands A-F and two alpha-helices (alpha-helix 1 and 2), arranged in the order: beta-strands A, B, C, alpha-helix 1, beta-strands D, E, alpha-helix 2 and beta-strand F


Pssm-ID: 467155 [Multi-domain]  Cd Length: 80  Bit Score: 50.74  E-value: 3.24e-07
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 2022791524 4543 GNGLGIRVVGGKEipgssgeTGAYIAKILPGGNAEQTGKLIEGMQVLEWNGIPLTGKTYEEVQNiIIQQSG 4613
Cdd:cd06667      9 GSGLGFGIVGGKS-------TGVVVKTILPGGVADRDGRLRSGDHILQIGDTNLRGMGSEQVAQ-VLRQCG 71
PDZ3_MAGI-1_3-like cd06733
PDZ domain 3 of membrane-associated guanylate kinase inverted 1 (MAGI-1), MAGI-2, and MAGI-3, ...
4544-4604 3.57e-07

PDZ domain 3 of membrane-associated guanylate kinase inverted 1 (MAGI-1), MAGI-2, and MAGI-3, and related domains; PDZ (PSD-95 (Postsynaptic density protein 95), Dlg (Discs large protein), and ZO-1 (Zonula occludens-1)) domain 3 of MAGI1, 2, 3 (MAGI is also known as Membrane-associated guanylate kinase, WW and PDZ domain-containing protein) and related domains. MAGI proteins have been implicated in the control of cell migration and invasion through altering the activity of phosphatase and tensin homolog (PTEN) and modulating Akt signaling. Four MAGI proteins have been identified (MAGI1-3 and MAGIX). MAGI1-3 have 6 PDZ domains and bind to the C-terminus of PTEN via their PDZ2 domain. MAGIX has a single PDZ domain that is related to MAGI1-3 PDZ domain 5. Other binding partners for MAGI1 include JAM4, C-terminal tail of high risk HPV-18 E6, megalin, TRAF6, Kir4.1 (basolateral K+ channel subunit), and cadherin 23; for MAGI2, include DASM1, dendrin, axin, beta- and delta-catenin, neuroligin, hyperpolarization-activated cation channels, beta1-adrenergic receptors, NMDA receptor, and TARPs; and for MAGI3 includes LPA2. PDZ domains usually bind in a sequence-specific manner to short peptide sequences located at the C-terminal end of their partner proteins (known as PDZ binding motifs). The PDZ superfamily includes canonical PDZ domains as well as those with circular permutations and domain swapping mediated by beta-strands. This MAGI family PDZ3 domain is a canonical PDZ domain containing six beta-strands A-F and two alpha-helices (alpha-helix 1 and 2); arranged as beta-strands A, -B, C, alpha-helix 1, beta-strands D, E, alpha-helix 2 and beta-strand F.


Pssm-ID: 467215 [Multi-domain]  Cd Length: 85  Bit Score: 50.69  E-value: 3.57e-07
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 2022791524 4544 NGLGIRVVGGKEiPGSSgetgAYIAKILPGGNAEQTGKLIEGMQVLEWNGIPLTGKTYEEV 4604
Cdd:cd06733     11 TGFGFRILGGTE-EGSQ----VSIGAIVPGGAADLDGRLRTGDELLSVDGVNVVGASHHKV 66
PDZ1_FRMPD2-like cd23071
PDZ domain 1 of FERM and PDZ domain-containing protein 2 (FRMPD2), and related domains; PDZ ...
4529-4614 4.88e-07

PDZ domain 1 of FERM and PDZ domain-containing protein 2 (FRMPD2), and related domains; PDZ (PSD-95 (Postsynaptic density protein 95), Dlg (Discs large protein), and ZO-1 (Zonula occludens-1)) domain 1 of FRMPD2 (also known as PDZ domain-containing protein 4, and related domains. FRMPD2 is localized in the basolateral membranes of polarized epithelial cells and is associated with tight junction formation and immune response; it contains 3 PDZ domains. PDZ domains usually bind in a sequence-specific manner to short peptide sequences located at the C-terminal end of their partner proteins (known as PDZ binding motifs). The PDZ superfamily includes canonical PDZ domains as well as those with circular permutations and domain swapping mediated by beta-strands. This PTPN13 family PDZ1 domain is a canonical PDZ domain containing six beta-strands A-F and two alpha-helices (alpha-helix 1 and 2), arranged in the order: beta-strands A, B, C, alpha-helix 1, beta-strands D, E, alpha-helix 2 and beta-strand F.


Pssm-ID: 467284 [Multi-domain]  Cd Length: 92  Bit Score: 50.57  E-value: 4.88e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524 4529 RLKLLRDPKdhtvsgNGLGIRVVGGKEIpgSSGETGAYIAKILPGGNAEQTGKLIEGMQVLEWNGIPLTGKTYEEVQNiI 4608
Cdd:cd23071      4 CVTLKRDPK------RGFGFVIVGGENT--GKLDLGIFIASIIPGGPAEKDGRIKPGGRLISLNNISLEGVTFNTAVK-I 74

                   ....*.
gi 2022791524 4609 IQQSGE 4614
Cdd:cd23071     75 LQNSPD 80
PDZ4_Scribble-like cd06701
PDZ domain 4 of Drosophila Scribble, human Scribble homolog, and related domains; PDZ (PSD-95 ...
4543-4607 4.89e-07

PDZ domain 4 of Drosophila Scribble, human Scribble homolog, and related domains; PDZ (PSD-95 (Postsynaptic density protein 95), Dlg (Discs large protein), and ZO-1 (Zonula occludens-1)) domain 4 of Drosophila Scribble (also known as LAP4), human Scribble homolog (also known as hScrib, LAP4, CriB1, ScrB1 and Vartul), and related domains. They belong to the LAP family, which describes proteins that contain either one or four PDZ domains and 16 LRRs (leucine-rich repeats) and function in controlling cell shape, size and subcellular protein localization. In Drosophila, the Scribble complex, comprising Scribble, discs large, and lethal giant larvae, plays a role in apico-basal cell polarity, in other forms of polarity, including regulation of the actin cytoskeleton, cell signaling and vesicular trafficking, and in tumor development. Mammalian Scribble is important in many aspects of cancer development. Scribble and its homologs can be downregulated or overexpressed in cancer; they have a role in cancer beyond their function in loss of cell polarity. PDZ domains usually bind in a sequence-specific manner to short peptide sequences located at the C-terminal end of their partner proteins (known as PDZ binding motifs). The PDZ superfamily includes canonical PDZ domains as well as those with circular permutations and domain swapping mediated by beta-strands. This Scribble-like family PDZ4 domain is a canonical PDZ domain containing six beta-strands A-F and two alpha-helices (alpha-helix 1 and 2), arranged in the order: beta-strands A, B, C, alpha-helix 1, beta-strands D, E, alpha-helix 2 and beta-strand F.


Pssm-ID: 467185 [Multi-domain]  Cd Length: 98  Bit Score: 50.69  E-value: 4.89e-07
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 2022791524 4543 GNGLGIRVVGGkeIPGSSGET------GAYIAKILPGGNAEQTGKLIEGMQVLEWNGIPLTGKTYEEVQNI 4607
Cdd:cd06701     14 GEKLGISIRGG--AKGHAGNPldptdeGIFISKINPDGAAARDGRLKVGQRILEVNGQSLLGATHQEAVRI 82
PDZ3_Dlg1-2-4-like cd06795
PDZ domain 3 of human discs large homolog 1 (Dlg1), Dlg2, and Dlg4, Drosophila disc large (Dlg) ...
4543-4603 5.34e-07

PDZ domain 3 of human discs large homolog 1 (Dlg1), Dlg2, and Dlg4, Drosophila disc large (Dlg), and related domains; PDZ (PSD-95 (Postsynaptic density protein 95), Dlg (Discs large protein), and ZO-1 (Zonula occludens-1)) domain 3 of Drosophila Dlg1, human Dlg1, 2, and 4 and related domains. Dlg1 (also known as synapse-associated protein Dlg197; SAP-97), Dlg2 (also known as channel-associated protein of synapse-110; postsynaptic density protein 93, PSD-93), Dlg4 (also known as postsynaptic density protein 95, PSD-95; synapse-associated protein 90, SAP-90) each have 3 PDZ domains and belong to the membrane-associated guanylate kinase family. Dlg1 regulates antigen receptor signaling and cell polarity in lymphocytes, B-cell proliferation and antibody production, and TGFalpha bioavailability; its PDZ3 domain binds pro-TGFalpha, and its PDZ2 domain binds the TACE metalloprotease responsible for cleaving pro-TGFalpha to a soluble form. Dlg2 is involved in N-methyl-D-aspartate (NMDA) receptor signaling, regulating surface expression of NMDA receptors in dorsal horn neurons of the spinal cord; it interacts with NMDA receptor subunits and with Shaker-type K+ channel subunits to cluster into a channel complex. The Dlg4 PDZ1 domain binds NMDA receptors, and its PDZ2 domain binds neuronal nitric oxide synthase (nNOS), forming a complex in neurons. The Drosophila Scribble complex (Scribble, Dlg, and lethal giant larvae) plays a role in apico-basal cell polarity, and in other forms of polarity, including regulation of the actin cytoskeleton, cell signaling and vesicular trafficking, and in tumor development; postsynaptic targeting of Drosophila DLG requires interactions mediated by the first two PDZ domains. PDZ domains usually bind in a sequence-specific manner to short peptide sequences located at the C-terminal end of their partner proteins (known as PDZ binding motifs). The PDZ superfamily includes canonical PDZ domains as well as those with circular permutations and domain swapping mediated by beta-strands. This Dlg-like family PDZ3 domain is a canonical PDZ domain containing six beta-strands A-F and two alpha-helices (alpha-helix 1 and 2), arranged in the order: beta-strands A, B, C, alpha-helix 1, beta-strands D, E, alpha-helix 2 and beta-strand F.


Pssm-ID: 467257 [Multi-domain]  Cd Length: 91  Bit Score: 50.43  E-value: 5.34e-07
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 2022791524 4543 GNGLGIRVVGGKEipgssGEtGAYIAKILPGGNAEQTGKLIEGMQVLEWNGIPLTGKTYEE 4603
Cdd:cd06795     11 STGLGFNIVGGED-----GE-GIFISFILAGGPADLSGELRRGDQILSVNGVDLRNATHEQ 65
PBP1 COG5180
PAB1-binding protein, interacts with poly(A)-binding protein [RNA processing and modification]; ...
256-445 5.45e-07

PAB1-binding protein, interacts with poly(A)-binding protein [RNA processing and modification];


Pssm-ID: 444064 [Multi-domain]  Cd Length: 548  Bit Score: 55.84  E-value: 5.45e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  256 PADSKLEQVKQPPQPRGPQKsQPQQSEPTKPVQQQTSAKPSAGPAKPLPQQPDSARTSSQAPPPAKPSSQQLGPVKQPSQ 335
Cdd:COG5180    188 PRDALKDSPEKLDRPKVEVK-DEAQEEPPDLTGGADHPRPEAASSPKVDPPSTSEARSRPATVDAQPEMRPPADAKERRR 266
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  336 QPARQGGPVKPSSQQAGPPKQPSQQPGPEKPSAQQM---GPAKQASQPGPGKPPLQQTGPVKQAPPQSGPTkppPQTAGA 412
Cdd:COG5180    267 AAIGDTPAAEPPGLPVLEAGSEPQSDAPEAETARPIdvkGVASAPPATRPVRPPGGARDPGTPRPGQPTER---PAGVPE 343
                          170       180       190
                   ....*....|....*....|....*....|...
gi 2022791524  413 TKAPVPQPGLTKPPGQQPEPEKPSQQKQASATQ 445
Cdd:COG5180    344 AASDAGQPPSAYPPAEEAVPGKPLEQGAPRPGS 376
PDZ1_PTPN13_FRMPD2-like cd06694
PDZ domain 1 of protein tyrosine phosphatase non-receptor type 13 (PTPN13),FERM and PDZ ...
4529-4614 9.12e-07

PDZ domain 1 of protein tyrosine phosphatase non-receptor type 13 (PTPN13),FERM and PDZ domain-containing protein 2 (FRMPD2), and related domains; PDZ (PSD-95 (Postsynaptic density protein 95), Dlg (Discs large protein), and ZO-1 (Zonula occludens-1)) domain 1 of PTPN13 [also known as Fas-associated protein-tyrosine phosphatase 1 (FAP-1), protein-tyrosine phosphatase 1E (PTP-E1), and protein-tyrosine phosphatase (PTPL1)], FRMPD2 (also known as PDZ domain-containing protein 4; PDZ domain-containing protein 5C), and related domains. PTPN13 regulates negative apoptotic signaling and mediates phosphoinositide 3-kinase (PI3K) signaling. PTPN13 has five PDZ domains. Proteins known to interact with PTPN13 PDZ domains include: PLEKHA1 and PLEKHA2 via PTPN13-PDZ domain 1, Fas receptor and thyroid receptor-interacting protein 6 via PTPN13-PDZ domain 2, nerve growth factor receptor and protein kinase N2 via PTPN13-PDZ domain 3, PDZ and LIM domain 4 (PDLIM4) via PTPN13-PDZ domains 2 and 4, and brain calpain-2 via PTPN13-PDZ domains 3, 4 and 5. Calpain-2-mediated PTPN13 fragments may be involved in abnormal tau aggregation and increased risk for Alzheimer's disease. FRMPD2 is localized in the basolateral membranes of polarized epithelial cells and is associated with tight junction formation and immune response; it contains 3 PDZ domains. PDZ domains usually bind in a sequence-specific manner to short peptide sequences located at the C-terminal end of their partner proteins (known as PDZ binding motifs). The PDZ superfamily includes canonical PDZ domains as well as those with circular permutations and domain swapping mediated by beta-strands. This PTPN13 family PDZ1 domain is a canonical PDZ domain containing six beta-strands A-F and two alpha-helices (alpha-helix 1 and 2), arranged in the order: beta-strands A, B, C, alpha-helix 1, beta-strands D, E, alpha-helix 2 and beta-strand F.


Pssm-ID: 467180 [Multi-domain]  Cd Length: 92  Bit Score: 49.70  E-value: 9.12e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524 4529 RLKLLRDPKdhtvsgNGLGIRVVGGkEIPGSSGEtGAYIAKILPGGNAEQTGKLIEGMQVLEWNGIPLTGKTYEEVQNiI 4608
Cdd:cd06694      4 IVTLKKDPQ------KGLGFTIVGG-ENSGSLDL-GIFVKSIIPGGPADKDGRIKPGDRIIAINGQSLEGKTHHAAVE-I 74

                   ....*.
gi 2022791524 4609 IQQSGE 4614
Cdd:cd06694     75 IQNAPD 80
PDZ12_MUPP1-like cd06675
PDZ domain 12 of multi-PDZ-domain protein 1 (MUPP1), PDZ domain 10 of protein-associated tight ...
4532-4613 9.85e-07

PDZ domain 12 of multi-PDZ-domain protein 1 (MUPP1), PDZ domain 10 of protein-associated tight junction (PATJ, and related domains; PDZ (PSD-95 (Postsynaptic density protein 95), Dlg (Discs large protein), and ZO-1 (Zonula occludens-1)) domain 12 of MUPP1, PDZ domain 10 of PATJ, and related domains. MUPP1 and PATJ serve as scaffolding proteins linking different proteins and protein complexes involved in the organization of tight junctions and epithelial polarity. MUPP1 contains an L27 (Lin-2 and Lin-7 binding) domain and 13 PDZ domains. PATJ (also known as INAD-like) contains an L27 domain and ten PDZ domains. MUPP1 and PATJ share several binding partners, including junctional adhesion molecules (JAM), zonula occludens (ZO)-3, Pals1 (protein associated with Lin-7), Par (partitioning defective)-6 proteins, and nectins (adherence junction adhesion molecules). PATJ lacks 3 PDZ domains seen in MUPP1: PDZ6, 9, and 13; consequently, MUPP1 PDZ7 and 8 align with PATJ PDZ6 and 7; and MUPP1 PDZ domains 10-12 align with PATJ PDZ domains 8-10. PDZ domains usually bind in a sequence-specific manner to short peptide sequences located at the C-terminal end of their partner proteins (known as PDZ binding motifs). The PDZ superfamily includes canonical PDZ domains as well as those with circular permutations and domain swapping mediated by beta-strands. This MUPP1-like PDZ12 family domain is a canonical PDZ domain containing six beta-strands A-F and two alpha-helices (alpha-helix 1 and 2), arranged in the order: beta-strands A, B, C, alpha-helix 1, beta-strands D, E, alpha-helix 2 and beta-strand F


Pssm-ID: 467163 [Multi-domain]  Cd Length: 86  Bit Score: 49.67  E-value: 9.85e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524 4532 LLRDPKDhtvsgnGLGIRVVGGKeipGSS-GETGAYIAKILPGGNAEQTGKLIEGMQVLEWNGIPLTGKTYEEVQNIIIQ 4610
Cdd:cd06675      5 IKRGPQD------SLGISIAGGV---GSPlGDVPVFIAMIQPNGVAAQTGKLKVGDRIVSINGQSTDGLTHSEAVNLLKN 75

                   ...
gi 2022791524 4611 QSG 4613
Cdd:cd06675     76 ASG 78
dnaA PRK14086
chromosomal replication initiator protein DnaA;
216-421 1.12e-06

chromosomal replication initiator protein DnaA;


Pssm-ID: 237605 [Multi-domain]  Cd Length: 617  Bit Score: 54.83  E-value: 1.12e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  216 PVQPQGHVKPTPQQSESSKAVPQQQQPGEPKQiqkPGPGHPADSKleqvKQPPQPRGPQKSQPQqsePTKPVQQQtSAKP 295
Cdd:PRK14086    87 TVDPSAGEPAPPPPHARRTSEPELPRPGRRPY---EGYGGPRADD----RPPGLPRQDQLPTAR---PAYPAYQQ-RPEP 155
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  296 SAGPAKPLPQQPDSARTssqAPPPAKP--SSQQLGPVKQPSQQPARQGGPvkpssqQAGPPKQPSQQPGPEKPSAQQMGP 373
Cdd:PRK14086   156 GAWPRAADDYGWQQQRL---GFPPRAPyaSPASYAPEQERDREPYDAGRP------EYDQRRRDYDHPRPDWDRPRRDRT 226
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|....*....
gi 2022791524  374 AKQASQPGPGKPPLQ-QTGPVKQAPPQSGPTKPPPQTAGATKAPVPQPG 421
Cdd:PRK14086   227 DRPEPPPGAGHVHRGgPGPPERDDAPVVPIRPSAPGPLAAQPAPAPGPG 275
PRK14971 PRK14971
DNA polymerase III subunit gamma/tau;
303-446 1.19e-06

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237874 [Multi-domain]  Cd Length: 614  Bit Score: 54.78  E-value: 1.19e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  303 LPQQPDSARTSSQAPPPAKPSSQQLGPVKQPSQQPARQggpvkPSSQQAGPPKQPSQqpGPEKPSAQQMGPAKQASQPGP 382
Cdd:PRK14971   362 LTQKGDDASGGRGPKQHIKPVFTQPAAAPQPSAAAAAS-----PSPSQSSAAAQPSA--PQSATQPAGTPPTVSVDPPAA 434
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 2022791524  383 GKPPLQQTGPVKQAPPQSGPTKPPPqtagATKAPVPQPGLTKPpgQQPEPEKPSQQKQASATQP 446
Cdd:PRK14971   435 VPVNPPSTAPQAVRPAQFKEEKKIP----VSKVSSLGPSTLRP--IQEKAEQATGNIKEAPTGT 492
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
39-383 1.20e-06

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 54.99  E-value: 1.20e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524   39 PAGVEADLSQLSEEERRQIAAVMSRAQGLPRGNLAGAEPPPMQRHPELDTSHHPRQPSKPPDPGPGLSKSRTVDVLKTE- 117
Cdd:PRK07764   437 PAPAPPSPAGNAPAGGAPSPPPAAAPSAQPAPAPAAAPEPTAAPAPAPPAAPAPAAAPAAPAAPAAPAGADDAATLRERw 516
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  118 ----QRAPGRSPSSISLRESKSR-TDFREDQ------KSSMMPSFLSEANPLSAVTSVVNKFN----PFDLISDSDTAHE 182
Cdd:PRK07764   517 peilAAVPKRSRKTWAILLPEATvLGVRGDTlvlgfsTGGLARRFASPGNAEVLVTALAEELGgdwqVEAVVGPAPGAAG 596
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  183 EAGRK---QKVIQKEQGKPEEQRGSAKLPAQ---QQSPKPVQPQGHVKPTPQQSESSKAVPQQQQPGEPKQIQKPGPGHP 256
Cdd:PRK07764   597 GEGPPapaSSGPPEEAARPAAPAAPAAPAAPapaGAAAAPAEASAAPAPGVAAPEHHPKHVAVPDASDGGDGWPAKAGGA 676
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  257 ADSKLEQVKQPPQPRGPQ---KSQPQQSEPTKPVQQQTSAKPSAGPAKPLPQQPDSARTSSQAPPPAKPSSQQLGPVKQP 333
Cdd:PRK07764   677 APAAPPPAPAPAAPAAPAgaaPAQPAPAPAATPPAGQADDPAAQPPQAAQGASAPSPAADDPVPLPPEPDDPPDPAGAPA 756
                          330       340       350       360       370
                   ....*....|....*....|....*....|....*....|....*....|
gi 2022791524  334 SQQPArqggpvkpssQQAGPPKQPSQQPGPEKPSAQQMGPAKQASQPGPG 383
Cdd:PRK07764   757 QPPPP----------PAPAPAAAPAAAPPPSPPSEEEEMAEDDAPSMDDE 796
MISS pfam15822
MAPK-interacting and spindle-stabilising protein-like; MISS is a family of eukaryotic ...
227-433 1.31e-06

MAPK-interacting and spindle-stabilising protein-like; MISS is a family of eukaryotic MAPK-interacting and spindle-stabilising protein-like proteins. MISS is rich in prolines and has four potential MAPK-phosphorylation sites, a MAPK-docking site, a PEST sequence (PEST motif) and a bipartite nuclear localization signal. The endogenous protein accumulates during mouse meiotic maturation and is found as discrete dots on the MII spindle. MISS is the first example of a physiological MAPK-substrate that is stabilized in MII that specifically regulates MII spindle integrity during the CSF arrest.


Pssm-ID: 318115 [Multi-domain]  Cd Length: 238  Bit Score: 53.07  E-value: 1.31e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  227 PQQSESSKAVPQQQQPGEPKQiqkpgpGHPADSKLEQVKQPPQ-PRGPQKSQPQQSEPTKPVQqqTSAKPSAGPAKPLPQ 305
Cdd:pfam15822    8 PEQSPAKTSAVSNPKPGQPPQ------GWPGSNPWNNPSAPPAvPSGLPPSTAPSTVPFGPAP--TGMYPSIPLTGPSPG 79
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  306 qpdsartssqAPPPAKPSSQQLGPVKQPSQQPARQG-GPVKPSSQQAGP-PKQPSQQPGPEKPSAQQMGPAKQASQPGPG 383
Cdd:pfam15822   80 ----------PPAPFPPSGPSCPPPGGPYPAPTVPGpGPIGPYPTPNMPfPELPRPYGAPTDPAAAAPSGPWGSMSSGPW 149
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 2022791524  384 KPPLQQTGPVKQAP---PQSGPTKPPPQTAGATkAPVP----QPGLTKPPGQQPEPE 433
Cdd:pfam15822  150 APGMGGQYPAPNMPypsPGPYPAVPPPQSPGAA-PPVPwgtvPPGPWGPPAPYPDPT 205
PspC_subgroup_2 NF033839
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ...
559-742 1.35e-06

pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.


Pssm-ID: 468202 [Multi-domain]  Cd Length: 557  Bit Score: 54.77  E-value: 1.35e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  559 PDPAQLAEPKKPPPQKKQPSMPGSPPVKAKTSRAEPADTSQQIDPAPKSDRAKPSQAEDKQKQPSIQKPAADTVLTPAAP 638
Cdd:NF033839   295 PKPGMQPSPQPEKKEVKPEPETPKPEVKPQLEKPKPEVKPQPEKPKPEVKPQLETPKPEVKPQPEKPKPEVKPQPEKPKP 374
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  639 GLKQDSAGPRPP----PTQQKVTDSPKPELAKPSQDTHPAGEKPDSKPLPQVSRQKSDPKL-SAQPGARSEAKTQKP-VE 712
Cdd:NF033839   375 EVKPQPETPKPEvkpqPEKPKPEVKPQPEKPKPEVKPQPEKPKPEVKPQPEKPKPEVKPQPeKPKPEVKPQPEKPKPeVK 454
                          170       180       190
                   ....*....|....*....|....*....|
gi 2022791524  713 PAQVKDDPKKLQTKPSPKPETKPAPKGLQP 742
Cdd:NF033839   455 PQPETPKPEVKPQPEKPKPEVKPQPEKPKP 484
PDZ3_PTPN13_FRMPD2-like cd06695
PDZ domain 3 of protein tyrosine phosphatase non-receptor type 13 (PTPN13), FERM and PDZ ...
4542-4608 1.36e-06

PDZ domain 3 of protein tyrosine phosphatase non-receptor type 13 (PTPN13), FERM and PDZ domain-containing protein 2 (FRMPD2), and related domains; PDZ (PSD-95 (Postsynaptic density protein 95), Dlg (Discs large protein), and ZO-1 (Zonula occludens-1)) domain 3 of PTPN13 [also known as Fas-associated protein-tyrosine phosphatase 1 (FAP-1), protein-tyrosine phosphatase 1E (PTP-E1), and protein-tyrosine phosphatase (PTPL1)], FRMPD2 (also known as PDZ domain-containing protein 4; PDZ domain-containing protein 5C), and related domains. PTPN13 regulates negative apoptotic signaling and mediates phosphoinositide 3-kinase (PI3K) signaling. PTPN13 has five PDZ domains. Proteins known to interact with PTPN13 PDZ domains include: PLEKHA1 and PLEKHA2 via PTPN13-PDZ domain 1, Fas receptor and thyroid receptor-interacting protein 6 via PTPN13-PDZ domain 2, nerve growth factor receptor and protein kinase N2 via PTPN13-PDZ domain 3, PDZ and LIM domain 4 (PDLIM4) via PTPN13-PDZ domains 2 and 4, and brain calpain-2 via PTPN13-PDZ domains 3, 4 and 5. Calpain-2-mediated PTPN13 fragments may be involved in abnormal tau aggregation and increased risk for Alzheimer's disease. PDZ domains usually bind in a sequence-specific manner to short peptide sequences located at the C-terminal end of their partner proteins (known as PDZ binding motifs). FRMPD2 is localized in the basolateral membranes of polarized epithelial cells and is associated with tight junction formation and immune response; it contains 3 PDZ domains). The PDZ superfamily includes canonical PDZ domains as well as those with circular permutations and domain swapping mediated by beta-strands. This PTPN13 family PDZ3 domain is a canonical PDZ domain containing six beta-strands A-F and two alpha-helices (alpha-helix 1 and 2), arranged in the order: beta-strands A, B, C, alpha-helix 1, beta-strands D, E, alpha-helix 2 and beta-strand F.


Pssm-ID: 467181 [Multi-domain]  Cd Length: 90  Bit Score: 49.18  E-value: 1.36e-06
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 2022791524 4542 SGNGLGIRVVGGK-EIPGSSGETGAYIAKILPGGNAEQTGKLIEGMQVLEWNGIPLTGKTYEEVQNII 4608
Cdd:cd06695      9 GSSGLGFSFLGGEnNSPEDPFSGLVRIKKLFPGQPAAESGLIQEGDVILAVNGEPLKGLSYQEVLSLL 76
PDZ_Lin-7-like cd06796
PDZ domain of protein Lin-7 and related domains; PDZ (PSD-95 (Postsynaptic density protein 95), ...
4545-4613 1.41e-06

PDZ domain of protein Lin-7 and related domains; PDZ (PSD-95 (Postsynaptic density protein 95), Dlg (Discs large protein), and ZO-1 (Zonula occludens-1)) domain of Lin-7 (also known as LIN-7 or LIN7), and related domains. Lin-7 targets and organize protein complexes to epithelial and synaptic plasma membranes. There are three mammalian Lin-7 homologs: Lin-7A (protein lin-7 homolog A, also known as mammalian lin-seven protein 1 (MALS-1), vertebrate lin-7 homolog 1 (Veli-1), tax interaction protein 33); Lin-7B (also known as MALS-2, Veli-2); and Lin-7C (also known as MALS-3, Veli-3). Lin-7 is involved in localization of the Let-23 growth factor receptor to the basolateral membrane of epithelial cells, in tight junction localization of insulin receptor substrate p53 (IRSp53), in retaining gamma-aminobutyric (GABA) transporter (BGT-1) at the basolateral surface of epithelial cells, and in regulating recruitment of neurotransmitter receptors to the postsynaptic density (PSD). The Lin7 PDZ domain binds Let-23, BGT and beta-catenin, and NMDA (N-methyl-D-aspartate) receptor NR2B. Lin-7 also binds to the PDZ binding motif located in the C-terminal tail of Rhotekin, an effector protein for small GTPase Rho. PDZ domains usually bind in a sequence-specific manner to short peptide sequences located at the C-terminal end of their partner proteins (known as PDZ binding motifs). The PDZ superfamily includes canonical PDZ domains as well as those with circular permutations and domain swapping mediated by beta-strands. This Lin-7-like family domain is a canonical PDZ domain containing six beta-strands A-F and two alpha-helices (alpha-helix 1 and 2), arranged in the order: beta-strands A, B, C, alpha-helix 1, beta-strands D, E, alpha-helix 2 and beta-strand F.


Pssm-ID: 467258 [Multi-domain]  Cd Length: 86  Bit Score: 48.97  E-value: 1.41e-06
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 2022791524 4545 GLGIRVVGGKEipgssGETGAYIAKILPGGNAEQTGKLIEGMQVLEWNGIPLTGKTYEEVQNIIIQQSG 4613
Cdd:cd06796     13 GLGFNVMGGKE-----QNSPIYISRIIPGGVADRHGGLKRGDQLLSVNGVSVEGEHHEKAVELLKAAQG 76
PHA03379 PHA03379
EBNA-3A; Provisional
204-451 1.46e-06

EBNA-3A; Provisional


Pssm-ID: 223066 [Multi-domain]  Cd Length: 935  Bit Score: 54.68  E-value: 1.46e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  204 SAKLPAQQQSPKPVQPQ---GHVKPTPQQSESSKAVPQ-----QQQPGEPKQIQKPGPGHPADSKLEQVKQPPQPR---- 271
Cdd:PHA03379   511 ASLSQVPGVAFAPVMPQpmpVEPVPVPTVALERPVCPAppliaMQGPGETSGIVRVRERWRPAPWTPNPPRSPSQMsvrd 590
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  272 GPQKSQPQQSEPTKPVQQQTSAKPSAGPAKPL--PQQPDSARTSSQAPPPAKPSSQQLG-PVKQPS------QQPARQGG 342
Cdd:PHA03379   591 RLARLRAEAQPYQASVEVQPPQLTQVSPQQPMeyPLEPEQQMFPGSPFSQVADVMRAGGvPAMQPQyfdlplQQPISQGA 670
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  343 PVKPSSQQAGP-PKQPSQQPGP-EKPSAQQMGPAKQASQ---PGPGKPPLQQTGPVKQAPPQSGPTKPPPQTAGATKAPV 417
Cdd:PHA03379   671 PLAPLRASMGPvPPVPATQPQYfDIPLTEPINQGASAAHflpQQPMEGPLVPERWMFQGATLSQSVRPGVAQSQYFDLPL 750
                          250       260       270
                   ....*....|....*....|....*....|....
gi 2022791524  418 PQPGLTKPPGQQPEPEKPSQQKQASATQPAESAP 451
Cdd:PHA03379   751 TQPINHGAPAAHFLHQPPMEGPWVPEQWMFQGAP 784
PDZ_MPP3-MPP4-MPP7-like cd06799
PDZ domain of membrane palmitoylated proteins 3 (MPP3), MPP4, and MPP7, and related domains; ...
4548-4613 1.70e-06

PDZ domain of membrane palmitoylated proteins 3 (MPP3), MPP4, and MPP7, and related domains; PDZ (PSD-95 (Postsynaptic density protein 95), Dlg (Discs large protein), and ZO-1 (Zonula occludens-1)) domain of MPP3, MPP4, and MPP7, and related domains. MPP3 (also known as MAGUK p55 subfamily member 3, erythrocyte membrane protein p55, or EMP55), MPP4 (also known as MAGUK p55 subfamily member 4 or Discs large homolog 6), and MPP7 (also known as MAGUK p55 subfamily member 7) are membrane-associated guanylate kinase (MAGUK)-like proteins. MPP3 is part of a cell adhesion protein complex including tumor suppressor CADM1 and actin-binding protein 4.1B. Participation in the Crumbs cell polarity complex has also been demonstrated for MPP7 in epithelial cells, and for MPP3 and MPP4 in the retina. MPP4 is needed for proper localization of plasma membrane calcium ATPases and maintenance of calcium homeostasis at the rod photoreceptor synaptic terminals. Binding partners of the MPP3 PDZ domain include nectin-3, serotonin 5-hydroxytryptamine, 5-HT(2C) receptor, and a cell adhesion protein, TSLC1 (tumor suppressor in lung cancer 1); fragments of MPP4 having the PDZ domain bind CRB (PDZ-SH3-GUK) and GABA transporter GAT1 (PDZ-SH3). PDZ domains usually bind in a sequence-specific manner to short peptide sequences located at the C-terminal end of their partner proteins (known as PDZ binding motifs). The PDZ superfamily includes canonical PDZ domains as well as those with circular permutations and domain swapping mediated by beta-strands. This MPP1-like family domain is a canonical PDZ domain containing six beta-strands A-F and two alpha-helices (alpha-helix 1 and 2), arranged in the order: beta-strands A, B, C, alpha-helix 1, beta-strands D, E, alpha-helix 2 and beta-strand F.


Pssm-ID: 467260 [Multi-domain]  Cd Length: 81  Bit Score: 48.78  E-value: 1.70e-06
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 2022791524 4548 IRVVGGKEIPGSS----GETGA-YIAKILPGGNAEQTGKLIEGMQVLEWNGIPLTGKTYEEVQNIIIQQSG 4613
Cdd:cd06799      3 VRLVKNNEPLGATikrdEKTGAiVVARIMRGGAADRSGLIHVGDELREVNGISVEGKDPEEVIQILANSQG 73
PDZ2_Scribble-like cd06703
PDZ domain 2 of Drosophila Scribble, human Scribble homolog, and related domains; PDZ (PSD-95 ...
4531-4612 1.76e-06

PDZ domain 2 of Drosophila Scribble, human Scribble homolog, and related domains; PDZ (PSD-95 (Postsynaptic density protein 95), Dlg (Discs large protein), and ZO-1 (Zonula occludens-1)) domain 2 of Drosophila Scribble (also known as LAP4), human Scribble homolog (also known as hScrib, LAP4, CriB1, ScrB1 and Vartul), and related domains. They belong to the LAP family, which describes proteins that contain either one or four PDZ domains and 16 LRRs (leucine-rich repeats) and function in controlling cell shape, size and subcellular protein localization. In Drosophila, the Scribble complex, comprising Scribble, discs large, and lethal giant larvae, plays a role in apico-basal cell polarity, in other forms of polarity, including regulation of the actin cytoskeleton, cell signaling and vesicular trafficking, and in tumor development. Mammalian Scribble is important in many aspects of cancer development. Scribble and its homologs can be downregulated or overexpressed in cancer; they have a role in cancer beyond their function in loss of cell polarity. PDZ domains usually bind in a sequence-specific manner to short peptide sequences located at the C-terminal end of their partner proteins (known as PDZ binding motifs). The PDZ superfamily includes canonical PDZ domains as well as those with circular permutations and domain swapping mediated by beta-strands. This Scribble-like family PDZ2 domain is a canonical PDZ domain containing six beta-strands A-F and two alpha-helices (alpha-helix 1 and 2), arranged in the order: beta-strands A, B, C, alpha-helix 1, beta-strands D, E, alpha-helix 2 and beta-strand F.


Pssm-ID: 467187 [Multi-domain]  Cd Length: 92  Bit Score: 48.80  E-value: 1.76e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524 4531 KLLRDpkdhtvsGNGLGIRVVGGKE-IPGSSGETGAYIAKILPGGNAEQTGKLIEGMQVLEWNGIPLTGKTYEEVQNIII 4609
Cdd:cd06703      6 TLIRD-------GKGLGFSIAGGKGsTPFRDGDEGIFISRITEGGAADRDGKLQVGDRVLSINGVDVTEARHDQAVALLT 78

                   ...
gi 2022791524 4610 QQS 4612
Cdd:cd06703     79 SSS 81
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
208-451 1.81e-06

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 54.61  E-value: 1.81e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  208 PAQQQSPKPVQPQGHVKPTPQQSESSKAVPQQQQPGEPKQIQKPGPGHPADSKLEQVKQPPQPRGPQKSQPQQSEPTkpv 287
Cdd:PRK07764   400 SAAAAAPAAAPAPAAAAPAAAAAPAPAAAPQPAPAPAPAPAPPSPAGNAPAGGAPSPPPAAAPSAQPAPAPAAAPEP--- 476
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  288 QQQTSAKPSAGPAKPLPQQPDSARTSSQAPPPA-------------------------KPSSQ---------QLGPVKQP 333
Cdd:PRK07764   477 TAAPAPAPPAAPAPAAAPAAPAAPAAPAGADDAatlrerwpeilaavpkrsrktwailLPEATvlgvrgdtlVLGFSTGG 556
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  334 SQQ-------------------------PARQGGPVKPSSQQAGPPKQPSQQPGPEKPSAQQMGPAKQASQPGPGKPPLQ 388
Cdd:PRK07764   557 LARrfaspgnaevlvtalaeelggdwqvEAVVGPAPGAAGGEGPPAPASSGPPEEAARPAAPAAPAAPAAPAPAGAAAAP 636
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 2022791524  389 QTGPVKQAPPQSGPTKPPPQTAGATKAPVPQPGLTKPPGQQPEPEKPSQQKQASATQPAESAP 451
Cdd:PRK07764   637 AEASAAPAPGVAAPEHHPKHVAVPDASDGGDGWPAKAGGAAPAAPPPAPAPAAPAAPAGAAPA 699
PDZ2_DLG5-like cd06765
PDZ domain 2 of Discs Large 5 (Dlg5) and related domains; PDZ (PSD-95 (Postsynaptic density ...
4548-4608 2.30e-06

PDZ domain 2 of Discs Large 5 (Dlg5) and related domains; PDZ (PSD-95 (Postsynaptic density protein 95), Dlg (Discs large protein), and ZO-1 (Zonula occludens-1)) domain 2 of Drosophila and mammalian Dlg5, and related domains. Dlg5 is a scaffold protein with multiple conserved functions that are independent of each other in regulating growth, cell polarity, and cell adhesion. It has a coiled-coil domain, 4 PDZ domains and a MAGUK domain (an SH3 domain next to a non-catalytically active guanylate kinase domain). Deregulation of Dlg5 has been implicated in the malignancy of several cancer types. PDZ domains usually bind in a sequence-specific manner to short peptide sequences located at the C-terminal end of their partner proteins (known as PDZ binding motifs). The PDZ superfamily includes canonical PDZ domains as well as those with circular permutations and domain swapping mediated by beta-strands. This Dlg5-like family PSZ2 domain is a canonical PDZ domain containing six beta-strands A-F and two alpha-helices (alpha-helix 1 and 2), arranged in the order: beta-strands A, B, C, alpha-helix 1, beta-strands D, E, alpha-helix 2 and beta-strand F.


Pssm-ID: 467246 [Multi-domain]  Cd Length: 77  Bit Score: 48.11  E-value: 2.30e-06
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 2022791524 4548 IRVVGGKEiPGSSGETGAYIAKILPGGNAEQTGKLIEGMQVLEWNGIPLTGKTYEEVQNII 4608
Cdd:cd06765      2 INLSGQKD-SGISLENGVFISRIVPGSPAAKEGSLTVGDRIIAINGIALDNKSLSECEALL 61
PDZ2_LNX1_2-like cd06678
PDZ domain 2 of human Ligand of Numb protein X 1 (LNX1) and LNX2, and related domains; PDZ ...
4539-4614 2.63e-06

PDZ domain 2 of human Ligand of Numb protein X 1 (LNX1) and LNX2, and related domains; PDZ (PSD-95 (Postsynaptic density protein 95), Dlg (Discs large protein), and ZO-1 (Zonula occludens-1)) domain 2 of LNX1 (also known as PDZ domain-containing RING finger protein 2, PDZRN2) and LNX2 (also known as PDZ domain-containing RING finger protein 1, PDZRN1), and related domains. LNX1 and LNX2 are Ring (Really Interesting New Gene) finger and PDZ domain-containing E3 ubiquitin ligases that bind to the cell fate determinant protein NUMB and mediate its ubiquitination. LNX1 can ubiquitinate a number of other ligands including PPFIA1, KLHL11, KIF7 and ERC2. LNX1 and LNX2 each have four PDZ domains. PDZ domains usually bind in a sequence-specific manner to short peptide sequences located at the C-terminal end of their partner proteins (known as PDZ binding motifs). The PDZ superfamily includes canonical PDZ domains as well as those with circular permutations and domain swapping mediated by beta-strands. This LNX family PDZ2 domain is a canonical PDZ domain containing six beta-strands A-F and two alpha-helices (alpha-helix 1 and 2), arranged in the order: beta-strands A, B, C, alpha-helix 1, beta-strands D, E, alpha-helix 2 and beta-strand F.


Pssm-ID: 467166 [Multi-domain]  Cd Length: 82  Bit Score: 48.01  E-value: 2.63e-06
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 2022791524 4539 HTVSGNGLGIRVVGGKEipgssgETGAYIAKILPGGNAEQTGKLIEGMQVLEWNGIPLTGKTYEEVQNiIIQQSGE 4614
Cdd:cd06678      6 NKRDGEQLGIKLVRKKD------EPGVFILDLLEGGLAARDGRLKSDDRVLAINGQDLRHGTPEQAAQ-IIQASGE 74
PspC_subgroup_2 NF033839
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ...
579-747 2.75e-06

pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.


Pssm-ID: 468202 [Multi-domain]  Cd Length: 557  Bit Score: 53.62  E-value: 2.75e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  579 MPGSPPVKAKTSRAEPADTSQQIDPAPKSDRAKPSQAEDKQK-----QPSIQKPAADTVLTPAAPGLKQDSAGPRP--PP 651
Cdd:NF033839   299 MQPSPQPEKKEVKPEPETPKPEVKPQLEKPKPEVKPQPEKPKpevkpQLETPKPEVKPQPEKPKPEVKPQPEKPKPevKP 378
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  652 TQQKVTDSPKPELAKPSQDTHPAGEKPDSKPLPQVSRQKSDPKLSAQPGaRSEAKTQKPVEPAQVKDDPKKlqtkpsPKP 731
Cdd:NF033839   379 QPETPKPEVKPQPEKPKPEVKPQPEKPKPEVKPQPEKPKPEVKPQPEKP-KPEVKPQPEKPKPEVKPQPEK------PKP 451
                          170
                   ....*....|....*.
gi 2022791524  732 ETKPAPKGLQPGAGPK 747
Cdd:NF033839   452 EVKPQPETPKPEVKPQ 467
PDZ2_Dlg1-2-4-like cd06724
PDZ domain 2 of human discs large homolog 1 (Dlg1), Dlg2, and Dlg4, Drosophila disc large (Dlg) ...
4530-4603 2.78e-06

PDZ domain 2 of human discs large homolog 1 (Dlg1), Dlg2, and Dlg4, Drosophila disc large (Dlg), and related domains; PDZ (PSD-95 (Postsynaptic density protein 95), Dlg (Discs large protein), and ZO-1 (Zonula occludens-1)) domain 1 of Drosophila Dlg1, human Dlg1,2, and 4 and related domains. Dlg1 (also known as synapse-associated protein Dlg197 or SAP-97), Dlg2 (also known as channel-associated protein of synapse-110, postsynaptic density protein 93, or PSD-93), Dlg4 (also known as postsynaptic density protein 95, PSD-95, synapse-associated protein 90, or SAP-90) each have 3 PDZ domains and belong to the membrane-associated guanylate kinase family. Dlg1 regulates antigen receptor signaling and cell polarity in lymphocytes, B-cell proliferation and antibody production, and TGFalpha bioavailability; its PDZ3 domain binds pro-TGFalpha, and its PDZ2 domain binds the TACE metalloprotease responsible for cleaving pro-TGFalpha to a soluble form. Dlg2 is involved in N-methyl-D-aspartate (NMDA) receptor signaling. It regulates surface expression of NMDA receptors in dorsal horn neurons of the spinal cord, and it also interacts with NMDA receptor subunits and with Shaker-type K+ channel subunits to cluster into a channel complex. Dlg4 PDZ1 domain binds NMDA receptors, and its PDZ2 domain binds neuronal nitric oxide synthase (nNOS), forming a complex in neurons. The Drosophila Scribble complex (Scribble, Dlg, and lethal giant larvae) plays a role in apico-basal cell polarity, and in other forms of polarity, including regulation of the actin cytoskeleton, cell signaling and vesicular trafficking, and in tumor development. Postsynaptic targeting of Drosophila DLG requires interactions mediated by the first two PDZ domains. PDZ domains usually bind in a sequence-specific manner to short peptide sequences located at the C-terminal end of their partner proteins (known as PDZ binding motifs). The PDZ superfamily includes canonical PDZ domains as well as those with circular permutations and domain swapping mediated by beta-strands. This Dlg-like family PDZ2 domain is a canonical PDZ domain containing six beta-strands A-F and two alpha-helices (alpha-helix 1 and 2), arranged in the order: beta-strands A, B, C, alpha-helix 1, beta-strands D, E, alpha-helix 2 and beta-strand F.


Pssm-ID: 467207 [Multi-domain]  Cd Length: 85  Bit Score: 48.03  E-value: 2.78e-06
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 2022791524 4530 LKLLRDPKdhtvsgnGLGIRVVGGK---EIPGssgETGAYIAKILPGGNAEQTGKLIEGMQVLEWNGIPLTGKTYEE 4603
Cdd:cd06724      2 IKLVKGPK-------GLGFSIAGGVgnqHIPG---DNGIYVTKIIEGGAAQKDGRLQVGDKLLAVNDVSLEEVTHEE 68
PRK07003 PRK07003
DNA polymerase III subunit gamma/tau;
235-451 2.88e-06

DNA polymerase III subunit gamma/tau;


Pssm-ID: 235906 [Multi-domain]  Cd Length: 830  Bit Score: 53.70  E-value: 2.88e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  235 AVPQQQQPGEPKQIQKPGPGHPADSkleqVKQPPQPRGPQKSQPQQSEPTKPVQQQTSAKPSAGPAKPLPQQ-------- 306
Cdd:PRK07003   367 APGGGVPARVAGAVPAPGARAAAAV----GASAVPAVTAVTGAAGAALAPKAAAAAAATRAEAPPAAPAPPAtadrgdda 442
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  307 -PDSARTSSQAPPPAKPSSQQLGPVKQPSQQPARQGGPvkPSSQQAGPPKQPSQQPGPEKPSAQQMG--PAKQASQPGPG 383
Cdd:PRK07003   443 aDGDAPVPAKANARASADSRCDERDAQPPADSGSASAP--ASDAPPDAAFEPAPRAAAPSAATPAAVpdARAPAAASRED 520
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  384 KPPLQQTGPVKQAPPQSGPTKPPPQ-------------------------TAGATKAPVPQPGLTKPPG-----QQPEPE 433
Cdd:PRK07003   521 APAAAAPPAPEARPPTPAAAAPAARaggaaaaldvlrnagmrvssdrgarAAAAAKPAAAPAAAPKPAAprvavQVPTPR 600
                          250
                   ....*....|....*...
gi 2022791524  434 KPSQQKQASATQPAESAP 451
Cdd:PRK07003   601 ARAATGDAPPNGAARAEQ 618
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
266-457 3.16e-06

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 53.73  E-value: 3.16e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  266 QPPQPRGPQKSQPQQSEPTKPVQQQTSAKPSAGPAKPLPQQPDSARTSSQAPPPAKPSSQQLGPVKQPSQQPARQGGPVK 345
Cdd:PRK12323   364 RPGQSGGGAGPATAAAAPVAQPAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAPEALAAARQASARG 443
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  346 PSSQQAgPPKQPSQQPGPEKPsaqqmgPAKQASQPGPGKPPLQQTGPVKQAPPQSGPTKPPPQTAGATKAPVPQP----- 420
Cdd:PRK12323   444 PGGAPA-PAPAPAAAPAAAAR------PAAAGPRPVAAAAAAAPARAAPAAAPAPADDDPPPWEELPPEFASPAPaqpda 516
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|....*.
gi 2022791524  421 --------GLTKPPGQQPEPEKPSQQKQASATQ-PAESAPKKTFCP 457
Cdd:PRK12323   517 apagwvaeSIPDPATADPDDAFETLAPAPAAAPaPRAAAATEPVVA 562
PHA03377 PHA03377
EBNA-3C; Provisional
191-467 3.22e-06

EBNA-3C; Provisional


Pssm-ID: 177614 [Multi-domain]  Cd Length: 1000  Bit Score: 53.90  E-value: 3.22e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  191 IQKEQGKPEEQRGSAKLPAQ----QQSPKPVQPqghvkPTPQQSESSKAVPQQQQPGEPKQIQKPGP---GHPADSKL-- 261
Cdd:PHA03377   433 LVKTSGRSDEAEQAQSTPERpgpsDQPSVPVEP-----AHLTPVEHTTVILHQPPQSPPTVAIKPAPppsRRRRGACVvy 507
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  262 ----------------EQVKQPPQPR-----GPQKSQPQQSEPTKP-VQQQTSAKPSAGP--AKPLPQQPDSARTSSQAP 317
Cdd:PHA03377   508 dddiievidvetteeeESVTQPAKPHrkvqdGFQRSGRRQKRATPPkVSPSDRGPPKASPpvMAPPSTGPRVMATPSTGP 587
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  318 PPAKPssqqlgPVKQPSQQPARQGGPVKPSSQQAGPPKQPSQQPGP--------EKPSAQQMGPAKQ------ASQPGPG 383
Cdd:PHA03377   588 RDMAP------PSTGPRQQAKCKDGPPASGPHEKQPPSSAPRDMAPsvvrmflrERLLEQSTGPKPKsfwemrAGRDGSG 661
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  384 --KPPLQQTGPVKQappqsgPTKPPPQTAGATKApvpqpgLTKPPGQQPEPEKPSQQKQASATQPAESAPKKTFCPLCTT 461
Cdd:PHA03377   662 iqQEPSSRRQPATQ------STPPRPSWLPSVFV------LPSVDAGRAQPSEESHLSSMSPTQPISHEEQPRYEDPDDP 729

                   ....*.
gi 2022791524  462 TELLLH 467
Cdd:PHA03377   730 LDLSLH 735
PDZ pfam00595
PDZ domain; PDZ domains are found in diverse signaling proteins.
4537-4608 3.92e-06

PDZ domain; PDZ domains are found in diverse signaling proteins.


Pssm-ID: 395476 [Multi-domain]  Cd Length: 81  Bit Score: 47.66  E-value: 3.92e-06
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 2022791524 4537 KDHTVSGNGLGIRVVGGkeipGSSGETGAYIAKILPGGNAEQTGkLIEGMQVLEWNGIPLTGKTYEEVQNII 4608
Cdd:pfam00595    3 TLEKDGRGGLGFSLKGG----SDQGDPGIFVSEVLPGGAAEAGG-LKVGDRILSINGQDVENMTHEEAVLAL 69
PHA03377 PHA03377
EBNA-3C; Provisional
184-455 4.33e-06

EBNA-3C; Provisional


Pssm-ID: 177614 [Multi-domain]  Cd Length: 1000  Bit Score: 53.52  E-value: 4.33e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  184 AGRKQKVIQKEqgkPEEQRGSA---KLPAQQQSPK----PVQPQGhvKPTPQQSESSKAVPQQQQPGEPKQIQKPGPGHP 256
Cdd:PHA03377   655 AGRDGSGIQQE---PSSRRQPAtqsTPPRPSWLPSvfvlPSVDAG--RAQPSEESHLSSMSPTQPISHEEQPRYEDPDDP 729
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  257 ADSKLE-QVKQPPQPRGPQKSQPQQSEPTKPVQQQTSAKPSAGPAKPLpQQPDSARTSSQAPP------PAKPSSQQLGP 329
Cdd:PHA03377   730 LDLSLHpDQAPPPSHQAPYSGHEEPQAQQAPYPGYWEPRPPQAPYLGY-QEPQAQGVQVSSYPgyagpwGLRAQHPRYRH 808
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  330 VKQPSQQPARQGGPVKPSSQQAG-PPKQPSQQPGPEKPSAQQMGPAkqasQPGPGKPPLQQTgpvkQAPPQSGPTKPPPQ 408
Cdd:PHA03377   809 SWAYWSQYPGHGHPQGPWAPRPPhLPPQWDGSAGHGQDQVSQFPHL----QSETGPPRLQLS----QVPQLPYSQTLVSS 880
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|....*...
gi 2022791524  409 TAGATKAPVPQPGLTKPPGQQPEPEKPSQQKQAS-ATQPAESAPKKTF 455
Cdd:PHA03377   881 SAPSWSSPQPRAPIRPIPTRFPPPPMPLQDSMAVgCDSSGTACPSMPF 928
PDZ13_MUPP1-like cd06676
PDZ domain 13 of multi-PDZ-domain protein 1 (MUPP1) and related domains; PDZ (PSD-95 ...
4545-4613 4.59e-06

PDZ domain 13 of multi-PDZ-domain protein 1 (MUPP1) and related domains; PDZ (PSD-95 (Postsynaptic density protein 95), Dlg (Discs large protein), and ZO-1 (Zonula occludens-1)) domain 13 of MUPP1. MUPP1 and PATJ serve as scaffolding proteins linking different proteins and protein complexes involved in the organization of tight junctions and epithelial polarity. MUPP1 contains an L27 (Lin-2 and Lin-7 binding) domain and 13 PDZ domains. PATJ (also known as INAD-like) contains an L27 domain and ten PDZ domains. PATJ lacks 3 PDZ domains seen in MUPP1: PDZ6, PDZ9, and PDZ13. This MuPP1-like PDZ13 domain is therefore absent from PATJ. MUPP1 and PATJ share several binding partners, including junctional adhesion molecules (JAM), zonula occludens (ZO)-3, Pals1 (protein associated with Lin-7), Par (partitioning defective)-6 proteins, and nectins (adherence junction adhesion molecules). PDZ domains usually bind in a sequence-specific manner to short peptide sequences located at the C-terminal end of their partner proteins (known as PDZ binding motifs). The PDZ superfamily includes canonical PDZ domains as well as those with circular permutations and domain swapping mediated by beta-strands. This MUPP1-like family PDZ13 domain is a canonical PDZ domain containing six beta-strands A-F and two alpha-helices (alpha-helix 1 and 2), arranged in the order: beta-strands A, B, C, alpha-helix 1, beta-strands D, E, alpha-helix 2 and beta-strand F.


Pssm-ID: 467164 [Multi-domain]  Cd Length: 83  Bit Score: 47.34  E-value: 4.59e-06
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 2022791524 4545 GLGIRVVGGKEIPgsSGETGAYIAKILPGGNAEQTGKLIEGMQVLEWNGIPLTGKTYEEVQNIIIQQSG 4613
Cdd:cd06676     10 GLGFSIVGGFGSP--HGDLPIYVKTVFEKGAAAEDGRLKRGDQILAVNGESLEGVTHEEAVNILKKTKG 76
Amelin smart00817
Ameloblastin precursor (Amelin); This family consists of several mammalian Ameloblastin ...
273-407 5.22e-06

Ameloblastin precursor (Amelin); This family consists of several mammalian Ameloblastin precursor (Amelin) proteins. Matrix proteins of tooth enamel consist mainly of amelogenin but also of non-amelogenin proteins, which, although their volumetric percentage is low, have an important role in enamel mineralisation. One of the non-amelogenin proteins is ameloblastin, also known as amelin and sheathlin. Ameloblastin (AMBN) is one of the enamel sheath proteins which is though to have a role in determining the prismatic structure of growing enamel crystals.


Pssm-ID: 214832 [Multi-domain]  Cd Length: 411  Bit Score: 52.20  E-value: 5.22e-06
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524   273 PQKSQPQQSEPTKPVQQQTSAKPSAGPakPLPQQPdsartssqappPAKPSSQQLGPVKQPSQQPARQGGPVKPSSQQAG 352
Cdd:smart00817   72 PHSSFPWLRPREHETQQYEYSLPVHPP--PLPSQP-----------SLQPQQPGLKPFLQPTALPTNQATPQKNGPQPPM 138
                            90       100       110       120       130       140
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 2022791524   353 PPKQPSQQPGPEKPSAQQMGPAKQASQP--------GPGKPPLQQTGP--VKQAPPQSGPTKPPP 407
Cdd:smart00817  139 HLGQPPLQQAELPMIPPQVAPSDKPPQTelplydfaDPQNPLLFQIAHlmSRGPMPQNKQQHLYP 203
PDZ1_MUPP1-like cd06689
PDZ domain 1 of multi-PDZ-domain protein 1 (MUPP1) and PATJ (protein-associated tight junction) ...
4542-4617 5.92e-06

PDZ domain 1 of multi-PDZ-domain protein 1 (MUPP1) and PATJ (protein-associated tight junction), and related domains; PDZ (PSD-95 (Postsynaptic density protein 95), Dlg (Discs large protein), and ZO-1 (Zonula occludens-1)) domain 1 of MUPP1 and PATJ, and related domains. MUPP1 and PATJ serve as scaffolding proteins linking different proteins and protein complexes involved in the organization of tight junctions and epithelial polarity. MUPP1 contains an L27 (Lin-2 and Lin-7 binding) domain and 13 PDZ domains. PATJ (also known as INAD-like) contains an L27 domain and ten PDZ domains. MUPP1 and PATJ share several binding partners, including junctional adhesion molecules (JAM), zonula occludens (ZO)-3, Pals1 (protein associated with Lin-7), Par (partitioning defective)-6 proteins, and nectins (adherence junction adhesion molecules). PATJ lacks 3 PDZ domains seen in MUPP1: PDZ6, 9, and 13; consequently, MUPP1 PDZ7 and 8 align with PATJ PDZ6 and 7; and MUPP1 PDZ domains 10-12 align with PATJ PDZ domains 8-10. PDZ domains usually bind in a sequence-specific manner to short peptide sequences located at the C-terminal end of their partner proteins (known as PDZ binding motifs). The PDZ superfamily includes canonical PDZ domains as well as those with circular permutations and domain swapping mediated by beta-strands. This MUPP1-like family PDZ1 domain is a canonical PDZ domain containing six beta-strands A-F and two alpha-helices (alpha-helix 1 and 2), arranged in the order: beta-strands A, B, C, alpha-helix 1, beta-strands D, E, alpha-helix 2 and beta-strand F.


Pssm-ID: 467176 [Multi-domain]  Cd Length: 102  Bit Score: 47.63  E-value: 5.92e-06
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 2022791524 4542 SGNGLGIRVVGGKEipGSSGETGAYIAKILPGGNAEQTGKLIEGMQVLEWNGIPL-TGKTYEEVQNIIIQQSGEAEI 4617
Cdd:cd06689     24 ESGGLGFSVVGLKS--ENRGELGIFVQEIQPGSVAARDGRLKENDQILAINGQPLdQSISHQQAIAILQQAKGSVEL 98
SOBP pfam15279
Sine oculis-binding protein; SOBP is associated with syndromic and nonsyndromic intellectual ...
256-430 5.97e-06

Sine oculis-binding protein; SOBP is associated with syndromic and nonsyndromic intellectual disability. It carries a zinc-finger of the zf-C2H2 type at the N-terminus, and a highly characteriztic C-terminal PhPhPhPhPhPh motif. The deduced 873-amino acid protein contains an N-terminal nuclear localization signal (NLS), followed by 2 FCS-type zinc finger motifs, a proline-rich region (PR1), a putative RNA-binding motif region, and a C-terminal NLS embedded in a second proline-rich motif. SOBP is expressed in various human tissues, including developing mouse brain at embryonic day 14. In postnatal and adult mouse brain SOBP is expressed in all neurons, with intense staining in the limbic system. Highest expression is in layer V cortical neurons, hippocampus, pyriform cortex, dorsomedial nucleus of thalamus, amygdala, and hypothalamus. Postnatal expression of SOBP in the limbic system corresponds to a time of active synaptogenesis. the family is also referred to as Jackson circler, JXC1. In seven affected siblings from a consanguineous Israeli Arab family with mental retardation, anterior maxillary protrusion, and strabismus mutations were found in this protein.


Pssm-ID: 464609 [Multi-domain]  Cd Length: 325  Bit Score: 51.74  E-value: 5.97e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  256 PADSKLEQVKQPPQPRGPQKSQPQQSEPTKPVQ---------QQTSAKPSAGPAKPLPQQPDSARTSSQAPPPAKPSSQQ 326
Cdd:pfam15279   85 PASTRSESVSPGPSSSASPSSSPTSSNSSKPLIsvassskllAPKPHEPPSLPPPPLPPKKGRRHRPGLHPPLGRPPGSP 164
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  327 LGPVK----QPSQQPARQGGPV----KPSSQQAGPPKQPSQQPGPEKPSAQQMGPAkQASQPGPG-----------KPPL 387
Cdd:pfam15279  165 PMSMTprglLGKPQQHPPPSPLpafmEPSSMPPPFLRPPPSIPQPNSPLSNPMLPG-IGPPPKPPrnlgppsnpmhRPPF 243
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|...
gi 2022791524  388 qqtGPVKQAPPQSGPTKPPPQTAGATKAPVPQPGLTKPPGQQP 430
Cdd:pfam15279  244 ---SPHHPPPPPTPPGPPPGLPPPPPRGFTPPFGPPFPPVNMM 283
PRK14948 PRK14948
DNA polymerase III subunit gamma/tau;
256-450 6.04e-06

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237862 [Multi-domain]  Cd Length: 620  Bit Score: 52.66  E-value: 6.04e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  256 PADSKLEQVKQPPQPRGPQKSQPQQSEPTKPVQQ-QTSAKPSAGPAKPLPQQPDSARTSSQAPPPAKPSSQQ-------- 326
Cdd:PRK14948   367 EIANASAPANPTPAPNPSPPPAPIQPSAPKTKQAaTTPSPPPAKASPPIPVPAEPTEPSPTPPANAANAPPSlnleelwq 446
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  327 --LGPVKQPSQQP-ARQ---------------------------------------GGPVKP--SSQQAGPPKQPSQQPG 362
Cdd:PRK14948   447 qiLAKLELPSTRMlLSQqaelvsldsnraviavspnwlgmvqsrkplleqafakvlGRSIKLnlESQSGSASNTAKTPPP 526
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  363 PEKPSAQQMGPakqasqpgpgkPPLQQtgPVKQAPPQSGPTKPPPQTAGATKAPVPQPGLTKPPGQQPEPEKPSQQKQAS 442
Cdd:PRK14948   527 PQKSPPPPAPT-----------PPLPQ--PTATAPPPTPPPPPPTATQASSNAPAQIPADSSPPPPIPEEPTPSPTKDSS 593

                   ....*...
gi 2022791524  443 ATQPAESA 450
Cdd:PRK14948   594 PEEIDKAA 601
PRK07994 PRK07994
DNA polymerase III subunits gamma and tau; Validated
285-434 6.17e-06

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236138 [Multi-domain]  Cd Length: 647  Bit Score: 52.56  E-value: 6.17e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  285 KPVQQQTSAKPSAGPAKPLPQQPDSARTSSQAPPPAkPSSQQLGPVKQPSQQPARQGGPVKPSSQQAGPPKQPSQ-QPGP 363
Cdd:PRK07994   360 HPAAPLPEPEVPPQSAAPAASAQATAAPTAAVAPPQ-APAVPPPPASAPQQAPAVPLPETTSQLLAARQQLQRAQgATKA 438
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 2022791524  364 EKPSAQQMGPAKQASQPGPGKPPLQQTGPVKQAPPQSGPTKP--PPQTAGATKAPVPQPgltKPPGQQPEPEK 434
Cdd:PRK07994   439 KKSEPAAASRARPVNSALERLASVRPAPSALEKAPAKKEAYRwkATNPVEVKKEPVATP---KALKKALEHEK 508
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
225-451 6.54e-06

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 52.87  E-value: 6.54e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  225 PTPQQSESSKAVPQQQQPGEPKQIQKPGPGHPADSKLEQVKQPPQPRGPQK-----------SQPQQSEPTKPVQQQTSA 293
Cdd:PHA03307   177 SSPEETARAPSSPPAEPPPSTPPAAASPRPPRRSSPISASASSPAPAPGRSaaddagasssdSSSSESSGCGWGPENECP 256
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  294 KPSAGPAkPLPQQPDSARTSSQAPPPAKPSSQQLGP-VKQPSQQPARQGGPVKPSSQQAGPPKQPSQQPGPEKPSAQQMG 372
Cdd:PHA03307   257 LPRPAPI-TLPTRIWEASGWNGPSSRPGPASSSSSPrERSPSPSPSSPGSGPAPSSPRASSSSSSSRESSSSSTSSSSES 335
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 2022791524  373 PAKQASQPGPGKPPlqqtgpvkqaPPQSGPTKPPPQTAGATKAPVPQPGLTKPPGQQPEPEKPSQQKQASATQPAESAP 451
Cdd:PHA03307   336 SRGAAVSPGPSPSR----------SPSPSRPPPPADPSSPRKRPRPSRAPSSPAASAGRPTRRRARAAVAGRARRRDAT 404
PBP1 COG5180
PAB1-binding protein, interacts with poly(A)-binding protein [RNA processing and modification]; ...
198-432 7.42e-06

PAB1-binding protein, interacts with poly(A)-binding protein [RNA processing and modification];


Pssm-ID: 444064 [Multi-domain]  Cd Length: 548  Bit Score: 52.37  E-value: 7.42e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  198 PEEQRGSAKLPAQQQSPKPVQPQGHVKPTPQQSESSKaVPQQQQPGEPKQIQKPGPGHPADskleQVKQPPQPRGPQKSQ 277
Cdd:COG5180    167 DGDSASTLPPPAEKLDKVLTEPRDALKDSPEKLDRPK-VEVKDEAQEEPPDLTGGADHPRP----EAASSPKVDPPSTSE 241
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  278 PQQSEPTKPVQQQTSAKPSAG----------PAKPLPQQPDSA-----RTSSQAPPPAKPSSQQLGPVKQPSQQPAR-QG 341
Cdd:COG5180    242 ARSRPATVDAQPEMRPPADAKerrraaigdtPAAEPPGLPVLEagsepQSDAPEAETARPIDVKGVASAPPATRPVRpPG 321
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  342 GPVKPSSQQagpPKQPSQQPGPEKPSAQQMGPAKQASQPGPGKPPLQQTGPVKQAPPQSGPTKPPPQTAGATKAPVPQPG 421
Cdd:COG5180    322 GARDPGTPR---PGQPTERPAGVPEAASDAGQPPSAYPPAEEAVPGKPLEQGAPRPGSSGGDGAPFQPPNGAPQPGLGRR 398
                          250
                   ....*....|.
gi 2022791524  422 LTKPPGQQPEP 432
Cdd:COG5180    399 GAPGPPMGAGD 409
PRK07994 PRK07994
DNA polymerase III subunits gamma and tau; Validated
255-431 7.63e-06

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236138 [Multi-domain]  Cd Length: 647  Bit Score: 52.18  E-value: 7.63e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  255 HPADSkleqVKQPPQPRGPQKSQPQQSEPTKPVQQQTSAkpsagPAKPLPQQPDSARTSSQAPPPAKPSSQQLGPVKQPS 334
Cdd:PRK07994   360 HPAAP----LPEPEVPPQSAAPAASAQATAAPTAAVAPP-----QAPAVPPPPASAPQQAPAVPLPETTSQLLAARQQLQ 430
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  335 QQPARQGGPV-KPSSQQAGPPKQPSQQPGPEKPSAQQMGPAKQASQPGPGKPPLQqtgPVKQAPPQSGPTKPPPQTAGAT 413
Cdd:PRK07994   431 RAQGATKAKKsEPAAASRARPVNSALERLASVRPAPSALEKAPAKKEAYRWKATN---PVEVKKEPVATPKALKKALEHE 507
                          170
                   ....*....|....*...
gi 2022791524  414 KAPVPQPGLTKPPGQQPE 431
Cdd:PRK07994   508 KTPELAAKLAAEAIERDP 525
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
279-458 7.95e-06

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 52.19  E-value: 7.95e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  279 QQSEPTKPVQQQTSAKPSAGPAKPLPQQPDSARTSSQAPPPAKPssqqlgPVKQPSQQPARQGGPVKPSSQQAGPPKQ-P 357
Cdd:PRK12323   367 QSGGGAGPATAAAAPVAQPAPAAAAPAAAAPAPAAPPAAPAAAP------AAAAAARAVAAAPARRSPAPEALAAARQaS 440
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  358 SQQPGPEKPSAQQMGPAKQASQPGPGKPPLQQTGPVKQAPPQSGPTKPPPQTAGATKAPVPQPGLTKPPGQQPEPEKPSQ 437
Cdd:PRK12323   441 ARGPGGAPAPAPAPAAAPAAAARPAAAGPRPVAAAAAAAPARAAPAAAPAPADDDPPPWEELPPEFASPAPAQPDAAPAG 520
                          170       180
                   ....*....|....*....|.
gi 2022791524  438 QKQASATQPAESAPKKTFCPL 458
Cdd:PRK12323   521 WVAESIPDPATADPDDAFETL 541
PDZ2_Par3-like cd23058
PDZ domain 2 of partitioning defective 3 (Par3), and related domains; PDZ (PSD-95 ...
4544-4604 9.44e-06

PDZ domain 2 of partitioning defective 3 (Par3), and related domains; PDZ (PSD-95 (Postsynaptic density protein 95), Dlg (Discs large protein), and ZO-1 (Zonula occludens-1)) domain 2 of Par3 (or PAR3 or Par-3, also known as Atypical PKC isotype-specific-interacting protein, ASIP, Drosophila Bazooka) and related domains. Par3 is a scaffold protein involved in organizing cell polarity across animals. Par3 binds numerous molecules both for its recruitment to one pole of the cell and for downstream contributions to polarized cell function. It regulates cell polarity by targeting the Par complex proteins Par6 and atypical protein kinase C (aPKC) to specific cortical sites. Physical interactions between Par3 and the Par complex include Par3 PDZ domain 1 binding to the Par6 PDZ domain, Par3 PDZ domain 1 and PDZ domain 3 binding the Par6's PDZ-binding motif, and an interaction with an undefined region of aPKC that requires both Par3 PDZ2 and PDZ3. The PDZ domains of Par3 have also been implicated as potential phosphoinositide signaling integrators, since its second PDZ domain binds to phosphoinositides, and the third PDZ interacts with phosphoinositide phosphatase PTEN. PDZ domains usually bind in a sequence-specific manner to short peptide sequences located at the C-terminal end of their partner proteins (known as PDZ binding motifs). The PDZ superfamily includes canonical PDZ domains as well as those with circular permutations and domain swapping mediated by beta-strands. This Par3 family PDZ2 domain is a canonical PDZ domain containing six beta-strands A-F and two alpha-helices (alpha-helix 1 and 2), arranged in the order: beta-strands A, B, C, alpha-helix 1, beta-strands D, E, alpha-helix 2 and beta-strand F.


Pssm-ID: 467271 [Multi-domain]  Cd Length: 93  Bit Score: 46.87  E-value: 9.44e-06
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 2022791524 4544 NGLGIRVVGGKEIPGSSGETgaYIAKILPGGNAEQTGKLIEGMQVLEWNGIPLTGKTYEEV 4604
Cdd:cd23058     15 EGLGFSITSRDNPTGGSGPI--YIKNILPKGAAIQDGRLKAGDRLLEVNGVDVTGKTQEEV 73
PDZ1_ZO1-like cd06727
PDZ domain 1 of Zonula Occludens-1 (ZO-1), homologs ZO-2 and ZO-3, and related domains; PDZ ...
4539-4617 1.00e-05

PDZ domain 1 of Zonula Occludens-1 (ZO-1), homologs ZO-2 and ZO-3, and related domains; PDZ (PSD-95 (Postsynaptic density protein 95), Dlg (Discs large protein), and ZO-1 (Zonula occludens-1)) domain 1 of ZO-1, -2, -3 and related domains. Zonula occludens proteins (ZO-1, ZO-2, ZO-3) are multi-PDZ domain proteins involved in the maintenance and biogenesis of multi-protein networks at the cytoplasmic surface of intercellular contacts in epithelial and endothelial cells. They have three N-terminal PDZ domains, PDZ1-3, followed by a Src homology-3 (SH3) domain and a guanylate kinase (GuK)-like domain. Among protein-protein interactions for all ZO proteins is the binding of the first PDZ domain (PDZ1) to the C-termini of claudins, and the homo- and hetero-dimerization of ZO-proteins via their second PDZ domain (PDZ2), which takes place by symmetrical domain swapping of the first two beta-strands of PDZ2. At the cell level, ZO-1 and ZO-2 are involved in polarity maintenance, gene transcription, cell proliferation, and tumor cell metastasis. PDZ domains usually bind in a sequence-specific manner to short peptide sequences located at the C-terminal end of their partner proteins (known as PDZ binding motifs). The PDZ superfamily includes canonical PDZ domains as well as those with circular permutations and domain swapping mediated by beta-strands. This ZO family PDZ1 domain is a canonical PDZ domain containing six beta-strands A-F and two alpha-helices (alpha-helix 1 and 2), arranged in the order: beta-strands A, B, C, alpha-helix 1, beta-strands D, E, alpha-helix 2 and beta-strand F.


Pssm-ID: 467209 [Multi-domain]  Cd Length: 87  Bit Score: 46.50  E-value: 1.00e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524 4539 HTV-----SGNGLGIRVVGGKEIPG-SSGETGAYIAKILPGGNAEqtGKLIEGMQVLEWNGIPLTGKTYEE-VQniIIQQ 4611
Cdd:cd06727      1 HTVtlhraPGFGFGIAVSGGRDNPHfQSGDTSIVISDVLKGGPAE--GKLQENDRVVSVNGVSMENVEHSFaVQ--ILRK 76

                   ....*..
gi 2022791524 4612 SG-EAEI 4617
Cdd:cd06727     77 CGkTANI 83
Med15 pfam09606
ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of ...
191-689 1.07e-05

ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of the ARC-Mediator co-activator is a three-helix bundle with marked similarity to the KIX domain. The sterol regulatory element binding protein (SREBP) family of transcription activators use the ARC105 subunit to activate target genes in the regulation of cholesterol and fatty acid homeostasis. In addition, Med15 is a critical transducer of gene activation signals that control early metazoan development.


Pssm-ID: 312941 [Multi-domain]  Cd Length: 732  Bit Score: 51.93  E-value: 1.07e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  191 IQKEQGKPEEQRGSAKLPAQQQSPKPVQPQGHVKPTPQQSESSKAVPQQQQP---GEPKQIQKPGPgHPADSKLEQVKQP 267
Cdd:pfam09606  123 LLASLGRPQMPMGGAGFPSQMSRVGRMQPGGQAGGMMQPSSGQPGSGTPNQMgpnGGPGQGQAGGM-NGGQQGPMGGQMP 201
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  268 PQPRGPQKSQPQQSEPTKPVQQQTSAKPSAGPAKPLPQQpdsaRTSSQAPPPAKPSSQQLGPVKQPSQQ-------PARQ 340
Cdd:pfam09606  202 PQMGVPGMPGPADAGAQMGQQAQANGGMNPQQMGGAPNQ----VAMQQQQPQQQGQQSQLGMGINQMQQmpqgvggGAGQ 277
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  341 GGPVKP----SSQQAGPPKQPSQQPGPEKPSAQQMGPAKQASQPGPGKPPLQQTGPVKQAppqSGPTKPPPQTAGATKAP 416
Cdd:pfam09606  278 GGPGQPmgppGQQPGAMPNVMSIGDQNNYQQQQTRQQQQQQGGNHPAAHQQQMNQSVGQG---GQVVALGGLNHLETWNP 354
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  417 VPQPGLTKPPGQQPEPEKPSQQKQASATQPAESAPKKTfcplctttelLLHTPekanyntctQCHTVVCSLCGFNPNPHI 496
Cdd:pfam09606  355 GNFGGLGANPMQRGQPGMMSSPSPVPGQQVRQVTPNQF----------MRQSP---------QPSVPSPQGPGSQPPQSH 415
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  497 TEikewlclncqmqralGGDLAPGHGPGPQlpppkqktptppstakpppqPQPGQKKDAS----------PKPDPAQLAE 566
Cdd:pfam09606  416 PG---------------GMIPSPALIPSPS--------------------PQMSQQPAQQrtigqdspggSLNTPGQSAV 460
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  567 PKKPPPQKKQPSMpgsPPVKAKTSRAEPAD---TSQQIDPAPKSDRAKPSQAEDKQKQPSIQKPAADTVLTPAApgLKQD 643
Cdd:pfam09606  461 NSPLNPQEEQLYR---EKYRQLTKYIEPLKrmiAKMENDPGDIDKMNKMKRLLEILSNPSSRIPLETLQKCEAA--LENQ 535
                          490       500       510       520       530
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 2022791524  644 SAGPRPPPTQQKVTDSPKPELAKPS---------QDTHPAGEKPDSKPLPQVSRQ 689
Cdd:pfam09606  536 MGTPREPPLNNPLLEAVLSNLQSPTgnhtlyrtfRPTLEALFGPDIKGLPIPAKQ 590
PRK10263 PRK10263
DNA translocase FtsK; Provisional
200-306 1.11e-05

DNA translocase FtsK; Provisional


Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 52.01  E-value: 1.11e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  200 EQRGSAKLPAQQQSPKPVQPqghVKPTPQQSESSKAVPQQQQPGEPKQIQKPGPghpadskleqvkQPPQPRGPQKSQPQ 279
Cdd:PRK10263   752 VQQPQQPVAPQQQYQQPQQP---VAPQPQYQQPQQPVAPQPQYQQPQQPVAPQP------------QYQQPQQPVAPQPQ 816
                           90       100
                   ....*....|....*....|....*..
gi 2022791524  280 QSEPTKPVQQQTSAKPSAGPAKPLPQQ 306
Cdd:PRK10263   817 YQQPQQPVAPQPQYQQPQQPVAPQPQD 843
PRK14971 PRK14971
DNA polymerase III subunit gamma/tau;
268-402 1.18e-05

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237874 [Multi-domain]  Cd Length: 614  Bit Score: 51.70  E-value: 1.18e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  268 PQPRGPQKS-QPQQSEPTKPVQQQTSAKPSagpakplpQQPDSARTSSQAPPPAKPSSQQLGPVkqpsQQPARQGGPVKP 346
Cdd:PRK14971   370 SGGRGPKQHiKPVFTQPAAAPQPSAAAAAS--------PSPSQSSAAAQPSAPQSATQPAGTPP----TVSVDPPAAVPV 437
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 2022791524  347 SSQQAGPPKQPSQQPGPEKPSAQQMGPAKQASQPGPGKPPLQQ-TGPVKQAPPQSGP 402
Cdd:PRK14971   438 NPPSTAPQAVRPAQFKEEKKIPVSKVSSLGPSTLRPIQEKAEQaTGNIKEAPTGTQK 494
PTZ00449 PTZ00449
104 kDa microneme/rhoptry antigen; Provisional
180-405 1.31e-05

104 kDa microneme/rhoptry antigen; Provisional


Pssm-ID: 185628 [Multi-domain]  Cd Length: 943  Bit Score: 51.61  E-value: 1.31e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  180 AHEEAGRKQKVIQKEQGKPEEQRgSAKLPAQQQSPK-PVQPQGHVK-PTPQQSESSKAVPQQQQPGEPKQIQKP-GPGHP 256
Cdd:PTZ00449   564 AKEHKPSKIPTLSKKPEFPKDPK-HPKDPEEPKKPKrPRSAQRPTRpKSPKLPELLDIPKSPKRPESPKSPKRPpPPQRP 642
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  257 AD-SKLEQVKQPPQPRGPQKSQP------------------QQSEPTKPVQQQTSAKPSAgPAKPLPQQPDSARTSSQAP 317
Cdd:PTZ00449   643 SSpERPEGPKIIKSPKPPKSPKPpfdpkfkekfyddyldaaAKSKETKTTVVLDESFESI-LKETLPETPGTPFTTPRPL 721
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  318 PPAKPSSQQL--GPVKQP-SQQPARQGGPVKPSSQQAGPPKQPSQQPGPEKPSAQQMGPAKQASQPGPGKPPLQQTGPVK 394
Cdd:PTZ00449   722 PPKLPRDEEFpfEPIGDPdAEQPDDIEFFTPPEEERTFFHETPADTPLPDILAEEFKEEDIHAETGEPDEAMKRPDSPSE 801
                          250
                   ....*....|...
gi 2022791524  395 QAPPQSG--PTKP 405
Cdd:PTZ00449   802 HEDKPPGdhPSLP 814
Herpes_TAF50 pfam03326
Herpesvirus transcription activation factor (transactivator); This family includes EBV BRLF1 ...
199-451 1.43e-05

Herpesvirus transcription activation factor (transactivator); This family includes EBV BRLF1 and similar ORF 50 proteins from other herpesviruses.


Pssm-ID: 308764 [Multi-domain]  Cd Length: 568  Bit Score: 51.24  E-value: 1.43e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  199 EEQRGSAKLPAQQQSPKPVQPQGHVKPTPQQSESS--KAVPQ---QQQPGEPKQIQKPG--------------------- 252
Cdd:pfam03326  218 DELESLIRDPEPRYQPSPTMPQQALKPGAQSSDTSgvSDTEQsarQTATAEPQRLQAPGfsrstsrpargqtaqyflaaq 297
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  253 --------PGHPADSKLEQVKQPPQ----------------PRGPQKSQPQQSEPTKP----VQQQTSAKPSA-GPAKPL 303
Cdd:pfam03326  298 qhgvvslfPSTATLVPIAGSTGVTEvvsyghnstspsstpcPSTAVTEADHQTEPEVPwiatAHQESDQRPIGpGPEKPT 377
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  304 PQQPDSARTSSQAPPPAKPSSQQLGPVKQPSQQPARQGgpvkpsSQQAGPPKQPSQQPgPEKPSAQQMGPAKQASQP-GP 382
Cdd:pfam03326  378 FLPPVGGKQFFQGLRDSRSTSFLTAPEATSAISDVFQG------TEVCQPKRIRALHP-PGSPSANRPLPSSLAPTPtGP 450
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 2022791524  383 GKPPLQQTGPVkqappqsgptkPPPQTAGATKAPVPQPGLTKPPGQQPEPEkPSQQKQASATQPAESAP 451
Cdd:pfam03326  451 VHEPGSSLTPA-----------TVPQPLDAAPVATPEASHELQPPDEETPQ-PLDEDQALCGQQDASHP 507
PRK10263 PRK10263
DNA translocase FtsK; Provisional
218-380 1.79e-05

DNA translocase FtsK; Provisional


Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 51.24  E-value: 1.79e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  218 QPQGHVKPTPQQSESSKAVPQQQQPGEPKQIQKPGPghpadskleqvkQPPQPRGPQKSQPQQSEPTKPVQQQtsakpsa 297
Cdd:PRK10263   754 QPQQPVAPQQQYQQPQQPVAPQPQYQQPQQPVAPQP------------QYQQPQQPVAPQPQYQQPQQPVAPQ------- 814
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  298 gpakPLPQQPdsartssQAPPPAKPSSQQlgPVKQPSQQParQGGPVKPSSQQAGPPKqPSQQPGPEKPSAQQMGPAKQA 377
Cdd:PRK10263   815 ----PQYQQP-------QQPVAPQPQYQQ--PQQPVAPQP--QDTLLHPLLMRNGDSR-PLHKPTTPLPSLDLLTPPPSE 878

                   ...
gi 2022791524  378 SQP 380
Cdd:PRK10263   879 VEP 881
PRK10819 PRK10819
transport protein TonB; Provisional
208-328 1.95e-05

transport protein TonB; Provisional


Pssm-ID: 236768 [Multi-domain]  Cd Length: 246  Bit Score: 49.30  E-value: 1.95e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  208 PAQQQSPKPVQ--PQGHVKPTPQ---QSESSKAVPQQQQPGEPKQIQKPGPG-HPADSKLEQVKQPPQPRGPQksqPQQS 281
Cdd:PRK10819    55 PADLEPPQAVQppPEPVVEPEPEpepIPEPPKEAPVVIPKPEPKPKPKPKPKpKPVKKVEEQPKREVKPVEPR---PASP 131
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....*..
gi 2022791524  282 EPTKPVQQQTSAKPSAGPAKPLPQQPDSARTSSQAPPPAKPSSQQLG 328
Cdd:PRK10819   132 FENTAPARPTSSTATAAASKPVTSVSSGPRALSRNQPQYPARAQALR 178
PDZ_MPP5-like cd06798
PDZ domain of membrane palmitoylated protein 5 (MPP5), Drosophila Stardust, and related ...
4567-4613 1.95e-05

PDZ domain of membrane palmitoylated protein 5 (MPP5), Drosophila Stardust, and related domains; PDZ (PSD-95 (Postsynaptic density protein 95), Dlg (Discs large protein), and ZO-1 (Zonula occludens-1)) domain of MPP5, Drosophila Stardust, and related domains. MPP5 (also known as MAGUK p55 subfamily member 1, protein associated with Lin-7 1 or PALS1) and Drosophila Stardust are membrane-associated guanylate kinase (MAGUK)-like proteins that serve as signaling and scaffolding proteins, linking different proteins critical to the formation and maintenance of tight junctions (TJ) and apical-basal polarity. Apical-basal polarity determinants cluster in complexes; in particular, the Crumbs complex (Crb, MPP5, and PATJ) and the PAR/aPKC-complex (PAR-3, PAR-6, aPKC) determine the apical plasma membrane domain. Within the Crumbs complex, Crb is stabilized in the plasma membrane by MPP5, which in turn recruits PATJ and Lin-7 to the complex. MPP5 also links the Crumbs complex with the PAR/aPKC-complex. The Drosophila homolog of the Crumbs complex is the (CRB)-Stardust (Sdt)-Discs Lost (Dlt) complex. MPP5 also acts as an interaction partner for SARS-CoV envelope protein E, which results in delayed formation of TJs and dysregulation of cell polarity. PDZ domains usually bind in a sequence-specific manner to short peptide sequences located at the C-terminal end of their partner proteins (known as PDZ binding motifs). The PDZ superfamily includes canonical PDZ domains as well as those with circular permutations and domain swapping mediated by beta-strands. This MPP5-like family domain is a canonical PDZ domain containing six beta-strands A-F and two alpha-helices (alpha-helix 1 and 2), arranged in the order: beta-strands A, B, C, alpha-helix 1, beta-strands D, E, alpha-helix 2 and beta-strand F.


Pssm-ID: 467259 [Multi-domain]  Cd Length: 79  Bit Score: 45.41  E-value: 1.95e-05
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*..
gi 2022791524 4567 IAKILPGGNAEQTGKLIEGMQVLEWNGIPLTGKTYEEVQNIIIQQSG 4613
Cdd:cd06798     25 ISRIVKGGAAEKSGLLHEGDEILEINGIEIRGKDVNEVCDLLADMHG 71
PDZ4_MUPP1-like cd06668
PDZ domain 4 of multi-PDZ-domain protein 1 (MUPP1) and PATJ (protein-associated tight junction) ...
4543-4608 2.06e-05

PDZ domain 4 of multi-PDZ-domain protein 1 (MUPP1) and PATJ (protein-associated tight junction) and related domains; PDZ (PSD-95 (Postsynaptic density protein 95), Dlg (Discs large protein), and ZO-1 (Zonula occludens-1)) domain 4 of MUPP1 and PATJ, and related domains. MUPP1 and PATJ serve as scaffolding proteins linking different proteins and protein complexes involved in the organization of tight junctions and epithelial polarity. MUPP1 contains an L27 (Lin-2 and Lin-7 binding) domain and 13 PDZ domains. PATJ (also known as INAD-like) contains an L27 domain and ten PDZ domains. MUPP1 and PATJ share several binding partners, including junctional adhesion molecules (JAM), zonula occludens (ZO)-3, Pals1 (protein associated with Lin-7), Par (partitioning defective)-6 proteins, and nectins (adherence junction adhesion molecules). PATJ lacks 3 PDZ domains seen in MUPP1: PDZ6, 9, and 13; consequently, MUPP1 PDZ7 and 8 align with PATJ PDZ6 and 7; and MUPP1 PDZ domains 10-12 align with PATJ PDZ domains 8-10. PDZ domains usually bind in a sequence-specific manner to short peptide sequences located at the C-terminal end of their partner proteins (known as PDZ binding motifs). The PDZ superfamily includes canonical PDZ domains as well as those with circular permutations and domain swapping mediated by beta-strands. This MUPP1-like family PDZ4 domain is a canonical PDZ domain containing six beta-strands A-F and two alpha-helices (alpha-helix 1 and 2), arranged in the order: beta-strands A, B, C, alpha-helix 1, beta-strands D, E, alpha-helix 2 and beta-strand F


Pssm-ID: 467156 [Multi-domain]  Cd Length: 88  Bit Score: 45.75  E-value: 2.06e-05
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 2022791524 4543 GNGLGIRVVGGKEIpGSSGEtgAYIAKILPGGNAEQTGKLIEGMQVLEWNGIPLTGKTYEEVQNII 4608
Cdd:cd06668     13 SSGLGISLEGTVDV-EVRGH--HYIRSILPEGPVGRNGKLFSGDELLEVNGIQLLGLSHKEVVSIL 75
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
197-382 2.09e-05

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 50.92  E-value: 2.09e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  197 KPEEQRGSAKLPAQQQSPKPVQPQGhvkPTPQQSESSKAVPQQQQPGEPKQIQKPGPGHPADSKL---EQVKQPPQPRGP 273
Cdd:pfam03154  356 KPPPTTPIPQLPNPQSHKHPPHLSG---PSPFQMNSNLPPPPALKPLSSLSTHHPPSAHPPPLQLmpqSQQLPPPPAQPP 432
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  274 QKSQPQQSEPTKPVQQQTSAKPSAGPAKPLPQQPDSARTSSQAPPPAKP---SSQQLGPVKQPSQQPARQGGPVKPSSQQ 350
Cdd:pfam03154  433 VLTQSQSLPPPAASHPPTSGLHQVPSQSPFPQHPFVPGGPPPITPPSGPptsTSSAMPGIQPPSSASVSSSGPVPAAVSC 512
                          170       180       190
                   ....*....|....*....|....*....|..
gi 2022791524  351 AGPPKQPSQQPGPEKPSAQQMGPAKQASQPGP 382
Cdd:pfam03154  513 PLPPVQIKEEALDEAEEPESPPPPPRSPSPEP 544
PRK07994 PRK07994
DNA polymerase III subunits gamma and tau; Validated
299-452 2.09e-05

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236138 [Multi-domain]  Cd Length: 647  Bit Score: 51.02  E-value: 2.09e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  299 PAKPLPQqPDSARTSSQAPPPAKPSSQQLGPVKQPSQQPArqggpvkPSSQQAGPPKQPSQQPGPEKPSAQQMGPAKQAS 378
Cdd:PRK07994   361 PAAPLPE-PEVPPQSAAPAASAQATAAPTAAVAPPQAPAV-------PPPPASAPQQAPAVPLPETTSQLLAARQQLQRA 432
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  379 QpgpgkpplQQTGPVKQAPPQSGPTKP--PPQTAGATKAPVPQPGLTKPP--------GQQPEPEK------PSQQKQAS 442
Cdd:PRK07994   433 Q--------GATKAKKSEPAAASRARPvnSALERLASVRPAPSALEKAPAkkeayrwkATNPVEVKkepvatPKALKKAL 504
                          170
                   ....*....|.
gi 2022791524  443 ATQPA-ESAPK 452
Cdd:PRK07994   505 EHEKTpELAAK 515
PDZ4_PDZD2-PDZ2_hPro-IL-16-like cd06760
PDZ domain 4 of PDZ domain containing 2 (PDZD2), PDZ domain 2 of human pro-interleukin-16 ...
4539-4617 2.21e-05

PDZ domain 4 of PDZ domain containing 2 (PDZD2), PDZ domain 2 of human pro-interleukin-16 (isoform 1, 1332 AA), and related domains; PDZ (PSD-95 (Postsynaptic density protein 95), Dlg (Discs large protein), and ZO-1 (Zonula occludens-1)) domain 4 of PDZD2, also known as KIAA0300, PIN-1, activated in prostate cancer (AIPC) and PDZ domain-containing protein 3 (PDZK3). PDZD2 has seven PDZ domains. PDZD2 is expressed at exceptionally high levels in the pancreas and certain cancer tissues, such as prostate cancer. It promotes the proliferation of insulinoma cells and is upregulated during prostate tumorigenesis. In osteosarcoma (OS), the microRNA miR-363 acts as a tumor suppressor by inhibiting PDZD2. This family also includes the second PDZ domain (PDZ2) of human pro-interleukin-16 (isoform 1, also known as nPro-Il-16; 1332 amino-acid protein). Precursor IL-16 is cleaved to produce pro-IL-16 and mature IL-16 (derived from the C-terminal 121 AA). Pro-IL-16 functions as a regulator of T cell growth; mature IL-16 is a CD4 ligand that induces chemotaxis and CD25 expression in CD4+ T cells. IL-16 bioactivity has been closely associated with the progression of several different cancers PDZ domains usually bind in a sequence-specific manner to short peptide sequences located at the C-terminal end of their partner proteins (known as PDZ binding motifs). The PDZ superfamily includes canonical PDZ domains as well as those with circular permutations and domain swapping mediated by beta-strands. This PDZD2-like family PDZ4 domain is a canonical PDZ domain containing six beta-strands A-F and two alpha-helices (alpha-helix 1 and 2), arranged in the order: beta-strands A, B, C, alpha-helix 1, beta-strands D, E, alpha-helix 2 and beta-strand F.


Pssm-ID: 467241 [Multi-domain]  Cd Length: 90  Bit Score: 45.72  E-value: 2.21e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524 4539 HTVSGNGLGIRVVGgkeIPGSSGETGAYIAKILPGGNAEQTGKLIEGMQVLEWNGIPLTGKTYEEVQNIIIQ-QSGEAEI 4617
Cdd:cd06760     10 NKEPGVGLGIGLCC---LPLENDIPGIFIHHLSPGSVAHMDGRLRRGDQILEINGTSLRNVTLNEAYAILSQcKPGPVTL 86
PDZ7_PDZD2-PDZ4_hPro-IL-16-like cd06763
PDZ domain 7 of PDZ domain containing 2 (PDZD2), PDZ domain 4 of human pro-interleukin-16 ...
4545-4608 2.24e-05

PDZ domain 7 of PDZ domain containing 2 (PDZD2), PDZ domain 4 of human pro-interleukin-16 (isoform 1, 1332 AA), and related domains; PDZ (PSD-95 (Postsynaptic density protein 95), Dlg (Discs large protein), and ZO-1 (Zonula occludens-1)) domain 7 of PDZD2, also known as KIAA0300, PIN-1, PAPIN, activated in prostate cancer (AIPC) and PDZ domain-containing protein 3 (PDZK3). PDZD2 has seven PDZ domains. PDZD2 is expressed at exceptionally high levels in the pancreas and certain cancer tissues, such as prostate cancer. It promotes the proliferation of insulinoma cells and is upregulated during prostate tumorigenesis. In osteosarcoma (OS), the microRNA miR-363 acts as a tumor suppressor by inhibiting PDZD2. This family include the PDZ domain of the secreted mature form of human interleukin-16 (IL-16); this is the fourth PDZ domain (PDZ4) of human pro-interleukin-16 (isoform 1, also known as nPro-Il-16). Precursor IL-16 is cleaved to produce pro-IL-16 and C-terminal mature IL-16. Pro-IL-16 functions as a regulator of T cell growth; mature IL-16 is a CD4 ligand that induces chemotaxis and CD25 expression in CD4+ T cells. IL-16 bioactivity has been closely associated with the progression of several different cancers PDZ domains usually bind in a sequence-specific manner to short peptide sequences located at the C-terminal end of their partner proteins (known as PDZ binding motifs). The PDZ superfamily includes canonical PDZ domains as well as those with circular permutations and domain swapping mediated by beta-strands. This PDZD2-like family PDZ7 domain is a canonical PDZ domain containing six beta-strands A-F and two alpha-helices (alpha-helix 1 and 2), arranged in the order: beta-strands A, B, C, alpha-helix 1, beta-strands D, E, alpha-helix 2 and beta-strand F.


Pssm-ID: 467244 [Multi-domain]  Cd Length: 86  Bit Score: 45.68  E-value: 2.24e-05
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 2022791524 4545 GLGIRVVGGKEIPgsSGETGAYIAKILPGGNAEQTGKLIEGMQVLEWNGIPLTGKTYEEVQNII 4608
Cdd:cd06763     12 GLGFSLEGGKGSP--LGDRPLTIKRIFKGGAAEQSGVLQVGDEILQINGTSLQGLTRFEAWNII 73
PDZ_syntrophin-like cd06801
PDZ domain of syntrophins, and related domains; PDZ (PSD-95 (Postsynaptic density protein 95), ...
4531-4604 2.39e-05

PDZ domain of syntrophins, and related domains; PDZ (PSD-95 (Postsynaptic density protein 95), Dlg (Discs large protein), and ZO-1 (Zonula occludens-1)) domain of syntrophins (including alpha-1-syntrophin, beta-1-syntrophin, beta-2-syntrophin, gamma-1-syntrophin, and gamma-2-syntrophin), and related domains. Syntrophins play a role in recruiting various signaling molecules into signaling complexes and help provide appropriate spatiotemporal regulation of signaling pathways. They function in cytoskeletal organization and maintenance; as components of the dystrophin-glycoprotein complex (DGC), they help maintain structural integrity of skeletal muscle fibers. They link voltage-gated sodium channels to the actin cytoskeleton and the extracellular matrix, and control the localization and activity of the actin reorganizing proteins such as PI3K, PI(3,4)P2 and TAPP1. Through association with various cytoskeletal proteins within the cells, they are involved in processes such as regulation of focal adhesions, myogenesis, calcium homeostasis, and cell migration. They also have roles in synapse formation and in the organization of utrophin, acetylcholine receptor, and acetylcholinesterase at the neuromuscular synapse. PDZ domains usually bind in a sequence-specific manner to short peptide sequences located at the C-terminal end of their partner proteins (known as PDZ binding motifs). The PDZ superfamily includes canonical PDZ domains as well as those with circular permutations and domain swapping mediated by beta-strands. This syntrophin-like family domain is a canonical PDZ domain containing six beta-strands A-F and two alpha-helices (alpha-helix 1 and 2), arranged in the order: beta-strands A, B, C, alpha-helix 1, beta-strands D, E, alpha-helix 2 and beta-strand F.


Pssm-ID: 467262 [Multi-domain]  Cd Length: 83  Bit Score: 45.64  E-value: 2.39e-05
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 2022791524 4531 KLLRDPkdhtvsGNGLGIRVVGGKE--IPgssgetgAYIAKILPGGNAEQTGKLIEGMQVLEWNGIPLTGKTYEEV 4604
Cdd:cd06801      4 RVVKQD------VGGLGISIKGGAEhkMP-------ILISKIFKGQAADQTGQLFVGDAILSVNGENLEDATHDEA 66
PHA03378 PHA03378
EBNA-3B; Provisional
198-406 2.53e-05

EBNA-3B; Provisional


Pssm-ID: 223065 [Multi-domain]  Cd Length: 991  Bit Score: 50.84  E-value: 2.53e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  198 PEEQRGSAKLPAQQQSPKPVQPQGHVKPTPQQSESSKAVPQQQQPGEPKQIQKPGPGhPADSKLeQVKQPPQPRGPQKSQ 277
Cdd:PHA03378   747 PAAAPGRARPPAAAPGRARPPAAAPGAPTPQPPPQAPPAPQQRPRGAPTPQPPPQAG-PTSMQL-MPRAAPGQQGPTKQI 824
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  278 PQqseptkpvQQQTSAKPSAGPAKPLPQQPDSARTSSQAPPPAKPSSQQL-------GPVKQPSQQPaRQGGPVKPSSQQ 350
Cdd:PHA03378   825 LR--------QLLTGGVKRGRPSLKKPAALERQAAAGPTPSPGSGTSDKIvqapvfyPPVLQPIQVM-RQLGSVRAAAAS 895
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 2022791524  351 AGpPKQPSQQPGPEKpsaqQMGPAKQASQPGPGKPPLQQTgpVKQAPPQSGPTKPP 406
Cdd:PHA03378   896 TV-TQAPTEYTGERR----GVGPMHPTDIPPSKRAKTDAY--VESQPPHGGQSHSF 944
PRK07994 PRK07994
DNA polymerase III subunits gamma and tau; Validated
315-453 2.55e-05

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236138 [Multi-domain]  Cd Length: 647  Bit Score: 50.63  E-value: 2.55e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  315 QAPPPAKPSSQQLGPVKQPSQQPARQGGPVKPSSQQAgPPKQPSQQPGPEkPSAQQMGPAKQASQPGPGKPPLQ-QTGPV 393
Cdd:PRK07994   362 AAPLPEPEVPPQSAAPAASAQATAAPTAAVAPPQAPA-VPPPPASAPQQA-PAVPLPETTSQLLAARQQLQRAQgATKAK 439
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 2022791524  394 KQAPPQSGPTKP--PPQTAGATKAPVPQPGLTKPPGQQPEPEKPSQqkQASATQPAESAPKK 453
Cdd:PRK07994   440 KSEPAAASRARPvnSALERLASVRPAPSALEKAPAKKEAYRWKATN--PVEVKKEPVATPKA 499
PRK14951 PRK14951
DNA polymerase III subunits gamma and tau; Provisional
294-448 2.58e-05

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237865 [Multi-domain]  Cd Length: 618  Bit Score: 50.48  E-value: 2.58e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  294 KPSAGPAKPLPqqpdsARTSSQAPPPAKPSSqqlgPVKQPSQQPARQGGPVKPSSQQAGPPKQPSQQPGPEKPSAQQMGP 373
Cdd:PRK14951   365 KPAAAAEAAAP-----AEKKTPARPEAAAPA----AAPVAQAAAAPAPAAAPAAAASAPAAPPAAAPPAPVAAPAAAAPA 435
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 2022791524  374 AKQASQPGPGKPPlqqtgPVKQAPPQSGPTKPPPQTagatkapvpQPGltkpPGQQPEPEKPSQQKQASATQPAE 448
Cdd:PRK14951   436 AAPAAAPAAVALA-----PAPPAQAAPETVAIPVRV---------APE----PAVASAAPAPAAAPAAARLTPTE 492
PHA03247 PHA03247
large tegument protein UL36; Provisional
555-1047 2.73e-05

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 51.09  E-value: 2.73e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  555 ASPKPDPAQLAEPKKPPPQKKQPSMPGS---PPVKAKTSRAEPADTSQQIDPAPKSDRAKPSQAEDKQKQPSIQKPaadt 631
Cdd:PHA03247  2500 GGGPPDPDAPPAPSRLAPAILPDEPVGEpvhPRMLTWIRGLEELASDDAGDPPPPLPPAAPPAAPDRSVPPPRPAP---- 2575
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  632 vlTPAAPGLKQDSAGPRPPPtQQKVTDSPKPELAKPSQDTHPAGEKPDSKPLPqvsrqksdpklsAQPGARSEAKTQKPv 711
Cdd:PHA03247  2576 --RPSEPAVTSRARRPDAPP-QSARPRAPVDDRGDPRGPAPPSPLPPDTHAPD------------PPPPSPSPAANEPD- 2639
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  712 epaqvkdDPKKLQTKPSPKPETKPAPKGLQPgagPKPVPAQPAPQPQQPQKTPEQPRRFSLNLGGITDT--PKPQPTTPQ 789
Cdd:PHA03247  2640 -------PHPPPTVPPPERPRDDPAPGRVSR---PRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSLadPPPPPPTPE 2709
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  790 ETVTGRLFGFGASIFSQASNLISTAgQPGSQTSGPPPAGPAAKQPQAPAQPPASQappkdTAQTQSSPKAVPVKKEAKPL 869
Cdd:PHA03247  2710 PAPHALVSATPLPPGPAAARQASPA-LPAAPAPPAVPAGPATPGGPARPARPPTT-----AGPPAPAPPAAPAAGPPRRL 2783
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  870 TSEKMEPTKVDSVLTTKGSDLEKKPSLAKDSKPQAAEAKKPAGVLEPEKASHPKVSCPlcktglnvgsKDPPNFNTCTEC 949
Cdd:PHA03247  2784 TRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPP----------PPGPPPPSLPLG 2853
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  950 KKVVcnlcgfnPMPHIAevqewlclncqtQRAMSGQLGDMGKVP-------LPKPGPS---QPVSKPPATPQKQPVPAVS 1019
Cdd:PHA03247  2854 GSVA-------PGGDVR------------RRPPSRSPAAKPAAParppvrrLARPAVSrstESFALPPDQPERPPQPQAP 2914
                          490       500
                   ....*....|....*....|....*...
gi 2022791524 1020 HFPQKTSTPPTPAAAKPKEEPGVQKEVP 1047
Cdd:PHA03247  2915 PPPQPQPQPPPPPQPQPPPPPPPRPQPP 2942
PDZ3_Scribble-like cd06702
PDZ domain 3 of Drosophila Scribble, human Scribble homolog, and related domains; PDZ (PSD-95 ...
4539-4614 3.00e-05

PDZ domain 3 of Drosophila Scribble, human Scribble homolog, and related domains; PDZ (PSD-95 (Postsynaptic density protein 95), Dlg (Discs large protein), and ZO-1 (Zonula occludens-1)) domain 2 of Drosophila Scribble (also known as LAP4), human Scribble homolog (also known as hScrib, LAP4, CriB1, ScrB1 and Vartul), and related domains. They belong to the LAP family, which describes proteins that contain either one or four PDZ domains and 16 LRRs (leucine-rich repeats) and function in controlling cell shape, size and subcellular protein localization. In Drosophila, the Scribble complex, comprising Scribble, discs large, and lethal giant larvae, plays a role in apico-basal cell polarity, in other forms of polarity, including regulation of the actin cytoskeleton, cell signaling and vesicular trafficking, and in tumor development. Mammalian Scribble is important in many aspects of cancer development. Scribble and its homologs can be downregulated or overexpressed in cancer; they have a role in cancer beyond their function in loss of cell polarity. PDZ domains usually bind in a sequence-specific manner to short peptide sequences located at the C-terminal end of their partner proteins (known as PDZ binding motifs). The PDZ superfamily includes canonical PDZ domains as well as those with circular permutations and domain swapping mediated by beta-strands. This Scribble-like family PDZ3 domain is a canonical PDZ domain containing six beta-strands A-F and two alpha-helices (alpha-helix 1 and 2), arranged in the order: beta-strands A, B, C, alpha-helix 1, beta-strands D, E, alpha-helix 2 and beta-strand F.


Pssm-ID: 467186 [Multi-domain]  Cd Length: 89  Bit Score: 45.33  E-value: 3.00e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524 4539 HTVSGNG-LGIRVVGGKE---IPGSSGETGAYIAKILPGGNAEQTGKLIeGMQVLEWNGIPLTGKTYEEVQNIIIQQSGE 4614
Cdd:cd06702      4 HLVKAGGpLGLSIVGGSDhssHPFGVDEPGIFISKVIPDGAAAKSGLRI-GDRILSVNGKDLRHATHQEAVSALLSPGQE 82
PRK10263 PRK10263
DNA translocase FtsK; Provisional
187-321 3.01e-05

DNA translocase FtsK; Provisional


Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 50.47  E-value: 3.01e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  187 KQKVIQKEQGKPEEQRGSAKLPAQQQSPKPVQPQGHVKPTPQQSESSKAVPQQQQPGEPKQIQkpgpghpadsklEQVKQ 266
Cdd:PRK10263   752 VQQPQQPVAPQQQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPVAPQ------------PQYQQ 819
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  267 PPQPRGPQKSQPQQSEPTKPVQQQTSAKP---SAGPAKPLpQQPDSARTSSQ--APPPAK 321
Cdd:PRK10263   820 PQQPVAPQPQYQQPQQPVAPQPQDTLLHPllmRNGDSRPL-HKPTTPLPSLDllTPPPSE 878
PRK14971 PRK14971
DNA polymerase III subunit gamma/tau;
250-377 3.07e-05

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237874 [Multi-domain]  Cd Length: 614  Bit Score: 50.16  E-value: 3.07e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  250 KPGPGHPADSKLEQVKQPPQPRGPQ-KSQPQQSEPTKPVQQQTSAKPSAGPAKPLPQQPDSARTSSQAPPPAKPSSQQLG 328
Cdd:PRK14971   369 ASGGRGPKQHIKPVFTQPAAAPQPSaAAAASPSPSQSSAAAQPSAPQSATQPAGTPPTVSVDPPAAVPVNPPSTAPQAVR 448
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|.
gi 2022791524  329 PVKQPSQQPAR--QGGPVKPSSQQagpPKQPSQQPGPEKPSAQQMGPAKQA 377
Cdd:PRK14971   449 PAQFKEEKKIPvsKVSSLGPSTLR---PIQEKAEQATGNIKEAPTGTQKEI 496
PDZ3_ZO1-like_domain cd06729
PDZ domain 3 of Zonula Occludens-1 (ZO-1), homologs ZO-2 and ZO-3, and related domains; PDZ ...
4543-4604 3.08e-05

PDZ domain 3 of Zonula Occludens-1 (ZO-1), homologs ZO-2 and ZO-3, and related domains; PDZ (PSD-95 (Postsynaptic density protein 95), Dlg (Discs large protein), and ZO-1 (Zonula occludens-1)) domain 3 of ZO-1, -2, -3 and related domains. Zonula occludens proteins (ZO-1, ZO-2, ZO-3) are multi-PDZ domain proteins involved in the maintenance and biogenesis of multi-protein networks at the cytoplasmic surface of intercellular contacts in epithelial and endothelial cells. They have three N-terminal PDZ domains, PDZ1-3, followed by a Src homology-3 (SH3) domain and a guanylate kinase (GuK)-like domain. Among protein-protein interactions for all ZO proteins is the binding of the first PDZ domain (PDZ1) to the C-termini of claudins , and the homo- and hetero-dimerization of ZO-proteins via their second PDZ domain (PDZ2), which takes place by symmetrical domain swapping of the first two beta-strands of PDZ2. At the cell level, ZO-1 and ZO-2 are involved in polarity maintenance, gene transcription, cell proliferation, and tumor cell metastasis. PDZ domains usually bind in a sequence-specific manner to short peptide sequences located at the C-terminal end of their partner proteins (known as PDZ binding motifs). The PDZ superfamily includes canonical PDZ domains as well as those with circular permutations and domain swapping mediated by beta-strands. This ZO family PDZ3 domain is a canonical PDZ domain containing six beta-strands A-F and two alpha-helices (alpha-helix 1 and 2), arranged in the order: beta-strands A, B, C, alpha-helix 1, beta-strands D, E, alpha-helix 2 and beta-strand F.


Pssm-ID: 467211 [Multi-domain]  Cd Length: 82  Bit Score: 45.25  E-value: 3.08e-05
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 2022791524 4543 GNGLGIRVVGGKEipgssgeTGAYIAKILPGGNAEQTGkLIEGMQVLEWNGIPLTGKTYEEV 4604
Cdd:cd06729     10 GGSVGLRLAGGND-------VGIFVAGVQEGSPAEKQG-LQEGDQILKVNGVDFRNLTREEA 63
PDZ_Radil-like cd06690
PDZ domain of Ras-associating and dilute domain-containing protein (Radil) and related domains; ...
4543-4614 3.21e-05

PDZ domain of Ras-associating and dilute domain-containing protein (Radil) and related domains; PDZ (PSD-95 (Postsynaptic density protein 95), Dlg (Discs large protein), and ZO-1 (Zonula occludens-1)) domain 1 of Radil (also known as protein KIAA1849) and related domains. Radil is required for cell adhesion and migration of neural crest precursors during development. Radil is a component of a Rasip1-Radil-ARHGAP29 complex at endothelial cell-cell junctions. Rap1, via its effectors Radil and Rasip1 and their binding partner ArhGAP29, controls the endothelial barrier by decreasing Rho-mediated radial tension on cell-cell junctions. ArhGAP29 binds the Radil PDZ domain. The Radil PDZ domain also binds kinesin family protein 14 (KIF14); KIF14 negatively regulates Rap1-mediated inside-out integrin activation by tethering Radil on microtubules. PDZ domains usually bind in a sequence-specific manner to short peptide sequences located at the C-terminal end of their partner proteins (known as PDZ binding motifs). The PDZ superfamily includes canonical PDZ domains as well as those with circular permutations and domain swapping mediated by beta-strands. This Radil-like family domain is a canonical PDZ domain containing six beta-strands A-F and two alpha-helices (alpha-helix 1 and 2), arranged in the order: beta-strands A, B, C, alpha-helix 1, beta-strands D, E, alpha-helix 2 and beta-strand F.


Pssm-ID: 467177 [Multi-domain]  Cd Length: 88  Bit Score: 45.36  E-value: 3.21e-05
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 2022791524 4543 GNGLGIRVVGGKEIPGssGETGAYIAKILPGGNAEQTGKLIEGMQVLEWNGIPLTGKTYEEVQNiIIQQSGE 4614
Cdd:cd06690     12 PKGLGLGLIDGLHTPL--RSPGIYIRTLVPDSPAARDGRLRLGDRILAVNGTSLVGADYQSAMD-LIRTSGD 80
PspC_subgroup_1 NF033838
pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, ...
176-405 3.24e-05

pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A. The other form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site.


Pssm-ID: 468201 [Multi-domain]  Cd Length: 684  Bit Score: 50.40  E-value: 3.24e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  176 DSDTAHEEAGRKQKVIQKEQGKPEEQRGSAKLPAQQQSPKPVqPQGHVKPTPQQSESSKAVPQQQQPGEpKQIQKPGPGH 255
Cdd:NF033838   254 ATSEQDKPKRRAKRGVLGEPATPDKKENDAKSSDSSVGEETL-PSPSLKPEKKVAEAEKKVEEAKKKAK-DQKEEDRRNY 331
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  256 PA------------------DSKLEQVKQ-PPQPRGPQK-SQPQ------QSEPTKPVQQQTSAKPSAGPAKPLPQQPDS 309
Cdd:NF033838   332 PTntyktleleiaesdvkvkEAELELVKEeAKEPRNEEKiKQAKakveskKAEATRLEKIKTDRKKAEEEAKRKAAEEDK 411
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  310 ARTSsqapppakpssqqlgPVKQPSQQPARQggPVKPSSQqagpPKQPSQQPGPEKPSAQQMGPAKQASQPGPGKPPLQQ 389
Cdd:NF033838   412 VKEK---------------PAEQPQPAPAPQ--PEKPAPK----PEKPAEQPKAEKPADQQAEEDYARRSEEEYNRLTQQ 470
                          250
                   ....*....|....*.
gi 2022791524  390 TGPVKQAPPQsgPTKP 405
Cdd:NF033838   471 QPPKTEKPAQ--PSTP 484
PDZ_GOPC-like cd06800
PDZ domain of Golgi-associated PDZ and coiled-coil motif-containing protein (GOPC), and ...
4545-4617 3.28e-05

PDZ domain of Golgi-associated PDZ and coiled-coil motif-containing protein (GOPC), and related domains; PDZ (PSD-95 (Postsynaptic density protein 95), Dlg (Discs large protein), and ZO-1 (Zonula occludens-1)) domain of GOPC and related domains. GOPC, also known as PIST (PDZ domain protein interacting specifically with TC10), FIG (fused in glioblastoma), and CAL (CFTR-associated ligand), regulates the trafficking of a wide array of proteins, including small GTPases, receptors, and cell surface molecules such as cadherin 23 and CFTR. It may regulate CFTR chloride currents and acid-sensing ASIC3 currents by modulating cell surface expression of both channels, and may play a role in autophagy. Interaction partners of the GOPC PDZ domains include: FZD5, FZD8, ASIC3, CFTR, MUC3, ARFRP1, Ggamma13, neuroligin, and Stargazin. PDZ domains usually bind in a sequence-specific manner to short peptide sequences located at the C-terminal end of their partner proteins (known as PDZ binding motifs). The PDZ superfamily includes canonical PDZ domains as well as those with circular permutations and domain swapping mediated by beta-strands. This GOPC-like family domain is a canonical PDZ domain containing six beta-strands A-F and two alpha-helices (alpha-helix 1 and 2), arranged in the order: beta-strands A, B, C, alpha-helix 1, beta-strands D, E, alpha-helix 2 and beta-strand F.


Pssm-ID: 467261 [Multi-domain]  Cd Length: 83  Bit Score: 45.05  E-value: 3.28e-05
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 2022791524 4545 GLGIRVVGGKE--IPgssgetgAYIAKILPGGNAEQTGKLIEGMQVLEWNGIPLTGKTYEEVQNIIIQQSGEAEI 4617
Cdd:cd06800     12 GLGISITGGKEhgVP-------ILISEIHEGQPADRCGGLYVGDAILSVNGIDLRDAKHKEAVTILSQQRGEITL 79
SP2_N cd22540
N-terminal domain of transcription factor Specificity Protein (SP) 2; Specificity Proteins ...
204-451 3.34e-05

N-terminal domain of transcription factor Specificity Protein (SP) 2; Specificity Proteins (SPs) are transcription factors that are involved in many cellular processes, including cell differentiation, cell growth, apoptosis, immune responses, response to DNA damage, and chromatin remodeling. SP2 contains the least conserved DNA-binding domain within the SP subfamily of proteins, and its DNA sequence specificity differs from the other SP proteins. It localizes primarily within subnuclear foci associated with the nuclear matrix, and can activate, or in some cases, repress expression from different promoters. The transcription factor SP2 serves as a paradigm for indirect genomic binding. It does not require its DNA-binding domain for genomic DNA binding and occupies target promoters independently of whether they contain a cognate DNA-binding motif. SP2 belongs to a family of proteins, called the SP/Kruppel or Krueppel-like Factor (KLF) family, characterized by a C-terminal DNA-binding domain of 81 amino acids consisting of three Kruppel-like C2H2 zinc fingers. These factors bind to a loose consensus motif, namely NNRCRCCYY (where N is any nucleotide; R is A/G, and Y is C/T), such as the recurring motifs in GC and GT boxes (5'-GGGGCGGGG-3' and 5-GGTGTGGGG-3') that are present in promoters and more distal regulatory elements of mammalian genes. SP factors preferentially bind GC boxes, while KLFs bind CACCC boxes. Another characteristic hallmark of SP factors is the presence of the Buttonhead (BTD) box CXCPXC, just N-terminal to the zinc fingers. The function of the BTD box is unknown, but it is thought to play an important physiological role. Another feature of most SP factors is the presence of a conserved amino acid stretch, the so-called SP box, located close to the N-terminus. SP factors may be separated into three groups based on their domain architecture and the similarity of their N-terminal transactivation domains: SP1-4, SP5, and SP6-9. The transactivation domains between the three groups are not homologous to one another. SP1-4 have similar N-terminal transactivation domains characterized by glutamine-rich regions, which, in most cases, have adjacent serine/threonine-rich regions. This model represents the N-terminal domain of SP2.


Pssm-ID: 411776 [Multi-domain]  Cd Length: 511  Bit Score: 49.92  E-value: 3.34e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  204 SAKLPAQQQSPKPVQPQghvKPTPQQSESSKAVPQQQQPGepkqIQKPGPGHPADSKLEQVKQPPQPRGPQK-SQPQQSE 282
Cdd:cd22540      9 SEYLQPAASTTQDSQPS---PLALLAATCSKIGPPAVEAA----VTPPAPPQPTPRKLVPIKPAPLPLGPGKnSIGFLSA 81
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  283 PTKPVQQQTSAKPSAGPAKPLP---QQPDSARTSSQAPPPAKPS---SQQLGPVKQPSQQPARQGGPVKPSSQQAGPPKQ 356
Cdd:cd22540     82 KGNIIQLQGSQLSSSAPGGQQVfaiQNPTMIIKGSQTRSSTNQQyqiSPQIQAAGQINNSGQIQIIPGTNQAIITPVQVL 161
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  357 PSQQpgpekpsaqqmgpakQASQPGPGKPPLQQTGPVKQAPPQSGPTKPPPQTAGATKAPVPQPGLT---KPPGQQPEPE 433
Cdd:cd22540    162 QQPQ---------------QAHKPVPIKPAPLQTSNTNSASLQVPGNVIKLQSGGNVALTLPVNNLVgtqDGATQLQLAA 226
                          250
                   ....*....|....*...
gi 2022791524  434 KPSQQKQASATQPAESAP 451
Cdd:cd22540    227 APSKPSKKIRKKSAQAAQ 244
Med15 pfam09606
ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of ...
308-433 3.37e-05

ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of the ARC-Mediator co-activator is a three-helix bundle with marked similarity to the KIX domain. The sterol regulatory element binding protein (SREBP) family of transcription activators use the ARC105 subunit to activate target genes in the regulation of cholesterol and fatty acid homeostasis. In addition, Med15 is a critical transducer of gene activation signals that control early metazoan development.


Pssm-ID: 312941 [Multi-domain]  Cd Length: 732  Bit Score: 50.39  E-value: 3.37e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  308 DSARTSSQAPPPAKPSSQQLGPVKQPSQQP---ARQGGPvkpsSQQAGPPKQPSQQPGPEKPSAQQMGPAKQAS------ 378
Cdd:pfam09606   52 DMSKKAAQQQQPQGGQGNGGMGGGQQGMPDpinALQNLA----GQGTRPQMMGPMGPGPGGPMGQQMGGPGTASnllasl 127
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 2022791524  379 -----------------QPGPGKPPLQQTGPVKQAPPQSGPTKP----PPQTAGATKAPVPQPGLTKPPGQQPEPE 433
Cdd:pfam09606  128 grpqmpmggagfpsqmsRVGRMQPGGQAGGMMQPSSGQPGSGTPnqmgPNGGPGQGQAGGMNGGQQGPMGGQMPPQ 203
PHA03247 PHA03247
large tegument protein UL36; Provisional
522-1019 3.75e-05

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 50.32  E-value: 3.75e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  522 GPGPQLPPPKQKTPTPPSTakPPPQPQPGQKKDA----SPKPD-PAQLAEPKKPPPQkkqpsmPGSPPVKAKTSRAEPAD 596
Cdd:PHA03247  2550 DPPPPLPPAAPPAAPDRSV--PPPRPAPRPSEPAvtsrARRPDaPPQSARPRAPVDD------RGDPRGPAPPSPLPPDT 2621
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  597 TSqqIDPAPKSDRAKPSQAEDKQKQPSIQKPAADTVLTPAAPGLKQDSAGPRPPPTQQKVTDSPKPELAKP-------SQ 669
Cdd:PHA03247  2622 HA--PDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPtvgsltsLA 2699
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  670 DTHPAGEKPDSKPLPQVSRQKSdPKLSAQPGARSEAKTQKPVEPAQVKDDPKKLQTKPSPKPETKPAPKGLQPGAGPKPV 749
Cdd:PHA03247  2700 DPPPPPPTPEPAPHALVSATPL-PPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAG 2778
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  750 PAQPAPQPQQPqktpeqprrfSLNLGGITDTPKPQPTTPQETVTGRLFGFGASIFSQASNLISTAGQPGS--QTSGPPPA 827
Cdd:PHA03247  2779 PPRRLTRPAVA----------SLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAppPPPGPPPP 2848
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  828 GPAAKQPQAPAQPPASQAPPKDTAQTQSSPKAVPVKKEAKPLTSEKMEPTKvdsvlttkgsdlEKKPSLAKDSKPQAAEA 907
Cdd:PHA03247  2849 SLPLGGSVAPGGDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFA------------LPPDQPERPPQPQAPPP 2916
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  908 KKPAGVLEPEKASHPKVSCPlcktglnvGSKDPPnfntcteckkvvcnlcgFNPMPHIAEVQEwlclncQTQRAMSGQLG 987
Cdd:PHA03247  2917 PQPQPQPPPPPQPQPPPPPP--------PRPQPP-----------------LAPTTDPAGAGE------PSGAVPQPWLG 2965
                          490       500       510       520
                   ....*....|....*....|....*....|....*....|
gi 2022791524  988 DM--GKVPLPK-----PGPSQPVSKPPA-TPQKQPVPAVS 1019
Cdd:PHA03247  2966 ALvpGRVAVPRfrvpqPAPSREAPASSTpPLTGHSLSRVS 3005
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
198-451 3.99e-05

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 50.17  E-value: 3.99e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  198 PEEQRGSAKLPAQQQSPKPVQPQGHVKPTPQQSESSKAVPQQQQPGEPKQIQKPGPGHPADSKLEQVKQPPQPRGPQKSQ 277
Cdd:PHA03307   136 EMLRPVGSPGPPPAASPPAAGASPAAVASDAASSRQAALPLSSPEETARAPSSPPAEPPPSTPPAAASPRPPRRSSPISA 215
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  278 PQQSeptkpvqqqTSAKPSAGPAKPLPQQPDSARTSSQAPPPAKPSSQQLGPVKQPSQQPARQGGPVKPSSQQAGPPKQP 357
Cdd:PHA03307   216 SASS---------PAPAPGRSAADDAGASSSDSSSSESSGCGWGPENECPLPRPAPITLPTRIWEASGWNGPSSRPGPAS 286
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  358 SQQPGPEKPSAQQMGPAKQASQPGPGKPPLQQTGPVKQAPPQSGPTKPPPQTAGATKAPVPQPGltKPPGQQPEPEKPSQ 437
Cdd:PHA03307   287 SSSSPRERSPSPSPSSPGSGPAPSSPRASSSSSSSRESSSSSTSSSSESSRGAAVSPGPSPSRS--PSPSRPPPPADPSS 364
                          250
                   ....*....|....
gi 2022791524  438 QKQASATQPAESAP 451
Cdd:PHA03307   365 PRKRPRPSRAPSSP 378
PDZ4_INAD-like cd23065
PDZ domain 4 of inactivation-no-after-potential D (INAD), and related domains; PDZ (PSD-95 ...
4541-4617 4.11e-05

PDZ domain 4 of inactivation-no-after-potential D (INAD), and related domains; PDZ (PSD-95 (Postsynaptic density protein 95), Dlg (Discs large protein), and ZO-1 (Zonula occludens-1)) domain 4 of INAD, and related domains. INAD assembles key enzymes of the Drosophila compound eye photo-transduction pathway into a supramolecular complex, supporting efficient and fast light signaling. It contains 5 PDZ domains arranged in tandem (PDZ1-PDZ5) which independently bind various proteins. INAD PDZ2 binds eye-specific protein kinase C, INAD PDZ3 binds transient receptor potential (TRP) channel, and INAD PDZ4,5 tandem binds NORPA (phospholipase Cbeta, PLCbeta). Mutations of the inaD gene that lead to disruption of each of these interactions impair fly photo signal transduction. PDZ domains usually bind in a sequence-specific manner to short peptide sequences located at the C-terminal end of their partner proteins (known as PDZ binding motifs). The PDZ superfamily includes canonical PDZ domains as well as those with circular permutations and domain swapping mediated by beta-strands. This INAD-like family PDZ4 domain is a canonical PDZ domain containing six beta-strands A-F and two alpha-helices (alpha-helix 1 and 2), arranged in the order: beta-strands A, B, C, alpha-helix 1, beta-strands D, E, alpha-helix 2 and beta-strand F.


Pssm-ID: 467278 [Multi-domain]  Cd Length: 82  Bit Score: 44.81  E-value: 4.11e-05
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 2022791524 4541 VSGNGLGIRVVGGKEipgsSGETGAYIAKILPGGNAEQTGKLIEGMQVLEWNGIPLTGKTYEEVQNIIIQQSGEAEI 4617
Cdd:cd23065      6 TDKSPLGVSVVGGKN----HVTTGCIITHIYPNSIVAADKRLKVFDQILDINGTKVHVMTTLKVHQLFHKTYEKAVT 78
Cytadhesin_P30 pfam07271
Cytadhesin P30/P32; This family consists of several Mycoplasma species specific Cytadhesin P32 ...
276-437 4.25e-05

Cytadhesin P30/P32; This family consists of several Mycoplasma species specific Cytadhesin P32 and P30 proteins. P30 has been found to be membrane associated and localized on the tip organelle. It is thought that it is important in cytadherence and virulence. The N-terminus contains two predicted transmembrane helices followed by a long region of a short 6 residue proline rich repeat.


Pssm-ID: 429374 [Multi-domain]  Cd Length: 275  Bit Score: 48.81  E-value: 4.25e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  276 SQPQQSEPTKPVQQQTSAKPSAGPAKPLPQQPDSARTSSQAPPPAKPSSQQ-LGPVKQPSQQPaRQGGP----VKPSSQQ 350
Cdd:pfam07271  117 SEQQETQAIEPTEQQNTAEPTQPAGVNVANQPQMGINQPQINPQFGPNPQQrIGFPMQPNMGM-RPGFNqmpgMPPNQMR 195
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  351 AGPPKQPS-------QQPGPekpsaqQMgpakqasQPGPGKPPlqQTGPVKQAPpqsGPTKPPPQTAGATKAPvPQPGLT 423
Cdd:pfam07271  196 PGFNQMPGmpprpgfPNPMP------NM-------QPRPGFRP--QPGPMGNRP---GGGFPHPGTPMGPNRM-PNPGMN 256
                          170
                   ....*....|....*
gi 2022791524  424 KPPGQQ-PEPEKPSQ 437
Cdd:pfam07271  257 QRPGMApPRPGFPPQ 271
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
235-453 4.54e-05

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 49.98  E-value: 4.54e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  235 AVPQQQQPGepkqiQKPGPGHPADSKleqvkQPPQPRGPQKSQPQQSEPTKPVQQQTSAKPSAGPAKPLPQQPDSARTSS 314
Cdd:PRK07764   392 GAPAAAAPS-----AAAAAPAAAPAP-----AAAAPAAAAAPAPAAAPQPAPAPAPAPAPPSPAGNAPAGGAPSPPPAAA 461
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  315 QAPPPAKPSsqqlGPVKQPSQQPARQGGPVKPSSQQAGPPKQPSQQPGPE---------------------------KPS 367
Cdd:PRK07764   462 PSAQPAPAP----AAAPEPTAAPAPAPPAAPAPAAAPAAPAAPAAPAGADdaatlrerwpeilaavpkrsrktwailLPE 537
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  368 AQQMGPAKQASQPGPGKPPLQQ-------------------------------TGPVKQAPPQSGPTKPPPQTAGATKAP 416
Cdd:PRK07764   538 ATVLGVRGDTLVLGFSTGGLARrfaspgnaevlvtalaeelggdwqveavvgpAPGAAGGEGPPAPASSGPPEEAARPAA 617
                          250       260       270
                   ....*....|....*....|....*....|....*..
gi 2022791524  417 VPQPGLTKPPGQQPEPEKPSQQKQASATQPAESAPKK 453
Cdd:PRK07764   618 PAAPAAPAAPAPAGAAAAPAEASAAPAPGVAAPEHHP 654
PHA03379 PHA03379
EBNA-3A; Provisional
289-451 4.57e-05

EBNA-3A; Provisional


Pssm-ID: 223066 [Multi-domain]  Cd Length: 935  Bit Score: 50.06  E-value: 4.57e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  289 QQTSAKPSAGPAKPLPQQPDSARTSSQAPPPAKPSSqqlGPVKQPSQQPARQGGPVKPSS-QQAGPPKQPSQQPGPEKPS 367
Cdd:PHA03379   410 EPTYGTPRPPVEKPRPEVPQSLETATSHGSAQVPEP---PPVHDLEPGPLHDQHSMAPCPvAQLPPGPLQDLEPGDQLPG 486
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  368 AQQMGPAKQASQPGPGKPPLQQTGPVKQAPPQSGPTKPPPQTAGATKAPVPQPGLTKPPGQQPE------PEKPSQQKQA 441
Cdd:PHA03379   487 VVQDGRPACAPVPAPAGPIVRPWEASLSQVPGVAFAPVMPQPMPVEPVPVPTVALERPVCPAPPliamqgPGETSGIVRV 566
                          170
                   ....*....|.
gi 2022791524  442 SAT-QPAESAP 451
Cdd:PHA03379   567 RERwRPAPWTP 577
PHA03418 PHA03418
hypothetical E4 protein; Provisional
251-413 4.64e-05

hypothetical E4 protein; Provisional


Pssm-ID: 177646 [Multi-domain]  Cd Length: 230  Bit Score: 48.20  E-value: 4.64e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  251 PGPGHPA--DSKLEQVKQPPQPRGPQKSQPQQSEPTKPVQQQTSAKPSAGPAKPLPQQPDS-------ARTSSQAPPPAk 321
Cdd:PHA03418    37 PAPHHPNpqEDPDKNPSPPPDPPLTPRPPAQPNGHNKPPVTKQPGGEGTEEDHQAPLAADAdddprpgKRSKADEHGPA- 115
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  322 PSSQQLGPVKQPSQQPARQGGPVKPSSQQAGPPKQPsqqpgPEkpsaqqmgpAKQASQPGPGKPPLQQTGpvkQAPPQSG 401
Cdd:PHA03418   116 PGRAALAPFKLDLDQDPLHGDPDPPPGATGGQGEEP-----PE---------GGEESQPPLGEGEGAVEG---HPPPLPP 178
                          170
                   ....*....|..
gi 2022791524  402 PTKPPPQTAGAT 413
Cdd:PHA03418   179 APEPKPHNGDAT 190
PHA03247 PHA03247
large tegument protein UL36; Provisional
247-452 4.79e-05

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 49.94  E-value: 4.79e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  247 QIQKPGPGHPADSKLEQVKQPPQPRGPQKSQPQQSEPTKPVQQQT-------SAKPSAGPAKPLPQQPDSArTSSQAPPP 319
Cdd:PHA03247   251 DIAAPAPPPVVGEGADRAPETARGATGPPPPPEAAAPNGAAAPPDgvwgaalAGAPLALPAPPDPPPPAPA-GDAEEEDD 329
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  320 AKPSSQQLGPVKQPSQQ-----PARQGGPVKPSSQ----QAGPPKQPSQQPGPEKPSAQQMGPAKQASQPGPG-----KP 385
Cdd:PHA03247   330 EDGAMEVVSPLPRPRQHyplgfPKRRRPTWTPPSSledlSAGRHHPKRASLPTRKRRSARHAATPFARGPGGDdqtrpAA 409
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 2022791524  386 PLQQTGPVKQAPPQSGPTKPPPqtagATKAPVPQPGLTKPPGQQPEPEKPSQQKQASATQPAESAPK 452
Cdd:PHA03247   410 PVPASVPTPAPTPVPASAPPPP----ATPLPSAEPGSDDGPAPPPERQPPAPATEPAPDDPDDATRK 472
PHA03247 PHA03247
large tegument protein UL36; Provisional
202-472 4.83e-05

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 49.94  E-value: 4.83e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  202 RGSAKLPAqqqsPKPVQPQGHVKPtPQQSESSKAVPQQQQPGEPKQIQKPGPG--HPADSKLEQVKQPPQPRGPQKSQPQ 279
Cdd:PHA03247   249 RGDIAAPA----PPPVVGEGADRA-PETARGATGPPPPPEAAAPNGAAAPPDGvwGAALAGAPLALPAPPDPPPPAPAGD 323
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  280 QSEPTKPVQQQTSAKPSAGPAKPLPQQPDSARTSSQAPPPAKP--SSQQLGPVKQPSQQPARQGGPVKPSSQQAGPPKQP 357
Cdd:PHA03247   324 AEEEDDEDGAMEVVSPLPRPRQHYPLGFPKRRRPTWTPPSSLEdlSAGRHHPKRASLPTRKRRSARHAATPFARGPGGDD 403
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  358 SQQPGPEKPsaqqmgpakqASQPGPGKPPlqqtGPVKQAPPQSGPTKPPPQTAGATKAPVPQPGLTKPPGQQPEPEKPSQ 437
Cdd:PHA03247   404 QTRPAAPVP----------ASVPTPAPTP----VPASAPPPPATPLPSAEPGSDDGPAPPPERQPPAPATEPAPDDPDDA 469
                          250       260       270
                   ....*....|....*....|....*....|....*
gi 2022791524  438 QKQASATQPAESAPKKtfcPLCTTTELLLHTPEKA 472
Cdd:PHA03247   470 TRKALDALRERRPPEP---PGADLAELLGRHPDTA 501
PspC_subgroup_2 NF033839
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ...
604-747 4.96e-05

pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.


Pssm-ID: 468202 [Multi-domain]  Cd Length: 557  Bit Score: 49.38  E-value: 4.96e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  604 APKSDRAKPSQAEDKQKQPSIQKPAADTVLTPAAPG--LKQDSAGPRPP----PTQQKVTDSPKPELAKPSQDTHPAGEK 677
Cdd:NF033839   294 APKPGMQPSPQPEKKEVKPEPETPKPEVKPQLEKPKpeVKPQPEKPKPEvkpqLETPKPEVKPQPEKPKPEVKPQPEKPK 373
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 2022791524  678 PDSKPLPQVSRQKSDPKLSA-QPGARSEAKTQKPvepaQVKDDPKKlqtkpsPKPETKPAPKGLQPGAGPK 747
Cdd:NF033839   374 PEVKPQPETPKPEVKPQPEKpKPEVKPQPEKPKP----EVKPQPEK------PKPEVKPQPEKPKPEVKPQ 434
PRK10927 PRK10927
cell division protein FtsN;
112-370 5.81e-05

cell division protein FtsN;


Pssm-ID: 236797 [Multi-domain]  Cd Length: 319  Bit Score: 48.52  E-value: 5.81e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  112 DVLKTEQRAPGRSpssislRESKSRtdfredQKSSMMPSfLSEANPLSAVTSVVNKFNPFDLISDSDTAHEEAGRKQKVI 191
Cdd:PRK10927     5 DYVRRSQPAPSRR------KKSTSR------KKQRNLPA-VSPAMVAIAAAVLVTFIGGLYFITHHKKEESETLQSQKVT 71
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  192 QKE-QGKPEEQ-RGSAKLPAQQQSPK-PVQPQ--GHVKPTPQQSESSKAVPQQQQPG---EPKQIQKPgpghPADsklEQ 263
Cdd:PRK10927    72 GNGlPPKPEERwRYIKELESRQPGVRaPTEPSagGEVKTPEQLTPEQRQLLEQMQADmrqQPTQLVEV----PWN---EQ 144
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  264 VKQPPQPRGPQKSQPQQSEPTKPVQQQTSAkpsagPAKPLPQQpdsARTSSQAPPPAKPSSQQLGPVKQPSQQ----PAR 339
Cdd:PRK10927   145 TPEQRQQTLQRQRQAQQLAEQQRLAQQSRT-----TEQSWQQQ---TRTSQAAPVQAQPRQSKPASTQQPYQDllqtPAH 216
                          250       260       270
                   ....*....|....*....|....*....|.
gi 2022791524  340 QGGpvKPSSQQAGPPKQPSQQPgpeKPSAQQ 370
Cdd:PRK10927   217 TTA--QSKPQQAAPVTRAADAP---KPTAEK 242
PDZ_PDLIM-like cd06753
PDZ domain of PDZ-LIM family proteins, and related domains; PDZ (PSD-95 (Postsynaptic density ...
4547-4608 6.15e-05

PDZ domain of PDZ-LIM family proteins, and related domains; PDZ (PSD-95 (Postsynaptic density protein 95), Dlg (Discs large protein), and ZO-1 (Zonula occludens-1)) domain of PDZ-LIM family proteins including PDLIM1-7, and related domains. PDZ-LIM family proteins (also known as Zasp PDZ domain proteins) are involved in the rearrangement of the actin cytoskeleton; they mediate association with the cytoskeleton through alpha-actinin as well as with other proteins involved in signal transduction pathways. Members of this family include PDLIM1 (also known as C-terminal LIM domain protein 1, elfin, LIM domain protein CLP-36), PDLIM2 (also known as PDZ-LIM protein mystique), PDLIM3 (also known as actinin-associated LIM protein, alpha-actinin-2-associated LIM protein, ALP), PDLIM4 (also known as LIM protein RIL, Reversion-induced LIM protein), PDLIM5 (also known as enigma homolog, ENH, enigma-like PDZ and LIM domains protein), PDLIM6 (also known as LIM domain-binding protein 3, ZASP, Cypher, Oracle), and PDLIM7 (also known as PDZ and LIM domain protein 7, LIM mineralization protein, LMP; protein enigma). PDLIM1 has been shown to negatively regulate NF-kappaB-mediated signaling in the cytoplasm. PDLIM7 negatively regulates p53 through binding murine double minute 2 (MDM2). The PDZ domains of PDZ-LIM family proteins PDLIM1, 2, 3, 5, 6, 7 have been shown to bind actin. Other PDZ-LIM family PDZ domain binding partners include thyroid receptor interacting protein-6 (PDLIM4-PDZ), the LIM domain of PDLIM4 (PDLIM4-PDZ), tropomyosin (PDLIM7-PDZ), myotilin and calsarcin 1 (PDLIM6-PDZ), and proteins from the myotilin and FATZ (calsarcin/myozenin) families (PDLIM1, 3, 4, 6 PDZ domains). PDZ domains usually bind in a sequence-specific manner to short peptide sequences located at the C-terminal end of their partner proteins (known as PDZ binding motifs). The PDZ superfamily includes canonical PDZ domains as well as those with circular permutations and domain swapping mediated by beta-strands. This PDLIM-like family domain is a canonical PDZ domain containing six beta-strands A-F and two alpha-helices (alpha-helix 1 and 2), arranged in the order: beta-strands A, B, C, alpha-helix 1, beta-strands D, E, alpha-helix 2 and beta-strand F.


Pssm-ID: 467235 [Multi-domain]  Cd Length: 79  Bit Score: 44.06  E-value: 6.15e-05
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 2022791524 4547 GIRVVGGKEIpgssgETGAYIAKILPGGNAEQTGkLIEGMQVLEWNGIPLTGKTYEEVQNII 4608
Cdd:cd06753     11 GFRLQGGKDF-----NQPLTISRVTPGGKAAQAN-LRPGDVILAINGESTEGMTHLEAQNKI 66
PDZ3_Par3-like cd23059
PDZ domain 3 of partitioning defective 3 (Par3), and related domains; PDZ (PSD-95 ...
4545-4608 6.62e-05

PDZ domain 3 of partitioning defective 3 (Par3), and related domains; PDZ (PSD-95 (Postsynaptic density protein 95), Dlg (Discs large protein), and ZO-1 (Zonula occludens-1)) domain 3 of Par3 (or PAR3 or Par-3, also known as Atypical PKC isotype-specific-interacting protein, ASIP, Drosophila Bazooka) and related domains. Par3 is a scaffold protein involved in organizing cell polarity across animals. Par3 binds numerous molecules both for its recruitment to one pole of the cell and for downstream contributions to polarized cell function. It regulates cell polarity by targeting the Par complex proteins Par6 and atypical protein kinase C (aPKC) to specific cortical sites. Physical interactions between Par-3 and the Par complex include Par3 PDZ domain 1 binding to the Par6 PDZ domain, Par3 PDZ domain 1 and PDZ domain 3 binding the Par6's PDZ-binding motif, and an interaction with an undefined region of aPKC that requires both Par3 PDZ2 and PDZ3. The PDZ domains of Par3 have also been implicated as potential phosphoinositide signaling integrators, since its second PDZ domain binds to phosphoinositides, and the third PDZ interacts with phosphoinositide phosphatase PTEN. PDZ domains usually bind in a sequence-specific manner to short peptide sequences located at the C-terminal end of their partner proteins (known as PDZ binding motifs). The PDZ superfamily includes canonical PDZ domains as well as those with circular permutations and domain swapping mediated by beta-strands. This Par3 family PDZ3 domain is a canonical PDZ domain containing six beta-strands A-F and two alpha-helices (alpha-helix 1 and 2), arranged in the order: beta-strands A, B, C, alpha-helix 1, beta-strands D, E, alpha-helix 2 and beta-strand F.


Pssm-ID: 467272 [Multi-domain]  Cd Length: 103  Bit Score: 44.97  E-value: 6.62e-05
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 2022791524 4545 GLGIRVVG--GKEIPGSSGETGAYIAKILPGGNAEQTGKLIEGMQVLEWNGIPLTGKTYEEVQNII 4608
Cdd:cd23059     17 GLGVSVKGktSKEDNGGKADLGIFIKSIIHGGAASKDGRLRVNDQLIAVNGESLLGLTNSEAMETL 82
PDZ_AFDN-like cd06789
PDZ domain of afadin (AFDN), and related domains; PDZ (PSD-95 (Postsynaptic density protein 95) ...
4543-4613 6.88e-05

PDZ domain of afadin (AFDN), and related domains; PDZ (PSD-95 (Postsynaptic density protein 95), Dlg (Discs large protein), and ZO-1 (Zonula occludens-1)) domain of afadin (AFDN, also known as ALL1-fused gene from chromosome 6 protein (AF6) and MLLT4), and related domains. AFDN belongs to the adhesion system, probably together with the E-cadherin-catenin system, that plays a role in the organization of homotypic, interneuronal, and heterotypic cell-cell adherens junctions. The AFDN PDZ domain interaction partners include poliovirus receptor-related protein PRR2/nectin, the junctional adhesion molecule (JAM), the breakpoint-cluster-region protein (BCR), connexin36 (Cx36), and a subset of Eph-related receptor tyrosine kinases; it can also bind low molecular weight ligands, in competition with a natural peptide ligand. Other AFDN-binding proteins have been identified. PDZ domains usually bind in a sequence-specific manner to short peptide sequences located at the C-terminal end of their partner proteins (known as PDZ binding motifs). The PDZ superfamily includes canonical PDZ domains as well as those with circular permutations and domain swapping mediated by beta-strands. This AFDN family domain is a canonical PDZ domain containing six beta-strands A-F and two alpha-helices (alpha-helix 1 and 2), arranged in the order: beta-strands A, B, C, alpha-helix 1, beta-strands D, E, alpha-helix 2 and beta-strand F.


Pssm-ID: 467251 [Multi-domain]  Cd Length: 89  Bit Score: 44.20  E-value: 6.88e-05
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 2022791524 4543 GNGLGIRVVGGKeipGS-SGETGAYIAKILPGGNAEQTGKLIEGMQVLEWNGIPLTGKTYEEVQNiIIQQSG 4613
Cdd:cd06789     12 GNGMGLSIVAAK---GAgQDKLGIYIKSVVKGGAADLDGRLQAGDQLLSVDGHSLVGLSQERAAE-LMTKTG 79
PRK14971 PRK14971
DNA polymerase III subunit gamma/tau;
326-454 7.24e-05

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237874 [Multi-domain]  Cd Length: 614  Bit Score: 49.00  E-value: 7.24e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  326 QLGPVKQPSQQPARqggPVKPSSQQAGPPKQPSQQPGPEkPSAQQMGPAKQASQPGPGKPPLQQTGPVKQAPPQSGPTkP 405
Cdd:PRK14971   364 QKGDDASGGRGPKQ---HIKPVFTQPAAAPQPSAAAAAS-PSPSQSSAAAQPSAPQSATQPAGTPPTVSVDPPAAVPV-N 438
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....
gi 2022791524  406 PPQTAGATKAPVPQPGLTKPPGQQ-----PEPEKPSQQKQASATQPAESAPKKT 454
Cdd:PRK14971   439 PPSTAPQAVRPAQFKEEKKIPVSKvsslgPSTLRPIQEKAEQATGNIKEAPTGT 492
PDZ10_MUPP1-PDZ8_PATJ-like cd06673
PDZ domain 10 of multi-PDZ-domain protein 1 (MUPP1), domain 8 of PATJ (protein-associated ...
4545-4610 7.95e-05

PDZ domain 10 of multi-PDZ-domain protein 1 (MUPP1), domain 8 of PATJ (protein-associated tight junction) and similar domains; PDZ (PSD-95 (Postsynaptic density protein 95), Dlg (Discs large protein), and ZO-1 (Zonula occludens-1)) domain 10 of MUPP1, PDZ domain 8 of PATJ, and related domains. MUPP1 and PATJ serve as scaffolding proteins linking different proteins and protein complexes involved in the organization of tight junctions and epithelial polarity. MUPP1 contains an L27 (Lin-2 and Lin-7 binding) domain and 13 PDZ domains. PATJ (also known as INAD-like) contains an L27 domain and ten PDZ domains. MUPP1 and PATJ share several binding partners, including junctional adhesion molecules (JAM), zonula occludens (ZO)-3, Pals1 (protein associated with Lin-7), Par (partitioning defective)-6 proteins, and nectins (adherence junction adhesion molecules). PATJ lacks 3 PDZ domains seen in MUPP1: PDZ6, 9, and 13; consequently, MUPP1 PDZ7 and 8 align with PATJ PDZ6 and 7; and MUPP1 PDZ domains 10-12 align with PATJ PDZ domains 8-10. PDZ domains usually bind in a sequence-specific manner to short peptide sequences located at the C-terminal end of their partner proteins (known as PDZ binding motifs). The PDZ superfamily includes canonical PDZ domains as well as those with circular permutations and domain swapping mediated by beta-strands. This MUPP1-like family PDZ10 domain is a canonical PDZ domain containing six beta-strands A-F and two alpha-helices (alpha-helix 1 and 2), arranged in the order: beta-strands A, B, C, alpha-helix 1, beta-strands D, E, alpha-helix 2 and beta-strand F.


Pssm-ID: 467161 [Multi-domain]  Cd Length: 86  Bit Score: 44.20  E-value: 7.95e-05
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 2022791524 4545 GLGIRVVGGKEIPgssgeTGA-YIAKILPGGNAEQTGKLIEGMQVLEWNGIPLTGKTYEEVQNIIIQ 4610
Cdd:cd06673     14 GLGLSIVGGSDTL-----LGAiIIHEVYEDGAAAKDGRLWAGDQILEVNGEDLRKATHDEAINVLRQ 75
PDZ1_hSTXBP4-PDZ2_GgSTXBP4-like cd06698
PDZ1 domain of human syntaxin-binding protein 4 (STXBP4), PDZ2 domain of Gallus gallus ...
4545-4608 8.02e-05

PDZ1 domain of human syntaxin-binding protein 4 (STXBP4), PDZ2 domain of Gallus gallus uncharacterized STXBP4 isoform X1, and related domains; PDZ (PSD-95 (Postsynaptic density protein 95), Dlg (Discs large protein), and ZO-1 (Zonula occludens-1)) domain 1 of human syntaxin-binding protein 4 (STXBP4), PDZ2 domain of Gallus gallus uncharacterized STXBP4 isoform X1, and related domains. Human STXBP4 (also known as Synip) includes a single PDZ domain, a coiled-coil domain, and a WW domain (named for its two conserved tryptophans); Gallus gallus STXBP4 isoform X1 contains 2 PDZ domains (PDZ1 and PDZ2). Human STXBP4 plays a role in the translocation of transport vesicles from the cytoplasm to the plasma membrane: insulin induces the dissociation of the STXBP4 and STX4 complex liberating STX4 to interact with Vamp2, and to form the SNARE complex thereby promoting vesicle fusion. It may also play a role in the regulation of insulin release by pancreatic beta cells after stimulation by glucose. Human STXBP4 is also known to physically associate with a prominent isoform of TP63 (deltaNp63alpha 9) whose overexpression promotes squamous cell carcinoma development, and in doing so prevents degradation of this isoform by the Cdc20-APC/C complex, Itch, and RACK1. PDZ domains usually bind in a sequence-specific manner to short peptide sequences located at the C-terminal end of their partner proteins (known as PDZ binding motifs). The PDZ superfamily includes canonical PDZ domains as well as those with circular permutations and domain swapping mediated by beta-strands. This STXBP4-like family domain is a canonical PDZ domain containing six beta-strands A-F and two alpha-helices (alpha-helix 1 and 2), arranged in the order: beta-strands A, B, C, alpha-helix 1, beta-strands D, E, alpha-helix 2 and beta-strand F.


Pssm-ID: 467184 [Multi-domain]  Cd Length: 89  Bit Score: 44.22  E-value: 8.02e-05
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 2022791524 4545 GLGIRVVGGKEIPGssgETGAYIAKILPGGNAEQTGKLIEGMQVLEWNGIPLTGKTYEEVQNII 4608
Cdd:cd06698     12 GLGLSIVGGINRPE---GPMVFIQEVIPGGDCYKDGRLRPGDQLVSINKESLIGVTLEEAKSIL 72
PHA03418 PHA03418
hypothetical E4 protein; Provisional
278-437 8.03e-05

hypothetical E4 protein; Provisional


Pssm-ID: 177646 [Multi-domain]  Cd Length: 230  Bit Score: 47.43  E-value: 8.03e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  278 PQQSEPTKPVQQQTSAK-PSAGPAKPLPQQPDSARTSSQAPPPAK-PSSQQLGPVKQPSQQPARQGGPVKPSSQQAGPPK 355
Cdd:PHA03418    34 PLLPAPHHPNPQEDPDKnPSPPPDPPLTPRPPAQPNGHNKPPVTKqPGGEGTEEDHQAPLAADADDDPRPGKRSKADEHG 113
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  356 QpsqQPGPEKPSAQQMGPAKQASQPGPGKPPlQQTGPVKQAPPQSGPTKPPP--QTAGATKAPVPQPgltkPPGQQPEPE 433
Cdd:PHA03418   114 P---APGRAALAPFKLDLDQDPLHGDPDPPP-GATGGQGEEPPEGGEESQPPlgEGEGAVEGHPPPL----PPAPEPKPH 185

                   ....
gi 2022791524  434 KPSQ 437
Cdd:PHA03418   186 NGDA 189
DamX COG3266
Cell division protein DamX, binds to the septal ring, contains C-terminal SPOR domain [Cell ...
238-455 8.47e-05

Cell division protein DamX, binds to the septal ring, contains C-terminal SPOR domain [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 442497 [Multi-domain]  Cd Length: 455  Bit Score: 48.69  E-value: 8.47e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  238 QQQQPGEPKQIQKPGPGHPADSKLEQVKQPPQpRGPQKSQPQQSEPTKPVQQQTSAKPSAGPAKPLPQQPDSARTSSQAP 317
Cdd:COG3266    160 EQLLLLALQDIQGTLQALGAVAALLGLRKAEE-ALALRAGSAAADALALLLLLLASALGEAVAAAAELAALALLAAGAAE 238
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  318 PPAKPSSQQLGPVKQPSQQPARQGGPVKPSSQQAGPPKQPSQQPGPE-KPSAQQMGPAKQASQPGPGKPPLQQTGPVKQA 396
Cdd:COG3266    239 VLTARLVLLLLIIGSALKAPSQASSASAPATTSLGEQQEVSLPPAVAaQPAAAAAAQPSAVALPAAPAAAAAAAAPAEAA 318
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  397 PPQSGPTKPPPQTAGATKAPVPQpgltkPPGQQPEPEKPSQQKQASAT-QPAESAPKKTF 455
Cdd:COG3266    319 APQPTAAKPVVTETAAPAAPAPE-----AAAAAAAPAAPAVAKKLAADeQWLASQPASHY 373
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
225-451 9.22e-05

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 48.83  E-value: 9.22e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  225 PTPQQSESSKAVPQQQQPGEPkqiqkpGPGHPADSKLEQVKQPPQPRGPQKSQPQQSEPTKPVQQQTSAKPSAGPAKPLP 304
Cdd:PRK07764   394 PAAAAPSAAAAAPAAAPAPAA------AAPAAAAAPAPAAAPQPAPAPAPAPAPPSPAGNAPAGGAPSPPPAAAPSAQPA 467
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  305 QQPDSARTSSQAPPPAKPSSQQLGPVKQPSQQPARQGGP---------------------------VKPSSQQAGPPK-- 355
Cdd:PRK07764   468 PAPAAAPEPTAAPAPAPPAAPAPAAAPAAPAAPAAPAGAddaatlrerwpeilaavpkrsrktwaiLLPEATVLGVRGdt 547
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  356 -----------------------------------QPSQQPGPEKPSAQQMGPAKQASQPGPGKP--PLQQTGPVKQAPP 398
Cdd:PRK07764   548 lvlgfstgglarrfaspgnaevlvtalaeelggdwQVEAVVGPAPGAAGGEGPPAPASSGPPEEAarPAAPAAPAAPAAP 627
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|....
gi 2022791524  399 -QSGPTKPPPQTAGATKAPVPQPGLTKPPGQQPEPEKPSQQKQASATQPAESAP 451
Cdd:PRK07764   628 aPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVPDASDGGDGWPAKAGGAAPAAP 681
PHA03377 PHA03377
EBNA-3C; Provisional
225-435 9.39e-05

EBNA-3C; Provisional


Pssm-ID: 177614 [Multi-domain]  Cd Length: 1000  Bit Score: 48.90  E-value: 9.39e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  225 PTPQ---QSESSKAVPQQQQPGEPKQIQKPGPGHPADSKLEQVKQPP-----QPRGPQKSQPQQSEPTKPV---QQQTSA 293
Cdd:PHA03377   645 PKPKsfwEMRAGRDGSGIQQEPSSRRQPATQSTPPRPSWLPSVFVLPsvdagRAQPSEESHLSSMSPTQPIsheEQPRYE 724
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  294 KPSAG---------PAKPLPQQPDSARTSSQAPPPAKPSSQQLGPVKQP---SQQPARQGGPVKPSSQQAGP----PKQP 357
Cdd:PHA03377   725 DPDDPldlslhpdqAPPPSHQAPYSGHEEPQAQQAPYPGYWEPRPPQAPylgYQEPQAQGVQVSSYPGYAGPwglrAQHP 804
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  358 ---------SQQP------GPEKPSAQQMGPaKQASQPGPGKPPLQQTGPVKqapPQSGPtkPPPQTAGATKAPVPQP-- 420
Cdd:PHA03377   805 ryrhswaywSQYPghghpqGPWAPRPPHLPP-QWDGSAGHGQDQVSQFPHLQ---SETGP--PRLQLSQVPQLPYSQTlv 878
                          250
                   ....*....|....*
gi 2022791524  421 GLTKPPGQQPEPEKP 435
Cdd:PHA03377   879 SSSAPSWSSPQPRAP 893
PDZ11_MUPP1-PDZ9_PATJ-like cd06674
PDZ domain 11 of MUPP1 of multi-PDZ-domain protein 1 (MUPP1), domain 9 of PATJ ...
4542-4604 1.17e-04

PDZ domain 11 of MUPP1 of multi-PDZ-domain protein 1 (MUPP1), domain 9 of PATJ (protein-associated tight junction) and related domains; PDZ (PSD-95 (Postsynaptic density protein 95), Dlg (Discs large protein), and ZO-1 (Zonula occludens-1)) domain 11 of MUPP1, PDZ domain 9 of PATJ, and related domains. MUPP1 and PATJ serve as scaffolding proteins linking different proteins and protein complexes involved in the organization of tight junctions and epithelial polarity. MUPP1 contains an L27 (Lin-2 and Lin-7 binding) domain and 13 PDZ domains. PATJ (also known as INAD-like) contains an L27 domain and ten PDZ domains. MUPP1 and PATJ share several binding partners, including junctional adhesion molecules (JAM), zonula occludens (ZO)-3, Pals1 (protein associated with Lin-7), Par (partitioning defective)-6 proteins, and nectins (adherence junction adhesion molecules). PATJ lacks 3 PDZ domains seen in MUPP1: PDZ6, 9, and 13; consequently, MUPP1 PDZ7 and 8 align with PATJ PDZ6 and 7; and MUPP1 PDZ domains 10-12 align with PATJ PDZ domains 8-10. PDZ domains usually bind in a sequence-specific manner to short peptide sequences located at the C-terminal end of their partner proteins (known as PDZ binding motifs). The PDZ superfamily includes canonical PDZ domains as well as those with circular permutations and domain swapping mediated by beta-strands. This MUPP1-like family PDZ11 domain is a canonical PDZ domain containing six beta-strands A-F and two alpha-helices (alpha-helix 1 and 2), arranged in the order: beta-strands A, B, C, alpha-helix 1, beta-strands D, E, alpha-helix 2 and beta-strand F.


Pssm-ID: 467162 [Multi-domain]  Cd Length: 87  Bit Score: 43.81  E-value: 1.17e-04
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 2022791524 4542 SGNGLGIRVVGGKEipgssgETGAYIAKILPGGNAEQTGKLIEGMQVLEWNGIPLTGKTYEEV 4604
Cdd:cd06674     12 PGRGLGLSIVGKRN------DTGVFVSDIVKGGAADADGRLMQGDQILSVNGEDVRNASQEAA 68
ECM1 pfam05782
Extracellular matrix protein 1 (ECM1); This family consists of several eukaryotic ...
267-382 1.40e-04

Extracellular matrix protein 1 (ECM1); This family consists of several eukaryotic extracellular matrix protein 1 (ECM1) sequences. ECM1 has been shown to regulate endochondral bone formation, stimulate the proliferation of endothelial cells and induce angiogenesis. Mutations in the ECM1 gene can cause lipoid proteinosis, a disorder which causes generalized thickening of skin, mucosae and certain viscera. Classical features include beaded eyelid papules and laryngeal infiltration leading to hoarseness.


Pssm-ID: 461739  Cd Length: 518  Bit Score: 47.91  E-value: 1.40e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  267 PPQPRGPQKSQPQQSEPTKPVQQQTSAKPSAGP--------AKPLPQQPDSARTSSQAPPPAKPSSQQLGPVKQPSQQpa 338
Cdd:pfam05782    9 PPQTRGLPVDHPDTSQHDPPFEGQSEVQPPPSQeaipvqeeELPPPQLPVEKKVDPPLPQEAIPLQEELPPPQLPIEQ-- 86
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....*...
gi 2022791524  339 RQGGPVKPSSQQAGPPKQPSQQPGP----EKPSAQQMGPAKQASQPGP 382
Cdd:pfam05782   87 KEIDPPFPQQEEITPSKQREEKPAPlvgqGHPEPESWNPAQHCQQGRR 134
PspC_subgroup_2 NF033839
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ...
259-457 1.59e-04

pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.


Pssm-ID: 468202 [Multi-domain]  Cd Length: 557  Bit Score: 47.84  E-value: 1.59e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  259 SKLEQVKQPPQPRGPQKSQPQQSEPtkpvQQQTSAKPSAGPAKPLPQQPDSARTSSQAPPPAKPSSQQLGPVKQPSQQPA 338
Cdd:NF033839   273 TKFKKGLTQDTPKEPGNKKPSAPKP----GMQPSPQPEKKEVKPEPETPKPEVKPQLEKPKPEVKPQPEKPKPEVKPQLE 348
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  339 RQGGPVKPSSQQAGPPKQPsqQPGPEKPSAQ-QMGPAKQASQPGPGKP-PLQQTGPVKQAPP-QSGPTKPPPQTAGATKA 415
Cdd:NF033839   349 TPKPEVKPQPEKPKPEVKP--QPEKPKPEVKpQPETPKPEVKPQPEKPkPEVKPQPEKPKPEvKPQPEKPKPEVKPQPEK 426
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|...
gi 2022791524  416 PVPQpglTKPPGQQPEPE-KPSQQKQASATQPAESAPKKTFCP 457
Cdd:NF033839   427 PKPE---VKPQPEKPKPEvKPQPEKPKPEVKPQPETPKPEVKP 466
PRK07994 PRK07994
DNA polymerase III subunits gamma and tau; Validated
239-428 1.59e-04

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236138 [Multi-domain]  Cd Length: 647  Bit Score: 47.94  E-value: 1.59e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  239 QQQPGEPKQIQKPGPGHPADSKLEQVKQPPQPrgpqKSQPQQSEPTKPVQQQTSAKPSAGPAKPLPQQPDSARTSSQAPP 318
Cdd:PRK07994   365 LPEPEVPPQSAAPAASAQATAAPTAAVAPPQA----PAVPPPPASAPQQAPAVPLPETTSQLLAARQQLQRAQGATKAKK 440
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  319 PAKPSSQQLGPVKQPSQQPARqggpvKPSSQQAGPPKQPSQQPGPEKPSAQQMGPAKQASQPGPGKPPLQQTgpvkQAPP 398
Cdd:PRK07994   441 SEPAAASRARPVNSALERLAS-----VRPAPSALEKAPAKKEAYRWKATNPVEVKKEPVATPKALKKALEHE----KTPE 511
                          170       180       190
                   ....*....|....*....|....*....|
gi 2022791524  399 QSGPTKPPPQTAGATKAPVPQPGLTKPPGQ 428
Cdd:PRK07994   512 LAAKLAAEAIERDPWAALVSQLGLPGLVEQ 541
PTZ00108 PTZ00108
DNA topoisomerase 2-like protein; Provisional
1223-1584 1.68e-04

DNA topoisomerase 2-like protein; Provisional


Pssm-ID: 240271 [Multi-domain]  Cd Length: 1388  Bit Score: 48.12  E-value: 1.68e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524 1223 LQHVKIAKASLVEKIQQEAQKEESKLQ------QEKL------SKTLSAD--KIPATVSSDQ---KKTLSKFEEDKKPLL 1285
Cdd:PTZ00108   990 LDLYKKRKEYLLGKLERELARLSNKVRfikhviNGELvitnakKKDLVKElkKLGYVRFKDIikkKSEKITAEEEEGAEE 1069
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524 1286 PEKKAPHSEDKKEQIIAE------------TKDHVAERKGEvevpceelHAKKQEDAKKEgLTTGIPQM----------V 1343
Cdd:PTZ00108  1070 DDEADDEDDEEELGAAVSydyllsmpiwslTKEKVEKLNAE--------LEKKEKELEKL-KNTTPKDMwledldkfeeA 1140
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524 1344 LKAEKAQEEETPVQMSRVP---RSDHVETVREKTEKEDDKSDTSSSQQQKSPQGLSDTGYSSDGISSSLGEIPSHIPSDE 1420
Cdd:PTZ00108  1141 LEEQEEVEEKEIAKEQRLKsktKGKASKLRKPKLKKKEKKKKKSSADKSKKASVVGNSKRVDSDEKRKLDDKPDNKKSNS 1220
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524 1421 KDLLRESSKKDTVSQESPPSPSDLAKLESTVLSileAQASTLSDEKSVKKKELYETYSEQTKDQHKTKPLPVTPESYSSD 1500
Cdd:PTZ00108  1221 SGSDQEDDEEQKTKPKKSSVKRLKSKKNNSSKS---SEDNDEFSSDDLSKEGKPKNAPKRVSAVQYSPPPPSKRPDGESN 1297
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524 1501 EEDLEGEGTIVEDGKGSASSQAEYKEGDGEDDIPARRQRYDS-VEDSS-ESENSPVPRRKRRGSVGSSSSDEYKRDDSQG 1578
Cdd:PTZ00108  1298 GGSKPSSPTKKKVKKRLEGSLAALKKKKKSEKKTARKKKSKTrVKQASaSQSSRLLRRPRKKKSDSSSEDDDDSEVDDSE 1377

                   ....*.
gi 2022791524 1579 SGDEED 1584
Cdd:PTZ00108  1378 DEDDED 1383
PspC_subgroup_2 NF033839
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ...
208-448 1.73e-04

pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.


Pssm-ID: 468202 [Multi-domain]  Cd Length: 557  Bit Score: 47.84  E-value: 1.73e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  208 PAQQQSPKPVQPQGHVKPTPQQSESSKAVPQQQQPGE----------PKQIQKPGPGHPADSKLEQVKQPPQPRGPQKSQ 277
Cdd:NF033839   164 PENPEHQKPTTPAPDTKPSPQPEGKKPSVPDINQEKEkaklavatymSKILDDIQKHHLQKEKHRQIVALIKELDELKKQ 243
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  278 --PQQSEPTKPVQQQTSAKPSAGPAKPLPQQPDSARTSSQAPPPAKPSSQQLGPVKQPSQQParQGGPVKPSSQQAGPPK 355
Cdd:NF033839   244 alSEIDNVNTKVEIENTVHKIFADMDAVVTKFKKGLTQDTPKEPGNKKPSAPKPGMQPSPQP--EKKEVKPEPETPKPEV 321
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  356 QPSQQ-------PGPEKPSAQ---QMGPAKQASQPGPGKPP----LQQTGPVKQAPPQSGPTKPPPQTAGATKAPVPQPG 421
Cdd:NF033839   322 KPQLEkpkpevkPQPEKPKPEvkpQLETPKPEVKPQPEKPKpevkPQPEKPKPEVKPQPETPKPEVKPQPEKPKPEVKPQ 401
                          250       260
                   ....*....|....*....|....*...
gi 2022791524  422 LTKP-PGQQPEPEKPSQQKQASATQPAE 448
Cdd:NF033839   402 PEKPkPEVKPQPEKPKPEVKPQPEKPKP 429
FimV COG3170
Type IV pilus assembly protein FimV [Cell motility, Extracellular structures];
198-436 1.74e-04

Type IV pilus assembly protein FimV [Cell motility, Extracellular structures];


Pssm-ID: 442403 [Multi-domain]  Cd Length: 508  Bit Score: 47.87  E-value: 1.74e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  198 PEEQRGSAKLPAQQQSPKPVQP----QGHVKPTPQQSESSKAVPQQQQPGEP-----KQIQKPGPGHPADSKLEQVKQP- 267
Cdd:COG3170    106 PPAYAAAAAAPAAAPAPAPAAPaaaaAAADQPAAEAAPAASGEYYPVRPGDTlwsiaARPVRPSSGVSLDQMMVALYRAn 185
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  268 PQ--PRGPQKSQPQQSEPTKPVQQQTSAKPSAGPAKPLPQQPDSARTSSQ-------APPPAKPSSQQLGPVKQPSQQPA 338
Cdd:COG3170    186 PDafIDGNINRLKAGAVLRVPAAEEVAALSPAEARQEVQAQSADWAAYRArlaaavePAPAAAAPAAPPAAAAAAGPVPA 265
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  339 RQGGPVKPSSQQAGPPKQPSQQPGPEKPSAQQMgpAKQASQPGpgkpPLQQTGPVKQAPPQSGPTKPPPQTAGATKAPVP 418
Cdd:COG3170    266 AAEDTLSPEVTAAAAAEEADALPEAAAELAERL--AALEAQLA----ELQRLLALKNPAPAAAVSAPAAAAAAATVEAAA 339
                          250
                   ....*....|....*...
gi 2022791524  419 QPGLTKPPGQQPEPEKPS 436
Cdd:COG3170    340 PAAAAQPAAAAPAPALDN 357
PRK14951 PRK14951
DNA polymerase III subunits gamma and tau; Provisional
278-405 1.87e-04

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237865 [Multi-domain]  Cd Length: 618  Bit Score: 47.79  E-value: 1.87e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  278 PQQSEPTKPV-QQQTSAKPSAGPAKPLPQQPDSARTSSQAPPPAkPSSQQLGPVKQPSQQPARQGGPVKPSSQQAGPPKQ 356
Cdd:PRK14951   366 PAAAAEAAAPaEKKTPARPEAAAPAAAPVAQAAAAPAPAAAPAA-AASAPAAPPAAAPPAPVAAPAAAAPAAAPAAAPAA 444
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....*....
gi 2022791524  357 PSQQPGPEKPSAQQMGPAKQASQPGPGKPPLQQTGPVKQAPPQSGPTKP 405
Cdd:PRK14951   445 VALAPAPPAQAAPETVAIPVRVAPEPAVASAAPAPAAAPAAARLTPTEE 493
PDZ3_MUPP1-like cd06791
PDZ domain 3 of multi-PDZ-domain protein 1 (MUPP1) and PATJ (protein-associated tight junction) ...
4543-4604 2.26e-04

PDZ domain 3 of multi-PDZ-domain protein 1 (MUPP1) and PATJ (protein-associated tight junction) and related domains; PDZ (PSD-95 (Postsynaptic density protein 95), Dlg (Discs large protein), and ZO-1 (Zonula occludens-1)) domain 3 of MUPP1 and PATJ, and related domains. MUPP1 and PATJ serve as scaffolding proteins linking different proteins and protein complexes involved in the organization of tight junctions and epithelial polarity. MUPP1 contains an L27 (Lin-2 and Lin-7 binding) domain and 13 PDZ domains. PATJ (also known as INAD-like) contains an L27 domain and ten PDZ domains. MUPP1 and PATJ share several binding partners, including junctional adhesion molecules (JAM), zonula occludens (ZO)-3, Pals1 (protein associated with Lin-7), Par (partitioning defective)-6 proteins, and nectins (adherence junction adhesion molecules). PATJ lacks 3 PDZ domains seen in MUPP1: PDZ6, 9, and 13; consequently, MUPP1 PDZ7 and 8 align with PATJ PDZ6 and 7; and MUPP1 PDZ domains 10-12 align with PATJ PDZ domains 8-10. PDZ domains usually bind in a sequence-specific manner to short peptide sequences located at the C-terminal end of their partner proteins (known as PDZ binding motifs). The PDZ superfamily includes canonical PDZ domains as well as those with circular permutations and domain swapping mediated by beta-strands. This MUPP1-like family domain is a canonical PDZ domain containing six beta-strands A-F and two alpha-helices (alpha-helix 1 and 2), arranged in the order: beta-strands A, B, C, alpha-helix 1, beta-strands D, E, alpha-helix 2 and beta-strand F.


Pssm-ID: 467253 [Multi-domain]  Cd Length: 89  Bit Score: 42.99  E-value: 2.26e-04
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 2022791524 4543 GNGLGIRVVGGKEIPGSSGETGAYIAKILPGGNAEQTGKLIEGMQVLEWNGIPLTGKTYEEV 4604
Cdd:cd06791     11 EQGLGITIAGYVGEKASGELSGIFVKSIIPGSAADQDGRIQVNDQIIAVDGVNLQGFTNQEA 72
Streccoc_I_II NF033804
antigen I/II family LPXTG-anchored adhesin; Members of the antigen I/II family are adhesins ...
192-323 2.29e-04

antigen I/II family LPXTG-anchored adhesin; Members of the antigen I/II family are adhesins with a glucan-binding domain, two types of repetitive regions, an isopeptide bond-forming domain associated with shear resistance, and a C-terminal LPXTG motif for anchoring to the cell wall. They occur in oral Streptococci, and tend to be major cell surface adhesins. Members of this family include SspA and SspB from Streptococcus gordonii, antigen I/II from S. mutans, etc.


Pssm-ID: 468188 [Multi-domain]  Cd Length: 1552  Bit Score: 47.63  E-value: 2.29e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  192 QKEQGKPEEQRGSAKLPAQqqsPKPVQPQGHVKPTPQQSESSKAVPQQqqPGEPK-QIQKPGPGHPADSKLEQVKQPPQP 270
Cdd:NF033804   874 QPEPSKPEEPTYETEKPLE---PAPVAPTYENEPTPPVKTPDQPEPSK--PEEPTyETEKPLEPAPVAPSYENEPTPPVK 948
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....
gi 2022791524  271 rgpqksQPQQSEPTKPVQqqtsakPSagpAKPLPQQPdSARTSSQAP-PPAKPS 323
Cdd:NF033804   949 ------TPDQPEPSKPVE------PT---YDPLPTPP-VAPTPKQLPtPPAVPT 986
PDZ1_harmonin cd06737
PDZ domain 1 of harmonin isoforms a, b, and c, and related domains; PDZ (PSD-95 (Postsynaptic ...
4529-4612 2.42e-04

PDZ domain 1 of harmonin isoforms a, b, and c, and related domains; PDZ (PSD-95 (Postsynaptic density protein 95), Dlg (Discs large protein), and ZO-1 (Zonula occludens-1)) domain 1 of harmonin isoforms a, b, and c, and related domains. Harmonin (also known as Usher Type 1C, PDZ-73 and AIE-75) is a key organizer of the Usher (USH) protein interactome. USH syndrome is the leading cause of hereditary sensory deaf-blindness in humans; three clinically distinct types of USH have been identified, type 1 to 3. The gene encoding harmonin (USH1C) is the causative gene for the USH type 1C phenotype. There are at least 10 alternatively spliced isoforms of harmonin, which are divided into three subclasses (a, b, and c). All isoforms contain the first two PDZ domains and the first coiled-coil domain. The a and b isoforms all have a third PDZ domain. The different PDZ domains are responsible for interactions with all known Usher syndrome type 1 proteins, and most Usher syndrome type 2 proteins. PDZ domains usually bind in a sequence-specific manner to short peptide sequences located at the C-terminal end of their partner proteins (known as PDZ binding motifs). The PDZ superfamily includes canonical PDZ domains as well as those with circular permutations and domain swapping mediated by beta-strands. This harmonin family PDZ1 domain is a canonical PDZ domain containing six beta-strands A-F and two alpha-helices (alpha-helix 1 and 2), arranged in the order: beta-strands A, B, C, alpha-helix 1, beta-strands D, E, alpha-helix 2 and beta-strand F.


Pssm-ID: 467219 [Multi-domain]  Cd Length: 85  Bit Score: 42.63  E-value: 2.42e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524 4529 RLKLLRDPKDHtvsGNGLGIRVVGGKEIpgssgETGAYIAKILPGGNAEQTGkLIEGMQVLEWNGIPLTGKTYEEVQNII 4608
Cdd:cd06737      1 KLRLVRLDRRG---PESLGFSVRGGLEH-----GCGLFVSHVSPGSQADNKG-LRVGDEIVRINGYSISQCTHEEVINLI 71

                   ....
gi 2022791524 4609 IQQS 4612
Cdd:cd06737     72 KTKK 75
Amelogenin smart00818
Amelogenins, cell adhesion proteins, play a role in the biomineralisation of teeth; They seem ...
207-330 2.61e-04

Amelogenins, cell adhesion proteins, play a role in the biomineralisation of teeth; They seem to regulate formation of crystallites during the secretory stage of tooth enamel development and are thought to play a major role in the structural organisation and mineralisation of developing enamel. The extracellular matrix of the developing enamel comprises two major classes of protein: the hydrophobic amelogenins and the acidic enamelins. Circular dichroism studies of porcine amelogenin have shown that the protein consists of 3 discrete folding units: the N-terminal region appears to contain beta-strand structures, while the C-terminal region displays characteristics of a random coil conformation. Subsequent studies on the bovine protein have indicated the amelogenin structure to contain a repetitive beta-turn segment and a "beta-spiral" between Gln112 and Leu138, which sequester a (Pro, Leu, Gln) rich region. The beta-spiral offers a probable site for interactions with Ca2+ ions. Muatations in the human amelogenin gene (AMGX) cause X-linked hypoplastic amelogenesis imperfecta, a disease characterised by defective enamel. A 9bp deletion in exon 2 of AMGX results in the loss of codons for Ile5, Leu6, Phe7 and Ala8, and replacement by a new threonine codon, disrupting the 16-residue (Met1-Ala16) amelogenin signal peptide.


Pssm-ID: 197891 [Multi-domain]  Cd Length: 165  Bit Score: 44.78  E-value: 2.61e-04
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524   207 LPAQQQSP--KPVQPQGHVKPTPQQsesskavpqqqQPGEPKQIQKPGPGHPA--------DSKLEQVKQPPQPRGPQKS 276
Cdd:smart00818   40 IPVSQQHPptHTLQPHHHIPVLPAQ-----------QPVVPQQPLMPVPGQHSmtptqhhqPNLPQPAQQPFQPQPLQPP 108
                            90       100       110       120       130
                    ....*....|....*....|....*....|....*....|....*....|....
gi 2022791524   277 QPQQSEPTKPVQQqtsakpsagPAKPLPQQPdsartssqaPPPAKPSSQQLGPV 330
Cdd:smart00818  109 QPQQPMQPQPPVH---------PIPPLPPQP---------PLPPMFPMQPLPPL 144
PRK10263 PRK10263
DNA translocase FtsK; Provisional
198-303 2.86e-04

DNA translocase FtsK; Provisional


Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 47.39  E-value: 2.86e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  198 PEEQRGSAKLPAQQQSPKPVQPQGHVKPTPQQSESSKAVPQQQQPGEPK-QIQKPGPGHPADSKLEQVKQPPQPRgPQKS 276
Cdd:PRK10263   740 PHEPLFTPIVEPVQQPQQPVAPQQQYQQPQQPVAPQPQYQQPQQPVAPQpQYQQPQQPVAPQPQYQQPQQPVAPQ-PQYQ 818
                           90       100
                   ....*....|....*....|....*..
gi 2022791524  277 QPQQSEPTKPVQQQTSAKPSAGPAKPL 303
Cdd:PRK10263   819 QPQQPVAPQPQYQQPQQPVAPQPQDTL 845
PLN03209 PLN03209
translocon at the inner envelope of chloroplast subunit 62; Provisional
176-386 2.88e-04

translocon at the inner envelope of chloroplast subunit 62; Provisional


Pssm-ID: 178748 [Multi-domain]  Cd Length: 576  Bit Score: 47.23  E-value: 2.88e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  176 DSDTAHEEAGRKQKVIQKEQGKPEEQRGSAKlpAQQQSPKPVQP-------QGHVKPTPQQSESSKAVPQQ----QQPGE 244
Cdd:PLN03209   332 ESDAADGPKPVPTKPVTPEAPSPPIEEEPPQ--PKAVVPRPLSPytayedlKPPTSPIPTPPSSSPASSKSvdavAKPAE 409
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  245 PKQIQKPGP--GHPADSKLEQVKQPPQPRGPQKSQPQQSEPTKPV-QQQTSAKPSAGPAKPLPQQPDSA----RTSSQAP 317
Cdd:PLN03209   410 PDVVPSPGSasNVPEVEPAQVEAKKTRPLSPYARYEDLKPPTSPSpTAPTGVSPSVSSTSSVPAVPDTApataATDAAAP 489
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 2022791524  318 PPAKPSSQQLGPV----KQP-SQQPARQGGPVKPSSQQAGPPKQPSQQPGPEKPSAQQMGPAKQASQPGPG----KPP 386
Cdd:PLN03209   490 PPANMRPLSPYAVyddlKPPtSPSPAAPVGKVAPSSTNEVVKVGNSAPPTALADEQHHAQPKPRPLSPYTMyedlKPP 567
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
259-438 2.98e-04

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 47.47  E-value: 2.98e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  259 SKLEQVKQPPQPRGPQksqpQQSEPTKPVQQQTSAKPSAGPAKPLPQQPDSARTSSQA---PPPAKPSSQQLGPVKQP-- 333
Cdd:PHA03307   766 AKLAEALALLEPAEPQ----RGAGSSPPVRAEAAFRRPGRLRRSGPAADAASRTASKRksrSHTPDGGSESSGPARPPga 841
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  334 --SQQPARQGGPVKPSSQQAGPP---KQPSQQPGPEKPSAQQMGPAKQASQPGPGKPPLQQTGPVKQAPPQSGPTKPppq 408
Cdd:PHA03307   842 aaRPPPARSSESSKSKPAAAGGRargKNGRRRPRPPEPRARPGAAAPPKAAAAAPPAGAPAPRPRPAPRVKLGPMPP--- 918
                          170       180       190
                   ....*....|....*....|....*....|
gi 2022791524  409 tagatKAPVPQPGLTKPPGQQPEPEKPSQQ 438
Cdd:PHA03307   919 -----GGPDPRGGFRRVPPGDLHTPAPSAA 943
PRK11633 PRK11633
cell division protein DedD; Provisional
288-396 3.17e-04

cell division protein DedD; Provisional


Pssm-ID: 236940 [Multi-domain]  Cd Length: 226  Bit Score: 45.38  E-value: 3.17e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  288 QQQTSAKPSAGPakPLPQQP-DSARTSSQAPPPAKPSSQqlgpvkqPSQQPARQGgPVKPSSQQAGPPKQPSQQPGPEKP 366
Cdd:PRK11633    50 RDEPDMMPAATQ--ALPTQPpEGAAEAVRAGDAAAPSLD-------PATVAPPNT-PVEPEPAPVEPPKPKPVEKPKPKP 119
                           90       100       110
                   ....*....|....*....|....*....|
gi 2022791524  367 SAQQMGPAKQASQPGPGKPPLQQTGPVKQA 396
Cdd:PRK11633   120 KPQQKVEAPPAPKPEPKPVVEEKAAPTGKA 149
PRK08691 PRK08691
DNA polymerase III subunits gamma and tau; Validated
201-408 3.39e-04

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236333 [Multi-domain]  Cd Length: 709  Bit Score: 47.01  E-value: 3.39e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  201 QRGSAKLPAQQQSPKPVqPQGHVKPTPQQSESSKAVPQQQQPGEPKQIQKPGPGHPADSKLEQVKQP-PQPRGPQKSQPQ 279
Cdd:PRK08691   381 SAQTAEKETAAKKPQPR-PEAETAQTPVQTASAAAMPSEGKTAGPVSNQENNDVPPWEDAPDEAQTAaGTAQTSAKSIQT 459
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  280 QSEPTKPVQQQTSAKPSAGPAKPLPQQPDSARTSSQAPPPAKPSSQQLGPVKQPSQQPARQGGPvkpssQQAGPPKQPSQ 359
Cdd:PRK08691   460 ASEAETPPENQVSKNKAADNETDAPLSEVPSENPIQATPNDEAVETETFAHEAPAEPFYGYGFP-----DNDCPPEDGAE 534
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|....
gi 2022791524  360 QPGPE----KPSAQQMGPAKQASQPGP-GKPPLqqtgpvkqappqsgPTKPPPQ 408
Cdd:PRK08691   535 IPPPDwehaAPADTAGGGADEEAEAGGiGGNNT--------------PSAPPPE 574
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
235-415 3.86e-04

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 47.09  E-value: 3.86e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  235 AVPQQQQPGEPKQIQKPGPGHPADSKLEQVKQPPQPRGPQKSQPQQSEPTKPVQQQTSAKPSAGPAKPLPQQPDSARTSS 314
Cdd:PHA03307   778 AEPQRGAGSSPPVRAEAAFRRPGRLRRSGPAADAASRTASKRKSRSHTPDGGSESSGPARPPGAAARPPPARSSESSKSK 857
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  315 QAPPPAKPSSQQLGPVKQPSQQPARQGGPVKPSSqQAGPPKQPSQQPGPEKPSAQQMGPakqasQPGPGKPPLqqtGPVK 394
Cdd:PHA03307   858 PAAAGGRARGKNGRRRPRPPEPRARPGAAAPPKA-AAAAPPAGAPAPRPRPAPRVKLGP-----MPPGGPDPR---GGFR 928
                          170       180
                   ....*....|....*....|.
gi 2022791524  395 QAPPqsGPTKPPPQTAGATKA 415
Cdd:PHA03307   929 RVPP--GDLHTPAPSAAALAA 947
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
251-420 3.93e-04

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 47.09  E-value: 3.93e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  251 PGPGHPADSKLEQVKQPPQPRGPQKSQPQQSEPTKPVQQQTS--AKPSAGPAKPLPQQPDSARTSSQAPPPAKPSSQqlg 328
Cdd:PHA03307   775 LEPAEPQRGAGSSPPVRAEAAFRRPGRLRRSGPAADAASRTAskRKSRSHTPDGGSESSGPARPPGAAARPPPARSS--- 851
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  329 PVKQPSQQPARQGGPVKPSSQQAGPPKQPSQQPGPEKPSAqqmgPAKQASQPGPGKPPLQQTGPVKQAPPQSGPT----- 403
Cdd:PHA03307   852 ESSKSKPAAAGGRARGKNGRRRPRPPEPRARPGAAAPPKA----AAAAPPAGAPAPRPRPAPRVKLGPMPPGGPDprggf 927
                          170
                   ....*....|....*...
gi 2022791524  404 -KPPPqtaGATKAPVPQP 420
Cdd:PHA03307   928 rRVPP---GDLHTPAPSA 942
PRK07994 PRK07994
DNA polymerase III subunits gamma and tau; Validated
210-365 4.02e-04

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236138 [Multi-domain]  Cd Length: 647  Bit Score: 46.78  E-value: 4.02e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  210 QQQSPKPVQPQGHVKPTPQQSESSKAVPQQQQPgEPKQIQKPGPGHPADskleqvkQPPQPRGPQKSQPQQSEPTKPVQQ 289
Cdd:PRK07994   362 AAPLPEPEVPPQSAAPAASAQATAAPTAAVAPP-QAPAVPPPPASAPQQ-------APAVPLPETTSQLLAARQQLQRAQ 433
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 2022791524  290 QTSAKPSAGPAKPLPQQPDSARTSSQAPPPAKPSSQQLGPVKQPSqQPARQGGPVKPSSQQAGPPKQPSQQPGPEK 365
Cdd:PRK07994   434 GATKAKKSEPAAASRARPVNSALERLASVRPAPSALEKAPAKKEA-YRWKATNPVEVKKEPVATPKALKKALEHEK 508
PspC_subgroup_1 NF033838
pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, ...
175-271 4.05e-04

pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A. The other form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site.


Pssm-ID: 468201 [Multi-domain]  Cd Length: 684  Bit Score: 46.54  E-value: 4.05e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  175 SDSDTAHEEAGRKQKVIQKEQGKPEEQRGSAKLPA-QQQSPKPVQPQGH---VKPTPQQSESSKAVPQQQQPGEPKQIQK 250
Cdd:NF033838   393 TDRKKAEEEAKRKAAEEDKVKEKPAEQPQPAPAPQpEKPAPKPEKPAEQpkaEKPADQQAEEDYARRSEEEYNRLTQQQP 472
                           90       100
                   ....*....|....*....|.
gi 2022791524  251 PGPGHPAdskleqvkQPPQPR 271
Cdd:NF033838   473 PKTEKPA--------QPSTPK 485
PRK14951 PRK14951
DNA polymerase III subunits gamma and tau; Provisional
202-330 4.63e-04

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237865 [Multi-domain]  Cd Length: 618  Bit Score: 46.63  E-value: 4.63e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  202 RGSAKLPAQQQSPKPVQPQGhvkPTPQQSESSKAVPQQQQPGEPKQIQKPGPGHPAdskleqvkqPPQPRGPQKSQPQQS 281
Cdd:PRK14951   383 RPEAAAPAAAPVAQAAAAPA---PAAAPAAAASAPAAPPAAAPPAPVAAPAAAAPA---------AAPAAAPAAVALAPA 450
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|.
gi 2022791524  282 EPTkpvqqQTSAKPSAGPAK--PLPQQPDSARTSSQAPPPAKPSSQQLGPV 330
Cdd:PRK14951   451 PPA-----QAAPETVAIPVRvaPEPAVASAAPAPAAAPAAARLTPTEEGDV 496
PHA03247 PHA03247
large tegument protein UL36; Provisional
29-426 4.91e-04

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 46.86  E-value: 4.91e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524   29 GPAAPLQRLVPAGVEADLSQLSEEERRQIAAVMSRAQGLPRGNLAGAEPPPMQRHPELDTSHHPRQPSKPPDPG--PGLS 106
Cdd:PHA03247  2732 SPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPAdpPAAV 2811
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  107 KSRTVDVLKTEQRAPGRSPSSISLRESKSRTdfredqkssmmPSFLSEANPLSAvtSVVnkfnpfdlisdsdtaheeagr 186
Cdd:PHA03247  2812 LAPAAALPPAASPAGPLPPPTSAQPTAPPPP-----------PGPPPPSLPLGG--SVA--------------------- 2857
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  187 kqkviqkeQGKPEEQRGsaklPAQQQSPKPVQP-----QGHVKPTPQQSESSKAVPQQQQPgEPKQIQKPGPGHPAdskl 261
Cdd:PHA03247  2858 --------PGGDVRRRP----PSRSPAAKPAAParppvRRLARPAVSRSTESFALPPDQPE-RPPQPQAPPPPQPQ---- 2920
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  262 EQVKQPPQPRGPQKSQPQQSEPTKPvQQQTSAKPSAGPAKPLPQQPDSARTSSQAPPPAKPSSQQLGPVKQPSQQPARQG 341
Cdd:PHA03247  2921 PQPPPPPQPQPPPPPPPRPQPPLAP-TTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPSREAPASSTPPLTGH 2999
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  342 GPVKPSSQQAGPPKQPSQQPGPEK-------PSAQQMGPAKQASQPGPGKPPLQQTGPVKQAP--PQSGPTKPPPQTAGA 412
Cdd:PHA03247  3000 SLSRVSSWASSLALHEETDPPPVSlkqtlwpPDDTEDSDADSLFDSDSERSDLEALDPLPPEPhdPFAHEPDPATPEAGA 3079
                          410
                   ....*....|....
gi 2022791524  413 TKAPVPQPGltkPP 426
Cdd:PHA03247  3080 RESPSSQFG---PP 3090
PDZ5_MAGI-1_3-like cd06735
PDZ domain 5 of membrane-associated guanylate kinase inverted 1 (MAGI-1), MAGI-2, and MAGI-3, ...
4532-4613 4.94e-04

PDZ domain 5 of membrane-associated guanylate kinase inverted 1 (MAGI-1), MAGI-2, and MAGI-3, and related domains; PDZ (PSD-95 (Postsynaptic density protein 95), Dlg (Discs large protein), and ZO-1 (Zonula occludens-1)) domain 5 of MAGI1, 2, 3 (MAGI is also known as Membrane-associated guanylate kinase, WW and PDZ domain-containing protein) and related domains. MAGI proteins have been implicated in the control of cell migration and invasion through altering the activity of phosphatase and tensin homolog (PTEN) and modulating Akt signaling. Four MAGI proteins have been identified (MAGI1-3 and MAGIX). MAGI1-3 have 6 PDZ domains and bind to the C-terminus of PTEN via their PDZ2 domain. MAGIX has a single PDZ domain that is related to MAGI1-3 PDZ domain 5, and belongs to this MAGI1,2,3-like family. Other binding partners for MAGI1 include JAM4, C-terminal tail of high risk HPV-18 E6, megalin, TRAF6, Kir4.1 (basolateral K+ channel subunit), and cadherin 23; for MAGI2, include DASM1, dendrin, axin, beta- and delta-catenin, neuroligin, hyperpolarization-activated cation channels, beta1-adrenergic receptors, NMDA receptor, and TARPs; and for MAGI3 includes LPA2. PDZ domains usually bind in a sequence-specific manner to short peptide sequences located at the C-terminal end of their partner proteins (known as PDZ binding motifs). The PDZ superfamily includes canonical PDZ domains as well as those with circular permutations and domain swapping mediated by beta-strands. This MAGI family PDZ5 domain is a canonical PDZ domain containing six beta-strands A-F and two alpha-helices (alpha-helix 1 and 2), arranged as beta-strands A, B, C, alpha-helix 1, beta-strands D, E, alpha-helix 2 and beta-strand F.


Pssm-ID: 467217 [Multi-domain]  Cd Length: 84  Bit Score: 41.80  E-value: 4.94e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524 4532 LLRDPKdhtvsGNGLGIRvvGGKEipgsSGETGAYIAKILPGGNAEQTGKLIEGMQVLEWNGIPLTGKTYEEVQNiIIQQ 4611
Cdd:cd06735      6 LERGPK-----GFGFSIR--GGRE----YNNMPLYVLRLAEDGPAQRDGRLRVGDQILEINGESTQGMTHAQAIE-LIRS 73

                   ..
gi 2022791524 4612 SG 4613
Cdd:cd06735     74 GG 75
PDZ3_LNX1_2-like cd06679
PDZ domain 3 of human Ligand of Numb protein X 1 (LNX1) and LNX2, and related domains; PDZ ...
4546-4617 4.95e-04

PDZ domain 3 of human Ligand of Numb protein X 1 (LNX1) and LNX2, and related domains; PDZ (PSD-95 (Postsynaptic density protein 95), Dlg (Discs large protein), and ZO-1 (Zonula occludens-1)) domain 3 of LNX1 (also known as PDZ domain-containing RING finger protein 2, PDZRN2) and LNX2 (also known as PDZ domain-containing RING finger protein 1, PDZRN1), and related domains. LNX1 and LNX2 are Ring (Really Interesting New Gene) finger and PDZ domain-containing E3 ubiquitin ligases that bind to the cell fate determinant protein NUMB and mediate its ubiquitination. LNX1 can ubiquitinate a number of other ligands including PPFIA1, KLHL11, KIF7 and ERC2. LNX1 and LNX2 each have four PDZ domains. PDZ domains usually bind in a sequence-specific manner to short peptide sequences located at the C-terminal end of their partner proteins (known as PDZ binding motifs). The PDZ superfamily includes canonical PDZ domains as well as those with circular permutations and domain swapping mediated by beta-strands. This LNX family PDZ3 domain is a canonical PDZ domain containing six beta-strands A-F and two alpha-helices (alpha-helix 1 and 2), arranged in the order: beta-strands A, B, C, alpha-helix 1, beta-strands D, E, alpha-helix 2 and beta-strand F.


Pssm-ID: 467167 [Multi-domain]  Cd Length: 88  Bit Score: 41.85  E-value: 4.95e-04
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 2022791524 4546 LGIRVVGGKEIPgsSGETGAYIAKILPGGNAEQTGKLIEGMQVLEWNGIPLTGKTYEEVQNIIIQQSGEAEI 4617
Cdd:cd06679     13 LGISVAGGRGSR--RGDLPIYVTNVQPDGCLGRDGRIKKGDVLLSINGISLTNLSHSEAVAVLKASAASSSI 82
AF-4 pfam05110
AF-4 proto-oncoprotein N-terminal region; This family consists of AF4 (Proto-oncogene AF4) and ...
172-373 5.24e-04

AF-4 proto-oncoprotein N-terminal region; This family consists of AF4 (Proto-oncogene AF4) and FMR2 (Fragile X E mental retardation syndrome) nuclear proteins. These proteins have been linked to human diseases such as acute lymphoblastic leukaemia and mental retardation. The family also contains a Drosophila AF4 protein homolog Lilliputian which contains an AT-hook domain. Lilliputian represents a novel pair-rule gene that acts in cytoskeleton regulation, segmentation and morphogenesis in Drosophila.


Pssm-ID: 461550 [Multi-domain]  Cd Length: 514  Bit Score: 46.27  E-value: 5.24e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  172 DLISDSDTAHEEAGRKQKVIQKEQGKPE-----EQRGSAKLPAQQQSP-KPVQPQGHVKPTPQQSESSKAVPQQQQPGEP 245
Cdd:pfam05110   62 ELITNKSNQHLVGIPKNSVPQTPQEKPDqpffpDKTSGLPPSFHTSSHsQPMGPPSSSSPSVSSSQSQKKSQARTEPAHG 141
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  246 KQIQKPGPGHPADSKLEQVKQPPQPRGPQKSQPQQSEPTKPVQQQTSAKPSA---------GPAKPLPQQPDSARTSSQA 316
Cdd:pfam05110  142 GHSSSGSQSSQRSQGQSRSKGGQESHSSSHHKRQERREDLFSCASLSHSLEElspllsslsSPVKPLSPSHSRQHTGSKA 221
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 2022791524  317 PPPAKPSSQQLGPVKQPSQQPARQGGPVKPSSQQAGPPKQPSQQ----PGPEKPSAQQMGP 373
Cdd:pfam05110  222 QNSSDHHGKEYSHSKSPRDSEAGSHGPESPSTSLLASSSQLSSQtfppSLPSKTSAMQQKP 282
PDZ1_Dlg1-2-4-like cd06723
PDZ domain 1 of human discs large homolog 1 (Dlg1), Dlg2, and Dlg4, Drosophila disc large (Dlg) ...
4543-4582 5.52e-04

PDZ domain 1 of human discs large homolog 1 (Dlg1), Dlg2, and Dlg4, Drosophila disc large (Dlg), and related domains; PDZ (PSD-95 (Postsynaptic density protein 95), Dlg (Discs large protein), and ZO-1 (Zonula occludens-1)) domain 1 of Drosophila Dlg1, human Dlg1,2, and 4 and related domains. Dlg1 (also known as synapse-associated protein Dlg197 or SAP-97), Dlg2 (also known as channel-associated protein of synapse-110, postsynaptic density protein 93, or PSD-93), Dlg4 (also known as postsynaptic density protein 95, PSD-95, synapse-associated protein 90, or SAP-90) each have 3 PDZ domains and belong to the membrane-associated guanylate kinase family. Dlg1 regulates antigen receptor signaling and cell polarity in lymphocytes, B-cell proliferation and antibody production, and TGFalpha bioavailability; its PDZ3 domain binds pro-TGFalpha, and its PDZ2 domain binds the TACE metalloprotease responsible for cleaving pro-TGFalpha to a soluble form. Dlg2 is involved in N-methyl-D-aspartate (NMDA) receptor signaling. It regulates surface expression of NMDA receptors in dorsal horn neurons of the spinal cord, and it also interacts with NMDA receptor subunits and with Shaker-type K+ channel subunits to cluster into a channel complex. Dlg4 PDZ1 domain binds NMDA receptors, and its PDZ2 domain binds neuronal nitric oxide synthase (nNOS), forming a complex in neurons. The Drosophila Scribble complex (Scribble, Dlg, and lethal giant larvae) plays a role in apico-basal cell polarity, and in other forms of polarity, including regulation of the actin cytoskeleton, cell signaling and vesicular trafficking, and in tumor development. Postsynaptic targeting of Drosophila DLG requires interactions mediated by the first two PDZ domains. PDZ domains usually bind in a sequence-specific manner to short peptide sequences located at the C-terminal end of their partner proteins (known as PDZ binding motifs). The PDZ superfamily includes canonical PDZ domains as well as those with circular permutations and domain swapping mediated by beta-strands. This Dlg-like family PDZ1 domain is a canonical PDZ domain containing six beta-strands A-F and two alpha-helices (alpha-helix 1 and 2), arranged in the order: beta-strands A, B, C, alpha-helix 1, beta-strands D, E, alpha-helix 2 and beta-strand F.


Pssm-ID: 467206 [Multi-domain]  Cd Length: 89  Bit Score: 41.92  E-value: 5.52e-04
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|
gi 2022791524 4543 GNGLGIRVVGGKEIPGSSGETGAYIAKILPGGNAEQTGKL 4582
Cdd:cd06723     10 NSGLGFSIAGGTDNPHIGDDPSIYITKIIPGGAAAADGRL 49
PHA01929 PHA01929
putative scaffolding protein
315-427 5.79e-04

putative scaffolding protein


Pssm-ID: 177328  Cd Length: 306  Bit Score: 45.43  E-value: 5.79e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  315 QAPPPAKPSSQQLGPVKQPSQQPARQGGPVKPS-SQQAGPPKQ---PSQQPGPEKPSAQQMGPAKQA-SQPGPGKPPLQQ 389
Cdd:PHA01929     7 QLPPGLAGLVANVPPAAAPTPQPNPVIQPQAPVqPGQPGAPQQlaiPTQQPQPVPTSAMTPHVVQQApAQPAPAAPPAAG 86
                           90       100       110
                   ....*....|....*....|....*....|....*...
gi 2022791524  390 TGPVKQAPPQSGPTKPPPQTAGATKAPVPQPGLTKPPG 427
Cdd:PHA01929    87 AALPEALEVPPPPAFTPNGEIVGTLAGNLEGDPQLAPS 124
SPT5 COG5164
Transcription elongation factor SPT5 [Transcription];
312-428 5.86e-04

Transcription elongation factor SPT5 [Transcription];


Pssm-ID: 444063 [Multi-domain]  Cd Length: 495  Bit Score: 45.79  E-value: 5.86e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  312 TSSQAPPPAKPSSQQLGPVKQPSQQPARQGGpvkpssqqAGPPKQPSQQPGPekpsAQQMGPAKQASQPGPGKPPLQQTG 391
Cdd:COG5164      4 YGPGKTGPSDPGGVTTPAGSQGSTKPAQNQG--------STRPAGNTGGTRP----AQNQGSTTPAGNTGGTRPAGNQGA 71
                           90       100       110
                   ....*....|....*....|....*....|....*..
gi 2022791524  392 pvkQAPPQSGPTKPPPQTAGATKAPVPQPGLTKPPGQ 428
Cdd:COG5164     72 ---TGPAQNQGGTTPAQNQGGTRPAGNTGGTTPAGDG 105
PHA03269 PHA03269
envelope glycoprotein C; Provisional
323-442 5.97e-04

envelope glycoprotein C; Provisional


Pssm-ID: 165527 [Multi-domain]  Cd Length: 566  Bit Score: 45.87  E-value: 5.97e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  323 SSQQLGPVKQPSQQPARQGGPVKPSSQQAGPPKQPSQQPGPEkpSAQQMGPAKQASQPGPGKPPlQQTGPVKQAPPQSGP 402
Cdd:PHA03269    36 ATQKPDPAPAPHQAASRAPDPAVAPTSAASRKPDLAQAPTPA--ASEKFDPAPAPHQAASRAPD-PAVAPQLAAAPKPDA 112
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|
gi 2022791524  403 TKPPPQTAGATKAPVPQPglTKPPGQQPEPEKPSQQKQAS 442
Cdd:PHA03269   113 AEAFTSAAQAHEAPADAG--TSAASKKPDPAAHTQHSPPP 150
PDZ1_LNX1_2-like cd06677
PDZ domain 1 of human Ligand of Numb protein X 1 (LNX1) and LNX2, and related domains; PDZ ...
4546-4610 6.20e-04

PDZ domain 1 of human Ligand of Numb protein X 1 (LNX1) and LNX2, and related domains; PDZ (PSD-95 (Postsynaptic density protein 95), Dlg (Discs large protein), and ZO-1 (Zonula occludens-1)) domain 1 of LNX1 (also known as PDZ domain-containing RING finger protein 2, PDZRN2) and LNX2 (also known as PDZ domain-containing RING finger protein 1, PDZRN1), and related domains. LNX1 and LNX2 are Ring (Really Interesting New Gene) finger and PDZ domain-containing E3 ubiquitin ligases that bind to the cell fate determinant protein NUMB and mediate its ubiquitination. LNX1 can ubiquitinate a number of other ligands including PPFIA1, KLHL11, KIF7 and ERC2. LNX1 and LNX2 each have four PDZ domains. PDZ domains usually bind in a sequence-specific manner to short peptide sequences located at the C-terminal end of their partner proteins (known as PDZ binding motifs). The PDZ superfamily includes canonical PDZ domains as well as those with circular permutations and domain swapping mediated by beta-strands. This LNX family PDZ1 domain is a canonical PDZ domain containing six beta-strands A-F and two alpha-helices (alpha-helix 1 and 2), arranged in the order: beta-strands A, B, C, alpha-helix 1, beta-strands D, E, alpha-helix 2 and beta-strand F.


Pssm-ID: 467165 [Multi-domain]  Cd Length: 89  Bit Score: 41.46  E-value: 6.20e-04
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 2022791524 4546 LGIRVVGGKEIPgssgETGAYIAKILPGGNAEQTGKLIEGMQVLEWNGIPLTGKTYEEVQNIIIQ 4610
Cdd:cd06677     17 LGISIVGGNDTP----LINIVIQEVYRDGVIARDGRLLPGDQILEVNGVDISNVTHSQARSVLRQ 77
PHA03269 PHA03269
envelope glycoprotein C; Provisional
302-430 6.34e-04

envelope glycoprotein C; Provisional


Pssm-ID: 165527 [Multi-domain]  Cd Length: 566  Bit Score: 45.87  E-value: 6.34e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  302 PLPQQPDSARTssQAPPPAKPSSQQLGPVKQPSQQPArQGGPVKPSSQQAgPPKQPSQQPGPEKPSAQQmgpakQASQPG 381
Cdd:PHA03269    27 PIPELHTSAAT--QKPDPAPAPHQAASRAPDPAVAPT-SAASRKPDLAQA-PTPAASEKFDPAPAPHQA-----ASRAPD 97
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|...
gi 2022791524  382 PGKPPLQQTGPVKQA--PPQSGPT-KPPPQTAG-ATKAPVPQPGLTKPPGQQP 430
Cdd:PHA03269    98 PAVAPQLAAAPKPDAaeAFTSAAQaHEAPADAGtSAASKKPDPAAHTQHSPPP 150
PRK14971 PRK14971
DNA polymerase III subunit gamma/tau;
194-317 6.49e-04

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237874 [Multi-domain]  Cd Length: 614  Bit Score: 45.92  E-value: 6.49e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  194 EQGKPEEQRGSAKLPAQQQSPKPVQPQGHVKPTPQQSESSKAV--PQQQQPGEPKQIQKPGPGHPADskleQVKQPPQPR 271
Cdd:PRK14971   367 DDASGGRGPKQHIKPVFTQPAAAPQPSAAAAASPSPSQSSAAAqpSAPQSATQPAGTPPTVSVDPPA----AVPVNPPST 442
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....*.
gi 2022791524  272 GPQKSQPQQSEPTKPVqqQTSAKPSAGPAKPLPQQPDSARTSSQAP 317
Cdd:PRK14971   443 APQAVRPAQFKEEKKI--PVSKVSSLGPSTLRPIQEKAEQATGNIK 486
DUF4045 pfam13254
Domain of unknown function (DUF4045); This presumed domain is functionally uncharacterized. ...
193-349 6.58e-04

Domain of unknown function (DUF4045); This presumed domain is functionally uncharacterized. This domain family is found in bacteria and eukaryotes, and is typically between 384 and 430 amino acids in length.


Pssm-ID: 433066 [Multi-domain]  Cd Length: 415  Bit Score: 45.54  E-value: 6.58e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  193 KEQGKPEEQRGSAKLPAQQqSPKPVQPQGHVKPTPQQSESSKAvpqqQQPGEPKQIQKPGPGHPADSKLEQVKQPPQPRG 272
Cdd:pfam13254  180 KELNKIRQSRASVDLGRPN-SFKEVTPVGLMRSPAPGGHSKSP----SVSGISADSSPTKEEPSEEADTLSTDKEQSPAP 254
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  273 PQKSQPQQSEPTKPVQQQTSAKPSAGP-AKPLPQQPDS-----ARTSSQAPPPAKPSSQQLGPVkqPSQQPARQGGPVKP 346
Cdd:pfam13254  255 TSASEPPPKTKELPKDSEEPAAPSKSAeASTEKKEPDTesspeTSSEKSAPSLLSPVSKASIDK--PLSSPDRDPLSPKP 332

                   ...
gi 2022791524  347 SSQ 349
Cdd:pfam13254  333 KPQ 335
PRK10263 PRK10263
DNA translocase FtsK; Provisional
211-688 6.64e-04

DNA translocase FtsK; Provisional


Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 46.23  E-value: 6.64e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  211 QQSPKPVQPQGHVKPTPqqsESSKAVPQQQQPGEPKQiqkpgpghpadsklEQVKQPPQPRGPQKSQPQQSEPTKPVQQQ 290
Cdd:PRK10263   361 QPVPGPQTGEPVIAPAP---EGYPQQSQYAQPAVQYN--------------EPLQQPVQPQQPYYAPAAEQPAQQPYYAP 423
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  291 TSAKPSAGPAKPLPQQPDSARTSSQAPPPAKPSSQQlgPVKQPsQQPARQggPVKPSSQQAGPPKQPSQQPGPEKPSAQQ 370
Cdd:PRK10263   424 APEQPAQQPYYAPAPEQPVAGNAWQAEEQQSTFAPQ--STYQT-EQTYQQ--PAAQEPLYQQPQPVEQQPVVEPEPVVEE 498
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  371 MGPAKQ-------------------ASQPGPGKPPLQQTGPVKQAPPQSGPTKPPPQTAGATKAPV-------------- 417
Cdd:PRK10263   499 TKPARPplyyfeeveekrarereqlAAWYQPIPEPVKEPEPIKSSLKAPSVAAVPPVEAAAAVSPLasgvkkatlatgaa 578
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  418 ---------------PQPGLTKPPGQQ-PEPEK----------------PSQQKQASATQPAESAPKKTFCPLCTTTELL 465
Cdd:PRK10263   579 atvaapvfslansggPRPQVKEGIGPQlPRPKRirvptrrelasygiklPSQRAAEEKAREAQRNQYDSGDQYNDDEIDA 658
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  466 LHTPEKANYNTCTQCHTVVCSLCGFNP-----NPHITEIKEWLCLNCQMQRALGGDLAPGHGP----------------- 523
Cdd:PRK10263   659 MQQDELARQFAQTQQQRYGEQYQHDVPvnaedADAAAEAELARQFAQTQQQRYSGEQPAGANPfslddfefspmkalldd 738
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  524 GPQlPPPKQKTPTPPSTAKPPPQPQPGQKKDASPKPDPAQLAEPKKPPPQKKQPSMPgSPPVKAKTSRAEPadtSQQIDP 603
Cdd:PRK10263   739 GPH-EPLFTPIVEPVQQPQQPVAPQQQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQP-QQPVAPQPQYQQP---QQPVAP 813
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  604 APKSDRAKP---SQAEDKQKQPSIQKPAADTVLTPaapgLKQDSAGPRppPTQQKVTDSPKPELAKPSqdthPAGEKP-D 679
Cdd:PRK10263   814 QPQYQQPQQpvaPQPQYQQPQQPVAPQPQDTLLHP----LLMRNGDSR--PLHKPTTPLPSLDLLTPP----PSEVEPvD 883

                   ....*....
gi 2022791524  680 SKPLPQVSR 688
Cdd:PRK10263   884 TFALEQMAR 892
PRK14951 PRK14951
DNA polymerase III subunits gamma and tau; Provisional
340-454 6.73e-04

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237865 [Multi-domain]  Cd Length: 618  Bit Score: 45.86  E-value: 6.73e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  340 QGGPVKPSSQQAGPPKQPSQQ-PGPEKPSAQQMGPAKQASQPGPGKPPLQQTGPVKQAPPQSGPTKPPPQTAGATKAPVP 418
Cdd:PRK14951   367 AAAAEAAAPAEKKTPARPEAAaPAAAPVAQAAAAPAPAAAPAAAASAPAAPPAAAPPAPVAAPAAAAPAAAPAAAPAAVA 446
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....*.
gi 2022791524  419 QP----------GLTKPPGQQPEPEKPSQQKqASATQPAESAPKKT 454
Cdd:PRK14951   447 LApappaqaapeTVAIPVRVAPEPAVASAAP-APAAAPAAARLTPT 491
PHA03291 PHA03291
envelope glycoprotein I; Provisional
366-450 6.82e-04

envelope glycoprotein I; Provisional


Pssm-ID: 223033 [Multi-domain]  Cd Length: 401  Bit Score: 45.72  E-value: 6.82e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  366 PSAQQMGPAkQASQPGPGKPPLQQTGPVKQAPPQSGPTKPPPQT--AGATKAPVPQPGLTKPPGQQPEPEKP--SQQKQA 441
Cdd:PHA03291   192 LSAPRLGPA-DVFVPATPRPTPRTTASPETTPTPSTTTSPPSTTipAPSTTIAAPQAGTTPEAEGTPAPPTPggGEAPPA 270

                   ....*....
gi 2022791524  442 SATQPAESA 450
Cdd:PHA03291   271 NATPAPEAS 279
PRK14951 PRK14951
DNA polymerase III subunits gamma and tau; Provisional
292-425 6.90e-04

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237865 [Multi-domain]  Cd Length: 618  Bit Score: 45.86  E-value: 6.90e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  292 SAKPSAGPAKPLPQQPDSARTSSQAPPPAkpssQQLGPVKQPSQQPARQGGPVKPSSQQAGPPKQPSQQPGPEKPSAQQM 371
Cdd:PRK14951   368 AAAEAAAPAEKKTPARPEAAAPAAAPVAQ----AAAAPAPAAAPAAAASAPAAPPAAAPPAPVAAPAAAAPAAAPAAAPA 443
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....
gi 2022791524  372 gPAKQASQPGPGKPPLQQTGPVKQAPPqsgptkPPPQTAGATKAPVPQPGLTKP 425
Cdd:PRK14951   444 -AVALAPAPPAQAAPETVAIPVRVAPE------PAVASAAPAPAAAPAAARLTP 490
PRK10263 PRK10263
DNA translocase FtsK; Provisional
79-467 7.34e-04

DNA translocase FtsK; Provisional


Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 46.23  E-value: 7.34e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524   79 PMQRHPE---LDTSHHPRQPSKPPD---PGPGLsksrtvdvlktEQRAPGRSPSSISLRESKSRTDFREDQKS--SMMPS 150
Cdd:PRK10263   464 TEQTYQQpaaQEPLYQQPQPVEQQPvvePEPVV-----------EETKPARPPLYYFEEVEEKRAREREQLAAwyQPIPE 532
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  151 FLSEANPLSAVTSVVNKFNPFDLISDSDTAHEEAGRKQKVIQKeqGKPEEQRGSAKLPAQQQSPKPVQPQGhvkPTPQQS 230
Cdd:PRK10263   533 PVKEPEPIKSSLKAPSVAAVPPVEAAAAVSPLASGVKKATLAT--GAAATVAAPVFSLANSGGPRPQVKEG---IGPQLP 607
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  231 ESSKA-VPQQQQP---GEPKQIQKPGPGHPADSKLEQVKQPPQPRGPQKSQPQQSEPTKPVQQQTSAKPSAGPAKPLPQQ 306
Cdd:PRK10263   608 RPKRIrVPTRRELasyGIKLPSQRAAEEKAREAQRNQYDSGDQYNDDEIDAMQQDELARQFAQTQQQRYGEQYQHDVPVN 687
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  307 PDSARTSSQAPPPAK-PSSQQLgpvKQPSQQPArQGGPVKPSSQQAGP--------PKQPSQQPG--PEKPSAQQMGPAK 375
Cdd:PRK10263   688 AEDADAAAEAELARQfAQTQQQ---RYSGEQPA-GANPFSLDDFEFSPmkallddgPHEPLFTPIvePVQQPQQPVAPQQ 763
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  376 QASQPGPGKPPLQQtgpvkQAPPQSgPTKPPPQTAGATKAPVPQPGLTKPPGQQPEPEKPSQQKQASATQPAESAPKKTF 455
Cdd:PRK10263   764 QYQQPQQPVAPQPQ-----YQQPQQ-PVAPQPQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPV 837
                          410
                   ....*....|..
gi 2022791524  456 CPlcTTTELLLH 467
Cdd:PRK10263   838 AP--QPQDTLLH 847
Caprin-1_C pfam12287
Cytoplasmic activation/proliferation-associated protein-1 C term; This family of proteins is ...
306-454 7.88e-04

Cytoplasmic activation/proliferation-associated protein-1 C term; This family of proteins is found in eukaryotes. Proteins in this family are typically between 343 and 708 amino acids in length. This family is the C terminal region of caprin-1. Caprin-1 is a protein involved in regulating cellular proliferation. In mutated phenotypes, the G1 phase of the cell cycle is greatly lengthened, impairing normal proliferation. The C terminal region of caprin-1 contains RGG motifs which are characteriztic of RNA binding domains. It is possible that caprin-1 functions through an RNA binding mechanism.


Pssm-ID: 463522 [Multi-domain]  Cd Length: 320  Bit Score: 45.17  E-value: 7.88e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  306 QPDSARTSSQAPPPAKPSSQQLGPVKQP----------SQQPAR-QGGPVKPSSQQAGPPKQPSQQPGPEKPSAQQMGPA 374
Cdd:pfam12287   25 PSDSAIVSAQPPSQSPDLSQMVCPPASPeqrlsqqsdvLQQPEQtQVSPVSPSSNACASSGSEYQFHTSEPPQPEAIDPI 104
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  375 ----KQASQPGPGKPPLQQT--GPVKQAPPQSGPT---KPPP----QTAGATKAPVpqpgltkPPGQQPEPEKPSQQkQA 441
Cdd:pfam12287  105 qssmSLPSELAPPSPPLSPAsqPQVFQSKPASSSGinvNAAPfqsmQTVFNVNAPV-------PPRNEQELKESSQY-SS 176
                          170
                   ....*....|...
gi 2022791524  442 SATQPAESAPKKT 454
Cdd:pfam12287  177 GYNQSFSSQSTQT 189
PRK10927 PRK10927
cell division protein FtsN;
258-403 7.88e-04

cell division protein FtsN;


Pssm-ID: 236797 [Multi-domain]  Cd Length: 319  Bit Score: 45.06  E-value: 7.88e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  258 DSKLEQVKQPPQP------RGPQKSQPQQSEPTKPVQQQTSAKPSAGPAKPLPQQPDSARTSSQAPPPAKPSSQQLGPVK 331
Cdd:PRK10927    90 ESRQPGVRAPTEPsaggevKTPEQLTPEQRQLLEQMQADMRQQPTQLVEVPWNEQTPEQRQQTLQRQRQAQQLAEQQRLA 169
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 2022791524  332 QPSQQPARQGGPVKPSSQQAGPPKQPSQqpgPEKPSAQQ------MGPAKQASQPGPgkpplQQTGPVKQAPPQSGPT 403
Cdd:PRK10927   170 QQSRTTEQSWQQQTRTSQAAPVQAQPRQ---SKPASTQQpyqdllQTPAHTTAQSKP-----QQAAPVTRAADAPKPT 239
motB PRK05996
MotB family protein;
268-450 7.91e-04

MotB family protein;


Pssm-ID: 235665 [Multi-domain]  Cd Length: 423  Bit Score: 45.46  E-value: 7.91e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  268 PQPRGPQKSQPQQSEPTKP--VQQQTSAKPSAGPAKplpqqpDSARTSSQAPPPAKPSSQQL------------GPVKQP 333
Cdd:PRK05996    78 PSEKGLKDPVDGAEGEQKPgkSKFEEDQRVEGSSAV------TGDDTTRTSGDQTNYSEADLfrnpyavlaeiaQEVGQQ 151
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  334 SQQPARQGGpvkpSSQQAGP------------PKQP---SQQpgpekpsaQQMGPAKQASQPGPGKPPLQQTGPVKQAPP 398
Cdd:PRK05996   152 ANVSAKGDG----GAAQSGPatgadggeayrdPFDPdfwSKQ--------VEVTTAGDLLPPGQAREQAQGAKSATAAPA 219
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 2022791524  399 QSGPTKPPPQT------------AGATKAPVPQPGLTKPPGQQPEPekPSQQKQASATQPAESA 450
Cdd:PRK05996   220 TVPQAAPLPQAqpkkaateeeliADAKKAATGEPAANAAKAAKPEP--MPDDQQKEAEQLQAAI 281
Pro-rich pfam15240
Proline-rich protein; This family includes several eukaryotic proline-rich proteins.
320-446 7.95e-04

Proline-rich protein; This family includes several eukaryotic proline-rich proteins.


Pssm-ID: 464580 [Multi-domain]  Cd Length: 167  Bit Score: 43.49  E-value: 7.95e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  320 AKPSSQQLGPVKQPSQQPARQGGPVKPSSQQAGPPkqPSQQPGPEKPSAQQMGPAKQAsqpGPGKPPLQQTGPVKQAPPQ 399
Cdd:pfam15240   16 AQSSSEDVSQEDSPSLISEEEGQSQQGGQGPQGPP--PGGFPPQPPASDDPPGPPPPG---GPQQPPPQGGKQKPQGPPP 90
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....*...
gi 2022791524  400 SGPTKPPPqtaGATKAPVPQPGLTKP-PGQQPEPEKPSQQKQASATQP 446
Cdd:pfam15240   91 QGGPRPPP---GKPQGPPPQGGNQQQgPPPPGKPQGPPPQGGGPPPQG 135
PDZ5_DrPTPN13-like cd23060
PDZ domain 5 of Danio rerio tyrosine-protein phosphatase non-receptor type 13 (Ptpn13) and ...
4545-4613 8.67e-04

PDZ domain 5 of Danio rerio tyrosine-protein phosphatase non-receptor type 13 (Ptpn13) and related domains; PDZ (PSD-95 (Postsynaptic density protein 95), Dlg (Discs large protein), and ZO-1 (Zonula occludens-1)) domain 5 of Danio rerio Ptpn13, and related domains. Protein-tyrosine phosphatases (PTPs) dephosphorylate phosphotyrosyl residues in proteins that are phosphorylated by protein tyrosine kinases (PTKs). Danio rerio Ptpn13 is a classical non-receptor-like PTP. PDZ domains usually bind in a sequence-specific manner to short peptide sequences located at the C-terminal end of their partner proteins (known as PDZ binding motifs). The PDZ superfamily includes canonical PDZ domains as well as those with circular permutations and domain swapping mediated by beta-strands. This PTPN13-like family domain is a canonical PDZ domain containing six beta-strands A-F and two alpha-helices (alpha-helix 1 and 2), arranged in the order: beta-strands A, B, C, alpha-helix 1, beta-strands D, E, alpha-helix 2 and beta-strand F.


Pssm-ID: 467273 [Multi-domain]  Cd Length: 80  Bit Score: 40.80  E-value: 8.67e-04
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 2022791524 4545 GLGIRVVGGKeipgssGETGAYIAKILPGGNAEQTGKLIEGMQVLEWNGIPLTGKTYEEVQNIIIQQSG 4613
Cdd:cd23060     11 GLGFSLVGGE------GGSGIFVKSISPGGVADRDGRLQVGDRLLQVNGESVIGLSHSKAVNILRKAKG 73
PAT1 pfam09770
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ...
182-327 1.06e-03

Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.


Pssm-ID: 401645 [Multi-domain]  Cd Length: 846  Bit Score: 45.41  E-value: 1.06e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  182 EEAGRKQKVIQKEQGKPEEQRGSAKLPAQQ---------QSPKPVQPQGHVKPTPQQSESSKAVPQQQQPgEPKQIQKPG 252
Cdd:pfam09770  200 EAAMRAQAKKPAQQPAPAPAQPPAAPPAQQaqqqqqfppQIQQQQQPQQQPQQPQQHPGQGHPVTILQRP-QSPQPDPAQ 278
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 2022791524  253 PGHPADSKLEQVKQPPQPRGP-QKSQ--PQQSEPTKPVQQQTSAKPSAGPAKPLPQQPDSArtSSQAPPPAKPssQQL 327
Cdd:pfam09770  279 PSIQPQAQQFHQQPPPVPVQPtQILQnpNRLSAARVGYPQNPQPGVQPAPAHQAHRQQGSF--GRQAPIITHP--QQL 352
PRK14950 PRK14950
DNA polymerase III subunits gamma and tau; Provisional
267-359 1.13e-03

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237864 [Multi-domain]  Cd Length: 585  Bit Score: 45.19  E-value: 1.13e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  267 PPQPRGPQKSQPQQSEPTKpvqqQTSAKPSAGPAKPLPQQPDSARTSSQAPPPAKPSSQQLGPVKQPSQQPARQGGPVKP 346
Cdd:PRK14950   365 APQPAKPTAAAPSPVRPTP----APSTRPKAAAAANIPPKEPVRETATPPPVPPRPVAPPVPHTPESAPKLTRAAIPVDE 440
                           90
                   ....*....|...
gi 2022791524  347 SSQQAGPPKQPSQ 359
Cdd:PRK14950   441 KPKYTPPAPPKEE 453
Pro-rich pfam15240
Proline-rich protein; This family includes several eukaryotic proline-rich proteins.
356-448 1.15e-03

Proline-rich protein; This family includes several eukaryotic proline-rich proteins.


Pssm-ID: 464580 [Multi-domain]  Cd Length: 167  Bit Score: 42.72  E-value: 1.15e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  356 QPSQQPGPEKPSaQQMGPAKQASQPGPGKPPLQQTGPVKQAPPQSGPTKPPPQTAGAT-KAPVPQPGLTKPPGQQPEPEK 434
Cdd:pfam15240   28 SPSLISEEEGQS-QQGGQGPQGPPPGGFPPQPPASDDPPGPPPPGGPQQPPPQGGKQKpQGPPPQGGPRPPPGKPQGPPP 106
                           90
                   ....*....|....
gi 2022791524  435 PSQQKQASATQPAE 448
Cdd:pfam15240  107 QGGNQQQGPPPPGK 120
PRK10856 PRK10856
cytoskeleton protein RodZ;
274-382 1.20e-03

cytoskeleton protein RodZ;


Pssm-ID: 236776 [Multi-domain]  Cd Length: 331  Bit Score: 44.63  E-value: 1.20e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  274 QKSQPQQSEPTKPVQQqTSAKPSAGPAKPLP----QQPDSARTSSQAPPPAKPSSQQLGPVKQPSQQPA--RQGGPVKPS 347
Cdd:PRK10856   134 QNHKAQQEEITTMADQ-SSAELSQNSGQSVPldtsTTTDPATTPAPAAPVDTTPTNSQTPAVATAPAPAvdPQQNAVVAP 212
                           90       100       110
                   ....*....|....*....|....*....|....*
gi 2022791524  348 SQQAGPPKQPSQQPGPEKPSAQQMGPAKQASQPGP 382
Cdd:PRK10856   213 SQANVDTAATPAPAAPATPDGAAPLPTDQAGVSTP 247
Androgen_recep pfam02166
Androgen receptor;
199-302 1.23e-03

Androgen receptor;


Pssm-ID: 426632 [Multi-domain]  Cd Length: 501  Bit Score: 44.92  E-value: 1.23e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  199 EEQRGSAKLPAQQQSPKPVQPQGHVKPTPQQSESSKAVPQQQQPG----------EPKQIQKPGPGHPADSkleQVKQPP 268
Cdd:pfam02166   54 QQQQQQVPQQPQQQESSPRQPQASVQPQQAGDDGSPPAHNRGPAGylaleddeqpQPSQAQPAAECCPENG---CVPEPG 130
                           90       100       110
                   ....*....|....*....|....*....|....*...
gi 2022791524  269 QPRGPQKSQPQQSEPTKPVQQQTSAKPSA----GPAKP 302
Cdd:pfam02166  131 AAAAAGKGLPQQAVAPAAPDDDDSAAPSTlsllGPSFP 168
PRK10927 PRK10927
cell division protein FtsN;
290-454 1.29e-03

cell division protein FtsN;


Pssm-ID: 236797 [Multi-domain]  Cd Length: 319  Bit Score: 44.29  E-value: 1.29e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  290 QTSAKPSAGPAKPLPQQ--PDSARTSSQAPPPAKPSSQQLGPVKQPSQQPARQggpvkpssQQAGPPKQPSQQPGPEKPS 367
Cdd:PRK10927    97 RAPTEPSAGGEVKTPEQltPEQRQLLEQMQADMRQQPTQLVEVPWNEQTPEQR--------QQTLQRQRQAQQLAEQQRL 168
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  368 AQQMGPAKQASQpgpgkpplQQTGPVKQAPPQSGPtKPPPQTAGATkapvPQPGLTKPPGQQPEPEKPsqQKQASATQPA 447
Cdd:PRK10927   169 AQQSRTTEQSWQ--------QQTRTSQAAPVQAQP-RQSKPASTQQ----PYQDLLQTPAHTTAQSKP--QQAAPVTRAA 233

                   ....*..
gi 2022791524  448 EsAPKKT 454
Cdd:PRK10927   234 D-APKPT 239
Amelin smart00817
Ameloblastin precursor (Amelin); This family consists of several mammalian Ameloblastin ...
210-433 1.43e-03

Ameloblastin precursor (Amelin); This family consists of several mammalian Ameloblastin precursor (Amelin) proteins. Matrix proteins of tooth enamel consist mainly of amelogenin but also of non-amelogenin proteins, which, although their volumetric percentage is low, have an important role in enamel mineralisation. One of the non-amelogenin proteins is ameloblastin, also known as amelin and sheathlin. Ameloblastin (AMBN) is one of the enamel sheath proteins which is though to have a role in determining the prismatic structure of growing enamel crystals.


Pssm-ID: 214832 [Multi-domain]  Cd Length: 411  Bit Score: 44.50  E-value: 1.43e-03
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524   210 QQQSPKPVQPQGHVKPTPQQSESSKAVPQQQQPGEPK---QIQKPGPGHPAdskleQVKQPPQPRGPQKSQPQQSEPT-K 285
Cdd:smart00817   88 QYEYSLPVHPPPLPSQPSLQPQQPGLKPFLQPTALPTnqaTPQKNGPQPPM-----HLGQPPLQQAELPMIPPQVAPSdK 162
                            90       100       110       120       130       140       150       160
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524   286 PVQQQTSAKPSAGPAKPLPQQpdSARTSSQAPPPakpssqqlgPVKQPSQQPARQGgpVKPSSQQAGPPKQPSQQPGPEK 365
Cdd:smart00817  163 PPQTELPLYDFADPQNPLLFQ--IAHLMSRGPMP---------QNKQQHLYPGLFY--MSYGANQLGAPARLGAMSSEEM 229
                           170       180       190       200       210       220       230       240
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524   366 PSAQQMGPAKQASQPG-----------PGKPPLQQTGPVKQAPPQSGpTKPPPQTAG-ATKAPVPQPGLTKPPGQQPEPE 433
Cdd:smart00817  230 TGGRGAPHAYGALFPGlggmrpglrgmPQNPAMQGDFTLEDDSPVAA-TKGPEKGEGgAQGSPIPEAQGVDPENPALLSE 308
valS PRK14900
valyl-tRNA synthetase; Provisional
306-438 1.53e-03

valyl-tRNA synthetase; Provisional


Pssm-ID: 237855 [Multi-domain]  Cd Length: 1052  Bit Score: 44.98  E-value: 1.53e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  306 QPDSARTSS-QAPPPAKPSSQQLGPVKQPSQQPARQGGPVKPSSQQAGPPKQ--PSQQPGPEKPSAQQMGPAKQASQPGP 382
Cdd:PRK14900   907 EANSARRDTmEIQNEQKPTQDGPAAEAQPAQENTVVESAEKAVAAVSEAAQQaaTAVASGIEKVAEAVRKTVRRSVKKAA 986
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 2022791524  383 GKPPLQQTGPVKQAPPQSGPTKPPPQTAGATKAPVPQPGLTKPPGQQPEPEKPSQQ 438
Cdd:PRK14900   987 ATRAAMKKKVAKKAPAKKAAAKKAAAKKAAAKKKVAKKAPAKKVARKPAAKKAAKK 1042
PTZ00449 PTZ00449
104 kDa microneme/rhoptry antigen; Provisional
598-910 1.55e-03

104 kDa microneme/rhoptry antigen; Provisional


Pssm-ID: 185628 [Multi-domain]  Cd Length: 943  Bit Score: 45.07  E-value: 1.55e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  598 SQQIDPAPKSDRAK--PSQAEDKQKQPSIQKPAADTVLTPAAPGLKQDSAGPRPPptqQKVTDSPKPELAKPSQDTHPAG 675
Cdd:PTZ00449   482 TQEIKKLIKKSKKKlaPIEEEDSDKHDEPPEGPEASGLPPKAPGDKEGEEGEHED---SKESDEPKEGGKPGETKEGEVG 558
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  676 EKP----DSKP--LPQVSRQKSDPKLSAQPgarseaktQKPVEPAQVKD--DPKKLQTKPSPK-PETKPAPKGLQPGAGP 746
Cdd:PTZ00449   559 KKPgpakEHKPskIPTLSKKPEFPKDPKHP--------KDPEEPKKPKRprSAQRPTRPKSPKlPELLDIPKSPKRPESP 630
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  747 KPVPAQPAPQPQQPQKTPEQPRrfslnlggITDTPKPqPTTPQ-----------------------ETVTGRLFGFGASI 803
Cdd:PTZ00449   631 KSPKRPPPPQRPSSPERPEGPK--------IIKSPKP-PKSPKppfdpkfkekfyddyldaaakskETKTTVVLDESFES 701
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  804 FSQASnLISTAGQPGSQTSGPPPagpaakqpQAPAQPPASQAPPKDTAQTQSSPKAVPVKKEAKpltSEKMEPTKVDSVL 883
Cdd:PTZ00449   702 ILKET-LPETPGTPFTTPRPLPP--------KLPRDEEFPFEPIGDPDAEQPDDIEFFTPPEEE---RTFFHETPADTPL 769
                          330       340
                   ....*....|....*....|....*..
gi 2022791524  884 TTKGSDLEKKPSLAKDSKPQAAEAKKP 910
Cdd:PTZ00449   770 PDILAEEFKEEDIHAETGEPDEAMKRP 796
PDZ1_Par3-like cd06691
PDZ domain 1 of partitioning defective 3 (Par3), and related domains; PDZ (PSD-95 ...
4540-4607 1.59e-03

PDZ domain 1 of partitioning defective 3 (Par3), and related domains; PDZ (PSD-95 (Postsynaptic density protein 95), Dlg (Discs large protein), and ZO-1 (Zonula occludens-1)) domain 1 of Par3 (or PAR3 or Par-3, also known as Atypical PKC isotype-specific-interacting protein, ASIP) and related domains; Drosophila bazooka PDZ1 belongs to a different PDZ family. Par3 is a scaffold protein involved in organizing cell polarity across animals. Par3 binds numerous molecules both for its recruitment to one pole of the cell and for downstream contributions to polarized cell function. It regulates cell polarity by targeting the Par complex proteins Par6 and atypical protein kinase C (aPKC) to specific cortical sites. Physical interactions between Par3 and the Par complex include: Par-3 PDZ domain 1 binding to the Par6 PDZ domain, Par3 PDZ domain 1 and PDZ domain 3 binding the Par6's PDZ-binding motif, and an interaction with an undefined region of aPKC that requires both Par3 PDZ2 and PDZ3. The PDZ domains of Par3 have also been implicated as potential phosphoinositide signaling integrators, since its second PDZ domain binds to phosphoinositides, and the third PDZ interacts with phosphoinositide phosphatase PTEN. PDZ domains usually bind in a sequence-specific manner to short peptide sequences located at the C-terminal end of their partner proteins (known as PDZ binding motifs). The PDZ superfamily includes canonical PDZ domains as well as those with circular permutations and domain swapping mediated by beta-strands. This Par3 family PDZ1 domain is a canonical PDZ domain containing six beta-strands A-F and two alpha-helices (alpha-helix 1 and 2), arranged in the order: beta-strands A, B, C, alpha-helix 1, beta-strands D, E, alpha-helix 2 and beta-strand F.


Pssm-ID: 467178 [Multi-domain]  Cd Length: 98  Bit Score: 40.68  E-value: 1.59e-03
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 2022791524 4540 TVSGNG--LGIRVVggkeiP----GSSGETGAYIAKILPGGNAEQTGKLIEGMQVLEWNGIPLTGKTYEEVQNI 4607
Cdd:cd06691      9 ELSNDGgpLGIHVV-----PfsssLSGRTLGLLIRGIEEGSRAERDGRFQENDCIVEINGVDLIDKSFEQAQDI 77
dnaA PRK14086
chromosomal replication initiator protein DnaA;
196-369 1.75e-03

chromosomal replication initiator protein DnaA;


Pssm-ID: 237605 [Multi-domain]  Cd Length: 617  Bit Score: 44.43  E-value: 1.75e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  196 GKPEEQRGSAKLPAQQQSPKPV----QPQGHVKPTPQQSESSKAVPQQQQPGEPKQIQKPGPGHPAdskleqvkqPPQPR 271
Cdd:PRK14086   122 GGPRADDRPPGLPRQDQLPTARpaypAYQQRPEPGAWPRAADDYGWQQQRLGFPPRAPYASPASYA---------PEQER 192
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  272 GPQKSQPQQSEPTKPVQQQTSAKPSAgpakplpQQPDSARTSSQAPPPAKPSSQQLGPVkqpsqQPARQGGPVKPSSQQA 351
Cdd:PRK14086   193 DREPYDAGRPEYDQRRRDYDHPRPDW-------DRPRRDRTDRPEPPPGAGHVHRGGPG-----PPERDDAPVVPIRPSA 260
                          170       180
                   ....*....|....*....|
gi 2022791524  352 G--PPKQPSQQPGPEKPSAQ 369
Cdd:PRK14086   261 PgpLAAQPAPAPGPGEPTAR 280
PDZ3_FL-whirlin-like cd06742
PDZ domain 3 of the full-length isoform of whirlin, PDZ domain 1 of the short isoform of ...
4546-4608 1.77e-03

PDZ domain 3 of the full-length isoform of whirlin, PDZ domain 1 of the short isoform of whirlin, and related domains; PDZ (PSD-95 (Postsynaptic density protein 95), Dlg (Discs large protein), and ZO-1 (Zonula occludens-1)) domain 3 of the full-length isoform of whirlin, PDZ domain 1 of the short isoform of whirlin, and related domains. Whirlin is an essential protein for developmental pathways in photoreceptor cells of the retina and hair cells of the inner ear. The full-length whirlin isoform has two harmonin N-like domains, three PDZ domains, a proline-rich region, and a PDZ-binding motif. Whirlin isoforms may form different complexes at the periciliary membrane complex (PMC) in photoreceptors, and the stereociliary tip and base in inner ear hair cells. It interacts with ADGRV1 and usherin at the PMC; with SANS and RpgrORF15 at the connecting cilium in photoreceptors; with EPS8, MYO15A, p55, and CASK proteins at the stereociliary tip of inner ear hair cells; and with ADGRV1, usherin, and PDZD7 at the stereociliary base in inner ear hair cells. Mutations in the gene encoding whirlin (WHRN; also known as USH2D and DFNB31), have been found to cause either USH2 subtype (USH2D) or autosomal recessive non-syndromic deafness type 31 (DFNB31). Whirlin is the key protein in the USH2 complex (whirlin, usherin and GPR98) which recruits other USH2 causative proteins at the periciliary membrane in photoreceptors and the ankle link of the stereocilia in hair cells. Whirlin's interaction with espin, another stereociliary protein, may be important for the architecture of the USH2 complex. PDZ domains usually bind in a sequence-specific manner to short peptide sequences located at the C-terminal end of their partner proteins (known as PDZ binding motifs). The PDZ superfamily includes canonical PDZ domains as well as those with circular permutations and domain swapping mediated by beta-strands. This whirlin family PDZ3 domain is a canonical PDZ domain containing six beta-strands A-F and two alpha-helices (alpha-helix 1 and 2), arranged in the order: beta-strands A, B, C, alpha-helix 1, beta-strands D, E, alpha-helix 2 and beta-strand F


Pssm-ID: 467224 [Multi-domain]  Cd Length: 91  Bit Score: 40.42  E-value: 1.77e-03
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 2022791524 4546 LGIRVVGG----KEIPgssgetgaYIAKILPGGNAEQTGKLIEGMQVLEWNGIPLTGKTYEEVQNII 4608
Cdd:cd06742     13 LGIAIEGGantkQPLP--------RVINIQRGGSAHNCGGLKVGHVILEVNGTSLRGLEHREAARLI 71
PRK14951 PRK14951
DNA polymerase III subunits gamma and tau; Provisional
227-358 1.86e-03

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237865 [Multi-domain]  Cd Length: 618  Bit Score: 44.32  E-value: 1.86e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  227 PQQSESSKAVPQQQQPGEPKQiQKPGPGHPADSKLEQVKQPPQP--RGPQKSQPQQSEPTKPVQQQTSAKPSAGPAKPLP 304
Cdd:PRK14951   366 PAAAAEAAAPAEKKTPARPEA-AAPAAAPVAQAAAAPAPAAAPAaaASAPAAPPAAAPPAPVAAPAAAAPAAAPAAAPAA 444
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....
gi 2022791524  305 QQPDSArtssqapPPAKPSSQQLGPVKQPSQQPARQGGPVKPSSQQAGPPKQPS 358
Cdd:PRK14951   445 VALAPA-------PPAQAAPETVAIPVRVAPEPAVASAAPAPAAAPAAARLTPT 491
Gag_spuma pfam03276
Spumavirus gag protein;
293-426 2.07e-03

Spumavirus gag protein;


Pssm-ID: 460872 [Multi-domain]  Cd Length: 614  Bit Score: 44.35  E-value: 2.07e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  293 AKPSAGPAKPLPQQPDSARTSSQAPPPAKPSSQqlgpvkqPSQQPARQGGPVKPSSQQAGPPKQP-SQQPGPEKPSAQQM 371
Cdd:pfam03276  193 SGLPSLPAIGGIHLPAIPGIHARAPPGNIARSL-------GDDIMPSLGDAGMPQPRFAFHPGNPfAEAEGHPFAEAEGE 265
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  372 GPAKQASQ-----PGPGKPPLQQTGPVkqAPPQSGPTKPPPQTAGATKAPVPQPGLTKPP 426
Cdd:pfam03276  266 RPRDIPRApridaPSAPAIPAIQPIAP--PMIPPIGAPIPIPHGASIPGEHIRNPREEPI 323
PLN03209 PLN03209
translocon at the inner envelope of chloroplast subunit 62; Provisional
226-451 2.25e-03

translocon at the inner envelope of chloroplast subunit 62; Provisional


Pssm-ID: 178748 [Multi-domain]  Cd Length: 576  Bit Score: 44.15  E-value: 2.25e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  226 TPQQSESSKAVPQQQQPGEPKQIQKPGPGhPADSKLEQVKQPPQPRGPQKSQPQQSEPTKPVQQQTSAKPsagPAKPLPQ 305
Cdd:PLN03209   314 TPMEELLAKIPSQRVPPKESDAADGPKPV-PTKPVTPEAPSPPIEEEPPQPKAVVPRPLSPYTAYEDLKP---PTSPIPT 389
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  306 QPDSArtssqapppaKPSSQQLGPVKQPSQQparqggpvKPSSQQAGPPKQPSQQPGP-EKPSAQQMGP------AKQAS 378
Cdd:PLN03209   390 PPSSS----------PASSKSVDAVAKPAEP--------DVVPSPGSASNVPEVEPAQvEAKKTRPLSPyaryedLKPPT 451
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 2022791524  379 QPGPgKPPLQQTGPVKQAPPQSG-PTKPPPQTAGATKAPVPQPGLTKPPGQQPEPEKPSQQKQASATQPAESAP 451
Cdd:PLN03209   452 SPSP-TAPTGVSPSVSSTSSVPAvPDTAPATAATDAAAPPPANMRPLSPYAVYDDLKPPTSPSPAAPVGKVAPS 524
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
518-828 2.28e-03

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 44.39  E-value: 2.28e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  518 APGHGPGPQLPPPKQKTPTPPSTAKPPPQPQPGQKKD-ASPKPDPAQLAEPKKPPPQKKQPSMPGSPPVKAKTSRAEPAD 596
Cdd:PHA03307   125 SPPPSPAPDLSEMLRPVGSPGPPPAASPPAAGASPAAvASDAASSRQAALPLSSPEETARAPSSPPAEPPPSTPPAAASP 204
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  597 TSQ-------------QIDPAPKSDRAKPSQAEDKQKQPSIQK------------PAADTVLTPAAPGLKQDSAGPRPPP 651
Cdd:PHA03307   205 RPPrrsspisasasspAPAPGRSAADDAGASSSDSSSSESSGCgwgpenecplprPAPITLPTRIWEASGWNGPSSRPGP 284
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  652 TQqkvTDSPKPELAKPSQDTHPAGEKPDSKPLPQVSRQKSDPKLSAQPGARSEAKTQKPVEPAQVKDDPKKLQtKPSPKP 731
Cdd:PHA03307   285 AS---SSSSPRERSPSPSPSSPGSGPAPSSPRASSSSSSSRESSSSSTSSSSESSRGAAVSPGPSPSRSPSPS-RPPPPA 360
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  732 ETKPAPKGLQPGAGPKPVPAQPAPQPQQPQKTPEQPRRFSLNLGGITDTPKPQPTTPQETVTgrlfgfgASIFSQASNLI 811
Cdd:PHA03307   361 DPSSPRKRPRPSRAPSSPAASAGRPTRRRARAAVAGRARRRDATGRFPAGRPRPSPLDAGAA-------SGAFYARYPLL 433
                          330
                   ....*....|....*..
gi 2022791524  812 STAGQPGSQtSGPPPAG 828
Cdd:PHA03307   434 TPSGEPWPG-SPPPPPG 449
Pro-rich pfam15240
Proline-rich protein; This family includes several eukaryotic proline-rich proteins.
199-345 2.30e-03

Proline-rich protein; This family includes several eukaryotic proline-rich proteins.


Pssm-ID: 464580 [Multi-domain]  Cd Length: 167  Bit Score: 41.95  E-value: 2.30e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  199 EEQRGSAKLPAQQQSPKPVQPQGHVKPTPQQSESSKAVPQQ--QQPGEPKQIQKPGPGHPADSKLEQVKQPPQPRGPQKS 276
Cdd:pfam15240   35 EEGQSQQGGQGPQGPPPGGFPPQPPASDDPPGPPPPGGPQQppPQGGKQKPQGPPPQGGPRPPPGKPQGPPPQGGNQQQG 114
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 2022791524  277 QPQQSEPTKPVQQQTSAKPSAGPakplPQQPdsartssQAPPPAKPSsqqlGPVKQPSqQPARQGGPVK 345
Cdd:pfam15240  115 PPPPGKPQGPPPQGGGPPPQGGN----QQGP-------PPPPPGNPQ----GPPQRPP-QPGNPQGPPQ 167
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
517-729 2.30e-03

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 44.10  E-value: 2.30e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  517 LAPGHGPG---PQLPPPKQKTPTPPSTAKPPPQPQPGQKKDASPKPDPAQLAEPKKPPPQKKQPSMPGSPPVKAKTSRAE 593
Cdd:PRK12323   363 FRPGQSGGgagPATAAAAPVAQPAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAPEALAAARQASAR 442
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  594 PADTSQQIDPAPKsdrAKPSQAEdkqkQPSIQKPAADTVLTPAAPGLKQDSAGPRPPPtqqkvtDSPKPELAKPSQDTHP 673
Cdd:PRK12323   443 GPGGAPAPAPAPA---AAPAAAA----RPAAAGPRPVAAAAAAAPARAAPAAAPAPAD------DDPPPWEELPPEFASP 509
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 2022791524  674 AGEKPDSKPLPQVSRQKSDPKLSAQPGARSEAKTQKPVEPAQVKDDPKKLQTKPSP 729
Cdd:PRK12323   510 APAQPDAAPAGWVAESIPDPATADPDDAFETLAPAPAAAPAPRAAAATEPVVAPRP 565
PRK12757 PRK12757
cell division protein FtsN; Provisional
317-449 2.32e-03

cell division protein FtsN; Provisional


Pssm-ID: 237191 [Multi-domain]  Cd Length: 256  Bit Score: 43.11  E-value: 2.32e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  317 PPPAKPSSQqlGPVKQPSQ-----------------QPARQGGPVkPSSQQAGPPKQPSQQPGPEKPSAQqmgPAKQASQ 379
Cdd:PRK12757    63 PTPTEPSAG--GEVNSPTQltdeqrqlleqmqadmrQQPTQLSEV-PYNEQTPQVPRSTVQIQQQAQQQQ---PPATTAQ 136
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 2022791524  380 PGPGKPPLQQTGPVKQAPPQsgPTKPPPQtagatkAPVPQPglTKPPGQQPEPEKPS----QQKQASATQPAES 449
Cdd:PRK12757   137 PQPVTPPRQTTAPVQPQTPA--PVRTQPA------APVTQA--VEAPKVEAEKEKEQrwmvQCGSFKGTEQAES 200
PRK07994 PRK07994
DNA polymerase III subunits gamma and tau; Validated
209-324 2.41e-03

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236138 [Multi-domain]  Cd Length: 647  Bit Score: 44.09  E-value: 2.41e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  209 AQQQSPKPVQPQGHVKPTPQQSESSKAVPQQQQPGEPKQIQKPGPGHPADSKLEQVKQPPQPRG---PQKSQPQQSEPTK 285
Cdd:PRK07994   371 PPQSAAPAASAQATAAPTAAVAPPQAPAVPPPPASAPQQAPAVPLPETTSQLLAARQQLQRAQGatkAKKSEPAAASRAR 450
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|...
gi 2022791524  286 PVQQ----QTSAKPSAGPAKPLPQQPDSARTSSQAPPPAKPSS 324
Cdd:PRK07994   451 PVNSalerLASVRPAPSALEKAPAKKEAYRWKATNPVEVKKEP 493
PRK08691 PRK08691
DNA polymerase III subunits gamma and tau; Validated
285-448 2.46e-03

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236333 [Multi-domain]  Cd Length: 709  Bit Score: 44.31  E-value: 2.46e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  285 KPVQQQTSAKPSAGpAKPLPQ-QPDSARTSSQAPPPAKPSSQqlgpvkqpsqqpARQGGPVKPSSQQAGPPKQPSQQPGP 363
Cdd:PRK08691   378 QSPSAQTAEKETAA-KKPQPRpEAETAQTPVQTASAAAMPSE------------GKTAGPVSNQENNDVPPWEDAPDEAQ 444
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  364 EKPSAQQMGPAKQASQPGPGKPPLQQTGPVKQAPPQS-GPTKPPPQTAGATKAPVPQPGLTKPPGQQPEPEKPSQQKQAS 442
Cdd:PRK08691   445 TAAGTAQTSAKSIQTASEAETPPENQVSKNKAADNETdAPLSEVPSENPIQATPNDEAVETETFAHEAPAEPFYGYGFPD 524

                   ....*.
gi 2022791524  443 ATQPAE 448
Cdd:PRK08691   525 NDCPPE 530
PTZ00436 PTZ00436
60S ribosomal protein L19-like protein; Provisional
309-450 2.57e-03

60S ribosomal protein L19-like protein; Provisional


Pssm-ID: 185616 [Multi-domain]  Cd Length: 357  Bit Score: 43.40  E-value: 2.57e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  309 SARTSSQAPPPAKPSSQQLGPVKQPSQQPARQGGPVKPSSQQAGPPKQPSQQP--GPEKPSAQQMGPAKQASQPGPGKPP 386
Cdd:PTZ00436   198 AAKQKAAAKKAAAPSGKKSAKAAAPAKAAAAPAKAAAPPAKAAAAPAKAAAAPakAAAPPAKAAAPPAKAAAPPAKAAAP 277
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 2022791524  387 lqqtgPVKQAPPQSGPTKPPPQ--TAGATKAPVPQPGLTKPPGQQPEPEKPSQQKQASATQPAESA 450
Cdd:PTZ00436   278 -----PAKAAAPPAKAAAPPAKaaAAPAKAAAAPAKAAAAPAKAAAPPAKAAAPPAKAATPPAKAA 338
PRK10905 PRK10905
cell division protein DamX; Validated
238-417 2.67e-03

cell division protein DamX; Validated


Pssm-ID: 236792 [Multi-domain]  Cd Length: 328  Bit Score: 43.39  E-value: 2.67e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  238 QQQQPGEPKQIQKPgpghPADSKLEQVKQPPQPRGPQKSQPQ---QSEPTKPVQQQTSAKPSAGPAkpLPQQPDSARTSS 314
Cdd:PRK10905    61 EQTAGNTQQDVSLP----PISSTPTQGQTPVATDGQQRVEVQgdlNNALTQPQNQQQLNNVAVNST--LPTEPATVAPVR 134
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  315 QAPPPAKPSSQQLGPvKQPSQQPARQGG---PVKPSSQQAGPPKQPSQQPGPEKPSAQQMGPAKQASQPGPGKPPLQQTG 391
Cdd:PRK10905   135 NGNASRQTAKTQTAE-RPATTRPARKQAviePKKPQATAKTEPKPVAQTPKRTEPAAPVASTKAPAATSTPAPKETATTA 213
                          170       180
                   ....*....|....*....|....*.
gi 2022791524  392 PVKQAPPQSgpTKPPPQTAGATKAPV 417
Cdd:PRK10905   214 PVQTASPAQ--TTATPAAGGKTAGNV 237
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
267-663 2.80e-03

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 44.21  E-value: 2.80e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  267 PPQPRGPQKSQPQQSEPTKPVQQQTSAKPSAGPAKPLPQQPDSArtsSQAPPPAKPSSQQLGPVKQPSQQPARQGGPVKP 346
Cdd:PRK07764   395 AAAAPSAAAAAPAAAPAPAAAAPAAAAAPAPAAAPQPAPAPAPA---PAPPSPAGNAPAGGAPSPPPAAAPSAQPAPAPA 471
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  347 SSQQAGPPKQPSQQPGPEKPSAQQMGPAKQASQPGPGKPPLQQ------------------------------------- 389
Cdd:PRK07764   472 AAPEPTAAPAPAPPAAPAPAAAPAAPAAPAAPAGADDAATLRErwpeilaavpkrsrktwaillpeatvlgvrgdtlvlg 551
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  390 --TGPVKQAPPQSGPTK---------------------PPPQTAGATKAPVPQPGLTKPPGQQPEPEKPSQQKQASATQP 446
Cdd:PRK07764   552 fsTGGLARRFASPGNAEvlvtalaeelggdwqveavvgPAPGAAGGEGPPAPASSGPPEEAARPAAPAAPAAPAAPAPAG 631
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  447 AESAPkktfcplcttTELLLHTPEKANYNTCTQCHTvvcslcgfnpnphiteikewlclncqmqralgGDLAPGHGPGPQ 526
Cdd:PRK07764   632 AAAAP----------AEASAAPAPGVAAPEHHPKHV--------------------------------AVPDASDGGDGW 669
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  527 lpppkqktptPPSTAkpppqpqpgqkkDASPKPDPAQLAEPKKPPPQKKQPSMPGSPPvkAKTSRAEPADTSQQIDPAPK 606
Cdd:PRK07764   670 ----------PAKAG------------GAAPAAPPPAPAPAAPAAPAGAAPAQPAPAP--AATPPAGQADDPAAQPPQAA 725
                          410       420       430       440       450
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 2022791524  607 SDRAKPSQAEDKQkQPSIQKPAADTVLTPAAPGLKQDSAGPRPPPTQQKVTDSPKPE 663
Cdd:PRK07764   726 QGASAPSPAADDP-VPLPPEPDDPPDPAGAPAQPPPPPAPAPAAAPAAAPPPSPPSE 781
PDZ1_APBA1_3-like cd06720
PDZ domain 1 of amyloid-beta A4 precursor protein-binding family A member 1 (APBA1), APBA2, ...
4543-4608 3.06e-03

PDZ domain 1 of amyloid-beta A4 precursor protein-binding family A member 1 (APBA1), APBA2, APBA3, and related domains; PDZ (PSD-95 (Postsynaptic density protein 95), Dlg (Discs large protein), and ZO-1 (Zonula occludens-1)) domain 1 of APBA1, APBA2, APBA3, and related domains. The APBA/X11/Mint protein family includes three members: neuron specific APBA1 (also known as X11alpha and Mint1) and APBA2 (also known as X11beta and Mint2), and the ubiquitously expressed APBA3 (also known as (X12gamma and Mint3). They are involved in regulating neuronal signaling, trafficking and plasticity. They contain two PDZ domains (PDZ1 and PDZ2) which bind a variety of proteins: Arf GTPases (APBA1 and APBA2 PDZ2) and neurexin (APBA1 and APBA2 PDZ1 and 2), which are involved in vesicle docking and exocytosis; alpha1B subunit of N-type Ca2+ channel (APBA1 PDZ1) that is involved in ion channels; KIF17 (APBA1 PDZ1) that is involved in transport and traffic; and Alzheimer's disease related proteins such as APP (APBA3 PDZ2), CCS (APBA1 PDZ2), NF-kappa-B/p65 (APBA2 PDZ2), presenilin-1 (APBA1 and APBA2 PDZ1 and PDZ2). PDZ domains usually bind in a sequence-specific manner to short peptide sequences located at the C-terminal end of their partner proteins (known as PDZ binding motifs). The PDZ superfamily includes canonical PDZ domains as well as those with circular permutations and domain swapping mediated by beta-strands. This APBA1,2,3-like family domain is a canonical PDZ domain containing six beta-strands A-F and two alpha-helices (alpha-helix 1 and 2), arranged as beta-strands A, B, C, alpha-helix 1, beta-strands D, E, alpha-helix 2 and beta- strand F.


Pssm-ID: 467203 [Multi-domain]  Cd Length: 86  Bit Score: 39.55  E-value: 3.06e-03
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 2022791524 4543 GNGLGIRVVggkeipgssgETG-------AYIAKILPGGNAEQTGKLIEGMQVLEWNGIPLTGKTYEEVQNII 4608
Cdd:cd06720     10 GEILGVVIV----------ESGwgsllptVVVANMMPGGPAARSGKLNIGDQIMSINGTSLVGLPLSTCQAII 72
DUF4813 pfam16072
Domain of unknown function (DUF4813); This family of proteins is functionally uncharacterized. ...
318-433 3.15e-03

Domain of unknown function (DUF4813); This family of proteins is functionally uncharacterized. This family of proteins is found in eukaryotes. Proteins in this family are typically between 345 and 672 amino acids in length.


Pssm-ID: 435117 [Multi-domain]  Cd Length: 288  Bit Score: 42.82  E-value: 3.15e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  318 PPAKPSSQQLGP----VKQPSQQPARQGGPVKPSSQQAGPPKQPSQQPGPeKPSAQQM--------GPAKQASQPGPGKP 385
Cdd:pfam16072  145 PPGSVTTTSAGSgttvINAGGQQPAAPAAPAYPVAPAAYPAQAPAAAPAP-APGAPQTplaplnpvAAAPAAAAGAAAAP 223
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....*....
gi 2022791524  386 PLQQTGPVKQAPPQSGPTKPPPQtagatkAPVPQP-GLTKPPGQQPEPE 433
Cdd:pfam16072  224 VVAAAAPAAAAPPPPAPAAPPAD------AAPPAPgGIICVPVRVPEPD 266
G_path_suppress pfam15991
G-protein pathway suppressor; This family of proteins inhibits G-protein- and ...
277-422 3.28e-03

G-protein pathway suppressor; This family of proteins inhibits G-protein- and mitogen-activated protein kinase-mediated signal transduction.


Pssm-ID: 464961 [Multi-domain]  Cd Length: 272  Bit Score: 42.99  E-value: 3.28e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  277 QPQQSEPTKPVQQQTSAKPSAGPAKPLPQQPDSARTSSQAPPPAKPSSQQLGPvkQPSQQPAR-----QGGPVKPSSQQA 351
Cdd:pfam15991  113 LPQLSMQGQPHHQQHPGPQVGVLKRTRSPSPPVQQQAYYKQPAFSPGYAEHGQ--QKHDDGRRgydvaRFGSWNKSTAQY 190
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 2022791524  352 GPPKQ---PSQQPGPEKPSAQQMGPAKQASQPGPGKP-PLQQtgPVKQAPPQSG-PTKPPPQTAGATKA-PVPQPGL 422
Cdd:pfam15991  191 PPSGQlfyPTHQYLPPPQTQGQADARLQTIYPQPGYAlPLQQ--QYEHANQPSPfVSSSPLKQMQSPKAgPGPQPMQ 265
PDZ1_PTPN13-like cd23072
PDZ domain 1 of protein tyrosine phosphatase non-receptor type 13 (PTPN13), and related ...
4530-4608 3.43e-03

PDZ domain 1 of protein tyrosine phosphatase non-receptor type 13 (PTPN13), and related domains; PDZ (PSD-95 (Postsynaptic density protein 95), Dlg (Discs large protein), and ZO-1 (Zonula occludens-1)) domain 1 of PTPN13 [also known as Fas-associated protein-tyrosine phosphatase 1 (FAP-1), protein-tyrosine phosphatase 1E (PTP-E1), and protein-tyrosine phosphatase (PTPL1)], and related domains. PTPN13 regulates negative apoptotic signaling and mediates phosphoinositide 3-kinase (PI3K) signaling. PTPN13 has five PDZ domains. Proteins known to interact with PTPN13 PDZ domains include: PLEKHA1 and PLEKHA2 via PTPN13-PDZ domain 1, Fas receptor and thyroid receptor-interacting protein 6 via PTPN13-PDZ domain 2, nerve growth factor receptor and protein kinase N2 via PTPN13-PDZ domain 3, PDZ and LIM domain 4 (PDLIM4) via PTPN13-PDZ domains 2 and 4, and brain calpain-2 via PTPN13-PDZ domains 3, 4 and 5. Calpain-2-mediated PTPN13 fragments may be involved in abnormal tau aggregation and increased risk for Alzheimer's disease. PDZ domains usually bind in a sequence-specific manner to short peptide sequences located at the C-terminal end of their partner proteins (known as PDZ binding motifs). The PDZ superfamily includes canonical PDZ domains as well as those with circular permutations and domain swapping mediated by beta-strands. This PTPN13 family PDZ1 domain is a canonical PDZ domain containing six beta-strands A-F and two alpha-helices (alpha-helix 1 and 2), arranged in the order: beta-strands A, B, C, alpha-helix 1, beta-strands D, E, alpha-helix 2 and beta-strand F.


Pssm-ID: 467285 [Multi-domain]  Cd Length: 92  Bit Score: 39.78  E-value: 3.43e-03
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 2022791524 4530 LKLLRDPKdhtvsgNGLGIRVVGGkEIPGSSgETGAYIAKILPGGNAEQTGKLIEGMQVLEWNGIPLTGKTYEEVQNII 4608
Cdd:cd23072      5 VNLKKDAK------YGLGFQIVGG-EKSGRL-DLGIFISSITPGGPADLDGRLKPGDRLISVNDVSLEGLSHDAAVEIL 75
PTZ00108 PTZ00108
DNA topoisomerase 2-like protein; Provisional
1426-1669 3.52e-03

DNA topoisomerase 2-like protein; Provisional


Pssm-ID: 240271 [Multi-domain]  Cd Length: 1388  Bit Score: 43.88  E-value: 3.52e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524 1426 ESSKKDTVSQESPPSPSDLAKLESTVLSILEAQASTLSDEKSVKKKELYETYSEQTKDQHKTKPLPV--TPESYSSDEED 1503
Cdd:PTZ00108  1148 EEKEIAKEQRLKSKTKGKASKLRKPKLKKKEKKKKKSSADKSKKASVVGNSKRVDSDEKRKLDDKPDnkKSNSSGSDQED 1227
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524 1504 LEGEGTIVEDGKGSASSQAEYKEGDGEDDIPARRQRYDSVEDSSESENSPVPRRKRRGSVGSSSSDEYKRDDSQGSGDEE 1583
Cdd:PTZ00108  1228 DEEQKTKPKKSSVKRLKSKKNNSSKSSEDNDEFSSDDLSKEGKPKNAPKRVSAVQYSPPPPSKRPDGESNGGSKPSSPTK 1307
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524 1584 DFIRKQIIEMSADEDASGSEDDEFIRNQlkeisvtESQKKEDVKSKAKGTAGKHRRMARKSSAGYDEDAGRRHSWHDDDD 1663
Cdd:PTZ00108  1308 KKVKKRLEGSLAALKKKKKSEKKTARKK-------KSKTRVKQASASQSSRLLRRPRKKKSDSSSEDDDDSEVDDSEDED 1380

                   ....*.
gi 2022791524 1664 ETFDES 1669
Cdd:PTZ00108  1381 DEDDED 1386
PHA03379 PHA03379
EBNA-3A; Provisional
208-425 3.93e-03

EBNA-3A; Provisional


Pssm-ID: 223066 [Multi-domain]  Cd Length: 935  Bit Score: 43.51  E-value: 3.93e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  208 PAQQQSPKPVQPQGHVKPTPQQSESSKA-----VPQQQQPGEPKQIQKPGPGHPADSKLEQVKQPPQPRGPQKSQPQQSE 282
Cdd:PHA03379   617 SPQQPMEYPLEPEQQMFPGSPFSQVADVmraggVPAMQPQYFDLPLQQPISQGAPLAPLRASMGPVPPVPATQPQYFDIP 696
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  283 PTKPVQQQTSA------KPSAGPAKPlPQQPDSARTSSQAPPPAKPSSQQLGpvkQPSQQPARQGGPVKPSSQQagPPkq 356
Cdd:PHA03379   697 LTEPINQGASAahflpqQPMEGPLVP-ERWMFQGATLSQSVRPGVAQSQYFD---LPLTQPINHGAPAAHFLHQ--PP-- 768
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  357 psqQPGPEKPSAQQMgpakQASQPGPGKPPLQ-QTGPVKQAPPQSGPtKPPPQTAGATKAPVPQPGLTKP 425
Cdd:PHA03379   769 ---MEGPWVPEQWMF----QGAPPSQGTDVVQhQLDALGYVLHVLNH-PGVPVSPAVNQYHVSQAAFGLP 830
PDZ2_GRIP1-2-like cd06681
PDZ domain 2 of glutamate receptor-interacting protein 1 (GRIP1) and GRIP2, and related ...
4570-4615 4.00e-03

PDZ domain 2 of glutamate receptor-interacting protein 1 (GRIP1) and GRIP2, and related domains; PDZ (PSD-95 (Postsynaptic density protein 95), Dlg (Discs large protein), and ZO-1 (Zonula occludens-1)) domain of alpha-amino-3-hydroxy-5-methyl-4-isoxazolepropionic acid receptor (AMPAR) binding proteins GRIP1 (ABP/GRIP2) and GRIP2, and related domains. GRIP1 and GRIP2 each have 7 PDZ domains. The interaction of GRIP1 and GRIP2 with GluA2/3 (AMPAR subunit) regulates AMPAR trafficking and synaptic targeting. GRIP1 has an essential role in regulating AMPAR trafficking during synaptic plasticity and learning and memory. GRIP1 and GRIP2 interact with a variety of other proteins associated with protein trafficking and internalization, for example GRIP1 also interacts with KIF5 (also known as kinesin 1), EphB receptors, scaffold protein liprin-alpha, and the rasGEF GRASP-1. PDZ domains usually bind in a sequence-specific manner to short peptide sequences located at the C-terminal end of their partner proteins (known as PDZ binding motifs). The PDZ superfamily includes canonical PDZ domains as well as those with circular permutations and domain swapping mediated by beta-strands. This GRIP family PDZ2 domain is a canonical PDZ domain containing six beta-strands A-F and two alpha-helices (alpha-helix 1 and 2), arranged in the order: beta-strands A, B, C, alpha-helix 1, beta-strands D, E, alpha-helix 2 and beta-strand F.


Pssm-ID: 467169 [Multi-domain]  Cd Length: 89  Bit Score: 39.52  E-value: 4.00e-03
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*.
gi 2022791524 4570 ILPGGNAEQTGKLIEGMQVLEWNGIPLTGKTYEEVQNIIIQQSGEA 4615
Cdd:cd06681     37 VRPGGPADREGTIKPGDRLLSVDGISLHGATHAEAMSILKQCGQEA 82
CCDC47 pfam07946
PAT complex subunit CCDC47; This family represents CCDC47 proteins which are a component of ...
3782-3845 4.23e-03

PAT complex subunit CCDC47; This family represents CCDC47 proteins which are a component of the PAT complex, an endoplasmic reticulum (ER)-resident membrane multiprotein complex that facilitates multi-pass membrane proteins insertion into membranes. The PAT complex, formed by CCDC47 and Asterix proteins, acts as an intramembrane chaperone by directly interacting with nascent transmembrane domains (TMDs), releasing its substrates upon correct folding, and is needed for optimal biogenesis of multi-pass membrane proteins. CCDC47 is required to maintain the stability of Asterix. CCDC47 is associated with various membrane-associated processes and is component of a ribosome-associated ER translocon complex involved in multi-pass membrane protein transport into the ER membrane and biogenesis. It is also involved in the regulation of calcium ion homeostasis in the ER, being also required for proper protein degradation via the ERAD (ER-associated degradation) pathway.


Pssm-ID: 462322  Cd Length: 323  Bit Score: 42.55  E-value: 4.23e-03
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 2022791524 3782 ARAKILQDIDRELDLVERESAKLRKKQAELDEEEKEIDaklrylEMGINRRKEALLKEREKRER 3845
Cdd:pfam07946  265 TREEEIEKIKKAAEEERAEEAQEKKEEAKKKEREEKLA------KLSPEEQRKYEEKERKKEQR 322
PDZ3_PDZD7-like cd06751
PDZ domain 3 of the canonical isoform 1 of PDZ domain containing 7 (PDZD7), and related ...
4546-4608 4.23e-03

PDZ domain 3 of the canonical isoform 1 of PDZ domain containing 7 (PDZD7), and related domains; PDZ (PSD-95 (Postsynaptic density protein 95), Dlg (Discs large protein), and ZO-1 (Zonula occludens-1)) domain 3 of the long isoform 1 of PDZD7, and related domains. PDZD7 is critical for the organization of the Usher syndrome type 2 (USH2) complex. Usher syndrome is the leading cause of hereditary sensory deaf-blindness in humans; USH2 is the most common sub-type. Formation of the USH2 complex is based upon heterodimerization between PDZD7 and whirlin (another PDZ domain-containing protein) and a subsequent dynamic interplay between USH2 proteins via their multiple PDZ domains. The PDZD7 PDZ2 domain binds GPR98 (also known as VLGR1) and usherin (USH2A). PDZD7 and whirlin form heterodimers through their multiple PDZ domains; whirlin and PDZD7 interact with usherin and GPR98 to form an interdependent ankle link complex. PDZD7 also interacts with myosin VIIa and can also form homodimers through its PDZ2 domain. Various isoforms of PDZD7 produced by alternative splicing have been identified; this subgroup includes the third PDZ domain of the canonical isoform of PDZD7- isoform 1. PDZ domains usually bind in a sequence-specific manner to short peptide sequences located at the C-terminal end of their partner proteins (known as PDZ binding motifs). The PDZ superfamily includes canonical PDZ domains as well as those with circular permutations and domain swapping mediated by beta-strands. This PDZD7-like family PDZ3 domain is a canonical PDZ domain containing six beta-strands A-F and two alpha-helices (alpha-helix 1 and 2), arranged in the order: beta-strands A, B, C, alpha-helix 1, beta-strands D, E, alpha-helix 2 and beta-strand F.


Pssm-ID: 467233 [Multi-domain]  Cd Length: 89  Bit Score: 39.34  E-value: 4.23e-03
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 2022791524 4546 LGIRVVGGKEipgSSGETGAYIAKILPGGNAEQTGKLIEGMQVLEWNGIPLTGKTYEEVQNII 4608
Cdd:cd06751     13 LGISISGGIE---SKVQPVVKIEKIFPGGAAALSGNLKAGYELVSVDGESLQQVTHQQAVDII 72
PRK14950 PRK14950
DNA polymerase III subunits gamma and tau; Provisional
344-434 4.42e-03

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237864 [Multi-domain]  Cd Length: 585  Bit Score: 43.26  E-value: 4.42e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  344 VKPSSQQAGPPKQPSQQPGPEKPsAQQMGPAKQASQPGPGKPPLQQTG-PVKQAPPQSGPTKPPPQTAGATKAPVPQPGL 422
Cdd:PRK14950   361 VPVPAPQPAKPTAAAPSPVRPTP-APSTRPKAAAAANIPPKEPVRETAtPPPVPPRPVAPPVPHTPESAPKLTRAAIPVD 439
                           90
                   ....*....|..
gi 2022791524  423 TKPPGQQPEPEK 434
Cdd:PRK14950   440 EKPKYTPPAPPK 451
PTZ00436 PTZ00436
60S ribosomal protein L19-like protein; Provisional
274-450 4.58e-03

60S ribosomal protein L19-like protein; Provisional


Pssm-ID: 185616 [Multi-domain]  Cd Length: 357  Bit Score: 42.63  E-value: 4.58e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  274 QKSQPQQSEPTKPVQQQTSAKPSAGPakplpqqpdSARTSSQAPPPAKPSSQQLGPVKQPSQQPARQGGPVKPSSQQAGP 353
Cdd:PTZ00436   186 ERARREDAAAAAAAKQKAAAKKAAAP---------SGKKSAKAAAPAKAAAAPAKAAAPPAKAAAAPAKAAAAPAKAAAP 256
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  354 PKQPSQQP--GPEKPSAQQMGPAKQASQPGPGKPPlqqtgPVKQAPPQSGPTKPPPQTAGATKAPVPQPGLTKPPgqqpe 431
Cdd:PTZ00436   257 PAKAAAPPakAAAPPAKAAAPPAKAAAPPAKAAAP-----PAKAAAAPAKAAAAPAKAAAAPAKAAAPPAKAAAP----- 326
                          170
                   ....*....|....*....
gi 2022791524  432 PEKPSQQKQASATQPAESA 450
Cdd:PTZ00436   327 PAKAATPPAKAAAPPAKAA 345
PABP-1234 TIGR01628
polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins ...
186-296 4.62e-03

polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins recognize the poly-A of mRNA and consists of four tandem RNA recognition domains at the N-terminus (rrm: pfam00076) followed by a PABP-specific domain (pfam00658) at the C-terminus. The protein is involved in the transport of mRNA's from the nucleus to the cytoplasm. There are four paralogs in Homo sapiens which are expressed in testis, platelets, broadly expressed and of unknown tissue range.


Pssm-ID: 130689 [Multi-domain]  Cd Length: 562  Bit Score: 43.26  E-value: 4.62e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  186 RKQKVIQKEQGKPEEQRGSAKLPAQQQSPKPVQPQGHVKPTPQQSESSKAVPQQQQPgepkqiqkPGPGHPADSKLEQVK 265
Cdd:TIGR01628  401 YGQGPQQQFNGQPLGWPRMSMMPTPMGPGGPLRPNGLAPMNAVRAPSRNAQNAAQKP--------PMQPVMYPPNYQSLP 472
                           90       100       110
                   ....*....|....*....|....*....|.
gi 2022791524  266 QPPQPRGPQKSQPQQSEPTKPVQQQTSAKPS 296
Cdd:TIGR01628  473 LSQDLPQPQSTASQGGQNKKLAQVLASATPQ 503
DUF6162 pfam19659
Family of unknown function (DUF6162); This family of proteins is functionally uncharacterized. ...
317-445 5.20e-03

Family of unknown function (DUF6162); This family of proteins is functionally uncharacterized. This family of proteins is found in bacteria. Proteins in this family are approximately 190 amino acids in length.


Pssm-ID: 437491  Cd Length: 240  Bit Score: 42.13  E-value: 5.20e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  317 PPPAKPSSQQLGPVKQPSQQPARQGGPVKPSSQQAGPPKQPSQQPGPEKPSAQQmGPAKQASQPGPGKPPLQQTGPVKQA 396
Cdd:pfam19659   94 PPPTPDNWAAGGWPPFADDLAARQRGALRWALRQAGQDYAYVSQPANAMPAAPT-APANAASMPGPSAATPRAMLWRLLA 172
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....*....
gi 2022791524  397 PPQSGPTKPPPqtagatkAPVPQPGLTKPPGQqpePEKPSQQKQASATQ 445
Cdd:pfam19659  173 AARSGPNSPPP-------LDVWLRATSAAPGQ---PELPGALDDASLTR 211
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
557-746 5.28e-03

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 42.94  E-value: 5.28e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  557 PKPDPAQLAEPKKPPPQKKQPSMPGSPPVKAKTSRAePADTSQQIDPAPKSDRAkpSQAEDKQKQPSIQKPAADTVLTPA 636
Cdd:PRK12323   383 AQPAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAARA-VAAAPARRSPAPEALAA--ARQASARGPGGAPAPAPAPAAAPA 459
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  637 aPGLKQDSAGPRPPPtqqkVTDSPKPELAKPSQDTHPAGEKPdsKPLPQVSRQKSDPKLSAQPGARSEAKTQKPVEPAQV 716
Cdd:PRK12323   460 -AAARPAAAGPRPVA----AAAAAAPARAAPAAAPAPADDDP--PPWEELPPEFASPAPAQPDAAPAGWVAESIPDPATA 532
                          170       180       190
                   ....*....|....*....|....*....|
gi 2022791524  717 KDDPKKLQTKPSPKPETKPAPKGLQPGAGP 746
Cdd:PRK12323   533 DPDDAFETLAPAPAAAPAPRAAAATEPVVA 562
PRK14951 PRK14951
DNA polymerase III subunits gamma and tau; Provisional
250-382 5.29e-03

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237865 [Multi-domain]  Cd Length: 618  Bit Score: 43.16  E-value: 5.29e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  250 KPGPGHPADSKLEQ--VKQPPQPRGPQKSQPQQSEPTKPVQQQTSAKPSAGPAKPLPQQPDSARTSSQAPPPAKPSSQQl 327
Cdd:PRK14951   365 KPAAAAEAAAPAEKktPARPEAAAPAAAPVAQAAAAPAPAAAPAAAASAPAAPPAAAPPAPVAAPAAAAPAAAPAAAPA- 443
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 2022791524  328 gPVKQPSQQPARQGGPVKPSSQQAGPPKQPSQQPGPekPSAQQMGPAKQASQPGP 382
Cdd:PRK14951   444 -AVALAPAPPAQAAPETVAIPVRVAPEPAVASAAPA--PAAAPAAARLTPTEEGD 495
Amelogenin smart00818
Amelogenins, cell adhesion proteins, play a role in the biomineralisation of teeth; They seem ...
314-407 5.63e-03

Amelogenins, cell adhesion proteins, play a role in the biomineralisation of teeth; They seem to regulate formation of crystallites during the secretory stage of tooth enamel development and are thought to play a major role in the structural organisation and mineralisation of developing enamel. The extracellular matrix of the developing enamel comprises two major classes of protein: the hydrophobic amelogenins and the acidic enamelins. Circular dichroism studies of porcine amelogenin have shown that the protein consists of 3 discrete folding units: the N-terminal region appears to contain beta-strand structures, while the C-terminal region displays characteristics of a random coil conformation. Subsequent studies on the bovine protein have indicated the amelogenin structure to contain a repetitive beta-turn segment and a "beta-spiral" between Gln112 and Leu138, which sequester a (Pro, Leu, Gln) rich region. The beta-spiral offers a probable site for interactions with Ca2+ ions. Muatations in the human amelogenin gene (AMGX) cause X-linked hypoplastic amelogenesis imperfecta, a disease characterised by defective enamel. A 9bp deletion in exon 2 of AMGX results in the loss of codons for Ile5, Leu6, Phe7 and Ala8, and replacement by a new threonine codon, disrupting the 16-residue (Met1-Ala16) amelogenin signal peptide.


Pssm-ID: 197891 [Multi-domain]  Cd Length: 165  Bit Score: 40.93  E-value: 5.63e-03
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524   314 SQAPPPAKPSSQQLGPVKQPSQQPARQGGPVKPSSQQagPPKQPSQQPGPEKPSAQQMGPAKQASQPGPGKPPLQQTGPV 393
Cdd:smart00818   43 SQQHPPTHTLQPHHHIPVLPAQQPVVPQQPLMPVPGQ--HSMTPTQHHQPNLPQPAQQPFQPQPLQPPQPQQPMQPQPPV 120
                            90
                    ....*....|....
gi 2022791524   394 KQAPPQSGPTKPPP 407
Cdd:smart00818  121 HPIPPLPPQPPLPP 134
PHA01929 PHA01929
putative scaffolding protein
322-432 5.98e-03

putative scaffolding protein


Pssm-ID: 177328  Cd Length: 306  Bit Score: 42.35  E-value: 5.98e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  322 PSSQQLGP-------VKQPSQQPARQGGPVKPSSQqagpPKQPSQQPGPekpsaQQMGPAKQASQPGPGKPPLQQTGPvk 394
Cdd:PHA01929     3 QNEQQLPPglaglvaNVPPAAAPTPQPNPVIQPQA----PVQPGQPGAP-----QQLAIPTQQPQPVPTSAMTPHVVQ-- 71
                           90       100       110
                   ....*....|....*....|....*....|....*...
gi 2022791524  395 QAPPQSGPTKPPPQtagatkAPVPQPGLTKPPGQQPEP 432
Cdd:PHA01929    72 QAPAQPAPAAPPAA------GAALPEALEVPPPPAFTP 103
PRK10905 PRK10905
cell division protein DamX; Validated
256-452 6.11e-03

cell division protein DamX; Validated


Pssm-ID: 236792 [Multi-domain]  Cd Length: 328  Bit Score: 42.23  E-value: 6.11e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  256 PADSKLEQVKQPPQPRGPQKSQPQQSEPTKPVQQQTSAKPSAGPAKPLPQQPDSARTSSQAPPPAKPSSQQL----GPVK 331
Cdd:PRK10905    23 PSTSSSDQTASGEKSIDLAGNATDQANGVQPAPGTTSAEQTAGNTQQDVSLPPISSTPTQGQTPVATDGQQRvevqGDLN 102
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  332 QPSQQPARQG---------------GPVKPSSQQAGPPKQPSQQPGPEKPSAQqmgPAKQASQPGPGKPplQQTGPVKQA 396
Cdd:PRK10905   103 NALTQPQNQQqlnnvavnstlptepATVAPVRNGNASRQTAKTQTAERPATTR---PARKQAVIEPKKP--QATAKTEPK 177
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 2022791524  397 PPQSGPTKPPPQTAGAT-KAPVPQpgltkppgQQPEPEKPSQQKQASATQPAESAPK 452
Cdd:PRK10905   178 PVAQTPKRTEPAAPVAStKAPAAT--------STPAPKETATTAPVQTASPAQTTAT 226
Tymo_45kd_70kd pfam03251
Tymovirus 45/70Kd protein; Tymoviruses are single stranded RNA viruses. This family includes a ...
222-457 6.44e-03

Tymovirus 45/70Kd protein; Tymoviruses are single stranded RNA viruses. This family includes a protein of unknown function that has been named based on its molecular weight. Tymoviruses such as the ononis yellow mosaic tymovirus encode only three proteins. Of these two are overlapping this protein overlaps a larger ORF that is thought to be the polymerase.


Pssm-ID: 281269 [Multi-domain]  Cd Length: 468  Bit Score: 42.47  E-value: 6.44e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  222 HVKPTPQQSESSKAVPQQQQPG-EPKQIQKPGPGHPADSKLEQVKqpPQPRGPQKSQPQQSEPTKPVQQQT------SAK 294
Cdd:pfam03251  198 HSSLRPRGSRSRQLQPTVRRPLlAPNQFHSPRQPPPLSDDPGILG--PRPLAPHSTRDPPPRPITPGPSNThdlrplSVL 275
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  295 PSAGPAKPLPQQPDSARTSSQAPPPAKPSSQQLGPVKQPSQQPARQGGPVKPSSQQAGPPKQPSQQPGPekpsaqQMGPA 374
Cdd:pfam03251  276 PRTSPRRGLLPNPRRHRTSTGHIPPTTTSRPTGPPSRLQRPVHLYQSSPHTPNFRPSSIRKDALLQTGP------RLGHL 349
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  375 KQASQPGPGKPPLQQTGPVKQAPPQSGPTKPPPQTAGATKAPV-----PQPGLTKPPGQQPEPEKPSQQKQASATQPAES 449
Cdd:pfam03251  350 ERLGQPANLRTSERSPPTKRRLPRSSEPNRLPKPLPEATLAPSyrhrrPYPLLPNPPAALPSIAYTSSRGKIHHSLPKGA 429

                   ....*...
gi 2022791524  450 APKKTFCP 457
Cdd:pfam03251  430 LPKEGAPP 437
PHA03269 PHA03269
envelope glycoprotein C; Provisional
268-382 6.68e-03

envelope glycoprotein C; Provisional


Pssm-ID: 165527 [Multi-domain]  Cd Length: 566  Bit Score: 42.79  E-value: 6.68e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  268 PQPRGPQKSQPQQSEPTKPVQQQTSAKP--SAGPAKPLPQQPDSArtssQAPPPAkpSSQQLGPVKQPSQQPARQGGPVK 345
Cdd:PHA03269    27 PIPELHTSAATQKPDPAPAPHQAASRAPdpAVAPTSAASRKPDLA----QAPTPA--ASEKFDPAPAPHQAASRAPDPAV 100
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|
gi 2022791524  346 PSSQQAGPPKQPSQQP--GPEKPSAQQMGPAKQASQ-PGP 382
Cdd:PHA03269   101 APQLAAAPKPDAAEAFtsAAQAHEAPADAGTSAASKkPDP 140
Herpes_BLLF1 pfam05109
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
254-449 7.01e-03

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 42.60  E-value: 7.01e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  254 GHPADSKLEQVKQPPQPRGPQKSQPQQSEPTKpvqqqTSAKPSAGPAKPLPQQPDSArTSSQAPPPAKPSSQQLGPVKQP 333
Cdd:pfam05109  447 GLPSSTHVPTNLTAPASTGPTVSTADVTSPTP-----AGTTSGASPVTPSPSPRDNG-TESKAPDMTSPTSAVTTPTPNA 520
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  334 SQQPARQGGPVKPSSQQAGPPKQPSQQPGPEKPSAQQMGPAKQASQPGPGKPPLQQTGPV--------KQAPPQSGPTKP 405
Cdd:pfam05109  521 TSPTPAVTTPTPNATSPTLGKTSPTSAVTTPTPNATSPTPAVTTPTPNATIPTLGKTSPTsavttptpNATSPTVGETSP 600
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|....*
gi 2022791524  406 PPQTAGATKAPVPQ-PGLTKPPGQQPEPEKPSQQKQASATQPAES 449
Cdd:pfam05109  601 QANTTNHTLGGTSStPVVTSPPKNATSAVTTGQHNITSSSTSSMS 645
kgd PRK12270
multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine ...
372-451 8.34e-03

multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine pyrophosphate-binding subunit/dihydrolipoyllysine-residue succinyltransferase subunit;


Pssm-ID: 237030 [Multi-domain]  Cd Length: 1228  Bit Score: 42.57  E-value: 8.34e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  372 GPAKQASQPGPGKPPLQQTGPVKQAPPQSGPTKPPPQTAGATKAPVPQPGLTKPPGQQPEPEKPSQQKQASATQPAESAP 451
Cdd:PRK12270    40 STAAPTAAAAAAAAAASAPAAAPAAKAPAAPAPAPPAAAAPAAPPKPAAAAAAAAAPAAPPAAAAAAAPAAAAVEDEVTP 119
SP2_N cd22540
N-terminal domain of transcription factor Specificity Protein (SP) 2; Specificity Proteins ...
213-469 8.64e-03

N-terminal domain of transcription factor Specificity Protein (SP) 2; Specificity Proteins (SPs) are transcription factors that are involved in many cellular processes, including cell differentiation, cell growth, apoptosis, immune responses, response to DNA damage, and chromatin remodeling. SP2 contains the least conserved DNA-binding domain within the SP subfamily of proteins, and its DNA sequence specificity differs from the other SP proteins. It localizes primarily within subnuclear foci associated with the nuclear matrix, and can activate, or in some cases, repress expression from different promoters. The transcription factor SP2 serves as a paradigm for indirect genomic binding. It does not require its DNA-binding domain for genomic DNA binding and occupies target promoters independently of whether they contain a cognate DNA-binding motif. SP2 belongs to a family of proteins, called the SP/Kruppel or Krueppel-like Factor (KLF) family, characterized by a C-terminal DNA-binding domain of 81 amino acids consisting of three Kruppel-like C2H2 zinc fingers. These factors bind to a loose consensus motif, namely NNRCRCCYY (where N is any nucleotide; R is A/G, and Y is C/T), such as the recurring motifs in GC and GT boxes (5'-GGGGCGGGG-3' and 5-GGTGTGGGG-3') that are present in promoters and more distal regulatory elements of mammalian genes. SP factors preferentially bind GC boxes, while KLFs bind CACCC boxes. Another characteristic hallmark of SP factors is the presence of the Buttonhead (BTD) box CXCPXC, just N-terminal to the zinc fingers. The function of the BTD box is unknown, but it is thought to play an important physiological role. Another feature of most SP factors is the presence of a conserved amino acid stretch, the so-called SP box, located close to the N-terminus. SP factors may be separated into three groups based on their domain architecture and the similarity of their N-terminal transactivation domains: SP1-4, SP5, and SP6-9. The transactivation domains between the three groups are not homologous to one another. SP1-4 have similar N-terminal transactivation domains characterized by glutamine-rich regions, which, in most cases, have adjacent serine/threonine-rich regions. This model represents the N-terminal domain of SP2.


Pssm-ID: 411776 [Multi-domain]  Cd Length: 511  Bit Score: 42.22  E-value: 8.64e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  213 SPKPVQP----QGHVKPTPQQSESSKAVPQ---------QQQPGepkQIQKPGPGHPadsKLEQVKQPPQPRGPQKSQPQ 279
Cdd:cd22540     48 PPAPPQPtprkLVPIKPAPLPLGPGKNSIGflsakgniiQLQGS---QLSSSAPGGQ---QVFAIQNPTMIIKGSQTRSS 121
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  280 QSEPTKPVQQ-QTSAKPSAG---------------PAKPLPQQPDSARTSSQAPPPAKPSSQQLGPVKQPSQQPARQGGP 343
Cdd:cd22540    122 TNQQYQISPQiQAAGQINNSgqiqiipgtnqaiitPVQVLQQPQQAHKPVPIKPAPLQTSNTNSASLQVPGNVIKLQSGG 201
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  344 VK----PSSQQAGPPKQPSQQPGPEKPSAQQMGPAKQASQPGPGKPPLQ---QTGPVKQAPP---QSGPTKPPPQTAGAT 413
Cdd:cd22540    202 NValtlPVNNLVGTQDGATQLQLAAAPSKPSKKIRKKSAQAAQPAVTVAeqvETVLIETTADniiQAGNNLLIVQSPGTG 281
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 2022791524  414 KAPVPQPGLTKPPGQQPE----PEKPSQQKQ-ASATQPaeSAPKKTFCPL------CTTTELLLHTP 469
Cdd:cd22540    282 QPAVLQQVQVLQPKQEQQvvqiPQQALRVVQaASATLP--TVPQKPLQNIqiqnsePTPTQVYIKTP 346
PRK14949 PRK14949
DNA polymerase III subunits gamma and tau; Provisional
176-323 8.87e-03

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237863 [Multi-domain]  Cd Length: 944  Bit Score: 42.40  E-value: 8.87e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  176 DSDTAHEEAGRKQKVIQKEQGKPEEQRGSAKLPAQQQSPKPVQPQGHvKPTPQQSESSKAVPQQQQPGEPKQ-------- 247
Cdd:PRK14949   633 EGDGKKSSADRKPKTPPSRAPPASLSKPASSPDASQTSASFDLDPDF-ELATHQSVPEAALASGSAPAPPPVpdpydrpp 711
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  248 -----IQKPGPGHPADSKLEQVKQPPQPRGPQKSQPQQSEPTKPVQQQTSAKPSAgpAKPLPQQPDSARTSSQAPPPAKP 322
Cdd:PRK14949   712 weeapEVASANDGPNNAAEGNLSESVEDASNSELQAVEQQATHQPQVQAEAQSPA--STTALTQTSSEVQDTELNLVLLS 789

                   .
gi 2022791524  323 S 323
Cdd:PRK14949   790 S 790
kgd PRK12270
multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine ...
381-451 9.15e-03

multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine pyrophosphate-binding subunit/dihydrolipoyllysine-residue succinyltransferase subunit;


Pssm-ID: 237030 [Multi-domain]  Cd Length: 1228  Bit Score: 42.57  E-value: 9.15e-03
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 2022791524  381 GPGKPPLQQTGPVKQAPPQSGPTKPPPQTAGATKAPVPQPGLTKPPGQQPEPEKPSQQKQASATQPAESAP 451
Cdd:PRK12270    37 GPGSTAAPTAAAAAAAAAASAPAAAPAAKAPAAPAPAPPAAAAPAAPPKPAAAAAAAAAPAAPPAAAAAAA 107
PRK14954 PRK14954
DNA polymerase III subunits gamma and tau; Provisional
336-454 9.46e-03

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 184918 [Multi-domain]  Cd Length: 620  Bit Score: 42.24  E-value: 9.46e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  336 QPARQGGPVKPSSQQAGPPKQPSQQPGPEKPSAQQMGPAKQASqpgpgkpPLQQTGPVKQAP-PQSGPTKPPPQTAGATK 414
Cdd:PRK14954   381 APSPAGSPDVKKKAPEPDLPQPDRHPGPAKPEAPGARPAELPS-------PASAPTPEQQPPvARSAPLPPSPQASAPRN 453
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|
gi 2022791524  415 APVPQPGLTKPPGQQPEPEKPSQQKQASATQPAESAPKKT 454
Cdd:PRK14954   454 VASGKPGVDLGSWQGKFMNFTRNGSRKQPVQASSSDAAQT 493
PRK12757 PRK12757
cell division protein FtsN; Provisional
339-454 9.52e-03

cell division protein FtsN; Provisional


Pssm-ID: 237191 [Multi-domain]  Cd Length: 256  Bit Score: 41.18  E-value: 9.52e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  339 RQGGPVKPSSQQAGPPKQPSQQPGPE-KPSAQQM-GPAKQASQPGPGKPPLQQTGPVKQA----PPQSGPTKPPPQTagA 412
Cdd:PRK12757    58 RQIGVPTPTEPSAGGEVNSPTQLTDEqRQLLEQMqADMRQQPTQLSEVPYNEQTPQVPRStvqiQQQAQQQQPPATT--A 135
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|..
gi 2022791524  413 TKAPVPQPGLTKPPgQQPEPEKPSQQKQASATQPAESAPKKT 454
Cdd:PRK12757   136 QPQPVTPPRQTTAP-VQPQTPAPVRTQPAAPVTQAVEAPKVE 176
Med25_NR-box pfam11244
Mediator complex subunit 25 C-terminal NR box-containing; The overall function of the ...
333-422 9.63e-03

Mediator complex subunit 25 C-terminal NR box-containing; The overall function of the full-length Med25 is efficiently to coordinate the transcriptional activation of RAR/RXR (retinoic acid receptor/retinoic X receptor) in higher eukaryotic cells. Human Med25 consists of several domains with different binding properties, the N-terminal, VWA, domain, an SD1 - synapsin 1 - domain from residues 229-381, a PTOV(B) or ACID domain from 395-545, an SD2 domain from residues 564-645 and this C-terminal NR box-containing domain (646-650) from C69-747. The NR box of MED25 is critical for its recruitment to the promoter, probably through an interaction with pre bound RAR.


Pssm-ID: 463246 [Multi-domain]  Cd Length: 89  Bit Score: 38.48  E-value: 9.63e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022791524  333 PSQQPARQggpvkpsSQQAGPPKQPSQQPGPEKPsaqQMGPakqasqpgpgkpPLQQTGPVKQAPPQSGPTKPPPQT--- 409
Cdd:pfam11244    5 QTQQPLHH-------MQQAAQGMLPHQHQAPGQQ---QLGQ------------TLMHQAPAQSWGAQLPPRAPLPGQmll 62
                           90
                   ....*....|...
gi 2022791524  410 AGATKAPVPQPGL 422
Cdd:pfam11244   63 SGGPRGPVPQPGL 75
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH