NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|334187096|ref|NP_001119101|]
View 

Sec23/Sec24 protein transport family protein [Arabidopsis thaliana]

Protein Classification

SEC24 family transport protein( domain architecture ID 1001573)

SEC24 family transport protein is a component of the coat protein complex II (COPII) which promotes the formation of transport vesicles from the endoplasmic reticulum (ER)

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
COG5028 super family cl34873
Vesicle coat complex COPII, subunit SEC24/subunit SFB2/subunit SFB3 [Intracellular trafficking ...
273-1076 1.39e-176

Vesicle coat complex COPII, subunit SEC24/subunit SFB2/subunit SFB3 [Intracellular trafficking and secretion];


The actual alignment was detected with superfamily member COG5028:

Pssm-ID: 227361 [Multi-domain]  Cd Length: 861  Bit Score: 538.99  E-value: 1.39e-176
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096  273 AVSGLPYGPPSAQVAPPLGFPGQMQPPRYGMGPLPNQSMTNIP---TAMGQPGATVPGPSRIDP---------------- 333
Cdd:COG5028    18 TGAASSKKSARPHRAYANFSAGQMGMPPYTTPPLQQQSRRQIDqaaTAMHNTGANNPAPSVMSPafqsqqkfsspyggsm 97
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096  334 --NQIPRPGSSSSPTVFETRQ----SNQANPP----PPATSDYVVRDTGNCSPRYMRCTINQIPCTVDLLSTSGMQLALM 403
Cdd:COG5028    98 adGTAPKPTNPLVPVDLFEDQpppiSDLFLPPppivPPLTTNFVGSEQSNCSPKYVRSTMYAIPETNDLLKKSKIPFGLV 177
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096  404 VQPLALSHPSEEPIQVVDfgEGGPVRCSRCKGYINPFMKFIDQGRKFICNFCGYTDETPRDYHCNLGPDGRRRDVDERPE 483
Cdd:COG5028   178 IRPFLELYPEEDPVPLVE--DGSIVRCRRCRSYINPFVQFIEQGRKWRCNICRSKNDVPEGFDNPSGPNDPRSDRYSRPE 255
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096  484 LCRGTVEFVATKEYMVRDPMPAVYFFLIDVSMNAIQTGATAAACNAIQQVLSDLPE-GPRTFVGIATFDSTIHFYNLKRA 562
Cdd:COG5028   256 LKSGVVDFLAPKEYSLRQPPPPVYVFLIDVSFEAIKNGLVKAAIRAILENLDQIPNfDPRTKIAIICFDSSLHFFKLSPD 335
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096  563 LQQPlMLIVPDVQDVYTPLET-DVVVQLSECRQHLELLLDSIPTMFQESKIPESAFGAAVKAAFLAMKSKGGKLMVFQSI 641
Cdd:COG5028   336 LDEQ-MLIVSDLDEPFLPFPSgLFVLPLKSCKQIIETLLDRVPRIFQDNKSPKNALGPALKAAKSLIGGTGGKIIVFLST 414
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096  642 LCSVGVGALSSReaegranmsagEKEAHKLLQPADKTLKTMAIEFAEYQVCVDIFITTQAYVDMASISVIPRTTGGQVYC 721
Cdd:COG5028   415 LPNMGIGKLQLR-----------EDKESSLLSCKDSFYKEFAIECSKVGISVDLFLTSEDYIDVATLSHLCRYTGGQTYF 483
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096  722 YYPFSA--LSDPPKLYNDLKWNITRPQGFEAVMRVRCSQGIQVQEYSGNFCKRIPTDIDLPA------------HDDKLQ 787
Cdd:COG5028   484 YPNFSAtrPNDATKLANDLVSHLSMEIGYEAVMRVRCSTGLRVSSFYGNFFNRSSDLCAFSTmprdtsllvefsIDEKLM 563
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096  788 dGAECAFQCALLYTTIYGERRIRVTTLSLSCTNMLSNLFRAADLDSQFACMLKQAANEIPSKALPLVKEQATNSCINALY 867
Cdd:COG5028   564 -TSDVYFQVALLYTLNDGERRIRVVNLSLPTSSSIREVYASADQLAIACILAKKASTKALNSSLKEARVLINKSMVDILK 642
                         650       660       670       680       690       700       710       720
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096  868 AYRKFCATVTSSGQLILPEALKLFPLYTLALTKSVGLRTDG-RIDDRSFWINYVSSLSTPLAIPLVYPRMISVHDL---- 942
Cdd:COG5028   643 AYKKELVKSNTSTQLPLPANLKLLPLLMLALLKSSAFRSGStPSDIRISALNRLTSLPLKQLMRNIYPTLYALHDMpiea 722
                         730       740       750       760       770       780       790       800
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096  943 DVKDTEGSVLPPPIPLSSEHISNEGVYFLENGEDGLLFVGESVDSDILQKLFAVSSAAEIPN-QFVLQQYDNQLSKKFND 1021
Cdd:COG5028   723 GLPDEGLLVLPSPINATSSLLESGGLYLIDTGQKIFLWFGKDAVPSLLQDLFGVDSLSDIPSgKFTLPPTGNEFNERVRN 802
                         810       820       830       840       850
                  ....*....|....*....|....*....|....*....|....*....|....*...
gi 334187096 1022 AVNEIRRQ-RCSYLRIKLCKKG-EPSGML-FLSYMVEDRTASGPSYVEFLVQVHRQIQ 1076
Cdd:COG5028   803 IIGELRSVnDDSTLPLVLVRGGgDPSLRLwFFSTLVEDKTLNIPSYLDYLQILHEKIK 860
PHA03247 super family cl33720
large tegument protein UL36; Provisional
65-345 1.12e-18

large tegument protein UL36; Provisional


The actual alignment was detected with superfamily member PHA03247:

Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 92.69  E-value: 1.12e-18
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096   65 PQPFPQQSPSYGAPQRGPSPMSRPGPPAGMARPGGPPPVSQPAgfQSNVPLNRPTGPPSRQPsfgsRPSMPGGPVAQPAA 144
Cdd:PHA03247 2725 PAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPP--APAPPAAPAAGPPRRLT----RPAVASLSESRESL 2798
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096  145 SSSGFPAFGPSGSVAAGP--PPGSRPmAFGSPPPVGSGMSMP--PSGMIGGPVSNGHQMVGSGGFPR--GTQFPGAAVTT 218
Cdd:PHA03247 2799 PSPWDPADPPAAVLAPAAalPPAASP-AGPLPPPTSAQPTAPppPPGPPPPSLPLGGSVAPGGDVRRrpPSRSPAAKPAA 2877
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096  219 PQAPYVRPPSAPYARTPPQPLGSHSLSGNPPLTPFTAPSMPPPATFPGAPHGRPAVSglPYGPPSAQVAPPLGFPGQMQP 298
Cdd:PHA03247 2878 PARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPP--PPPRPQPPLAPTTDPAGAGEP 2955
                         250       260       270       280       290
                  ....*....|....*....|....*....|....*....|....*....|..
gi 334187096  299 PryGMGPLPnqsmtniptamgQPGATVPGP-----SRIDPNQIPRPGSSSSP 345
Cdd:PHA03247 2956 S--GAVPQP------------WLGALVPGRvavprFRVPQPAPSREAPASST 2993
 
Name Accession Description Interval E-value
COG5028 COG5028
Vesicle coat complex COPII, subunit SEC24/subunit SFB2/subunit SFB3 [Intracellular trafficking ...
273-1076 1.39e-176

Vesicle coat complex COPII, subunit SEC24/subunit SFB2/subunit SFB3 [Intracellular trafficking and secretion];


Pssm-ID: 227361 [Multi-domain]  Cd Length: 861  Bit Score: 538.99  E-value: 1.39e-176
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096  273 AVSGLPYGPPSAQVAPPLGFPGQMQPPRYGMGPLPNQSMTNIP---TAMGQPGATVPGPSRIDP---------------- 333
Cdd:COG5028    18 TGAASSKKSARPHRAYANFSAGQMGMPPYTTPPLQQQSRRQIDqaaTAMHNTGANNPAPSVMSPafqsqqkfsspyggsm 97
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096  334 --NQIPRPGSSSSPTVFETRQ----SNQANPP----PPATSDYVVRDTGNCSPRYMRCTINQIPCTVDLLSTSGMQLALM 403
Cdd:COG5028    98 adGTAPKPTNPLVPVDLFEDQpppiSDLFLPPppivPPLTTNFVGSEQSNCSPKYVRSTMYAIPETNDLLKKSKIPFGLV 177
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096  404 VQPLALSHPSEEPIQVVDfgEGGPVRCSRCKGYINPFMKFIDQGRKFICNFCGYTDETPRDYHCNLGPDGRRRDVDERPE 483
Cdd:COG5028   178 IRPFLELYPEEDPVPLVE--DGSIVRCRRCRSYINPFVQFIEQGRKWRCNICRSKNDVPEGFDNPSGPNDPRSDRYSRPE 255
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096  484 LCRGTVEFVATKEYMVRDPMPAVYFFLIDVSMNAIQTGATAAACNAIQQVLSDLPE-GPRTFVGIATFDSTIHFYNLKRA 562
Cdd:COG5028   256 LKSGVVDFLAPKEYSLRQPPPPVYVFLIDVSFEAIKNGLVKAAIRAILENLDQIPNfDPRTKIAIICFDSSLHFFKLSPD 335
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096  563 LQQPlMLIVPDVQDVYTPLET-DVVVQLSECRQHLELLLDSIPTMFQESKIPESAFGAAVKAAFLAMKSKGGKLMVFQSI 641
Cdd:COG5028   336 LDEQ-MLIVSDLDEPFLPFPSgLFVLPLKSCKQIIETLLDRVPRIFQDNKSPKNALGPALKAAKSLIGGTGGKIIVFLST 414
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096  642 LCSVGVGALSSReaegranmsagEKEAHKLLQPADKTLKTMAIEFAEYQVCVDIFITTQAYVDMASISVIPRTTGGQVYC 721
Cdd:COG5028   415 LPNMGIGKLQLR-----------EDKESSLLSCKDSFYKEFAIECSKVGISVDLFLTSEDYIDVATLSHLCRYTGGQTYF 483
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096  722 YYPFSA--LSDPPKLYNDLKWNITRPQGFEAVMRVRCSQGIQVQEYSGNFCKRIPTDIDLPA------------HDDKLQ 787
Cdd:COG5028   484 YPNFSAtrPNDATKLANDLVSHLSMEIGYEAVMRVRCSTGLRVSSFYGNFFNRSSDLCAFSTmprdtsllvefsIDEKLM 563
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096  788 dGAECAFQCALLYTTIYGERRIRVTTLSLSCTNMLSNLFRAADLDSQFACMLKQAANEIPSKALPLVKEQATNSCINALY 867
Cdd:COG5028   564 -TSDVYFQVALLYTLNDGERRIRVVNLSLPTSSSIREVYASADQLAIACILAKKASTKALNSSLKEARVLINKSMVDILK 642
                         650       660       670       680       690       700       710       720
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096  868 AYRKFCATVTSSGQLILPEALKLFPLYTLALTKSVGLRTDG-RIDDRSFWINYVSSLSTPLAIPLVYPRMISVHDL---- 942
Cdd:COG5028   643 AYKKELVKSNTSTQLPLPANLKLLPLLMLALLKSSAFRSGStPSDIRISALNRLTSLPLKQLMRNIYPTLYALHDMpiea 722
                         730       740       750       760       770       780       790       800
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096  943 DVKDTEGSVLPPPIPLSSEHISNEGVYFLENGEDGLLFVGESVDSDILQKLFAVSSAAEIPN-QFVLQQYDNQLSKKFND 1021
Cdd:COG5028   723 GLPDEGLLVLPSPINATSSLLESGGLYLIDTGQKIFLWFGKDAVPSLLQDLFGVDSLSDIPSgKFTLPPTGNEFNERVRN 802
                         810       820       830       840       850
                  ....*....|....*....|....*....|....*....|....*....|....*...
gi 334187096 1022 AVNEIRRQ-RCSYLRIKLCKKG-EPSGML-FLSYMVEDRTASGPSYVEFLVQVHRQIQ 1076
Cdd:COG5028   803 IIGELRSVnDDSTLPLVLVRGGgDPSLRLwFFSTLVEDKTLNIPSYLDYLQILHEKIK 860
Sec24-like cd01479
Sec24-like: Protein and membrane traffic in eukaryotes is mediated by at least in part by the ...
502-746 3.44e-115

Sec24-like: Protein and membrane traffic in eukaryotes is mediated by at least in part by the budding and fusion of intracellular transport vesicles that selectively carry cargo proteins and lipids from donor to acceptor organelles. The two main classes of vesicular carriers within the endocytic and the biosynthetic pathways are COP- and clathrin-coated vesicles. Formation of COPII vesicles requires the ordered assembly of the coat built from several cytosolic components GTPase Sar1, complexes of Sec23-Sec24 and Sec13-Sec31. The process is initiated by the conversion of GDP to GTP by the GTPase Sar1 which then recruits the heterodimeric complex of Sec23 and Sec24. This heterodimeric complex generates the pre-budding complex. The final step leading to membrane deformation and budding of COPII-coated vesicles is carried by the heterodimeric complex Sec13-Sec31. The members of this CD belong to the Sec23-like family. Sec 24 is very similar to Sec23. The Sec23 and Sec24 polypeptides fold into five distinct domains: a beta-barrel, a zinc finger, a vWA or trunk, an all helical region and a carboxy Gelsolin domain. The members of this subgroup carry a partial MIDAS motif and have the overall Para-Rossmann type fold that is characteristic of this superfamily.


Pssm-ID: 238756 [Multi-domain]  Cd Length: 244  Bit Score: 355.81  E-value: 3.44e-115
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096  502 PMPAVYFFLIDVSMNAIQTGATAAACNAIQQVLSDLP-EGPRTFVGIATFDSTIHFYNLKRALQQPLMLIVPDVQDVYTP 580
Cdd:cd01479     1 PQPAVYVFLIDVSYNAIKSGLLATACEALLSNLDNLPgDDPRTRVGFITFDSTLHFFNLKSSLEQPQMMVVSDLDDPFLP 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096  581 LETDVVVQLSECRQHLELLLDSIPTMFQESKIPESAFGAAVKAAFLAMKSKGGKLMVFQSILCSVGVGALSSREAEGraN 660
Cdd:cd01479    81 LPDGLLVNLKESRQVIEDLLDQIPEMFQDTKETESALGPALQAAFLLLKETGGKIIVFQSSLPTLGAGKLKSREDPK--L 158
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096  661 MSAGEKEAHklLQPADKTLKTMAIEFAEYQVCVDIFITTQAYVDMASISVIPRTTGGQVYCY--YPFSALSDPPKLYNDL 738
Cdd:cd01479   159 LSTDKEKQL--LQPQTDFYKKLALECVKSQISVDLFLFSNQYVDVATLGCLSRLTGGQVYYYpsFNFSAPNDVEKLVNEL 236

                  ....*...
gi 334187096  739 KWNITRPQ 746
Cdd:cd01479   237 ARYLTRKI 244
Sec23_trunk pfam04811
Sec23/Sec24 trunk domain; COPII-coated vesicles carry proteins from the endoplasmic reticulum ...
502-742 2.30e-104

Sec23/Sec24 trunk domain; COPII-coated vesicles carry proteins from the endoplasmic reticulum to the Golgi complex. This vesicular transport can be reconstituted by using three cytosolic components containing five proteins: the small GTPase Sar1p, the Sec23p/24p complex, and the Sec13p/Sec31p complex. This domain is known as the trunk domain and has an alpha/beta vWA fold and forms the dimer interface.


Pssm-ID: 398467 [Multi-domain]  Cd Length: 241  Bit Score: 326.90  E-value: 2.30e-104
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096   502 PMPAVYFFLIDVSMNAIQTGATAAACNAIQQVLSDLPEGPRTFVGIATFDSTIHFYNLKRALQQPLMLIVPDVQDVYTPL 581
Cdd:pfam04811    1 PQPPVFLFVIDVSYNAIKSGLLAALKESLLQSLDLLPGDPRARVGFITFDSTVHFFNLGSSLRQPQMLVVSDLQDMFLPL 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096   582 ETDVVVQLSECRQHLELLLDSIPTMFQESKIPESAFGAAVKAAFLAMKS--KGGKLMVFQSILCSVGV-GALSSREAEgr 658
Cdd:pfam04811   81 PDRFLVPLSECRFVLEDLLEQLPPMFPVTKRPERCLGPALQAAFLLLKAafTGGKIMVFQGGLPTVGPgGKLKSRLDE-- 158
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096   659 aNMSAGEKEAHKLLQPADKTLKTMAIEFAEYQVCVDIFITTQAYVDMASISVIPRTTGGQVYCYYPFSALSDPPKLYNDL 738
Cdd:pfam04811  159 -SHHGTDKEKAKLVKKADKFYKSLAKECVKQGHSVDLFAFSLDYVDVATLGQLSRLTGGQVYLYPSFQADVDGSKFKQDL 237

                   ....
gi 334187096   739 KWNI 742
Cdd:pfam04811  238 QRYF 241
PTZ00395 PTZ00395
Sec24-related protein; Provisional
499-1080 4.57e-43

Sec24-related protein; Provisional


Pssm-ID: 185594 [Multi-domain]  Cd Length: 1560  Bit Score: 171.80  E-value: 4.57e-43
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096  499 VRDPMPAVYFFLIDVSMNAIQTGATAAACNAIQQVLSDLpEGPRTFVGIATFDSTIHFYNLKRALQQPL----------- 567
Cdd:PTZ00395  947 VKNMLPPYFVFVVECSYNAIYNNITYTILEGIRYAVQNV-KCPQTKIAIITFNSSIYFYHCKGGKGVSGeegdggggsgn 1025
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096  568 --MLIVPDVQDVYTPLE-TDVVVQLSECRQHLELLLDSIPTMFQESKIPESAFGAAVKAAFLAMKSKGG--KLMVFQSIL 642
Cdd:PTZ00395 1026 hqVIVMSDVDDPFLPLPlEDLFFGCVEEIDKINTLIDTIKSVSTTMQSYGSCGNSALKIAMDMLKERNGlgSICMFYTTT 1105
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096  643 CSVGVGALssreaegranmsageKEAHKLLQPADKTLK------TMAIEFAEYQVCVDIFI--TTQAYVDMASISVIPRT 714
Cdd:PTZ00395 1106 PNCGIGAI---------------KELKKDLQENFLEVKqkifydSLLLDLYAFNISVDIFIisSNNVRVCVPSLQYVAQN 1170
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096  715 TGGQVYCYYPFSALSDPPKLYNDLKWNITRPQ-GFEAVMRVRCSQGIQVQEY---SGNFCKRIPTD-IDLPA--HD---- 783
Cdd:PTZ00395 1171 TGGKILFVENFLWQKDYKEIYMNIMDTLTSEDiAYCCELKLRYSHHMSVKKLfccNNNFNSIISVDtIKIPKirHDqtfa 1250
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096  784 ------DKLQDGAECAFQCALLYTTIYGERRIRVTTLSLSCTNMLSNLFRAADLDSQFACMLKQAANEIpskalpLVKEQ 857
Cdd:PTZ00395 1251 fllnysDISESKKQIYFQCACIYTNLWGDRFVRLHTTHMNLTSSLSTVFRYTDAEALMNILIKQLCTNI------LHNDN 1324
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096  858 ATNSCINA----LYAYRKFCATVTSSGQLILPEALKLFPLYTLALTKSVGLRTDGRIDDRSFwiNYVSSLSTPLAIPL-- 931
Cdd:PTZ00395 1325 YSKIIIDNlaaiLFSYRINCASSAHSGQLILPDTLKLLPLFTSSLLKHNVTKKEILHDLKVY--SLIKLLSMPIISSLly 1402
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096  932 VYPRMISVH------DLDVKDTEGSV-LPPPIPLSSEHISNEGVYFLENGEDGLLFVGESVDSDilqklFAVSSAAEIPN 1004
Cdd:PTZ00395 1403 VYPVMYVIHikgktnEIDSMDVDDDLfIPKTIPSSAEKIYSNGIYLLDACTHFYLYFGFHSDAN-----FAKEIVGDIPT 1477
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096 1005 QfvlqqyDNQLSKKFNDAVNEIRRQRC--SYLRIKLCKKGEPSGML---------FLSYMVEDRTASGPSYVEFLVQVHR 1073
Cdd:PTZ00395 1478 E------KNAHELNLTDTPNAQKVQRIikNLSRIHHFNKYVPLVMVapksneeehLISLCVEDKADKEYSYVNFLCFIHK 1551

                  ....*..
gi 334187096 1074 QIQLKMN 1080
Cdd:PTZ00395 1552 LVHKRID 1558
PHA03247 PHA03247
large tegument protein UL36; Provisional
65-345 1.12e-18

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 92.69  E-value: 1.12e-18
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096   65 PQPFPQQSPSYGAPQRGPSPMSRPGPPAGMARPGGPPPVSQPAgfQSNVPLNRPTGPPSRQPsfgsRPSMPGGPVAQPAA 144
Cdd:PHA03247 2725 PAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPP--APAPPAAPAAGPPRRLT----RPAVASLSESRESL 2798
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096  145 SSSGFPAFGPSGSVAAGP--PPGSRPmAFGSPPPVGSGMSMP--PSGMIGGPVSNGHQMVGSGGFPR--GTQFPGAAVTT 218
Cdd:PHA03247 2799 PSPWDPADPPAAVLAPAAalPPAASP-AGPLPPPTSAQPTAPppPPGPPPPSLPLGGSVAPGGDVRRrpPSRSPAAKPAA 2877
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096  219 PQAPYVRPPSAPYARTPPQPLGSHSLSGNPPLTPFTAPSMPPPATFPGAPHGRPAVSglPYGPPSAQVAPPLGFPGQMQP 298
Cdd:PHA03247 2878 PARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPP--PPPRPQPPLAPTTDPAGAGEP 2955
                         250       260       270       280       290
                  ....*....|....*....|....*....|....*....|....*....|..
gi 334187096  299 PryGMGPLPnqsmtniptamgQPGATVPGP-----SRIDPNQIPRPGSSSSP 345
Cdd:PHA03247 2956 S--GAVPQP------------WLGALVPGRvavprFRVPQPAPSREAPASST 2993
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
63-367 2.90e-11

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 67.87  E-value: 2.90e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096    63 QSPQPFPQQSPSYGAPQRGPSPMSRPGPPAGMarpggPPPVSQPAGFQSNVPLNRPTGPPSRQPSfgsrPSMPGGPVAQP 142
Cdd:pfam03154  185 SPPPPGTTQAATAGPTPSAPSVPPQGSPATSQ-----PPNQTQSTAAPHTLIQQTPTLHPQRLPS----PHPPLQPMTQP 255
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096   143 AASSSGFPAFGPSGSVAAGPPPGSRPMAFGsppPVGSGMSMPPSGMIGGPVSNGHQmvgsGGFPRGTQFPGAAVTTPQAP 222
Cdd:pfam03154  256 PPPSQVSPQPLPQPSLHGQMPPMPHSLQTG---PSHMQHPVPPQPFPLTPQSSQSQ----VPPGPSPAAPGQSQQRIHTP 328
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096   223 yvRPPSAPYARTPP--QPLgshslsgnpPLTPFTAPSMPPPATFPGAPHGRPAVSGlpyGPPSAQVAPPLGFPGQMQPPR 300
Cdd:pfam03154  329 --PSQSQLQSQQPPreQPL---------PPAPLSMPHIKPPPTTPIPQLPNPQSHK---HPPHLSGPSPFQMNSNLPPPP 394
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 334187096   301 yGMGPLPNQSMTNIPTAMGQPGATVPGPSRIDPNQIPRPGSSSSPTVfetrQSNQANPPPPATSDYV 367
Cdd:pfam03154  395 -ALKPLSSLSTHHPPSAHPPPLQLMPQSQQLPPPPAQPPVLTQSQSL----PPPAASHPPTSGLHQV 456
PPE COG5651
PPE-repeat protein [Function unknown];
90-308 2.37e-05

PPE-repeat protein [Function unknown];


Pssm-ID: 444372 [Multi-domain]  Cd Length: 385  Bit Score: 47.97  E-value: 2.37e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096   90 PPAGMARPGGPPPVSqpagfqsnvplNRPTGPPSRQPSFGSRPSMPGGPVAQPAASSSGFPAFGPSGSVAAGPPPGSRPM 169
Cdd:COG5651   170 PPPTITNPGGLLGAQ-----------NAGSGNTSSNPGFANLGLTGLNQVGIGGLNSGSGPIGLNSGPGNTGFAGTGAAA 238
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096  170 AFGSPPPVGSGMSMPPSGMIGGPVSNGHQMVGSGGFPRGT--QFPGAAVTTPQAP--YVRPPSAPYARTPPQPLGSHSLS 245
Cdd:COG5651   239 GAAAAAAAAAAAAGAGASAALASLAATLLNASSLGLAATAasSAATNLGLAGSPLglAGGGAGAAAATGLGLGAGGAAGA 318
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 334187096  246 GNPPLTPFTAPSMPPPATfPGAPHGRPAVSGLPYGPPSAQVAPPLGFPGQMQPPRYGMGPLPN 308
Cdd:COG5651   319 AGATGAGAALGAGAAAAA-AGAAAGAGAAAAAAAGGAGGGGGGALGAGGGGGSAGAAAGAASG 380
PABP-1234 TIGR01628
polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins ...
63-172 4.02e-04

polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins recognize the poly-A of mRNA and consists of four tandem RNA recognition domains at the N-terminus (rrm: pfam00076) followed by a PABP-specific domain (pfam00658) at the C-terminus. The protein is involved in the transport of mRNA's from the nucleus to the cytoplasm. There are four paralogs in Homo sapiens which are expressed in testis, platelets, broadly expressed and of unknown tissue range.


Pssm-ID: 130689 [Multi-domain]  Cd Length: 562  Bit Score: 44.41  E-value: 4.02e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096    63 QSPQPFPQQsPSYGAPQ---RGP--------------SPMSRPGPPAGMARPGGPPPVSQ---PAGFQSNVPLNRPTGPP 122
Cdd:TIGR01628  384 QLPMGSPMG-GAMGQPPyygQGPqqqfngqplgwprmSMMPTPMGPGGPLRPNGLAPMNAvraPSRNAQNAAQKPPMQPV 462
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|
gi 334187096   123 SRQPSFGSRPSMPGGPVAQPAASSSGfpAFGPSGSVAAGPPPGSRPMAFG 172
Cdd:TIGR01628  463 MYPPNYQSLPLSQDLPQPQSTASQGG--QNKKLAQVLASATPQMQKQVLG 510
SepH NF040712
septation protein SepH; Septation protein H (SepH) was firstly characterized in Streptomyces ...
126-274 1.14e-03

septation protein SepH; Septation protein H (SepH) was firstly characterized in Streptomyces venezuelae, and homologs were identified in Mycobacterium smegmatis. SepH contains a N-terminal DUF3071 domain and a conserved C-terminal region. It binds directly to cell division protein FtsZ to stimulate the assembly of FtsZ protofilaments.


Pssm-ID: 468676 [Multi-domain]  Cd Length: 346  Bit Score: 42.45  E-value: 1.14e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096  126 PSFGsRPSMPGGPVAQPAASSSGFPAFGPSGSVAAGPPPGSRPmafgSPPPVGSGMSMPPSGmigGPVSNGHQMVGSGGF 205
Cdd:NF040712  190 PDFG-RPLRPLATVPRLAREPADARPEEVEPAPAAEGAPATDS----DPAEAGTPDDLASAR---RRRAGVEQPEDEPVG 261
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 334187096  206 PRGTQFPGAAVTTPQAPYVRPPSAPYARTPPQPLGSHSLSGNPPLTPF---TAPSMPPPATFPGAPHGRPAV 274
Cdd:NF040712  262 PGAAPAAEPDEATRDAGEPPAPGAAETPEAAEPPAPAPAAPAAPAAPEaeePARPEPPPAPKPKRRRRRASV 333
 
Name Accession Description Interval E-value
COG5028 COG5028
Vesicle coat complex COPII, subunit SEC24/subunit SFB2/subunit SFB3 [Intracellular trafficking ...
273-1076 1.39e-176

Vesicle coat complex COPII, subunit SEC24/subunit SFB2/subunit SFB3 [Intracellular trafficking and secretion];


Pssm-ID: 227361 [Multi-domain]  Cd Length: 861  Bit Score: 538.99  E-value: 1.39e-176
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096  273 AVSGLPYGPPSAQVAPPLGFPGQMQPPRYGMGPLPNQSMTNIP---TAMGQPGATVPGPSRIDP---------------- 333
Cdd:COG5028    18 TGAASSKKSARPHRAYANFSAGQMGMPPYTTPPLQQQSRRQIDqaaTAMHNTGANNPAPSVMSPafqsqqkfsspyggsm 97
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096  334 --NQIPRPGSSSSPTVFETRQ----SNQANPP----PPATSDYVVRDTGNCSPRYMRCTINQIPCTVDLLSTSGMQLALM 403
Cdd:COG5028    98 adGTAPKPTNPLVPVDLFEDQpppiSDLFLPPppivPPLTTNFVGSEQSNCSPKYVRSTMYAIPETNDLLKKSKIPFGLV 177
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096  404 VQPLALSHPSEEPIQVVDfgEGGPVRCSRCKGYINPFMKFIDQGRKFICNFCGYTDETPRDYHCNLGPDGRRRDVDERPE 483
Cdd:COG5028   178 IRPFLELYPEEDPVPLVE--DGSIVRCRRCRSYINPFVQFIEQGRKWRCNICRSKNDVPEGFDNPSGPNDPRSDRYSRPE 255
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096  484 LCRGTVEFVATKEYMVRDPMPAVYFFLIDVSMNAIQTGATAAACNAIQQVLSDLPE-GPRTFVGIATFDSTIHFYNLKRA 562
Cdd:COG5028   256 LKSGVVDFLAPKEYSLRQPPPPVYVFLIDVSFEAIKNGLVKAAIRAILENLDQIPNfDPRTKIAIICFDSSLHFFKLSPD 335
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096  563 LQQPlMLIVPDVQDVYTPLET-DVVVQLSECRQHLELLLDSIPTMFQESKIPESAFGAAVKAAFLAMKSKGGKLMVFQSI 641
Cdd:COG5028   336 LDEQ-MLIVSDLDEPFLPFPSgLFVLPLKSCKQIIETLLDRVPRIFQDNKSPKNALGPALKAAKSLIGGTGGKIIVFLST 414
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096  642 LCSVGVGALSSReaegranmsagEKEAHKLLQPADKTLKTMAIEFAEYQVCVDIFITTQAYVDMASISVIPRTTGGQVYC 721
Cdd:COG5028   415 LPNMGIGKLQLR-----------EDKESSLLSCKDSFYKEFAIECSKVGISVDLFLTSEDYIDVATLSHLCRYTGGQTYF 483
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096  722 YYPFSA--LSDPPKLYNDLKWNITRPQGFEAVMRVRCSQGIQVQEYSGNFCKRIPTDIDLPA------------HDDKLQ 787
Cdd:COG5028   484 YPNFSAtrPNDATKLANDLVSHLSMEIGYEAVMRVRCSTGLRVSSFYGNFFNRSSDLCAFSTmprdtsllvefsIDEKLM 563
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096  788 dGAECAFQCALLYTTIYGERRIRVTTLSLSCTNMLSNLFRAADLDSQFACMLKQAANEIPSKALPLVKEQATNSCINALY 867
Cdd:COG5028   564 -TSDVYFQVALLYTLNDGERRIRVVNLSLPTSSSIREVYASADQLAIACILAKKASTKALNSSLKEARVLINKSMVDILK 642
                         650       660       670       680       690       700       710       720
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096  868 AYRKFCATVTSSGQLILPEALKLFPLYTLALTKSVGLRTDG-RIDDRSFWINYVSSLSTPLAIPLVYPRMISVHDL---- 942
Cdd:COG5028   643 AYKKELVKSNTSTQLPLPANLKLLPLLMLALLKSSAFRSGStPSDIRISALNRLTSLPLKQLMRNIYPTLYALHDMpiea 722
                         730       740       750       760       770       780       790       800
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096  943 DVKDTEGSVLPPPIPLSSEHISNEGVYFLENGEDGLLFVGESVDSDILQKLFAVSSAAEIPN-QFVLQQYDNQLSKKFND 1021
Cdd:COG5028   723 GLPDEGLLVLPSPINATSSLLESGGLYLIDTGQKIFLWFGKDAVPSLLQDLFGVDSLSDIPSgKFTLPPTGNEFNERVRN 802
                         810       820       830       840       850
                  ....*....|....*....|....*....|....*....|....*....|....*...
gi 334187096 1022 AVNEIRRQ-RCSYLRIKLCKKG-EPSGML-FLSYMVEDRTASGPSYVEFLVQVHRQIQ 1076
Cdd:COG5028   803 IIGELRSVnDDSTLPLVLVRGGgDPSLRLwFFSTLVEDKTLNIPSYLDYLQILHEKIK 860
Sec24-like cd01479
Sec24-like: Protein and membrane traffic in eukaryotes is mediated by at least in part by the ...
502-746 3.44e-115

Sec24-like: Protein and membrane traffic in eukaryotes is mediated by at least in part by the budding and fusion of intracellular transport vesicles that selectively carry cargo proteins and lipids from donor to acceptor organelles. The two main classes of vesicular carriers within the endocytic and the biosynthetic pathways are COP- and clathrin-coated vesicles. Formation of COPII vesicles requires the ordered assembly of the coat built from several cytosolic components GTPase Sar1, complexes of Sec23-Sec24 and Sec13-Sec31. The process is initiated by the conversion of GDP to GTP by the GTPase Sar1 which then recruits the heterodimeric complex of Sec23 and Sec24. This heterodimeric complex generates the pre-budding complex. The final step leading to membrane deformation and budding of COPII-coated vesicles is carried by the heterodimeric complex Sec13-Sec31. The members of this CD belong to the Sec23-like family. Sec 24 is very similar to Sec23. The Sec23 and Sec24 polypeptides fold into five distinct domains: a beta-barrel, a zinc finger, a vWA or trunk, an all helical region and a carboxy Gelsolin domain. The members of this subgroup carry a partial MIDAS motif and have the overall Para-Rossmann type fold that is characteristic of this superfamily.


Pssm-ID: 238756 [Multi-domain]  Cd Length: 244  Bit Score: 355.81  E-value: 3.44e-115
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096  502 PMPAVYFFLIDVSMNAIQTGATAAACNAIQQVLSDLP-EGPRTFVGIATFDSTIHFYNLKRALQQPLMLIVPDVQDVYTP 580
Cdd:cd01479     1 PQPAVYVFLIDVSYNAIKSGLLATACEALLSNLDNLPgDDPRTRVGFITFDSTLHFFNLKSSLEQPQMMVVSDLDDPFLP 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096  581 LETDVVVQLSECRQHLELLLDSIPTMFQESKIPESAFGAAVKAAFLAMKSKGGKLMVFQSILCSVGVGALSSREAEGraN 660
Cdd:cd01479    81 LPDGLLVNLKESRQVIEDLLDQIPEMFQDTKETESALGPALQAAFLLLKETGGKIIVFQSSLPTLGAGKLKSREDPK--L 158
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096  661 MSAGEKEAHklLQPADKTLKTMAIEFAEYQVCVDIFITTQAYVDMASISVIPRTTGGQVYCY--YPFSALSDPPKLYNDL 738
Cdd:cd01479   159 LSTDKEKQL--LQPQTDFYKKLALECVKSQISVDLFLFSNQYVDVATLGCLSRLTGGQVYYYpsFNFSAPNDVEKLVNEL 236

                  ....*...
gi 334187096  739 KWNITRPQ 746
Cdd:cd01479   237 ARYLTRKI 244
Sec23_trunk pfam04811
Sec23/Sec24 trunk domain; COPII-coated vesicles carry proteins from the endoplasmic reticulum ...
502-742 2.30e-104

Sec23/Sec24 trunk domain; COPII-coated vesicles carry proteins from the endoplasmic reticulum to the Golgi complex. This vesicular transport can be reconstituted by using three cytosolic components containing five proteins: the small GTPase Sar1p, the Sec23p/24p complex, and the Sec13p/Sec31p complex. This domain is known as the trunk domain and has an alpha/beta vWA fold and forms the dimer interface.


Pssm-ID: 398467 [Multi-domain]  Cd Length: 241  Bit Score: 326.90  E-value: 2.30e-104
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096   502 PMPAVYFFLIDVSMNAIQTGATAAACNAIQQVLSDLPEGPRTFVGIATFDSTIHFYNLKRALQQPLMLIVPDVQDVYTPL 581
Cdd:pfam04811    1 PQPPVFLFVIDVSYNAIKSGLLAALKESLLQSLDLLPGDPRARVGFITFDSTVHFFNLGSSLRQPQMLVVSDLQDMFLPL 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096   582 ETDVVVQLSECRQHLELLLDSIPTMFQESKIPESAFGAAVKAAFLAMKS--KGGKLMVFQSILCSVGV-GALSSREAEgr 658
Cdd:pfam04811   81 PDRFLVPLSECRFVLEDLLEQLPPMFPVTKRPERCLGPALQAAFLLLKAafTGGKIMVFQGGLPTVGPgGKLKSRLDE-- 158
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096   659 aNMSAGEKEAHKLLQPADKTLKTMAIEFAEYQVCVDIFITTQAYVDMASISVIPRTTGGQVYCYYPFSALSDPPKLYNDL 738
Cdd:pfam04811  159 -SHHGTDKEKAKLVKKADKFYKSLAKECVKQGHSVDLFAFSLDYVDVATLGQLSRLTGGQVYLYPSFQADVDGSKFKQDL 237

                   ....
gi 334187096   739 KWNI 742
Cdd:pfam04811  238 QRYF 241
trunk_domain cd01468
trunk domain. COPII-coated vesicles carry proteins from the endoplasmic reticulum to the Golgi ...
502-740 1.19e-93

trunk domain. COPII-coated vesicles carry proteins from the endoplasmic reticulum to the Golgi complex. This vesicular transport can be reconstituted by using three cytosolic components containing five proteins: the small GTPase Sar1p, the Sec23p/24p complex, and the Sec13p/Sec31p complex. This domain is known as the trunk domain and has an alpha/beta vWA fold and forms the dimer interface. Some members of this family possess a partial MIDAS motif that is a characteristic feature of most vWA domain proteins.


Pssm-ID: 238745 [Multi-domain]  Cd Length: 239  Bit Score: 298.00  E-value: 1.19e-93
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096  502 PMPAVYFFLIDVSMNAIQTGATAAACNAIQQVLSDLPEGPRTFVGIATFDSTIHFYNLKRALQQPLMLIVPDVQDVYTPL 581
Cdd:cd01468     1 PQPPVFVFVIDVSYEAIKEGLLQALKESLLASLDLLPGDPRARVGLITYDSTVHFYNLSSDLAQPKMYVVSDLKDVFLPL 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096  582 ETDVVVQLSECRQHLELLLDSIPTMFQE--SKIPESAFGAAVKAAFLAMKSK--GGKLMVFQSILCSVGVGALSSREAEG 657
Cdd:cd01468    81 PDRFLVPLSECKKVIHDLLEQLPPMFWPvpTHRPERCLGPALQAAFLLLKGTfaGGRIIVFQGGLPTVGPGKLKSREDKE 160
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096  658 RANMSagekEAHKLLQPADKTLKTMAIEFAEYQVCVDIFITTQAYVDMASISVIPRTTGGQVYCYYPFSALSDPPKLYND 737
Cdd:cd01468   161 PIRSH----DEAQLLKPATKFYKSLAKECVKSGICVDLFAFSLDYVDVATLKQLAKSTGGQVYLYDSFQAPNDGSKFKQD 236

                  ...
gi 334187096  738 LKW 740
Cdd:cd01468   237 LQR 239
PTZ00395 PTZ00395
Sec24-related protein; Provisional
499-1080 4.57e-43

Sec24-related protein; Provisional


Pssm-ID: 185594 [Multi-domain]  Cd Length: 1560  Bit Score: 171.80  E-value: 4.57e-43
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096  499 VRDPMPAVYFFLIDVSMNAIQTGATAAACNAIQQVLSDLpEGPRTFVGIATFDSTIHFYNLKRALQQPL----------- 567
Cdd:PTZ00395  947 VKNMLPPYFVFVVECSYNAIYNNITYTILEGIRYAVQNV-KCPQTKIAIITFNSSIYFYHCKGGKGVSGeegdggggsgn 1025
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096  568 --MLIVPDVQDVYTPLE-TDVVVQLSECRQHLELLLDSIPTMFQESKIPESAFGAAVKAAFLAMKSKGG--KLMVFQSIL 642
Cdd:PTZ00395 1026 hqVIVMSDVDDPFLPLPlEDLFFGCVEEIDKINTLIDTIKSVSTTMQSYGSCGNSALKIAMDMLKERNGlgSICMFYTTT 1105
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096  643 CSVGVGALssreaegranmsageKEAHKLLQPADKTLK------TMAIEFAEYQVCVDIFI--TTQAYVDMASISVIPRT 714
Cdd:PTZ00395 1106 PNCGIGAI---------------KELKKDLQENFLEVKqkifydSLLLDLYAFNISVDIFIisSNNVRVCVPSLQYVAQN 1170
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096  715 TGGQVYCYYPFSALSDPPKLYNDLKWNITRPQ-GFEAVMRVRCSQGIQVQEY---SGNFCKRIPTD-IDLPA--HD---- 783
Cdd:PTZ00395 1171 TGGKILFVENFLWQKDYKEIYMNIMDTLTSEDiAYCCELKLRYSHHMSVKKLfccNNNFNSIISVDtIKIPKirHDqtfa 1250
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096  784 ------DKLQDGAECAFQCALLYTTIYGERRIRVTTLSLSCTNMLSNLFRAADLDSQFACMLKQAANEIpskalpLVKEQ 857
Cdd:PTZ00395 1251 fllnysDISESKKQIYFQCACIYTNLWGDRFVRLHTTHMNLTSSLSTVFRYTDAEALMNILIKQLCTNI------LHNDN 1324
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096  858 ATNSCINA----LYAYRKFCATVTSSGQLILPEALKLFPLYTLALTKSVGLRTDGRIDDRSFwiNYVSSLSTPLAIPL-- 931
Cdd:PTZ00395 1325 YSKIIIDNlaaiLFSYRINCASSAHSGQLILPDTLKLLPLFTSSLLKHNVTKKEILHDLKVY--SLIKLLSMPIISSLly 1402
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096  932 VYPRMISVH------DLDVKDTEGSV-LPPPIPLSSEHISNEGVYFLENGEDGLLFVGESVDSDilqklFAVSSAAEIPN 1004
Cdd:PTZ00395 1403 VYPVMYVIHikgktnEIDSMDVDDDLfIPKTIPSSAEKIYSNGIYLLDACTHFYLYFGFHSDAN-----FAKEIVGDIPT 1477
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096 1005 QfvlqqyDNQLSKKFNDAVNEIRRQRC--SYLRIKLCKKGEPSGML---------FLSYMVEDRTASGPSYVEFLVQVHR 1073
Cdd:PTZ00395 1478 E------KNAHELNLTDTPNAQKVQRIikNLSRIHHFNKYVPLVMVapksneeehLISLCVEDKADKEYSYVNFLCFIHK 1551

                  ....*..
gi 334187096 1074 QIQLKMN 1080
Cdd:PTZ00395 1552 LVHKRID 1558
Sec23_helical pfam04815
Sec23/Sec24 helical domain; COPII-coated vesicles carry proteins from the endoplasmic ...
830-930 6.32e-29

Sec23/Sec24 helical domain; COPII-coated vesicles carry proteins from the endoplasmic reticulum to the Golgi complex. This vesicular transport can be reconstituted by using three cytosolic components containing five proteins: the small GTPase Sar1p, the Sec23p/24p complex, and the Sec13p/Sec31p complex. This domain is composed of five alpha helices.


Pssm-ID: 461441 [Multi-domain]  Cd Length: 103  Bit Score: 111.44  E-value: 6.32e-29
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096   830 DLDSQFACMLKQAANEIPSKALPLVKEQATNSCINALYAYRKFCATVTSSGQLILPEALKLFPLYTLALTKSVGLRTDG- 908
Cdd:pfam04815    1 DQEAIAVLLAKKAVEKALSSSLSDAREALDNKLVDILAAYRKYCASSSSPGQLILPESLKLLPLYMLALLKSPALRGGNs 80
                           90       100
                   ....*....|....*....|...
gi 334187096   909 -RIDDRSFWINYVSSLSTPLAIP 930
Cdd:pfam04815   81 sPSDERAYARHLLLSLPVEELLL 103
Sec23_BS pfam08033
Sec23/Sec24 beta-sandwich domain;
747-819 1.61e-23

Sec23/Sec24 beta-sandwich domain;


Pssm-ID: 429794 [Multi-domain]  Cd Length: 86  Bit Score: 95.30  E-value: 1.61e-23
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096   747 GFEAVMRVRCSQGIQVQEYSGNFCKRIPTD-IDLPA------------HDDKLQDGAECAFQCALLYTTIYGERRIRVTT 813
Cdd:pfam08033    1 GFNAVLRVRTSKGLKVSGFIGNFVSRSSGDtWKLPSldpdtsyafefdIDEPLPNGSNAYIQFALLYTHSSGERRIRVTT 80

                   ....*.
gi 334187096   814 LSLSCT 819
Cdd:pfam08033   81 VALPVT 86
PHA03247 PHA03247
large tegument protein UL36; Provisional
65-345 1.12e-18

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 92.69  E-value: 1.12e-18
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096   65 PQPFPQQSPSYGAPQRGPSPMSRPGPPAGMARPGGPPPVSQPAgfQSNVPLNRPTGPPSRQPsfgsRPSMPGGPVAQPAA 144
Cdd:PHA03247 2725 PAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPP--APAPPAAPAAGPPRRLT----RPAVASLSESRESL 2798
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096  145 SSSGFPAFGPSGSVAAGP--PPGSRPmAFGSPPPVGSGMSMP--PSGMIGGPVSNGHQMVGSGGFPR--GTQFPGAAVTT 218
Cdd:PHA03247 2799 PSPWDPADPPAAVLAPAAalPPAASP-AGPLPPPTSAQPTAPppPPGPPPPSLPLGGSVAPGGDVRRrpPSRSPAAKPAA 2877
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096  219 PQAPYVRPPSAPYARTPPQPLGSHSLSGNPPLTPFTAPSMPPPATFPGAPHGRPAVSglPYGPPSAQVAPPLGFPGQMQP 298
Cdd:PHA03247 2878 PARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPP--PPPRPQPPLAPTTDPAGAGEP 2955
                         250       260       270       280       290
                  ....*....|....*....|....*....|....*....|....*....|..
gi 334187096  299 PryGMGPLPnqsmtniptamgQPGATVPGP-----SRIDPNQIPRPGSSSSP 345
Cdd:PHA03247 2956 S--GAVPQP------------WLGALVPGRvavprFRVPQPAPSREAPASST 2993
PHA03247 PHA03247
large tegument protein UL36; Provisional
3-365 1.99e-18

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 91.92  E-value: 1.99e-18
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096    3 APVPPGAPRPNsqqnsgPPNFYPGSQGNSNALADNMQNLSLNRPPPMMPGSGPRPPPPFGQSPQPFPQQSPsygaPQRGP 82
Cdd:PHA03247 2615 SPLPPDTHAPD------PPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSP----PQRPR 2684
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096   83 SPMSRP--GPPAGMARPGGPPPVSQPAGFQSNVPLNRPTGPPSRQPSFGSRPSMPGGPVAQPAASSSGFPAFGPSGSVAA 160
Cdd:PHA03247 2685 RRAARPtvGSLTSLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTA 2764
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096  161 GPPpgsrpmafgSPPPVGSGMSMPPSGMIGGPVSNGHQMVGSGGFPRGTQFPGAAVTTPqAPYVRPPSAPYARTPPQPLG 240
Cdd:PHA03247 2765 GPP---------APAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAP-AAALPPAASPAGPLPPPTSA 2834
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096  241 SHSLSGNPPltPFTAPSMPPP-ATFPGAPHGRPAVSGLPYGPPSAQVAPP---LGFPGQMQPPRYGMGPLPNQSMTNIPT 316
Cdd:PHA03247 2835 QPTAPPPPP--GPPPPSLPLGgSVAPGGDVRRRPPSRSPAAKPAAPARPPvrrLARPAVSRSTESFALPPDQPERPPQPQ 2912
                         330       340       350       360
                  ....*....|....*....|....*....|....*....|....*....
gi 334187096  317 AMGQPGATVPGPSRIDPNQIPRPGSSSSPTVFETRQSNQANPPPPATSD 365
Cdd:PHA03247 2913 APPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQ 2961
PHA03247 PHA03247
large tegument protein UL36; Provisional
4-364 9.47e-17

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 86.15  E-value: 9.47e-17
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096    4 PVPPGAPRPNSQQNSGPPNFYPGSQGNSnaladnmqnlSLNRPPPMMPGSGPRPPPPFGQSPQPFPQQSPSYGAPQRGPS 83
Cdd:PHA03247 2554 PLPPAAPPAAPDRSVPPPRPAPRPSEPA----------VTSRARRPDAPPQSARPRAPVDDRGDPRGPAPPSPLPPDTHA 2623
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096   84 PmsRPGPPAGMARPGGPPPVSQPAGFQSNVPLNRPTGPPSRQPSFGSRPSMPGGPVAQPAA--SSSGFPAFGPSGSVAAG 161
Cdd:PHA03247 2624 P--DPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRprRRAARPTVGSLTSLADP 2701
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096  162 PPPGSRPMAfgSPPPVGSGMSMPPsgmigGPVSNGhqmvGSGGFPRGTQFPGAAVTTPQAPY--VRPPSAPYARTPPQPl 239
Cdd:PHA03247 2702 PPPPPTPEP--APHALVSATPLPP-----GPAAAR----QASPALPAAPAPPAVPAGPATPGgpARPARPPTTAGPPAP- 2769
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096  240 gshslsgNPPLTPFTAPsmPPPATFPGAPHGRPAVSGLPYGPPSAQVAPPLGFPGQMQPPRYGMGPLPNQSMTNIPTAMG 319
Cdd:PHA03247 2770 -------APPAAPAAGP--PRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPP 2840
                         330       340       350       360
                  ....*....|....*....|....*....|....*....|....*
gi 334187096  320 QPgatvPGPSridPNQIPRPGSSSSPTVFETRQSNQANPPPPATS 364
Cdd:PHA03247 2841 PP----PGPP---PPSLPLGGSVAPGGDVRRRPPSRSPAAKPAAP 2878
PHA03247 PHA03247
large tegument protein UL36; Provisional
65-364 2.65e-16

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 84.60  E-value: 2.65e-16
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096   65 PQPFPQQSPSYGAPQRGPSPMSRPGPP-----AGMARPGGPPPVSQPagfqsNVPLNrPTGPPSRQPSfgSRPSMPGGPV 139
Cdd:PHA03247 2552 PPPLPPAAPPAAPDRSVPPPRPAPRPSepavtSRARRPDAPPQSARP-----RAPVD-DRGDPRGPAP--PSPLPPDTHA 2623
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096  140 AQPAASSSGFPAFGPSGSVAAGPPPGSRPMAFGSPPPVGSGMSMPPSGMIGGPVSNGHqmvgsGGFPRGTQFPGAAVTT- 218
Cdd:PHA03247 2624 PDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQ-----RPRRRAARPTVGSLTSl 2698
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096  219 ------PQAPYVRPPsaPYARTPPQPLGSHSLSGNPPLTPfTAPSMPPPATFPGAPHG--RPAVSGLPYGPPSAqvAPPL 290
Cdd:PHA03247 2699 adppppPPTPEPAPH--ALVSATPLPPGPAAARQASPALP-AAPAPPAVPAGPATPGGpaRPARPPTTAGPPAP--APPA 2773
                         250       260       270       280       290       300       310
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 334187096  291 GFPGQMQP--PRYGMGPL-PNQSMTNIPTAMGQPGATVPGP-SRIDPNQIPRPGSSSSPTVFETRQSNQANPPPPATS 364
Cdd:PHA03247 2774 APAAGPPRrlTRPAVASLsESRESLPSPWDPADPPAAVLAPaAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLP 2851
zf-Sec23_Sec24 pfam04810
Sec23/Sec24 zinc finger; COPII-coated vesicles carry proteins from the endoplasmic reticulum ...
427-464 1.79e-15

Sec23/Sec24 zinc finger; COPII-coated vesicles carry proteins from the endoplasmic reticulum to the Golgi complex. This vesicular transport can be reconstituted by using three cytosolic components containing five proteins: the small GTPase Sar1p, the Sec23p/24p complex, and the Sec13p/Sec31p complex. This domain is found to be zinc binding domain.


Pssm-ID: 461437 [Multi-domain]  Cd Length: 38  Bit Score: 70.94  E-value: 1.79e-15
                           10        20        30
                   ....*....|....*....|....*....|....*...
gi 334187096   427 PVRCSRCKGYINPFMKFIDQGRKFICNFCGYTDETPRD 464
Cdd:pfam04810    1 PVRCRRCRAYLNPFCQFDFGGKKWTCNFCGTRNPVPPE 38
PLN00162 PLN00162
transport protein sec23; Provisional
380-719 2.96e-12

transport protein sec23; Provisional


Pssm-ID: 215083 [Multi-domain]  Cd Length: 761  Bit Score: 71.12  E-value: 2.96e-12
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096  380 RCTINQIPCTVDLLSTSGMQLALMVQPLalSHPSEEPIQVVDfgeggPVRCSRCKGYINPFMKFIDQGRKFICNFCGYTD 459
Cdd:PLN00162   13 RMSWNVWPSSKIEASKCVIPLAALYTPL--KPLPELPVLPYD-----PLRCRTCRAVLNPYCRVDFQAKIWICPFCFQRN 85
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096  460 ETPRDYHC----NLgPdgrrrdvderPEL--CRGTVEFVATKEyMVRDPMPAVYFFLIDVSMNAIQTGataAACNAIQQV 533
Cdd:PLN00162   86 HFPPHYSSisetNL-P----------AELfpQYTTVEYTLPPG-SGGAPSPPVFVFVVDTCMIEEELG---ALKSALLQA 150
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096  534 LSDLPEGprTFVGIATFDSTIHFYNL----------------------------KRALQQPLMLIVPDVQDvytPLETDV 585
Cdd:PLN00162  151 IALLPEN--ALVGLITFGTHVHVHELgfsecsksyvfrgnkevskdqileqlglGGKKRRPAGGGIAGARD---GLSSSG 225
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096  586 V----VQLSECRQHLELLLDSI---PTMFQESKIPESAFGAA--VKAAFLA--MKSKGGKLMVFQSILCSVGVGALSSRE 654
Cdd:PLN00162  226 VnrflLPASECEFTLNSALEELqkdPWPVPPGHRPARCTGAAlsVAAGLLGacVPGTGARIMAFVGGPCTEGPGAIVSKD 305
                         330       340       350       360       370       380       390
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 334187096  655 aegranMSAG-------EKEAHKLLQPADKTLKTMAIEFAEYQVCVDIFITTQAYVDMASISVIPRTTGGQV 719
Cdd:PLN00162  306 ------LSEPirshkdlDKDAAPYYKKAVKFYEGLAKQLVAQGHVLDVFACSLDQVGVAEMKVAVERTGGLV 371
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
87-299 5.66e-12

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 70.01  E-value: 5.66e-12
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096   87 RPGPPAGMARPGGPPPvsqPAGFQSNVPLNRPTGPPsrQPSFGSRPSmPGGPVAQPAASSSGFPAFGPSGSVAAGPPPGS 166
Cdd:PRK07764  587 VVGPAPGAAGGEGPPA---PASSGPPEEAARPAAPA--APAAPAAPA-PAGAAAAPAEASAAPAPGVAAPEHHPKHVAVP 660
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096  167 RPMAFGSPPPVGSGMSMPpsgmiggpvsnghqmVGSGGFPRGTQFPGAAVTTPQAPYVRPPSAPYARTPPQPLGSHSLSG 246
Cdd:PRK07764  661 DASDGGDGWPAKAGGAAP---------------AAPPPAPAPAAPAAPAGAAPAQPAPAPAATPPAGQADDPAAQPPQAA 725
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|...
gi 334187096  247 NPPLTPFTAPSMPPPAtfPGAPHGRPAVSGLPYGPPSAQVAPPLGFPGQMQPP 299
Cdd:PRK07764  726 QGASAPSPAADDPVPL--PPEPDDPPDPAGAPAQPPPPPAPAPAAAPAAAPPP 776
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
63-367 2.90e-11

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 67.87  E-value: 2.90e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096    63 QSPQPFPQQSPSYGAPQRGPSPMSRPGPPAGMarpggPPPVSQPAGFQSNVPLNRPTGPPSRQPSfgsrPSMPGGPVAQP 142
Cdd:pfam03154  185 SPPPPGTTQAATAGPTPSAPSVPPQGSPATSQ-----PPNQTQSTAAPHTLIQQTPTLHPQRLPS----PHPPLQPMTQP 255
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096   143 AASSSGFPAFGPSGSVAAGPPPGSRPMAFGsppPVGSGMSMPPSGMIGGPVSNGHQmvgsGGFPRGTQFPGAAVTTPQAP 222
Cdd:pfam03154  256 PPPSQVSPQPLPQPSLHGQMPPMPHSLQTG---PSHMQHPVPPQPFPLTPQSSQSQ----VPPGPSPAAPGQSQQRIHTP 328
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096   223 yvRPPSAPYARTPP--QPLgshslsgnpPLTPFTAPSMPPPATFPGAPHGRPAVSGlpyGPPSAQVAPPLGFPGQMQPPR 300
Cdd:pfam03154  329 --PSQSQLQSQQPPreQPL---------PPAPLSMPHIKPPPTTPIPQLPNPQSHK---HPPHLSGPSPFQMNSNLPPPP 394
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 334187096   301 yGMGPLPNQSMTNIPTAMGQPGATVPGPSRIDPNQIPRPGSSSSPTVfetrQSNQANPPPPATSDYV 367
Cdd:pfam03154  395 -ALKPLSSLSTHHPPSAHPPPLQLMPQSQQLPPPPAQPPVLTQSQSL----PPPAASHPPTSGLHQV 456
SEC23 COG5047
Vesicle coat complex COPII, subunit SEC23 [Intracellular trafficking and secretion];
379-943 3.71e-11

Vesicle coat complex COPII, subunit SEC23 [Intracellular trafficking and secretion];


Pssm-ID: 227380 [Multi-domain]  Cd Length: 755  Bit Score: 67.21  E-value: 3.71e-11
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096  379 MRCTINQIPCTVDLLSTSGMQLALMVQPLalsHPSEEpiqvVDFGEGGPVRC-SRCKGYINPFMKfIDQGRKF-ICNFCG 456
Cdd:COG5047    12 IRLTWNVFPATRGDATRTVIPIACLYTPL---HEDDA----LTVNYYEPVKCtAPCKAVLNPYCH-IDERNQSwICPFCN 83
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096  457 YTDETPRDYHcNLGPDgrrrdvDERPELCR--GTVEFVATKEYMVrdpmPAVYFFLIDVsmnAIQTGATAAACNAIQQVL 534
Cdd:COG5047    84 QRNTLPPQYR-DISNA------NLPLELLPqsSTIEYTLSKPVIL----PPVFFFVVDA---CCDEEELTALKDSLIVSL 149
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096  535 SDLPEGPrtFVGIATFDSTIHFYNL------------------KRALQQPLMLIVPDV----------QDVYTPLETDVV 586
Cdd:COG5047   150 SLLPPEA--LVGLITYGTSIQVHELnaenhrrsyvfsgnkeytKENLQELLALSKPTKsggfeskisgIGQFASSRFLLP 227
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096  587 VQlsECRQHLELLLDSI-PTMFQ--ESKIPESAFGAAVKAAFLAM----KSKGGKLMVFQSILCSVGVGALSSRE-AEGR 658
Cdd:COG5047   228 TQ--QCEFKLLNILEQLqPDPWPvpAGKRPLRCTGSALNIASSLLeqcfPNAGCHIVLFAGGPCTVGPGTVVSTElKEPM 305
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096  659 ANMSAGEKEAHKLLQPADKTLKTMAIEFAEYQVCVDIFITTQAYVDMASISVIPRTTGGQVYCYYPFSA---LSDPPKLY 735
Cdd:COG5047   306 RSHHDIESDSAQHSKKATKFYKGLAERVANQGHALDIFAGCLDQIGIMEMEPLTTSTGGALVLSDSFTTsifKQSFQRIF 385
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096  736 NDLKWNITRpQGFEAVMRVRCSQGIQVQEYSGN---FCKR-----------------------------IPTDIDLPAHD 783
Cdd:COG5047   386 NRDSEGYLK-MGFNANMEVKTSKNLKIKGLIGHavsVKKKannisdseigigatnswkmaslspksnyaLYFEIALGAAS 464
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096  784 DKLQDGAECAFQCALLYTTIYGERRIRVTTLSLSCTNM-LSNLFRAADLDSQFACMLKQAANEIPSKALPLVKEQATNSC 862
Cdd:COG5047   465 GSAQRPAEAYIQFITTYQHSSGTYRIRVTTVARMFTDGgLPKINRSFDQEAAAVFMARIAAFKAETEDIIDVFRWIDRNL 544
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096  863 INALYAYRKFCATVTSSgqLILPEALKLFPLYTLALTKSVGLRT-DGRIDDRSFWINYVSSLSTPLAIPLVYPRMISVHD 941
Cdd:COG5047   545 IRLCQKFADYRKDDPSS--FRLDPNFTLYPQFMYHLRRSPFLSVfNNSPDETAFYRHMLNNADVNDSLIMIQPTLQSYSF 622

                  ..
gi 334187096  942 LD 943
Cdd:COG5047   623 EK 624
PHA03378 PHA03378
EBNA-3B; Provisional
77-362 5.44e-11

EBNA-3B; Provisional


Pssm-ID: 223065 [Multi-domain]  Cd Length: 991  Bit Score: 67.01  E-value: 5.44e-11
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096   77 APQRGPSPMSRPGPPAGMARPGGPPPVSQPAGFQSNVPLNRPTGPPSRQPSFGSRP-------SMPGGPVAQPAASSSGF 149
Cdd:PHA03378  516 MEQRVMATLLPPSPPQPRAGRRAPCVYTEDLDIESDEPASTEPVHDQLLPAPGLGPlqiqpltSPTTSQLASSAPSYAQT 595
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096  150 PAFGPSGSVAAGPPPG-SRPMAFGSPppvgSGMSMPPSGMIGGPVSNGHQMVGSGGFPRGTQFPGAAVTTPQAPYVRPPS 228
Cdd:PHA03378  596 PWPVPHPSQTPEPPTTqSHIPETSAP----RQWPMPLRPIPMRPLRMQPITFNVLVFPTPHQPPQVEITPYKPTWTQIGH 671
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096  229 APYARTPPQPLGSHSLSGNPPL--TPFTAPS-MPPPATFPGaPHGRPAVSGLPYGPPSA---QVAPPLGFPGQMQPPRYG 302
Cdd:PHA03378  672 IPYQPSPTGANTMLPIQWAPGTmqPPPRAPTpMRPPAAPPG-RAQRPAAATGRARPPAAapgRARPPAAAPGRARPPAAA 750
                         250       260       270       280       290       300
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 334187096  303 MGPL--PNQSMTNIPTAMGQPGATVPGP---SRIDPNQIPRPGSSSSPTVFETRQSNQANPPPPA 362
Cdd:PHA03378  751 PGRArpPAAAPGRARPPAAAPGAPTPQPppqAPPAPQQRPRGAPTPQPPPQAGPTSMQLMPRAAP 815
Gelsolin pfam00626
Gelsolin repeat;
949-1014 7.13e-11

Gelsolin repeat;


Pssm-ID: 395501 [Multi-domain]  Cd Length: 76  Bit Score: 59.24  E-value: 7.13e-11
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 334187096   949 GSVLPPPIPLSSEHISNEGVYFLENGEDGLLFVGEsvDSDILQKLFAVSSAAEIPNQ--FVLQQYDNQ 1014
Cdd:pfam00626    1 KFVLPPPVPLSQESLNSGDCYLLDNGFTIFLWVGK--GSSLLEKLFAALLAAQLDDDerFPLPEVIRV 66
Med15 pfam09606
ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of ...
65-359 7.20e-11

ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of the ARC-Mediator co-activator is a three-helix bundle with marked similarity to the KIX domain. The sterol regulatory element binding protein (SREBP) family of transcription activators use the ARC105 subunit to activate target genes in the regulation of cholesterol and fatty acid homeostasis. In addition, Med15 is a critical transducer of gene activation signals that control early metazoan development.


Pssm-ID: 312941 [Multi-domain]  Cd Length: 732  Bit Score: 66.57  E-value: 7.20e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096    65 PQPFPQQSPSYGAPQRGPSPMSRPGPPAGMARPGGPPPVSQPAGFQSNVPLNRPTGPPSRQPSFGSRPSM--------PG 136
Cdd:pfam09606   73 GGGQQGMPDPINALQNLAGQGTRPQMMGPMGPGPGGPMGQQMGGPGTASNLLASLGRPQMPMGGAGFPSQmsrvgrmqPG 152
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096   137 GPVAQPAASSSGFPAFGPSGSVAAGPPPGsRPMAFGSPPPVGSGMSMPPSGMIGGPVSNGHQMVGSGGfprGTQFPGAAV 216
Cdd:pfam09606  153 GQAGGMMQPSSGQPGSGTPNQMGPNGGPG-QGQAGGMNGGQQGPMGGQMPPQMGVPGMPGPADAGAQM---GQQAQANGG 228
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096   217 TTPQAPYVRPPSAPYARTPPQPLGSHS---LSGNPPLTPFTAPSMPPPATFPGAPHGRPAVSGLPYGPPSAQVAPPLGFP 293
Cdd:pfam09606  229 MNPQQMGGAPNQVAMQQQQPQQQGQQSqlgMGINQMQQMPQGVGGGAGQGGPGQPMGPPGQQPGAMPNVMSIGDQNNYQQ 308
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096   294 GQMQPPRYGM-GPLPNQSMTNIPTAMGQPGATVP---------------GPSRIDPNQIPRPGSSSSPTVFETRQSNQAN 357
Cdd:pfam09606  309 QQTRQQQQQQgGNHPAAHQQQMNQSVGQGGQVVAlgglnhletwnpgnfGGLGANPMQRGQPGMMSSPSPVPGQQVRQVT 388

                   ..
gi 334187096   358 PP 359
Cdd:pfam09606  389 PN 390
PHA03247 PHA03247
large tegument protein UL36; Provisional
3-307 7.46e-11

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 66.89  E-value: 7.46e-11
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096    3 APVPPGAPRPNSQQNSGPPNFYPGSQGNSNALADNMQNLSLNRPPPMMPGSGPRPPPPFGQSPQPF--PQQSPSYGAPQR 80
Cdd:PHA03247 2762 TTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPlpPPTSAQPTAPPP 2841
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096   81 GPSPMSRPGPPAGMARPGGPPPVSQPAGFQSNVPLNRPTGPPSRQPsfgsRPSMPGGPVAQPAASSSGFPAFGPSGSVAA 160
Cdd:PHA03247 2842 PPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAKPAAPARPPVRRLA----RPAVSRSTESFALPPDQPERPPQPQAPPPP 2917
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096  161 GPPPGSRPMAFGSPPPVGSGMSMPPSGMIGGPVSNGHQMVGSGGFPRGTQFPG-AAVTTPQAPYVRpPSAPYARTPPQPL 239
Cdd:PHA03247 2918 QPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGrVAVPRFRVPQPA-PSREAPASSTPPL 2996
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096  240 GSHSLSGNPPLTPFTA---PSMPPPATF------PGAPHGRPAVSGL------------------PYGPPSAQVAPPLGF 292
Cdd:PHA03247 2997 TGHSLSRVSSWASSLAlheETDPPPVSLkqtlwpPDDTEDSDADSLFdsdsersdlealdplppePHDPFAHEPDPATPE 3076
                         330
                  ....*....|....*
gi 334187096  293 PGQMQPPRYGMGPLP 307
Cdd:PHA03247 3077 AGARESPSSQFGPPP 3091
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
63-362 4.44e-10

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 63.85  E-value: 4.44e-10
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096   63 QSPQPFPQQSPSYGAPQRGPSPMSRPGPPAGMARPGGPPPVSQPAGFQSNVPLNRPTGPPSrqpsfGSRPSMPGGPVAQP 142
Cdd:PRK07764  398 APSAAAAAPAAAPAPAAAAPAAAAAPAPAAAPQPAPAPAPAPAPPSPAGNAPAGGAPSPPP-----AAAPSAQPAPAPAA 472
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096  143 AASSSGFPAfgPSGSVAAGPPPGSRPMAFGSPPPVGSGMSMP---------------------------PSGMIGGPVSN 195
Cdd:PRK07764  473 APEPTAAPA--PAPPAAPAPAAAPAAPAAPAAPAGADDAATLrerwpeilaavpkrsrktwaillpeatVLGVRGDTLVL 550
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096  196 GH------QMVGSGGFPR----------GTQF---------PGAAVTTPQAPYVRPPSAPYARTPPQPLGSHSLSGNPPL 250
Cdd:PRK07764  551 GFstgglaRRFASPGNAEvlvtalaeelGGDWqveavvgpaPGAAGGEGPPAPASSGPPEEAARPAAPAAPAAPAAPAPA 630
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096  251 TPFTAPSMPPPATFPGAPHGRPAVSGLPYGPPSAQVAPPLGFPGQMQPPRYGMGPLPNQSMTNIPTAMGQPGATVPGPSR 330
Cdd:PRK07764  631 GAAAAPAEASAAPAPGVAAPEHHPKHVAVPDASDGGDGWPAKAGGAAPAAPPPAPAPAAPAAPAGAAPAQPAPAPAATPP 710
                         330       340       350
                  ....*....|....*....|....*....|..
gi 334187096  331 IDPNQIPRPGSSSSPTVFETRQSNQANPPPPA 362
Cdd:PRK07764  711 AGQADDPAAQPPQAAQGASAPSPAADDPVPLP 742
PHA03247 PHA03247
large tegument protein UL36; Provisional
4-343 4.58e-10

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 64.19  E-value: 4.58e-10
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096    4 PVPPGAPrpNSQQNSGPPNFYPGSQGNSNALADNMQNLSLNRPPPMMPGSGPRPPPPFGQSPQPfpqqspsygapqRGPS 83
Cdd:PHA03247 2720 PLPPGPA--AARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPR------------RLTR 2785
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096   84 PmsrPGPPAGMARPGGPPPvSQPAGFQSNVPLNRPTGPPSRQPSFGSRPSMPGGPVAQPAASSSGFPAFGPSGSVAAGPP 163
Cdd:PHA03247 2786 P---AVASLSESRESLPSP-WDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGD 2861
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096  164 PGSRPMAfGSPPPVGSGMSMPPSGMIGGPvsnghqmvgsggfprgtqfpgaAVTTPQAPYVRPPSAPyaRTPPQPLGSHS 243
Cdd:PHA03247 2862 VRRRPPS-RSPAAKPAAPARPPVRRLARP----------------------AVSRSTESFALPPDQP--ERPPQPQAPPP 2916
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096  244 LSGNPPLTPFTAPSMPPPAtfPGAPHGRPAVSGLPYGPPSAQVAPPLGFPGQMQPPRYgmgplpnqSMTNIPTAMGQPGA 323
Cdd:PHA03247 2917 PQPQPQPPPPPQPQPPPPP--PPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRV--------AVPRFRVPQPAPSR 2986
                         330       340
                  ....*....|....*....|
gi 334187096  324 TVPGPSRIDPNQIPRPGSSS 343
Cdd:PHA03247 2987 EAPASSTPPLTGHSLSRVSS 3006
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
2-361 5.31e-10

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 63.63  E-value: 5.31e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096     2 VAPVPPGAPRPNSQQNSGPPNFYPG-SQGNSNALADNMQNLSLNRPPPMMPGSGPRPPPPFGQSPQPFPQQS-----PSY 75
Cdd:pfam03154  181 ASPPSPPPPGTTQAATAGPTPSAPSvPPQGSPATSQPPNQTQSTAAPHTLIQQTPTLHPQRLPSPHPPLQPMtqpppPSQ 260
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096    76 GAPQRGPSP-MSRPGPPAGMARPGGPPPVSQPA---GFQSNVPLNRPTGPPSRQPSF-GSRPSMPGGPVAQPAASSSGFP 150
Cdd:pfam03154  261 VSPQPLPQPsLHGQMPPMPHSLQTGPSHMQHPVppqPFPLTPQSSQSQVPPGPSPAApGQSQQRIHTPPSQSQLQSQQPP 340
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096   151 AFGPSgsvaagpPPGSRPMAFGSPPPVgsgmsmPPSGMIGGPVSNGHQMVGSGgfPRGTQFPGaavTTPQAPYVRPPSA- 229
Cdd:pfam03154  341 REQPL-------PPAPLSMPHIKPPPT------TPIPQLPNPQSHKHPPHLSG--PSPFQMNS---NLPPPPALKPLSSl 402
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096   230 -----PYARTPPQPLGSHSLSGNPP------LT-----PFTAPSMPPPATFPGAPHGRP--AVSGLPYGPPSaqVAPPLG 291
Cdd:pfam03154  403 sthhpPSAHPPPLQLMPQSQQLPPPpaqppvLTqsqslPPPAASHPPTSGLHQVPSQSPfpQHPFVPGGPPP--ITPPSG 480
                          330       340       350       360       370       380       390
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096   292 FPGQMQPPRYGMGPLPNQSMTNIPTAMGQPGATVPgPSRIDPNQIPRPGSSSSPtvfetrqsnqanPPPP 361
Cdd:pfam03154  481 PPTSTSSAMPGIQPPSSASVSSSGPVPAAVSCPLP-PVQIKEEALDEAEEPESP------------PPPP 537
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
64-367 7.47e-10

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 63.08  E-value: 7.47e-10
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096   64 SPQPFPQQSPSYGAPQRGPSPMSRPgPPAGMARPGGPPPVSQPAGFQSNVPLNRPTGPPSRQPSFGSRPSMPGGPVAQPA 143
Cdd:PRK07764  405 APAAAPAPAAAAPAAAAAPAPAAAP-QPAPAPAPAPAPPSPAGNAPAGGAPSPPPAAAPSAQPAPAPAAAPEPTAAPAPA 483
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096  144 ASSSGFPAFGPSGSVAAGPPPGSRPMAF--GSPPPV------GSGMSMP-------PSGMIGGPVSNGH------QMVGS 202
Cdd:PRK07764  484 PPAAPAPAAAPAAPAAPAAPAGADDAATlrERWPEIlaavpkRSRKTWAillpeatVLGVRGDTLVLGFstgglaRRFAS 563
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096  203 GGFPR----------GTQF---------PGAAVTTPQAPYVRPPSAPYARTPPQPLGSHSLSGNPPLTPFTAPSMPPPAT 263
Cdd:PRK07764  564 PGNAEvlvtalaeelGGDWqveavvgpaPGAAGGEGPPAPASSGPPEEAARPAAPAAPAAPAAPAPAGAAAAPAEASAAP 643
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096  264 FPGAPHGRPAVSGLPYGPPSAQVAPPLGFPGQMQPPRYGMGPLPNQSMTNIPTAMGQPGATVPGPsridpnqiPRPGSSS 343
Cdd:PRK07764  644 APGVAAPEHHPKHVAVPDASDGGDGWPAKAGGAAPAAPPPAPAPAAPAAPAGAAPAQPAPAPAAT--------PPAGQAD 715
                         330       340
                  ....*....|....*....|....
gi 334187096  344 SPTVFETRQSNQANPPPPATSDYV 367
Cdd:PRK07764  716 DPAAQPPQAAQGASAPSPAADDPV 739
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
66-320 2.88e-09

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 61.32  E-value: 2.88e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096    66 QPFPQQSPSYGAPQRGPSPMSRPGPPAGMARPGGPPPVSQPagfqsnVPlNRPTGPPSRQPSFGSRPSMPGGPVAQPAAs 145
Cdd:pfam03154  322 QQRIHTPPSQSQLQSQQPPREQPLPPAPLSMPHIKPPPTTP------IP-QLPNPQSHKHPPHLSGPSPFQMNSNLPPP- 393
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096   146 ssgfPAFGPSGSVAAGPPPGSRpmafgsPPPVgsgMSMPPSGMIGGPVSNGHQMVGSGGFPRgtqfPGAAVTTPQAPYVR 225
Cdd:pfam03154  394 ----PALKPLSSLSTHHPPSAH------PPPL---QLMPQSQQLPPPPAQPPVLTQSQSLPP----PAASHPPTSGLHQV 456
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096   226 PPSAPYARTPPQPLGSHSL---SGNPPLTPFTAPSMPPPATFPGAPHGR-PAVSGLPYGPPSAQVAPPLGFPGQMQPPRY 301
Cdd:pfam03154  457 PSQSPFPQHPFVPGGPPPItppSGPPTSTSSAMPGIQPPSSASVSSSGPvPAAVSCPLPPVQIKEEALDEAEEPESPPPP 536
                          250
                   ....*....|....*....
gi 334187096   302 GMGPLPNQSMTNIPTAMGQ 320
Cdd:pfam03154  537 PRSPSPEPTVVNTPSHASQ 555
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
18-364 4.40e-09

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 60.96  E-value: 4.40e-09
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096   18 SGPPNFYPGSQGNSNALADNMQNLSLNRPPPMMPGSGPRPPPPFGQSPQPFPQQSPSYGAPqRGPSPMSRPGPPAGMArP 97
Cdd:PHA03307   37 SGSQGQLVSDSAELAAVTVVAGAAACDRFEPPTGPPPGPGTEAPANESRSTPTWSLSTLAP-ASPAREGSPTPPGPSS-P 114
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096   98 GGPPPVSQPAGFQSNVPLNRPTGPPSRQPSFGSRPSMPGGPVAQPAASSSGFPAFGPSGSVAAGPPPGSRPmafGSPPPV 177
Cdd:PHA03307  115 DPPPPTPPPASPPPSPAPDLSEMLRPVGSPGPPPAASPPAAGASPAAVASDAASSRQAALPLSSPEETARA---PSSPPA 191
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096  178 GSGMSMPPSGMIGGPvsnghqmvGSGGFPRGTQFPGAAVTTPQAPYVRPPSAPYARTPPQPLGS----HSLSGNPPLTPF 253
Cdd:PHA03307  192 EPPPSTPPAAASPRP--------PRRSSPISASASSPAPAPGRSAADDAGASSSDSSSSESSGCgwgpENECPLPRPAPI 263
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096  254 TAPSmPPPATFPGAPHGRPAVSGLPYGPPSAQVAPPlgfpgqmQPPRYGMGPLPNQSmTNIPTAMGQPGATVPGPSRIDP 333
Cdd:PHA03307  264 TLPT-RIWEASGWNGPSSRPGPASSSSSPRERSPSP-------SPSSPGSGPAPSSP-RASSSSSSSRESSSSSTSSSSE 334
                         330       340       350
                  ....*....|....*....|....*....|....
gi 334187096  334 ---NQIPRPGSSSSPTVFETRQSNQANPPPPATS 364
Cdd:PHA03307  335 ssrGAAVSPGPSPSRSPSPSRPPPPADPSSPRKR 368
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
122-348 8.44e-09

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 59.50  E-value: 8.44e-09
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096  122 PSRQPSFGSRPSMPGGPVAQPAASSSGFPAFGPSGSVAAGPPPGSRPMAFGSPPPVGSGMSMPPSgMIGGPVSNGHQMVG 201
Cdd:PRK12323  365 PGQSGGGAGPATAAAAPVAQPAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPA-PEALAAARQASARG 443
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096  202 SGGFPRgtqfPGAAVTTPQAPYVRPPSAPyARTPPQPLGSHSLSGNPPLTPFTAPSMPPPatFPGAPHGRPAVSGLPYGP 281
Cdd:PRK12323  444 PGGAPA----PAPAPAAAPAAAARPAAAG-PRPVAAAAAAAPARAAPAAAPAPADDDPPP--WEELPPEFASPAPAQPDA 516
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 334187096  282 -PSAQVAPPLGFPGQMQPPRygmgplPNQSMTNIPTAMGQPGATVPgPSRIDPNQIPRPGSSSSPTVF 348
Cdd:PRK12323  517 aPAGWVAESIPDPATADPDD------AFETLAPAPAAAPAPRAAAA-TEPVVAPRPPRASASGLPDMF 577
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
67-366 1.82e-08

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 58.84  E-value: 1.82e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096   67 PFPQQSPSYGAPQRGPSPMSRPGPPAGMARPG-GPPPVSQPAGFQSNVPLNRPTGPPSRQPSFGSRPSMPGGPVAQPAAS 145
Cdd:PRK07764  418 AAAAAPAPAAAPQPAPAPAPAPAPPSPAGNAPaGGAPSPPPAAAPSAQPAPAPAAAPEPTAAPAPAPPAAPAPAAAPAAP 497
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096  146 SSGFPAFGP-----------------------------SGSVAAGPPPGSRPMAFGSPPPVgsgmsmppsGMIGGPVSNG 196
Cdd:PRK07764  498 AAPAAPAGAddaatlrerwpeilaavpkrsrktwaillPEATVLGVRGDTLVLGFSTGGLA---------RRFASPGNAE 568
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096  197 ------HQMVG-----------SGGFPRGTQFPGAAVTTPQAPYVRPPSAPYARTPPQPLGSHSLSGNPPLTPFTAPSMP 259
Cdd:PRK07764  569 vlvtalAEELGgdwqveavvgpAPGAAGGEGPPAPASSGPPEEAARPAAPAAPAAPAAPAPAGAAAAPAEASAAPAPGVA 648
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096  260 PPATFPGAPHGRPAVSGLPYGPPSAQVAPPLGFPGQMQPPrygmGPLPNQSMTNIPTAMGQPGATVPGPSRIDPNQIPRP 339
Cdd:PRK07764  649 APEHHPKHVAVPDASDGGDGWPAKAGGAAPAAPPPAPAPA----APAAPAGAAPAQPAPAPAATPPAGQADDPAAQPPQA 724
                         330       340
                  ....*....|....*....|....*..
gi 334187096  340 GSSSSPtvfeTRQSNQANPPPPATSDY 366
Cdd:PRK07764  725 AQGASA----PSPAADDPVPLPPEPDD 747
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
69-290 3.01e-08

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 57.96  E-value: 3.01e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096   69 PQQSPSYGAPQRGPS-PMSRPGPPAGMARPGGPPPVSQPAGFQSNVPLNRPTGPPSRQPSFGSrPSMPGGPVAQPAASSS 147
Cdd:PRK12323  365 PGQSGGGAGPATAAAaPVAQPAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRS-PAPEALAAARQASARG 443
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096  148 GFPAFGPSGSVAAGPPPGSRPMAFGSPPPVGSGMSMPPSGMIGGPVSNGHQmvgsgGFPRGTQFPGA-AVTTPQAPYVRP 226
Cdd:PRK12323  444 PGGAPAPAPAPAAAPAAAARPAAAGPRPVAAAAAAAPARAAPAAAPAPADD-----DPPPWEELPPEfASPAPAQPDAAP 518
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 334187096  227 PSAPYARTPPQPLGSHSLSGNPPLTPFTAPSMPPPA--TFPGAPHGRPAVSGLPYGPPSAQVAPPL 290
Cdd:PRK12323  519 AGWVAESIPDPATADPDDAFETLAPAPAAAPAPRAAaaTEPVVAPRPPRASASGLPDMFDGDWPAL 584
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
64-262 3.25e-08

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 57.69  E-value: 3.25e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096   64 SPQPFPQQSPS-YGAPQRGPSPmSRPGPPAGMARPGGPPPVSQPAGFQSNVPLNRPTGPPSRQPSFGSRPSMPGGPVAQP 142
Cdd:PRK07764  600 PPAPASSGPPEeAARPAAPAAP-AAPAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVPDASDGGDGWPAKAGGAAP 678
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096  143 AASS-SGFPAFGPSGSVAAGPPPGSRPMAFGSPPPVGSGMSMPPSGMIGG--PVSNGHQMVGSGGFPRGTQFPGAAVTTP 219
Cdd:PRK07764  679 AAPPpAPAPAAPAAPAGAAPAQPAPAPAATPPAGQADDPAAQPPQAAQGAsaPSPAADDPVPLPPEPDDPPDPAGAPAQP 758
                         170       180       190       200
                  ....*....|....*....|....*....|....*....|...
gi 334187096  220 QAPYVRPPSAPYARTPPQPlgshSLSGNPPLTPFTAPSMPPPA 262
Cdd:PRK07764  759 PPPPAPAPAAAPAAAPPPS----PPSEEEEMAEDDAPSMDDED 797
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
71-371 5.72e-08

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 57.10  E-value: 5.72e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096   71 QSPSYGAPQ--------RGPSPMSRPGPPAGMARPGGPPPVSQPAGFQSNVPLNRPTGPPSRQPSFGSRPSMPGGPVAQP 142
Cdd:PHA03307   33 DDLLSGSQGqlvsdsaeLAAVTVVAGAAACDRFEPPTGPPPGPGTEAPANESRSTPTWSLSTLAPASPAREGSPTPPGPS 112
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096  143 aaSSSGFPAFGPSGSVAAGPPPGSRPMAFGSPPPVGSGMSMPPsgmiggPVSNGHQMVGSGGF-PRGTQFPGAAVTTPQA 221
Cdd:PHA03307  113 --SPDPPPPTPPPASPPPSPAPDLSEMLRPVGSPGPPPAASPP------AAGASPAAVASDAAsSRQAALPLSSPEETAR 184
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096  222 PYVRPPSAPYARTPPQPLGSHSLSGNPPLTPFTAPSMPPPATFPGAPHGRPAVSGLPYGPPSAQVAPPLGFPGqmqpPRY 301
Cdd:PHA03307  185 APSSPPAEPPPSTPPAAASPRPPRRSSPISASASSPAPAPGRSAADDAGASSSDSSSSESSGCGWGPENECPL----PRP 260
                         250       260       270       280       290       300       310
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096  302 GMGPLPNQSMTNIPTAMGQPGATVPGPSRIDPNQIPRPGSSSSptvfetrqSNQANPPPPATSDYVVRDT 371
Cdd:PHA03307  261 APITLPTRIWEASGWNGPSSRPGPASSSSSPRERSPSPSPSSP--------GSGPAPSSPRASSSSSSSR 322
dnaA PRK14086
chromosomal replication initiator protein DnaA;
160-364 9.48e-08

chromosomal replication initiator protein DnaA;


Pssm-ID: 237605 [Multi-domain]  Cd Length: 617  Bit Score: 55.99  E-value: 9.48e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096  160 AGPPPGSRPMAFGSPPPVGSGMSMPPSGMIGGPVSNGHQMvgsgGFPRGTQFPGAAVTTPQApYVRPPSAPYARTPPQPL 239
Cdd:PRK14086   92 AGEPAPPPPHARRTSEPELPRPGRRPYEGYGGPRADDRPP----GLPRQDQLPTARPAYPAY-QQRPEPGAWPRAADDYG 166
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096  240 GSHSLSGNPPLTPFTAPSMPPPATFPGAPhgrpavsglPYGPPSAQVAPPLGFPGQmqpPRYGMGPlPNQSMTNIP---T 316
Cdd:PRK14086  167 WQQQRLGFPPRAPYASPASYAPEQERDRE---------PYDAGRPEYDQRRRDYDH---PRPDWDR-PRRDRTDRPeppP 233
                         170       180       190       200
                  ....*....|....*....|....*....|....*....|....*...
gi 334187096  317 AMGQPGATVPGPSRIDPNQIPrPGSSSSPTVFETRQSNQANPPPPATS 364
Cdd:PRK14086  234 GAGHVHRGGPGPPERDDAPVV-PIRPSAPGPLAAQPAPAPGPGEPTAR 280
PHA03247 PHA03247
large tegument protein UL36; Provisional
130-363 9.62e-08

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 56.87  E-value: 9.62e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096  130 SRPSMPGGPVAQ-PAASSSGFPAFGPSGSVAAGPPPGSRPMAFGSPPPvgsgmSMPPSGMIGGPVS-------NGHQMVG 201
Cdd:PHA03247 2470 LGELFPGAPVYRrPAEARFPFAAGAAPDPGGGGPPDPDAPPAPSRLAP-----AILPDEPVGEPVHprmltwiRGLEELA 2544
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096  202 S--GGFPRGTQFPGAAVTTP--QAPYVRP---PSAPYART-------PPQPLGSHSL---SGNPPLTPftAPSMPPPATF 264
Cdd:PHA03247 2545 SddAGDPPPPLPPAAPPAAPdrSVPPPRPaprPSEPAVTSrarrpdaPPQSARPRAPvddRGDPRGPA--PPSPLPPDTH 2622
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096  265 ---PGAPHGRPAVSGLPYGPPSAQVAPPLGF----PGQMQPPRYGMGPLPNQSMTNIPTAMGQPGA--TV--------PG 327
Cdd:PHA03247 2623 apdPPPPSPSPAANEPDPHPPPTVPPPERPRddpaPGRVSRPRRARRLGRAAQASSPPQRPRRRAArpTVgsltsladPP 2702
                         250       260       270
                  ....*....|....*....|....*....|....*.
gi 334187096  328 PSRIDPNQIPRPGSSSSPTVFETRQSNQANPPPPAT 363
Cdd:PHA03247 2703 PPPPTPEPAPHALVSATPLPPGPAAARQASPALPAA 2738
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
63-238 9.99e-08

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 56.15  E-value: 9.99e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096   63 QSPQPFPQQSPSYGAPQRGPSPMSRPGPPAGMARPGGPPPVSQPAGFQSNVPLNRPTGPPSRQPSFGSRPSMPGGPvAQP 142
Cdd:PRK07764  623 APAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVPDASDGGDGWPAKAGGAAPAAPPPAPAPAAPAAPAGAAP-AQP 701
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096  143 AASSSGFPAFGPSGSVAAGPPPGSRpmAFGSPPPVGSGMSMPPSGMIGGPVSNGHqmvGSGGFPRGTQFPGAAVTTPQAP 222
Cdd:PRK07764  702 APAPAATPPAGQADDPAAQPPQAAQ--GASAPSPAADDPVPLPPEPDDPPDPAGA---PAQPPPPPAPAPAAAPAAAPPP 776
                         170
                  ....*....|....*.
gi 334187096  223 YVRPPSAPYARTPPQP 238
Cdd:PRK07764  777 SPPSEEEEMAEDDAPS 792
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
3-262 1.51e-07

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 55.95  E-value: 1.51e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096    3 APVPPGAPRPNSQQNSGPPnfyPGSQGNSNALADNmQNLSLNRPPPMMPGSGPRPPPPFGQSPQPFPQQSPSYGAPQRGP 82
Cdd:PHA03307  185 APSSPPAEPPPSTPPAAAS---PRPPRRSSPISAS-ASSPAPAPGRSAADDAGASSSDSSSSESSGCGWGPENECPLPRP 260
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096   83 SPMSRPGPPAGMARPG----GPPPVSQPAGFQSNVPLNRPTGPPSRQPSFGSRPSMPGGPVAQPAASSSGFPAFGPSGSV 158
Cdd:PHA03307  261 APITLPTRIWEASGWNgpssRPGPASSSSSPRERSPSPSPSSPGSGPAPSSPRASSSSSSSRESSSSSTSSSSESSRGAA 340
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096  159 AAGPPPGSRPMAFGSPPPVGSGMSMPPSGMIGGPVSNGHQMVGSGGFPRGtqfpGAAVTTPQAPYVRPPSAPYARTPPQP 238
Cdd:PHA03307  341 VSPGPSPSRSPSPSRPPPPADPSSPRKRPRPSRAPSSPAASAGRPTRRRA----RAAVAGRARRRDATGRFPAGRPRPSP 416
                         250       260       270
                  ....*....|....*....|....*....|..
gi 334187096  239 LGSHSLSGN-----PPLTPFTAP---SMPPPA 262
Cdd:PHA03307  417 LDAGAASGAfyaryPLLTPSGEPwpgSPPPPP 448
PHA03378 PHA03378
EBNA-3B; Provisional
78-347 1.54e-07

EBNA-3B; Provisional


Pssm-ID: 223065 [Multi-domain]  Cd Length: 991  Bit Score: 55.84  E-value: 1.54e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096   78 PQRGPSPMSRP-GPPAGMARPGG-----PPPVSQPAGFQSNVPLNRPTGPPSRQPSFGSRPSMPGGPvAQPAASSSGFPA 151
Cdd:PHA03378  697 PPRAPTPMRPPaAPPGRAQRPAAatgraRPPAAAPGRARPPAAAPGRARPPAAAPGRARPPAAAPGR-ARPPAAAPGAPT 775
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096  152 fgPSGSVAAGPPPGSRPMAFGSP------PPVGSGMSMPPSGMIGGPVSNGHQMVGSGGFPRG---TQFPGAAvtTPQAP 222
Cdd:PHA03378  776 --PQPPPQAPPAPQQRPRGAPTPqpppqaGPTSMQLMPRAAPGQQGPTKQILRQLLTGGVKRGrpsLKKPAAL--ERQAA 851
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096  223 YVRPPSapyartPPQPLGSHSLSgnppltpftAPSMPPPATFPGAPHGRPAVSglpyGPPSAQVAP--PLGFPGQmqppR 300
Cdd:PHA03378  852 AGPTPS------PGSGTSDKIVQ---------APVFYPPVLQPIQVMRQLGSV----RAAAASTVTqaPTEYTGE----R 908
                         250       260       270       280
                  ....*....|....*....|....*....|....*....|....*..
gi 334187096  301 YGMGPLPnqsMTNIPtamgqPGATVPGPSRIDPnQIPRPGSSSSPTV 347
Cdd:PHA03378  909 RGVGPMH---PTDIP-----PSKRAKTDAYVES-QPPHGGQSHSFSV 946
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
63-271 2.36e-07

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 54.99  E-value: 2.36e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096   63 QSPQPFPQQSPSYGAPQRGPSPMSRPGPPAGMARPGGPPPVSQPAGfqsnvplnRPTGPPSRQPSFGSRPSMPGGPVAQP 142
Cdd:PRK07764  611 EAARPAAPAAPAAPAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKH--------VAVPDASDGGDGWPAKAGGAAPAAPP 682
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096  143 AASSSGFPafgPSGSVAAGPPPGSRPMAFGSPPPVGSGMSMPPSGMIGG--PVSNGHQMVGSGGFPRGTQFPGAAVTTPQ 220
Cdd:PRK07764  683 PAPAPAAP---AAPAGAAPAQPAPAPAATPPAGQADDPAAQPPQAAQGAsaPSPAADDPVPLPPEPDDPPDPAGAPAQPP 759
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|.
gi 334187096  221 apyvrPPSAPYARTPPQPlgshslsGNPPLTPFTAPSMPPPATFPGAPHGR 271
Cdd:PRK07764  760 -----PPPAPAPAAAPAA-------APPPSPPSEEEEMAEDDAPSMDDEDR 798
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
69-287 2.54e-07

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 54.88  E-value: 2.54e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096   69 PQQSPSYGAPQRGPSPMSRPGPPAGMARPGGPPPVSQPAGFQSnvplnRPTGPPSRQPSFGSRPSMPGGPVAQPAASSSG 148
Cdd:PRK12323  397 PAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAPEALAAARQA-----SARGPGGAPAPAPAPAAAPAAAARPAAAGPRP 471
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096  149 FPAFGPSGSVAAGPPPGSRPMAFGSPPpvgsGMSMPPSGMIGGPVSNghqmvgSGGFPRGTQFPGAAVTTPQAPYVRPPS 228
Cdd:PRK12323  472 VAAAAAAAPARAAPAAAPAPADDDPPP----WEELPPEFASPAPAQP------DAAPAGWVAESIPDPATADPDDAFETL 541
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|....*....
gi 334187096  229 APYARTPPQPLGSHSLSGNPPLTPFTAPSMPPPATFPGAPHGRPAvsGLPYGPPSAQVA 287
Cdd:PRK12323  542 APAPAAAPAPRAAAATEPVVAPRPPRASASGLPDMFDGDWPALAA--RLPVRGLAQQLA 598
PRK07003 PRK07003
DNA polymerase III subunit gamma/tau;
88-367 3.64e-07

DNA polymerase III subunit gamma/tau;


Pssm-ID: 235906 [Multi-domain]  Cd Length: 830  Bit Score: 54.47  E-value: 3.64e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096   88 PGPPAGMARPGGPPPVsqpagfqsnVPLNRPTGPPSRQPSFGSRPSMPGGPVAQPAASSSGFPAFGPSGSVAAGPPPGSr 167
Cdd:PRK07003  360 PAVTGGGAPGGGVPAR---------VAGAVPAPGARAAAAVGASAVPAVTAVTGAAGAALAPKAAAAAAATRAEAPPAA- 429
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096  168 pmafgsPPPVGSGMSMPPSGMIGGPVSnghqmvGSGGFPRGTQFPGAAVTTPQAPYVRPPSAPYARTPPQPLGSHSLSGN 247
Cdd:PRK07003  430 ------PAPPATADRGDDAADGDAPVP------AKANARASADSRCDERDAQPPADSGSASAPASDAPPDAAFEPAPRAA 497
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096  248 PPlTPFTAPSMPPPATFPGAPHG-RPAvsglPYGPPSAQVAPPLgfPGQMQPPRYGMGP------LPNQSM--------- 311
Cdd:PRK07003  498 AP-SAATPAAVPDARAPAAASREdAPA----AAAPPAPEARPPT--PAAAAPAARAGGAaaaldvLRNAGMrvssdrgar 570
                         250       260       270       280       290       300       310
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 334187096  312 --TNIPTAMGQPGATVPGPSRIdPNQIPRPGSSSSPTVFETR---------QSNQANPP----PPatSDYV 367
Cdd:PRK07003  571 aaAAAKPAAAPAAAPKPAAPRV-AVQVPTPRARAATGDAPPNgaaraeqaaESRGAPPPwediPP--DDYV 638
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
10-380 3.91e-07

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 54.41  E-value: 3.91e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096   10 PRPNSQQNSGPPNFYPGSQGNSNaladnmqnlSLNRPPPMMPGSGPRPPPPFGQSPQPFPQQSPSYGAPQRGPSPMSRPG 89
Cdd:PHA03307   65 FEPPTGPPPGPGTEAPANESRST---------PTWSLSTLAPASPAREGSPTPPGPSSPDPPPPTPPPASPPPSPAPDLS 135
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096   90 PPAGMARPGGPPPVSQPAGFQSNVPLNRPTGPPSRQPSFGSrpSMPGGPVAQPAASSSGFPAFGPSGSVAAGPPPGSRPM 169
Cdd:PHA03307  136 EMLRPVGSPGPPPAASPPAAGASPAAVASDAASSRQAALPL--SSPEETARAPSSPPAEPPPSTPPAAASPRPPRRSSPI 213
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096  170 AFGSPPPVGSGMSMPPSGMIGGPVSNGH-QMVGSGGFPRGTqfpgaaVTTPQAPYVRPPSAPYARTPPQPLGSHSLSGNP 248
Cdd:PHA03307  214 SASASSPAPAPGRSAADDAGASSSDSSSsESSGCGWGPENE------CPLPRPAPITLPTRIWEASGWNGPSSRPGPASS 287
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096  249 PltpfTAPSMPPPATFPGAPHGRPAVSGLPYGPPSAQVAPP-LGFPGQMQPPRYGMGPLPNQSMTNIPTAMGQPGATVPG 327
Cdd:PHA03307  288 S----SSPRERSPSPSPSSPGSGPAPSSPRASSSSSSSRESsSSSTSSSSESSRGAAVSPGPSPSRSPSPSRPPPPADPS 363
                         330       340       350       360       370
                  ....*....|....*....|....*....|....*....|....*....|...
gi 334187096  328 PSRIDPNQIPRPGSSSSPTVFETRQSNQANPPPPATSdyvvRDTGNCSPRYMR 380
Cdd:PHA03307  364 SPRKRPRPSRAPSSPAASAGRPTRRRARAAVAGRARR----RDATGRFPAGRP 412
PAT1 pfam09770
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ...
100-361 5.16e-07

Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.


Pssm-ID: 401645 [Multi-domain]  Cd Length: 846  Bit Score: 53.89  E-value: 5.16e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096   100 PPPVSQPAGFQSNVPLNRPTGPPSRQPSFGSRPSMPG-------GPVA--QPAASSSGFPAfGPSGSVAAGPPPGSRPMA 170
Cdd:pfam09770  107 PAARAAQSSAQPPASSLPQYQYASQQSQQPSKPVRTGyekykepEPIPdlQVDASLWGVAP-KKAAAPAPAPQPAAQPAS 185
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096   171 FGSPPP-------VGSGMSMPPSgmiggpvsnghqmvgsggfPRGTQFPGAAVTTPQAPYVrPPSAPYARTPPQPLGSHS 243
Cdd:pfam09770  186 LPAPSRkmmsleeVEAAMRAQAK-------------------KPAQQPAPAPAQPPAAPPA-QQAQQQQQFPPQIQQQQQ 245
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096   244 LSGNPPLTPFTAPSMPPPATFPGaPHGRPAVSGLPYGPPSAQVAPPLGFPGQMQPprygMGPLPNQSMtniptamgqpga 323
Cdd:pfam09770  246 PQQQPQQPQQHPGQGHPVTILQR-PQSPQPDPAQPSIQPQAQQFHQQPPPVPVQP----TQILQNPNR------------ 308
                          250       260       270
                   ....*....|....*....|....*....|....*...
gi 334187096   324 tvPGPSRIDPNQIPRPGSSSSPTVFETRQSNQANPPPP 361
Cdd:pfam09770  309 --LSAARVGYPQNPQPGVQPAPAHQAHRQQGSFGRQAP 344
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
61-365 7.13e-07

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 53.64  E-value: 7.13e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096   61 FGQSPQPFPQQSPSYGAPQRGPSPMSRPGPPAGMARPGGPPPVSQPAGFQSNVPLNRP--TGPPSRQPSFGSRPSMPGGP 138
Cdd:PHA03307  132 PDLSEMLRPVGSPGPPPAASPPAAGASPAAVASDAASSRQAALPLSSPEETARAPSSPpaEPPPSTPPAAASPRPPRRSS 211
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096  139 VAQPAASSsgfPAFGPSGSVAAGPPPGSRPMAFGSPPPVGSG-MSMPPSGMIGGPVSNGHQMVGSGGFPRGTQF-PGAAV 216
Cdd:PHA03307  212 PISASASS---PAPAPGRSAADDAGASSSDSSSSESSGCGWGpENECPLPRPAPITLPTRIWEASGWNGPSSRPgPASSS 288
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096  217 TTPQAPYVRP----PSAPYARTPPQPLGSHSLS--GNPPLTPFTAPSMPPPATFPGAPHGRPAVSGLPygPPSAQVAPpl 290
Cdd:PHA03307  289 SSPRERSPSPspssPGSGPAPSSPRASSSSSSSreSSSSSTSSSSESSRGAAVSPGPSPSRSPSPSRP--PPPADPSS-- 364
                         250       260       270       280       290       300       310
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 334187096  291 gfPGQMQPPRYGmGPLPNQSMTNIPTAMGQPGATVPGPSRIDPNQIPRPGSSSSPTVFETRQSNQANPPPPATSD 365
Cdd:PHA03307  365 --PRKRPRPSRA-PSSPAASAGRPTRRRARAAVAGRARRRDATGRFPAGRPRPSPLDAGAASGAFYARYPLLTPS 436
GGN pfam15685
Gametogenetin; GGN is a family of proteins largely found in mammals. It reacts with POG in the ...
63-380 7.52e-07

Gametogenetin; GGN is a family of proteins largely found in mammals. It reacts with POG in the maturation of sperm and is expressed virtually only in the testis. It is found to be associated with the intracellular membrane, binds with GGNBP1 and may be involved in vesicular trafficking.


Pssm-ID: 434857 [Multi-domain]  Cd Length: 668  Bit Score: 53.23  E-value: 7.52e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096    63 QSPQP-----FPQQSPSYGAPQRGPSPMSRPGPPAGMARPGGPP-----PVSQPAGFQSNVPLNRPTGPPSRQPsfgSRP 132
Cdd:pfam15685  182 SDRQPpnrgiTPALATSATSPTDSQAKHIAEGKTAGGACGGAPPqagegEMARFAASESGLSLLCKVTFKSAAP---LCP 258
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096   133 SMPGGPVAQPAA--SSSGFPAFGPSGSVAAGPPPGSRPMAFGSPPPVG---------SGMSMPPSGMIGGPVSNGHQMVG 201
Cdd:pfam15685  259 AAASGPLAAKASlgGGGGGGLFAASGAISCAEVLKQGPLAPGAARPLGevpraaletEGGEGDGEGCSGGPAAPASHARA 338
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096   202 SGGfPRGTQFPGaavTTPQAPYVRPP--------------------------SAPYARTPPQPLGSHSLSG--------- 246
Cdd:pfam15685  339 LPP-PAYTTFPG---SKPKFDWVSPPdgperhfrfngagggigaprrraaalSGPWGSPPPPPGKAHPIPGprrpapall 414
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096   247 NPPLTPFTAPSM------PPPAT---FPGAPHGRPAVSGLPYGPPSAQVAPPLGFP-GQMQPPRYGMGPLPNQSMTnipt 316
Cdd:pfam15685  415 APPMFIFPAPTNgepvrpGPPAPqalLPRPPPPTPPATPPPVPPPIPQLPALQPMPlAAARPPTPRPCPGHGESAL---- 490
                          330       340       350       360       370       380
                   ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 334187096   317 amgQPGATVPGPSRIDPNQIPRPGSSSSPTVFETRQSNQANPPPPATSDYVVRDTGNCSPRYMR 380
Cdd:pfam15685  491 ---APAPTAPLPPALAADQAPAPALAAAPAPSPAPAPATADPLPPAPAPIKARTRKNKGPRAAR 551
SOBP pfam15279
Sine oculis-binding protein; SOBP is associated with syndromic and nonsyndromic intellectual ...
80-306 7.54e-07

Sine oculis-binding protein; SOBP is associated with syndromic and nonsyndromic intellectual disability. It carries a zinc-finger of the zf-C2H2 type at the N-terminus, and a highly characteriztic C-terminal PhPhPhPhPhPh motif. The deduced 873-amino acid protein contains an N-terminal nuclear localization signal (NLS), followed by 2 FCS-type zinc finger motifs, a proline-rich region (PR1), a putative RNA-binding motif region, and a C-terminal NLS embedded in a second proline-rich motif. SOBP is expressed in various human tissues, including developing mouse brain at embryonic day 14. In postnatal and adult mouse brain SOBP is expressed in all neurons, with intense staining in the limbic system. Highest expression is in layer V cortical neurons, hippocampus, pyriform cortex, dorsomedial nucleus of thalamus, amygdala, and hypothalamus. Postnatal expression of SOBP in the limbic system corresponds to a time of active synaptogenesis. the family is also referred to as Jackson circler, JXC1. In seven affected siblings from a consanguineous Israeli Arab family with mental retardation, anterior maxillary protrusion, and strabismus mutations were found in this protein.


Pssm-ID: 464609 [Multi-domain]  Cd Length: 325  Bit Score: 52.51  E-value: 7.54e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096    80 RGPSPMSR----PGPPAGMARPGGPPPVSQPagfqSNVPLNRPTgPPSRQPSFGSRPSmpgGPVAQPAASSSGFPAFGPS 155
Cdd:pfam15279   81 KSASPASTrsesVSPGPSSSASPSSSPTSSN----SSKPLISVA-SSSKLLAPKPHEP---PSLPPPPLPPKKGRRHRPG 152
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096   156 GSVAAGPPPGSRPMAFGSPPPVGSGMSMPPSGMIggPVSNGHQMVgsggfPRGTQFPGAAVTTPQAPyVRPPSAPYARTP 235
Cdd:pfam15279  153 LHPPLGRPPGSPPMSMTPRGLLGKPQQHPPPSPL--PAFMEPSSM-----PPPFLRPPPSIPQPNSP-LSNPMLPGIGPP 224
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 334187096   236 PQPLGSHSLSGNPPLTP-FTAPSMPPPATFPGAPHGRPAVSGLPYGPPSAQVAPPLGFPGQMQPPRYGMGPL 306
Cdd:pfam15279  225 PKPPRNLGPPSNPMHRPpFSPHHPPPPPTPPGPPPGLPPPPPRGFTPPFGPPFPPVNMMPNPPEMNFGLPSL 296
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
72-289 1.07e-06

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 53.25  E-value: 1.07e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096   72 SPSYGAPQRGPSPMSRPGPPAGMARPGGPPPVSQPAGFqsnvplnRPTGPPsrqpsfgsRPSMPGGPVAQPAASSSGFPA 151
Cdd:PHA03307  759 SNPSLVPAKLAEALALLEPAEPQRGAGSSPPVRAEAAF-------RRPGRL--------RRSGPAADAASRTASKRKSRS 823
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096  152 FGPSGSVAAgPPPGSRPMAFGSPPPVGSGMSMPPSGMIGGPVSNGHQMVGSGGFPRGTQFPGAAVTTPQAPYVRPPSAPY 231
Cdd:PHA03307  824 HTPDGGSES-SGPARPPGAAARPPPARSSESSKSKPAAAGGRARGKNGRRRPRPPEPRARPGAAAPPKAAAAAPPAGAPA 902
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|....*...
gi 334187096  232 ARTPPQPLgshslsgnPPLTPftapsMPppatfPGAPHGRPAVSGLPYGPPSAQVAPP 289
Cdd:PHA03307  903 PRPRPAPR--------VKLGP-----MP-----PGGPDPRGGFRRVPPGDLHTPAPSA 942
PRK14959 PRK14959
DNA polymerase III subunits gamma and tau; Provisional
167-284 1.88e-06

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 184923 [Multi-domain]  Cd Length: 624  Bit Score: 51.99  E-value: 1.88e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096  167 RPMAFGSPPPVGSGMSMPPSGMIGGPVSNGHQMVgSGGFPRGTQFPGAAVTTPQAPYVRP-PSAPYARTPPQPlgshSLS 245
Cdd:PRK14959  372 RPSGGGASAPSGSAAEGPASGGAATIPTPGTQGP-QGTAPAAGMTPSSAAPATPAPSAAPsPRVPWDDAPPAP----PRS 446
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|.
gi 334187096  246 GNPpltPFTAPSMPPPATFPGAPHGRPAVSGLP--YGPPSA 284
Cdd:PRK14959  447 GIP---PRPAPRMPEASPVPGAPDSVASASDAPptLGDPSD 484
PHA03378 PHA03378
EBNA-3B; Provisional
6-222 5.31e-06

EBNA-3B; Provisional


Pssm-ID: 223065 [Multi-domain]  Cd Length: 991  Bit Score: 50.84  E-value: 5.31e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096    6 PPGAPRPNSqqnsgPPNFYPGSQGNSNAladnmqnlSLNRPPPMMPGSGPRPPPPFGQSPQPFPQQSPS-----YGAPQR 80
Cdd:PHA03378  697 PPRAPTPMR-----PPAAPPGRAQRPAA--------ATGRARPPAAAPGRARPPAAAPGRARPPAAAPGrarppAAAPGR 763
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096   81 GPSPMSRPGPPAGMarpggPPPVSQPAGFQ----SNVPLNRPTGPPS------RQPSFGSRPSMPGGPVAQPAASSSGFP 150
Cdd:PHA03378  764 ARPPAAAPGAPTPQ-----PPPQAPPAPQQrprgAPTPQPPPQAGPTsmqlmpRAAPGQQGPTKQILRQLLTGGVKRGRP 838
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 334187096  151 AFGPSGSVAAGPPPGSRPmafgsPPPVGSGMSMPPSGMIGGPVSNGHQMVGSGGFPRgtqfPGAAVTTPQAP 222
Cdd:PHA03378  839 SLKKPAALERQAAAGPTP-----SPGSGTSDKIVQAPVFYPPVLQPIQVMRQLGSVR----AAAASTVTQAP 901
PPE COG5651
PPE-repeat protein [Function unknown];
90-308 2.37e-05

PPE-repeat protein [Function unknown];


Pssm-ID: 444372 [Multi-domain]  Cd Length: 385  Bit Score: 47.97  E-value: 2.37e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096   90 PPAGMARPGGPPPVSqpagfqsnvplNRPTGPPSRQPSFGSRPSMPGGPVAQPAASSSGFPAFGPSGSVAAGPPPGSRPM 169
Cdd:COG5651   170 PPPTITNPGGLLGAQ-----------NAGSGNTSSNPGFANLGLTGLNQVGIGGLNSGSGPIGLNSGPGNTGFAGTGAAA 238
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096  170 AFGSPPPVGSGMSMPPSGMIGGPVSNGHQMVGSGGFPRGT--QFPGAAVTTPQAP--YVRPPSAPYARTPPQPLGSHSLS 245
Cdd:COG5651   239 GAAAAAAAAAAAAGAGASAALASLAATLLNASSLGLAATAasSAATNLGLAGSPLglAGGGAGAAAATGLGLGAGGAAGA 318
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 334187096  246 GNPPLTPFTAPSMPPPATfPGAPHGRPAVSGLPYGPPSAQVAPPLGFPGQMQPPRYGMGPLPN 308
Cdd:COG5651   319 AGATGAGAALGAGAAAAA-AGAAAGAGAAAAAAAGGAGGGGGGALGAGGGGGSAGAAAGAASG 380
PRK10263 PRK10263
DNA translocase FtsK; Provisional
88-346 2.79e-05

DNA translocase FtsK; Provisional


Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 48.54  E-value: 2.79e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096   88 PGPPAgmarPGGPPPVSQPAGFQSNVPlnrptGPPSRQPSFGSRPS--MPGGPVAQPAASSSG-----FPAFGPSGSVAA 160
Cdd:PRK10263  342 QTPPV----ASVDVPPAQPTVAWQPVP-----GPQTGEPVIAPAPEgyPQQSQYAQPAVQYNEplqqpVQPQQPYYAPAA 412
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096  161 GPPPGSRPMAFGSPPPVGSGMSMPPSGMIGGPVSNGHQmvgsggfPRGTQFPGAAVTTPQAPYVRPPSAPYARTPPQPLG 240
Cdd:PRK10263  413 EQPAQQPYYAPAPEQPAQQPYYAPAPEQPVAGNAWQAE-------EQQSTFAPQSTYQTEQTYQQPAAQEPLYQQPQPVE 485
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096  241 SHSLSGNPPLTPFTAPSMPPPATFPGAPHGR----------------PAVSGLPYGP----PSAQVAPPLGFPGQMQPPR 300
Cdd:PRK10263  486 QQPVVEPEPVVEETKPARPPLYYFEEVEEKRarereqlaawyqpipePVKEPEPIKSslkaPSVAAVPPVEAAAAVSPLA 565
                         250       260       270       280       290
                  ....*....|....*....|....*....|....*....|....*....|....*..
gi 334187096  301 YGMgplpnQSMTNIPTAMGQPGATV--------PGPS---RIDPnQIPRPGSSSSPT 346
Cdd:PRK10263  566 SGV-----KKATLATGAAATVAAPVfslansggPRPQvkeGIGP-QLPRPKRIRVPT 616
PHA03247 PHA03247
large tegument protein UL36; Provisional
66-318 4.65e-05

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 48.01  E-value: 4.65e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096   66 QPFPQQSPSYGAPQRGPSPMSRPgPPAGMARPGGPPPVSQPAGFQSNVPLNRPTGPPSRQPSFGSRPSMPGGPVAQPAAS 145
Cdd:PHA03247  235 EPFVERRVVISHPLRGDIAAPAP-PPVVGEGADRAPETARGATGPPPPPEAAAPNGAAAPPDGVWGAALAGAPLALPAPP 313
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096  146 SsgfPAFGPSGSVAAGPPPGSRPMAFGSPPPvgsgmsmppsgmiggpvsnGHQMVGSGGFP---RGTQFPGAAVTTPQAP 222
Cdd:PHA03247  314 D---PPPPAPAGDAEEEDDEDGAMEVVSPLP-------------------RPRQHYPLGFPkrrRPTWTPPSSLEDLSAG 371
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096  223 YVRPPSAP-------YARTPPQPLGSHSLSGNPPLTPFTAPSMPPPATFPGAPHGRPAVSGLPygPPSAQVAPPLG--FP 293
Cdd:PHA03247  372 RHHPKRASlptrkrrSARHAATPFARGPGGDDQTRPAAPVPASVPTPAPTPVPASAPPPPATP--LPSAEPGSDDGpaPP 449
                         250       260
                  ....*....|....*....|....*
gi 334187096  294 GQMQPPRYGMGPLPNQSMTNIPTAM 318
Cdd:PHA03247  450 PERQPPAPATEPAPDDPDDATRKAL 474
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
3-177 4.76e-05

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 47.67  E-value: 4.76e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096    3 APVPPGAPRPNSQQNSGPPNFYPGSQGNSNALADNMQNLSLNRPPPMMPGSGPRPPPPFGQSPQPFPQQSPSYGAPQRGP 82
Cdd:PRK07764  606 SGPPEEAARPAAPAAPAAPAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVPDASDGGDGWPAKAGGAAPAAPPPAP 685
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096   83 SPMSRPGPPAGMARPGGPPPVSQPAGFQSNVPLNRPTGPP----SRQPSFGSRPSMPGGPVAQPAASSSGFPAFGPSGSV 158
Cdd:PRK07764  686 APAAPAAPAGAAPAQPAPAPAATPPAGQADDPAAQPPQAAqgasAPSPAADDPVPLPPEPDDPPDPAGAPAQPPPPPAPA 765
                         170
                  ....*....|....*....
gi 334187096  159 AAGPPPGSRPMAFGSPPPV 177
Cdd:PRK07764  766 PAAAPAAAPPPSPPSEEEE 784
PAT1 pfam09770
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ...
31-297 5.35e-05

Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.


Pssm-ID: 401645 [Multi-domain]  Cd Length: 846  Bit Score: 47.34  E-value: 5.35e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096    31 SNALADNMQNLSLNRPPPMMPGSGPRPPPPFGQSPQPFPQQSPSYGAPQR-GPSPMSRPGPPA---------GMARPGGP 100
Cdd:pfam09770   92 SDAIEEEQVRFNRQQPAARAAQSSAQPPASSLPQYQYASQQSQQPSKPVRtGYEKYKEPEPIPdlqvdaslwGVAPKKAA 171
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096   101 PPVSQPAgfqsnvplnrPTGPPSRQPSfGSRPSM------------PGGPVAQPAASSSGFPAfgpsgsvaagPPPGSRP 168
Cdd:pfam09770  172 APAPAPQ----------PAAQPASLPA-PSRKMMsleeveaamraqAKKPAQQPAPAPAQPPA----------APPAQQA 230
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096   169 MAFGSPPPVGSGMSMPpsgmiggpvsngHQMVGSGGFPRGTQFPGAAVTTPQAPYVrPPSAPYARTPPQPLGSHSLSGNP 248
Cdd:pfam09770  231 QQQQQFPPQIQQQQQP------------QQQPQQPQQHPGQGHPVTILQRPQSPQP-DPAQPSIQPQAQQFHQQPPPVPV 297
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|
gi 334187096   249 -PLTPFTAPSMPPPATFPGAPHGRPAVSGLPyGPPSAQVAPPLGFPGQMQ 297
Cdd:pfam09770  298 qPTQILQNPNRLSAARVGYPQNPQPGVQPAP-AHQAHRQQGSFGRQAPII 346
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
2-176 6.30e-05

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 47.29  E-value: 6.30e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096    2 VAPVPPGAPRPNSQQNSGPPNFYPGSQGNSNALADNMQNLSlnrPPPMMPGSGPRPPPPFGQSPQPFPQQSPSyGAPQRG 81
Cdd:PRK07764  625 AAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVPDASDG---GDGWPAKAGGAAPAAPPPAPAPAAPAAPA-GAAPAQ 700
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096   82 PSPMSRPGPPAGMARPGGPPPVSQPAGFQSNVPLNRPTGPPSRQPSFGSRPSMPGGPVAQPAASSSGFPAFGPSGSVAAG 161
Cdd:PRK07764  701 PAPAPAATPPAGQADDPAAQPPQAAQGASAPSPAADDPVPLPPEPDDPPDPAGAPAQPPPPPAPAPAAAPAAAPPPSPPS 780
                         170
                  ....*....|....*
gi 334187096  162 PPPGSRPMAFGSPPP 176
Cdd:PRK07764  781 EEEEMAEDDAPSMDD 795
PHA03247 PHA03247
large tegument protein UL36; Provisional
208-365 8.53e-05

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 46.86  E-value: 8.53e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096  208 GTQFPGAAVttpqapYVRPPSA--PYARTP-PQPLGShslsGNPPLTPFTAPSMPPPATFPGAPHGRPA-------VSGL 277
Cdd:PHA03247 2471 GELFPGAPV------YRRPAEArfPFAAGAaPDPGGG----GPPDPDAPPAPSRLAPAILPDEPVGEPVhprmltwIRGL 2540
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096  278 PY---------GPPSAQVAPPLGFPGQMQPPRY----------------GMGPLPNQSMTNIPTAMGQPGATVPGPSRID 332
Cdd:PHA03247 2541 EElasddagdpPPPLPPAAPPAAPDRSVPPPRPaprpsepavtsrarrpDAPPQSARPRAPVDDRGDPRGPAPPSPLPPD 2620
                         170       180       190
                  ....*....|....*....|....*....|...
gi 334187096  333 PNQIPRPGSSSSPTVFETRQSNQANPPPPATSD 365
Cdd:PHA03247 2621 THAPDPPPPSPSPAANEPDPHPPPTVPPPERPR 2653
PBP1 COG5180
PAB1-binding protein, interacts with poly(A)-binding protein [RNA processing and modification]; ...
64-429 9.79e-05

PAB1-binding protein, interacts with poly(A)-binding protein [RNA processing and modification];


Pssm-ID: 444064 [Multi-domain]  Cd Length: 548  Bit Score: 46.21  E-value: 9.79e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096   64 SPQPFPQQSPSYGAPQRGPSPMSRPGPPAGMARPGGPPPVSQPAGF-QSNVPLNRPTGPPSRQPSFGSRPSMPGGPVAQP 142
Cdd:COG5180   139 REATSASAGVALAAALLQRSDPILAKDPDGDSASTLPPPAEKLDKVlTEPRDALKDSPEKLDRPKVEVKDEAQEEPPDLT 218
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096  143 AASSSGFPAFGPSGSVaagpPPGSRPMAFGSPPPVGSGMSMPPSGmiggPVSNGHQMVGSggfprgtqfpgaavttpQAP 222
Cdd:COG5180   219 GGADHPRPEAASSPKV----DPPSTSEARSRPATVDAQPEMRPPA----DAKERRRAAIG-----------------DTP 273
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096  223 YVRPPSAPYARTPPQPLGSHSLSGNPPLTPFTAPSMPPPATFPGAPHGRPAVSGLPYGPPSA-------QVAPPLGFPGQ 295
Cdd:COG5180   274 AAEPPGLPVLEAGSEPQSDAPEAETARPIDVKGVASAPPATRPVRPPGGARDPGTPRPGQPTerpagvpEAASDAGQPPS 353
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096  296 MQPPRYGMGPL-PNQSMTNIPTAMGQPGATVPGPSRIDPNQIPRPGSSSSPTvfetrqsnQANPPPPATSDYVVRDTGNC 374
Cdd:COG5180   354 AYPPAEEAVPGkPLEQGAPRPGSSGGDGAPFQPPNGAPQPGLGRRGAPGPPM--------GAGDLVQAALDGGGRETASL 425
                         330       340       350       360       370
                  ....*....|....*....|....*....|....*....|....*....|....*..
gi 334187096  375 SP--RYMRCTINQIPCTVDLLSTSGMqLALMVQPLALSHPSEEPIQVVDfGEGGPVR 429
Cdd:COG5180   426 GGaaGGAGQGPKADFVPGDAESVSGP-AGLADQAGAAASTAMADFVAPV-TDATPVD 480
PPE COG5651
PPE-repeat protein [Function unknown];
133-347 1.28e-04

PPE-repeat protein [Function unknown];


Pssm-ID: 444372 [Multi-domain]  Cd Length: 385  Bit Score: 45.65  E-value: 1.28e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096  133 SMPGGPVAQPAASSSGFPAFGPSGSVAAGPPPGSRPMAFGSPPPVGSGMSMPPSGMIGGPVSNGHQMVGSGGFPRGTQFP 212
Cdd:COG5651   164 LTPFTQPPPTITNPGGLLGAQNAGSGNTSSNPGFANLGLTGLNQVGIGGLNSGSGPIGLNSGPGNTGFAGTGAAAGAAAA 243
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096  213 GAAVTTPQAPYVRPPSAPYARTPPQPLGSH-------SLSGNPPLTPFTAPSMPPPATFPGAPHGRPAVSGLPYGPPSAq 285
Cdd:COG5651   244 AAAAAAAAGAGASAALASLAATLLNASSLGlaataasSAATNLGLAGSPLGLAGGGAGAAAATGLGLGAGGAAGAAGAT- 322
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 334187096  286 vAPPLGFPGQMQPPRYGMGPLPNQSMTNIPTAMGQPGATVPGPSRIDPNQIPRPGSSSSPTV 347
Cdd:COG5651   323 -GAGAALGAGAAAAAAGAAAGAGAAAAAAAGGAGGGGGGALGAGGGGGSAGAAAGAASGGGA 383
PHA03379 PHA03379
EBNA-3A; Provisional
59-367 1.75e-04

EBNA-3A; Provisional


Pssm-ID: 223066 [Multi-domain]  Cd Length: 935  Bit Score: 45.82  E-value: 1.75e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096   59 PPF--GQSPQPFpQQSPSYGA---PQRGPSPMSRPGP----------PAGMARPGGPPPVS----QPAGFQSN----VPL 115
Cdd:PHA03379  420 VEKprPEVPQSL-ETATSHGSaqvPEPPPVHDLEPGPlhdqhsmapcPVAQLPPGPLQDLEpgdqLPGVVQDGrpacAPV 498
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096  116 NRPTGP------------PSRQPSFGSRPSMPGGPVAQPAASSSGFPAFGPSGSVAAGP--PPGSRPMAfgspppvgsgM 181
Cdd:PHA03379  499 PAPAGPivrpweaslsqvPGVAFAPVMPQPMPVEPVPVPTVALERPVCPAPPLIAMQGPgeTSGIVRVR----------E 568
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096  182 SMPPSGMIGGPVSNGHQMVGSGGFPRGTqfpgaavttpqaPYVRPPSAPYARTPPQplgshsLSGNPPLTPFTAPSMPPP 261
Cdd:PHA03379  569 RWRPAPWTPNPPRSPSQMSVRDRLARLR------------AEAQPYQASVEVQPPQ------LTQVSPQQPMEYPLEPEQ 630
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096  262 ATFPGAPHGRPA----VSGLPYGPPSAQVAP---PLGFPGQMQPPRYGMGPLPNQSMTN---IPTAMGQPGATVPGPSRI 331
Cdd:PHA03379  631 QMFPGSPFSQVAdvmrAGGVPAMQPQYFDLPlqqPISQGAPLAPLRASMGPVPPVPATQpqyFDIPLTEPINQGASAAHF 710
                         330       340       350
                  ....*....|....*....|....*....|....*.
gi 334187096  332 DPNQIPRPGSSSSPTVFETRQSNQANPPPPATSDYV 367
Cdd:PHA03379  711 LPQQPMEGPLVPERWMFQGATLSQSVRPGVAQSQYF 746
PHA02682 PHA02682
ORF080 virion core protein; Provisional
212-347 2.06e-04

ORF080 virion core protein; Provisional


Pssm-ID: 177464 [Multi-domain]  Cd Length: 280  Bit Score: 44.47  E-value: 2.06e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096  212 PGAAVTTPQAPYVRPPSAPYARTPPQPLgshslsgnPPLTPFTAPSMPPPATFPG----APHGRPAVSGLPYGPP----- 282
Cdd:PHA02682   86 PACAAPAPACPACAPAAPAPAVTCPAPA--------PACPPATAPTCPPPAVCPAparpAPACPPSTRQCPPAPPlptpk 157
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 334187096  283 SAQVAPPLGFPGQMQPPRYgmgplPNQSMTNIPTAmgqPGATVPGPSRIdPNQIPRPGSSSSPTV 347
Cdd:PHA02682  158 PAPAAKPIFLHNQLPPPDY-----PAASCPTIETA---PAASPVLEPRI-PDKIIDADNDDKDLI 213
PRK08691 PRK08691
DNA polymerase III subunits gamma and tau; Validated
122-372 2.25e-04

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236333 [Multi-domain]  Cd Length: 709  Bit Score: 45.47  E-value: 2.25e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096  122 PSRQPSFGSRPSMPGGPVAQPAASSSGFPAfgpsgsVAAGPPPGSRPMAFGSPPPVGSGMSMPPSGMIGGPVSNGHQmvg 201
Cdd:PRK08691  360 PLAAASCDANAVIENTELQSPSAQTAEKET------AAKKPQPRPEAETAQTPVQTASAAAMPSEGKTAGPVSNQEN--- 430
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096  202 SGGFPRGTQfPGAAVT---TPQAPYVRPPSAPYARTPPQPLGSHSLSGNPPlTPFTAPSMPPPATFPGAPHGRPAVSGLP 278
Cdd:PRK08691  431 NDVPPWEDA-PDEAQTaagTAQTSAKSIQTASEAETPPENQVSKNKAADNE-TDAPLSEVPSENPIQATPNDEAVETETF 508
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096  279 YGPPSAQVAPPLGFPGQMQPPRygmgplpnqsmtniptamgqPGATVPGP--SRIDPNQIPRPGSSSSPTVFETRQSNQA 356
Cdd:PRK08691  509 AHEAPAEPFYGYGFPDNDCPPE--------------------DGAEIPPPdwEHAAPADTAGGGADEEAEAGGIGGNNTP 568
                         250       260
                  ....*....|....*....|
gi 334187096  357 NPPPPATSD----YVVRDTG 372
Cdd:PRK08691  569 SAPPPEFSTenwaAIVRHFA 588
PHA03379 PHA03379
EBNA-3A; Provisional
117-364 2.48e-04

EBNA-3A; Provisional


Pssm-ID: 223066 [Multi-domain]  Cd Length: 935  Bit Score: 45.43  E-value: 2.48e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096  117 RPTGPPSRQPSFGSRPSMPGGPVAQPAASSSGFPAFGPSGSVAAGPPPGSRPMAfgsPPPVGsgmSMPPSGMigGPVSNG 196
Cdd:PHA03379  410 EPTYGTPRPPVEKPRPEVPQSLETATSHGSAQVPEPPPVHDLEPGPLHDQHSMA---PCPVA---QLPPGPL--QDLEPG 481
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096  197 HQMVGSGGFPRGTQFPgaaVTTPQAPYVRPPSAPYARTPPQPLGSHS---LSGNP---PLTPFTAPSMPPPA----TFPG 266
Cdd:PHA03379  482 DQLPGVVQDGRPACAP---VPAPAGPIVRPWEASLSQVPGVAFAPVMpqpMPVEPvpvPTVALERPVCPAPPliamQGPG 558
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096  267 APHGRPAVSgLPYGPPS---AQVAPPLGFPGQMQPPRYGMGPLPNQSMTNI-PTAMGQPGATVPGPSRIDPNQIPRPGSS 342
Cdd:PHA03379  559 ETSGIVRVR-ERWRPAPwtpNPPRSPSQMSVRDRLARLRAEAQPYQASVEVqPPQLTQVSPQQPMEYPLEPEQQMFPGSP 637
                         250       260
                  ....*....|....*....|..
gi 334187096  343 SSPTVFETRQSNQANPPPPATS 364
Cdd:PHA03379  638 FSQVADVMRAGGVPAMQPQYFD 659
PHA03247 PHA03247
large tegument protein UL36; Provisional
70-182 2.66e-04

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 45.31  E-value: 2.66e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096   70 QQSPSYGAPQRGPSPMSRPGPPAGMARPGGPPPVSQPAGFQSnvplnrPTGPPSRQPsfgSRPSMPGGPVAQPAASSSGF 149
Cdd:PHA03247  404 QTRPAAPVPASVPTPAPTPVPASAPPPPATPLPSAEPGSDDG------PAPPPERQP---PAPATEPAPDDPDDATRKAL 474
                          90       100       110
                  ....*....|....*....|....*....|...
gi 334187096  150 PAFGPSGSvaAGPPPGSRPMAFGSPPPVGSGMS 182
Cdd:PHA03247  475 DALRERRP--PEPPGADLAELLGRHPDTAGTVV 505
Gag_spuma pfam03276
Spumavirus gag protein;
165-337 2.85e-04

Spumavirus gag protein;


Pssm-ID: 460872 [Multi-domain]  Cd Length: 614  Bit Score: 44.74  E-value: 2.85e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096   165 GSRPMAFGSPPPVGSGMSMPPSGMIGGPvsnghqmvgsggfprgtqfpgaAVTTPQAPYVRPPSAPYARTPPQPLGSHSL 244
Cdd:pfam03276  177 EISPGAQGGIPPGASFSGLPSLPAIGGI----------------------HLPAIPGIHARAPPGNIARSLGDDIMPSLG 234
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096   245 SGNPPLTPFTAPSMPPPATFPGAP------HGRPAVSGLPYGPPSAQVAPPLGFPGQMQPPRYGMGPLPNQSMTNIPtam 318
Cdd:pfam03276  235 DAGMPQPRFAFHPGNPFAEAEGHPfaeaegERPRDIPRAPRIDAPSAPAIPAIQPIAPPMIPPIGAPIPIPHGASIP--- 311
                          170
                   ....*....|....*....
gi 334187096   319 GQPGATVPGPSRIDPNQIP 337
Cdd:pfam03276  312 GEHIRNPREEPIRLGREAP 330
PHA03377 PHA03377
EBNA-3C; Provisional
3-366 3.05e-04

EBNA-3C; Provisional


Pssm-ID: 177614 [Multi-domain]  Cd Length: 1000  Bit Score: 45.04  E-value: 3.05e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096    3 APVPPGAPRPNSQQNSGPPNFYPGSQGNSNALADNMQNLSLNRPPPMMPGSGPRPPPPFGQSPQpfPQQSPSYGAPQRGP 82
Cdd:PHA03377  535 KVQDGFQRSGRRQKRATPPKVSPSDRGPPKASPPVMAPPSTGPRVMATPSTGPRDMAPPSTGPR--QQAKCKDGPPASGP 612
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096   83 ----SPMSRPGPPAG-----------MARPGGPPPVS--------------QPAGFQSNVPLNRPTGPPSRQPSFGSRPS 133
Cdd:PHA03377  613 hekqPPSSAPRDMAPsvvrmflrerlLEQSTGPKPKSfwemragrdgsgiqQEPSSRRQPATQSTPPRPSWLPSVFVLPS 692
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096  134 MPGGPvAQPAASSSGFPAFGPSGSVAAGPPPGSRPMA--------FGSPPPvgsgmsmPPSGMIGGPVSNGHQMVGSGGF 205
Cdd:PHA03377  693 VDAGR-AQPSEESHLSSMSPTQPISHEEQPRYEDPDDpldlslhpDQAPPP-------SHQAPYSGHEEPQAQQAPYPGY 764
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096  206 --PRGTQFPGAAVTTPQAPYVRPPS-----APYARTPPQPLGSHSLS-------GNPPLTPFTA-PSMPPPATFPGAPHG 270
Cdd:PHA03377  765 wePRPPQAPYLGYQEPQAQGVQVSSypgyaGPWGLRAQHPRYRHSWAywsqypgHGHPQGPWAPrPPHLPPQWDGSAGHG 844
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096  271 RPAVSGLPygPPSAQVAPPLgfPGQMQPPRYgMGPLPNQSMTNIPTAMGQPGATV-PGPSRIDPNQIPRPGSSSSptvfE 349
Cdd:PHA03377  845 QDQVSQFP--HLQSETGPPR--LQLSQVPQL-PYSQTLVSSSAPSWSSPQPRAPIrPIPTRFPPPPMPLQDSMAV----G 915
                         410
                  ....*....|....*..
gi 334187096  350 TRQSNQANPPPPATSDY 366
Cdd:PHA03377  916 CDSSGTACPSMPFASDY 932
PHA03377 PHA03377
EBNA-3C; Provisional
97-364 3.37e-04

EBNA-3C; Provisional


Pssm-ID: 177614 [Multi-domain]  Cd Length: 1000  Bit Score: 44.66  E-value: 3.37e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096   97 PGGPPPVSQPAGFQSNVPLNRPTGPPsrQPSFGSRPSMPGGPV-------AQPAASSSGFPAFGPSGSVAAGPP---PGS 166
Cdd:PHA03377  395 PNMEPVQQRPVMFVSRVPWRKPRTLP--WPTPKTHPVKRTLVKtsgrsdeAEQAQSTPERPGPSDQPSVPVEPAhltPVE 472
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096  167 RPMAF-----GSPPPVGSGMSMPPSGMIGGP---------------VSNGHQMVGSGGFPRGTQFPGAAVT-TPQAPYVR 225
Cdd:PHA03377  473 HTTVIlhqppQSPPTVAIKPAPPPSRRRRGAcvvydddiievidveTTEEEESVTQPAKPHRKVQDGFQRSgRRQKRATP 552
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096  226 PPSAPYARTPP---QPLGSHSLSGNPPLTPftaPSMPPPATFPGAPHGRpAVSGLPYGPPSA---QVAPPLGFPGQMQPP 299
Cdd:PHA03377  553 PKVSPSDRGPPkasPPVMAPPSTGPRVMAT---PSTGPRDMAPPSTGPR-QQAKCKDGPPASgphEKQPPSSAPRDMAPS 628
                         250       260       270       280       290       300       310
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 334187096  300 RYGM-----------GPLPnQSMTNIPTAMGQPGATVPGPSRIDPN-QIPRPGSSSSPTVFETRQSNQANPPPPATS 364
Cdd:PHA03377  629 VVRMflrerlleqstGPKP-KSFWEMRAGRDGSGIQQEPSSRRQPAtQSTPPRPSWLPSVFVLPSVDAGRAQPSEES 704
PABP-1234 TIGR01628
polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins ...
63-172 4.02e-04

polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins recognize the poly-A of mRNA and consists of four tandem RNA recognition domains at the N-terminus (rrm: pfam00076) followed by a PABP-specific domain (pfam00658) at the C-terminus. The protein is involved in the transport of mRNA's from the nucleus to the cytoplasm. There are four paralogs in Homo sapiens which are expressed in testis, platelets, broadly expressed and of unknown tissue range.


Pssm-ID: 130689 [Multi-domain]  Cd Length: 562  Bit Score: 44.41  E-value: 4.02e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096    63 QSPQPFPQQsPSYGAPQ---RGP--------------SPMSRPGPPAGMARPGGPPPVSQ---PAGFQSNVPLNRPTGPP 122
Cdd:TIGR01628  384 QLPMGSPMG-GAMGQPPyygQGPqqqfngqplgwprmSMMPTPMGPGGPLRPNGLAPMNAvraPSRNAQNAAQKPPMQPV 462
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|
gi 334187096   123 SRQPSFGSRPSMPGGPVAQPAASSSGfpAFGPSGSVAAGPPPGSRPMAFG 172
Cdd:TIGR01628  463 MYPPNYQSLPLSQDLPQPQSTASQGG--QNKKLAQVLASATPQMQKQVLG 510
PRK14959 PRK14959
DNA polymerase III subunits gamma and tau; Provisional
67-194 4.81e-04

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 184923 [Multi-domain]  Cd Length: 624  Bit Score: 44.29  E-value: 4.81e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096   67 PFPQQSPSyGAPQRGPSPMSRPGPPAGMARPGGPPPVSQPAGFQSNVPLNRPTG-PPSRQPSFGSRPSMPGGPVAqPAAS 145
Cdd:PRK14959  367 PVESLRPS-GGGASAPSGSAAEGPASGGAATIPTPGTQGPQGTAPAAGMTPSSAaPATPAPSAAPSPRVPWDDAP-PAPP 444
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|..
gi 334187096  146 SSGFPAFGPSGSVAAGPPPGsRPMAFGSP---PPVGSGMSMPPSGMIGGPVS 194
Cdd:PRK14959  445 RSGIPPRPAPRMPEASPVPG-APDSVASAsdaPPTLGDPSDTAEHTPSGPRT 495
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
64-185 7.04e-04

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 44.01  E-value: 7.04e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096   64 SPQPFPQQSPSYGAPQRGPSPMSRPGPPAGMARPggPPPVSQPAGFQSNVPLN-RPTGPPSRQPSFGSRPSMPGGPVAQP 142
Cdd:PHA03307  812 ASRTASKRKSRSHTPDGGSESSGPARPPGAAARP--PPARSSESSKSKPAAAGgRARGKNGRRRPRPPEPRARPGAAAPP 889
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|...
gi 334187096  143 AASSSGFPAFGPSGSVAAGPPPGSRPMAFGSPPPVGSGMSMPP 185
Cdd:PHA03307  890 KAAAAAPPAGAPAPRPRPAPRVKLGPMPPGGPDPRGGFRRVPP 932
DUF5585 pfam17823
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
83-361 9.72e-04

Family of unknown function (DUF5585); This is a family of unknown function found in chordata.


Pssm-ID: 465521 [Multi-domain]  Cd Length: 506  Bit Score: 43.02  E-value: 9.72e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096    83 SPMSRPGPPAGMA-RPGGPPPVSQPAGFQSNVPlNRPTGPPSRQPSF--GSRPSMPG--GPVAQPAASSSGFPAFGPSGS 157
Cdd:pfam17823   99 EPATREGAADGAAsRALAAAASSSPSSAAQSLP-AAIAALPSEAFSAprAAACRANAsaAPRAAIAAASAPHAASPAPRT 177
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096   158 VAAGPPPGSRPMAFGSPPPVGSGMS----MPPSGMIGGPVSNGHQMVGSGGFPRGTQFPGAAVTTPQAPYVRPPS----A 229
Cdd:pfam17823  178 AASSTTAASSTTAASSAPTTAASSApatlTPARGISTAATATGHPAAGTALAAVGNSSPAAGTVTAAVGTVTPAAlatlA 257
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096   230 PYARTPPQPLGSHSLSGNPPLTPFTAPSMPPPATFPG-APHGRPAVSGlpygpPSAQVA---PPLGFPGQMQPPRYGMGP 305
Cdd:pfam17823  258 AAAGTVASAAGTINMGDPHARRLSPAKHMPSDTMARNpAAPMGAQAQG-----PIIQVStdqPVHNTAGEPTPSPSNTTL 332
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 334187096   306 LPNQSMTNIPTAMGQPGATVPGPSRIDPNQIPRPGSSSSPTVFETRQSNQANPPPP 361
Cdd:pfam17823  333 EPNTPKSVASTNLAVVTTTKAQAKEPSASPVPVLHTSMIPEVEATSPTTQPSPLLP 388
Herpes_BLLF1 pfam05109
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
67-364 1.02e-03

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 43.37  E-value: 1.02e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096    67 PFPQQSPSYGAPQRGPSPMSRPGPPAGM--ARPGGPPPVSQPAGFQSNVP-LNRPTGPPSRQPSFGSRPSmPGGPVAQPA 143
Cdd:pfam05109  455 PTNLTAPASTGPTVSTADVTSPTPAGTTsgASPVTPSPSPRDNGTESKAPdMTSPTSAVTTPTPNATSPT-PAVTTPTPN 533
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096   144 ASSSGFPAFGPSGSVAAGPPPGSRPM-AFGSPPPVGSGMSM---PPSGMIGGPVSNGHQmvgsggfprgtqfPGAAVTTP 219
Cdd:pfam05109  534 ATSPTLGKTSPTSAVTTPTPNATSPTpAVTTPTPNATIPTLgktSPTSAVTTPTPNATS-------------PTVGETSP 600
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096   220 QAPYVR-----PPSAPYARTPPQPLGSHSLSGNPPLTPFTAPSMPPpatfpgaphgRPAVSGLPYGPPSAQVAPplgfpg 294
Cdd:pfam05109  601 QANTTNhtlggTSSTPVVTSPPKNATSAVTTGQHNITSSSTSSMSL----------RPSSISETLSPSTSDNST------ 664
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 334187096   295 QMQPPRYGMGPLPNQSMTNI---PTAMGQPGATVPGPSRIDPNQIPRPGSSSSPTVFETRQSNQANPPPPATS 364
Cdd:pfam05109  665 SHMPLLTSAHPTGGENITQVtpaSTSTHHVSTSSPAPRPGTTSQASGPGNSSTSTKPGEVNVTKGTPPKNATS 737
PRK14951 PRK14951
DNA polymerase III subunits gamma and tau; Provisional
212-333 1.06e-03

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237865 [Multi-domain]  Cd Length: 618  Bit Score: 43.16  E-value: 1.06e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096  212 PGAAVTTPQ----APYVRPPSAPYARTPPQPLGSHSLSGNPPLTPFTAPSMPPPATFPGAPHGRPAVSGLPYGPPSAQVA 287
Cdd:PRK14951  366 PAAAAEAAApaekKTPARPEAAAPAAAPVAQAAAAPAPAAAPAAAASAPAAPPAAAPPAPVAAPAAAAPAAAPAAAPAAV 445
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|....*.
gi 334187096  288 PPLGFPGQMQPPRYGMGPLPNQSMTNIPTAmGQPGATVPGPSRIDP 333
Cdd:PRK14951  446 ALAPAPPAQAAPETVAIPVRVAPEPAVASA-APAPAAAPAAARLTP 490
SepH NF040712
septation protein SepH; Septation protein H (SepH) was firstly characterized in Streptomyces ...
126-274 1.14e-03

septation protein SepH; Septation protein H (SepH) was firstly characterized in Streptomyces venezuelae, and homologs were identified in Mycobacterium smegmatis. SepH contains a N-terminal DUF3071 domain and a conserved C-terminal region. It binds directly to cell division protein FtsZ to stimulate the assembly of FtsZ protofilaments.


Pssm-ID: 468676 [Multi-domain]  Cd Length: 346  Bit Score: 42.45  E-value: 1.14e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096  126 PSFGsRPSMPGGPVAQPAASSSGFPAFGPSGSVAAGPPPGSRPmafgSPPPVGSGMSMPPSGmigGPVSNGHQMVGSGGF 205
Cdd:NF040712  190 PDFG-RPLRPLATVPRLAREPADARPEEVEPAPAAEGAPATDS----DPAEAGTPDDLASAR---RRRAGVEQPEDEPVG 261
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 334187096  206 PRGTQFPGAAVTTPQAPYVRPPSAPYARTPPQPLGSHSLSGNPPLTPF---TAPSMPPPATFPGAPHGRPAV 274
Cdd:NF040712  262 PGAAPAAEPDEATRDAGEPPAPGAAETPEAAEPPAPAPAAPAAPAAPEaeePARPEPPPAPKPKRRRRRASV 333
PRK14951 PRK14951
DNA polymerase III subunits gamma and tau; Provisional
151-326 1.19e-03

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237865 [Multi-domain]  Cd Length: 618  Bit Score: 42.78  E-value: 1.19e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096  151 AFGPSGSVAAGPPPGSRPMAFGSPPPVGSGmsmppsgmiggpvsnghqmvgsggfPRGTQFPGAAVTTPQAPYVRPPSAP 230
Cdd:PRK14951  363 AFKPAAAAEAAAPAEKKTPARPEAAAPAAA-------------------------PVAQAAAAPAPAAAPAAAASAPAAP 417
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096  231 YARTPPQPLGShslsgnPPLTPFTAPSMPPPATFPGAPHGRPAVSGLPYGPPsAQVAPPlgfPGQMQPPRYGMGPLPNQS 310
Cdd:PRK14951  418 PAAAPPAPVAA------PAAAAPAAAPAAAPAAVALAPAPPAQAAPETVAIP-VRVAPE---PAVASAAPAPAAAPAAAR 487
                         170
                  ....*....|....*..
gi 334187096  311 MTniPTAMGQP-GATVP 326
Cdd:PRK14951  488 LT--PTEEGDVwHATVQ 502
PRK07994 PRK07994
DNA polymerase III subunits gamma and tau; Validated
249-373 1.37e-03

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236138 [Multi-domain]  Cd Length: 647  Bit Score: 42.55  E-value: 1.37e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096  249 PLTPFTAPSMPPPATFPGAPHGRPAVSGLPYGPPSAQVAPPLGFPGQMQPPRYGMGPLPNQSMTNIPTAMGQPGATVPGP 328
Cdd:PRK07994  361 PAAPLPEPEVPPQSAAPAASAQATAAPTAAVAPPQAPAVPPPPASAPQQAPAVPLPETTSQLLAARQQLQRAQGATKAKK 440
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|....*
gi 334187096  329 SRIDPNQIPRPGSSSSPTVFETRQSNQANPPPPATSDYVVRDTGN 373
Cdd:PRK07994  441 SEPAAASRARPVNSALERLASVRPAPSALEKAPAKKEAYRWKATN 485
EP400_N pfam15790
E1A-binding protein p400, N-terminal; EP400_N is a family of eukaryote proteins. the exact ...
65-252 1.51e-03

E1A-binding protein p400, N-terminal; EP400_N is a family of eukaryote proteins. the exact function of this domain is not known. This family is largely low-complexity residues.


Pssm-ID: 434938 [Multi-domain]  Cd Length: 489  Bit Score: 42.29  E-value: 1.51e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096    65 PQPFPQQSPSygapqrgpspMSRPGPPAGMARPGGPPPV-----SQPAGFQSNVPLNRP-----------------TGPP 122
Cdd:pfam15790  129 PQQVQPQSPT----------QHSPVPLQGVQRPGAPGTGlgvcgQSPTRFVDASMLVRQislgspsggghfvyqdgTGLA 198
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096   123 SRQPSFGS-RPSMPGGP-------VAQPAASSSG-FPAFGPSGSVAAgpppGSRPMAFGSPPPVGSGmSMPP--SGMIGG 191
Cdd:pfam15790  199 QIAPGAGQvQLASPGTPgsvrerrLSQPHSQTGGtIHHLGPQSPAAA----GAGLQTLGSPGHITTS-NLPPqiSSIIQG 273
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 334187096   192 pvsnghQMVGSGGFPRGTQFPGAAVTTPQA----PYVRPPSAPYARTPPQPLGShslsgnPPLTP 252
Cdd:pfam15790  274 ------QLARPLGFEKTAQVVVAGAGGPAAsfgiPSSIPPTSPSRTSPPPGLSS------NPLTS 326
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
118-274 1.63e-03

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 42.67  E-value: 1.63e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096  118 PTGPPSRQPSFGSRPSMPGGPVAQPAASSSGfPAFGPSGSVAAGPPPGSRPMAFGSPPPVGSGMSMPPSgmiggpvsngh 197
Cdd:PRK07764  386 GVAGGAGAPAAAAPSAAAAAPAAAPAPAAAA-PAAAAAPAPAAAPQPAPAPAPAPAPPSPAGNAPAGGA----------- 453
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 334187096  198 qmvgsggfprGTQFPGAAVTTPQAPYVRPPSAPYARTPPQPLGSHSlsgnppltPFTAPSMPPPATFPGAPHGRPAV 274
Cdd:PRK07764  454 ----------PSPPPAAAPSAQPAPAPAAAPEPTAAPAPAPPAAPA--------PAAAPAAPAAPAAPAGADDAATL 512
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
167-377 2.25e-03

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 42.17  E-value: 2.25e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096  167 RPMAFgspPPVGSGMSMPPSGMIGGPVSNGhqmvgsggfPRGTQFPGAAVTTPQAPYVRPPSAPYARTPPQPLGSHSLSG 246
Cdd:PRK12323  359 RMLAF---RPGQSGGGAGPATAAAAPVAQP---------APAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARR 426
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096  247 NPPLTPFTAPSMPPPATFPGAPhgRPAvsglpygpPSAQVAPPLGFPGQMQPPRYGMGPLPNQSMTNIPTAMGQPGAT-- 324
Cdd:PRK12323  427 SPAPEALAAARQASARGPGGAP--APA--------PAPAAAPAAAARPAAAGPRPVAAAAAAAPARAAPAAAPAPADDdp 496
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 334187096  325 -----------VPGPSRIDP-------NQIPRPGSSSSPTVFETRQSNQANPPPPATSDYVVRDTGNCSPR 377
Cdd:PRK12323  497 ppweelppefaSPAPAQPDAapagwvaESIPDPATADPDDAFETLAPAPAAAPAPRAAAATEPVVAPRPPR 567
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
132-277 2.98e-03

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 41.51  E-value: 2.98e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096  132 PSMPGGPVAQPAASSSGFPAFGPSGSVAAGPPPGSRPmafGSPPPVGSGMSMPPSGMIGGPVSNGhqmvgsggfPRGTQF 211
Cdd:PRK07764  390 GAGAPAAAAPSAAAAAPAAAPAPAAAAPAAAAAPAPA---AAPQPAPAPAPAPAPPSPAGNAPAG---------GAPSPP 457
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 334187096  212 PGAAVTTPQAPYVRPPSAPYARTPPQPLGSHSlsgnppltPFTAPSMPPPatfPGAPHGRPAVSGL 277
Cdd:PRK07764  458 PAAAPSAQPAPAPAAAPEPTAAPAPAPPAAPA--------PAAAPAAPAA---PAAPAGADDAATL 512
PRK14951 PRK14951
DNA polymerase III subunits gamma and tau; Provisional
73-239 3.25e-03

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237865 [Multi-domain]  Cd Length: 618  Bit Score: 41.62  E-value: 3.25e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096   73 PSYGAPQRGPSPMSRPgppagmARPGGPPPVSQPAGFQSNVPLNRPTGPPSRQPSFGSRPSMPGGPVAQPAASSSgfPAF 152
Cdd:PRK14951  366 PAAAAEAAAPAEKKTP------ARPEAAAPAAAPVAQAAAAPAPAAAPAAAASAPAAPPAAAPPAPVAAPAAAAP--AAA 437
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096  153 GPSGSVAAGPPPgsrpmaFGSPPPVGSGMSMPPSgmiggpvsnghqmvgsggfprgTQFPGAAVTTPQAPYVRPPSAPYA 232
Cdd:PRK14951  438 PAAAPAAVALAP------APPAQAAPETVAIPVR----------------------VAPEPAVASAAPAPAAAPAAARLT 489

                  ....*..
gi 334187096  233 RTPPQPL 239
Cdd:PRK14951  490 PTEEGDV 496
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
233-370 3.54e-03

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 41.51  E-value: 3.54e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096  233 RTPPQPLGSHSLSGNPPLTPFTAPSMPPPATFPGAPHGRPAVSGLPygPPSAQVAPPLGFPGQMQPPRYGMGPLPNQSmt 312
Cdd:PRK07764  384 RLGVAGGAGAPAAAAPSAAAAAPAAAPAPAAAAPAAAAAPAPAAAP--QPAPAPAPAPAPPSPAGNAPAGGAPSPPPA-- 459
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|....*...
gi 334187096  313 niPTAMGQPGATVPGPSRIDPNQIPRPGSSSSPTVfETRQSNQANPPPPATSDYVVRD 370
Cdd:PRK07764  460 --AAPSAQPAPAPAAAPEPTAAPAPAPPAAPAPAA-APAAPAAPAAPAGADDAATLRE 514
PRK10263 PRK10263
DNA translocase FtsK; Provisional
119-361 3.71e-03

DNA translocase FtsK; Provisional


Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 41.61  E-value: 3.71e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096  119 TGPPSRQPSFGS-RPSMPGGPVAQPAASSSGFPAFGPSGSVAAGPPPGSRPMAFGSPPPVGSGMSMPPsgmIGGPvSNGH 197
Cdd:PRK10263  295 SGNRATQPEYDEyDPLLNGAPITEPVAVAAAATTATQSWAAPVEPVTQTPPVASVDVPPAQPTVAWQP---VPGP-QTGE 370
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096  198 QMVGSGgfPRGTQfPGAAVTTPQAPYVRPPSAPYartPPQPLGSHSLSGNPPLTPFTAPSMPPPATFPGaphgrpavsgl 277
Cdd:PRK10263  371 PVIAPA--PEGYP-QQSQYAQPAVQYNEPLQQPV---QPQQPYYAPAAEQPAQQPYYAPAPEQPAQQPY----------- 433
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096  278 pygPPSAQVAPPLGFPGQMQPPRYGMGPLPNQsmtniptamgQPGATVPGPSRIDPNQIPRPGSSSSPTVFETRQSNQAN 357
Cdd:PRK10263  434 ---YAPAPEQPVAGNAWQAEEQQSTFAPQSTY----------QTEQTYQQPAAQEPLYQQPQPVEQQPVVEPEPVVEETK 500

                  ....
gi 334187096  358 PPPP 361
Cdd:PRK10263  501 PARP 504
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
135-289 4.33e-03

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 41.12  E-value: 4.33e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096  135 PGGPVAQPAASSSGFPAFGPSGSVAAGPPPGSRPMAFGSPPPVGSGmsmppsgmiggpvsnghqmvgsggfprgTQFPGA 214
Cdd:PRK07764  371 ERGLLARLERLERRLGVAGGAGAPAAAAPSAAAAAPAAAPAPAAAA----------------------------PAAAAA 422
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 334187096  215 AVTTPQAPYVRPPSAPYARTPPQPLGSHSLSGNPPLTPFTAPSMPPPATFPGAPHGRPAVSGLPYGPPSAQVAPP 289
Cdd:PRK07764  423 PAPAAAPQPAPAPAPAPAPPSPAGNAPAGGAPSPPPAAAPSAQPAPAPAAAPEPTAAPAPAPPAAPAPAAAPAAP 497
Pro-rich pfam15240
Proline-rich protein; This family includes several eukaryotic proline-rich proteins.
219-345 5.25e-03

Proline-rich protein; This family includes several eukaryotic proline-rich proteins.


Pssm-ID: 464580 [Multi-domain]  Cd Length: 167  Bit Score: 38.87  E-value: 5.25e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096   219 PQAPYVRPPSAPYARTPPQPLGSHSLSGNPPLTpftAPSMPPPATFPGAPHGRPAVSGLPYGPPSAQVAPPLGFPGQMQP 298
Cdd:pfam15240   39 SQQGGQGPQGPPPGGFPPQPPASDDPPGPPPPG---GPQQPPPQGGKQKPQGPPPQGGPRPPPGKPQGPPPQGGNQQQGP 115
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|
gi 334187096   299 PRYGMGPLPNQSMTNIPTAMG---QPGATVPGPSRIDPNQIPRPGSSSSP 345
Cdd:pfam15240  116 PPPGKPQGPPPQGGGPPPQGGnqqGPPPPPPGNPQGPPQRPPQPGNPQGP 165
PRK12727 PRK12727
flagellar biosynthesis protein FlhF;
4-177 5.41e-03

flagellar biosynthesis protein FlhF;


Pssm-ID: 237182 [Multi-domain]  Cd Length: 559  Bit Score: 40.74  E-value: 5.41e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096    4 PVPPGAPRPNSQQNSGPPNFYPGSQGNS-NALADNMQNLSLNRPPPMMPGSGPRPPPPFGQSPQPFP---QQSPSYGAPQ 79
Cdd:PRK12727   63 PATAAAPAPAPQAPTKPAAPVHAPLKLSaNANMSQRQRVASAAEDMIAAMALRQPVSVPRQAPAAAPvraASIPSPAAQA 142
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096   80 RGPSPMSRPGPPAGMARPGGPPPVSqpAGFQSNVPLNRPTGPPSRQPSFGSRPSMPG-GPVAQPAASSSGFPAFGPSGSV 158
Cdd:PRK12727  143 LAHAAAVRTAPRQEHALSAVPEQLF--ADFLTTAPVPRAPVQAPVVAAPAPVPAIAAaLAAHAAYAQDDDEQLDDDGFDL 220
                         170
                  ....*....|....*....
gi 334187096  159 AAGPPPGSRPMAFgsPPPV 177
Cdd:PRK12727  221 DDALPQILPPAAL--PPIV 237
PRK14951 PRK14951
DNA polymerase III subunits gamma and tau; Provisional
64-178 5.57e-03

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237865 [Multi-domain]  Cd Length: 618  Bit Score: 40.85  E-value: 5.57e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096   64 SPQPFPQQSPSYGAPQRGPSPMSRPGPPAGMARPGGPPPVSQPAGFQSNVPLNRPTGPPSRQPSFGSRPSMPGGPVAQPA 143
Cdd:PRK14951  380 TPARPEAAAPAAAPVAQAAAAPAPAAAPAAAASAPAAPPAAAPPAPVAAPAAAAPAAAPAAAPAAVALAPAPPAQAAPET 459
                          90       100       110
                  ....*....|....*....|....*....|....*
gi 334187096  144 ASSSGFPAFGPSGSVAAGPPPGSRPMAFGSPPPVG 178
Cdd:PRK14951  460 VAIPVRVAPEPAVASAAPAPAAAPAAARLTPTEEG 494
Cytadhesin_P30 pfam07271
Cytadhesin P30/P32; This family consists of several Mycoplasma species specific Cytadhesin P32 ...
63-187 5.92e-03

Cytadhesin P30/P32; This family consists of several Mycoplasma species specific Cytadhesin P32 and P30 proteins. P30 has been found to be membrane associated and localized on the tip organelle. It is thought that it is important in cytadherence and virulence. The N-terminus contains two predicted transmembrane helices followed by a long region of a short 6 residue proline rich repeat.


Pssm-ID: 429374 [Multi-domain]  Cd Length: 275  Bit Score: 39.95  E-value: 5.92e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096    63 QSPQPFPQQSPSYGAPQRGPSPMSRPGPP--AGMA-RPGGP-----PPVSQPAGFQSNVPLNRPTGPPSRQPSFGSRPSM 134
Cdd:pfam07271  144 VANQPQMGINQPQINPQFGPNPQQRIGFPmqPNMGmRPGFNqmpgmPPNQMRPGFNQMPGMPPRPGFPNPMPNMQPRPGF 223
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|...
gi 334187096   135 PGGPVAQPAASSSGFPAFGPSGSVAAGPPPGSRPMAFGSPPPVGsgmsMPPSG 187
Cdd:pfam07271  224 RPQPGPMGNRPGGGFPHPGTPMGPNRMPNPGMNQRPGMAPPRPG----FPPQN 272
dnaA PRK14086
chromosomal replication initiator protein DnaA;
61-271 7.97e-03

chromosomal replication initiator protein DnaA;


Pssm-ID: 237605 [Multi-domain]  Cd Length: 617  Bit Score: 40.19  E-value: 7.97e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096   61 FGQSPQpfpqqspsygAPQRGPSPMSRPGPPAGMARPGGPPPVSQpagfqsnVPLNRPTGP---PSRQPSFGSRPSMPGG 137
Cdd:PRK14086  104 RTSEPE----------LPRPGRRPYEGYGGPRADDRPPGLPRQDQ-------LPTARPAYPayqQRPEPGAWPRAADDYG 166
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334187096  138 PVAQPAasssGFPAFGPSGSVAAGPPPGSRpmaFGSPPPVGSGMSMPPSgmiggpvsnghqmvGSGGFPRGTQFPGAAVT 217
Cdd:PRK14086  167 WQQQRL----GFPPRAPYASPASYAPEQER---DREPYDAGRPEYDQRR--------------RDYDHPRPDWDRPRRDR 225
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|....*.
gi 334187096  218 TPQaPYVRPPSAPYARTPPQPLGSHSLSGNP--PLTPFTAPSMPPPATFPGAPHGR 271
Cdd:PRK14086  226 TDR-PEPPPGAGHVHRGGPGPPERDDAPVVPirPSAPGPLAAQPAPAPGPGEPTAR 280
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH