NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|568987261|ref|XP_006518878|]
View 

protein transport protein Sec24C isoform X1 [Mus musculus]

Protein Classification

SEC24 family transport protein( domain architecture ID 1001573)

SEC24 family transport protein is a component of the coat protein complex II (COPII) which promotes the formation of transport vesicles from the endoplasmic reticulum (ER)

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
COG5028 super family cl34873
Vesicle coat complex COPII, subunit SEC24/subunit SFB2/subunit SFB3 [Intracellular trafficking ...
181-1117 2.30e-166

Vesicle coat complex COPII, subunit SEC24/subunit SFB2/subunit SFB3 [Intracellular trafficking and secretion];


The actual alignment was detected with superfamily member COG5028:

Pssm-ID: 227361 [Multi-domain]  Cd Length: 861  Bit Score: 513.57  E-value: 2.30e-166
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261  181 SYPQSQAPPLSQAQGHPGVQPPLRSAPPLAS--SFTSPASGGPQMPsmtglLPPGQgfgslpvNQANHvssppaPALPPG 258
Cdd:COG5028     2 SQHKKGVYPQAQSQVHTGAASSKKSARPHRAyaNFSAGQMGMPPYT-----TPPLQ-------QQSRR------QIDQAA 63
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261  259 TQMTgppvpppppmhspqQPGYQLQQNGSFGPARGPQPNYESPYPGAPTFGsqpgppqpLPPKRLDPDAiPSPQLNELPP 338
Cdd:COG5028    64 TAMH--------------NTGANNPAPSVMSPAFQSQQKFSSPYGGSMADG--------TAPKPTNPLV-PVDLFEDQPP 120
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261  339 QqKTRHRIDPdaipspiqvieddrnnrgsepfvtgvRGQVPPLvTTNFLVKDQGNASPRYIRCTSYNIPCTSDMAKQAQV 418
Cdd:COG5028   121 P-ISDLFLPP--------------------------PPIVPPL-TTNFVGSEQSNCSPKYVRSTMYAIPETNDLLKKSKI 172
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261  419 PLAAVIKPLARLPPEEASPYVVDHGEsgPLRCNRCKAYMCPLMTFIEGGRRFQCSFCSCVNDVPPQYFQHLDHTGKRVDA 498
Cdd:COG5028   173 PFGLVIRPFLELYPEEDPVPLVEDGS--IVRCRRCRSYINPFVQFIEQGRKWRCNICRSKNDVPEGFDNPSGPNDPRSDR 250
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261  499 YDRPELSLGSYEFLATVDYckNNKFPSPPAFIFMIDVSYNAIRTGLVRLLCEELKSLLDYLPREGGAeesaIRVGFVTYN 578
Cdd:COG5028   251 YSRPELKSGVVDFLAPKEY--SLRQPPPPVYVFLIDVSFEAIKNGLVKAAIRAILENLDQIPNFDPR----TKIAIICFD 324
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261  579 KVLHFYNVKSSLaQPQMMVVSDVADMFVPLLDG-FLVNVSESRAVITSLLDQIPEMFADTRETETVFAPviqagmeALKA 657
Cdd:COG5028   325 SSLHFFKLSPDL-DEQMLIVSDLDEPFLPFPSGlFVLPLKSCKQIIETLLDRVPRIFQDNKSPKNALGP-------ALKA 396
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261  658 A-----ECAGKLFLFHTSLPIAeAPGKLKNRDDrklintdKEKTLFQPQTGTYQTLAKECVAQGCCVDLFLFPNQYVDVA 732
Cdd:COG5028   397 AksligGTGGKIIVFLSTLPNM-GIGKLQLRED-------KESSLLSCKDSFYKEFAIECSKVGISVDLFLTSEDYIDVA 468
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261  733 TLSVVPQLTGGSVYKYACFQVE--NDQERFLSDLRRDVQKVVGFDAVMRVRTSTGIRAVDFFGAFYMSNTTDVELAGLDG 810
Cdd:COG5028   469 TLSHLCRYTGGQTYFYPNFSATrpNDATKLANDLVSHLSMEIGYEAVMRVRCSTGLRVSSFYGNFFNRSSDLCAFSTMPR 548
                         650       660       670       680       690       700       710       720
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261  811 DKTVTVEFKHDDRLNEEnGALLQCALLYTSCAGQRRLRIHNLALNCCTQLADLYRNCETDTLINYMAKFAYRAVLNSPVK 890
Cdd:COG5028   549 DTSLLVEFSIDEKLMTS-DVYFQVALLYTLNDGERRIRVVNLSLPTSSSIREVYASADQLAIACILAKKASTKALNSSLK 627
                         730       740       750       760       770       780       790       800
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261  891 TVRDTLITQCAQILACYRKNCASPSSAGQLILPECMKLLPVYLNCVLKSDVLQPGAeVTTDDRAYVRQLVSSMDVAETNV 970
Cdd:COG5028   628 EARVLINKSMVDILKAYKKELVKSNTSTQLPLPANLKLLPLLMLALLKSSAFRSGS-TPSDIRISALNRLTSLPLKQLMR 706
                         810       820       830       840       850       860       870       880
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261  971 FFYPRLLPLVRTKSPLDSTAE-----PPAVRASEERLSSGDIYLLENGLNLFVWVGASVQQGVVQSLFNVSSFSQITSGL 1045
Cdd:COG5028   707 NIYPTLYALHDMPIEAGLPDEgllvlPSPINATSSLLESGGLYLIDTGQKIFLWFGKDAVPSLLQDLFGVDSLSDIPSGK 786
                         890       900       910       920       930       940       950
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 568987261 1046 SVLPVLDNPLSKKVRGLIDSLRaQRMRYMKLIVVKQED----KLEMLFKHFLVEDKSLsGGASYVDFLCHMHKEIR 1117
Cdd:COG5028   787 FTLPPTGNEFNERVRNIIGELR-SVNDDSTLPLVLVRGggdpSLRLWFFSTLVEDKTL-NIPSYLDYLQILHEKIK 860
Atrophin-1 super family cl38111
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
25-232 2.16e-08

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


The actual alignment was detected with superfamily member pfam03154:

Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 58.63  E-value: 2.16e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261    25 QSSYGGQPGPAAPATPYGAYNGPVPGYQQAPPQGVP--RAPPSSGAPPAS------------AAQVPCGQTTYGQFGQG- 89
Cdd:pfam03154  177 QSGAASPPSPPPPGTTQAATAGPTPSAPSVPPQGSPatSQPPNQTQSTAAphtliqqtptlhPQRLPSPHPPLQPMTQPp 256
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261    90 -DIQNGPSST------AQMQRVPGSQQFGPPLAPvvsQPAVLQPYGPPPTSTQvtaqlagmqisgavAQAPPPsglgygP 162
Cdd:pfam03154  257 pPSQVSPQPLpqpslhGQMPPMPHSLQTGPSHMQ---HPVPPQPFPLTPQSSQ--------------SQVPPG------P 313
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 568987261   163 PTSLASASGNFPNSGPYGSYPQSQAPPLSQ-----AQGHPGVQPPLRSA-PPLASSFT---SPASGGPQMPSMTGLLPP 232
Cdd:pfam03154  314 SPAAPGQSQQRIHTPPSQSQLQSQQPPREQplppaPLSMPHIKPPPTTPiPQLPNPQShkhPPHLSGPSPFQMNSNLPP 392
 
Name Accession Description Interval E-value
COG5028 COG5028
Vesicle coat complex COPII, subunit SEC24/subunit SFB2/subunit SFB3 [Intracellular trafficking ...
181-1117 2.30e-166

Vesicle coat complex COPII, subunit SEC24/subunit SFB2/subunit SFB3 [Intracellular trafficking and secretion];


Pssm-ID: 227361 [Multi-domain]  Cd Length: 861  Bit Score: 513.57  E-value: 2.30e-166
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261  181 SYPQSQAPPLSQAQGHPGVQPPLRSAPPLAS--SFTSPASGGPQMPsmtglLPPGQgfgslpvNQANHvssppaPALPPG 258
Cdd:COG5028     2 SQHKKGVYPQAQSQVHTGAASSKKSARPHRAyaNFSAGQMGMPPYT-----TPPLQ-------QQSRR------QIDQAA 63
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261  259 TQMTgppvpppppmhspqQPGYQLQQNGSFGPARGPQPNYESPYPGAPTFGsqpgppqpLPPKRLDPDAiPSPQLNELPP 338
Cdd:COG5028    64 TAMH--------------NTGANNPAPSVMSPAFQSQQKFSSPYGGSMADG--------TAPKPTNPLV-PVDLFEDQPP 120
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261  339 QqKTRHRIDPdaipspiqvieddrnnrgsepfvtgvRGQVPPLvTTNFLVKDQGNASPRYIRCTSYNIPCTSDMAKQAQV 418
Cdd:COG5028   121 P-ISDLFLPP--------------------------PPIVPPL-TTNFVGSEQSNCSPKYVRSTMYAIPETNDLLKKSKI 172
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261  419 PLAAVIKPLARLPPEEASPYVVDHGEsgPLRCNRCKAYMCPLMTFIEGGRRFQCSFCSCVNDVPPQYFQHLDHTGKRVDA 498
Cdd:COG5028   173 PFGLVIRPFLELYPEEDPVPLVEDGS--IVRCRRCRSYINPFVQFIEQGRKWRCNICRSKNDVPEGFDNPSGPNDPRSDR 250
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261  499 YDRPELSLGSYEFLATVDYckNNKFPSPPAFIFMIDVSYNAIRTGLVRLLCEELKSLLDYLPREGGAeesaIRVGFVTYN 578
Cdd:COG5028   251 YSRPELKSGVVDFLAPKEY--SLRQPPPPVYVFLIDVSFEAIKNGLVKAAIRAILENLDQIPNFDPR----TKIAIICFD 324
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261  579 KVLHFYNVKSSLaQPQMMVVSDVADMFVPLLDG-FLVNVSESRAVITSLLDQIPEMFADTRETETVFAPviqagmeALKA 657
Cdd:COG5028   325 SSLHFFKLSPDL-DEQMLIVSDLDEPFLPFPSGlFVLPLKSCKQIIETLLDRVPRIFQDNKSPKNALGP-------ALKA 396
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261  658 A-----ECAGKLFLFHTSLPIAeAPGKLKNRDDrklintdKEKTLFQPQTGTYQTLAKECVAQGCCVDLFLFPNQYVDVA 732
Cdd:COG5028   397 AksligGTGGKIIVFLSTLPNM-GIGKLQLRED-------KESSLLSCKDSFYKEFAIECSKVGISVDLFLTSEDYIDVA 468
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261  733 TLSVVPQLTGGSVYKYACFQVE--NDQERFLSDLRRDVQKVVGFDAVMRVRTSTGIRAVDFFGAFYMSNTTDVELAGLDG 810
Cdd:COG5028   469 TLSHLCRYTGGQTYFYPNFSATrpNDATKLANDLVSHLSMEIGYEAVMRVRCSTGLRVSSFYGNFFNRSSDLCAFSTMPR 548
                         650       660       670       680       690       700       710       720
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261  811 DKTVTVEFKHDDRLNEEnGALLQCALLYTSCAGQRRLRIHNLALNCCTQLADLYRNCETDTLINYMAKFAYRAVLNSPVK 890
Cdd:COG5028   549 DTSLLVEFSIDEKLMTS-DVYFQVALLYTLNDGERRIRVVNLSLPTSSSIREVYASADQLAIACILAKKASTKALNSSLK 627
                         730       740       750       760       770       780       790       800
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261  891 TVRDTLITQCAQILACYRKNCASPSSAGQLILPECMKLLPVYLNCVLKSDVLQPGAeVTTDDRAYVRQLVSSMDVAETNV 970
Cdd:COG5028   628 EARVLINKSMVDILKAYKKELVKSNTSTQLPLPANLKLLPLLMLALLKSSAFRSGS-TPSDIRISALNRLTSLPLKQLMR 706
                         810       820       830       840       850       860       870       880
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261  971 FFYPRLLPLVRTKSPLDSTAE-----PPAVRASEERLSSGDIYLLENGLNLFVWVGASVQQGVVQSLFNVSSFSQITSGL 1045
Cdd:COG5028   707 NIYPTLYALHDMPIEAGLPDEgllvlPSPINATSSLLESGGLYLIDTGQKIFLWFGKDAVPSLLQDLFGVDSLSDIPSGK 786
                         890       900       910       920       930       940       950
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 568987261 1046 SVLPVLDNPLSKKVRGLIDSLRaQRMRYMKLIVVKQED----KLEMLFKHFLVEDKSLsGGASYVDFLCHMHKEIR 1117
Cdd:COG5028   787 FTLPPTGNEFNERVRNIIGELR-SVNDDSTLPLVLVRGggdpSLRLWFFSTLVEDKTL-NIPSYLDYLQILHEKIK 860
Sec24-like cd01479
Sec24-like: Protein and membrane traffic in eukaryotes is mediated by at least in part by the ...
524-783 1.22e-123

Sec24-like: Protein and membrane traffic in eukaryotes is mediated by at least in part by the budding and fusion of intracellular transport vesicles that selectively carry cargo proteins and lipids from donor to acceptor organelles. The two main classes of vesicular carriers within the endocytic and the biosynthetic pathways are COP- and clathrin-coated vesicles. Formation of COPII vesicles requires the ordered assembly of the coat built from several cytosolic components GTPase Sar1, complexes of Sec23-Sec24 and Sec13-Sec31. The process is initiated by the conversion of GDP to GTP by the GTPase Sar1 which then recruits the heterodimeric complex of Sec23 and Sec24. This heterodimeric complex generates the pre-budding complex. The final step leading to membrane deformation and budding of COPII-coated vesicles is carried by the heterodimeric complex Sec13-Sec31. The members of this CD belong to the Sec23-like family. Sec 24 is very similar to Sec23. The Sec23 and Sec24 polypeptides fold into five distinct domains: a beta-barrel, a zinc finger, a vWA or trunk, an all helical region and a carboxy Gelsolin domain. The members of this subgroup carry a partial MIDAS motif and have the overall Para-Rossmann type fold that is characteristic of this superfamily.


Pssm-ID: 238756 [Multi-domain]  Cd Length: 244  Bit Score: 378.92  E-value: 1.22e-123
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261  524 PSPPAFIFMIDVSYNAIRTGLVRLLCEELKSLLDYLPREggaeESAIRVGFVTYNKVLHFYNVKSSLAQPQMMVVSDVAD 603
Cdd:cd01479     1 PQPAVYVFLIDVSYNAIKSGLLATACEALLSNLDNLPGD----DPRTRVGFITFDSTLHFFNLKSSLEQPQMMVVSDLDD 76
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261  604 MFVPLLDGFLVNVSESRAVITSLLDQIPEMFADTRETETVFAPVIQAGMEALKaaECAGKLFLFHTSLPIAEApGKLKNR 683
Cdd:cd01479    77 PFLPLPDGLLVNLKESRQVIEDLLDQIPEMFQDTKETESALGPALQAAFLLLK--ETGGKIIVFQSSLPTLGA-GKLKSR 153
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261  684 DDRKLINTDKEKTLFQPQTGTYQTLAKECVAQGCCVDLFLFPNQYVDVATLSVVPQLTGGSVYKYAcfqvendqeRFLSD 763
Cdd:cd01479   154 EDPKLLSTDKEKQLLQPQTDFYKKLALECVKSQISVDLFLFSNQYVDVATLGCLSRLTGGQVYYYP---------SFNFS 224
                         250       260
                  ....*....|....*....|
gi 568987261  764 LRRDVQKVVGFDAVMRVRTS 783
Cdd:cd01479   225 APNDVEKLVNELARYLTRKI 244
Sec23_trunk pfam04811
Sec23/Sec24 trunk domain; COPII-coated vesicles carry proteins from the endoplasmic reticulum ...
524-768 1.57e-115

Sec23/Sec24 trunk domain; COPII-coated vesicles carry proteins from the endoplasmic reticulum to the Golgi complex. This vesicular transport can be reconstituted by using three cytosolic components containing five proteins: the small GTPase Sar1p, the Sec23p/24p complex, and the Sec13p/Sec31p complex. This domain is known as the trunk domain and has an alpha/beta vWA fold and forms the dimer interface.


Pssm-ID: 398467 [Multi-domain]  Cd Length: 241  Bit Score: 357.33  E-value: 1.57e-115
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261   524 PSPPAFIFMIDVSYNAIRTGLVRLLCEELKSLLDYLPREggaeeSAIRVGFVTYNKVLHFYNVKSSLAQPQMMVVSDVAD 603
Cdd:pfam04811    1 PQPPVFLFVIDVSYNAIKSGLLAALKESLLQSLDLLPGD-----PRARVGFITFDSTVHFFNLGSSLRQPQMLVVSDLQD 75
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261   604 MFVPLLDGFLVNVSESRAVITSLLDQIPEMFADTRETETVFAPVIQAGMEALKAAECAGKLFLFHTSLPIAEAPGKLKNR 683
Cdd:pfam04811   76 MFLPLPDRFLVPLSECRFVLEDLLEQLPPMFPVTKRPERCLGPALQAAFLLLKAAFTGGKIMVFQGGLPTVGPGGKLKSR 155
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261   684 DDRKLINTDKEKTLFQPQT-GTYQTLAKECVAQGCCVDLFLFPNQYVDVATLSVVPQLTGGSVYKYACFQVENDQERFLS 762
Cdd:pfam04811  156 LDESHHGTDKEKAKLVKKAdKFYKSLAKECVKQGHSVDLFAFSLDYVDVATLGQLSRLTGGQVYLYPSFQADVDGSKFKQ 235

                   ....*.
gi 568987261   763 DLRRDV 768
Cdd:pfam04811  236 DLQRYF 241
PTZ00395 PTZ00395
Sec24-related protein; Provisional
19-1118 3.09e-48

Sec24-related protein; Provisional


Pssm-ID: 185594 [Multi-domain]  Cd Length: 1560  Bit Score: 188.36  E-value: 3.09e-48
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261   19 IYPGYHqssyGGQPGPAAPATPYGAYNGPVPG--YQQAPP--QGVPRAPPSSGAPPASAAQVPCGQTTYGQfgqgdiqng 94
Cdd:PTZ00395  338 IYGGFH----DGSPNAASAGAPFNGLGNQADGghINQVHPdaRGAWAGGPHSNASYNCAAYSNAAQSNAAQ--------- 404
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261   95 psSTAQMQRVPGSQQfGPPLAPVVSQPAVLQPYGPPPTSTQVTAQlagmqisgavaqaPPPSGlgygPPTSlasasgNFP 174
Cdd:PTZ00395  405 --SNAGFSNAGYSNP-GNSNPGYNNAPNSNTPYNNPPNSNTPYSN-------------PPNSN----PPYS------NLP 458
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261  175 NSG-PYGSYPQSQAPPlSQAQGHPGVqpplrsappLASSFTSPASGGPQMPSMTGLLPPGQGFGSlpvNQANHVSSPPAP 253
Cdd:PTZ00395  459 YSNtPYSNAPLSNAPP-SSAKDHHSA---------YHAAYQHRAANQPAANLPTANQPAANNFHG---AAGNSVGNPFAS 525
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261  254 ALPPGTQMTGppvpppppmhspqqpgyqlqqngsfGPARGPQPNYESPYPGAPTFGSQPGPPQPLPPKRLDPDAI--PSP 331
Cdd:PTZ00395  526 RPFGSAPYGG-------------------------NAATTADPNGIAKREDHPEGGTNRQKYEQSDEESVESSSSenSSE 580
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261  332 QLNELPPQ--------QKTRHRIDPDAIPSPIQVIEDDRNNRGSEPFVTgVRGQVPPLVTTNFLVKDQGNASPRYIRCTS 403
Cdd:PTZ00395  581 NENEVTDKgeeiysllKKTINRIDMNKIPRPIINTQEKKKKKNLKVFET-CKYISPPSYYQPYISIDTGKADPRFLKSTL 659
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261  404 YNIPCTSDMAKQAQVPLAAVIKPLARLPPEEASPYV-----VDHGESGP--LRCNRCKAYMcpLMTFIEG-GRRFQCSFC 475
Cdd:PTZ00395  660 YQIPLFSETLKLSQIPFGIIVNPFACLNEGEGIDKIdmkdiINDKEENIeiLRCPKCLGYL--HATILEDiSSSVQCVFC 737
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261  476 SC---VND--------------------------------------VPPQYFQHLD-------HTGKRV----------- 496
Cdd:PTZ00395  738 DTdflINEnvlfdifqynekighkesdhnehgnslspllkgsvdiiIPPIYYHNVNkfkltytYLNKNInqtafmitnki 817
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261  497 -----------------------------DAYDRPELSLGSY-------------------------------------- 509
Cdd:PTZ00395  818 msftkhisnslvandskggnkatsasafgDSGDANFLAGGGYtnyggaggyntydnqsgynnhdvvnnrggsgagnhlyg 897
                         650       660       670       680       690       700       710       720
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261  510 ------EFLATVD------------YCKNN---------------KFPS-----PPAFIFMIDVSYNAIRTGLVRLLCEE 551
Cdd:PTZ00395  898 kdhdvqNFDNVMDnanftihdmknlICEKNgepdsakirrnsflaKYPQvknmlPPYFVFVVECSYNAIYNNITYTILEG 977
                         730       740       750       760       770       780       790       800
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261  552 LKSLLDYL--PReggaeesaIRVGFVTYNKVLHFYNVKSSLAQP-------------QMMVVSDVADMFVPL-LDGFLVN 615
Cdd:PTZ00395  978 IRYAVQNVkcPQ--------TKIAIITFNSSIYFYHCKGGKGVSgeegdggggsgnhQVIVMSDVDDPFLPLpLEDLFFG 1049
                         810       820       830       840       850       860       870       880
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261  616 VSESRAVITSLLDQIPEMFADTRETETVFAPVIQAGMEALKAAECAGKLFLFHTSLPIAeAPGKLKnrddrKLINTDKEK 695
Cdd:PTZ00395 1050 CVEEIDKINTLIDTIKSVSTTMQSYGSCGNSALKIAMDMLKERNGLGSICMFYTTTPNC-GIGAIK-----ELKKDLQEN 1123
                         890       900       910       920       930       940       950       960
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261  696 TLFQPQTGTYQTLAKECVAQGCCVDLFLFP--NQYVDVATLSVVPQLTGGSVYKYACFQVEND-QERFLSDLRRDVQKVV 772
Cdd:PTZ00395 1124 FLEVKQKIFYDSLLLDLYAFNISVDIFIISsnNVRVCVPSLQYVAQNTGGKILFVENFLWQKDyKEIYMNIMDTLTSEDI 1203
                         970       980       990      1000      1010      1020      1030      1040
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261  773 GFDAVMRVRTSTGIRAVDFFGAFYMSNTT----DVELAGLDGDKTVTVEFKHDDRLNEENGALLQCALLYTSCAGQRRLR 848
Cdd:PTZ00395 1204 AYCCELKLRYSHHMSVKKLFCCNNNFNSIisvdTIKIPKIRHDQTFAFLLNYSDISESKKQIYFQCACIYTNLWGDRFVR 1283
                        1050      1060      1070      1080      1090      1100      1110      1120
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261  849 IHNLALNCCTQLADLYRNCETDTLINYMAKFAYRAVLNSpvKTVRDTLITQCAQILACYRKNCASPSSAGQLILPECMKL 928
Cdd:PTZ00395 1284 LHTTHMNLTSSLSTVFRYTDAEALMNILIKQLCTNILHN--DNYSKIIIDNLAAILFSYRINCASSAHSGQLILPDTLKL 1361
                        1130      1140      1150      1160      1170      1180      1190      1200
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261  929 LPVYLNCVLKSDVLQpgAEVTTDDRAYVRQLVSSMDVAETNVFFYPRLLPL-VRTKS-PLDSTAE------PPAVRASEE 1000
Cdd:PTZ00395 1362 LPLFTSSLLKHNVTK--KEILHDLKVYSLIKLLSMPIISSLLYVYPVMYVIhIKGKTnEIDSMDVdddlfiPKTIPSSAE 1439
                        1210      1220      1230      1240      1250      1260      1270      1280
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261 1001 RLSSGDIYLLENGLNLFVWVGASVQQGVVQSLFNVSSFSQITSGLSvlpVLDNPLSKKVRGLIDSL-RAQRM-RYMKLIV 1078
Cdd:PTZ00395 1440 KIYSNGIYLLDACTHFYLYFGFHSDANFAKEIVGDIPTEKNAHELN---LTDTPNAQKVQRIIKNLsRIHHFnKYVPLVM 1516
                        1290      1300      1310      1320
                  ....*....|....*....|....*....|....*....|
gi 568987261 1079 VKQEDKLEMLFKHFLVEDKSlSGGASYVDFLCHMHKEIRQ 1118
Cdd:PTZ00395 1517 VAPKSNEEEHLISLCVEDKA-DKEYSYVNFLCFIHKLVHK 1555
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
25-232 2.16e-08

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 58.63  E-value: 2.16e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261    25 QSSYGGQPGPAAPATPYGAYNGPVPGYQQAPPQGVP--RAPPSSGAPPAS------------AAQVPCGQTTYGQFGQG- 89
Cdd:pfam03154  177 QSGAASPPSPPPPGTTQAATAGPTPSAPSVPPQGSPatSQPPNQTQSTAAphtliqqtptlhPQRLPSPHPPLQPMTQPp 256
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261    90 -DIQNGPSST------AQMQRVPGSQQFGPPLAPvvsQPAVLQPYGPPPTSTQvtaqlagmqisgavAQAPPPsglgygP 162
Cdd:pfam03154  257 pPSQVSPQPLpqpslhGQMPPMPHSLQTGPSHMQ---HPVPPQPFPLTPQSSQ--------------SQVPPG------P 313
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 568987261   163 PTSLASASGNFPNSGPYGSYPQSQAPPLSQ-----AQGHPGVQPPLRSA-PPLASSFT---SPASGGPQMPSMTGLLPP 232
Cdd:pfam03154  314 SPAAPGQSQQRIHTPPSQSQLQSQQPPREQplppaPLSMPHIKPPPTTPiPQLPNPQShkhPPHLSGPSPFQMNSNLPP 392
PHA03247 PHA03247
large tegument protein UL36; Provisional
7-344 6.13e-08

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 57.26  E-value: 6.13e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261    7 APPVPPYGQNQPIYPGYHQSSYGGQPGPAAPATPYGAYNGPVPGYQQAPPQGVPRAPPSSGAPPASAAQVPCGQTTygqf 86
Cdd:PHA03247 2709 EPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLT---- 2784
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261   87 gqgdIQNGPSSTAQMQRVPGSQQFGPPLAPVVSQPAVLQPYG------PPPTSTQVTAqlagmqisgavaqAPPPSGLgy 160
Cdd:PHA03247 2785 ----RPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAAspagplPPPTSAQPTA-------------PPPPPGP-- 2845
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261  161 gPPTSLASASGNFPnSGPYGSYPQSQAPPLSQA-QGHPGVQPPLRSAPPLAS-SFTSPASGgPQMPSMTGLLPPGQGFGS 238
Cdd:PHA03247 2846 -PPPSLPLGGSVAP-GGDVRRRPPSRSPAAKPAaPARPPVRRLARPAVSRSTeSFALPPDQ-PERPPQPQAPPPPQPQPQ 2922
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261  239 LPVnqanhvssppapalppgtqmtgpPVPPPPPMHSPQQPGYQLQQNGSFGPARGPQPNYESPYPGAPTFGSqpgppqPL 318
Cdd:PHA03247 2923 PPP-----------------------PPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGR------VA 2973
                         330       340
                  ....*....|....*....|....*.
gi 568987261  319 PPKRLDPDAIPSPQLNELPPQQKTRH 344
Cdd:PHA03247 2974 VPRFRVPQPAPSREAPASSTPPLTGH 2999
PABP-1234 TIGR01628
polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins ...
1-118 2.17e-04

polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins recognize the poly-A of mRNA and consists of four tandem RNA recognition domains at the N-terminus (rrm: pfam00076) followed by a PABP-specific domain (pfam00658) at the C-terminus. The protein is involved in the transport of mRNA's from the nucleus to the cytoplasm. There are four paralogs in Homo sapiens which are expressed in testis, platelets, broadly expressed and of unknown tissue range.


Pssm-ID: 130689 [Multi-domain]  Cd Length: 562  Bit Score: 45.18  E-value: 2.17e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261     1 MNVNQSAPPVP---PYGQNQPiYPGYHqssyGGQPGPAAPAtPYGAYNGPVPGYQQA----PPQGVPRAPPSSGAPPASA 73
Cdd:TIGR01628  403 QGPQQQFNGQPlgwPRMSMMP-TPMGP----GGPLRPNGLA-PMNAVRAPSRNAQNAaqkpPMQPVMYPPNYQSLPLSQD 476
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....*..
gi 568987261    74 AQVPcgQTTYGQFGQGD--IQNGPSSTAQMQRvpgsQQFGPPLAPVV 118
Cdd:TIGR01628  477 LPQP--QSTASQGGQNKklAQVLASATPQMQK----QVLGERLFPLV 517
COG3416 COG3416
Uncharacterized conserved protein, DUF2076 domain [Function unknown];
14-67 1.33e-03

Uncharacterized conserved protein, DUF2076 domain [Function unknown];


Pssm-ID: 442642 [Multi-domain]  Cd Length: 237  Bit Score: 41.55  E-value: 1.33e-03
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....
gi 568987261   14 GQNQPIYPGYHQSSYGGQPGPAAPATPYGAYNGPVPGYQQaPPQGVPRAPPSSG 67
Cdd:COG3416    91 GGGQRPPPAPQPSQPGPQQQPAPPSGPWGQAAPQQPGYGQ-PQYGQPAAGPSGG 143
 
Name Accession Description Interval E-value
COG5028 COG5028
Vesicle coat complex COPII, subunit SEC24/subunit SFB2/subunit SFB3 [Intracellular trafficking ...
181-1117 2.30e-166

Vesicle coat complex COPII, subunit SEC24/subunit SFB2/subunit SFB3 [Intracellular trafficking and secretion];


Pssm-ID: 227361 [Multi-domain]  Cd Length: 861  Bit Score: 513.57  E-value: 2.30e-166
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261  181 SYPQSQAPPLSQAQGHPGVQPPLRSAPPLAS--SFTSPASGGPQMPsmtglLPPGQgfgslpvNQANHvssppaPALPPG 258
Cdd:COG5028     2 SQHKKGVYPQAQSQVHTGAASSKKSARPHRAyaNFSAGQMGMPPYT-----TPPLQ-------QQSRR------QIDQAA 63
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261  259 TQMTgppvpppppmhspqQPGYQLQQNGSFGPARGPQPNYESPYPGAPTFGsqpgppqpLPPKRLDPDAiPSPQLNELPP 338
Cdd:COG5028    64 TAMH--------------NTGANNPAPSVMSPAFQSQQKFSSPYGGSMADG--------TAPKPTNPLV-PVDLFEDQPP 120
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261  339 QqKTRHRIDPdaipspiqvieddrnnrgsepfvtgvRGQVPPLvTTNFLVKDQGNASPRYIRCTSYNIPCTSDMAKQAQV 418
Cdd:COG5028   121 P-ISDLFLPP--------------------------PPIVPPL-TTNFVGSEQSNCSPKYVRSTMYAIPETNDLLKKSKI 172
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261  419 PLAAVIKPLARLPPEEASPYVVDHGEsgPLRCNRCKAYMCPLMTFIEGGRRFQCSFCSCVNDVPPQYFQHLDHTGKRVDA 498
Cdd:COG5028   173 PFGLVIRPFLELYPEEDPVPLVEDGS--IVRCRRCRSYINPFVQFIEQGRKWRCNICRSKNDVPEGFDNPSGPNDPRSDR 250
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261  499 YDRPELSLGSYEFLATVDYckNNKFPSPPAFIFMIDVSYNAIRTGLVRLLCEELKSLLDYLPREGGAeesaIRVGFVTYN 578
Cdd:COG5028   251 YSRPELKSGVVDFLAPKEY--SLRQPPPPVYVFLIDVSFEAIKNGLVKAAIRAILENLDQIPNFDPR----TKIAIICFD 324
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261  579 KVLHFYNVKSSLaQPQMMVVSDVADMFVPLLDG-FLVNVSESRAVITSLLDQIPEMFADTRETETVFAPviqagmeALKA 657
Cdd:COG5028   325 SSLHFFKLSPDL-DEQMLIVSDLDEPFLPFPSGlFVLPLKSCKQIIETLLDRVPRIFQDNKSPKNALGP-------ALKA 396
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261  658 A-----ECAGKLFLFHTSLPIAeAPGKLKNRDDrklintdKEKTLFQPQTGTYQTLAKECVAQGCCVDLFLFPNQYVDVA 732
Cdd:COG5028   397 AksligGTGGKIIVFLSTLPNM-GIGKLQLRED-------KESSLLSCKDSFYKEFAIECSKVGISVDLFLTSEDYIDVA 468
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261  733 TLSVVPQLTGGSVYKYACFQVE--NDQERFLSDLRRDVQKVVGFDAVMRVRTSTGIRAVDFFGAFYMSNTTDVELAGLDG 810
Cdd:COG5028   469 TLSHLCRYTGGQTYFYPNFSATrpNDATKLANDLVSHLSMEIGYEAVMRVRCSTGLRVSSFYGNFFNRSSDLCAFSTMPR 548
                         650       660       670       680       690       700       710       720
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261  811 DKTVTVEFKHDDRLNEEnGALLQCALLYTSCAGQRRLRIHNLALNCCTQLADLYRNCETDTLINYMAKFAYRAVLNSPVK 890
Cdd:COG5028   549 DTSLLVEFSIDEKLMTS-DVYFQVALLYTLNDGERRIRVVNLSLPTSSSIREVYASADQLAIACILAKKASTKALNSSLK 627
                         730       740       750       760       770       780       790       800
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261  891 TVRDTLITQCAQILACYRKNCASPSSAGQLILPECMKLLPVYLNCVLKSDVLQPGAeVTTDDRAYVRQLVSSMDVAETNV 970
Cdd:COG5028   628 EARVLINKSMVDILKAYKKELVKSNTSTQLPLPANLKLLPLLMLALLKSSAFRSGS-TPSDIRISALNRLTSLPLKQLMR 706
                         810       820       830       840       850       860       870       880
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261  971 FFYPRLLPLVRTKSPLDSTAE-----PPAVRASEERLSSGDIYLLENGLNLFVWVGASVQQGVVQSLFNVSSFSQITSGL 1045
Cdd:COG5028   707 NIYPTLYALHDMPIEAGLPDEgllvlPSPINATSSLLESGGLYLIDTGQKIFLWFGKDAVPSLLQDLFGVDSLSDIPSGK 786
                         890       900       910       920       930       940       950
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 568987261 1046 SVLPVLDNPLSKKVRGLIDSLRaQRMRYMKLIVVKQED----KLEMLFKHFLVEDKSLsGGASYVDFLCHMHKEIR 1117
Cdd:COG5028   787 FTLPPTGNEFNERVRNIIGELR-SVNDDSTLPLVLVRGggdpSLRLWFFSTLVEDKTL-NIPSYLDYLQILHEKIK 860
Sec24-like cd01479
Sec24-like: Protein and membrane traffic in eukaryotes is mediated by at least in part by the ...
524-783 1.22e-123

Sec24-like: Protein and membrane traffic in eukaryotes is mediated by at least in part by the budding and fusion of intracellular transport vesicles that selectively carry cargo proteins and lipids from donor to acceptor organelles. The two main classes of vesicular carriers within the endocytic and the biosynthetic pathways are COP- and clathrin-coated vesicles. Formation of COPII vesicles requires the ordered assembly of the coat built from several cytosolic components GTPase Sar1, complexes of Sec23-Sec24 and Sec13-Sec31. The process is initiated by the conversion of GDP to GTP by the GTPase Sar1 which then recruits the heterodimeric complex of Sec23 and Sec24. This heterodimeric complex generates the pre-budding complex. The final step leading to membrane deformation and budding of COPII-coated vesicles is carried by the heterodimeric complex Sec13-Sec31. The members of this CD belong to the Sec23-like family. Sec 24 is very similar to Sec23. The Sec23 and Sec24 polypeptides fold into five distinct domains: a beta-barrel, a zinc finger, a vWA or trunk, an all helical region and a carboxy Gelsolin domain. The members of this subgroup carry a partial MIDAS motif and have the overall Para-Rossmann type fold that is characteristic of this superfamily.


Pssm-ID: 238756 [Multi-domain]  Cd Length: 244  Bit Score: 378.92  E-value: 1.22e-123
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261  524 PSPPAFIFMIDVSYNAIRTGLVRLLCEELKSLLDYLPREggaeESAIRVGFVTYNKVLHFYNVKSSLAQPQMMVVSDVAD 603
Cdd:cd01479     1 PQPAVYVFLIDVSYNAIKSGLLATACEALLSNLDNLPGD----DPRTRVGFITFDSTLHFFNLKSSLEQPQMMVVSDLDD 76
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261  604 MFVPLLDGFLVNVSESRAVITSLLDQIPEMFADTRETETVFAPVIQAGMEALKaaECAGKLFLFHTSLPIAEApGKLKNR 683
Cdd:cd01479    77 PFLPLPDGLLVNLKESRQVIEDLLDQIPEMFQDTKETESALGPALQAAFLLLK--ETGGKIIVFQSSLPTLGA-GKLKSR 153
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261  684 DDRKLINTDKEKTLFQPQTGTYQTLAKECVAQGCCVDLFLFPNQYVDVATLSVVPQLTGGSVYKYAcfqvendqeRFLSD 763
Cdd:cd01479   154 EDPKLLSTDKEKQLLQPQTDFYKKLALECVKSQISVDLFLFSNQYVDVATLGCLSRLTGGQVYYYP---------SFNFS 224
                         250       260
                  ....*....|....*....|
gi 568987261  764 LRRDVQKVVGFDAVMRVRTS 783
Cdd:cd01479   225 APNDVEKLVNELARYLTRKI 244
Sec23_trunk pfam04811
Sec23/Sec24 trunk domain; COPII-coated vesicles carry proteins from the endoplasmic reticulum ...
524-768 1.57e-115

Sec23/Sec24 trunk domain; COPII-coated vesicles carry proteins from the endoplasmic reticulum to the Golgi complex. This vesicular transport can be reconstituted by using three cytosolic components containing five proteins: the small GTPase Sar1p, the Sec23p/24p complex, and the Sec13p/Sec31p complex. This domain is known as the trunk domain and has an alpha/beta vWA fold and forms the dimer interface.


Pssm-ID: 398467 [Multi-domain]  Cd Length: 241  Bit Score: 357.33  E-value: 1.57e-115
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261   524 PSPPAFIFMIDVSYNAIRTGLVRLLCEELKSLLDYLPREggaeeSAIRVGFVTYNKVLHFYNVKSSLAQPQMMVVSDVAD 603
Cdd:pfam04811    1 PQPPVFLFVIDVSYNAIKSGLLAALKESLLQSLDLLPGD-----PRARVGFITFDSTVHFFNLGSSLRQPQMLVVSDLQD 75
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261   604 MFVPLLDGFLVNVSESRAVITSLLDQIPEMFADTRETETVFAPVIQAGMEALKAAECAGKLFLFHTSLPIAEAPGKLKNR 683
Cdd:pfam04811   76 MFLPLPDRFLVPLSECRFVLEDLLEQLPPMFPVTKRPERCLGPALQAAFLLLKAAFTGGKIMVFQGGLPTVGPGGKLKSR 155
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261   684 DDRKLINTDKEKTLFQPQT-GTYQTLAKECVAQGCCVDLFLFPNQYVDVATLSVVPQLTGGSVYKYACFQVENDQERFLS 762
Cdd:pfam04811  156 LDESHHGTDKEKAKLVKKAdKFYKSLAKECVKQGHSVDLFAFSLDYVDVATLGQLSRLTGGQVYLYPSFQADVDGSKFKQ 235

                   ....*.
gi 568987261   763 DLRRDV 768
Cdd:pfam04811  236 DLQRYF 241
trunk_domain cd01468
trunk domain. COPII-coated vesicles carry proteins from the endoplasmic reticulum to the Golgi ...
524-766 2.32e-103

trunk domain. COPII-coated vesicles carry proteins from the endoplasmic reticulum to the Golgi complex. This vesicular transport can be reconstituted by using three cytosolic components containing five proteins: the small GTPase Sar1p, the Sec23p/24p complex, and the Sec13p/Sec31p complex. This domain is known as the trunk domain and has an alpha/beta vWA fold and forms the dimer interface. Some members of this family possess a partial MIDAS motif that is a characteristic feature of most vWA domain proteins.


Pssm-ID: 238745 [Multi-domain]  Cd Length: 239  Bit Score: 324.97  E-value: 2.32e-103
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261  524 PSPPAFIFMIDVSYNAIRTGLVRLLCEELKSLLDYLPREGGAeesaiRVGFVTYNKVLHFYNVKSSLAQPQMMVVSDVAD 603
Cdd:cd01468     1 PQPPVFVFVIDVSYEAIKEGLLQALKESLLASLDLLPGDPRA-----RVGLITYDSTVHFYNLSSDLAQPKMYVVSDLKD 75
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261  604 MFVPLLDGFLVNVSESRAVITSLLDQIPEMFAD--TRETETVFAPVIQAGMEALKAAECAGKLFLFHTSLPIAEaPGKLK 681
Cdd:cd01468    76 VFLPLPDRFLVPLSECKKVIHDLLEQLPPMFWPvpTHRPERCLGPALQAAFLLLKGTFAGGRIIVFQGGLPTVG-PGKLK 154
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261  682 NRDDRKLINTDKEKTLFQPQTGTYQTLAKECVAQGCCVDLFLFPNQYVDVATLSVVPQLTGGSVYKYACFQVENDQERFL 761
Cdd:cd01468   155 SREDKEPIRSHDEAQLLKPATKFYKSLAKECVKSGICVDLFAFSLDYVDVATLKQLAKSTGGQVYLYDSFQAPNDGSKFK 234

                  ....*
gi 568987261  762 SDLRR 766
Cdd:cd01468   235 QDLQR 239
PTZ00395 PTZ00395
Sec24-related protein; Provisional
19-1118 3.09e-48

Sec24-related protein; Provisional


Pssm-ID: 185594 [Multi-domain]  Cd Length: 1560  Bit Score: 188.36  E-value: 3.09e-48
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261   19 IYPGYHqssyGGQPGPAAPATPYGAYNGPVPG--YQQAPP--QGVPRAPPSSGAPPASAAQVPCGQTTYGQfgqgdiqng 94
Cdd:PTZ00395  338 IYGGFH----DGSPNAASAGAPFNGLGNQADGghINQVHPdaRGAWAGGPHSNASYNCAAYSNAAQSNAAQ--------- 404
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261   95 psSTAQMQRVPGSQQfGPPLAPVVSQPAVLQPYGPPPTSTQVTAQlagmqisgavaqaPPPSGlgygPPTSlasasgNFP 174
Cdd:PTZ00395  405 --SNAGFSNAGYSNP-GNSNPGYNNAPNSNTPYNNPPNSNTPYSN-------------PPNSN----PPYS------NLP 458
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261  175 NSG-PYGSYPQSQAPPlSQAQGHPGVqpplrsappLASSFTSPASGGPQMPSMTGLLPPGQGFGSlpvNQANHVSSPPAP 253
Cdd:PTZ00395  459 YSNtPYSNAPLSNAPP-SSAKDHHSA---------YHAAYQHRAANQPAANLPTANQPAANNFHG---AAGNSVGNPFAS 525
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261  254 ALPPGTQMTGppvpppppmhspqqpgyqlqqngsfGPARGPQPNYESPYPGAPTFGSQPGPPQPLPPKRLDPDAI--PSP 331
Cdd:PTZ00395  526 RPFGSAPYGG-------------------------NAATTADPNGIAKREDHPEGGTNRQKYEQSDEESVESSSSenSSE 580
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261  332 QLNELPPQ--------QKTRHRIDPDAIPSPIQVIEDDRNNRGSEPFVTgVRGQVPPLVTTNFLVKDQGNASPRYIRCTS 403
Cdd:PTZ00395  581 NENEVTDKgeeiysllKKTINRIDMNKIPRPIINTQEKKKKKNLKVFET-CKYISPPSYYQPYISIDTGKADPRFLKSTL 659
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261  404 YNIPCTSDMAKQAQVPLAAVIKPLARLPPEEASPYV-----VDHGESGP--LRCNRCKAYMcpLMTFIEG-GRRFQCSFC 475
Cdd:PTZ00395  660 YQIPLFSETLKLSQIPFGIIVNPFACLNEGEGIDKIdmkdiINDKEENIeiLRCPKCLGYL--HATILEDiSSSVQCVFC 737
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261  476 SC---VND--------------------------------------VPPQYFQHLD-------HTGKRV----------- 496
Cdd:PTZ00395  738 DTdflINEnvlfdifqynekighkesdhnehgnslspllkgsvdiiIPPIYYHNVNkfkltytYLNKNInqtafmitnki 817
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261  497 -----------------------------DAYDRPELSLGSY-------------------------------------- 509
Cdd:PTZ00395  818 msftkhisnslvandskggnkatsasafgDSGDANFLAGGGYtnyggaggyntydnqsgynnhdvvnnrggsgagnhlyg 897
                         650       660       670       680       690       700       710       720
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261  510 ------EFLATVD------------YCKNN---------------KFPS-----PPAFIFMIDVSYNAIRTGLVRLLCEE 551
Cdd:PTZ00395  898 kdhdvqNFDNVMDnanftihdmknlICEKNgepdsakirrnsflaKYPQvknmlPPYFVFVVECSYNAIYNNITYTILEG 977
                         730       740       750       760       770       780       790       800
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261  552 LKSLLDYL--PReggaeesaIRVGFVTYNKVLHFYNVKSSLAQP-------------QMMVVSDVADMFVPL-LDGFLVN 615
Cdd:PTZ00395  978 IRYAVQNVkcPQ--------TKIAIITFNSSIYFYHCKGGKGVSgeegdggggsgnhQVIVMSDVDDPFLPLpLEDLFFG 1049
                         810       820       830       840       850       860       870       880
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261  616 VSESRAVITSLLDQIPEMFADTRETETVFAPVIQAGMEALKAAECAGKLFLFHTSLPIAeAPGKLKnrddrKLINTDKEK 695
Cdd:PTZ00395 1050 CVEEIDKINTLIDTIKSVSTTMQSYGSCGNSALKIAMDMLKERNGLGSICMFYTTTPNC-GIGAIK-----ELKKDLQEN 1123
                         890       900       910       920       930       940       950       960
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261  696 TLFQPQTGTYQTLAKECVAQGCCVDLFLFP--NQYVDVATLSVVPQLTGGSVYKYACFQVEND-QERFLSDLRRDVQKVV 772
Cdd:PTZ00395 1124 FLEVKQKIFYDSLLLDLYAFNISVDIFIISsnNVRVCVPSLQYVAQNTGGKILFVENFLWQKDyKEIYMNIMDTLTSEDI 1203
                         970       980       990      1000      1010      1020      1030      1040
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261  773 GFDAVMRVRTSTGIRAVDFFGAFYMSNTT----DVELAGLDGDKTVTVEFKHDDRLNEENGALLQCALLYTSCAGQRRLR 848
Cdd:PTZ00395 1204 AYCCELKLRYSHHMSVKKLFCCNNNFNSIisvdTIKIPKIRHDQTFAFLLNYSDISESKKQIYFQCACIYTNLWGDRFVR 1283
                        1050      1060      1070      1080      1090      1100      1110      1120
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261  849 IHNLALNCCTQLADLYRNCETDTLINYMAKFAYRAVLNSpvKTVRDTLITQCAQILACYRKNCASPSSAGQLILPECMKL 928
Cdd:PTZ00395 1284 LHTTHMNLTSSLSTVFRYTDAEALMNILIKQLCTNILHN--DNYSKIIIDNLAAILFSYRINCASSAHSGQLILPDTLKL 1361
                        1130      1140      1150      1160      1170      1180      1190      1200
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261  929 LPVYLNCVLKSDVLQpgAEVTTDDRAYVRQLVSSMDVAETNVFFYPRLLPL-VRTKS-PLDSTAE------PPAVRASEE 1000
Cdd:PTZ00395 1362 LPLFTSSLLKHNVTK--KEILHDLKVYSLIKLLSMPIISSLLYVYPVMYVIhIKGKTnEIDSMDVdddlfiPKTIPSSAE 1439
                        1210      1220      1230      1240      1250      1260      1270      1280
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261 1001 RLSSGDIYLLENGLNLFVWVGASVQQGVVQSLFNVSSFSQITSGLSvlpVLDNPLSKKVRGLIDSL-RAQRM-RYMKLIV 1078
Cdd:PTZ00395 1440 KIYSNGIYLLDACTHFYLYFGFHSDANFAKEIVGDIPTEKNAHELN---LTDTPNAQKVQRIIKNLsRIHHFnKYVPLVM 1516
                        1290      1300      1310      1320
                  ....*....|....*....|....*....|....*....|
gi 568987261 1079 VKQEDKLEMLFKHFLVEDKSlSGGASYVDFLCHMHKEIRQ 1118
Cdd:PTZ00395 1517 VAPKSNEEEHLISLCVEDKA-DKEYSYVNFLCFIHKLVHK 1555
Sec23_helical pfam04815
Sec23/Sec24 helical domain; COPII-coated vesicles carry proteins from the endoplasmic ...
870-968 2.77e-35

Sec23/Sec24 helical domain; COPII-coated vesicles carry proteins from the endoplasmic reticulum to the Golgi complex. This vesicular transport can be reconstituted by using three cytosolic components containing five proteins: the small GTPase Sar1p, the Sec23p/24p complex, and the Sec13p/Sec31p complex. This domain is composed of five alpha helices.


Pssm-ID: 461441 [Multi-domain]  Cd Length: 103  Bit Score: 129.54  E-value: 2.77e-35
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261   870 DTLINYMAKFAYRAVLNSPVKTVRDTLITQCAQILACYRKNCASPSSAGQLILPECMKLLPVYLNCVLKSDVLQPGAEVT 949
Cdd:pfam04815    3 EAIAVLLAKKAVEKALSSSLSDAREALDNKLVDILAAYRKYCASSSSPGQLILPESLKLLPLYMLALLKSPALRGGNSSP 82
                           90
                   ....*....|....*....
gi 568987261   950 TDDRAYVRQLVSSMDVAET 968
Cdd:pfam04815   83 SDERAYARHLLLSLPVEEL 101
Sec23_BS pfam08033
Sec23/Sec24 beta-sandwich domain;
773-856 3.30e-28

Sec23/Sec24 beta-sandwich domain;


Pssm-ID: 429794 [Multi-domain]  Cd Length: 86  Bit Score: 108.78  E-value: 3.30e-28
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261   773 GFDAVMRVRTSTGIRAVDFFGAFYMSNTTD-VELAGLDGDKTVTVEFKHDDRLNEENGALLQCALLYTSCAGQRRLRIHN 851
Cdd:pfam08033    1 GFNAVLRVRTSKGLKVSGFIGNFVSRSSGDtWKLPSLDPDTSYAFEFDIDEPLPNGSNAYIQFALLYTHSSGERRIRVTT 80

                   ....*
gi 568987261   852 LALNC 856
Cdd:pfam08033   81 VALPV 85
PLN00162 PLN00162
transport protein sec23; Provisional
403-849 1.88e-17

transport protein sec23; Provisional


Pssm-ID: 215083 [Multi-domain]  Cd Length: 761  Bit Score: 87.69  E-value: 1.88e-17
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261  403 SYNI-PCTSDMAKQAQVPLAAVIKPLARLPPEEASPYvvdhgesGPLRCNRCKAYMCPLMTFIEGGRRFQCSFCSCVNDV 481
Cdd:PLN00162   15 SWNVwPSSKIEASKCVIPLAALYTPLKPLPELPVLPY-------DPLRCRTCRAVLNPYCRVDFQAKIWICPFCFQRNHF 87
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261  482 PPQYF----QHLDhtgkrvdaydrPELslgsYEFLATVDY---CKNNKFPSPPAFIFMIDVSynAIRTGLvRLLCEELKS 554
Cdd:PLN00162   88 PPHYSsiseTNLP-----------AEL----FPQYTTVEYtlpPGSGGAPSPPVFVFVVDTC--MIEEEL-GALKSALLQ 149
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261  555 LLDYLPreggaeESAiRVGFVTY----------------------------NKVLHFYNVKSSLAQPQMMVVSDVADMFV 606
Cdd:PLN00162  150 AIALLP------ENA-LVGLITFgthvhvhelgfsecsksyvfrgnkevskDQILEQLGLGGKKRRPAGGGIAGARDGLS 222
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261  607 PL-LDGFLVNVSESRAVITSLLDQI-PEMF---ADTRETE-TVFAPVIQAGMEALKAAECAGKLFLFhTSLPIAEAPGKL 680
Cdd:PLN00162  223 SSgVNRFLLPASECEFTLNSALEELqKDPWpvpPGHRPARcTGAALSVAAGLLGACVPGTGARIMAF-VGGPCTEGPGAI 301
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261  681 KNRDDRKLINTDKE-----KTLFQPQTGTYQTLAKECVAQGCCVDLFLFPNQYVDVATLSVVPQLTGGSVYKYACFqven 755
Cdd:PLN00162  302 VSKDLSEPIRSHKDldkdaAPYYKKAVKFYEGLAKQLVAQGHVLDVFACSLDQVGVAEMKVAVERTGGLVVLAESF---- 377
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261  756 DQERFLSDLRRDVQKV------VGFDAVMRVRTSTGIRAVDFFG---------------AFYMSNTTDVELAGLDGDKTV 814
Cdd:PLN00162  378 GHSVFKDSLRRVFERDgegslgLSFNGTFEVNCSKDVKVQGAIGpcaslekkgpsvsdtEIGEGGTTAWKLCGLDKKTSL 457
                         490       500       510       520
                  ....*....|....*....|....*....|....*....|
gi 568987261  815 TVEF----KHDDRLNEENGAL-LQCALLYTSCAGQRRLRI 849
Cdd:PLN00162  458 AVFFevanSGQSNPQPPGQQFfLQFLTRYQHSNGQTRLRV 497
SEC23 COG5047
Vesicle coat complex COPII, subunit SEC23 [Intracellular trafficking and secretion];
399-1026 1.79e-15

Vesicle coat complex COPII, subunit SEC23 [Intracellular trafficking and secretion];


Pssm-ID: 227380 [Multi-domain]  Cd Length: 755  Bit Score: 81.47  E-value: 1.79e-15
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261  399 IRCTSYNIPCTSDMAKQAQVPLAAVIKPLARLPPEEASPYvvdhgesGPLRCNR-CKAYMCPLMTFIEGGRRFQCSFCSC 477
Cdd:COG5047    12 IRLTWNVFPATRGDATRTVIPIACLYTPLHEDDALTVNYY-------EPVKCTApCKAVLNPYCHIDERNQSWICPFCNQ 84
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261  478 VNDVPPQYfqhLDHTGKRVDaydrPELSLGSyeflATVDYCKNNKFPSPPAFIFMIDVSYNAIRtglVRLLCEELKSLLD 557
Cdd:COG5047    85 RNTLPPQY---RDISNANLP----LELLPQS----STIEYTLSKPVILPPVFFFVVDACCDEEE---LTALKDSLIVSLS 150
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261  558 YLPREggaeesAIrVGFVTYNKVLHFYNVkSSLAQPQMMVVSDVADMFVPLLD--------------------------- 610
Cdd:COG5047   151 LLPPE------AL-VGLITYGTSIQVHEL-NAENHRRSYVFSGNKEYTKENLQellalskptksggfeskisgigqfass 222
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261  611 GFLVNVSESRAVITSLLDQI-PEMF---ADTRETE-TVFAPVIQAGMEALKAAECAGKLFLFhTSLPIAEAPGKLKNRDD 685
Cdd:COG5047   223 RFLLPTQQCEFKLLNILEQLqPDPWpvpAGKRPLRcTGSALNIASSLLEQCFPNAGCHIVLF-AGGPCTVGPGTVVSTEL 301
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261  686 RK------LINTDKEKtLFQPQTGTYQTLAKECVAQGCCVDLFLFPNQYVDVATLSVVPQLTGGSVYKYACFQVENDQER 759
Cdd:COG5047   302 KEpmrshhDIESDSAQ-HSKKATKFYKGLAERVANQGHALDIFAGCLDQIGIMEMEPLTTSTGGALVLSDSFTTSIFKQS 380
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261  760 FLSDLRRDVQK--VVGFDAVMRVRTSTGIRAVDFFG---------------AFYMSNTTDVELAGLDGDKTVTVEFKHDD 822
Cdd:COG5047   381 FQRIFNRDSEGylKMGFNANMEVKTSKNLKIKGLIGhavsvkkkannisdsEIGIGATNSWKMASLSPKSNYALYFEIAL 460
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261  823 RLNEENG-----ALLQCALLYTSCAGQRRLRIHNLALNCCTQLADL-YRNCETDTLINYMAKFA-YRAVLNSPVKTVR-- 893
Cdd:COG5047   461 GAASGSAqrpaeAYIQFITTYQHSSGTYRIRVTTVARMFTDGGLPKiNRSFDQEAAAVFMARIAaFKAETEDIIDVFRwi 540
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261  894 -DTLITQCaQILACYRKNcaSPSSAGqliLPECMKLLPVYLNCVLKSDVLQPGAEvTTDDRAYVRQLVSSMDVAETNVFF 972
Cdd:COG5047   541 dRNLIRLC-QKFADYRKD--DPSSFR---LDPNFTLYPQFMYHLRRSPFLSVFNN-SPDETAFYRHMLNNADVNDSLIMI 613
                         650       660       670       680       690
                  ....*....|....*....|....*....|....*....|....*....|....*...
gi 568987261  973 YPRLLPLVRTKSP----LDSTAEPPAVraseerlssgdIYLLENGLNLFVWVGASVQQ 1026
Cdd:COG5047   614 QPTLQSYSFEKGGvpvlLDSVSVKPDV-----------ILLLDTFFHILIFHGSYIAQ 660
zf-Sec23_Sec24 pfam04810
Sec23/Sec24 zinc finger; COPII-coated vesicles carry proteins from the endoplasmic reticulum ...
447-484 7.15e-15

Sec23/Sec24 zinc finger; COPII-coated vesicles carry proteins from the endoplasmic reticulum to the Golgi complex. This vesicular transport can be reconstituted by using three cytosolic components containing five proteins: the small GTPase Sar1p, the Sec23p/24p complex, and the Sec13p/Sec31p complex. This domain is found to be zinc binding domain.


Pssm-ID: 461437 [Multi-domain]  Cd Length: 38  Bit Score: 69.40  E-value: 7.15e-15
                           10        20        30
                   ....*....|....*....|....*....|....*...
gi 568987261   447 PLRCNRCKAYMCPLMTFIEGGRRFQCSFCSCVNDVPPQ 484
Cdd:pfam04810    1 PVRCRRCRAYLNPFCQFDFGGKKWTCNFCGTRNPVPPE 38
Gelsolin pfam00626
Gelsolin repeat;
991-1063 1.15e-11

Gelsolin repeat;


Pssm-ID: 395501 [Multi-domain]  Cd Length: 76  Bit Score: 61.55  E-value: 1.15e-11
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 568987261   991 EPPAVRASEERLSSGDIYLLENGLNLFVWVGASVQQgvVQSLFNVSSFSQI-TSGLSVLPVLDN-PLSKKVRGLI 1063
Cdd:pfam00626    4 LPPPVPLSQESLNSGDCYLLDNGFTIFLWVGKGSSL--LEKLFAALLAAQLdDDERFPLPEVIRvPQGKEPARFL 76
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
25-232 2.16e-08

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 58.63  E-value: 2.16e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261    25 QSSYGGQPGPAAPATPYGAYNGPVPGYQQAPPQGVP--RAPPSSGAPPAS------------AAQVPCGQTTYGQFGQG- 89
Cdd:pfam03154  177 QSGAASPPSPPPPGTTQAATAGPTPSAPSVPPQGSPatSQPPNQTQSTAAphtliqqtptlhPQRLPSPHPPLQPMTQPp 256
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261    90 -DIQNGPSST------AQMQRVPGSQQFGPPLAPvvsQPAVLQPYGPPPTSTQvtaqlagmqisgavAQAPPPsglgygP 162
Cdd:pfam03154  257 pPSQVSPQPLpqpslhGQMPPMPHSLQTGPSHMQ---HPVPPQPFPLTPQSSQ--------------SQVPPG------P 313
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 568987261   163 PTSLASASGNFPNSGPYGSYPQSQAPPLSQ-----AQGHPGVQPPLRSA-PPLASSFT---SPASGGPQMPSMTGLLPP 232
Cdd:pfam03154  314 SPAAPGQSQQRIHTPPSQSQLQSQQPPREQplppaPLSMPHIKPPPTTPiPQLPNPQShkhPPHLSGPSPFQMNSNLPP 392
Retinal pfam15449
Retinal protein; This family of proteins is found in the photoreceptor cells of the retina. ...
12-235 3.15e-08

Retinal protein; This family of proteins is found in the photoreceptor cells of the retina. Mutations of the gene encoding this protein have been associated with retinal disorders such as retinitis pigmentosa and late-onset progressive retinal atrophy. The function of this family of proteins is unknown, but it is likely to be important in the development and function of the retina.


Pssm-ID: 464722 [Multi-domain]  Cd Length: 1293  Bit Score: 58.25  E-value: 3.15e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261    12 PYGQNQPIYPGYHQSSYGGQPGPAAPAT--PYGAYNGPvpgyqQAPPQGVPRAPP-SSGAPPASAAQVPcgQTTYGQFG- 87
Cdd:pfam15449  964 LSKQPRKAIPWHHSSHTSGQSRTSEPSLarPTRGPHSP-----EAPRQSQERSPPlVRKASPTRAHWAP--RADKRHPSl 1036
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261    88 ---QGDIQngpSSTAQMQRVPGsqqfgPPLAPVVSQPAVLQPYGPPPTSTQVTAQLAGMQISGAVAQAPPPSGLGYGPPT 164
Cdd:pfam15449 1037 pssHRPAQ---PSLPTVQRSPS-----PPLSPRAPSPPRSPRVLSPPTSKKRTSPPPQHKLPSPPPESPPAQHKLSSPPT 1108
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261   165 SLASASGnfPNSGPygsypqSQAPPLSQAQGHPGV------QPPLRSAPPLASSFTSPASGGP---QMPSMTG--LLPPG 233
Cdd:pfam15449 1109 QRTEASS--PSSGP------SPSPPTSPSQGHKETrdsedsQAATAKASGNTCSIFCPATSSLfeaKSPFSTAhpLLPPE 1180

                   ..
gi 568987261   234 QG 235
Cdd:pfam15449 1181 AG 1182
PHA03247 PHA03247
large tegument protein UL36; Provisional
7-344 6.13e-08

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 57.26  E-value: 6.13e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261    7 APPVPPYGQNQPIYPGYHQSSYGGQPGPAAPATPYGAYNGPVPGYQQAPPQGVPRAPPSSGAPPASAAQVPCGQTTygqf 86
Cdd:PHA03247 2709 EPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLT---- 2784
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261   87 gqgdIQNGPSSTAQMQRVPGSQQFGPPLAPVVSQPAVLQPYG------PPPTSTQVTAqlagmqisgavaqAPPPSGLgy 160
Cdd:PHA03247 2785 ----RPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAAspagplPPPTSAQPTA-------------PPPPPGP-- 2845
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261  161 gPPTSLASASGNFPnSGPYGSYPQSQAPPLSQA-QGHPGVQPPLRSAPPLAS-SFTSPASGgPQMPSMTGLLPPGQGFGS 238
Cdd:PHA03247 2846 -PPPSLPLGGSVAP-GGDVRRRPPSRSPAAKPAaPARPPVRRLARPAVSRSTeSFALPPDQ-PERPPQPQAPPPPQPQPQ 2922
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261  239 LPVnqanhvssppapalppgtqmtgpPVPPPPPMHSPQQPGYQLQQNGSFGPARGPQPNYESPYPGAPTFGSqpgppqPL 318
Cdd:PHA03247 2923 PPP-----------------------PPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGR------VA 2973
                         330       340
                  ....*....|....*....|....*.
gi 568987261  319 PPKRLDPDAIPSPQLNELPPQQKTRH 344
Cdd:PHA03247 2974 VPRFRVPQPAPSREAPASSTPPLTGH 2999
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
5-306 6.75e-08

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 57.08  E-value: 6.75e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261     5 QSAPPVPPygqnQPIYPGYHQSSYGGqPGPAAPATPygayNGPVPGYQQAPPQGVPRAPPSS---GAPPASAAQVPCGQT 81
Cdd:pfam03154  177 QSGAASPP----SPPPPGTTQAATAG-PTPSAPSVP----PQGSPATSQPPNQTQSTAAPHTliqQTPTLHPQRLPSPHP 247
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261    82 TYGQFGQG--DIQNGPSST------AQMQRVPGSQQFGPPLAPvvsQPAVLQPYGPPPTSTQvtaqlagmqisgavAQAP 153
Cdd:pfam03154  248 PLQPMTQPppPSQVSPQPLpqpslhGQMPPMPHSLQTGPSHMQ---HPVPPQPFPLTPQSSQ--------------SQVP 310
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261   154 PPsglgygPPTSLASASGNFPNSGPYGSYPQSQAPPLSQA-----QGHPGVQPPLRSA-PPLASSFT---SPASGGPQMP 224
Cdd:pfam03154  311 PG------PSPAAPGQSQQRIHTPPSQSQLQSQQPPREQPlppapLSMPHIKPPPTTPiPQLPNPQShkhPPHLSGPSPF 384
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261   225 SMTGLLPPG---QGFGSLPVNQANHVSSPPAPALPPGTQMTGPPVPPPPpmhspqqpgyqLQQNGSFGPARGPQPNYESP 301
Cdd:pfam03154  385 QMNSNLPPPpalKPLSSLSTHHPPSAHPPPLQLMPQSQQLPPPPAQPPV-----------LTQSQSLPPPAASHPPTSGL 453

                   ....*
gi 568987261   302 YPGAP 306
Cdd:pfam03154  454 HQVPS 458
PHA03247 PHA03247
large tegument protein UL36; Provisional
4-385 2.99e-07

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 54.94  E-value: 2.99e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261    4 NQSAPPVPPygQNQPIYPGY----HQSSYGGQPG-PAAPATPYGAYNGPVPGYQQAPPQGVPRAPPSSGAPPASAAQVPc 78
Cdd:PHA03247 2565 DRSVPPPRP--APRPSEPAVtsraRRPDAPPQSArPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPH- 2641
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261   79 GQTTYGQFGQGDIQNGPSSTAQMQRVPGSQQFGPPLAPvvsqPAVLQPYGPPPTSTQVTAQLAGMQISGAVAQAPPPSGL 158
Cdd:PHA03247 2642 PPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSP----PQRPRRRAARPTVGSLTSLADPPPPPPTPEPAPHALVS 2717
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261  159 GYGPPTSLASASGNFPNSGPYGSYPQSQAPPLSQAQGHPGVQPPLRSAPPLASSFTSPASGGPQMPSMTGLLPPGQGFGS 238
Cdd:PHA03247 2718 ATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRES 2797
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261  239 LPVNQANHVSSPPAPALPPGTQMTGPPVPPPPPMHSPqqpgyqlqQNGSFGPARGPQPNYESP----YPGA-----PTFG 309
Cdd:PHA03247 2798 LPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSA--------QPTAPPPPPGPPPPSLPLggsvAPGGdvrrrPPSR 2869
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261  310 SQPGPPQPLPPKRLD----PDAIPSPQLNELPPQQKTRHRiDPDAIPSPIQVIEDDRNNRGSEPFVTGVRGQVPPLVTTN 385
Cdd:PHA03247 2870 SPAAKPAAPARPPVRrlarPAVSRSTESFALPPDQPERPP-QPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTD 2948
PRK07003 PRK07003
DNA polymerase III subunit gamma/tau;
29-219 3.75e-07

DNA polymerase III subunit gamma/tau;


Pssm-ID: 235906 [Multi-domain]  Cd Length: 830  Bit Score: 54.47  E-value: 3.75e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261   29 GGQPGPAAPATPYGAYNGPVP------GYQQAPPQGVPRAPPSSGAPPASAAQVPCGQTTYGQFGQGDIQNGPSSTAQMQ 102
Cdd:PRK07003  365 GGAPGGGVPARVAGAVPAPGAraaaavGASAVPAVTAVTGAAGAALAPKAAAAAAATRAEAPPAAPAPPATADRGDDAAD 444
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261  103 RVPGSQQFGPPLAPVVSQPAvlqPYGPPPTSTQVTAQLAGMQISGAVAQAPPPSGLGYGPPTSLASASGNFPNSGPYGSY 182
Cdd:PRK07003  445 GDAPVPAKANARASADSRCD---ERDAQPPADSGSASAPASDAPPDAAFEPAPRAAAPSAATPAAVPDARAPAAASREDA 521
                         170       180       190
                  ....*....|....*....|....*....|....*..
gi 568987261  183 PQSQAPPlsqaqghpgvQPPLRSAPPLASSFTSPASG 219
Cdd:PRK07003  522 PAAAAPP----------APEARPPTPAAAAPAARAGG 548
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
29-224 4.60e-07

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 54.22  E-value: 4.60e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261   29 GGQPGPAAPATPYGAYNGPVPGYQQAPPQGVPRAPPSSGAPPASAAQVPCGQTTYGQFGQGDIQNGPSSTAQMQRVPGSQ 108
Cdd:PRK07764  595 AGGEGPPAPASSGPPEEAARPAAPAAPAAPAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVPDASDGGDGWPAKAG 674
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261  109 QFGPPLAPVVSQPAVLQPYGPPPTSTQVTAQLAGMQISGAVAQAPPPSglgyGPPTSLASASGNFPNSGPYGSYPQSQAP 188
Cdd:PRK07764  675 GAAPAAPPPAPAPAAPAAPAGAAPAQPAPAPAATPPAGQADDPAAQPP----QAAQGASAPSPAADDPVPLPPEPDDPPD 750
                         170       180       190
                  ....*....|....*....|....*....|....*.
gi 568987261  189 PLSQAQGHPGVQPPLRSAPPLASSFTSPASGGPQMP 224
Cdd:PRK07764  751 PAGAPAQPPPPPAPAPAAAPAAAPPPSPPSEEEEMA 786
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
29-240 8.50e-07

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 53.34  E-value: 8.50e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261   29 GGQPGPAAPATpygAYNGPVPgyQQAPPQGVPR-APPSSGAPPASAAQVPCGQTTYGQFGQGDIQNGPSSTA-----QMQ 102
Cdd:PRK12323  366 GQSGGGAGPAT---AAAAPVA--QPAPAAAAPAaAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAPEAlaaarQAS 440
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261  103 RVPGSQQFGPPLAPVVS-----QPAVLQPYGPPPTSTQVTAQLAGMQISGAVAQAPPPSGlgyGPPTSLASASGNFPNSG 177
Cdd:PRK12323  441 ARGPGGAPAPAPAPAAApaaaaRPAAAGPRPVAAAAAAAPARAAPAAAPAPADDDPPPWE---ELPPEFASPAPAQPDAA 517
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 568987261  178 PYGSYPQSQAPPLSQAQghPGVQPPLRSAPPLASSFTSPASGGPQMPSMtgllPPGQGFGSLP 240
Cdd:PRK12323  518 PAGWVAESIPDPATADP--DDAFETLAPAPAAAPAPRAAAATEPVVAPR----PPRASASGLP 574
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
25-225 1.52e-06

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 52.48  E-value: 1.52e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261   25 QSSYGGQPGPAAPATPYGAYNGPVPGY---QQAPPQGVPRAPPSSGAPPASAAQVPCGQTTYGQfgQGDIQNGPSSTAqm 101
Cdd:PHA03307  108 PPGPSSPDPPPPTPPPASPPPSPAPDLsemLRPVGSPGPPPAASPPAAGASPAAVASDAASSRQ--AALPLSSPEETA-- 183
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261  102 qRVPGSqqfGPPLAPVVSQPAVLQPYGPPPTSTQVTAQLAGmQISGAVAQAPPPSGLGYGPPTSLASASGNFPNSGPYGS 181
Cdd:PHA03307  184 -RAPSS---PPAEPPPSTPPAAASPRPPRRSSPISASASSP-APAPGRSAADDAGASSSDSSSSESSGCGWGPENECPLP 258
                         170       180       190       200
                  ....*....|....*....|....*....|....*....|....
gi 568987261  182 YPQSQAPPLSQAQGHPGVQPPLRsAPPLASSFTSPASGGPQMPS 225
Cdd:PHA03307  259 RPAPITLPTRIWEASGWNGPSSR-PGPASSSSSPRERSPSPSPS 301
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
5-354 1.67e-06

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 52.46  E-value: 1.67e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261     5 QSAPPVPPYGQN--QPIYPGYHQssyggqpGPAAPAtPYGAYNGPVPGYQQAPPQGVPRAPPSSGA--PPASAAQVPcgq 80
Cdd:pfam03154  250 QPMTQPPPPSQVspQPLPQPSLH-------GQMPPM-PHSLQTGPSHMQHPVPPQPFPLTPQSSQSqvPPGPSPAAP--- 318
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261    81 ttyGQFGQGDIQNGPSSTAQMQRVPGSQQFGP---PLAPVVSQPAVLQPYGPPPTSTQVTAQLAGMQISGAVAQAPPPSG 157
Cdd:pfam03154  319 ---GQSQQRIHTPPSQSQLQSQQPPREQPLPPaplSMPHIKPPPTTPIPQLPNPQSHKHPPHLSGPSPFQMNSNLPPPPA 395
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261   158 LgygppTSLASASGNFPNSG---PYGSYPQSQ---APP-----LSQAQGHPgvqPPLRSAPPLASSFTSPA-SGGPQMPS 225
Cdd:pfam03154  396 L-----KPLSSLSTHHPPSAhppPLQLMPQSQqlpPPPaqppvLTQSQSLP---PPAASHPPTSGLHQVPSqSPFPQHPF 467
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261   226 MTGLLPPgqgfgslpvnqanhVSSPPAPALPPGTQMTGppvpppppmhspqqpgyqLQQNGSFGPARGpqpnyeSPYPGA 305
Cdd:pfam03154  468 VPGGPPP--------------ITPPSGPPTSTSSAMPG------------------IQPPSSASVSSS------GPVPAA 509
                          330       340       350       360
                   ....*....|....*....|....*....|....*....|....*....
gi 568987261   306 PTfgsqpgppQPLPPKRLDPDAIPSPQLNELPPQQKTRHRIDPDAIPSP 354
Cdd:pfam03154  510 VS--------CPLPPVQIKEEALDEAEEPESPPPPPRSPSPEPTVVNTP 550
PHA03377 PHA03377
EBNA-3C; Provisional
8-197 2.64e-06

EBNA-3C; Provisional


Pssm-ID: 177614 [Multi-domain]  Cd Length: 1000  Bit Score: 51.98  E-value: 2.64e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261    8 PPVPPYGQNQPIYPGYHQSSYGGQPGPAAPATPYGAYNGPVPGYQQAP----PQGvPRAPPSSGAPPASAAQVPCGQTTY 83
Cdd:PHA03377  770 PQAPYLGYQEPQAQGVQVSSYPGYAGPWGLRAQHPRYRHSWAYWSQYPghghPQG-PWAPRPPHLPPQWDGSAGHGQDQV 848
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261   84 GQFGQGDIQNGPSS--TAQMQRVPGSQQFGPPLAPVVSQPAVLQPYGPPPTstqvtaqlagmqisgavaQAPPPSGLGYG 161
Cdd:PHA03377  849 SQFPHLQSETGPPRlqLSQVPQLPYSQTLVSSSAPSWSSPQPRAPIRPIPT------------------RFPPPPMPLQD 910
                         170       180       190
                  ....*....|....*....|....*....|....*..
gi 568987261  162 PPTSLASASGNFPNSGPYGS-YPQSQAPPLSQAQGHP 197
Cdd:PHA03377  911 SMAVGCDSSGTACPSMPFASdYSQGAFTPLDINAQTP 947
PHA03377 PHA03377
EBNA-3C; Provisional
21-232 3.47e-06

EBNA-3C; Provisional


Pssm-ID: 177614 [Multi-domain]  Cd Length: 1000  Bit Score: 51.59  E-value: 3.47e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261   21 PGYHQSSYGGQPGPAAPATPYGAYNGPVP------GYQQAPPQGVPRAP-PSSGAPPASAAQVPCGQTTYgqfgqgdiqn 93
Cdd:PHA03377  741 PPSHQAPYSGHEEPQAQQAPYPGYWEPRPpqapylGYQEPQAQGVQVSSyPGYAGPWGLRAQHPRYRHSW---------- 810
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261   94 gpsstAQMQRVPGsqqFGPPLAPVVSQPAVLQPYGPPptstqvTAQLAGMQISGAVAQAPPPsglgyGPPTslasasgnf 173
Cdd:PHA03377  811 -----AYWSQYPG---HGHPQGPWAPRPPHLPPQWDG------SAGHGQDQVSQFPHLQSET-----GPPR--------- 862
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|....*....
gi 568987261  174 pnsgpygsypqsqaPPLSQAQGHPGVQPPLRSAPPlasSFTSPASGGPQMPSMTGLLPP 232
Cdd:PHA03377  863 --------------LQLSQVPQLPYSQTLVSSSAP---SWSSPQPRAPIRPIPTRFPPP 904
Med15 pfam09606
ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of ...
71-240 8.72e-06

ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of the ARC-Mediator co-activator is a three-helix bundle with marked similarity to the KIX domain. The sterol regulatory element binding protein (SREBP) family of transcription activators use the ARC105 subunit to activate target genes in the regulation of cholesterol and fatty acid homeostasis. In addition, Med15 is a critical transducer of gene activation signals that control early metazoan development.


Pssm-ID: 312941 [Multi-domain]  Cd Length: 732  Bit Score: 50.01  E-value: 8.72e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261    71 ASAAQVPCGQTTYGQFGQGDIQNGPSSTAqMQRVPGSQQFGPPLAPVVS--QPAVLQPYGPPPTSTQVTAQL--AGMQIS 146
Cdd:pfam09606   57 AAQQQQPQGGQGNGGMGGGQQGMPDPINA-LQNLAGQGTRPQMMGPMGPgpGGPMGQQMGGPGTASNLLASLgrPQMPMG 135
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261   147 GA--------VAQAPPPSGLGYGPPTSLASASGNFPNS-GPYGSYPQSQAP-PLSQAQGHPGVQPPLRSAPPLASSFTSP 216
Cdd:pfam09606  136 GAgfpsqmsrVGRMQPGGQAGGMMQPSSGQPGSGTPNQmGPNGGPGQGQAGgMNGGQQGPMGGQMPPQMGVPGMPGPADA 215
                          170       180
                   ....*....|....*....|....
gi 568987261   217 ASGGPQMPSMTGLLPPGQGFGSLP 240
Cdd:pfam09606  216 GAQMGQQAQANGGMNPQQMGGAPN 239
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
29-225 1.10e-05

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 49.60  E-value: 1.10e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261   29 GGQPGPAAPATPygayngpvPGYQQAPPQGVPRAPPSSGAPPASAAQVPCGQTTYGQFGQGDiqnGPSSTAQMQRVPGSQ 108
Cdd:PRK07764  589 GPAPGAAGGEGP--------PAPASSGPPEEAARPAAPAAPAAPAAPAPAGAAAAPAEASAA---PAPGVAAPEHHPKHV 657
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261  109 QFGPPLAPVVSQPAVLQPygPPPTSTQVTAQLAGMQISGAVAQAPPPSGLGYGPPTSLASASGNFPNSGPYGSYPQS--- 185
Cdd:PRK07764  658 AVPDASDGGDGWPAKAGG--AAPAAPPPAPAPAAPAAPAGAAPAQPAPAPAATPPAGQADDPAAQPPQAAQGASAPSpaa 735
                         170       180       190       200
                  ....*....|....*....|....*....|....*....|..
gi 568987261  186 --QAPPLSQAQGHPGVQPPLRSAPPLASSFTSPASGGPQMPS 225
Cdd:PRK07764  736 ddPVPLPPEPDDPPDPAGAPAQPPPPPAPAPAAAPAAAPPPS 777
Med15 pfam09606
ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of ...
4-297 1.81e-05

ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of the ARC-Mediator co-activator is a three-helix bundle with marked similarity to the KIX domain. The sterol regulatory element binding protein (SREBP) family of transcription activators use the ARC105 subunit to activate target genes in the regulation of cholesterol and fatty acid homeostasis. In addition, Med15 is a critical transducer of gene activation signals that control early metazoan development.


Pssm-ID: 312941 [Multi-domain]  Cd Length: 732  Bit Score: 48.85  E-value: 1.81e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261     4 NQSAPPVPPYGQNQPIYPGYHQSSYGGQPGP---AAPATPYGAYNGPVPGYQQAPPQGVPRAPPSSGAP-------PASA 73
Cdd:pfam09606  112 QQMGGPGTASNLLASLGRPQMPMGGAGFPSQmsrVGRMQPGGQAGGMMQPSSGQPGSGTPNQMGPNGGPgqgqaggMNGG 191
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261    74 AQVPCGQTTYGQFGQGdIQNGPSST-AQMQRVPGSQQFGPPLAPVVSQPAVLQPYGPP-PTSTQVTAQLAG---MQISGA 148
Cdd:pfam09606  192 QQGPMGGQMPPQMGVP-GMPGPADAgAQMGQQAQANGGMNPQQMGGAPNQVAMQQQQPqQQGQQSQLGMGInqmQQMPQG 270
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261   149 VAQAPPPSGLG--YGPPTSLASASGNFPNSGPYGSYPQSQAPPLSQAQG--HPGVQPPLRSAPPLASSFTSPASGGPQMP 224
Cdd:pfam09606  271 VGGGAGQGGPGqpMGPPGQQPGAMPNVMSIGDQNNYQQQQTRQQQQQQGgnHPAAHQQQMNQSVGQGGQVVALGGLNHLE 350
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261   225 SMTGLLPPGQGFGSLPVNQANHVSSPPAPALPPGTQMTGP---PVPPPPPMHSPQQPGYQLQQNGSFG----PARGPQPN 297
Cdd:pfam09606  351 TWNPGNFGGLGANPMQRGQPGMMSSPSPVPGQQVRQVTPNqfmRQSPQPSVPSPQGPGSQPPQSHPGGmipsPALIPSPS 430
PHA03378 PHA03378
EBNA-3B; Provisional
6-221 3.19e-05

EBNA-3B; Provisional


Pssm-ID: 223065 [Multi-domain]  Cd Length: 991  Bit Score: 48.14  E-value: 3.19e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261    6 SAPPVPPYGQNQPiypgyhQSSYGGQPGPAAP---ATPYGAYNGPVPGYQQAP-----PQGVP-RAPPSSGAPPASAaqv 76
Cdd:PHA03378  705 RPPAAPPGRAQRP------AAATGRARPPAAApgrARPPAAAPGRARPPAAAPgrarpPAAAPgRARPPAAAPGAPT--- 775
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261   77 PCGQTTYGQFGQGDIQNGPSSTAQMQRVPGSQQFGPPLAPVVSQPA-----VLQPYGPPPTSTQVTAQLAGMQISGAVAQ 151
Cdd:PHA03378  776 PQPPPQAPPAPQQRPRGAPTPQPPPQAGPTSMQLMPRAAPGQQGPTkqilrQLLTGGVKRGRPSLKKPAALERQAAAGPT 855
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261  152 APPPSGLG---------YGPPTSLASASGNFPNSGPYGSYPQSQAPplSQAQG--------HPGVQPPLRSAPPLASSFT 214
Cdd:PHA03378  856 PSPGSGTSdkivqapvfYPPVLQPIQVMRQLGSVRAAAASTVTQAP--TEYTGerrgvgpmHPTDIPPSKRAKTDAYVES 933

                  ....*..
gi 568987261  215 SPASGGP 221
Cdd:PHA03378  934 QPPHGGQ 940
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
28-184 5.10e-05

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 47.67  E-value: 5.10e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261   28 YGGQPGPAAPATPYGAYNGPVPgyqQAPPQGVPRAPPSSGAPPASAAQVPcgqttygqfgqgdiqnGPSSTAQMQRVPGS 107
Cdd:PRK07764  387 VAGGAGAPAAAAPSAAAAAPAA---APAPAAAAPAAAAAPAPAAAPQPAP----------------APAPAPAPPSPAGN 447
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 568987261  108 QQFGPPLAPVVSQPAVLQPyGPPPTSTQVTAQLAGMQISGAVAQAPPPsglgyGPPTSLASASGNFPNSGPYGSYPQ 184
Cdd:PRK07764  448 APAGGAPSPPPAAAPSAQP-APAPAAAPEPTAAPAPAPPAAPAPAAAP-----AAPAAPAAPAGADDAATLRERWPE 518
Gag_spuma pfam03276
Spumavirus gag protein;
92-241 5.18e-05

Spumavirus gag protein;


Pssm-ID: 460872 [Multi-domain]  Cd Length: 614  Bit Score: 47.43  E-value: 5.18e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261    92 QNGPSSTAQMQRVPGSQQFGPPLAPVVSQPAVLQPYGPPPTSTQvtaqlagMQISGAVAQAPPPSGLGYGPPTSLASasg 171
Cdd:pfam03276  175 LAEISPGAQGGIPPGASFSGLPSLPAIGGIHLPAIPGIHARAPP-------GNIARSLGDDIMPSLGDAGMPQPRFA--- 244
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 568987261   172 nFPNSGPYGSYPQSqaPPLSQAQGHPGVQP--PLRSApPLASSFTSPASGGPQMPSMTGLLPPGQGFGSLPV 241
Cdd:pfam03276  245 -FHPGNPFAEAEGH--PFAEAEGERPRDIPraPRIDA-PSAPAIPAIQPIAPPMIPPIGAPIPIPHGASIPG 312
PHA03247 PHA03247
large tegument protein UL36; Provisional
6-310 8.77e-05

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 46.86  E-value: 8.77e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261    6 SAPPVPPYGQNQPIYPGYHQSSYG----GQPGPAAPATPYGAYNGPVPGYQQAPPQGVPRAPPSSG---APPASAAQVPC 78
Cdd:PHA03247 2769 PAPPAAPAAGPPRRLTRPAVASLSesreSLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAqptAPPPPPGPPPP 2848
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261   79 GQTTYGQFGQGdiqnGPSSTAQMQRVPGSQQFGPPLAPV--VSQPAVLQPYGP---PPTSTQVTAQLAGMQISGAVAQAP 153
Cdd:PHA03247 2849 SLPLGGSVAPG----GDVRRRPPSRSPAAKPAAPARPPVrrLARPAVSRSTESfalPPDQPERPPQPQAPPPPQPQPQPP 2924
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261  154 PPSGLGYGPPTSLASASGNFPNSGPYG-SYPQSQAPPLSQAQGHPG-VQPPLRSAPPLASSFTSPAsggPQMPSMTGLLP 231
Cdd:PHA03247 2925 PPPQPQPPPPPPPRPQPPLAPTTDPAGaGEPSGAVPQPWLGALVPGrVAVPRFRVPQPAPSREAPA---SSTPPLTGHSL 3001
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261  232 PGqgFGSLPVNQANHVSSPPAPALPPGTQMTGPPVPPPPPMHSPQQPGYQLQQnGSFGPARGPQ--PNYESPYPGAPTFG 309
Cdd:PHA03247 3002 SR--VSSWASSLALHEETDPPPVSLKQTLWPPDDTEDSDADSLFDSDSERSDL-EALDPLPPEPhdPFAHEPDPATPEAG 3078

                  .
gi 568987261  310 S 310
Cdd:PHA03247 3079 A 3079
PHA03378 PHA03378
EBNA-3B; Provisional
11-234 9.76e-05

EBNA-3B; Provisional


Pssm-ID: 223065 [Multi-domain]  Cd Length: 991  Bit Score: 46.60  E-value: 9.76e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261   11 PPYGQNQPIYPGYHQSSY---GGQPGPAAPATPYGAYNGPVPGYQQAPPQGVPRAP-------------PSSGAPPAsaA 74
Cdd:PHA03378  580 PTTSQLASSAPSYAQTPWpvpHPSQTPEPPTTQSHIPETSAPRQWPMPLRPIPMRPlrmqpitfnvlvfPTPHQPPQ--V 657
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261   75 QVPCGQTTYGQFGQGDIQNGPSSTAQMQRV---PGSQQfGPPLAPVVSQPavlqPYGPPPTSTQVTAQLAGMQISGAV-- 149
Cdd:PHA03378  658 EITPYKPTWTQIGHIPYQPSPTGANTMLPIqwaPGTMQ-PPPRAPTPMRP----PAAPPGRAQRPAAATGRARPPAAApg 732
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261  150 AQAPPPSGLGYGPPTSLASASGNFPNSGPYGSYPQSQAPPLSQAQGHPGVQPPLRSAPPLASSFTSPASGGPQMPSMTGL 229
Cdd:PHA03378  733 RARPPAAAPGRARPPAAAPGRARPPAAAPGRARPPAAAPGAPTPQPPPQAPPAPQQRPRGAPTPQPPPQAGPTSMQLMPR 812

                  ....*
gi 568987261  230 LPPGQ 234
Cdd:PHA03378  813 AAPGQ 817
PAT1 pfam09770
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ...
31-194 1.33e-04

Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.


Pssm-ID: 401645 [Multi-domain]  Cd Length: 846  Bit Score: 46.18  E-value: 1.33e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261    31 QPGPAAPATPYGAYNGPVPGYQQAPPQgvprappssgappASAAQVPCGQTTYGQFGQGdiQNGPSStaQMQRvPGSQQF 110
Cdd:pfam09770  213 QPAPAPAQPPAAPPAQQAQQQQQFPPQ-------------IQQQQQPQQQPQQPQQHPG--QGHPVT--ILQR-PQSPQP 274
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261   111 GPPlAPVVSQPAVLQPYGPPPTSTQVTAQLAGMQ-ISGAVAQAP--PPSGLGYGPPTSLASASGNFPNSGPYGSYPQsQA 187
Cdd:pfam09770  275 DPA-QPSIQPQAQQFHQQPPPVPVQPTQILQNPNrLSAARVGYPqnPQPGVQPAPAHQAHRQQGSFGRQAPIITHPQ-QL 352

                   ....*..
gi 568987261   188 PPLSQAQ 194
Cdd:pfam09770  353 AQLSEEE 359
PABP-1234 TIGR01628
polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins ...
1-118 2.17e-04

polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins recognize the poly-A of mRNA and consists of four tandem RNA recognition domains at the N-terminus (rrm: pfam00076) followed by a PABP-specific domain (pfam00658) at the C-terminus. The protein is involved in the transport of mRNA's from the nucleus to the cytoplasm. There are four paralogs in Homo sapiens which are expressed in testis, platelets, broadly expressed and of unknown tissue range.


Pssm-ID: 130689 [Multi-domain]  Cd Length: 562  Bit Score: 45.18  E-value: 2.17e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261     1 MNVNQSAPPVP---PYGQNQPiYPGYHqssyGGQPGPAAPAtPYGAYNGPVPGYQQA----PPQGVPRAPPSSGAPPASA 73
Cdd:TIGR01628  403 QGPQQQFNGQPlgwPRMSMMP-TPMGP----GGPLRPNGLA-PMNAVRAPSRNAQNAaqkpPMQPVMYPPNYQSLPLSQD 476
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....*..
gi 568987261    74 AQVPcgQTTYGQFGQGD--IQNGPSSTAQMQRvpgsQQFGPPLAPVV 118
Cdd:TIGR01628  477 LPQP--QSTASQGGQNKklAQVLASATPQMQK----QVLGERLFPLV 517
PRK10263 PRK10263
DNA translocase FtsK; Provisional
11-209 2.84e-04

DNA translocase FtsK; Provisional


Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 45.08  E-value: 2.84e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261   11 PPYGQNQPIYPGyhQSSYGGQPGPAAPATPYGAYNGPVPGYQQAPPqgVPRAPPSSGAPPASAAQVPCGQT--------- 81
Cdd:PRK10263  302 PEYDEYDPLLNG--APITEPVAVAAAATTATQSWAAPVEPVTQTPP--VASVDVPPAQPTVAWQPVPGPQTgepviapap 377
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261   82 -TYGQFGQGDIQNGPSSTAQMQRVPGSQQFGPPLAPVVSQPAVLQPYGPPPTSTQVTAQLAGMQISGAVAQAPPPSGLGY 160
Cdd:PRK10263  378 eGYPQQSQYAQPAVQYNEPLQQPVQPQQPYYAPAAEQPAQQPYYAPAPEQPAQQPYYAPAPEQPVAGNAWQAEEQQSTFA 457
                         170       180       190       200
                  ....*....|....*....|....*....|....*....|....*....
gi 568987261  161 GPPTSLASASGNFPNSGPYGSYPQSQAPPLSQAQGHPGVQPPLRSAPPL 209
Cdd:PRK10263  458 PQSTYQTEQTYQQPAAQEPLYQQPQPVEQQPVVEPEPVVEETKPARPPL 506
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
119-306 2.94e-04

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 45.14  E-value: 2.94e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261   119 SQPAVLQPYGPPPtstqVTAQLAGMQISGAVAQAPPPSGLGYGPPTSlasasgnfPNSGPYGSYPQSQAPPLSQAQGHPG 198
Cdd:pfam03154  169 TQPPVLQAQSGAA----SPPSPPPPGTTQAATAGPTPSAPSVPPQGS--------PATSQPPNQTQSTAAPHTLIQQTPT 236
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261   199 VQP--------PLRSAPPLASSFTSPASGGPQmPSMTGLLPP-GQGFGSLPVNQANHVSSPPAPALPPGTQMTGPPVPPP 269
Cdd:pfam03154  237 LHPqrlpsphpPLQPMTQPPPPSQVSPQPLPQ-PSLHGQMPPmPHSLQTGPSHMQHPVPPQPFPLTPQSSQSQVPPGPSP 315
                          170       180       190
                   ....*....|....*....|....*....|....*..
gi 568987261   270 PPMHSPQQpgyQLQQNGSFGPARGPQPNYESPYPGAP 306
Cdd:pfam03154  316 AAPGQSQQ---RIHTPPSQSQLQSQQPPREQPLPPAP 349
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
6-156 3.60e-04

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 44.59  E-value: 3.60e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261    6 SAPPVPPYGQNQPIYPGYHQSSYGGQPGPAAPATPYGAYNG-PVPGYQQAPPQGVPRAPPSSGAPPASAAQVPcgQTTYG 84
Cdd:PRK07764  634 AAPAEASAAPAPGVAAPEHHPKHVAVPDASDGGDGWPAKAGgAAPAAPPPAPAPAAPAAPAGAAPAQPAPAPA--ATPPA 711
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 568987261   85 QFGQGDIQNGPSSTAQMQRVPGSQQFGPPLAPVVSQPAVLQPYGPPPTSTQVTAQLAGmqiSGAVAQAPPPS 156
Cdd:PRK07764  712 GQADDPAAQPPQAAQGASAPSPAADDPVPLPPEPDDPPDPAGAPAQPPPPPAPAPAAA---PAAAPPPSPPS 780
Treacle pfam03546
Treacher Collins syndrome protein Treacle;
34-221 4.76e-04

Treacher Collins syndrome protein Treacle;


Pssm-ID: 460967 [Multi-domain]  Cd Length: 531  Bit Score: 44.29  E-value: 4.76e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261    34 PAAPATPYGAYNGPVPGYQQA-------PPQGVPRAPPSSGAPPASAAQVpcgqttygqfGQGDIQNGPSSTAQmqrvpG 106
Cdd:pfam03546   39 PAAKTPLQAKPSGKTPQVRAAsapakesPRKGAPPVPPGKTGPAAAQAQA----------GKPEEDSESSSEES-----D 103
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261   107 SQQFGPPLAPVVSQPAVLQPYGPPPtstQV-TAQLAGMQISGAVAQAPPPSGLGYGPPTSLASASGNFPNSGPYGSYPQS 185
Cdd:pfam03546  104 SDGETPAAATLTTSPAQVKPLGKNS---QVrPASTVGKGPSGKGANPAPPGKAGSAAPLVQVGKKEEDSESSSEESDSEG 180
                          170       180       190
                   ....*....|....*....|....*....|....*...
gi 568987261   186 QAPPLSQAQGHPGVQPPLRSA--PPLASSFTSPASGGP 221
Cdd:pfam03546  181 EAPPAATQAKPSGKILQVRPAsgPAKGAAPAPPQKAGP 218
PRK07003 PRK07003
DNA polymerase III subunit gamma/tau;
6-208 7.86e-04

DNA polymerase III subunit gamma/tau;


Pssm-ID: 235906 [Multi-domain]  Cd Length: 830  Bit Score: 43.68  E-value: 7.86e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261    6 SAPPVPPYGQNQPIYPGYHQSSYGGQPGPAAPATPYGAYNGPVPGYQQAPPQGVPR---APPSSGAPPASAAQVPCGQTT 82
Cdd:PRK07003  420 ATRAEAPPAAPAPPATADRGDDAADGDAPVPAKANARASADSRCDERDAQPPADSGsasAPASDAPPDAAFEPAPRAAAP 499
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261   83 YGQFGQGDIQNGPSSTAQMQRVPGSQQFGPPLAPVVSQPAVlqpyGPPPTSTQVTAQL-----AGMQIS-----GAVAQA 152
Cdd:PRK07003  500 SAATPAAVPDARAPAAASREDAPAAAAPPAPEARPPTPAAA----APAARAGGAAAALdvlrnAGMRVSsdrgaRAAAAA 575
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|....*....
gi 568987261  153 PPPSGLGYGPPTSLASASGNFPNSGPYGSYPQ---SQAPPLSQAQGHPGVQPPLRSAPP 208
Cdd:PRK07003  576 KPAAAPAAAPKPAAPRVAVQVPTPRARAATGDappNGAARAEQAAESRGAPPPWEDIPP 634
PRK14951 PRK14951
DNA polymerase III subunits gamma and tau; Provisional
32-173 8.61e-04

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237865 [Multi-domain]  Cd Length: 618  Bit Score: 43.55  E-value: 8.61e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261   32 PGPAAPATPYGAYNGPVPGYQQAP---PQGVPRAPPSSGAPPASAAQVPCGQTTYgqfgqgdIQNGPSSTAQMQRvpgsq 108
Cdd:PRK14951  366 PAAAAEAAAPAEKKTPARPEAAAPaaaPVAQAAAAPAPAAAPAAAASAPAAPPAA-------APPAPVAAPAAAA----- 433
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 568987261  109 qfGPPLAPVVSQPAVLQPYGPPPTSTQVTAQLAGMQISGAVAQAPPPSGLGYGPPTSLASASGNF 173
Cdd:PRK14951  434 --PAAAPAAAPAAVALAPAPPAQAAPETVAIPVRVAPEPAVASAAPAPAAAPAAARLTPTEEGDV 496
COG3416 COG3416
Uncharacterized conserved protein, DUF2076 domain [Function unknown];
14-67 1.33e-03

Uncharacterized conserved protein, DUF2076 domain [Function unknown];


Pssm-ID: 442642 [Multi-domain]  Cd Length: 237  Bit Score: 41.55  E-value: 1.33e-03
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....
gi 568987261   14 GQNQPIYPGYHQSSYGGQPGPAAPATPYGAYNGPVPGYQQaPPQGVPRAPPSSG 67
Cdd:COG3416    91 GGGQRPPPAPQPSQPGPQQQPAPPSGPWGQAAPQQPGYGQ-PQYGQPAAGPSGG 143
Gly-rich_Ago1 pfam12764
Glycine-rich region of argonaut; This domain is often found at the very N-terminal of ...
9-105 1.45e-03

Glycine-rich region of argonaut; This domain is often found at the very N-terminal of argonaut-like proteins.


Pssm-ID: 463691 [Multi-domain]  Cd Length: 103  Bit Score: 39.16  E-value: 1.45e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261     9 PVPPYGQNQPIYPGYHQSSYGGQPGPAAPATPygayngPVPGYQQAPP---QGVPRAPPSSGAPPAS--AAQVPCGQTTY 83
Cdd:pfam12764    8 PRPRGGPPQQYYGGGRGGSGGRGPPSGGPSRP------PVPELHQATQvqyQAVVTQPSPSGAGSSSqpTAEVSTGQVAQ 81
                           90       100
                   ....*....|....*....|..
gi 568987261    84 gQFGQGDIQNGPSSTAQMQRVP 105
Cdd:pfam12764   82 -QFQQLSVQDQSSSSQAIQPAP 102
PAT1 pfam09770
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ...
47-341 1.79e-03

Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.


Pssm-ID: 401645 [Multi-domain]  Cd Length: 846  Bit Score: 42.33  E-value: 1.79e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261    47 PVPGYQQAPPQGVPRAPPSSGAPPASAAQVPCGQTTYGQFGQ-GDIQngpsSTAQMQRVPGSQQFGPPLAPVVS-QPAVL 124
Cdd:pfam09770  111 AAQSSAQPPASSLPQYQYASQQSQQPSKPVRTGYEKYKEPEPiPDLQ----VDASLWGVAPKKAAAPAPAPQPAaQPASL 186
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261   125 QPYGPPPTSTQ-VTAQLAgMQISGAVAQAPPPSglgYGPPTSLASASGNFPNSGPygsyPQSQAPPLSQAQGHPGVQPPL 203
Cdd:pfam09770  187 PAPSRKMMSLEeVEAAMR-AQAKKPAQQPAPAP---AQPPAAPPAQQAQQQQQFP----PQIQQQQQPQQQPQQPQQHPG 258
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261   204 RSAPPlaSSFTSPASGGPQmPSMTGLLPPGQGFGSLPVNQANHVssppapalppgTQmtgppvpppppmhspqqpgyQLQ 283
Cdd:pfam09770  259 QGHPV--TILQRPQSPQPD-PAQPSIQPQAQQFHQQPPPVPVQP-----------TQ--------------------ILQ 304
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|....*...
gi 568987261   284 QNGSFGPARGPQPNYesPYPGAPTFGSQPGPPQPLPPKRLDPDAIPSPQLNELPPQQK 341
Cdd:pfam09770  305 NPNRLSAARVGYPQN--PQPGVQPAPAHQAHRQQGSFGRQAPIITHPQQLAQLSEEEK 360
hnRNP-R-Q TIGR01648
heterogeneous nuclear ribonucleoprotein R, Q family; Sequences in this subfamily include the ...
11-201 1.82e-03

heterogeneous nuclear ribonucleoprotein R, Q family; Sequences in this subfamily include the human heterogeneous nuclear ribonucleoproteins (hnRNP) R, Q, and APOBEC-1 complementation factor (aka APOBEC-1 stimulating protein). These proteins contain three RNA recognition domains (rrm: pfam00076) and a somewhat variable C-terminal domain.


Pssm-ID: 273732 [Multi-domain]  Cd Length: 578  Bit Score: 42.29  E-value: 1.82e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261    11 PPYGQN--QPIYPGYHQSSyGGQPGPaapatpygaYNGPVPGYQQAPPQGVPRAPPSSGAPPASAAQvpcgqttYGQFGQ 88
Cdd:TIGR01648  387 PPYGYEayYGDYYGYHDYR-GKYEDK---------YYGYDPGMELTPMNPVRGKPGGRGGRPAIPPP-------RGRKNG 449
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261    89 GdiqnGPSSTAQMQRVPGSQQFGPPLApvVSQPAVLQPYGPPPTSTQVTaqlagmqisGAVAQAPPPSGLGYGPPTSlaS 168
Cdd:TIGR01648  450 A----PPPAIGQDGRQLFLYKITIPAG--YSQRPAPHPLGPPRGSAFVR---------GARGGPAQYQQRGRGSRTS--R 512
                          170       180       190
                   ....*....|....*....|....*....|....*...
gi 568987261   169 ASGNFPNSGPYG-SYPQSQAP----PLSQAQGHPGVQP 201
Cdd:TIGR01648  513 GNGRGGTAGGKRkAFDGYAQPdataRQTNNQQNWGAQP 550
PHA03247 PHA03247
large tegument protein UL36; Provisional
3-218 1.85e-03

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 42.62  E-value: 1.85e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261    3 VNQSAPPVPPYGQNQPiypgyhQSSYGGQPGPAAPATPYGAYNGPVPGYQQAPPQGVPRaPPSSGAPPASAAQVPCGQTT 82
Cdd:PHA03247 2886 LARPAVSRSTESFALP------PDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPR-PQPPLAPTTDPAGAGEPSGA 2958
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261   83 YGQFGQGDIQNGpsstaqmqRVPGSQQFGPPLAPVVSQPAvlqPYGPPPTSTQVTAqLAGMQISGA--VAQAPPPSGL-- 158
Cdd:PHA03247 2959 VPQPWLGALVPG--------RVAVPRFRVPQPAPSREAPA---SSTPPLTGHSLSR-VSSWASSLAlhEETDPPPVSLkq 3026
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261  159 GYGPPTSLASASGNFPNSGPYGSYPQSQAPPLSQAQGHPGVQPPlRSAPPLASSFTSPAS 218
Cdd:PHA03247 3027 TLWPPDDTEDSDADSLFDSDSERSDLEALDPLPPEPHDPFAHEP-DPATPEAGARESPSS 3085
PLN03209 PLN03209
translocon at the inner envelope of chloroplast subunit 62; Provisional
29-247 2.48e-03

translocon at the inner envelope of chloroplast subunit 62; Provisional


Pssm-ID: 178748 [Multi-domain]  Cd Length: 576  Bit Score: 41.84  E-value: 2.48e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261   29 GGQPGPAAPATPygayNGPVPGYQQAPPQG---VPR-----------APPSSGAP--PASAAQVPCGQTTYGQFGQGDIQ 92
Cdd:PLN03209  338 GPKPVPTKPVTP----EAPSPPIEEEPPQPkavVPRplspytayedlKPPTSPIPtpPSSSPASSKSVDAVAKPAEPDVV 413
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261   93 NGP---SSTAQMQRVPGSQQFGPPLAPVVSQPAVLQPYGPPPTstqvtaqlagmqisgavaqapPPSGLGygPPTSLASA 169
Cdd:PLN03209  414 PSPgsaSNVPEVEPAQVEAKKTRPLSPYARYEDLKPPTSPSPT---------------------APTGVS--PSVSSTSS 470
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 568987261  170 SGNFPNSGPYGSYPQSQAPPLSQAQGHPGVQPPLRSAPPLASSFTSPASGGPQMPSMTGLLPPGQGFGSLPVNQANHV 247
Cdd:PLN03209  471 VPAVPDTAPATAATDAAAPPPANMRPLSPYAVYDDLKPPTSPSPAAPVGKVAPSSTNEVVKVGNSAPPTALADEQHHA 548
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
6-214 2.73e-03

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 41.79  E-value: 2.73e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261    6 SAPPVPPYGQNQPIYPGYHQSSYGGQPGPAAPATPygayngpvPGYQQAPPQGVPRAPPSSGAPPASAAQVpcgqttygq 85
Cdd:PRK12323  400 AAPPAAPAAAPAAAAAARAVAAAPARRSPAPEALA--------AARQASARGPGGAPAPAPAPAAAPAAAA--------- 462
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261   86 fgqgdiqngPSSTAQMQRVPGSQQFGPPLAPVVSQPAVlQPYGPPPTStqvtaQLAGMQISGAVAQ---APPPSGLGYGP 162
Cdd:PRK12323  463 ---------RPAAAGPRPVAAAAAAAPARAAPAAAPAP-ADDDPPPWE-----ELPPEFASPAPAQpdaAPAGWVAESIP 527
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|..
gi 568987261  163 PTSLASASGNFPNSGPYGSYPQSQAPPLSQAQGHPGVqPPLRSAPPLASSFT 214
Cdd:PRK12323  528 DPATADPDDAFETLAPAPAAAPAPRAAAATEPVVAPR-PPRASASGLPDMFD 578
PRK10263 PRK10263
DNA translocase FtsK; Provisional
33-135 2.86e-03

DNA translocase FtsK; Provisional


Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 41.99  E-value: 2.86e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261   33 GPAAP-----ATPYGAYNGPVPGYQQAPPQGVPRAPPSSGAPPasaaQVPCGQTTYGQFGQGDIQNGPSSTAQMQRVPGS 107
Cdd:PRK10263  739 GPHEPlftpiVEPVQQPQQPVAPQQQYQQPQQPVAPQPQYQQP----QQPVAPQPQYQQPQQPVAPQPQYQQPQQPVAPQ 814
                          90       100
                  ....*....|....*....|....*...
gi 568987261  108 QQFGPPLAPVVSQPAVLQPYGPPPTSTQ 135
Cdd:PRK10263  815 PQYQQPQQPVAPQPQYQQPQQPVAPQPQ 842
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
59-219 3.88e-03

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 41.51  E-value: 3.88e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261   59 VPRAPPSSGAPPASAAQVPCGQTTYGQfgqgdiqnGPSSTAQmqrvPGSQQFGPPlAPVVSQPAVlQPYGPPPTSTQVTA 138
Cdd:PRK07764  364 LPSASDDERGLLARLERLERRLGVAGG--------AGAPAAA----APSAAAAAP-AAAPAPAAA-APAAAAAPAPAAAP 429
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261  139 QLAGMQISG-AVAQAPPPSGLGYGPPTSLASASGNFPNSGPYGSYPQSQAPPLSQAQGHPGVQPPLRSAPPLASSFTSPA 217
Cdd:PRK07764  430 QPAPAPAPApAPPSPAGNAPAGGAPSPPPAAAPSAQPAPAPAAAPEPTAAPAPAPPAAPAPAAAPAAPAAPAAPAGADDA 509

                  ..
gi 568987261  218 SG 219
Cdd:PRK07764  510 AT 511
PBP1 COG5180
PAB1-binding protein, interacts with poly(A)-binding protein [RNA processing and modification]; ...
3-240 5.85e-03

PAB1-binding protein, interacts with poly(A)-binding protein [RNA processing and modification];


Pssm-ID: 444064 [Multi-domain]  Cd Length: 548  Bit Score: 40.82  E-value: 5.85e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261    3 VNQSAPPVPPYGQNQPiypgyhQSSYGGQPGPAAPATPYGAYNGPVPGYQQAPPQGVPRAP-PSSGAPPASAAQVPCGQT 81
Cdd:COG5180   199 LDRPKVEVKDEAQEEP------PDLTGGADHPRPEAASSPKVDPPSTSEARSRPATVDAQPeMRPPADAKERRRAAIGDT 272
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261   82 TYGQF-GQGDIQNGP--------SSTAQMQRVPGSQQFGPPLAPVVSQPAVLQPYGPPPTstQVTAQLAGMQISGAVAQA 152
Cdd:COG5180   273 PAAEPpGLPVLEAGSepqsdapeAETARPIDVKGVASAPPATRPVRPPGGARDPGTPRPG--QPTERPAGVPEAASDAGQ 350
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261  153 PP-----PSGLGYGPPTSLASASGNFPNSGPYGSYPQSQAPPlsQAQGHPGVQPPLRSAPPLAssftsPASGGPQMPSMT 227
Cdd:COG5180   351 PPsayppAEEAVPGKPLEQGAPRPGSSGGDGAPFQPPNGAPQ--PGLGRRGAPGPPMGAGDLV-----QAALDGGGRETA 423
                         250
                  ....*....|...
gi 568987261  228 GLLPPGQGFGSLP 240
Cdd:COG5180   424 SLGGAAGGAGQGP 436
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
5-174 5.98e-03

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 40.63  E-value: 5.98e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261    5 QSAPPVPPYGQNQPIYPGYHQSSYGGQPGPAAPATPygaynGPVPGYQQAPPQGVPRAPPSSgAPPASAAQVPCGQTTYG 84
Cdd:PRK12323  419 VAAAPARRSPAPEALAAARQASARGPGGAPAPAPAP-----AAAPAAAARPAAAGPRPVAAA-AAAAPARAAPAAAPAPA 492
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261   85 QFGQGDIQNGPSSTAQMQRVPGSQQFGPPLAPVVSQPAVLQPYGPPPTSTQVTAQLAGMQISGAVAQAPPPSGLGYGPPT 164
Cdd:PRK12323  493 DDDPPPWEELPPEFASPAPAQPDAAPAGWVAESIPDPATADPDDAFETLAPAPAAAPAPRAAAATEPVVAPRPPRASASG 572
                         170
                  ....*....|
gi 568987261  165 SLASASGNFP 174
Cdd:PRK12323  573 LPDMFDGDWP 582
dnaA PRK14086
chromosomal replication initiator protein DnaA;
3-221 7.26e-03

chromosomal replication initiator protein DnaA;


Pssm-ID: 237605 [Multi-domain]  Cd Length: 617  Bit Score: 40.19  E-value: 7.26e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261    3 VNQSAPPVPPYGQNQPIYPGYHQSSYGGQP-----GPAAPATPYGAYNGPV-PGYQQAPPQGVPRAPPssGAPPASAAQV 76
Cdd:PRK14086   88 VDPSAGEPAPPPPHARRTSEPELPRPGRRPyegygGPRADDRPPGLPRQDQlPTARPAYPAYQQRPEP--GAWPRAADDY 165
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261   77 PCGQTTYGqFGQGDIQNGPSSTAqmqrvPGSQQFGPPlaPVVSQPAVLQPYGPPptstqvtaqlagmqiSGAVAQAPPPS 156
Cdd:PRK14086  166 GWQQQRLG-FPPRAPYASPASYA-----PEQERDREP--YDAGRPEYDQRRRDY---------------DHPRPDWDRPR 222
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 568987261  157 GLGYGPPTSlASASGNFPNSGPYGSYPQSqAPPLSQAQGHPGvqpplrsaPPLASSFTSPASGGP 221
Cdd:PRK14086  223 RDRTDRPEP-PPGAGHVHRGGPGPPERDD-APVVPIRPSAPG--------PLAAQPAPAPGPGEP 277
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH