|
Name |
Accession |
Description |
Interval |
E-value |
| COG5028 |
COG5028 |
Vesicle coat complex COPII, subunit SEC24/subunit SFB2/subunit SFB3 [Intracellular trafficking ... |
181-1117 |
2.30e-166 |
|
Vesicle coat complex COPII, subunit SEC24/subunit SFB2/subunit SFB3 [Intracellular trafficking and secretion];
Pssm-ID: 227361 [Multi-domain] Cd Length: 861 Bit Score: 513.57 E-value: 2.30e-166
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261 181 SYPQSQAPPLSQAQGHPGVQPPLRSAPPLAS--SFTSPASGGPQMPsmtglLPPGQgfgslpvNQANHvssppaPALPPG 258
Cdd:COG5028 2 SQHKKGVYPQAQSQVHTGAASSKKSARPHRAyaNFSAGQMGMPPYT-----TPPLQ-------QQSRR------QIDQAA 63
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261 259 TQMTgppvpppppmhspqQPGYQLQQNGSFGPARGPQPNYESPYPGAPTFGsqpgppqpLPPKRLDPDAiPSPQLNELPP 338
Cdd:COG5028 64 TAMH--------------NTGANNPAPSVMSPAFQSQQKFSSPYGGSMADG--------TAPKPTNPLV-PVDLFEDQPP 120
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261 339 QqKTRHRIDPdaipspiqvieddrnnrgsepfvtgvRGQVPPLvTTNFLVKDQGNASPRYIRCTSYNIPCTSDMAKQAQV 418
Cdd:COG5028 121 P-ISDLFLPP--------------------------PPIVPPL-TTNFVGSEQSNCSPKYVRSTMYAIPETNDLLKKSKI 172
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261 419 PLAAVIKPLARLPPEEASPYVVDHGEsgPLRCNRCKAYMCPLMTFIEGGRRFQCSFCSCVNDVPPQYFQHLDHTGKRVDA 498
Cdd:COG5028 173 PFGLVIRPFLELYPEEDPVPLVEDGS--IVRCRRCRSYINPFVQFIEQGRKWRCNICRSKNDVPEGFDNPSGPNDPRSDR 250
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261 499 YDRPELSLGSYEFLATVDYckNNKFPSPPAFIFMIDVSYNAIRTGLVRLLCEELKSLLDYLPREGGAeesaIRVGFVTYN 578
Cdd:COG5028 251 YSRPELKSGVVDFLAPKEY--SLRQPPPPVYVFLIDVSFEAIKNGLVKAAIRAILENLDQIPNFDPR----TKIAIICFD 324
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261 579 KVLHFYNVKSSLaQPQMMVVSDVADMFVPLLDG-FLVNVSESRAVITSLLDQIPEMFADTRETETVFAPviqagmeALKA 657
Cdd:COG5028 325 SSLHFFKLSPDL-DEQMLIVSDLDEPFLPFPSGlFVLPLKSCKQIIETLLDRVPRIFQDNKSPKNALGP-------ALKA 396
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261 658 A-----ECAGKLFLFHTSLPIAeAPGKLKNRDDrklintdKEKTLFQPQTGTYQTLAKECVAQGCCVDLFLFPNQYVDVA 732
Cdd:COG5028 397 AksligGTGGKIIVFLSTLPNM-GIGKLQLRED-------KESSLLSCKDSFYKEFAIECSKVGISVDLFLTSEDYIDVA 468
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261 733 TLSVVPQLTGGSVYKYACFQVE--NDQERFLSDLRRDVQKVVGFDAVMRVRTSTGIRAVDFFGAFYMSNTTDVELAGLDG 810
Cdd:COG5028 469 TLSHLCRYTGGQTYFYPNFSATrpNDATKLANDLVSHLSMEIGYEAVMRVRCSTGLRVSSFYGNFFNRSSDLCAFSTMPR 548
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261 811 DKTVTVEFKHDDRLNEEnGALLQCALLYTSCAGQRRLRIHNLALNCCTQLADLYRNCETDTLINYMAKFAYRAVLNSPVK 890
Cdd:COG5028 549 DTSLLVEFSIDEKLMTS-DVYFQVALLYTLNDGERRIRVVNLSLPTSSSIREVYASADQLAIACILAKKASTKALNSSLK 627
|
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261 891 TVRDTLITQCAQILACYRKNCASPSSAGQLILPECMKLLPVYLNCVLKSDVLQPGAeVTTDDRAYVRQLVSSMDVAETNV 970
Cdd:COG5028 628 EARVLINKSMVDILKAYKKELVKSNTSTQLPLPANLKLLPLLMLALLKSSAFRSGS-TPSDIRISALNRLTSLPLKQLMR 706
|
810 820 830 840 850 860 870 880
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261 971 FFYPRLLPLVRTKSPLDSTAE-----PPAVRASEERLSSGDIYLLENGLNLFVWVGASVQQGVVQSLFNVSSFSQITSGL 1045
Cdd:COG5028 707 NIYPTLYALHDMPIEAGLPDEgllvlPSPINATSSLLESGGLYLIDTGQKIFLWFGKDAVPSLLQDLFGVDSLSDIPSGK 786
|
890 900 910 920 930 940 950
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 568987261 1046 SVLPVLDNPLSKKVRGLIDSLRaQRMRYMKLIVVKQED----KLEMLFKHFLVEDKSLsGGASYVDFLCHMHKEIR 1117
Cdd:COG5028 787 FTLPPTGNEFNERVRNIIGELR-SVNDDSTLPLVLVRGggdpSLRLWFFSTLVEDKTL-NIPSYLDYLQILHEKIK 860
|
|
| Sec24-like |
cd01479 |
Sec24-like: Protein and membrane traffic in eukaryotes is mediated by at least in part by the ... |
524-783 |
1.22e-123 |
|
Sec24-like: Protein and membrane traffic in eukaryotes is mediated by at least in part by the budding and fusion of intracellular transport vesicles that selectively carry cargo proteins and lipids from donor to acceptor organelles. The two main classes of vesicular carriers within the endocytic and the biosynthetic pathways are COP- and clathrin-coated vesicles. Formation of COPII vesicles requires the ordered assembly of the coat built from several cytosolic components GTPase Sar1, complexes of Sec23-Sec24 and Sec13-Sec31. The process is initiated by the conversion of GDP to GTP by the GTPase Sar1 which then recruits the heterodimeric complex of Sec23 and Sec24. This heterodimeric complex generates the pre-budding complex. The final step leading to membrane deformation and budding of COPII-coated vesicles is carried by the heterodimeric complex Sec13-Sec31. The members of this CD belong to the Sec23-like family. Sec 24 is very similar to Sec23. The Sec23 and Sec24 polypeptides fold into five distinct domains: a beta-barrel, a zinc finger, a vWA or trunk, an all helical region and a carboxy Gelsolin domain. The members of this subgroup carry a partial MIDAS motif and have the overall Para-Rossmann type fold that is characteristic of this superfamily.
Pssm-ID: 238756 [Multi-domain] Cd Length: 244 Bit Score: 378.92 E-value: 1.22e-123
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261 524 PSPPAFIFMIDVSYNAIRTGLVRLLCEELKSLLDYLPREggaeESAIRVGFVTYNKVLHFYNVKSSLAQPQMMVVSDVAD 603
Cdd:cd01479 1 PQPAVYVFLIDVSYNAIKSGLLATACEALLSNLDNLPGD----DPRTRVGFITFDSTLHFFNLKSSLEQPQMMVVSDLDD 76
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261 604 MFVPLLDGFLVNVSESRAVITSLLDQIPEMFADTRETETVFAPVIQAGMEALKaaECAGKLFLFHTSLPIAEApGKLKNR 683
Cdd:cd01479 77 PFLPLPDGLLVNLKESRQVIEDLLDQIPEMFQDTKETESALGPALQAAFLLLK--ETGGKIIVFQSSLPTLGA-GKLKSR 153
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261 684 DDRKLINTDKEKTLFQPQTGTYQTLAKECVAQGCCVDLFLFPNQYVDVATLSVVPQLTGGSVYKYAcfqvendqeRFLSD 763
Cdd:cd01479 154 EDPKLLSTDKEKQLLQPQTDFYKKLALECVKSQISVDLFLFSNQYVDVATLGCLSRLTGGQVYYYP---------SFNFS 224
|
250 260
....*....|....*....|
gi 568987261 764 LRRDVQKVVGFDAVMRVRTS 783
Cdd:cd01479 225 APNDVEKLVNELARYLTRKI 244
|
|
| Sec23_trunk |
pfam04811 |
Sec23/Sec24 trunk domain; COPII-coated vesicles carry proteins from the endoplasmic reticulum ... |
524-768 |
1.57e-115 |
|
Sec23/Sec24 trunk domain; COPII-coated vesicles carry proteins from the endoplasmic reticulum to the Golgi complex. This vesicular transport can be reconstituted by using three cytosolic components containing five proteins: the small GTPase Sar1p, the Sec23p/24p complex, and the Sec13p/Sec31p complex. This domain is known as the trunk domain and has an alpha/beta vWA fold and forms the dimer interface.
Pssm-ID: 398467 [Multi-domain] Cd Length: 241 Bit Score: 357.33 E-value: 1.57e-115
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261 524 PSPPAFIFMIDVSYNAIRTGLVRLLCEELKSLLDYLPREggaeeSAIRVGFVTYNKVLHFYNVKSSLAQPQMMVVSDVAD 603
Cdd:pfam04811 1 PQPPVFLFVIDVSYNAIKSGLLAALKESLLQSLDLLPGD-----PRARVGFITFDSTVHFFNLGSSLRQPQMLVVSDLQD 75
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261 604 MFVPLLDGFLVNVSESRAVITSLLDQIPEMFADTRETETVFAPVIQAGMEALKAAECAGKLFLFHTSLPIAEAPGKLKNR 683
Cdd:pfam04811 76 MFLPLPDRFLVPLSECRFVLEDLLEQLPPMFPVTKRPERCLGPALQAAFLLLKAAFTGGKIMVFQGGLPTVGPGGKLKSR 155
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261 684 DDRKLINTDKEKTLFQPQT-GTYQTLAKECVAQGCCVDLFLFPNQYVDVATLSVVPQLTGGSVYKYACFQVENDQERFLS 762
Cdd:pfam04811 156 LDESHHGTDKEKAKLVKKAdKFYKSLAKECVKQGHSVDLFAFSLDYVDVATLGQLSRLTGGQVYLYPSFQADVDGSKFKQ 235
|
....*.
gi 568987261 763 DLRRDV 768
Cdd:pfam04811 236 DLQRYF 241
|
|
| PTZ00395 |
PTZ00395 |
Sec24-related protein; Provisional |
19-1118 |
3.09e-48 |
|
Sec24-related protein; Provisional
Pssm-ID: 185594 [Multi-domain] Cd Length: 1560 Bit Score: 188.36 E-value: 3.09e-48
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261 19 IYPGYHqssyGGQPGPAAPATPYGAYNGPVPG--YQQAPP--QGVPRAPPSSGAPPASAAQVPCGQTTYGQfgqgdiqng 94
Cdd:PTZ00395 338 IYGGFH----DGSPNAASAGAPFNGLGNQADGghINQVHPdaRGAWAGGPHSNASYNCAAYSNAAQSNAAQ--------- 404
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261 95 psSTAQMQRVPGSQQfGPPLAPVVSQPAVLQPYGPPPTSTQVTAQlagmqisgavaqaPPPSGlgygPPTSlasasgNFP 174
Cdd:PTZ00395 405 --SNAGFSNAGYSNP-GNSNPGYNNAPNSNTPYNNPPNSNTPYSN-------------PPNSN----PPYS------NLP 458
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261 175 NSG-PYGSYPQSQAPPlSQAQGHPGVqpplrsappLASSFTSPASGGPQMPSMTGLLPPGQGFGSlpvNQANHVSSPPAP 253
Cdd:PTZ00395 459 YSNtPYSNAPLSNAPP-SSAKDHHSA---------YHAAYQHRAANQPAANLPTANQPAANNFHG---AAGNSVGNPFAS 525
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261 254 ALPPGTQMTGppvpppppmhspqqpgyqlqqngsfGPARGPQPNYESPYPGAPTFGSQPGPPQPLPPKRLDPDAI--PSP 331
Cdd:PTZ00395 526 RPFGSAPYGG-------------------------NAATTADPNGIAKREDHPEGGTNRQKYEQSDEESVESSSSenSSE 580
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261 332 QLNELPPQ--------QKTRHRIDPDAIPSPIQVIEDDRNNRGSEPFVTgVRGQVPPLVTTNFLVKDQGNASPRYIRCTS 403
Cdd:PTZ00395 581 NENEVTDKgeeiysllKKTINRIDMNKIPRPIINTQEKKKKKNLKVFET-CKYISPPSYYQPYISIDTGKADPRFLKSTL 659
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261 404 YNIPCTSDMAKQAQVPLAAVIKPLARLPPEEASPYV-----VDHGESGP--LRCNRCKAYMcpLMTFIEG-GRRFQCSFC 475
Cdd:PTZ00395 660 YQIPLFSETLKLSQIPFGIIVNPFACLNEGEGIDKIdmkdiINDKEENIeiLRCPKCLGYL--HATILEDiSSSVQCVFC 737
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261 476 SC---VND--------------------------------------VPPQYFQHLD-------HTGKRV----------- 496
Cdd:PTZ00395 738 DTdflINEnvlfdifqynekighkesdhnehgnslspllkgsvdiiIPPIYYHNVNkfkltytYLNKNInqtafmitnki 817
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261 497 -----------------------------DAYDRPELSLGSY-------------------------------------- 509
Cdd:PTZ00395 818 msftkhisnslvandskggnkatsasafgDSGDANFLAGGGYtnyggaggyntydnqsgynnhdvvnnrggsgagnhlyg 897
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261 510 ------EFLATVD------------YCKNN---------------KFPS-----PPAFIFMIDVSYNAIRTGLVRLLCEE 551
Cdd:PTZ00395 898 kdhdvqNFDNVMDnanftihdmknlICEKNgepdsakirrnsflaKYPQvknmlPPYFVFVVECSYNAIYNNITYTILEG 977
|
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261 552 LKSLLDYL--PReggaeesaIRVGFVTYNKVLHFYNVKSSLAQP-------------QMMVVSDVADMFVPL-LDGFLVN 615
Cdd:PTZ00395 978 IRYAVQNVkcPQ--------TKIAIITFNSSIYFYHCKGGKGVSgeegdggggsgnhQVIVMSDVDDPFLPLpLEDLFFG 1049
|
810 820 830 840 850 860 870 880
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261 616 VSESRAVITSLLDQIPEMFADTRETETVFAPVIQAGMEALKAAECAGKLFLFHTSLPIAeAPGKLKnrddrKLINTDKEK 695
Cdd:PTZ00395 1050 CVEEIDKINTLIDTIKSVSTTMQSYGSCGNSALKIAMDMLKERNGLGSICMFYTTTPNC-GIGAIK-----ELKKDLQEN 1123
|
890 900 910 920 930 940 950 960
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261 696 TLFQPQTGTYQTLAKECVAQGCCVDLFLFP--NQYVDVATLSVVPQLTGGSVYKYACFQVEND-QERFLSDLRRDVQKVV 772
Cdd:PTZ00395 1124 FLEVKQKIFYDSLLLDLYAFNISVDIFIISsnNVRVCVPSLQYVAQNTGGKILFVENFLWQKDyKEIYMNIMDTLTSEDI 1203
|
970 980 990 1000 1010 1020 1030 1040
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261 773 GFDAVMRVRTSTGIRAVDFFGAFYMSNTT----DVELAGLDGDKTVTVEFKHDDRLNEENGALLQCALLYTSCAGQRRLR 848
Cdd:PTZ00395 1204 AYCCELKLRYSHHMSVKKLFCCNNNFNSIisvdTIKIPKIRHDQTFAFLLNYSDISESKKQIYFQCACIYTNLWGDRFVR 1283
|
1050 1060 1070 1080 1090 1100 1110 1120
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261 849 IHNLALNCCTQLADLYRNCETDTLINYMAKFAYRAVLNSpvKTVRDTLITQCAQILACYRKNCASPSSAGQLILPECMKL 928
Cdd:PTZ00395 1284 LHTTHMNLTSSLSTVFRYTDAEALMNILIKQLCTNILHN--DNYSKIIIDNLAAILFSYRINCASSAHSGQLILPDTLKL 1361
|
1130 1140 1150 1160 1170 1180 1190 1200
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261 929 LPVYLNCVLKSDVLQpgAEVTTDDRAYVRQLVSSMDVAETNVFFYPRLLPL-VRTKS-PLDSTAE------PPAVRASEE 1000
Cdd:PTZ00395 1362 LPLFTSSLLKHNVTK--KEILHDLKVYSLIKLLSMPIISSLLYVYPVMYVIhIKGKTnEIDSMDVdddlfiPKTIPSSAE 1439
|
1210 1220 1230 1240 1250 1260 1270 1280
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261 1001 RLSSGDIYLLENGLNLFVWVGASVQQGVVQSLFNVSSFSQITSGLSvlpVLDNPLSKKVRGLIDSL-RAQRM-RYMKLIV 1078
Cdd:PTZ00395 1440 KIYSNGIYLLDACTHFYLYFGFHSDANFAKEIVGDIPTEKNAHELN---LTDTPNAQKVQRIIKNLsRIHHFnKYVPLVM 1516
|
1290 1300 1310 1320
....*....|....*....|....*....|....*....|
gi 568987261 1079 VKQEDKLEMLFKHFLVEDKSlSGGASYVDFLCHMHKEIRQ 1118
Cdd:PTZ00395 1517 VAPKSNEEEHLISLCVEDKA-DKEYSYVNFLCFIHKLVHK 1555
|
|
| Atrophin-1 |
pfam03154 |
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ... |
25-232 |
2.16e-08 |
|
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.
Pssm-ID: 460830 [Multi-domain] Cd Length: 991 Bit Score: 58.63 E-value: 2.16e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261 25 QSSYGGQPGPAAPATPYGAYNGPVPGYQQAPPQGVP--RAPPSSGAPPAS------------AAQVPCGQTTYGQFGQG- 89
Cdd:pfam03154 177 QSGAASPPSPPPPGTTQAATAGPTPSAPSVPPQGSPatSQPPNQTQSTAAphtliqqtptlhPQRLPSPHPPLQPMTQPp 256
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261 90 -DIQNGPSST------AQMQRVPGSQQFGPPLAPvvsQPAVLQPYGPPPTSTQvtaqlagmqisgavAQAPPPsglgygP 162
Cdd:pfam03154 257 pPSQVSPQPLpqpslhGQMPPMPHSLQTGPSHMQ---HPVPPQPFPLTPQSSQ--------------SQVPPG------P 313
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 568987261 163 PTSLASASGNFPNSGPYGSYPQSQAPPLSQ-----AQGHPGVQPPLRSA-PPLASSFT---SPASGGPQMPSMTGLLPP 232
Cdd:pfam03154 314 SPAAPGQSQQRIHTPPSQSQLQSQQPPREQplppaPLSMPHIKPPPTTPiPQLPNPQShkhPPHLSGPSPFQMNSNLPP 392
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
7-344 |
6.13e-08 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 57.26 E-value: 6.13e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261 7 APPVPPYGQNQPIYPGYHQSSYGGQPGPAAPATPYGAYNGPVPGYQQAPPQGVPRAPPSSGAPPASAAQVPCGQTTygqf 86
Cdd:PHA03247 2709 EPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLT---- 2784
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261 87 gqgdIQNGPSSTAQMQRVPGSQQFGPPLAPVVSQPAVLQPYG------PPPTSTQVTAqlagmqisgavaqAPPPSGLgy 160
Cdd:PHA03247 2785 ----RPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAAspagplPPPTSAQPTA-------------PPPPPGP-- 2845
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261 161 gPPTSLASASGNFPnSGPYGSYPQSQAPPLSQA-QGHPGVQPPLRSAPPLAS-SFTSPASGgPQMPSMTGLLPPGQGFGS 238
Cdd:PHA03247 2846 -PPPSLPLGGSVAP-GGDVRRRPPSRSPAAKPAaPARPPVRRLARPAVSRSTeSFALPPDQ-PERPPQPQAPPPPQPQPQ 2922
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261 239 LPVnqanhvssppapalppgtqmtgpPVPPPPPMHSPQQPGYQLQQNGSFGPARGPQPNYESPYPGAPTFGSqpgppqPL 318
Cdd:PHA03247 2923 PPP-----------------------PPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGR------VA 2973
|
330 340
....*....|....*....|....*.
gi 568987261 319 PPKRLDPDAIPSPQLNELPPQQKTRH 344
Cdd:PHA03247 2974 VPRFRVPQPAPSREAPASSTPPLTGH 2999
|
|
| PABP-1234 |
TIGR01628 |
polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins ... |
1-118 |
2.17e-04 |
|
polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins recognize the poly-A of mRNA and consists of four tandem RNA recognition domains at the N-terminus (rrm: pfam00076) followed by a PABP-specific domain (pfam00658) at the C-terminus. The protein is involved in the transport of mRNA's from the nucleus to the cytoplasm. There are four paralogs in Homo sapiens which are expressed in testis, platelets, broadly expressed and of unknown tissue range.
Pssm-ID: 130689 [Multi-domain] Cd Length: 562 Bit Score: 45.18 E-value: 2.17e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261 1 MNVNQSAPPVP---PYGQNQPiYPGYHqssyGGQPGPAAPAtPYGAYNGPVPGYQQA----PPQGVPRAPPSSGAPPASA 73
Cdd:TIGR01628 403 QGPQQQFNGQPlgwPRMSMMP-TPMGP----GGPLRPNGLA-PMNAVRAPSRNAQNAaqkpPMQPVMYPPNYQSLPLSQD 476
|
90 100 110 120
....*....|....*....|....*....|....*....|....*..
gi 568987261 74 AQVPcgQTTYGQFGQGD--IQNGPSSTAQMQRvpgsQQFGPPLAPVV 118
Cdd:TIGR01628 477 LPQP--QSTASQGGQNKklAQVLASATPQMQK----QVLGERLFPLV 517
|
|
| COG3416 |
COG3416 |
Uncharacterized conserved protein, DUF2076 domain [Function unknown]; |
14-67 |
1.33e-03 |
|
Uncharacterized conserved protein, DUF2076 domain [Function unknown];
Pssm-ID: 442642 [Multi-domain] Cd Length: 237 Bit Score: 41.55 E-value: 1.33e-03
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|....
gi 568987261 14 GQNQPIYPGYHQSSYGGQPGPAAPATPYGAYNGPVPGYQQaPPQGVPRAPPSSG 67
Cdd:COG3416 91 GGGQRPPPAPQPSQPGPQQQPAPPSGPWGQAAPQQPGYGQ-PQYGQPAAGPSGG 143
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| COG5028 |
COG5028 |
Vesicle coat complex COPII, subunit SEC24/subunit SFB2/subunit SFB3 [Intracellular trafficking ... |
181-1117 |
2.30e-166 |
|
Vesicle coat complex COPII, subunit SEC24/subunit SFB2/subunit SFB3 [Intracellular trafficking and secretion];
Pssm-ID: 227361 [Multi-domain] Cd Length: 861 Bit Score: 513.57 E-value: 2.30e-166
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261 181 SYPQSQAPPLSQAQGHPGVQPPLRSAPPLAS--SFTSPASGGPQMPsmtglLPPGQgfgslpvNQANHvssppaPALPPG 258
Cdd:COG5028 2 SQHKKGVYPQAQSQVHTGAASSKKSARPHRAyaNFSAGQMGMPPYT-----TPPLQ-------QQSRR------QIDQAA 63
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261 259 TQMTgppvpppppmhspqQPGYQLQQNGSFGPARGPQPNYESPYPGAPTFGsqpgppqpLPPKRLDPDAiPSPQLNELPP 338
Cdd:COG5028 64 TAMH--------------NTGANNPAPSVMSPAFQSQQKFSSPYGGSMADG--------TAPKPTNPLV-PVDLFEDQPP 120
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261 339 QqKTRHRIDPdaipspiqvieddrnnrgsepfvtgvRGQVPPLvTTNFLVKDQGNASPRYIRCTSYNIPCTSDMAKQAQV 418
Cdd:COG5028 121 P-ISDLFLPP--------------------------PPIVPPL-TTNFVGSEQSNCSPKYVRSTMYAIPETNDLLKKSKI 172
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261 419 PLAAVIKPLARLPPEEASPYVVDHGEsgPLRCNRCKAYMCPLMTFIEGGRRFQCSFCSCVNDVPPQYFQHLDHTGKRVDA 498
Cdd:COG5028 173 PFGLVIRPFLELYPEEDPVPLVEDGS--IVRCRRCRSYINPFVQFIEQGRKWRCNICRSKNDVPEGFDNPSGPNDPRSDR 250
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261 499 YDRPELSLGSYEFLATVDYckNNKFPSPPAFIFMIDVSYNAIRTGLVRLLCEELKSLLDYLPREGGAeesaIRVGFVTYN 578
Cdd:COG5028 251 YSRPELKSGVVDFLAPKEY--SLRQPPPPVYVFLIDVSFEAIKNGLVKAAIRAILENLDQIPNFDPR----TKIAIICFD 324
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261 579 KVLHFYNVKSSLaQPQMMVVSDVADMFVPLLDG-FLVNVSESRAVITSLLDQIPEMFADTRETETVFAPviqagmeALKA 657
Cdd:COG5028 325 SSLHFFKLSPDL-DEQMLIVSDLDEPFLPFPSGlFVLPLKSCKQIIETLLDRVPRIFQDNKSPKNALGP-------ALKA 396
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261 658 A-----ECAGKLFLFHTSLPIAeAPGKLKNRDDrklintdKEKTLFQPQTGTYQTLAKECVAQGCCVDLFLFPNQYVDVA 732
Cdd:COG5028 397 AksligGTGGKIIVFLSTLPNM-GIGKLQLRED-------KESSLLSCKDSFYKEFAIECSKVGISVDLFLTSEDYIDVA 468
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261 733 TLSVVPQLTGGSVYKYACFQVE--NDQERFLSDLRRDVQKVVGFDAVMRVRTSTGIRAVDFFGAFYMSNTTDVELAGLDG 810
Cdd:COG5028 469 TLSHLCRYTGGQTYFYPNFSATrpNDATKLANDLVSHLSMEIGYEAVMRVRCSTGLRVSSFYGNFFNRSSDLCAFSTMPR 548
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261 811 DKTVTVEFKHDDRLNEEnGALLQCALLYTSCAGQRRLRIHNLALNCCTQLADLYRNCETDTLINYMAKFAYRAVLNSPVK 890
Cdd:COG5028 549 DTSLLVEFSIDEKLMTS-DVYFQVALLYTLNDGERRIRVVNLSLPTSSSIREVYASADQLAIACILAKKASTKALNSSLK 627
|
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261 891 TVRDTLITQCAQILACYRKNCASPSSAGQLILPECMKLLPVYLNCVLKSDVLQPGAeVTTDDRAYVRQLVSSMDVAETNV 970
Cdd:COG5028 628 EARVLINKSMVDILKAYKKELVKSNTSTQLPLPANLKLLPLLMLALLKSSAFRSGS-TPSDIRISALNRLTSLPLKQLMR 706
|
810 820 830 840 850 860 870 880
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261 971 FFYPRLLPLVRTKSPLDSTAE-----PPAVRASEERLSSGDIYLLENGLNLFVWVGASVQQGVVQSLFNVSSFSQITSGL 1045
Cdd:COG5028 707 NIYPTLYALHDMPIEAGLPDEgllvlPSPINATSSLLESGGLYLIDTGQKIFLWFGKDAVPSLLQDLFGVDSLSDIPSGK 786
|
890 900 910 920 930 940 950
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 568987261 1046 SVLPVLDNPLSKKVRGLIDSLRaQRMRYMKLIVVKQED----KLEMLFKHFLVEDKSLsGGASYVDFLCHMHKEIR 1117
Cdd:COG5028 787 FTLPPTGNEFNERVRNIIGELR-SVNDDSTLPLVLVRGggdpSLRLWFFSTLVEDKTL-NIPSYLDYLQILHEKIK 860
|
|
| Sec24-like |
cd01479 |
Sec24-like: Protein and membrane traffic in eukaryotes is mediated by at least in part by the ... |
524-783 |
1.22e-123 |
|
Sec24-like: Protein and membrane traffic in eukaryotes is mediated by at least in part by the budding and fusion of intracellular transport vesicles that selectively carry cargo proteins and lipids from donor to acceptor organelles. The two main classes of vesicular carriers within the endocytic and the biosynthetic pathways are COP- and clathrin-coated vesicles. Formation of COPII vesicles requires the ordered assembly of the coat built from several cytosolic components GTPase Sar1, complexes of Sec23-Sec24 and Sec13-Sec31. The process is initiated by the conversion of GDP to GTP by the GTPase Sar1 which then recruits the heterodimeric complex of Sec23 and Sec24. This heterodimeric complex generates the pre-budding complex. The final step leading to membrane deformation and budding of COPII-coated vesicles is carried by the heterodimeric complex Sec13-Sec31. The members of this CD belong to the Sec23-like family. Sec 24 is very similar to Sec23. The Sec23 and Sec24 polypeptides fold into five distinct domains: a beta-barrel, a zinc finger, a vWA or trunk, an all helical region and a carboxy Gelsolin domain. The members of this subgroup carry a partial MIDAS motif and have the overall Para-Rossmann type fold that is characteristic of this superfamily.
Pssm-ID: 238756 [Multi-domain] Cd Length: 244 Bit Score: 378.92 E-value: 1.22e-123
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261 524 PSPPAFIFMIDVSYNAIRTGLVRLLCEELKSLLDYLPREggaeESAIRVGFVTYNKVLHFYNVKSSLAQPQMMVVSDVAD 603
Cdd:cd01479 1 PQPAVYVFLIDVSYNAIKSGLLATACEALLSNLDNLPGD----DPRTRVGFITFDSTLHFFNLKSSLEQPQMMVVSDLDD 76
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261 604 MFVPLLDGFLVNVSESRAVITSLLDQIPEMFADTRETETVFAPVIQAGMEALKaaECAGKLFLFHTSLPIAEApGKLKNR 683
Cdd:cd01479 77 PFLPLPDGLLVNLKESRQVIEDLLDQIPEMFQDTKETESALGPALQAAFLLLK--ETGGKIIVFQSSLPTLGA-GKLKSR 153
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261 684 DDRKLINTDKEKTLFQPQTGTYQTLAKECVAQGCCVDLFLFPNQYVDVATLSVVPQLTGGSVYKYAcfqvendqeRFLSD 763
Cdd:cd01479 154 EDPKLLSTDKEKQLLQPQTDFYKKLALECVKSQISVDLFLFSNQYVDVATLGCLSRLTGGQVYYYP---------SFNFS 224
|
250 260
....*....|....*....|
gi 568987261 764 LRRDVQKVVGFDAVMRVRTS 783
Cdd:cd01479 225 APNDVEKLVNELARYLTRKI 244
|
|
| Sec23_trunk |
pfam04811 |
Sec23/Sec24 trunk domain; COPII-coated vesicles carry proteins from the endoplasmic reticulum ... |
524-768 |
1.57e-115 |
|
Sec23/Sec24 trunk domain; COPII-coated vesicles carry proteins from the endoplasmic reticulum to the Golgi complex. This vesicular transport can be reconstituted by using three cytosolic components containing five proteins: the small GTPase Sar1p, the Sec23p/24p complex, and the Sec13p/Sec31p complex. This domain is known as the trunk domain and has an alpha/beta vWA fold and forms the dimer interface.
Pssm-ID: 398467 [Multi-domain] Cd Length: 241 Bit Score: 357.33 E-value: 1.57e-115
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261 524 PSPPAFIFMIDVSYNAIRTGLVRLLCEELKSLLDYLPREggaeeSAIRVGFVTYNKVLHFYNVKSSLAQPQMMVVSDVAD 603
Cdd:pfam04811 1 PQPPVFLFVIDVSYNAIKSGLLAALKESLLQSLDLLPGD-----PRARVGFITFDSTVHFFNLGSSLRQPQMLVVSDLQD 75
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261 604 MFVPLLDGFLVNVSESRAVITSLLDQIPEMFADTRETETVFAPVIQAGMEALKAAECAGKLFLFHTSLPIAEAPGKLKNR 683
Cdd:pfam04811 76 MFLPLPDRFLVPLSECRFVLEDLLEQLPPMFPVTKRPERCLGPALQAAFLLLKAAFTGGKIMVFQGGLPTVGPGGKLKSR 155
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261 684 DDRKLINTDKEKTLFQPQT-GTYQTLAKECVAQGCCVDLFLFPNQYVDVATLSVVPQLTGGSVYKYACFQVENDQERFLS 762
Cdd:pfam04811 156 LDESHHGTDKEKAKLVKKAdKFYKSLAKECVKQGHSVDLFAFSLDYVDVATLGQLSRLTGGQVYLYPSFQADVDGSKFKQ 235
|
....*.
gi 568987261 763 DLRRDV 768
Cdd:pfam04811 236 DLQRYF 241
|
|
| trunk_domain |
cd01468 |
trunk domain. COPII-coated vesicles carry proteins from the endoplasmic reticulum to the Golgi ... |
524-766 |
2.32e-103 |
|
trunk domain. COPII-coated vesicles carry proteins from the endoplasmic reticulum to the Golgi complex. This vesicular transport can be reconstituted by using three cytosolic components containing five proteins: the small GTPase Sar1p, the Sec23p/24p complex, and the Sec13p/Sec31p complex. This domain is known as the trunk domain and has an alpha/beta vWA fold and forms the dimer interface. Some members of this family possess a partial MIDAS motif that is a characteristic feature of most vWA domain proteins.
Pssm-ID: 238745 [Multi-domain] Cd Length: 239 Bit Score: 324.97 E-value: 2.32e-103
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261 524 PSPPAFIFMIDVSYNAIRTGLVRLLCEELKSLLDYLPREGGAeesaiRVGFVTYNKVLHFYNVKSSLAQPQMMVVSDVAD 603
Cdd:cd01468 1 PQPPVFVFVIDVSYEAIKEGLLQALKESLLASLDLLPGDPRA-----RVGLITYDSTVHFYNLSSDLAQPKMYVVSDLKD 75
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261 604 MFVPLLDGFLVNVSESRAVITSLLDQIPEMFAD--TRETETVFAPVIQAGMEALKAAECAGKLFLFHTSLPIAEaPGKLK 681
Cdd:cd01468 76 VFLPLPDRFLVPLSECKKVIHDLLEQLPPMFWPvpTHRPERCLGPALQAAFLLLKGTFAGGRIIVFQGGLPTVG-PGKLK 154
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261 682 NRDDRKLINTDKEKTLFQPQTGTYQTLAKECVAQGCCVDLFLFPNQYVDVATLSVVPQLTGGSVYKYACFQVENDQERFL 761
Cdd:cd01468 155 SREDKEPIRSHDEAQLLKPATKFYKSLAKECVKSGICVDLFAFSLDYVDVATLKQLAKSTGGQVYLYDSFQAPNDGSKFK 234
|
....*
gi 568987261 762 SDLRR 766
Cdd:cd01468 235 QDLQR 239
|
|
| PTZ00395 |
PTZ00395 |
Sec24-related protein; Provisional |
19-1118 |
3.09e-48 |
|
Sec24-related protein; Provisional
Pssm-ID: 185594 [Multi-domain] Cd Length: 1560 Bit Score: 188.36 E-value: 3.09e-48
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261 19 IYPGYHqssyGGQPGPAAPATPYGAYNGPVPG--YQQAPP--QGVPRAPPSSGAPPASAAQVPCGQTTYGQfgqgdiqng 94
Cdd:PTZ00395 338 IYGGFH----DGSPNAASAGAPFNGLGNQADGghINQVHPdaRGAWAGGPHSNASYNCAAYSNAAQSNAAQ--------- 404
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261 95 psSTAQMQRVPGSQQfGPPLAPVVSQPAVLQPYGPPPTSTQVTAQlagmqisgavaqaPPPSGlgygPPTSlasasgNFP 174
Cdd:PTZ00395 405 --SNAGFSNAGYSNP-GNSNPGYNNAPNSNTPYNNPPNSNTPYSN-------------PPNSN----PPYS------NLP 458
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261 175 NSG-PYGSYPQSQAPPlSQAQGHPGVqpplrsappLASSFTSPASGGPQMPSMTGLLPPGQGFGSlpvNQANHVSSPPAP 253
Cdd:PTZ00395 459 YSNtPYSNAPLSNAPP-SSAKDHHSA---------YHAAYQHRAANQPAANLPTANQPAANNFHG---AAGNSVGNPFAS 525
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261 254 ALPPGTQMTGppvpppppmhspqqpgyqlqqngsfGPARGPQPNYESPYPGAPTFGSQPGPPQPLPPKRLDPDAI--PSP 331
Cdd:PTZ00395 526 RPFGSAPYGG-------------------------NAATTADPNGIAKREDHPEGGTNRQKYEQSDEESVESSSSenSSE 580
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261 332 QLNELPPQ--------QKTRHRIDPDAIPSPIQVIEDDRNNRGSEPFVTgVRGQVPPLVTTNFLVKDQGNASPRYIRCTS 403
Cdd:PTZ00395 581 NENEVTDKgeeiysllKKTINRIDMNKIPRPIINTQEKKKKKNLKVFET-CKYISPPSYYQPYISIDTGKADPRFLKSTL 659
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261 404 YNIPCTSDMAKQAQVPLAAVIKPLARLPPEEASPYV-----VDHGESGP--LRCNRCKAYMcpLMTFIEG-GRRFQCSFC 475
Cdd:PTZ00395 660 YQIPLFSETLKLSQIPFGIIVNPFACLNEGEGIDKIdmkdiINDKEENIeiLRCPKCLGYL--HATILEDiSSSVQCVFC 737
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261 476 SC---VND--------------------------------------VPPQYFQHLD-------HTGKRV----------- 496
Cdd:PTZ00395 738 DTdflINEnvlfdifqynekighkesdhnehgnslspllkgsvdiiIPPIYYHNVNkfkltytYLNKNInqtafmitnki 817
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261 497 -----------------------------DAYDRPELSLGSY-------------------------------------- 509
Cdd:PTZ00395 818 msftkhisnslvandskggnkatsasafgDSGDANFLAGGGYtnyggaggyntydnqsgynnhdvvnnrggsgagnhlyg 897
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261 510 ------EFLATVD------------YCKNN---------------KFPS-----PPAFIFMIDVSYNAIRTGLVRLLCEE 551
Cdd:PTZ00395 898 kdhdvqNFDNVMDnanftihdmknlICEKNgepdsakirrnsflaKYPQvknmlPPYFVFVVECSYNAIYNNITYTILEG 977
|
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261 552 LKSLLDYL--PReggaeesaIRVGFVTYNKVLHFYNVKSSLAQP-------------QMMVVSDVADMFVPL-LDGFLVN 615
Cdd:PTZ00395 978 IRYAVQNVkcPQ--------TKIAIITFNSSIYFYHCKGGKGVSgeegdggggsgnhQVIVMSDVDDPFLPLpLEDLFFG 1049
|
810 820 830 840 850 860 870 880
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261 616 VSESRAVITSLLDQIPEMFADTRETETVFAPVIQAGMEALKAAECAGKLFLFHTSLPIAeAPGKLKnrddrKLINTDKEK 695
Cdd:PTZ00395 1050 CVEEIDKINTLIDTIKSVSTTMQSYGSCGNSALKIAMDMLKERNGLGSICMFYTTTPNC-GIGAIK-----ELKKDLQEN 1123
|
890 900 910 920 930 940 950 960
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261 696 TLFQPQTGTYQTLAKECVAQGCCVDLFLFP--NQYVDVATLSVVPQLTGGSVYKYACFQVEND-QERFLSDLRRDVQKVV 772
Cdd:PTZ00395 1124 FLEVKQKIFYDSLLLDLYAFNISVDIFIISsnNVRVCVPSLQYVAQNTGGKILFVENFLWQKDyKEIYMNIMDTLTSEDI 1203
|
970 980 990 1000 1010 1020 1030 1040
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261 773 GFDAVMRVRTSTGIRAVDFFGAFYMSNTT----DVELAGLDGDKTVTVEFKHDDRLNEENGALLQCALLYTSCAGQRRLR 848
Cdd:PTZ00395 1204 AYCCELKLRYSHHMSVKKLFCCNNNFNSIisvdTIKIPKIRHDQTFAFLLNYSDISESKKQIYFQCACIYTNLWGDRFVR 1283
|
1050 1060 1070 1080 1090 1100 1110 1120
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261 849 IHNLALNCCTQLADLYRNCETDTLINYMAKFAYRAVLNSpvKTVRDTLITQCAQILACYRKNCASPSSAGQLILPECMKL 928
Cdd:PTZ00395 1284 LHTTHMNLTSSLSTVFRYTDAEALMNILIKQLCTNILHN--DNYSKIIIDNLAAILFSYRINCASSAHSGQLILPDTLKL 1361
|
1130 1140 1150 1160 1170 1180 1190 1200
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261 929 LPVYLNCVLKSDVLQpgAEVTTDDRAYVRQLVSSMDVAETNVFFYPRLLPL-VRTKS-PLDSTAE------PPAVRASEE 1000
Cdd:PTZ00395 1362 LPLFTSSLLKHNVTK--KEILHDLKVYSLIKLLSMPIISSLLYVYPVMYVIhIKGKTnEIDSMDVdddlfiPKTIPSSAE 1439
|
1210 1220 1230 1240 1250 1260 1270 1280
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261 1001 RLSSGDIYLLENGLNLFVWVGASVQQGVVQSLFNVSSFSQITSGLSvlpVLDNPLSKKVRGLIDSL-RAQRM-RYMKLIV 1078
Cdd:PTZ00395 1440 KIYSNGIYLLDACTHFYLYFGFHSDANFAKEIVGDIPTEKNAHELN---LTDTPNAQKVQRIIKNLsRIHHFnKYVPLVM 1516
|
1290 1300 1310 1320
....*....|....*....|....*....|....*....|
gi 568987261 1079 VKQEDKLEMLFKHFLVEDKSlSGGASYVDFLCHMHKEIRQ 1118
Cdd:PTZ00395 1517 VAPKSNEEEHLISLCVEDKA-DKEYSYVNFLCFIHKLVHK 1555
|
|
| Sec23_helical |
pfam04815 |
Sec23/Sec24 helical domain; COPII-coated vesicles carry proteins from the endoplasmic ... |
870-968 |
2.77e-35 |
|
Sec23/Sec24 helical domain; COPII-coated vesicles carry proteins from the endoplasmic reticulum to the Golgi complex. This vesicular transport can be reconstituted by using three cytosolic components containing five proteins: the small GTPase Sar1p, the Sec23p/24p complex, and the Sec13p/Sec31p complex. This domain is composed of five alpha helices.
Pssm-ID: 461441 [Multi-domain] Cd Length: 103 Bit Score: 129.54 E-value: 2.77e-35
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261 870 DTLINYMAKFAYRAVLNSPVKTVRDTLITQCAQILACYRKNCASPSSAGQLILPECMKLLPVYLNCVLKSDVLQPGAEVT 949
Cdd:pfam04815 3 EAIAVLLAKKAVEKALSSSLSDAREALDNKLVDILAAYRKYCASSSSPGQLILPESLKLLPLYMLALLKSPALRGGNSSP 82
|
90
....*....|....*....
gi 568987261 950 TDDRAYVRQLVSSMDVAET 968
Cdd:pfam04815 83 SDERAYARHLLLSLPVEEL 101
|
|
| Sec23_BS |
pfam08033 |
Sec23/Sec24 beta-sandwich domain; |
773-856 |
3.30e-28 |
|
Sec23/Sec24 beta-sandwich domain;
Pssm-ID: 429794 [Multi-domain] Cd Length: 86 Bit Score: 108.78 E-value: 3.30e-28
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261 773 GFDAVMRVRTSTGIRAVDFFGAFYMSNTTD-VELAGLDGDKTVTVEFKHDDRLNEENGALLQCALLYTSCAGQRRLRIHN 851
Cdd:pfam08033 1 GFNAVLRVRTSKGLKVSGFIGNFVSRSSGDtWKLPSLDPDTSYAFEFDIDEPLPNGSNAYIQFALLYTHSSGERRIRVTT 80
|
....*
gi 568987261 852 LALNC 856
Cdd:pfam08033 81 VALPV 85
|
|
| PLN00162 |
PLN00162 |
transport protein sec23; Provisional |
403-849 |
1.88e-17 |
|
transport protein sec23; Provisional
Pssm-ID: 215083 [Multi-domain] Cd Length: 761 Bit Score: 87.69 E-value: 1.88e-17
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261 403 SYNI-PCTSDMAKQAQVPLAAVIKPLARLPPEEASPYvvdhgesGPLRCNRCKAYMCPLMTFIEGGRRFQCSFCSCVNDV 481
Cdd:PLN00162 15 SWNVwPSSKIEASKCVIPLAALYTPLKPLPELPVLPY-------DPLRCRTCRAVLNPYCRVDFQAKIWICPFCFQRNHF 87
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261 482 PPQYF----QHLDhtgkrvdaydrPELslgsYEFLATVDY---CKNNKFPSPPAFIFMIDVSynAIRTGLvRLLCEELKS 554
Cdd:PLN00162 88 PPHYSsiseTNLP-----------AEL----FPQYTTVEYtlpPGSGGAPSPPVFVFVVDTC--MIEEEL-GALKSALLQ 149
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261 555 LLDYLPreggaeESAiRVGFVTY----------------------------NKVLHFYNVKSSLAQPQMMVVSDVADMFV 606
Cdd:PLN00162 150 AIALLP------ENA-LVGLITFgthvhvhelgfsecsksyvfrgnkevskDQILEQLGLGGKKRRPAGGGIAGARDGLS 222
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261 607 PL-LDGFLVNVSESRAVITSLLDQI-PEMF---ADTRETE-TVFAPVIQAGMEALKAAECAGKLFLFhTSLPIAEAPGKL 680
Cdd:PLN00162 223 SSgVNRFLLPASECEFTLNSALEELqKDPWpvpPGHRPARcTGAALSVAAGLLGACVPGTGARIMAF-VGGPCTEGPGAI 301
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261 681 KNRDDRKLINTDKE-----KTLFQPQTGTYQTLAKECVAQGCCVDLFLFPNQYVDVATLSVVPQLTGGSVYKYACFqven 755
Cdd:PLN00162 302 VSKDLSEPIRSHKDldkdaAPYYKKAVKFYEGLAKQLVAQGHVLDVFACSLDQVGVAEMKVAVERTGGLVVLAESF---- 377
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261 756 DQERFLSDLRRDVQKV------VGFDAVMRVRTSTGIRAVDFFG---------------AFYMSNTTDVELAGLDGDKTV 814
Cdd:PLN00162 378 GHSVFKDSLRRVFERDgegslgLSFNGTFEVNCSKDVKVQGAIGpcaslekkgpsvsdtEIGEGGTTAWKLCGLDKKTSL 457
|
490 500 510 520
....*....|....*....|....*....|....*....|
gi 568987261 815 TVEF----KHDDRLNEENGAL-LQCALLYTSCAGQRRLRI 849
Cdd:PLN00162 458 AVFFevanSGQSNPQPPGQQFfLQFLTRYQHSNGQTRLRV 497
|
|
| SEC23 |
COG5047 |
Vesicle coat complex COPII, subunit SEC23 [Intracellular trafficking and secretion]; |
399-1026 |
1.79e-15 |
|
Vesicle coat complex COPII, subunit SEC23 [Intracellular trafficking and secretion];
Pssm-ID: 227380 [Multi-domain] Cd Length: 755 Bit Score: 81.47 E-value: 1.79e-15
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261 399 IRCTSYNIPCTSDMAKQAQVPLAAVIKPLARLPPEEASPYvvdhgesGPLRCNR-CKAYMCPLMTFIEGGRRFQCSFCSC 477
Cdd:COG5047 12 IRLTWNVFPATRGDATRTVIPIACLYTPLHEDDALTVNYY-------EPVKCTApCKAVLNPYCHIDERNQSWICPFCNQ 84
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261 478 VNDVPPQYfqhLDHTGKRVDaydrPELSLGSyeflATVDYCKNNKFPSPPAFIFMIDVSYNAIRtglVRLLCEELKSLLD 557
Cdd:COG5047 85 RNTLPPQY---RDISNANLP----LELLPQS----STIEYTLSKPVILPPVFFFVVDACCDEEE---LTALKDSLIVSLS 150
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261 558 YLPREggaeesAIrVGFVTYNKVLHFYNVkSSLAQPQMMVVSDVADMFVPLLD--------------------------- 610
Cdd:COG5047 151 LLPPE------AL-VGLITYGTSIQVHEL-NAENHRRSYVFSGNKEYTKENLQellalskptksggfeskisgigqfass 222
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261 611 GFLVNVSESRAVITSLLDQI-PEMF---ADTRETE-TVFAPVIQAGMEALKAAECAGKLFLFhTSLPIAEAPGKLKNRDD 685
Cdd:COG5047 223 RFLLPTQQCEFKLLNILEQLqPDPWpvpAGKRPLRcTGSALNIASSLLEQCFPNAGCHIVLF-AGGPCTVGPGTVVSTEL 301
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261 686 RK------LINTDKEKtLFQPQTGTYQTLAKECVAQGCCVDLFLFPNQYVDVATLSVVPQLTGGSVYKYACFQVENDQER 759
Cdd:COG5047 302 KEpmrshhDIESDSAQ-HSKKATKFYKGLAERVANQGHALDIFAGCLDQIGIMEMEPLTTSTGGALVLSDSFTTSIFKQS 380
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261 760 FLSDLRRDVQK--VVGFDAVMRVRTSTGIRAVDFFG---------------AFYMSNTTDVELAGLDGDKTVTVEFKHDD 822
Cdd:COG5047 381 FQRIFNRDSEGylKMGFNANMEVKTSKNLKIKGLIGhavsvkkkannisdsEIGIGATNSWKMASLSPKSNYALYFEIAL 460
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261 823 RLNEENG-----ALLQCALLYTSCAGQRRLRIHNLALNCCTQLADL-YRNCETDTLINYMAKFA-YRAVLNSPVKTVR-- 893
Cdd:COG5047 461 GAASGSAqrpaeAYIQFITTYQHSSGTYRIRVTTVARMFTDGGLPKiNRSFDQEAAAVFMARIAaFKAETEDIIDVFRwi 540
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261 894 -DTLITQCaQILACYRKNcaSPSSAGqliLPECMKLLPVYLNCVLKSDVLQPGAEvTTDDRAYVRQLVSSMDVAETNVFF 972
Cdd:COG5047 541 dRNLIRLC-QKFADYRKD--DPSSFR---LDPNFTLYPQFMYHLRRSPFLSVFNN-SPDETAFYRHMLNNADVNDSLIMI 613
|
650 660 670 680 690
....*....|....*....|....*....|....*....|....*....|....*...
gi 568987261 973 YPRLLPLVRTKSP----LDSTAEPPAVraseerlssgdIYLLENGLNLFVWVGASVQQ 1026
Cdd:COG5047 614 QPTLQSYSFEKGGvpvlLDSVSVKPDV-----------ILLLDTFFHILIFHGSYIAQ 660
|
|
| zf-Sec23_Sec24 |
pfam04810 |
Sec23/Sec24 zinc finger; COPII-coated vesicles carry proteins from the endoplasmic reticulum ... |
447-484 |
7.15e-15 |
|
Sec23/Sec24 zinc finger; COPII-coated vesicles carry proteins from the endoplasmic reticulum to the Golgi complex. This vesicular transport can be reconstituted by using three cytosolic components containing five proteins: the small GTPase Sar1p, the Sec23p/24p complex, and the Sec13p/Sec31p complex. This domain is found to be zinc binding domain.
Pssm-ID: 461437 [Multi-domain] Cd Length: 38 Bit Score: 69.40 E-value: 7.15e-15
10 20 30
....*....|....*....|....*....|....*...
gi 568987261 447 PLRCNRCKAYMCPLMTFIEGGRRFQCSFCSCVNDVPPQ 484
Cdd:pfam04810 1 PVRCRRCRAYLNPFCQFDFGGKKWTCNFCGTRNPVPPE 38
|
|
| Gelsolin |
pfam00626 |
Gelsolin repeat; |
991-1063 |
1.15e-11 |
|
Gelsolin repeat;
Pssm-ID: 395501 [Multi-domain] Cd Length: 76 Bit Score: 61.55 E-value: 1.15e-11
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 568987261 991 EPPAVRASEERLSSGDIYLLENGLNLFVWVGASVQQgvVQSLFNVSSFSQI-TSGLSVLPVLDN-PLSKKVRGLI 1063
Cdd:pfam00626 4 LPPPVPLSQESLNSGDCYLLDNGFTIFLWVGKGSSL--LEKLFAALLAAQLdDDERFPLPEVIRvPQGKEPARFL 76
|
|
| Atrophin-1 |
pfam03154 |
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ... |
25-232 |
2.16e-08 |
|
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.
Pssm-ID: 460830 [Multi-domain] Cd Length: 991 Bit Score: 58.63 E-value: 2.16e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261 25 QSSYGGQPGPAAPATPYGAYNGPVPGYQQAPPQGVP--RAPPSSGAPPAS------------AAQVPCGQTTYGQFGQG- 89
Cdd:pfam03154 177 QSGAASPPSPPPPGTTQAATAGPTPSAPSVPPQGSPatSQPPNQTQSTAAphtliqqtptlhPQRLPSPHPPLQPMTQPp 256
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261 90 -DIQNGPSST------AQMQRVPGSQQFGPPLAPvvsQPAVLQPYGPPPTSTQvtaqlagmqisgavAQAPPPsglgygP 162
Cdd:pfam03154 257 pPSQVSPQPLpqpslhGQMPPMPHSLQTGPSHMQ---HPVPPQPFPLTPQSSQ--------------SQVPPG------P 313
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 568987261 163 PTSLASASGNFPNSGPYGSYPQSQAPPLSQ-----AQGHPGVQPPLRSA-PPLASSFT---SPASGGPQMPSMTGLLPP 232
Cdd:pfam03154 314 SPAAPGQSQQRIHTPPSQSQLQSQQPPREQplppaPLSMPHIKPPPTTPiPQLPNPQShkhPPHLSGPSPFQMNSNLPP 392
|
|
| Retinal |
pfam15449 |
Retinal protein; This family of proteins is found in the photoreceptor cells of the retina. ... |
12-235 |
3.15e-08 |
|
Retinal protein; This family of proteins is found in the photoreceptor cells of the retina. Mutations of the gene encoding this protein have been associated with retinal disorders such as retinitis pigmentosa and late-onset progressive retinal atrophy. The function of this family of proteins is unknown, but it is likely to be important in the development and function of the retina.
Pssm-ID: 464722 [Multi-domain] Cd Length: 1293 Bit Score: 58.25 E-value: 3.15e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261 12 PYGQNQPIYPGYHQSSYGGQPGPAAPAT--PYGAYNGPvpgyqQAPPQGVPRAPP-SSGAPPASAAQVPcgQTTYGQFG- 87
Cdd:pfam15449 964 LSKQPRKAIPWHHSSHTSGQSRTSEPSLarPTRGPHSP-----EAPRQSQERSPPlVRKASPTRAHWAP--RADKRHPSl 1036
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261 88 ---QGDIQngpSSTAQMQRVPGsqqfgPPLAPVVSQPAVLQPYGPPPTSTQVTAQLAGMQISGAVAQAPPPSGLGYGPPT 164
Cdd:pfam15449 1037 pssHRPAQ---PSLPTVQRSPS-----PPLSPRAPSPPRSPRVLSPPTSKKRTSPPPQHKLPSPPPESPPAQHKLSSPPT 1108
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261 165 SLASASGnfPNSGPygsypqSQAPPLSQAQGHPGV------QPPLRSAPPLASSFTSPASGGP---QMPSMTG--LLPPG 233
Cdd:pfam15449 1109 QRTEASS--PSSGP------SPSPPTSPSQGHKETrdsedsQAATAKASGNTCSIFCPATSSLfeaKSPFSTAhpLLPPE 1180
|
..
gi 568987261 234 QG 235
Cdd:pfam15449 1181 AG 1182
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
7-344 |
6.13e-08 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 57.26 E-value: 6.13e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261 7 APPVPPYGQNQPIYPGYHQSSYGGQPGPAAPATPYGAYNGPVPGYQQAPPQGVPRAPPSSGAPPASAAQVPCGQTTygqf 86
Cdd:PHA03247 2709 EPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLT---- 2784
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261 87 gqgdIQNGPSSTAQMQRVPGSQQFGPPLAPVVSQPAVLQPYG------PPPTSTQVTAqlagmqisgavaqAPPPSGLgy 160
Cdd:PHA03247 2785 ----RPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAAspagplPPPTSAQPTA-------------PPPPPGP-- 2845
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261 161 gPPTSLASASGNFPnSGPYGSYPQSQAPPLSQA-QGHPGVQPPLRSAPPLAS-SFTSPASGgPQMPSMTGLLPPGQGFGS 238
Cdd:PHA03247 2846 -PPPSLPLGGSVAP-GGDVRRRPPSRSPAAKPAaPARPPVRRLARPAVSRSTeSFALPPDQ-PERPPQPQAPPPPQPQPQ 2922
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261 239 LPVnqanhvssppapalppgtqmtgpPVPPPPPMHSPQQPGYQLQQNGSFGPARGPQPNYESPYPGAPTFGSqpgppqPL 318
Cdd:PHA03247 2923 PPP-----------------------PPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGR------VA 2973
|
330 340
....*....|....*....|....*.
gi 568987261 319 PPKRLDPDAIPSPQLNELPPQQKTRH 344
Cdd:PHA03247 2974 VPRFRVPQPAPSREAPASSTPPLTGH 2999
|
|
| Atrophin-1 |
pfam03154 |
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ... |
5-306 |
6.75e-08 |
|
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.
Pssm-ID: 460830 [Multi-domain] Cd Length: 991 Bit Score: 57.08 E-value: 6.75e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261 5 QSAPPVPPygqnQPIYPGYHQSSYGGqPGPAAPATPygayNGPVPGYQQAPPQGVPRAPPSS---GAPPASAAQVPCGQT 81
Cdd:pfam03154 177 QSGAASPP----SPPPPGTTQAATAG-PTPSAPSVP----PQGSPATSQPPNQTQSTAAPHTliqQTPTLHPQRLPSPHP 247
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261 82 TYGQFGQG--DIQNGPSST------AQMQRVPGSQQFGPPLAPvvsQPAVLQPYGPPPTSTQvtaqlagmqisgavAQAP 153
Cdd:pfam03154 248 PLQPMTQPppPSQVSPQPLpqpslhGQMPPMPHSLQTGPSHMQ---HPVPPQPFPLTPQSSQ--------------SQVP 310
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261 154 PPsglgygPPTSLASASGNFPNSGPYGSYPQSQAPPLSQA-----QGHPGVQPPLRSA-PPLASSFT---SPASGGPQMP 224
Cdd:pfam03154 311 PG------PSPAAPGQSQQRIHTPPSQSQLQSQQPPREQPlppapLSMPHIKPPPTTPiPQLPNPQShkhPPHLSGPSPF 384
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261 225 SMTGLLPPG---QGFGSLPVNQANHVSSPPAPALPPGTQMTGPPVPPPPpmhspqqpgyqLQQNGSFGPARGPQPNYESP 301
Cdd:pfam03154 385 QMNSNLPPPpalKPLSSLSTHHPPSAHPPPLQLMPQSQQLPPPPAQPPV-----------LTQSQSLPPPAASHPPTSGL 453
|
....*
gi 568987261 302 YPGAP 306
Cdd:pfam03154 454 HQVPS 458
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
4-385 |
2.99e-07 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 54.94 E-value: 2.99e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261 4 NQSAPPVPPygQNQPIYPGY----HQSSYGGQPG-PAAPATPYGAYNGPVPGYQQAPPQGVPRAPPSSGAPPASAAQVPc 78
Cdd:PHA03247 2565 DRSVPPPRP--APRPSEPAVtsraRRPDAPPQSArPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPH- 2641
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261 79 GQTTYGQFGQGDIQNGPSSTAQMQRVPGSQQFGPPLAPvvsqPAVLQPYGPPPTSTQVTAQLAGMQISGAVAQAPPPSGL 158
Cdd:PHA03247 2642 PPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSP----PQRPRRRAARPTVGSLTSLADPPPPPPTPEPAPHALVS 2717
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261 159 GYGPPTSLASASGNFPNSGPYGSYPQSQAPPLSQAQGHPGVQPPLRSAPPLASSFTSPASGGPQMPSMTGLLPPGQGFGS 238
Cdd:PHA03247 2718 ATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRES 2797
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261 239 LPVNQANHVSSPPAPALPPGTQMTGPPVPPPPPMHSPqqpgyqlqQNGSFGPARGPQPNYESP----YPGA-----PTFG 309
Cdd:PHA03247 2798 LPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSA--------QPTAPPPPPGPPPPSLPLggsvAPGGdvrrrPPSR 2869
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261 310 SQPGPPQPLPPKRLD----PDAIPSPQLNELPPQQKTRHRiDPDAIPSPIQVIEDDRNNRGSEPFVTGVRGQVPPLVTTN 385
Cdd:PHA03247 2870 SPAAKPAAPARPPVRrlarPAVSRSTESFALPPDQPERPP-QPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTD 2948
|
|
| PRK07003 |
PRK07003 |
DNA polymerase III subunit gamma/tau; |
29-219 |
3.75e-07 |
|
DNA polymerase III subunit gamma/tau;
Pssm-ID: 235906 [Multi-domain] Cd Length: 830 Bit Score: 54.47 E-value: 3.75e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261 29 GGQPGPAAPATPYGAYNGPVP------GYQQAPPQGVPRAPPSSGAPPASAAQVPCGQTTYGQFGQGDIQNGPSSTAQMQ 102
Cdd:PRK07003 365 GGAPGGGVPARVAGAVPAPGAraaaavGASAVPAVTAVTGAAGAALAPKAAAAAAATRAEAPPAAPAPPATADRGDDAAD 444
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261 103 RVPGSQQFGPPLAPVVSQPAvlqPYGPPPTSTQVTAQLAGMQISGAVAQAPPPSGLGYGPPTSLASASGNFPNSGPYGSY 182
Cdd:PRK07003 445 GDAPVPAKANARASADSRCD---ERDAQPPADSGSASAPASDAPPDAAFEPAPRAAAPSAATPAAVPDARAPAAASREDA 521
|
170 180 190
....*....|....*....|....*....|....*..
gi 568987261 183 PQSQAPPlsqaqghpgvQPPLRSAPPLASSFTSPASG 219
Cdd:PRK07003 522 PAAAAPP----------APEARPPTPAAAAPAARAGG 548
|
|
| PRK07764 |
PRK07764 |
DNA polymerase III subunits gamma and tau; Validated |
29-224 |
4.60e-07 |
|
DNA polymerase III subunits gamma and tau; Validated
Pssm-ID: 236090 [Multi-domain] Cd Length: 824 Bit Score: 54.22 E-value: 4.60e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261 29 GGQPGPAAPATPYGAYNGPVPGYQQAPPQGVPRAPPSSGAPPASAAQVPCGQTTYGQFGQGDIQNGPSSTAQMQRVPGSQ 108
Cdd:PRK07764 595 AGGEGPPAPASSGPPEEAARPAAPAAPAAPAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVPDASDGGDGWPAKAG 674
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261 109 QFGPPLAPVVSQPAVLQPYGPPPTSTQVTAQLAGMQISGAVAQAPPPSglgyGPPTSLASASGNFPNSGPYGSYPQSQAP 188
Cdd:PRK07764 675 GAAPAAPPPAPAPAAPAAPAGAAPAQPAPAPAATPPAGQADDPAAQPP----QAAQGASAPSPAADDPVPLPPEPDDPPD 750
|
170 180 190
....*....|....*....|....*....|....*.
gi 568987261 189 PLSQAQGHPGVQPPLRSAPPLASSFTSPASGGPQMP 224
Cdd:PRK07764 751 PAGAPAQPPPPPAPAPAAAPAAAPPPSPPSEEEEMA 786
|
|
| PRK12323 |
PRK12323 |
DNA polymerase III subunit gamma/tau; |
29-240 |
8.50e-07 |
|
DNA polymerase III subunit gamma/tau;
Pssm-ID: 237057 [Multi-domain] Cd Length: 700 Bit Score: 53.34 E-value: 8.50e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261 29 GGQPGPAAPATpygAYNGPVPgyQQAPPQGVPR-APPSSGAPPASAAQVPCGQTTYGQFGQGDIQNGPSSTA-----QMQ 102
Cdd:PRK12323 366 GQSGGGAGPAT---AAAAPVA--QPAPAAAAPAaAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAPEAlaaarQAS 440
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261 103 RVPGSQQFGPPLAPVVS-----QPAVLQPYGPPPTSTQVTAQLAGMQISGAVAQAPPPSGlgyGPPTSLASASGNFPNSG 177
Cdd:PRK12323 441 ARGPGGAPAPAPAPAAApaaaaRPAAAGPRPVAAAAAAAPARAAPAAAPAPADDDPPPWE---ELPPEFASPAPAQPDAA 517
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|...
gi 568987261 178 PYGSYPQSQAPPLSQAQghPGVQPPLRSAPPLASSFTSPASGGPQMPSMtgllPPGQGFGSLP 240
Cdd:PRK12323 518 PAGWVAESIPDPATADP--DDAFETLAPAPAAAPAPRAAAATEPVVAPR----PPRASASGLP 574
|
|
| PHA03307 |
PHA03307 |
transcriptional regulator ICP4; Provisional |
25-225 |
1.52e-06 |
|
transcriptional regulator ICP4; Provisional
Pssm-ID: 223039 [Multi-domain] Cd Length: 1352 Bit Score: 52.48 E-value: 1.52e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261 25 QSSYGGQPGPAAPATPYGAYNGPVPGY---QQAPPQGVPRAPPSSGAPPASAAQVPCGQTTYGQfgQGDIQNGPSSTAqm 101
Cdd:PHA03307 108 PPGPSSPDPPPPTPPPASPPPSPAPDLsemLRPVGSPGPPPAASPPAAGASPAAVASDAASSRQ--AALPLSSPEETA-- 183
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261 102 qRVPGSqqfGPPLAPVVSQPAVLQPYGPPPTSTQVTAQLAGmQISGAVAQAPPPSGLGYGPPTSLASASGNFPNSGPYGS 181
Cdd:PHA03307 184 -RAPSS---PPAEPPPSTPPAAASPRPPRRSSPISASASSP-APAPGRSAADDAGASSSDSSSSESSGCGWGPENECPLP 258
|
170 180 190 200
....*....|....*....|....*....|....*....|....
gi 568987261 182 YPQSQAPPLSQAQGHPGVQPPLRsAPPLASSFTSPASGGPQMPS 225
Cdd:PHA03307 259 RPAPITLPTRIWEASGWNGPSSR-PGPASSSSSPRERSPSPSPS 301
|
|
| Atrophin-1 |
pfam03154 |
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ... |
5-354 |
1.67e-06 |
|
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.
Pssm-ID: 460830 [Multi-domain] Cd Length: 991 Bit Score: 52.46 E-value: 1.67e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261 5 QSAPPVPPYGQN--QPIYPGYHQssyggqpGPAAPAtPYGAYNGPVPGYQQAPPQGVPRAPPSSGA--PPASAAQVPcgq 80
Cdd:pfam03154 250 QPMTQPPPPSQVspQPLPQPSLH-------GQMPPM-PHSLQTGPSHMQHPVPPQPFPLTPQSSQSqvPPGPSPAAP--- 318
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261 81 ttyGQFGQGDIQNGPSSTAQMQRVPGSQQFGP---PLAPVVSQPAVLQPYGPPPTSTQVTAQLAGMQISGAVAQAPPPSG 157
Cdd:pfam03154 319 ---GQSQQRIHTPPSQSQLQSQQPPREQPLPPaplSMPHIKPPPTTPIPQLPNPQSHKHPPHLSGPSPFQMNSNLPPPPA 395
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261 158 LgygppTSLASASGNFPNSG---PYGSYPQSQ---APP-----LSQAQGHPgvqPPLRSAPPLASSFTSPA-SGGPQMPS 225
Cdd:pfam03154 396 L-----KPLSSLSTHHPPSAhppPLQLMPQSQqlpPPPaqppvLTQSQSLP---PPAASHPPTSGLHQVPSqSPFPQHPF 467
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261 226 MTGLLPPgqgfgslpvnqanhVSSPPAPALPPGTQMTGppvpppppmhspqqpgyqLQQNGSFGPARGpqpnyeSPYPGA 305
Cdd:pfam03154 468 VPGGPPP--------------ITPPSGPPTSTSSAMPG------------------IQPPSSASVSSS------GPVPAA 509
|
330 340 350 360
....*....|....*....|....*....|....*....|....*....
gi 568987261 306 PTfgsqpgppQPLPPKRLDPDAIPSPQLNELPPQQKTRHRIDPDAIPSP 354
Cdd:pfam03154 510 VS--------CPLPPVQIKEEALDEAEEPESPPPPPRSPSPEPTVVNTP 550
|
|
| PHA03377 |
PHA03377 |
EBNA-3C; Provisional |
8-197 |
2.64e-06 |
|
EBNA-3C; Provisional
Pssm-ID: 177614 [Multi-domain] Cd Length: 1000 Bit Score: 51.98 E-value: 2.64e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261 8 PPVPPYGQNQPIYPGYHQSSYGGQPGPAAPATPYGAYNGPVPGYQQAP----PQGvPRAPPSSGAPPASAAQVPCGQTTY 83
Cdd:PHA03377 770 PQAPYLGYQEPQAQGVQVSSYPGYAGPWGLRAQHPRYRHSWAYWSQYPghghPQG-PWAPRPPHLPPQWDGSAGHGQDQV 848
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261 84 GQFGQGDIQNGPSS--TAQMQRVPGSQQFGPPLAPVVSQPAVLQPYGPPPTstqvtaqlagmqisgavaQAPPPSGLGYG 161
Cdd:PHA03377 849 SQFPHLQSETGPPRlqLSQVPQLPYSQTLVSSSAPSWSSPQPRAPIRPIPT------------------RFPPPPMPLQD 910
|
170 180 190
....*....|....*....|....*....|....*..
gi 568987261 162 PPTSLASASGNFPNSGPYGS-YPQSQAPPLSQAQGHP 197
Cdd:PHA03377 911 SMAVGCDSSGTACPSMPFASdYSQGAFTPLDINAQTP 947
|
|
| PHA03377 |
PHA03377 |
EBNA-3C; Provisional |
21-232 |
3.47e-06 |
|
EBNA-3C; Provisional
Pssm-ID: 177614 [Multi-domain] Cd Length: 1000 Bit Score: 51.59 E-value: 3.47e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261 21 PGYHQSSYGGQPGPAAPATPYGAYNGPVP------GYQQAPPQGVPRAP-PSSGAPPASAAQVPCGQTTYgqfgqgdiqn 93
Cdd:PHA03377 741 PPSHQAPYSGHEEPQAQQAPYPGYWEPRPpqapylGYQEPQAQGVQVSSyPGYAGPWGLRAQHPRYRHSW---------- 810
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261 94 gpsstAQMQRVPGsqqFGPPLAPVVSQPAVLQPYGPPptstqvTAQLAGMQISGAVAQAPPPsglgyGPPTslasasgnf 173
Cdd:PHA03377 811 -----AYWSQYPG---HGHPQGPWAPRPPHLPPQWDG------SAGHGQDQVSQFPHLQSET-----GPPR--------- 862
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|....*....
gi 568987261 174 pnsgpygsypqsqaPPLSQAQGHPGVQPPLRSAPPlasSFTSPASGGPQMPSMTGLLPP 232
Cdd:PHA03377 863 --------------LQLSQVPQLPYSQTLVSSSAP---SWSSPQPRAPIRPIPTRFPPP 904
|
|
| Med15 |
pfam09606 |
ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of ... |
71-240 |
8.72e-06 |
|
ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of the ARC-Mediator co-activator is a three-helix bundle with marked similarity to the KIX domain. The sterol regulatory element binding protein (SREBP) family of transcription activators use the ARC105 subunit to activate target genes in the regulation of cholesterol and fatty acid homeostasis. In addition, Med15 is a critical transducer of gene activation signals that control early metazoan development.
Pssm-ID: 312941 [Multi-domain] Cd Length: 732 Bit Score: 50.01 E-value: 8.72e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261 71 ASAAQVPCGQTTYGQFGQGDIQNGPSSTAqMQRVPGSQQFGPPLAPVVS--QPAVLQPYGPPPTSTQVTAQL--AGMQIS 146
Cdd:pfam09606 57 AAQQQQPQGGQGNGGMGGGQQGMPDPINA-LQNLAGQGTRPQMMGPMGPgpGGPMGQQMGGPGTASNLLASLgrPQMPMG 135
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261 147 GA--------VAQAPPPSGLGYGPPTSLASASGNFPNS-GPYGSYPQSQAP-PLSQAQGHPGVQPPLRSAPPLASSFTSP 216
Cdd:pfam09606 136 GAgfpsqmsrVGRMQPGGQAGGMMQPSSGQPGSGTPNQmGPNGGPGQGQAGgMNGGQQGPMGGQMPPQMGVPGMPGPADA 215
|
170 180
....*....|....*....|....
gi 568987261 217 ASGGPQMPSMTGLLPPGQGFGSLP 240
Cdd:pfam09606 216 GAQMGQQAQANGGMNPQQMGGAPN 239
|
|
| PRK07764 |
PRK07764 |
DNA polymerase III subunits gamma and tau; Validated |
29-225 |
1.10e-05 |
|
DNA polymerase III subunits gamma and tau; Validated
Pssm-ID: 236090 [Multi-domain] Cd Length: 824 Bit Score: 49.60 E-value: 1.10e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261 29 GGQPGPAAPATPygayngpvPGYQQAPPQGVPRAPPSSGAPPASAAQVPCGQTTYGQFGQGDiqnGPSSTAQMQRVPGSQ 108
Cdd:PRK07764 589 GPAPGAAGGEGP--------PAPASSGPPEEAARPAAPAAPAAPAAPAPAGAAAAPAEASAA---PAPGVAAPEHHPKHV 657
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261 109 QFGPPLAPVVSQPAVLQPygPPPTSTQVTAQLAGMQISGAVAQAPPPSGLGYGPPTSLASASGNFPNSGPYGSYPQS--- 185
Cdd:PRK07764 658 AVPDASDGGDGWPAKAGG--AAPAAPPPAPAPAAPAAPAGAAPAQPAPAPAATPPAGQADDPAAQPPQAAQGASAPSpaa 735
|
170 180 190 200
....*....|....*....|....*....|....*....|..
gi 568987261 186 --QAPPLSQAQGHPGVQPPLRSAPPLASSFTSPASGGPQMPS 225
Cdd:PRK07764 736 ddPVPLPPEPDDPPDPAGAPAQPPPPPAPAPAAAPAAAPPPS 777
|
|
| Med15 |
pfam09606 |
ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of ... |
4-297 |
1.81e-05 |
|
ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of the ARC-Mediator co-activator is a three-helix bundle with marked similarity to the KIX domain. The sterol regulatory element binding protein (SREBP) family of transcription activators use the ARC105 subunit to activate target genes in the regulation of cholesterol and fatty acid homeostasis. In addition, Med15 is a critical transducer of gene activation signals that control early metazoan development.
Pssm-ID: 312941 [Multi-domain] Cd Length: 732 Bit Score: 48.85 E-value: 1.81e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261 4 NQSAPPVPPYGQNQPIYPGYHQSSYGGQPGP---AAPATPYGAYNGPVPGYQQAPPQGVPRAPPSSGAP-------PASA 73
Cdd:pfam09606 112 QQMGGPGTASNLLASLGRPQMPMGGAGFPSQmsrVGRMQPGGQAGGMMQPSSGQPGSGTPNQMGPNGGPgqgqaggMNGG 191
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261 74 AQVPCGQTTYGQFGQGdIQNGPSST-AQMQRVPGSQQFGPPLAPVVSQPAVLQPYGPP-PTSTQVTAQLAG---MQISGA 148
Cdd:pfam09606 192 QQGPMGGQMPPQMGVP-GMPGPADAgAQMGQQAQANGGMNPQQMGGAPNQVAMQQQQPqQQGQQSQLGMGInqmQQMPQG 270
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261 149 VAQAPPPSGLG--YGPPTSLASASGNFPNSGPYGSYPQSQAPPLSQAQG--HPGVQPPLRSAPPLASSFTSPASGGPQMP 224
Cdd:pfam09606 271 VGGGAGQGGPGqpMGPPGQQPGAMPNVMSIGDQNNYQQQQTRQQQQQQGgnHPAAHQQQMNQSVGQGGQVVALGGLNHLE 350
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261 225 SMTGLLPPGQGFGSLPVNQANHVSSPPAPALPPGTQMTGP---PVPPPPPMHSPQQPGYQLQQNGSFG----PARGPQPN 297
Cdd:pfam09606 351 TWNPGNFGGLGANPMQRGQPGMMSSPSPVPGQQVRQVTPNqfmRQSPQPSVPSPQGPGSQPPQSHPGGmipsPALIPSPS 430
|
|
| PHA03378 |
PHA03378 |
EBNA-3B; Provisional |
6-221 |
3.19e-05 |
|
EBNA-3B; Provisional
Pssm-ID: 223065 [Multi-domain] Cd Length: 991 Bit Score: 48.14 E-value: 3.19e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261 6 SAPPVPPYGQNQPiypgyhQSSYGGQPGPAAP---ATPYGAYNGPVPGYQQAP-----PQGVP-RAPPSSGAPPASAaqv 76
Cdd:PHA03378 705 RPPAAPPGRAQRP------AAATGRARPPAAApgrARPPAAAPGRARPPAAAPgrarpPAAAPgRARPPAAAPGAPT--- 775
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261 77 PCGQTTYGQFGQGDIQNGPSSTAQMQRVPGSQQFGPPLAPVVSQPA-----VLQPYGPPPTSTQVTAQLAGMQISGAVAQ 151
Cdd:PHA03378 776 PQPPPQAPPAPQQRPRGAPTPQPPPQAGPTSMQLMPRAAPGQQGPTkqilrQLLTGGVKRGRPSLKKPAALERQAAAGPT 855
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261 152 APPPSGLG---------YGPPTSLASASGNFPNSGPYGSYPQSQAPplSQAQG--------HPGVQPPLRSAPPLASSFT 214
Cdd:PHA03378 856 PSPGSGTSdkivqapvfYPPVLQPIQVMRQLGSVRAAAASTVTQAP--TEYTGerrgvgpmHPTDIPPSKRAKTDAYVES 933
|
....*..
gi 568987261 215 SPASGGP 221
Cdd:PHA03378 934 QPPHGGQ 940
|
|
| PRK07764 |
PRK07764 |
DNA polymerase III subunits gamma and tau; Validated |
28-184 |
5.10e-05 |
|
DNA polymerase III subunits gamma and tau; Validated
Pssm-ID: 236090 [Multi-domain] Cd Length: 824 Bit Score: 47.67 E-value: 5.10e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261 28 YGGQPGPAAPATPYGAYNGPVPgyqQAPPQGVPRAPPSSGAPPASAAQVPcgqttygqfgqgdiqnGPSSTAQMQRVPGS 107
Cdd:PRK07764 387 VAGGAGAPAAAAPSAAAAAPAA---APAPAAAAPAAAAAPAPAAAPQPAP----------------APAPAPAPPSPAGN 447
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 568987261 108 QQFGPPLAPVVSQPAVLQPyGPPPTSTQVTAQLAGMQISGAVAQAPPPsglgyGPPTSLASASGNFPNSGPYGSYPQ 184
Cdd:PRK07764 448 APAGGAPSPPPAAAPSAQP-APAPAAAPEPTAAPAPAPPAAPAPAAAP-----AAPAAPAAPAGADDAATLRERWPE 518
|
|
| Gag_spuma |
pfam03276 |
Spumavirus gag protein; |
92-241 |
5.18e-05 |
|
Spumavirus gag protein;
Pssm-ID: 460872 [Multi-domain] Cd Length: 614 Bit Score: 47.43 E-value: 5.18e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261 92 QNGPSSTAQMQRVPGSQQFGPPLAPVVSQPAVLQPYGPPPTSTQvtaqlagMQISGAVAQAPPPSGLGYGPPTSLASasg 171
Cdd:pfam03276 175 LAEISPGAQGGIPPGASFSGLPSLPAIGGIHLPAIPGIHARAPP-------GNIARSLGDDIMPSLGDAGMPQPRFA--- 244
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 568987261 172 nFPNSGPYGSYPQSqaPPLSQAQGHPGVQP--PLRSApPLASSFTSPASGGPQMPSMTGLLPPGQGFGSLPV 241
Cdd:pfam03276 245 -FHPGNPFAEAEGH--PFAEAEGERPRDIPraPRIDA-PSAPAIPAIQPIAPPMIPPIGAPIPIPHGASIPG 312
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
6-310 |
8.77e-05 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 46.86 E-value: 8.77e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261 6 SAPPVPPYGQNQPIYPGYHQSSYG----GQPGPAAPATPYGAYNGPVPGYQQAPPQGVPRAPPSSG---APPASAAQVPC 78
Cdd:PHA03247 2769 PAPPAAPAAGPPRRLTRPAVASLSesreSLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAqptAPPPPPGPPPP 2848
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261 79 GQTTYGQFGQGdiqnGPSSTAQMQRVPGSQQFGPPLAPV--VSQPAVLQPYGP---PPTSTQVTAQLAGMQISGAVAQAP 153
Cdd:PHA03247 2849 SLPLGGSVAPG----GDVRRRPPSRSPAAKPAAPARPPVrrLARPAVSRSTESfalPPDQPERPPQPQAPPPPQPQPQPP 2924
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261 154 PPSGLGYGPPTSLASASGNFPNSGPYG-SYPQSQAPPLSQAQGHPG-VQPPLRSAPPLASSFTSPAsggPQMPSMTGLLP 231
Cdd:PHA03247 2925 PPPQPQPPPPPPPRPQPPLAPTTDPAGaGEPSGAVPQPWLGALVPGrVAVPRFRVPQPAPSREAPA---SSTPPLTGHSL 3001
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261 232 PGqgFGSLPVNQANHVSSPPAPALPPGTQMTGPPVPPPPPMHSPQQPGYQLQQnGSFGPARGPQ--PNYESPYPGAPTFG 309
Cdd:PHA03247 3002 SR--VSSWASSLALHEETDPPPVSLKQTLWPPDDTEDSDADSLFDSDSERSDL-EALDPLPPEPhdPFAHEPDPATPEAG 3078
|
.
gi 568987261 310 S 310
Cdd:PHA03247 3079 A 3079
|
|
| PHA03378 |
PHA03378 |
EBNA-3B; Provisional |
11-234 |
9.76e-05 |
|
EBNA-3B; Provisional
Pssm-ID: 223065 [Multi-domain] Cd Length: 991 Bit Score: 46.60 E-value: 9.76e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261 11 PPYGQNQPIYPGYHQSSY---GGQPGPAAPATPYGAYNGPVPGYQQAPPQGVPRAP-------------PSSGAPPAsaA 74
Cdd:PHA03378 580 PTTSQLASSAPSYAQTPWpvpHPSQTPEPPTTQSHIPETSAPRQWPMPLRPIPMRPlrmqpitfnvlvfPTPHQPPQ--V 657
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261 75 QVPCGQTTYGQFGQGDIQNGPSSTAQMQRV---PGSQQfGPPLAPVVSQPavlqPYGPPPTSTQVTAQLAGMQISGAV-- 149
Cdd:PHA03378 658 EITPYKPTWTQIGHIPYQPSPTGANTMLPIqwaPGTMQ-PPPRAPTPMRP----PAAPPGRAQRPAAATGRARPPAAApg 732
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261 150 AQAPPPSGLGYGPPTSLASASGNFPNSGPYGSYPQSQAPPLSQAQGHPGVQPPLRSAPPLASSFTSPASGGPQMPSMTGL 229
Cdd:PHA03378 733 RARPPAAAPGRARPPAAAPGRARPPAAAPGRARPPAAAPGAPTPQPPPQAPPAPQQRPRGAPTPQPPPQAGPTSMQLMPR 812
|
....*
gi 568987261 230 LPPGQ 234
Cdd:PHA03378 813 AAPGQ 817
|
|
| PAT1 |
pfam09770 |
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ... |
31-194 |
1.33e-04 |
|
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.
Pssm-ID: 401645 [Multi-domain] Cd Length: 846 Bit Score: 46.18 E-value: 1.33e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261 31 QPGPAAPATPYGAYNGPVPGYQQAPPQgvprappssgappASAAQVPCGQTTYGQFGQGdiQNGPSStaQMQRvPGSQQF 110
Cdd:pfam09770 213 QPAPAPAQPPAAPPAQQAQQQQQFPPQ-------------IQQQQQPQQQPQQPQQHPG--QGHPVT--ILQR-PQSPQP 274
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261 111 GPPlAPVVSQPAVLQPYGPPPTSTQVTAQLAGMQ-ISGAVAQAP--PPSGLGYGPPTSLASASGNFPNSGPYGSYPQsQA 187
Cdd:pfam09770 275 DPA-QPSIQPQAQQFHQQPPPVPVQPTQILQNPNrLSAARVGYPqnPQPGVQPAPAHQAHRQQGSFGRQAPIITHPQ-QL 352
|
....*..
gi 568987261 188 PPLSQAQ 194
Cdd:pfam09770 353 AQLSEEE 359
|
|
| PABP-1234 |
TIGR01628 |
polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins ... |
1-118 |
2.17e-04 |
|
polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins recognize the poly-A of mRNA and consists of four tandem RNA recognition domains at the N-terminus (rrm: pfam00076) followed by a PABP-specific domain (pfam00658) at the C-terminus. The protein is involved in the transport of mRNA's from the nucleus to the cytoplasm. There are four paralogs in Homo sapiens which are expressed in testis, platelets, broadly expressed and of unknown tissue range.
Pssm-ID: 130689 [Multi-domain] Cd Length: 562 Bit Score: 45.18 E-value: 2.17e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261 1 MNVNQSAPPVP---PYGQNQPiYPGYHqssyGGQPGPAAPAtPYGAYNGPVPGYQQA----PPQGVPRAPPSSGAPPASA 73
Cdd:TIGR01628 403 QGPQQQFNGQPlgwPRMSMMP-TPMGP----GGPLRPNGLA-PMNAVRAPSRNAQNAaqkpPMQPVMYPPNYQSLPLSQD 476
|
90 100 110 120
....*....|....*....|....*....|....*....|....*..
gi 568987261 74 AQVPcgQTTYGQFGQGD--IQNGPSSTAQMQRvpgsQQFGPPLAPVV 118
Cdd:TIGR01628 477 LPQP--QSTASQGGQNKklAQVLASATPQMQK----QVLGERLFPLV 517
|
|
| PRK10263 |
PRK10263 |
DNA translocase FtsK; Provisional |
11-209 |
2.84e-04 |
|
DNA translocase FtsK; Provisional
Pssm-ID: 236669 [Multi-domain] Cd Length: 1355 Bit Score: 45.08 E-value: 2.84e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261 11 PPYGQNQPIYPGyhQSSYGGQPGPAAPATPYGAYNGPVPGYQQAPPqgVPRAPPSSGAPPASAAQVPCGQT--------- 81
Cdd:PRK10263 302 PEYDEYDPLLNG--APITEPVAVAAAATTATQSWAAPVEPVTQTPP--VASVDVPPAQPTVAWQPVPGPQTgepviapap 377
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261 82 -TYGQFGQGDIQNGPSSTAQMQRVPGSQQFGPPLAPVVSQPAVLQPYGPPPTSTQVTAQLAGMQISGAVAQAPPPSGLGY 160
Cdd:PRK10263 378 eGYPQQSQYAQPAVQYNEPLQQPVQPQQPYYAPAAEQPAQQPYYAPAPEQPAQQPYYAPAPEQPVAGNAWQAEEQQSTFA 457
|
170 180 190 200
....*....|....*....|....*....|....*....|....*....
gi 568987261 161 GPPTSLASASGNFPNSGPYGSYPQSQAPPLSQAQGHPGVQPPLRSAPPL 209
Cdd:PRK10263 458 PQSTYQTEQTYQQPAAQEPLYQQPQPVEQQPVVEPEPVVEETKPARPPL 506
|
|
| Atrophin-1 |
pfam03154 |
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ... |
119-306 |
2.94e-04 |
|
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.
Pssm-ID: 460830 [Multi-domain] Cd Length: 991 Bit Score: 45.14 E-value: 2.94e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261 119 SQPAVLQPYGPPPtstqVTAQLAGMQISGAVAQAPPPSGLGYGPPTSlasasgnfPNSGPYGSYPQSQAPPLSQAQGHPG 198
Cdd:pfam03154 169 TQPPVLQAQSGAA----SPPSPPPPGTTQAATAGPTPSAPSVPPQGS--------PATSQPPNQTQSTAAPHTLIQQTPT 236
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261 199 VQP--------PLRSAPPLASSFTSPASGGPQmPSMTGLLPP-GQGFGSLPVNQANHVSSPPAPALPPGTQMTGPPVPPP 269
Cdd:pfam03154 237 LHPqrlpsphpPLQPMTQPPPPSQVSPQPLPQ-PSLHGQMPPmPHSLQTGPSHMQHPVPPQPFPLTPQSSQSQVPPGPSP 315
|
170 180 190
....*....|....*....|....*....|....*..
gi 568987261 270 PPMHSPQQpgyQLQQNGSFGPARGPQPNYESPYPGAP 306
Cdd:pfam03154 316 AAPGQSQQ---RIHTPPSQSQLQSQQPPREQPLPPAP 349
|
|
| PRK07764 |
PRK07764 |
DNA polymerase III subunits gamma and tau; Validated |
6-156 |
3.60e-04 |
|
DNA polymerase III subunits gamma and tau; Validated
Pssm-ID: 236090 [Multi-domain] Cd Length: 824 Bit Score: 44.59 E-value: 3.60e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261 6 SAPPVPPYGQNQPIYPGYHQSSYGGQPGPAAPATPYGAYNG-PVPGYQQAPPQGVPRAPPSSGAPPASAAQVPcgQTTYG 84
Cdd:PRK07764 634 AAPAEASAAPAPGVAAPEHHPKHVAVPDASDGGDGWPAKAGgAAPAAPPPAPAPAAPAAPAGAAPAQPAPAPA--ATPPA 711
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 568987261 85 QFGQGDIQNGPSSTAQMQRVPGSQQFGPPLAPVVSQPAVLQPYGPPPTSTQVTAQLAGmqiSGAVAQAPPPS 156
Cdd:PRK07764 712 GQADDPAAQPPQAAQGASAPSPAADDPVPLPPEPDDPPDPAGAPAQPPPPPAPAPAAA---PAAAPPPSPPS 780
|
|
| Treacle |
pfam03546 |
Treacher Collins syndrome protein Treacle; |
34-221 |
4.76e-04 |
|
Treacher Collins syndrome protein Treacle;
Pssm-ID: 460967 [Multi-domain] Cd Length: 531 Bit Score: 44.29 E-value: 4.76e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261 34 PAAPATPYGAYNGPVPGYQQA-------PPQGVPRAPPSSGAPPASAAQVpcgqttygqfGQGDIQNGPSSTAQmqrvpG 106
Cdd:pfam03546 39 PAAKTPLQAKPSGKTPQVRAAsapakesPRKGAPPVPPGKTGPAAAQAQA----------GKPEEDSESSSEES-----D 103
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261 107 SQQFGPPLAPVVSQPAVLQPYGPPPtstQV-TAQLAGMQISGAVAQAPPPSGLGYGPPTSLASASGNFPNSGPYGSYPQS 185
Cdd:pfam03546 104 SDGETPAAATLTTSPAQVKPLGKNS---QVrPASTVGKGPSGKGANPAPPGKAGSAAPLVQVGKKEEDSESSSEESDSEG 180
|
170 180 190
....*....|....*....|....*....|....*...
gi 568987261 186 QAPPLSQAQGHPGVQPPLRSA--PPLASSFTSPASGGP 221
Cdd:pfam03546 181 EAPPAATQAKPSGKILQVRPAsgPAKGAAPAPPQKAGP 218
|
|
| PRK07003 |
PRK07003 |
DNA polymerase III subunit gamma/tau; |
6-208 |
7.86e-04 |
|
DNA polymerase III subunit gamma/tau;
Pssm-ID: 235906 [Multi-domain] Cd Length: 830 Bit Score: 43.68 E-value: 7.86e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261 6 SAPPVPPYGQNQPIYPGYHQSSYGGQPGPAAPATPYGAYNGPVPGYQQAPPQGVPR---APPSSGAPPASAAQVPCGQTT 82
Cdd:PRK07003 420 ATRAEAPPAAPAPPATADRGDDAADGDAPVPAKANARASADSRCDERDAQPPADSGsasAPASDAPPDAAFEPAPRAAAP 499
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261 83 YGQFGQGDIQNGPSSTAQMQRVPGSQQFGPPLAPVVSQPAVlqpyGPPPTSTQVTAQL-----AGMQIS-----GAVAQA 152
Cdd:PRK07003 500 SAATPAAVPDARAPAAASREDAPAAAAPPAPEARPPTPAAA----APAARAGGAAAALdvlrnAGMRVSsdrgaRAAAAA 575
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|....*....
gi 568987261 153 PPPSGLGYGPPTSLASASGNFPNSGPYGSYPQ---SQAPPLSQAQGHPGVQPPLRSAPP 208
Cdd:PRK07003 576 KPAAAPAAAPKPAAPRVAVQVPTPRARAATGDappNGAARAEQAAESRGAPPPWEDIPP 634
|
|
| PRK14951 |
PRK14951 |
DNA polymerase III subunits gamma and tau; Provisional |
32-173 |
8.61e-04 |
|
DNA polymerase III subunits gamma and tau; Provisional
Pssm-ID: 237865 [Multi-domain] Cd Length: 618 Bit Score: 43.55 E-value: 8.61e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261 32 PGPAAPATPYGAYNGPVPGYQQAP---PQGVPRAPPSSGAPPASAAQVPCGQTTYgqfgqgdIQNGPSSTAQMQRvpgsq 108
Cdd:PRK14951 366 PAAAAEAAAPAEKKTPARPEAAAPaaaPVAQAAAAPAPAAAPAAAASAPAAPPAA-------APPAPVAAPAAAA----- 433
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 568987261 109 qfGPPLAPVVSQPAVLQPYGPPPTSTQVTAQLAGMQISGAVAQAPPPSGLGYGPPTSLASASGNF 173
Cdd:PRK14951 434 --PAAAPAAAPAAVALAPAPPAQAAPETVAIPVRVAPEPAVASAAPAPAAAPAAARLTPTEEGDV 496
|
|
| COG3416 |
COG3416 |
Uncharacterized conserved protein, DUF2076 domain [Function unknown]; |
14-67 |
1.33e-03 |
|
Uncharacterized conserved protein, DUF2076 domain [Function unknown];
Pssm-ID: 442642 [Multi-domain] Cd Length: 237 Bit Score: 41.55 E-value: 1.33e-03
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|....
gi 568987261 14 GQNQPIYPGYHQSSYGGQPGPAAPATPYGAYNGPVPGYQQaPPQGVPRAPPSSG 67
Cdd:COG3416 91 GGGQRPPPAPQPSQPGPQQQPAPPSGPWGQAAPQQPGYGQ-PQYGQPAAGPSGG 143
|
|
| Gly-rich_Ago1 |
pfam12764 |
Glycine-rich region of argonaut; This domain is often found at the very N-terminal of ... |
9-105 |
1.45e-03 |
|
Glycine-rich region of argonaut; This domain is often found at the very N-terminal of argonaut-like proteins.
Pssm-ID: 463691 [Multi-domain] Cd Length: 103 Bit Score: 39.16 E-value: 1.45e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261 9 PVPPYGQNQPIYPGYHQSSYGGQPGPAAPATPygayngPVPGYQQAPP---QGVPRAPPSSGAPPAS--AAQVPCGQTTY 83
Cdd:pfam12764 8 PRPRGGPPQQYYGGGRGGSGGRGPPSGGPSRP------PVPELHQATQvqyQAVVTQPSPSGAGSSSqpTAEVSTGQVAQ 81
|
90 100
....*....|....*....|..
gi 568987261 84 gQFGQGDIQNGPSSTAQMQRVP 105
Cdd:pfam12764 82 -QFQQLSVQDQSSSSQAIQPAP 102
|
|
| PAT1 |
pfam09770 |
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ... |
47-341 |
1.79e-03 |
|
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.
Pssm-ID: 401645 [Multi-domain] Cd Length: 846 Bit Score: 42.33 E-value: 1.79e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261 47 PVPGYQQAPPQGVPRAPPSSGAPPASAAQVPCGQTTYGQFGQ-GDIQngpsSTAQMQRVPGSQQFGPPLAPVVS-QPAVL 124
Cdd:pfam09770 111 AAQSSAQPPASSLPQYQYASQQSQQPSKPVRTGYEKYKEPEPiPDLQ----VDASLWGVAPKKAAAPAPAPQPAaQPASL 186
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261 125 QPYGPPPTSTQ-VTAQLAgMQISGAVAQAPPPSglgYGPPTSLASASGNFPNSGPygsyPQSQAPPLSQAQGHPGVQPPL 203
Cdd:pfam09770 187 PAPSRKMMSLEeVEAAMR-AQAKKPAQQPAPAP---AQPPAAPPAQQAQQQQQFP----PQIQQQQQPQQQPQQPQQHPG 258
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261 204 RSAPPlaSSFTSPASGGPQmPSMTGLLPPGQGFGSLPVNQANHVssppapalppgTQmtgppvpppppmhspqqpgyQLQ 283
Cdd:pfam09770 259 QGHPV--TILQRPQSPQPD-PAQPSIQPQAQQFHQQPPPVPVQP-----------TQ--------------------ILQ 304
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|....*...
gi 568987261 284 QNGSFGPARGPQPNYesPYPGAPTFGSQPGPPQPLPPKRLDPDAIPSPQLNELPPQQK 341
Cdd:pfam09770 305 NPNRLSAARVGYPQN--PQPGVQPAPAHQAHRQQGSFGRQAPIITHPQQLAQLSEEEK 360
|
|
| hnRNP-R-Q |
TIGR01648 |
heterogeneous nuclear ribonucleoprotein R, Q family; Sequences in this subfamily include the ... |
11-201 |
1.82e-03 |
|
heterogeneous nuclear ribonucleoprotein R, Q family; Sequences in this subfamily include the human heterogeneous nuclear ribonucleoproteins (hnRNP) R, Q, and APOBEC-1 complementation factor (aka APOBEC-1 stimulating protein). These proteins contain three RNA recognition domains (rrm: pfam00076) and a somewhat variable C-terminal domain.
Pssm-ID: 273732 [Multi-domain] Cd Length: 578 Bit Score: 42.29 E-value: 1.82e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261 11 PPYGQN--QPIYPGYHQSSyGGQPGPaapatpygaYNGPVPGYQQAPPQGVPRAPPSSGAPPASAAQvpcgqttYGQFGQ 88
Cdd:TIGR01648 387 PPYGYEayYGDYYGYHDYR-GKYEDK---------YYGYDPGMELTPMNPVRGKPGGRGGRPAIPPP-------RGRKNG 449
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261 89 GdiqnGPSSTAQMQRVPGSQQFGPPLApvVSQPAVLQPYGPPPTSTQVTaqlagmqisGAVAQAPPPSGLGYGPPTSlaS 168
Cdd:TIGR01648 450 A----PPPAIGQDGRQLFLYKITIPAG--YSQRPAPHPLGPPRGSAFVR---------GARGGPAQYQQRGRGSRTS--R 512
|
170 180 190
....*....|....*....|....*....|....*...
gi 568987261 169 ASGNFPNSGPYG-SYPQSQAP----PLSQAQGHPGVQP 201
Cdd:TIGR01648 513 GNGRGGTAGGKRkAFDGYAQPdataRQTNNQQNWGAQP 550
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
3-218 |
1.85e-03 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 42.62 E-value: 1.85e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261 3 VNQSAPPVPPYGQNQPiypgyhQSSYGGQPGPAAPATPYGAYNGPVPGYQQAPPQGVPRaPPSSGAPPASAAQVPCGQTT 82
Cdd:PHA03247 2886 LARPAVSRSTESFALP------PDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPR-PQPPLAPTTDPAGAGEPSGA 2958
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261 83 YGQFGQGDIQNGpsstaqmqRVPGSQQFGPPLAPVVSQPAvlqPYGPPPTSTQVTAqLAGMQISGA--VAQAPPPSGL-- 158
Cdd:PHA03247 2959 VPQPWLGALVPG--------RVAVPRFRVPQPAPSREAPA---SSTPPLTGHSLSR-VSSWASSLAlhEETDPPPVSLkq 3026
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261 159 GYGPPTSLASASGNFPNSGPYGSYPQSQAPPLSQAQGHPGVQPPlRSAPPLASSFTSPAS 218
Cdd:PHA03247 3027 TLWPPDDTEDSDADSLFDSDSERSDLEALDPLPPEPHDPFAHEP-DPATPEAGARESPSS 3085
|
|
| PLN03209 |
PLN03209 |
translocon at the inner envelope of chloroplast subunit 62; Provisional |
29-247 |
2.48e-03 |
|
translocon at the inner envelope of chloroplast subunit 62; Provisional
Pssm-ID: 178748 [Multi-domain] Cd Length: 576 Bit Score: 41.84 E-value: 2.48e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261 29 GGQPGPAAPATPygayNGPVPGYQQAPPQG---VPR-----------APPSSGAP--PASAAQVPCGQTTYGQFGQGDIQ 92
Cdd:PLN03209 338 GPKPVPTKPVTP----EAPSPPIEEEPPQPkavVPRplspytayedlKPPTSPIPtpPSSSPASSKSVDAVAKPAEPDVV 413
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261 93 NGP---SSTAQMQRVPGSQQFGPPLAPVVSQPAVLQPYGPPPTstqvtaqlagmqisgavaqapPPSGLGygPPTSLASA 169
Cdd:PLN03209 414 PSPgsaSNVPEVEPAQVEAKKTRPLSPYARYEDLKPPTSPSPT---------------------APTGVS--PSVSSTSS 470
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 568987261 170 SGNFPNSGPYGSYPQSQAPPLSQAQGHPGVQPPLRSAPPLASSFTSPASGGPQMPSMTGLLPPGQGFGSLPVNQANHV 247
Cdd:PLN03209 471 VPAVPDTAPATAATDAAAPPPANMRPLSPYAVYDDLKPPTSPSPAAPVGKVAPSSTNEVVKVGNSAPPTALADEQHHA 548
|
|
| PRK12323 |
PRK12323 |
DNA polymerase III subunit gamma/tau; |
6-214 |
2.73e-03 |
|
DNA polymerase III subunit gamma/tau;
Pssm-ID: 237057 [Multi-domain] Cd Length: 700 Bit Score: 41.79 E-value: 2.73e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261 6 SAPPVPPYGQNQPIYPGYHQSSYGGQPGPAAPATPygayngpvPGYQQAPPQGVPRAPPSSGAPPASAAQVpcgqttygq 85
Cdd:PRK12323 400 AAPPAAPAAAPAAAAAARAVAAAPARRSPAPEALA--------AARQASARGPGGAPAPAPAPAAAPAAAA--------- 462
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261 86 fgqgdiqngPSSTAQMQRVPGSQQFGPPLAPVVSQPAVlQPYGPPPTStqvtaQLAGMQISGAVAQ---APPPSGLGYGP 162
Cdd:PRK12323 463 ---------RPAAAGPRPVAAAAAAAPARAAPAAAPAP-ADDDPPPWE-----ELPPEFASPAPAQpdaAPAGWVAESIP 527
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|..
gi 568987261 163 PTSLASASGNFPNSGPYGSYPQSQAPPLSQAQGHPGVqPPLRSAPPLASSFT 214
Cdd:PRK12323 528 DPATADPDDAFETLAPAPAAAPAPRAAAATEPVVAPR-PPRASASGLPDMFD 578
|
|
| PRK10263 |
PRK10263 |
DNA translocase FtsK; Provisional |
33-135 |
2.86e-03 |
|
DNA translocase FtsK; Provisional
Pssm-ID: 236669 [Multi-domain] Cd Length: 1355 Bit Score: 41.99 E-value: 2.86e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261 33 GPAAP-----ATPYGAYNGPVPGYQQAPPQGVPRAPPSSGAPPasaaQVPCGQTTYGQFGQGDIQNGPSSTAQMQRVPGS 107
Cdd:PRK10263 739 GPHEPlftpiVEPVQQPQQPVAPQQQYQQPQQPVAPQPQYQQP----QQPVAPQPQYQQPQQPVAPQPQYQQPQQPVAPQ 814
|
90 100
....*....|....*....|....*...
gi 568987261 108 QQFGPPLAPVVSQPAVLQPYGPPPTSTQ 135
Cdd:PRK10263 815 PQYQQPQQPVAPQPQYQQPQQPVAPQPQ 842
|
|
| PRK07764 |
PRK07764 |
DNA polymerase III subunits gamma and tau; Validated |
59-219 |
3.88e-03 |
|
DNA polymerase III subunits gamma and tau; Validated
Pssm-ID: 236090 [Multi-domain] Cd Length: 824 Bit Score: 41.51 E-value: 3.88e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261 59 VPRAPPSSGAPPASAAQVPCGQTTYGQfgqgdiqnGPSSTAQmqrvPGSQQFGPPlAPVVSQPAVlQPYGPPPTSTQVTA 138
Cdd:PRK07764 364 LPSASDDERGLLARLERLERRLGVAGG--------AGAPAAA----APSAAAAAP-AAAPAPAAA-APAAAAAPAPAAAP 429
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261 139 QLAGMQISG-AVAQAPPPSGLGYGPPTSLASASGNFPNSGPYGSYPQSQAPPLSQAQGHPGVQPPLRSAPPLASSFTSPA 217
Cdd:PRK07764 430 QPAPAPAPApAPPSPAGNAPAGGAPSPPPAAAPSAQPAPAPAAAPEPTAAPAPAPPAAPAPAAAPAAPAAPAAPAGADDA 509
|
..
gi 568987261 218 SG 219
Cdd:PRK07764 510 AT 511
|
|
| PBP1 |
COG5180 |
PAB1-binding protein, interacts with poly(A)-binding protein [RNA processing and modification]; ... |
3-240 |
5.85e-03 |
|
PAB1-binding protein, interacts with poly(A)-binding protein [RNA processing and modification];
Pssm-ID: 444064 [Multi-domain] Cd Length: 548 Bit Score: 40.82 E-value: 5.85e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261 3 VNQSAPPVPPYGQNQPiypgyhQSSYGGQPGPAAPATPYGAYNGPVPGYQQAPPQGVPRAP-PSSGAPPASAAQVPCGQT 81
Cdd:COG5180 199 LDRPKVEVKDEAQEEP------PDLTGGADHPRPEAASSPKVDPPSTSEARSRPATVDAQPeMRPPADAKERRRAAIGDT 272
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261 82 TYGQF-GQGDIQNGP--------SSTAQMQRVPGSQQFGPPLAPVVSQPAVLQPYGPPPTstQVTAQLAGMQISGAVAQA 152
Cdd:COG5180 273 PAAEPpGLPVLEAGSepqsdapeAETARPIDVKGVASAPPATRPVRPPGGARDPGTPRPG--QPTERPAGVPEAASDAGQ 350
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261 153 PP-----PSGLGYGPPTSLASASGNFPNSGPYGSYPQSQAPPlsQAQGHPGVQPPLRSAPPLAssftsPASGGPQMPSMT 227
Cdd:COG5180 351 PPsayppAEEAVPGKPLEQGAPRPGSSGGDGAPFQPPNGAPQ--PGLGRRGAPGPPMGAGDLV-----QAALDGGGRETA 423
|
250
....*....|...
gi 568987261 228 GLLPPGQGFGSLP 240
Cdd:COG5180 424 SLGGAAGGAGQGP 436
|
|
| PRK12323 |
PRK12323 |
DNA polymerase III subunit gamma/tau; |
5-174 |
5.98e-03 |
|
DNA polymerase III subunit gamma/tau;
Pssm-ID: 237057 [Multi-domain] Cd Length: 700 Bit Score: 40.63 E-value: 5.98e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261 5 QSAPPVPPYGQNQPIYPGYHQSSYGGQPGPAAPATPygaynGPVPGYQQAPPQGVPRAPPSSgAPPASAAQVPCGQTTYG 84
Cdd:PRK12323 419 VAAAPARRSPAPEALAAARQASARGPGGAPAPAPAP-----AAAPAAAARPAAAGPRPVAAA-AAAAPARAAPAAAPAPA 492
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261 85 QFGQGDIQNGPSSTAQMQRVPGSQQFGPPLAPVVSQPAVLQPYGPPPTSTQVTAQLAGMQISGAVAQAPPPSGLGYGPPT 164
Cdd:PRK12323 493 DDDPPPWEELPPEFASPAPAQPDAAPAGWVAESIPDPATADPDDAFETLAPAPAAAPAPRAAAATEPVVAPRPPRASASG 572
|
170
....*....|
gi 568987261 165 SLASASGNFP 174
Cdd:PRK12323 573 LPDMFDGDWP 582
|
|
| dnaA |
PRK14086 |
chromosomal replication initiator protein DnaA; |
3-221 |
7.26e-03 |
|
chromosomal replication initiator protein DnaA;
Pssm-ID: 237605 [Multi-domain] Cd Length: 617 Bit Score: 40.19 E-value: 7.26e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261 3 VNQSAPPVPPYGQNQPIYPGYHQSSYGGQP-----GPAAPATPYGAYNGPV-PGYQQAPPQGVPRAPPssGAPPASAAQV 76
Cdd:PRK14086 88 VDPSAGEPAPPPPHARRTSEPELPRPGRRPyegygGPRADDRPPGLPRQDQlPTARPAYPAYQQRPEP--GAWPRAADDY 165
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568987261 77 PCGQTTYGqFGQGDIQNGPSSTAqmqrvPGSQQFGPPlaPVVSQPAVLQPYGPPptstqvtaqlagmqiSGAVAQAPPPS 156
Cdd:PRK14086 166 GWQQQRLG-FPPRAPYASPASYA-----PEQERDREP--YDAGRPEYDQRRRDY---------------DHPRPDWDRPR 222
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 568987261 157 GLGYGPPTSlASASGNFPNSGPYGSYPQSqAPPLSQAQGHPGvqpplrsaPPLASSFTSPASGGP 221
Cdd:PRK14086 223 RDRTDRPEP-PPGAGHVHRGGPGPPERDD-APVVPIRPSAPG--------PLAAQPAPAPGPGEP 277
|
|
|