NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|1039018156|gb|ANM67456|]
View 

Sec23/Sec24 protein transport family protein [Arabidopsis thaliana]

Protein Classification

vWA domain-containing protein( domain architecture ID 13416197)

vWA (von Willebrand factor type A) domain-containing protein may be involved in one of a wide variety of important cellular functions, including basal membrane formation, cell migration, cell differentiation, adhesion, haemostasis, signaling, chromosomal stability, malignant transformation and immune defenses

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
trunk_domain cd01468
trunk domain. COPII-coated vesicles carry proteins from the endoplasmic reticulum to the Golgi ...
275-518 1.54e-68

trunk domain. COPII-coated vesicles carry proteins from the endoplasmic reticulum to the Golgi complex. This vesicular transport can be reconstituted by using three cytosolic components containing five proteins: the small GTPase Sar1p, the Sec23p/24p complex, and the Sec13p/Sec31p complex. This domain is known as the trunk domain and has an alpha/beta vWA fold and forms the dimer interface. Some members of this family possess a partial MIDAS motif that is a characteristic feature of most vWA domain proteins.


:

Pssm-ID: 238745 [Multi-domain]  Cd Length: 239  Bit Score: 227.51  E-value: 1.54e-68
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039018156 275 DSRTSAPVVLVIDECLDEPHLQHLQSSLHAFVDSLP--QTTRLGIILYGRTVSIYDFSEDSvasadviSGAKSPSAESMK 352
Cdd:cd01468     2 QPPVFVFVIDVSYEAIKEGLLQALKESLLASLDLLPgdPRARVGLITYDSTVHFYNLSSDL-------AQPKMYVVSDLK 74
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039018156 353 AL-IYGTGVYLSPMHASLKVAHEIFSSLRPYTLNVPEASRDRCLGTAVEAALAIIQGPSAemsrgvvrraggNSRIIVCA 431
Cdd:cd01468    75 DVfLPLPDRFLVPLSECKKVIHDLLEQLPPMFWPVPTHRPERCLGPALQAAFLLLKGTFA------------GGRIIVFQ 142
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039018156 432 GGPITYGPGSVPHSMSHPNYP-----YMEKTAIKWMENLGREAHRHNTVVDILCAGTCPLRVPILQPLAKASGGVLVLHD 506
Cdd:cd01468   143 GGLPTVGPGKLKSREDKEPIRshdeaQLLKPATKFYKSLAKECVKSGICVDLFAFSLDYVDVATLKQLAKSTGGQVYLYD 222
                         250
                  ....*....|....*..
gi 1039018156 507 DF-----GEAFGVDLQR 518
Cdd:cd01468   223 SFqapndGSKFKQDLQR 239
SEC23 super family cl34880
Vesicle coat complex COPII, subunit SEC23 [Intracellular trafficking and secretion];
196-826 2.55e-49

Vesicle coat complex COPII, subunit SEC23 [Intracellular trafficking and secretion];


The actual alignment was detected with superfamily member COG5047:

Pssm-ID: 227380 [Multi-domain]  Cd Length: 755  Bit Score: 187.01  E-value: 2.55e-49
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039018156 196 IIQRDPHRCL-NCGAYSNPYSSILIGSGQWQCVICENMNGSKGEYVASSKNELQnfPELSLPLVDYVQTGNKRPGFVPAs 274
Cdd:COG5047    48 VNYYEPVKCTaPCKAVLNPYCHIDERNQSWICPFCNQRNTLPPQYRDISNANLP--LELLPQSSTIEYTLSKPVILPPV- 124
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039018156 275 dsrtsapVVLVIDECLDEPHLQHLQSSLHAFVDSLPQTTRLGIILYGRTVSIYDFSEDSVASADVISGAKSPSAESMKAL 354
Cdd:COG5047   125 -------FFFVVDACCDEEELTALKDSLIVSLSLLPPEALVGLITYGTSIQVHELNAENHRRSYVFSGNKEYTKENLQEL 197
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039018156 355 ---------------IYGTGV-----YLSPMHASLKVAHEIFSSLRPYTLNVPEASRD-RCLGTAVEAALAIIQGPSAEM 413
Cdd:COG5047   198 lalskptksggfeskISGIGQfassrFLLPTQQCEFKLLNILEQLQPDPWPVPAGKRPlRCTGSALNIASSLLEQCFPNA 277
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039018156 414 SrgvvrraggnSRIIVCAGGPITYGPGSV----------PHSMSHPNYPYMEKTAIKWMENLGREAHRHNTVVDILCAgt 483
Cdd:COG5047   278 G----------CHIVLFAGGPCTVGPGTVvstelkepmrSHHDIESDSAQHSKKATKFYKGLAERVANQGHALDIFAG-- 345
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039018156 484 CPLRVPILQ--PLAKASGGVLVLHDDFGEA-FGVDLQRAATR------AAGSHGLLEVRCSDDILITQVIGPG----EEA 550
Cdd:COG5047   346 CLDQIGIMEmePLTTSTGGALVLSDSFTTSiFKQSFQRIFNRdsegylKMGFNANMEVKTSKNLKIKGLIGHAvsvkKKA 425
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039018156 551 HSETHETFKSDAALSIQMLSVEETQSFSLSMENKRDIKSDHV------FFQFAFHYSDVYQADVSRVITFKLPTVDSISA 624
Cdd:COG5047   426 NNISDSEIGIGATNSWKMASLSPKSNYALYFEIALGAASGSAqrpaeaYIQFITTYQHSSGTYRIRVTTVARMFTDGGLP 505
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039018156 625 YLQSVEDEASAVLISKRTLLLAKNQKDAVDMRATVDERIKDIALKFgSQVPKSKLYSF--PKELSSLPELLFHLRRGPLL 702
Cdd:COG5047   506 KINRSFDQEAAAVFMARIAAFKAETEDIIDVFRWIDRNLIRLCQKF-ADYRKDDPSSFrlDPNFTLYPQFMYHLRRSPFL 584
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039018156 703 GNIIGHEDERSVLRNLFLNASFDLSLRMVAP--RCLMHQEGGTFEELPAydLSMQSDKAVILD--------HGTDVFIWL 772
Cdd:COG5047   585 SVFNNSPDETAFYRHMLNNADVNDSLIMIQPtlQSYSFEKGGVPVLLDS--VSVKPDVILLLDtffhilifHGSYIAQWR 662
                         650       660       670       680       690
                  ....*....|....*....|....*....|....*....|....*....|....*.
gi 1039018156 773 GA--ELSADEVKSAAVLAACRTLAEELTEFRFPAPRILAFKEGSSQARYFVCRLIP 826
Cdd:COG5047   663 NAgyQEQPEYLNLKELLEAPRLEAAELLQDRFPIPRFIVTEQGGSQARFLLSKINP 718
PHA03247 super family cl33720
large tegument protein UL36; Provisional
17-128 5.66e-04

large tegument protein UL36; Provisional


The actual alignment was detected with superfamily member PHA03247:

Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 43.77  E-value: 5.66e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039018156   17 PLEPNRPSPQPDRTPVPHSPPVVAS-------PIPPRFPQPSFRPDQMSSPSMKSPSLLSPANGIRTGSPIPRLS--TPP 87
Cdd:PHA03247  2670 LGRAAQASSPPQRPRRRAARPTVGSltsladpPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVpaGPA 2749
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....*...
gi 1039018156   88 GPPVFNTPVKPAA-------VPFRTSPATPQPMAYSSANSSLPVSTPS 128
Cdd:PHA03247  2750 TPGGPARPARPPTtagppapAPPAAPAAGPPRRLTRPAVASLSESRES 2797
 
Name Accession Description Interval E-value
trunk_domain cd01468
trunk domain. COPII-coated vesicles carry proteins from the endoplasmic reticulum to the Golgi ...
275-518 1.54e-68

trunk domain. COPII-coated vesicles carry proteins from the endoplasmic reticulum to the Golgi complex. This vesicular transport can be reconstituted by using three cytosolic components containing five proteins: the small GTPase Sar1p, the Sec23p/24p complex, and the Sec13p/Sec31p complex. This domain is known as the trunk domain and has an alpha/beta vWA fold and forms the dimer interface. Some members of this family possess a partial MIDAS motif that is a characteristic feature of most vWA domain proteins.


Pssm-ID: 238745 [Multi-domain]  Cd Length: 239  Bit Score: 227.51  E-value: 1.54e-68
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039018156 275 DSRTSAPVVLVIDECLDEPHLQHLQSSLHAFVDSLP--QTTRLGIILYGRTVSIYDFSEDSvasadviSGAKSPSAESMK 352
Cdd:cd01468     2 QPPVFVFVIDVSYEAIKEGLLQALKESLLASLDLLPgdPRARVGLITYDSTVHFYNLSSDL-------AQPKMYVVSDLK 74
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039018156 353 AL-IYGTGVYLSPMHASLKVAHEIFSSLRPYTLNVPEASRDRCLGTAVEAALAIIQGPSAemsrgvvrraggNSRIIVCA 431
Cdd:cd01468    75 DVfLPLPDRFLVPLSECKKVIHDLLEQLPPMFWPVPTHRPERCLGPALQAAFLLLKGTFA------------GGRIIVFQ 142
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039018156 432 GGPITYGPGSVPHSMSHPNYP-----YMEKTAIKWMENLGREAHRHNTVVDILCAGTCPLRVPILQPLAKASGGVLVLHD 506
Cdd:cd01468   143 GGLPTVGPGKLKSREDKEPIRshdeaQLLKPATKFYKSLAKECVKSGICVDLFAFSLDYVDVATLKQLAKSTGGQVYLYD 222
                         250
                  ....*....|....*..
gi 1039018156 507 DF-----GEAFGVDLQR 518
Cdd:cd01468   223 SFqapndGSKFKQDLQR 239
SEC23 COG5047
Vesicle coat complex COPII, subunit SEC23 [Intracellular trafficking and secretion];
196-826 2.55e-49

Vesicle coat complex COPII, subunit SEC23 [Intracellular trafficking and secretion];


Pssm-ID: 227380 [Multi-domain]  Cd Length: 755  Bit Score: 187.01  E-value: 2.55e-49
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039018156 196 IIQRDPHRCL-NCGAYSNPYSSILIGSGQWQCVICENMNGSKGEYVASSKNELQnfPELSLPLVDYVQTGNKRPGFVPAs 274
Cdd:COG5047    48 VNYYEPVKCTaPCKAVLNPYCHIDERNQSWICPFCNQRNTLPPQYRDISNANLP--LELLPQSSTIEYTLSKPVILPPV- 124
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039018156 275 dsrtsapVVLVIDECLDEPHLQHLQSSLHAFVDSLPQTTRLGIILYGRTVSIYDFSEDSVASADVISGAKSPSAESMKAL 354
Cdd:COG5047   125 -------FFFVVDACCDEEELTALKDSLIVSLSLLPPEALVGLITYGTSIQVHELNAENHRRSYVFSGNKEYTKENLQEL 197
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039018156 355 ---------------IYGTGV-----YLSPMHASLKVAHEIFSSLRPYTLNVPEASRD-RCLGTAVEAALAIIQGPSAEM 413
Cdd:COG5047   198 lalskptksggfeskISGIGQfassrFLLPTQQCEFKLLNILEQLQPDPWPVPAGKRPlRCTGSALNIASSLLEQCFPNA 277
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039018156 414 SrgvvrraggnSRIIVCAGGPITYGPGSV----------PHSMSHPNYPYMEKTAIKWMENLGREAHRHNTVVDILCAgt 483
Cdd:COG5047   278 G----------CHIVLFAGGPCTVGPGTVvstelkepmrSHHDIESDSAQHSKKATKFYKGLAERVANQGHALDIFAG-- 345
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039018156 484 CPLRVPILQ--PLAKASGGVLVLHDDFGEA-FGVDLQRAATR------AAGSHGLLEVRCSDDILITQVIGPG----EEA 550
Cdd:COG5047   346 CLDQIGIMEmePLTTSTGGALVLSDSFTTSiFKQSFQRIFNRdsegylKMGFNANMEVKTSKNLKIKGLIGHAvsvkKKA 425
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039018156 551 HSETHETFKSDAALSIQMLSVEETQSFSLSMENKRDIKSDHV------FFQFAFHYSDVYQADVSRVITFKLPTVDSISA 624
Cdd:COG5047   426 NNISDSEIGIGATNSWKMASLSPKSNYALYFEIALGAASGSAqrpaeaYIQFITTYQHSSGTYRIRVTTVARMFTDGGLP 505
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039018156 625 YLQSVEDEASAVLISKRTLLLAKNQKDAVDMRATVDERIKDIALKFgSQVPKSKLYSF--PKELSSLPELLFHLRRGPLL 702
Cdd:COG5047   506 KINRSFDQEAAAVFMARIAAFKAETEDIIDVFRWIDRNLIRLCQKF-ADYRKDDPSSFrlDPNFTLYPQFMYHLRRSPFL 584
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039018156 703 GNIIGHEDERSVLRNLFLNASFDLSLRMVAP--RCLMHQEGGTFEELPAydLSMQSDKAVILD--------HGTDVFIWL 772
Cdd:COG5047   585 SVFNNSPDETAFYRHMLNNADVNDSLIMIQPtlQSYSFEKGGVPVLLDS--VSVKPDVILLLDtffhilifHGSYIAQWR 662
                         650       660       670       680       690
                  ....*....|....*....|....*....|....*....|....*....|....*.
gi 1039018156 773 GA--ELSADEVKSAAVLAACRTLAEELTEFRFPAPRILAFKEGSSQARYFVCRLIP 826
Cdd:COG5047   663 NAgyQEQPEYLNLKELLEAPRLEAAELLQDRFPIPRFIVTEQGGSQARFLLSKINP 718
PLN00162 PLN00162
transport protein sec23; Provisional
180-826 1.07e-33

transport protein sec23; Provisional


Pssm-ID: 215083 [Multi-domain]  Cd Length: 761  Bit Score: 138.92  E-value: 1.07e-33
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039018156 180 FGAIVSAGREISPGPqIIQRDPHRCLNCGAYSNPYSSILIGSGQWQCVICENMNGSKGEYVASSKN----ELqnFPELSL 255
Cdd:PLN00162   33 LAALYTPLKPLPELP-VLPYDPLRCRTCRAVLNPYCRVDFQAKIWICPFCFQRNHFPPHYSSISETnlpaEL--FPQYTT 109
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039018156 256 plVDYVQTgnkrpgfvPASDSRTSAPV-VLVIDECLDEPHLQHLQSSLHAFVDSLPQTTRLGIILYGRTVSIYDFSEDSV 334
Cdd:PLN00162  110 --VEYTLP--------PGSGGAPSPPVfVFVVDTCMIEEELGALKSALLQAIALLPENALVGLITFGTHVHVHELGFSEC 179
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039018156 335 ASADVISGAKSPSAESMKALIygtGVYLSPMHASLKV---AHEIFSS------LRP-----YTLN--VPEASRD------ 392
Cdd:PLN00162  180 SKSYVFRGNKEVSKDQILEQL---GLGGKKRRPAGGGiagARDGLSSsgvnrfLLPaseceFTLNsaLEELQKDpwpvpp 256
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039018156 393 -----RCLGTAVEAALAIIQGPSaemsrgvvrrAGGNSRIIVCAGGPITYGPGS-VPHSMSHP----------NYPYMEK 456
Cdd:PLN00162  257 ghrpaRCTGAALSVAAGLLGACV----------PGTGARIMAFVGGPCTEGPGAiVSKDLSEPirshkdldkdAAPYYKK 326
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039018156 457 tAIKWMENLGREAHRHNTVVDIL-CA----GTCPLRVPILQplakaSGGVLVLHDDFG-EAFGVDLQRAATRAAG----- 525
Cdd:PLN00162  327 -AVKFYEGLAKQLVAQGHVLDVFaCSldqvGVAEMKVAVER-----TGGLVVLAESFGhSVFKDSLRRVFERDGEgslgl 400
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039018156 526 -SHGLLEVRCSDDILITQVIGPGeeahsethetfksdAALSIQMLSVEE-------TQSFSLSMENKR-------DIKSD 590
Cdd:PLN00162  401 sFNGTFEVNCSKDVKVQGAIGPC--------------ASLEKKGPSVSDteigeggTTAWKLCGLDKKtslavffEVANS 466
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039018156 591 HV----------FFQFAFHYSDVYQADVSRVITFKLPTVDSISA--YLQSVEDEASAVLISKRTLLLAKNQkDAVDMRAT 658
Cdd:PLN00162  467 GQsnpqppgqqfFLQFLTRYQHSNGQTRLRVTTVTRRWVEGSSSeeLVAGFDQEAAAVVMARLASHKMETE-EEFDATRW 545
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039018156 659 VDERIKDIALKFGSQV---PKSklYSFPKELSSLPELLFHLRRGPLLGNIIGHEDERSVLRnLFLN-ASFDLSLRMVAPR 734
Cdd:PLN00162  546 LDRALIRLCSKFGDYRkddPSS--FRLSPNFSLYPQFMFNLRRSQFVQVFNNSPDETAYFR-MMLNrENVTNSLVMIQPT 622
                         650       660       670       680       690       700       710       720
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039018156 735 CLMHqeggTFEELPAYDL----SMQSDKAVILD--------HGTDVFIW--LGAELSADEVKSAAVLAACRTLAEELTEF 800
Cdd:PLN00162  623 LISY----SFNGPPEPVLldvaSIAADRILLLDsyfsvvifHGSTIAQWrkAGYHNQPEHEAFAQLLEAPQADAQAIIKE 698
                         730       740
                  ....*....|....*....|....*.
gi 1039018156 801 RFPAPRILAFKEGSSQARYFVCRLIP 826
Cdd:PLN00162  699 RFPVPRLVVCDQHGSQARFLLAKLNP 724
gelsolin_like cd11280
Tandemly repeated domains found in gelsolin, severin, villin, and related proteins; Gelsolin ...
733-824 4.76e-18

Tandemly repeated domains found in gelsolin, severin, villin, and related proteins; Gelsolin repeats occur in gelsolin, severin, villin, advillin, villidin, supervillin, flightless, quail, fragmin, and other proteins, usually in several copies. They co-occur with villin headpiece domains, leucine-rich repeats, and several other domains. These gelsolin-related actin binding proteins (GRABPs) play regulatory roles in the assembly and disassembly of actin filaments; they are involved in F-actin capping, uncapping, severing, or the nucleation of actin filaments. Severing of actin filaments is Ca2+ dependent. Villins are also linked to generating bundles of F-actin with uniform filament polarity, which is most likely mediated by their extra villin headpiece domain. Many family members have also adopted functions in the nucleus, including the regulation of transcription. Supervillin, gelsolin, and flightless I are involved in intracellular signaling via nuclear hormone receptors. The gelsolin-like domain is distantly related to the actin depolymerizing domains found in cofilin and similar proteins.


Pssm-ID: 200436  Cd Length: 88  Bit Score: 79.72  E-value: 4.76e-18
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039018156 733 PRCLMHQEG---GTFEELPAYDLSMQSDKAVILDHGTDVFIWLGAElsadevKSAAVLAACRTLAEELTEFRFPAPRILA 809
Cdd:cd11280     1 PPRLYRVRGskaIEIEEVPLASSSLDSDDVFVLDTGSEIYIWQGRA------SSQAELAAAALLAKELDEERKGKPEIVR 74
                          90
                  ....*....|....*
gi 1039018156 810 FKEGSSqARYFVCRL 824
Cdd:cd11280    75 IRQGQE-PREFWSLF 88
Sec23_BS pfam08033
Sec23/Sec24 beta-sandwich domain;
525-619 1.92e-13

Sec23/Sec24 beta-sandwich domain;


Pssm-ID: 429794 [Multi-domain]  Cd Length: 86  Bit Score: 66.41  E-value: 1.92e-13
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039018156 525 GSHGLLEVRCSDDILITQVIGPGeeahsethetFKSDAALSIQMLSVEETQSFSLSMENKRDIKS-DHVFFQFAFHYSDV 603
Cdd:pfam08033   1 GFNAVLRVRTSKGLKVSGFIGNF----------VSRSSGDTWKLPSLDPDTSYAFEFDIDEPLPNgSNAYIQFALLYTHS 70
                          90
                  ....*....|....*.
gi 1039018156 604 YQADVSRVITFKLPTV 619
Cdd:pfam08033  71 SGERRIRVTTVALPVT 86
Sec23_trunk pfam04811
Sec23/Sec24 trunk domain; COPII-coated vesicles carry proteins from the endoplasmic reticulum ...
283-518 3.36e-08

Sec23/Sec24 trunk domain; COPII-coated vesicles carry proteins from the endoplasmic reticulum to the Golgi complex. This vesicular transport can be reconstituted by using three cytosolic components containing five proteins: the small GTPase Sar1p, the Sec23p/24p complex, and the Sec13p/Sec31p complex. This domain is known as the trunk domain and has an alpha/beta vWA fold and forms the dimer interface.


Pssm-ID: 398467 [Multi-domain]  Cd Length: 241  Bit Score: 55.34  E-value: 3.36e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039018156 283 VLVIDECLD---EPHLQHLQSSLHAFVDSLPQ--TTRLGIILYGRTVSIYDFSEDSVASADVISG----AKSPSAESMka 353
Cdd:pfam04811   7 LFVIDVSYNaikSGLLAALKESLLQSLDLLPGdpRARVGFITFDSTVHFFNLGSSLRQPQMLVVSdlqdMFLPLPDRF-- 84
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039018156 354 liygtgvyLSPMHASLKVAHEIFSSL-RPYTLN-VPEasrdRCLGTAVEAALAIIQGPSAemsrgvvrraGGnsRIIVCA 431
Cdd:pfam04811  85 --------LVPLSECRFVLEDLLEQLpPMFPVTkRPE----RCLGPALQAAFLLLKAAFT----------GG--KIMVFQ 140
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039018156 432 GGPITYGPGSV--------PHSMSHPNYPYMeKTAIKWMENLGREAHRHNTVVDILCAGTCPLRVPILQPLAKASGGVLV 503
Cdd:pfam04811 141 GGLPTVGPGGKlksrldesHHGTDKEKAKLV-KKADKFYKSLAKECVKQGHSVDLFAFSLDYVDVATLGQLSRLTGGQVY 219
                         250       260
                  ....*....|....*....|
gi 1039018156 504 LHDDF-----GEAFGVDLQR 518
Cdd:pfam04811 220 LYPSFqadvdGSKFKQDLQR 239
GEL smart00262
Gelsolin homology domain; Gelsolin/severin/villin homology domain. Calcium-binding and ...
753-814 2.37e-04

Gelsolin homology domain; Gelsolin/severin/villin homology domain. Calcium-binding and actin-binding. Both intra- and extracellular domains.


Pssm-ID: 214590 [Multi-domain]  Cd Length: 90  Bit Score: 40.74  E-value: 2.37e-04
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1039018156  753 SMQSDKAVILDHGTDVFIWLGAELSADEVKSAAvlaacrTLAEEL-TEFRFPAPRILAFKEGS 814
Cdd:smart00262  22 SLNSGDCYILDTGSEIYVWVGKKSSQDEKKKAA------ELAVELdDTLGPGPVQVRVVDEGK 78
PHA03247 PHA03247
large tegument protein UL36; Provisional
17-128 5.66e-04

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 43.77  E-value: 5.66e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039018156   17 PLEPNRPSPQPDRTPVPHSPPVVAS-------PIPPRFPQPSFRPDQMSSPSMKSPSLLSPANGIRTGSPIPRLS--TPP 87
Cdd:PHA03247  2670 LGRAAQASSPPQRPRRRAARPTVGSltsladpPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVpaGPA 2749
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....*...
gi 1039018156   88 GPPVFNTPVKPAA-------VPFRTSPATPQPMAYSSANSSLPVSTPS 128
Cdd:PHA03247  2750 TPGGPARPARPPTtagppapAPPAAPAAGPPRRLTRPAVASLSESRES 2797
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
2-152 1.76e-03

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 42.06  E-value: 1.76e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039018156   2 ANLPKssvnyPGTLTPLEpNRPSPQPdrtPVPHSPPVVASPIPPRFPQPSFRPDQMSSPSMKspsllsPANGIRTGSPIP 81
Cdd:pfam03154 388 SNLPP-----PPALKPLS-SLSTHHP---PSAHPPPLQLMPQSQQLPPPPAQPPVLTQSQSL------PPPAASHPPTSG 452
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1039018156  82 RLSTPPGPPVFNTPVKPAAVPFRTSPATPQPMAYSSANSSLPVSTPSFYSNGSSVGS-QRDLPDVVRMEEPI 152
Cdd:pfam03154 453 LHQVPSQSPFPQHPFVPGGPPPITPPSGPPTSTSSAMPGIQPPSSASVSSSGPVPAAvSCPLPPVQIKEEAL 524
PspC_subgroup_2 NF033839
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ...
3-143 7.85e-03

pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.


Pssm-ID: 468202 [Multi-domain]  Cd Length: 557  Bit Score: 39.75  E-value: 7.85e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039018156   3 NLPKSSVNyPGTLTPLEPNRPSPQPDRTPVPHSP--------PVVASPIP-----PRFPQPSFRPDQMSSPSMKSPSLLS 69
Cdd:NF033839  282 DTPKEPGN-KKPSAPKPGMQPSPQPEKKEVKPEPetpkpevkPQLEKPKPevkpqPEKPKPEVKPQLETPKPEVKPQPEK 360
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1039018156  70 PANGIRTGSPIPRLSTPPGPPVFNTPVKPaavpfrtSPATPQPMAYSSANSSLPVSTPSFYSNGSSVGSQRDLP 143
Cdd:NF033839  361 PKPEVKPQPEKPKPEVKPQPETPKPEVKP-------QPEKPKPEVKPQPEKPKPEVKPQPEKPKPEVKPQPEKP 427
 
Name Accession Description Interval E-value
trunk_domain cd01468
trunk domain. COPII-coated vesicles carry proteins from the endoplasmic reticulum to the Golgi ...
275-518 1.54e-68

trunk domain. COPII-coated vesicles carry proteins from the endoplasmic reticulum to the Golgi complex. This vesicular transport can be reconstituted by using three cytosolic components containing five proteins: the small GTPase Sar1p, the Sec23p/24p complex, and the Sec13p/Sec31p complex. This domain is known as the trunk domain and has an alpha/beta vWA fold and forms the dimer interface. Some members of this family possess a partial MIDAS motif that is a characteristic feature of most vWA domain proteins.


Pssm-ID: 238745 [Multi-domain]  Cd Length: 239  Bit Score: 227.51  E-value: 1.54e-68
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039018156 275 DSRTSAPVVLVIDECLDEPHLQHLQSSLHAFVDSLP--QTTRLGIILYGRTVSIYDFSEDSvasadviSGAKSPSAESMK 352
Cdd:cd01468     2 QPPVFVFVIDVSYEAIKEGLLQALKESLLASLDLLPgdPRARVGLITYDSTVHFYNLSSDL-------AQPKMYVVSDLK 74
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039018156 353 AL-IYGTGVYLSPMHASLKVAHEIFSSLRPYTLNVPEASRDRCLGTAVEAALAIIQGPSAemsrgvvrraggNSRIIVCA 431
Cdd:cd01468    75 DVfLPLPDRFLVPLSECKKVIHDLLEQLPPMFWPVPTHRPERCLGPALQAAFLLLKGTFA------------GGRIIVFQ 142
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039018156 432 GGPITYGPGSVPHSMSHPNYP-----YMEKTAIKWMENLGREAHRHNTVVDILCAGTCPLRVPILQPLAKASGGVLVLHD 506
Cdd:cd01468   143 GGLPTVGPGKLKSREDKEPIRshdeaQLLKPATKFYKSLAKECVKSGICVDLFAFSLDYVDVATLKQLAKSTGGQVYLYD 222
                         250
                  ....*....|....*..
gi 1039018156 507 DF-----GEAFGVDLQR 518
Cdd:cd01468   223 SFqapndGSKFKQDLQR 239
SEC23 COG5047
Vesicle coat complex COPII, subunit SEC23 [Intracellular trafficking and secretion];
196-826 2.55e-49

Vesicle coat complex COPII, subunit SEC23 [Intracellular trafficking and secretion];


Pssm-ID: 227380 [Multi-domain]  Cd Length: 755  Bit Score: 187.01  E-value: 2.55e-49
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039018156 196 IIQRDPHRCL-NCGAYSNPYSSILIGSGQWQCVICENMNGSKGEYVASSKNELQnfPELSLPLVDYVQTGNKRPGFVPAs 274
Cdd:COG5047    48 VNYYEPVKCTaPCKAVLNPYCHIDERNQSWICPFCNQRNTLPPQYRDISNANLP--LELLPQSSTIEYTLSKPVILPPV- 124
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039018156 275 dsrtsapVVLVIDECLDEPHLQHLQSSLHAFVDSLPQTTRLGIILYGRTVSIYDFSEDSVASADVISGAKSPSAESMKAL 354
Cdd:COG5047   125 -------FFFVVDACCDEEELTALKDSLIVSLSLLPPEALVGLITYGTSIQVHELNAENHRRSYVFSGNKEYTKENLQEL 197
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039018156 355 ---------------IYGTGV-----YLSPMHASLKVAHEIFSSLRPYTLNVPEASRD-RCLGTAVEAALAIIQGPSAEM 413
Cdd:COG5047   198 lalskptksggfeskISGIGQfassrFLLPTQQCEFKLLNILEQLQPDPWPVPAGKRPlRCTGSALNIASSLLEQCFPNA 277
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039018156 414 SrgvvrraggnSRIIVCAGGPITYGPGSV----------PHSMSHPNYPYMEKTAIKWMENLGREAHRHNTVVDILCAgt 483
Cdd:COG5047   278 G----------CHIVLFAGGPCTVGPGTVvstelkepmrSHHDIESDSAQHSKKATKFYKGLAERVANQGHALDIFAG-- 345
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039018156 484 CPLRVPILQ--PLAKASGGVLVLHDDFGEA-FGVDLQRAATR------AAGSHGLLEVRCSDDILITQVIGPG----EEA 550
Cdd:COG5047   346 CLDQIGIMEmePLTTSTGGALVLSDSFTTSiFKQSFQRIFNRdsegylKMGFNANMEVKTSKNLKIKGLIGHAvsvkKKA 425
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039018156 551 HSETHETFKSDAALSIQMLSVEETQSFSLSMENKRDIKSDHV------FFQFAFHYSDVYQADVSRVITFKLPTVDSISA 624
Cdd:COG5047   426 NNISDSEIGIGATNSWKMASLSPKSNYALYFEIALGAASGSAqrpaeaYIQFITTYQHSSGTYRIRVTTVARMFTDGGLP 505
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039018156 625 YLQSVEDEASAVLISKRTLLLAKNQKDAVDMRATVDERIKDIALKFgSQVPKSKLYSF--PKELSSLPELLFHLRRGPLL 702
Cdd:COG5047   506 KINRSFDQEAAAVFMARIAAFKAETEDIIDVFRWIDRNLIRLCQKF-ADYRKDDPSSFrlDPNFTLYPQFMYHLRRSPFL 584
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039018156 703 GNIIGHEDERSVLRNLFLNASFDLSLRMVAP--RCLMHQEGGTFEELPAydLSMQSDKAVILD--------HGTDVFIWL 772
Cdd:COG5047   585 SVFNNSPDETAFYRHMLNNADVNDSLIMIQPtlQSYSFEKGGVPVLLDS--VSVKPDVILLLDtffhilifHGSYIAQWR 662
                         650       660       670       680       690
                  ....*....|....*....|....*....|....*....|....*....|....*.
gi 1039018156 773 GA--ELSADEVKSAAVLAACRTLAEELTEFRFPAPRILAFKEGSSQARYFVCRLIP 826
Cdd:COG5047   663 NAgyQEQPEYLNLKELLEAPRLEAAELLQDRFPIPRFIVTEQGGSQARFLLSKINP 718
PLN00162 PLN00162
transport protein sec23; Provisional
180-826 1.07e-33

transport protein sec23; Provisional


Pssm-ID: 215083 [Multi-domain]  Cd Length: 761  Bit Score: 138.92  E-value: 1.07e-33
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039018156 180 FGAIVSAGREISPGPqIIQRDPHRCLNCGAYSNPYSSILIGSGQWQCVICENMNGSKGEYVASSKN----ELqnFPELSL 255
Cdd:PLN00162   33 LAALYTPLKPLPELP-VLPYDPLRCRTCRAVLNPYCRVDFQAKIWICPFCFQRNHFPPHYSSISETnlpaEL--FPQYTT 109
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039018156 256 plVDYVQTgnkrpgfvPASDSRTSAPV-VLVIDECLDEPHLQHLQSSLHAFVDSLPQTTRLGIILYGRTVSIYDFSEDSV 334
Cdd:PLN00162  110 --VEYTLP--------PGSGGAPSPPVfVFVVDTCMIEEELGALKSALLQAIALLPENALVGLITFGTHVHVHELGFSEC 179
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039018156 335 ASADVISGAKSPSAESMKALIygtGVYLSPMHASLKV---AHEIFSS------LRP-----YTLN--VPEASRD------ 392
Cdd:PLN00162  180 SKSYVFRGNKEVSKDQILEQL---GLGGKKRRPAGGGiagARDGLSSsgvnrfLLPaseceFTLNsaLEELQKDpwpvpp 256
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039018156 393 -----RCLGTAVEAALAIIQGPSaemsrgvvrrAGGNSRIIVCAGGPITYGPGS-VPHSMSHP----------NYPYMEK 456
Cdd:PLN00162  257 ghrpaRCTGAALSVAAGLLGACV----------PGTGARIMAFVGGPCTEGPGAiVSKDLSEPirshkdldkdAAPYYKK 326
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039018156 457 tAIKWMENLGREAHRHNTVVDIL-CA----GTCPLRVPILQplakaSGGVLVLHDDFG-EAFGVDLQRAATRAAG----- 525
Cdd:PLN00162  327 -AVKFYEGLAKQLVAQGHVLDVFaCSldqvGVAEMKVAVER-----TGGLVVLAESFGhSVFKDSLRRVFERDGEgslgl 400
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039018156 526 -SHGLLEVRCSDDILITQVIGPGeeahsethetfksdAALSIQMLSVEE-------TQSFSLSMENKR-------DIKSD 590
Cdd:PLN00162  401 sFNGTFEVNCSKDVKVQGAIGPC--------------ASLEKKGPSVSDteigeggTTAWKLCGLDKKtslavffEVANS 466
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039018156 591 HV----------FFQFAFHYSDVYQADVSRVITFKLPTVDSISA--YLQSVEDEASAVLISKRTLLLAKNQkDAVDMRAT 658
Cdd:PLN00162  467 GQsnpqppgqqfFLQFLTRYQHSNGQTRLRVTTVTRRWVEGSSSeeLVAGFDQEAAAVVMARLASHKMETE-EEFDATRW 545
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039018156 659 VDERIKDIALKFGSQV---PKSklYSFPKELSSLPELLFHLRRGPLLGNIIGHEDERSVLRnLFLN-ASFDLSLRMVAPR 734
Cdd:PLN00162  546 LDRALIRLCSKFGDYRkddPSS--FRLSPNFSLYPQFMFNLRRSQFVQVFNNSPDETAYFR-MMLNrENVTNSLVMIQPT 622
                         650       660       670       680       690       700       710       720
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039018156 735 CLMHqeggTFEELPAYDL----SMQSDKAVILD--------HGTDVFIW--LGAELSADEVKSAAVLAACRTLAEELTEF 800
Cdd:PLN00162  623 LISY----SFNGPPEPVLldvaSIAADRILLLDsyfsvvifHGSTIAQWrkAGYHNQPEHEAFAQLLEAPQADAQAIIKE 698
                         730       740
                  ....*....|....*....|....*.
gi 1039018156 801 RFPAPRILAFKEGSSQARYFVCRLIP 826
Cdd:PLN00162  699 RFPVPRLVVCDQHGSQARFLLAKLNP 724
COG5028 COG5028
Vesicle coat complex COPII, subunit SEC24/subunit SFB2/subunit SFB3 [Intracellular trafficking ...
1-773 1.25e-20

Vesicle coat complex COPII, subunit SEC24/subunit SFB2/subunit SFB3 [Intracellular trafficking and secretion];


Pssm-ID: 227361 [Multi-domain]  Cd Length: 861  Bit Score: 97.55  E-value: 1.25e-20
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039018156   1 MANLPKSsvNYPGTLTPLEPN----RPSPQPDRTPVPHSPPVVASPIPPRFPQPSfrpdqmsspsmkspsllspangiRT 76
Cdd:COG5028     1 MSQHKKG--VYPQAQSQVHTGaassKKSARPHRAYANFSAGQMGMPPYTTPPLQQ-----------------------QS 55
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039018156  77 GSPIPRLSTP-------PGPPVFNTPV----KPAAVPFRTSPATPQ--PMAYSSANSSLPVSTPSFYSNGSSVGSQRDLP 143
Cdd:COG5028    56 RRQIDQAATAmhntganNPAPSVMSPAfqsqQKFSSPYGGSMADGTapKPTNPLVPVDLFEDQPPPISDLFLPPPPIVPP 135
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039018156 144 D--VVRMEEPIAADSPYVLFSANKVLKQKKLANVASLGFGAIV----SAGREISPGPQIIQRDPHRCLNCGAYSNPYSSI 217
Cdd:COG5028   136 LttNFVGSEQSNCSPKYVRSTMYAIPETNDLLKKSKIPFGLVIrpflELYPEEDPVPLVEDGSIVRCRRCRSYINPFVQF 215
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039018156 218 LIGSGQWQCVICENMNGSKGEYVASS-----KNELQNFPELSLPLVDYVQTGNKR------PGFV-------PASDSRTS 279
Cdd:COG5028   216 IEQGRKWRCNICRSKNDVPEGFDNPSgpndpRSDRYSRPELKSGVVDFLAPKEYSlrqpppPVYVflidvsfEAIKNGLV 295
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039018156 280 APVVLVIDECLDephlqhlqsslhaFVDSLPQTTRLGIILYGRTVSIYDFSEDSVASADVISGAKSPSaesmkaLIYGTG 359
Cdd:COG5028   296 KAAIRAILENLD-------------QIPNFDPRTKIAIICFDSSLHFFKLSPDLDEQMLIVSDLDEPF------LPFPSG 356
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039018156 360 VYLSPMHAS-------LKVAHEIFSSLRpytlnVPEAsrdrCLGTAVEAALAIIQGpsaemsrgvvrrAGGnsRIIVCAG 432
Cdd:COG5028   357 LFVLPLKSCkqiietlLDRVPRIFQDNK-----SPKN----ALGPALKAAKSLIGG------------TGG--KIIVFLS 413
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039018156 433 GPITYGPGSVPHSmsHPNYPYMEKTAIKWMENLGREAHRHNTVVDILCAGTCPLRVPILQPLAKASGGVLVLHDDF---- 508
Cdd:COG5028   414 TLPNMGIGKLQLR--EDKESSLLSCKDSFYKEFAIECSKVGISVDLFLTSEDYIDVATLSHLCRYTGGQTYFYPNFsatr 491
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039018156 509 ---GEAFGVDLQRAATRAAGSHGLLEVRCSDDILITQVIGpgeeahsetHETFKSDAALSIQMLSVEETQSFSLSMENKr 585
Cdd:COG5028   492 pndATKLANDLVSHLSMEIGYEAVMRVRCSTGLRVSSFYG---------NFFNRSSDLCAFSTMPRDTSLLVEFSIDEK- 561
                         650       660       670       680       690       700       710       720
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039018156 586 dIKSDHVFFQFAFHYSDVYQADVSRVITFKLPTVDSISAYLQSVEDEASAVLISKRTLLLAKNqKDAVDMRATVDERIKD 665
Cdd:COG5028   562 -LMTSDVYFQVALLYTLNDGERRIRVVNLSLPTSSSIREVYASADQLAIACILAKKASTKALN-SSLKEARVLINKSMVD 639
                         730       740       750       760       770       780       790       800
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039018156 666 IALKFGSQVPKSK---LYSFPKELSSLPELLFHLRRGPLLGNIIGHEDERSVLRNLFLNASFDLSLRMVAPRCL----MH 738
Cdd:COG5028   640 ILKAYKKELVKSNtstQLPLPANLKLLPLLMLALLKSSAFRSGSTPSDIRISALNRLTSLPLKQLMRNIYPTLYalhdMP 719
                         810       820       830       840
                  ....*....|....*....|....*....|....*....|...
gi 1039018156 739 QEGGTFEE--------LPAYDLSMQSDKAVILDHGTDVFIWLG 773
Cdd:COG5028   720 IEAGLPDEgllvlpspINATSSLLESGGLYLIDTGQKIFLWFG 762
gelsolin_like cd11280
Tandemly repeated domains found in gelsolin, severin, villin, and related proteins; Gelsolin ...
733-824 4.76e-18

Tandemly repeated domains found in gelsolin, severin, villin, and related proteins; Gelsolin repeats occur in gelsolin, severin, villin, advillin, villidin, supervillin, flightless, quail, fragmin, and other proteins, usually in several copies. They co-occur with villin headpiece domains, leucine-rich repeats, and several other domains. These gelsolin-related actin binding proteins (GRABPs) play regulatory roles in the assembly and disassembly of actin filaments; they are involved in F-actin capping, uncapping, severing, or the nucleation of actin filaments. Severing of actin filaments is Ca2+ dependent. Villins are also linked to generating bundles of F-actin with uniform filament polarity, which is most likely mediated by their extra villin headpiece domain. Many family members have also adopted functions in the nucleus, including the regulation of transcription. Supervillin, gelsolin, and flightless I are involved in intracellular signaling via nuclear hormone receptors. The gelsolin-like domain is distantly related to the actin depolymerizing domains found in cofilin and similar proteins.


Pssm-ID: 200436  Cd Length: 88  Bit Score: 79.72  E-value: 4.76e-18
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039018156 733 PRCLMHQEG---GTFEELPAYDLSMQSDKAVILDHGTDVFIWLGAElsadevKSAAVLAACRTLAEELTEFRFPAPRILA 809
Cdd:cd11280     1 PPRLYRVRGskaIEIEEVPLASSSLDSDDVFVLDTGSEIYIWQGRA------SSQAELAAAALLAKELDEERKGKPEIVR 74
                          90
                  ....*....|....*
gi 1039018156 810 FKEGSSqARYFVCRL 824
Cdd:cd11280    75 IRQGQE-PREFWSLF 88
Sec23-like cd01478
Sec23-like: Protein and membrane traffic in eukaryotes is mediated by at least in part by the ...
278-518 4.01e-17

Sec23-like: Protein and membrane traffic in eukaryotes is mediated by at least in part by the budding and fusion of intracellular transport vesicles that selectively carry cargo proteins and lipids from donor to acceptor organelles. The two main classes of vesicular carriers within the endocytic and the biosynthetic pathways are COP- and clathrin-coated vesicles. Formation of COPII vesicles requires the ordered assembly of the coat built from several cytosolic components GTPase Sar1, complexes of Sec23-Sec24 and Sec13-Sec31. The process is initiated by the conversion of GDP to GTP by the GTPase Sar1 which then recruits the heterodimeric complex of Sec23 and Sec24. This heterodimeric complex generates the pre-budding complex. The final step leading to membrane deformation and budding of COPII-coated vesicles is carried by the heterodimeric complex Sec13-Sec31. The members of this CD belong to the Sec23-like family. Sec 23 is very similar to Sec24. The Sec23 and Sec24 polypeptides fold into five distinct domains: a beta-barrel, a zinc finger, a vWA or trunk, an all helical region and a carboxy Gelsolin domain. The members of this subgroup lack the consensus MIDAS motif but have the overall Para-Rossmann type fold that is characteristic of this superfamily.


Pssm-ID: 238755 [Multi-domain]  Cd Length: 267  Bit Score: 82.42  E-value: 4.01e-17
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039018156 278 TSAPVVL-VIDECLDEPHLQHLQSSLHAFVDSLPQTTRLGIILYGRTVSIYDFSEDSVASADVISGAKSPSAESMKALIY 356
Cdd:cd01478     1 TSPPVFLfVVDTCMDEEELDALKESLIMSLSLLPPNALVGLITFGTMVQVHELGFEECSKSYVFRGNKDYTAKQIQDMLG 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039018156 357 GTGVYLSPMHASLKVA-----------------------HEIFSSLRPYTLNVPEASRD-RCLGTAVEAALAIIQgpsae 412
Cdd:cd01478    81 LGGPAMRPSASQHPGAgnplpsaaasrfllpvsqceftlTDLLEQLQPDPWPVPAGHRPlRCTGVALSIAVGLLE----- 155
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039018156 413 msrGVVRRAGGnsRIIVCAGGPITYGPGSV-------P----HSMSHPNYPYMEKtAIKWMENLGREAHRHNTVVDILcA 481
Cdd:cd01478   156 ---ACFPNTGA--RIMLFAGGPCTVGPGAVvstelkdPirshHDIDKDNAKYYKK-AVKFYDSLAKRLAANGHAVDIF-A 228
                         250       260       270       280
                  ....*....|....*....|....*....|....*....|
gi 1039018156 482 GtCPLRVPILQ--PLAKASGGVLVLHDDFGEA-FGVDLQR 518
Cdd:cd01478   229 G-CLDQVGLLEmkVLVNSTGGHVVLSDSFTTSiFKQSFQR 267
Sec23_BS pfam08033
Sec23/Sec24 beta-sandwich domain;
525-619 1.92e-13

Sec23/Sec24 beta-sandwich domain;


Pssm-ID: 429794 [Multi-domain]  Cd Length: 86  Bit Score: 66.41  E-value: 1.92e-13
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039018156 525 GSHGLLEVRCSDDILITQVIGPGeeahsethetFKSDAALSIQMLSVEETQSFSLSMENKRDIKS-DHVFFQFAFHYSDV 603
Cdd:pfam08033   1 GFNAVLRVRTSKGLKVSGFIGNF----------VSRSSGDTWKLPSLDPDTSYAFEFDIDEPLPNgSNAYIQFALLYTHS 70
                          90
                  ....*....|....*.
gi 1039018156 604 YQADVSRVITFKLPTV 619
Cdd:pfam08033  71 SGERRIRVTTVALPVT 86
Sec23_helical pfam04815
Sec23/Sec24 helical domain; COPII-coated vesicles carry proteins from the endoplasmic ...
632-729 3.53e-12

Sec23/Sec24 helical domain; COPII-coated vesicles carry proteins from the endoplasmic reticulum to the Golgi complex. This vesicular transport can be reconstituted by using three cytosolic components containing five proteins: the small GTPase Sar1p, the Sec23p/24p complex, and the Sec13p/Sec31p complex. This domain is composed of five alpha helices.


Pssm-ID: 461441 [Multi-domain]  Cd Length: 103  Bit Score: 63.29  E-value: 3.53e-12
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039018156 632 EASAVLISKRTLLLAKNQKDAvDMRATVDERIKDIALKFGSQVPKSKLYS---FPKELSSLPELLFHLRRGPLL--GNII 706
Cdd:pfam04815   3 EAIAVLLAKKAVEKALSSSLS-DAREALDNKLVDILAAYRKYCASSSSPGqliLPESLKLLPLYMLALLKSPALrgGNSS 81
                          90       100
                  ....*....|....*....|...
gi 1039018156 707 gHEDERSVLRNLFLNASFDLSLR 729
Cdd:pfam04815  82 -PSDERAYARHLLLSLPVEELLL 103
zf-Sec23_Sec24 pfam04810
Sec23/Sec24 zinc finger; COPII-coated vesicles carry proteins from the endoplasmic reticulum ...
201-233 1.79e-10

Sec23/Sec24 zinc finger; COPII-coated vesicles carry proteins from the endoplasmic reticulum to the Golgi complex. This vesicular transport can be reconstituted by using three cytosolic components containing five proteins: the small GTPase Sar1p, the Sec23p/24p complex, and the Sec13p/Sec31p complex. This domain is found to be zinc binding domain.


Pssm-ID: 461437 [Multi-domain]  Cd Length: 38  Bit Score: 56.30  E-value: 1.79e-10
                          10        20        30
                  ....*....|....*....|....*....|...
gi 1039018156 201 PHRCLNCGAYSNPYSSILIGSGQWQCVICENMN 233
Cdd:pfam04810   1 PVRCRRCRAYLNPFCQFDFGGKKWTCNFCGTRN 33
Sec23_trunk pfam04811
Sec23/Sec24 trunk domain; COPII-coated vesicles carry proteins from the endoplasmic reticulum ...
283-518 3.36e-08

Sec23/Sec24 trunk domain; COPII-coated vesicles carry proteins from the endoplasmic reticulum to the Golgi complex. This vesicular transport can be reconstituted by using three cytosolic components containing five proteins: the small GTPase Sar1p, the Sec23p/24p complex, and the Sec13p/Sec31p complex. This domain is known as the trunk domain and has an alpha/beta vWA fold and forms the dimer interface.


Pssm-ID: 398467 [Multi-domain]  Cd Length: 241  Bit Score: 55.34  E-value: 3.36e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039018156 283 VLVIDECLD---EPHLQHLQSSLHAFVDSLPQ--TTRLGIILYGRTVSIYDFSEDSVASADVISG----AKSPSAESMka 353
Cdd:pfam04811   7 LFVIDVSYNaikSGLLAALKESLLQSLDLLPGdpRARVGFITFDSTVHFFNLGSSLRQPQMLVVSdlqdMFLPLPDRF-- 84
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039018156 354 liygtgvyLSPMHASLKVAHEIFSSL-RPYTLN-VPEasrdRCLGTAVEAALAIIQGPSAemsrgvvrraGGnsRIIVCA 431
Cdd:pfam04811  85 --------LVPLSECRFVLEDLLEQLpPMFPVTkRPE----RCLGPALQAAFLLLKAAFT----------GG--KIMVFQ 140
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039018156 432 GGPITYGPGSV--------PHSMSHPNYPYMeKTAIKWMENLGREAHRHNTVVDILCAGTCPLRVPILQPLAKASGGVLV 503
Cdd:pfam04811 141 GGLPTVGPGGKlksrldesHHGTDKEKAKLV-KKADKFYKSLAKECVKQGHSVDLFAFSLDYVDVATLGQLSRLTGGQVY 219
                         250       260
                  ....*....|....*....|
gi 1039018156 504 LHDDF-----GEAFGVDLQR 518
Cdd:pfam04811 220 LYPSFqadvdGSKFKQDLQR 239
Gelsolin pfam00626
Gelsolin repeat;
753-818 3.46e-05

Gelsolin repeat;


Pssm-ID: 395501 [Multi-domain]  Cd Length: 76  Bit Score: 42.68  E-value: 3.46e-05
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1039018156 753 SMQSDKAVILDHGTDVFIWLGAELSADEVKSAAVLAacrtlAEELTEFRFPAPRILAFKEGSSQAR 818
Cdd:pfam00626  14 SLNSGDCYLLDNGFTIFLWVGKGSSLLEKLFAALLA-----AQLDDDERFPLPEVIRVPQGKEPAR 74
GEL smart00262
Gelsolin homology domain; Gelsolin/severin/villin homology domain. Calcium-binding and ...
753-814 2.37e-04

Gelsolin homology domain; Gelsolin/severin/villin homology domain. Calcium-binding and actin-binding. Both intra- and extracellular domains.


Pssm-ID: 214590 [Multi-domain]  Cd Length: 90  Bit Score: 40.74  E-value: 2.37e-04
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1039018156  753 SMQSDKAVILDHGTDVFIWLGAELSADEVKSAAvlaacrTLAEEL-TEFRFPAPRILAFKEGS 814
Cdd:smart00262  22 SLNSGDCYILDTGSEIYVWVGKKSSQDEKKKAA------ELAVELdDTLGPGPVQVRVVDEGK 78
PHA03247 PHA03247
large tegument protein UL36; Provisional
17-128 5.66e-04

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 43.77  E-value: 5.66e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039018156   17 PLEPNRPSPQPDRTPVPHSPPVVAS-------PIPPRFPQPSFRPDQMSSPSMKSPSLLSPANGIRTGSPIPRLS--TPP 87
Cdd:PHA03247  2670 LGRAAQASSPPQRPRRRAARPTVGSltsladpPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVpaGPA 2749
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....*...
gi 1039018156   88 GPPVFNTPVKPAA-------VPFRTSPATPQPMAYSSANSSLPVSTPS 128
Cdd:PHA03247  2750 TPGGPARPARPPTtagppapAPPAAPAAGPPRRLTRPAVASLSESRES 2797
gelsolin_S3_like cd11292
Gelsolin sub-domain 3-like domain found in gelsolin, severin, villin, and related proteins; ...
755-788 9.30e-04

Gelsolin sub-domain 3-like domain found in gelsolin, severin, villin, and related proteins; Gelsolin repeats occur in gelsolin, severin, villin, advillin, villidin, supervillin, flightless, quail, fragmin, and other proteins, usually in several copies. They co-occur with villin headpiece domains, leucine-rich repeats, and several other domains. These gelsolin-related actin binding proteins (GRABPs) play regulatory roles in the assembly and disassembly of actin filaments; they are involved in F-actin capping, uncapping, severing, or the nucleation of actin filaments. Severing of actin filaments is Ca2+ dependent. Villins are also linked to generating bundles of F-actin with uniform filament polarity, which is most likely mediated by their extra villin headpiece domain. Many family members have also adopted functions in the nucleus, including the regulation of transcription. Supervillin, gelsolin, and flightless I are involved in intracellular signaling via nuclear hormone receptors. The gelsolin-like domain is distantly related to the actin depolymerizing domains found in cofilin and similar proteins.


Pssm-ID: 200448  Cd Length: 98  Bit Score: 39.15  E-value: 9.30e-04
                          10        20        30
                  ....*....|....*....|....*....|....
gi 1039018156 755 QSDKAVILDHGTDVFIWLGAELSADEVKSAAVLA 788
Cdd:cd11292    32 DSEDCYILDCGSEIFVWVGKGASLDERKAALKNA 65
PHA03247 PHA03247
large tegument protein UL36; Provisional
17-157 1.36e-03

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 42.62  E-value: 1.36e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039018156   17 PLEPNRPSPQPDRTpVPHSPPVvASPIPPRFPQPSFRPDqmsspsmkspsllSPANGIRTGSPIPRLSTPPGPPVfNTPV 96
Cdd:PHA03247  2554 PLPPAAPPAAPDRS-VPPPRPA-PRPSEPAVTSRARRPD-------------APPQSARPRAPVDDRGDPRGPAP-PSPL 2617
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1039018156   97 KPAAVPFRTSPATPQPMAYSSAN-SSLPVSTPSFYSNGSSVGSQRDLPDVVRMEEPIAADSP 157
Cdd:PHA03247  2618 PPDTHAPDPPPPSPSPAANEPDPhPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSP 2679
PHA03247 PHA03247
large tegument protein UL36; Provisional
13-138 1.37e-03

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 42.62  E-value: 1.37e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039018156   13 GTLTPLEPNRPS--PQPDRTPVPHSPPVVASPIPPRFPQPSfrpdqmsspsmkspslLSPANGIRTGSPIPRLSTPPGPP 90
Cdd:PHA03247  2747 GPATPGGPARPArpPTTAGPPAPAPPAAPAAGPPRRLTRPA----------------VASLSESRESLPSPWDPADPPAA 2810
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....*...
gi 1039018156   91 VfntPVKPAAVPFRTSPATPQPMAYSSANSSLPVSTPSFYSNGSSVGS 138
Cdd:PHA03247  2811 V---LAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGS 2855
PHA03247 PHA03247
large tegument protein UL36; Provisional
13-128 1.53e-03

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 42.62  E-value: 1.53e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039018156   13 GTLTPL-EPNRPSPQPDRTPVPHSPPVVASPIPPRFPQPSFRPDQMSSPSMKSPSLLSPANGIRTGSPiprlSTPPGPPV 91
Cdd:PHA03247  2693 GSLTSLaDPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARP----PTTAGPPA 2768
                           90       100       110
                   ....*....|....*....|....*....|....*..
gi 1039018156   92 FNTPVKPAAVPFRTSPATPQPMAYSSANSSLPVSTPS 128
Cdd:PHA03247  2769 PAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPA 2805
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
2-152 1.76e-03

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 42.06  E-value: 1.76e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039018156   2 ANLPKssvnyPGTLTPLEpNRPSPQPdrtPVPHSPPVVASPIPPRFPQPSFRPDQMSSPSMKspsllsPANGIRTGSPIP 81
Cdd:pfam03154 388 SNLPP-----PPALKPLS-SLSTHHP---PSAHPPPLQLMPQSQQLPPPPAQPPVLTQSQSL------PPPAASHPPTSG 452
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1039018156  82 RLSTPPGPPVFNTPVKPAAVPFRTSPATPQPMAYSSANSSLPVSTPSFYSNGSSVGS-QRDLPDVVRMEEPI 152
Cdd:pfam03154 453 LHQVPSQSPFPQHPFVPGGPPPITPPSGPPTSTSSAMPGIQPPSSASVSSSGPVPAAvSCPLPPVQIKEEAL 524
PTZ00449 PTZ00449
104 kDa microneme/rhoptry antigen; Provisional
12-127 2.58e-03

104 kDa microneme/rhoptry antigen; Provisional


Pssm-ID: 185628 [Multi-domain]  Cd Length: 943  Bit Score: 41.60  E-value: 2.58e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039018156  12 PGtltPLEPNRPSPQPDRTPVPHSPPVVASPIPPRFPQPSFRPdqmsspsmkspslLSPANGIRTGSP-IPRLST-PPGP 89
Cdd:PTZ00449  561 PG---PAKEHKPSKIPTLSKKPEFPKDPKHPKDPEEPKKPKRP-------------RSAQRPTRPKSPkLPELLDiPKSP 624
                          90       100       110
                  ....*....|....*....|....*....|....*....
gi 1039018156  90 PVFNTPVKPAAVPFRTSPATPQ-PMAYSSANSSLPVSTP 127
Cdd:PTZ00449  625 KRPESPKSPKRPPPPQRPSSPErPEGPKIIKSPKPPKSP 663
PHA03247 PHA03247
large tegument protein UL36; Provisional
1-136 2.94e-03

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 41.46  E-value: 2.94e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039018156    1 MANLPKSSVNYPGTLTPLEPNRPsPQPDRTPVPHSPPvvaSPIPPRFPQPSFRPDQMSSPSMKSPSLLSPANGIRTGSPI 80
Cdd:PHA03247  2886 LARPAVSRSTESFALPPDQPERP-PQPQAPPPPQPQP---QPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQ 2961
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 1039018156   81 PRLSTPpgppvfnTPVKPAAVPFRT-SPATPQPMAYSSANSSLPVSTPSFYSNGSSV 136
Cdd:PHA03247  2962 PWLGAL-------VPGRVAVPRFRVpQPAPSREAPASSTPPLTGHSLSRVSSWASSL 3011
PTZ00449 PTZ00449
104 kDa microneme/rhoptry antigen; Provisional
1-146 4.02e-03

104 kDa microneme/rhoptry antigen; Provisional


Pssm-ID: 185628 [Multi-domain]  Cd Length: 943  Bit Score: 40.83  E-value: 4.02e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039018156   1 MANLPKSSVNYPGTLTPLEPnrPSPQPDRTPV-PHSPPVVASPIPPRFPQPSFRP-------DQMSSPSMKSPSLLSPAN 72
Cdd:PTZ00449  617 LLDIPKSPKRPESPKSPKRP--PPPQRPSSPErPEGPKIIKSPKPPKSPKPPFDPkfkekfyDDYLDAAAKSKETKTTVV 694
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039018156  73 GIRTGSPIPRLS---TPPGPPVFNTPVKPAAVPFRTSPATP--QPMAYSSANSSL---PVSTPSFYsngSSVGSQRDLPD 144
Cdd:PTZ00449  695 LDESFESILKETlpeTPGTPFTTPRPLPPKLPRDEEFPFEPigDPDAEQPDDIEFftpPEEERTFF---HETPADTPLPD 771

                  ..
gi 1039018156 145 VV 146
Cdd:PTZ00449  772 IL 773
PHA02682 PHA02682
ORF080 virion core protein; Provisional
12-176 4.22e-03

ORF080 virion core protein; Provisional


Pssm-ID: 177464 [Multi-domain]  Cd Length: 280  Bit Score: 40.23  E-value: 4.22e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039018156  12 PGTLTPLEPNRPSPQP-DRTPVPhSPPVVASPIPPRFPQPSFRPDQmsspsmkspsllSPANGIRTGSPIPRLSTPPGPP 90
Cdd:PHA02682   76 PSGQSPLAPSPACAAPaPACPAC-APAAPAPAVTCPAPAPACPPAT------------APTCPPPAVCPAPARPAPACPP 142
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039018156  91 vfNTPVKPAAVPFrtspATPQPmayssanssLPVSTPSFYSN--------GSSVGSQRDLPDVVRMEEPIAADSPYVLFS 162
Cdd:PHA02682  143 --STRQCPPAPPL----PTPKP---------APAAKPIFLHNqlpppdypAASCPTIETAPAASPVLEPRIPDKIIDADN 207
                         170
                  ....*....|....
gi 1039018156 163 ANKVLKQKKLANVA 176
Cdd:PHA02682  208 DDKDLIKKELADIA 221
PRK14959 PRK14959
DNA polymerase III subunits gamma and tau; Provisional
1-123 4.45e-03

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 184923 [Multi-domain]  Cd Length: 624  Bit Score: 40.82  E-value: 4.45e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039018156   1 MANLPKssvnypgtLTPLEPNRPSPQPDRTPVPHSPPVVASPIPPRFPQP-SFRPDQMSSPSMKSPSLLSPANGIRTGSP 79
Cdd:PRK14959  359 LAMLPR--------LMPVESLRPSGGGASAPSGSAAEGPASGGAATIPTPgTQGPQGTAPAAGMTPSSAAPATPAPSAAP 430
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|....*..
gi 1039018156  80 IPRL---STPPGPPVFNTPVKPAAVPFRTSPATPQPMAYSSANSSLP 123
Cdd:PRK14959  431 SPRVpwdDAPPAPPRSGIPPRPAPRMPEASPVPGAPDSVASASDAPP 477
PHA03247 PHA03247
large tegument protein UL36; Provisional
12-128 6.72e-03

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 40.31  E-value: 6.72e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039018156   12 PGTLTPLEPNRPSPQPDRTPVPhsPPVVASPIPPRFPQPSFRPDQMSSPSMKSPSLLSPANGIRTGSPIPRLSTPPGPPV 91
Cdd:PHA03247  2720 PLPPGPAAARQASPALPAAPAP--PAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRES 2797
                           90       100       110
                   ....*....|....*....|....*....|....*..
gi 1039018156   92 FNTPVKPAAVPFRTSPATPQPMAYSSANSSLPVSTPS 128
Cdd:PHA03247  2798 LPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSA 2834
PspC_subgroup_2 NF033839
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ...
3-143 7.85e-03

pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.


Pssm-ID: 468202 [Multi-domain]  Cd Length: 557  Bit Score: 39.75  E-value: 7.85e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039018156   3 NLPKSSVNyPGTLTPLEPNRPSPQPDRTPVPHSP--------PVVASPIP-----PRFPQPSFRPDQMSSPSMKSPSLLS 69
Cdd:NF033839  282 DTPKEPGN-KKPSAPKPGMQPSPQPEKKEVKPEPetpkpevkPQLEKPKPevkpqPEKPKPEVKPQLETPKPEVKPQPEK 360
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1039018156  70 PANGIRTGSPIPRLSTPPGPPVFNTPVKPaavpfrtSPATPQPMAYSSANSSLPVSTPSFYSNGSSVGSQRDLP 143
Cdd:NF033839  361 PKPEVKPQPEKPKPEVKPQPETPKPEVKP-------QPEKPKPEVKPQPEKPKPEVKPQPEKPKPEVKPQPEKP 427
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH