NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|1907118297|ref|XP_036015882|]
View 

target of Nesh-SH3 isoform X4 [Mus musculus]

Protein Classification

fibronectin type III domain-containing protein( domain architecture ID 10440918)

fibronectin type III (FN3) domain-containing protein similar to human Target of Nesh-SH3 (Tarsh) and Drosophila melanogaster cytokine receptor (protein domeless)

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
PTZ00449 super family cl33186
104 kDa microneme/rhoptry antigen; Provisional
509-764 1.15e-12

104 kDa microneme/rhoptry antigen; Provisional


The actual alignment was detected with superfamily member PTZ00449:

Pssm-ID: 185628 [Multi-domain]  Cd Length: 943  Bit Score: 73.19  E-value: 1.15e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118297  509 TPKRQSTPKPPRV-----KPAPEPETRPSA--QTTKAPRKTKKPGHHRlrRPKTTRSPEVPKS--KPALEPATVTPEILV 579
Cdd:PTZ00449   542 EPKEGGKPGETKEgevgkKPGPAKEHKPSKipTLSKKPEFPKDPKHPK--DPEEPKKPKRPRSaqRPTRPKSPKLPELLD 619
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118297  580 PKIVPKPPQKPKATRRPEVPQvKPAHEPVTFGSEAPalaivtttdiepvitrtKASVTTLAPKPPRPRTHRQrtKYKTTQ 659
Cdd:PTZ00449   620 IPKSPKRPESPKSPKRPPPPQ-RPSSPERPEGPKII-----------------KSPKPPKSPKPPFDPKFKE--KFYDDY 679
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118297  660 SPKIPHSKPASTTKKVRRPRPKPQTTPHPEVPHTilvPATSLEPfiiteapgttLVPKLPQQPDYPHPKPKTTRSPAASP 739
Cdd:PTZ00449   680 LDAAAKSKETKTTVVLDESFESILKETLPETPGT---PFTTPRP----------LPPKLPRDEEFPFEPIGDPDAEQPDD 746
                          250       260
                   ....*....|....*....|....*
gi 1907118297  740 TELVPTPVfEPVTPLKEDPVTTIVP 764
Cdd:PTZ00449   747 IEFFTPPE-EERTFFHETPADTPLP 770
PHA03247 super family cl33720
large tegument protein UL36; Provisional
456-1085 4.27e-12

large tegument protein UL36; Provisional


The actual alignment was detected with superfamily member PHA03247:

Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 71.89  E-value: 4.27e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118297  456 DSVPPKTSRTAEQPRATLAPIEALFESRNveifTSPEVRPTTAAPQQTTSIPSTPKRQSTPKPPRVKPAPEPETRPSaqt 535
Cdd:PHA03247  2558 AAPPAAPDRSVPPPRPAPRPSEPAVTSRA----RRPDAPPQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPS--- 2630
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118297  536 tKAPRKTKKPGHHRLRRPKttrsPEVPKSKPAlePATVTPEILVPKIvPKPPQKPKATRRPEVPQVKPAHEPVTFGSEAP 615
Cdd:PHA03247  2631 -PSPAANEPDPHPPPTVPP----PERPRDDPA--PGRVSRPRRARRL-GRAAQASSPPQRPRRRAARPTVGSLTSLADPP 2702
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118297  616 alaivtttDIEPVITRTKASVTTLAPKPPRPRTHRQRTKyKTTQSPKIPHSKPASTTKKVRRPRPKPQTTPHPEVPHTIL 695
Cdd:PHA03247  2703 --------PPPPTPEPAPHALVSATPLPPGPAAARQASP-ALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPA 2773
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118297  696 VPATSLEPfIITEAPGTTLVPKLPQQPD----YPHPKPKTTRSPAASPTELVPTPVFEPVTPLKEDPVTTIVPitdLERV 771
Cdd:PHA03247  2774 APAAGPPR-RLTRPAVASLSESRESLPSpwdpADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGP---PPPS 2849
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118297  772 TDLETPVA----FRTEAPGTTLASKISQRTHRPRPRPRPRPRPRPRPKATLSPQAPETKTVPAVVLEPVTLRPEVQVTTL 847
Cdd:PHA03247  2850 LPLGGSVApggdVRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQP 2929
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118297  848 APQKTQKKHRPSPKPKPVPSPEVTESKPVLPrvrepvtlrtETWVTTKAPKTPKRTRRPRPkpqttptpetpltkpvaat 927
Cdd:PHA03247  2930 QPPPPPPPRPQPPLAPTTDPAGAGEPSGAVP----------QPWLGALVPGRVAVPRFRVP------------------- 2980
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118297  928 dlePSALSTEVPAtvvlataltpvtlrtkaPKTTTlapnvqRTRRPHPRPKTTASTGVSESKSAPtelqslvlkpvtsPS 1007
Cdd:PHA03247  2981 ---QPAPSREAPA-----------------SSTPP------LTGHSLSRVSSWASSLALHEETDP-------------PP 3021
                          570       580       590       600       610       620       630
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1907118297 1008 LEIIQSQSVSDDLElvafstespqKTIAPRQTTSMPPKLKTPhsrmpAKEPVPKEPLHTTSKPKMPPSPEvADTTSVP 1085
Cdd:PHA03247  3022 VSLKQTLWPPDDTE----------DSDADSLFDSDSERSDLE-----ALDPLPPEPHDPFAHEPDPATPE-AGARESP 3083
FN3 cd00063
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ...
1390-1481 3.30e-10

Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.


:

Pssm-ID: 238020 [Multi-domain]  Cd Length: 93  Bit Score: 58.28  E-value: 3.30e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118297 1390 NPPTNLTVVTVEgcPSFVILDWEKPLNDT--VTEYEVISRENGSFSGKNKSIQITNQTFSTVENLKPDTSYEFQVKPKNP 1467
Cdd:cd00063      2 SPPTNLRVTDVT--STSVTLSWTPPEDDGgpITGYVVEYREKGSGDWKEVEVTPGSETSYTLTGLKPGTEYEFRVRAVNG 79
                           90
                   ....*....|....
gi 1907118297 1468 LGEGPASNTVAFST 1481
Cdd:cd00063     80 GGESPPSESVTVTT 93
PHA03247 super family cl33720
large tegument protein UL36; Provisional
945-1336 2.90e-06

large tegument protein UL36; Provisional


The actual alignment was detected with superfamily member PHA03247:

Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 52.63  E-value: 2.90e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118297  945 ATALTPVTLRTKAPKTTTLAPNVQRTRRPHPRPKTTASTGvSESKSAPTELQSLVLKPVTSPSLEIIQSQSVSDDLELVA 1024
Cdd:PHA03247  2596 ARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPSPSPAA-NEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAA 2674
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118297 1025 FSTESPQK--------TIAPRQTTSMPP-KLKTPHSRMPAKEP-VPKEPLHTTSKPKMPPSPEVADTTSVPKDERLSLKP 1094
Cdd:PHA03247  2675 QASSPPQRprrraarpTVGSLTSLADPPpPPPTPEPAPHALVSaTPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGP 2754
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118297 1095 DPEVTHSETVLPPVTFRVEPPKTTIAPLETR--GIPLIPVISPRPSQEELQTAMEETDQSTQELFTTKIPRTTELAKTTQ 1172
Cdd:PHA03247  2755 ARPARPPTTAGPPAPAPPAAPAAGPPRRLTRpaVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSA 2834
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118297 1173 APhrlhTAPVRPRIPGRPH---------GRPALNKTTTRPDKTKPRGTSHKNGVGTGTKQAPKPPSPGRNASVDSHATRK 1243
Cdd:PHA03247  2835 QP----TAPPPPPGPPPPSlplggsvapGGDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQ 2910
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118297 1244 PGSVSGTRRPPIPHRHSSTRPvSPERRPLPPNNVTGKPGRAGIVSSSRVTSPPLKATLHPIGTATARPGAEQKEPTAPAS 1323
Cdd:PHA03247  2911 PQAPPPPQPQPQPPPPPQPQP-PPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPSREAP 2989
                          410
                   ....*....|...
gi 1907118297 1324 EEEFGTTTDFSSS 1336
Cdd:PHA03247  2990 ASSTPPLTGHSLS 3002
fn3 pfam00041
Fibronectin type III domain;
116-195 1.94e-04

Fibronectin type III domain;


:

Pssm-ID: 394996 [Multi-domain]  Cd Length: 85  Bit Score: 41.63  E-value: 1.94e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118297  116 KPLQLVVGTLTPSSVFLSWgflinphhdwTLPSHCPSD-RFYTIRYREKDKEKKWIFQLCPATET--IVENLKPNTVYEF 192
Cdd:pfam00041    2 APSNLTVTDVTSTSLTVSW----------TPPPDGNGPiTGYEVEYRPKNSGEPWNEITVPGTTTsvTLTGLKPGTEYEV 71

                   ...
gi 1907118297  193 GVK 195
Cdd:pfam00041   72 RVQ 74
PHA03247 super family cl33720
large tegument protein UL36; Provisional
307-575 3.00e-03

large tegument protein UL36; Provisional


The actual alignment was detected with superfamily member PHA03247:

Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 42.62  E-value: 3.00e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118297  307 TLALPAESKTPEVEKLAGQPVTVTPESVSRSTKPTLSSALDTAETALVLSEKTSE-TARSVLIPEFELPLSTLAPkrfpe 385
Cdd:PHA03247  2756 RPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAvLAPAAALPPAASPAGPLPP----- 2830
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118297  386 fPEAKTAFPLEKPRGSWASSEEP--WVVPGAKtsedsrvvqpqtatydvISSSTTSDETEIEIHTATRDPILDSVPPKTS 463
Cdd:PHA03247  2831 -PTSAQPTAPPPPPGPPPPSLPLggSVAPGGD-----------------VRRRPPSRSPAAKPAAPARPPVRRLARPAVS 2892
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118297  464 RTAE---QPRATLAPiealfeSRNVEIFTSPEVRPTTAAPQQTTSIPSTPKRQSTPKPPRVKPAPEPEtrPSAQTTKAPR 540
Cdd:PHA03247  2893 RSTEsfaLPPDQPER------PPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGE--PSGAVPQPWL 2964
                          250       260       270
                   ....*....|....*....|....*....|....*
gi 1907118297  541 KTKKPGHHRLRRpktTRSPEVPKSKPALEPATVTP 575
Cdd:PHA03247  2965 GALVPGRVAVPR---FRVPQPAPSREAPASSTPPL 2996
 
Name Accession Description Interval E-value
PTZ00449 PTZ00449
104 kDa microneme/rhoptry antigen; Provisional
509-764 1.15e-12

104 kDa microneme/rhoptry antigen; Provisional


Pssm-ID: 185628 [Multi-domain]  Cd Length: 943  Bit Score: 73.19  E-value: 1.15e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118297  509 TPKRQSTPKPPRV-----KPAPEPETRPSA--QTTKAPRKTKKPGHHRlrRPKTTRSPEVPKS--KPALEPATVTPEILV 579
Cdd:PTZ00449   542 EPKEGGKPGETKEgevgkKPGPAKEHKPSKipTLSKKPEFPKDPKHPK--DPEEPKKPKRPRSaqRPTRPKSPKLPELLD 619
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118297  580 PKIVPKPPQKPKATRRPEVPQvKPAHEPVTFGSEAPalaivtttdiepvitrtKASVTTLAPKPPRPRTHRQrtKYKTTQ 659
Cdd:PTZ00449   620 IPKSPKRPESPKSPKRPPPPQ-RPSSPERPEGPKII-----------------KSPKPPKSPKPPFDPKFKE--KFYDDY 679
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118297  660 SPKIPHSKPASTTKKVRRPRPKPQTTPHPEVPHTilvPATSLEPfiiteapgttLVPKLPQQPDYPHPKPKTTRSPAASP 739
Cdd:PTZ00449   680 LDAAAKSKETKTTVVLDESFESILKETLPETPGT---PFTTPRP----------LPPKLPRDEEFPFEPIGDPDAEQPDD 746
                          250       260
                   ....*....|....*....|....*
gi 1907118297  740 TELVPTPVfEPVTPLKEDPVTTIVP 764
Cdd:PTZ00449   747 IEFFTPPE-EERTFFHETPADTPLP 770
PHA03247 PHA03247
large tegument protein UL36; Provisional
456-1085 4.27e-12

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 71.89  E-value: 4.27e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118297  456 DSVPPKTSRTAEQPRATLAPIEALFESRNveifTSPEVRPTTAAPQQTTSIPSTPKRQSTPKPPRVKPAPEPETRPSaqt 535
Cdd:PHA03247  2558 AAPPAAPDRSVPPPRPAPRPSEPAVTSRA----RRPDAPPQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPS--- 2630
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118297  536 tKAPRKTKKPGHHRLRRPKttrsPEVPKSKPAlePATVTPEILVPKIvPKPPQKPKATRRPEVPQVKPAHEPVTFGSEAP 615
Cdd:PHA03247  2631 -PSPAANEPDPHPPPTVPP----PERPRDDPA--PGRVSRPRRARRL-GRAAQASSPPQRPRRRAARPTVGSLTSLADPP 2702
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118297  616 alaivtttDIEPVITRTKASVTTLAPKPPRPRTHRQRTKyKTTQSPKIPHSKPASTTKKVRRPRPKPQTTPHPEVPHTIL 695
Cdd:PHA03247  2703 --------PPPPTPEPAPHALVSATPLPPGPAAARQASP-ALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPA 2773
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118297  696 VPATSLEPfIITEAPGTTLVPKLPQQPD----YPHPKPKTTRSPAASPTELVPTPVFEPVTPLKEDPVTTIVPitdLERV 771
Cdd:PHA03247  2774 APAAGPPR-RLTRPAVASLSESRESLPSpwdpADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGP---PPPS 2849
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118297  772 TDLETPVA----FRTEAPGTTLASKISQRTHRPRPRPRPRPRPRPRPKATLSPQAPETKTVPAVVLEPVTLRPEVQVTTL 847
Cdd:PHA03247  2850 LPLGGSVApggdVRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQP 2929
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118297  848 APQKTQKKHRPSPKPKPVPSPEVTESKPVLPrvrepvtlrtETWVTTKAPKTPKRTRRPRPkpqttptpetpltkpvaat 927
Cdd:PHA03247  2930 QPPPPPPPRPQPPLAPTTDPAGAGEPSGAVP----------QPWLGALVPGRVAVPRFRVP------------------- 2980
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118297  928 dlePSALSTEVPAtvvlataltpvtlrtkaPKTTTlapnvqRTRRPHPRPKTTASTGVSESKSAPtelqslvlkpvtsPS 1007
Cdd:PHA03247  2981 ---QPAPSREAPA-----------------SSTPP------LTGHSLSRVSSWASSLALHEETDP-------------PP 3021
                          570       580       590       600       610       620       630
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1907118297 1008 LEIIQSQSVSDDLElvafstespqKTIAPRQTTSMPPKLKTPhsrmpAKEPVPKEPLHTTSKPKMPPSPEvADTTSVP 1085
Cdd:PHA03247  3022 VSLKQTLWPPDDTE----------DSDADSLFDSDSERSDLE-----ALDPLPPEPHDPFAHEPDPATPE-AGARESP 3083
FN3 cd00063
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ...
1390-1481 3.30e-10

Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.


Pssm-ID: 238020 [Multi-domain]  Cd Length: 93  Bit Score: 58.28  E-value: 3.30e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118297 1390 NPPTNLTVVTVEgcPSFVILDWEKPLNDT--VTEYEVISRENGSFSGKNKSIQITNQTFSTVENLKPDTSYEFQVKPKNP 1467
Cdd:cd00063      2 SPPTNLRVTDVT--STSVTLSWTPPEDDGgpITGYVVEYREKGSGDWKEVEVTPGSETSYTLTGLKPGTEYEFRVRAVNG 79
                           90
                   ....*....|....
gi 1907118297 1468 LGEGPASNTVAFST 1481
Cdd:cd00063     80 GGESPPSESVTVTT 93
FN3 smart00060
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, ...
1391-1471 9.88e-08

Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.


Pssm-ID: 214495 [Multi-domain]  Cd Length: 83  Bit Score: 51.08  E-value: 9.88e-08
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118297  1391 PPTNLTVVTVEgcPSFVILDWEKPLNDTVTEYEV-ISRENGSFSGKNKSIQITNQTFS-TVENLKPDTSYEFQVKPKNPL 1468
Cdd:smart00060    3 PPSNLRVTDVT--STSVTLSWEPPPDDGITGYIVgYRVEYREEGSEWKEVNVTPSSTSyTLTGLKPGTEYEFRVRAVNGA 80

                    ...
gi 1907118297  1469 GEG 1471
Cdd:smart00060   81 GEG 83
fn3 pfam00041
Fibronectin type III domain;
1391-1474 1.26e-06

Fibronectin type III domain;


Pssm-ID: 394996 [Multi-domain]  Cd Length: 85  Bit Score: 47.79  E-value: 1.26e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118297 1391 PPTNLTVVTVEgcPSFVILDWEKP--LNDTVTEYEVISRENGSFSGKNkSIQITNQTFS-TVENLKPDTSYEFQVKPKNP 1467
Cdd:pfam00041    2 APSNLTVTDVT--STSLTVSWTPPpdGNGPITGYEVEYRPKNSGEPWN-EITVPGTTTSvTLTGLKPGTEYEVRVQAVNG 78

                   ....*..
gi 1907118297 1468 LGEGPAS 1474
Cdd:pfam00041   79 GGEGPPS 85
PHA03247 PHA03247
large tegument protein UL36; Provisional
945-1336 2.90e-06

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 52.63  E-value: 2.90e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118297  945 ATALTPVTLRTKAPKTTTLAPNVQRTRRPHPRPKTTASTGvSESKSAPTELQSLVLKPVTSPSLEIIQSQSVSDDLELVA 1024
Cdd:PHA03247  2596 ARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPSPSPAA-NEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAA 2674
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118297 1025 FSTESPQK--------TIAPRQTTSMPP-KLKTPHSRMPAKEP-VPKEPLHTTSKPKMPPSPEVADTTSVPKDERLSLKP 1094
Cdd:PHA03247  2675 QASSPPQRprrraarpTVGSLTSLADPPpPPPTPEPAPHALVSaTPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGP 2754
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118297 1095 DPEVTHSETVLPPVTFRVEPPKTTIAPLETR--GIPLIPVISPRPSQEELQTAMEETDQSTQELFTTKIPRTTELAKTTQ 1172
Cdd:PHA03247  2755 ARPARPPTTAGPPAPAPPAAPAAGPPRRLTRpaVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSA 2834
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118297 1173 APhrlhTAPVRPRIPGRPH---------GRPALNKTTTRPDKTKPRGTSHKNGVGTGTKQAPKPPSPGRNASVDSHATRK 1243
Cdd:PHA03247  2835 QP----TAPPPPPGPPPPSlplggsvapGGDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQ 2910
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118297 1244 PGSVSGTRRPPIPHRHSSTRPvSPERRPLPPNNVTGKPGRAGIVSSSRVTSPPLKATLHPIGTATARPGAEQKEPTAPAS 1323
Cdd:PHA03247  2911 PQAPPPPQPQPQPPPPPQPQP-PPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPSREAP 2989
                          410
                   ....*....|...
gi 1907118297 1324 EEEFGTTTDFSSS 1336
Cdd:PHA03247  2990 ASSTPPLTGHSLS 3002
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
491-754 1.02e-04

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 47.07  E-value: 1.02e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118297  491 PEVRPTTAAPQQTTSIPSTPKRQSTPKPPRVKPAPEPETRPSAQTTKAPRKTKKPGHHRLRRPKTTRSPEVPKSKPALEP 570
Cdd:pfam03154  172 PVLQAQSGAASPPSPPPPGTTQAATAGPTPSAPSVPPQGSPATSQPPNQTQSTAAPHTLIQQTPTLHPQRLPSPHPPLQP 251
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118297  571 ATVT--PEILVPKIVPKPPQKPKATRRPEVPQVKPAHEPVTFGSEAPALaivtttdiepvitrTKASVTTLAPKPPRPRT 648
Cdd:pfam03154  252 MTQPppPSQVSPQPLPQPSLHGQMPPMPHSLQTGPSHMQHPVPPQPFPL--------------TPQSSQSQVPPGPSPAA 317
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118297  649 HRQrtkykTTQSPKIPHSKPASTTKKVRRPRPKPqttphpevphtilvPATSLEPFIitEAPGTTLVPKLPQQPDYPHPK 728
Cdd:pfam03154  318 PGQ-----SQQRIHTPPSQSQLQSQQPPREQPLP--------------PAPLSMPHI--KPPPTTPIPQLPNPQSHKHPP 376
                          250       260
                   ....*....|....*....|....*.
gi 1907118297  729 PKTTRSPAASPTELVPTPVFEPVTPL 754
Cdd:pfam03154  377 HLSGPSPFQMNSNLPPPPALKPLSSL 402
FN3 COG3401
Fibronectin type 3 domain [General function prediction only];
1375-1486 1.23e-04

Fibronectin type 3 domain [General function prediction only];


Pssm-ID: 442628 [Multi-domain]  Cd Length: 603  Bit Score: 46.92  E-value: 1.23e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118297 1375 PTEEATEGNATSPPqNPPTNLTVVTVEgcPSFVILDWEKPLNDTVTEYEvISRENGSfSGKNKSIQITNQTFSTVENLKP 1454
Cdd:COG3401    220 PSNEVSVTTPTTPP-SAPTGLTATADT--PGSVTLSWDPVTESDATGYR-VYRSNSG-DGPFTKVATVTTTSYTDTGLTN 294
                           90       100       110
                   ....*....|....*....|....*....|...
gi 1907118297 1455 DTSYEFQVKPKNPLG-EGPASNTVAFSTESADP 1486
Cdd:COG3401    295 GTTYYYRVTAVDAAGnESAPSNVVSVTTDLTPP 327
fn3 pfam00041
Fibronectin type III domain;
116-195 1.94e-04

Fibronectin type III domain;


Pssm-ID: 394996 [Multi-domain]  Cd Length: 85  Bit Score: 41.63  E-value: 1.94e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118297  116 KPLQLVVGTLTPSSVFLSWgflinphhdwTLPSHCPSD-RFYTIRYREKDKEKKWIFQLCPATET--IVENLKPNTVYEF 192
Cdd:pfam00041    2 APSNLTVTDVTSTSLTVSW----------TPPPDGNGPiTGYEVEYRPKNSGEPWNEITVPGTTTsvTLTGLKPGTEYEV 71

                   ...
gi 1907118297  193 GVK 195
Cdd:pfam00041   72 RVQ 74
FN3 smart00060
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, ...
114-195 2.81e-04

Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.


Pssm-ID: 214495 [Multi-domain]  Cd Length: 83  Bit Score: 41.06  E-value: 2.81e-04
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118297   114 PRKPLQLVVGTLTPSSVFLSWgflinphhdwtLPSHCPSDRFYTIRYREKDKEKKWIFQLCPA----TETIVENLKPNTV 189
Cdd:smart00060    1 PSPPSNLRVTDVTSTSVTLSW-----------EPPPDDGITGYIVGYRVEYREEGSEWKEVNVtpssTSYTLTGLKPGTE 69

                    ....*.
gi 1907118297   190 YEFGVK 195
Cdd:smart00060   70 YEFRVR 75
FN3 cd00063
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ...
114-195 3.08e-04

Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.


Pssm-ID: 238020 [Multi-domain]  Cd Length: 93  Bit Score: 41.33  E-value: 3.08e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118297  114 PRKPLQLVVGTLTPSSVFLSWgfliNPHHDWTLPSHcpsdrFYTIRYREKDKE--KKWIFQLCPATETIVENLKPNTVYE 191
Cdd:cd00063      1 PSPPTNLRVTDVTSTSVTLSW----TPPEDDGGPIT-----GYVVEYREKGSGdwKEVEVTPGSETSYTLTGLKPGTEYE 71

                   ....
gi 1907118297  192 FGVK 195
Cdd:cd00063     72 FRVR 75
PspC_subgroup_2 NF033839
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ...
491-691 6.33e-04

pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.


Pssm-ID: 468202 [Multi-domain]  Cd Length: 557  Bit Score: 44.37  E-value: 6.33e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118297  491 PEVRPTTAAPQ---QTTSIPSTPKRQSTPKPPRVKPAPEPETrPSAQTTKAPRKTKKPGHHRLRRPKTTRSPEVPKSKPA 567
Cdd:NF033839   286 EPGNKKPSAPKpgmQPSPQPEKKEVKPEPETPKPEVKPQLEK-PKPEVKPQPEKPKPEVKPQLETPKPEVKPQPEKPKPE 364
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118297  568 LEPATVTPEilvPKIVPKPPQ-KPKATRRPEVPQVKPAHEPVTFGSEAPALAIVTTTDIEPVITRTKASVTTLAPKP-PR 645
Cdd:NF033839   365 VKPQPEKPK---PEVKPQPETpKPEVKPQPEKPKPEVKPQPEKPKPEVKPQPEKPKPEVKPQPEKPKPEVKPQPEKPkPE 441
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|....*.
gi 1907118297  646 PRTHRQRTKYKTTQSPKIPHSKPASTTKKvrrprPKPQTTPHPEVP 691
Cdd:NF033839   442 VKPQPEKPKPEVKPQPETPKPEVKPQPEK-----PKPEVKPQPEKP 482
Not5 COG5665
CCR4-NOT transcriptional regulation complex, NOT5 subunit [Transcription];
428-758 2.37e-03

CCR4-NOT transcriptional regulation complex, NOT5 subunit [Transcription];


Pssm-ID: 444384 [Multi-domain]  Cd Length: 874  Bit Score: 42.73  E-value: 2.37e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118297  428 ATYDVISSSTTSDETEIEIHTATR-------DPILDSVPPKTSRTAEQPRATLAPIEALFESRNVEIFTSPEVR------ 494
Cdd:COG5665    208 STPQAFNASATSGRSQHIVQAAKRvgvewwgDPSLLATPPATPATEEKSSQQPKSQPTSPSGGTTPPSTNQLTTsntpts 287
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118297  495 --------PTTAAPQQTTSIPSTPKRQSTPKPPRV--KPAPEPETRPSAQTTKAPRKTKKPGHhrlrRPKTTRSPEVPKS 564
Cdd:COG5665    288 takaqpqpPTKKQPAKEPPSDTASGNPSAPSVLINsdSPTSEDPATASVPTTEETTAFTTPSS----VPSTPAEKDTPAT 363
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118297  565 KPALEPATVTPEILV-PKIVPKPPQKPKATrrpevpqvkpAHEPVTFGS-EAPALAIVTTTDIEPVITRTKASVTTLAPK 642
Cdd:COG5665    364 DLATPVSPTPPETSVdKKVSPDSATSSTKS----------EKEGGTASSpMPPNIAIGAKDDVDATDPSQEAKEYTKNAP 433
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118297  643 PPRPRTHRQRTKYKTTQSPKIPHSK-PASTTKKVRRPRPKPQTTPHPEVPHTILVPATSLEPFIITEAPGTTLVPKLPQQ 721
Cdd:COG5665    434 MTPEADSAPESSVRTEASPSAGSDLePENTTLRDPAPNAIPPPEDPSTIGRLSSGDKLANETGPPVIRRDSTPSSTADQS 513
                          330       340       350       360
                   ....*....|....*....|....*....|....*....|..
gi 1907118297  722 --PDYPHPKPKTT---RSPAASPTELVPTPVFEPVTPLKEDP 758
Cdd:COG5665    514 ivGVLAFGLDQRTqaeISVEAASRSNPLLNSQVKSFPLGKRS 555
PHA03247 PHA03247
large tegument protein UL36; Provisional
307-575 3.00e-03

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 42.62  E-value: 3.00e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118297  307 TLALPAESKTPEVEKLAGQPVTVTPESVSRSTKPTLSSALDTAETALVLSEKTSE-TARSVLIPEFELPLSTLAPkrfpe 385
Cdd:PHA03247  2756 RPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAvLAPAAALPPAASPAGPLPP----- 2830
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118297  386 fPEAKTAFPLEKPRGSWASSEEP--WVVPGAKtsedsrvvqpqtatydvISSSTTSDETEIEIHTATRDPILDSVPPKTS 463
Cdd:PHA03247  2831 -PTSAQPTAPPPPPGPPPPSLPLggSVAPGGD-----------------VRRRPPSRSPAAKPAAPARPPVRRLARPAVS 2892
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118297  464 RTAE---QPRATLAPiealfeSRNVEIFTSPEVRPTTAAPQQTTSIPSTPKRQSTPKPPRVKPAPEPEtrPSAQTTKAPR 540
Cdd:PHA03247  2893 RSTEsfaLPPDQPER------PPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGE--PSGAVPQPWL 2964
                          250       260       270
                   ....*....|....*....|....*....|....*
gi 1907118297  541 KTKKPGHHRLRRpktTRSPEVPKSKPALEPATVTP 575
Cdd:PHA03247  2965 GALVPGRVAVPR---FRVPQPAPSREAPASSTPPL 2996
 
Name Accession Description Interval E-value
PTZ00449 PTZ00449
104 kDa microneme/rhoptry antigen; Provisional
509-764 1.15e-12

104 kDa microneme/rhoptry antigen; Provisional


Pssm-ID: 185628 [Multi-domain]  Cd Length: 943  Bit Score: 73.19  E-value: 1.15e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118297  509 TPKRQSTPKPPRV-----KPAPEPETRPSA--QTTKAPRKTKKPGHHRlrRPKTTRSPEVPKS--KPALEPATVTPEILV 579
Cdd:PTZ00449   542 EPKEGGKPGETKEgevgkKPGPAKEHKPSKipTLSKKPEFPKDPKHPK--DPEEPKKPKRPRSaqRPTRPKSPKLPELLD 619
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118297  580 PKIVPKPPQKPKATRRPEVPQvKPAHEPVTFGSEAPalaivtttdiepvitrtKASVTTLAPKPPRPRTHRQrtKYKTTQ 659
Cdd:PTZ00449   620 IPKSPKRPESPKSPKRPPPPQ-RPSSPERPEGPKII-----------------KSPKPPKSPKPPFDPKFKE--KFYDDY 679
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118297  660 SPKIPHSKPASTTKKVRRPRPKPQTTPHPEVPHTilvPATSLEPfiiteapgttLVPKLPQQPDYPHPKPKTTRSPAASP 739
Cdd:PTZ00449   680 LDAAAKSKETKTTVVLDESFESILKETLPETPGT---PFTTPRP----------LPPKLPRDEEFPFEPIGDPDAEQPDD 746
                          250       260
                   ....*....|....*....|....*
gi 1907118297  740 TELVPTPVfEPVTPLKEDPVTTIVP 764
Cdd:PTZ00449   747 IEFFTPPE-EERTFFHETPADTPLP 770
PHA03247 PHA03247
large tegument protein UL36; Provisional
456-1085 4.27e-12

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 71.89  E-value: 4.27e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118297  456 DSVPPKTSRTAEQPRATLAPIEALFESRNveifTSPEVRPTTAAPQQTTSIPSTPKRQSTPKPPRVKPAPEPETRPSaqt 535
Cdd:PHA03247  2558 AAPPAAPDRSVPPPRPAPRPSEPAVTSRA----RRPDAPPQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPS--- 2630
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118297  536 tKAPRKTKKPGHHRLRRPKttrsPEVPKSKPAlePATVTPEILVPKIvPKPPQKPKATRRPEVPQVKPAHEPVTFGSEAP 615
Cdd:PHA03247  2631 -PSPAANEPDPHPPPTVPP----PERPRDDPA--PGRVSRPRRARRL-GRAAQASSPPQRPRRRAARPTVGSLTSLADPP 2702
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118297  616 alaivtttDIEPVITRTKASVTTLAPKPPRPRTHRQRTKyKTTQSPKIPHSKPASTTKKVRRPRPKPQTTPHPEVPHTIL 695
Cdd:PHA03247  2703 --------PPPPTPEPAPHALVSATPLPPGPAAARQASP-ALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPA 2773
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118297  696 VPATSLEPfIITEAPGTTLVPKLPQQPD----YPHPKPKTTRSPAASPTELVPTPVFEPVTPLKEDPVTTIVPitdLERV 771
Cdd:PHA03247  2774 APAAGPPR-RLTRPAVASLSESRESLPSpwdpADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGP---PPPS 2849
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118297  772 TDLETPVA----FRTEAPGTTLASKISQRTHRPRPRPRPRPRPRPRPKATLSPQAPETKTVPAVVLEPVTLRPEVQVTTL 847
Cdd:PHA03247  2850 LPLGGSVApggdVRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQP 2929
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118297  848 APQKTQKKHRPSPKPKPVPSPEVTESKPVLPrvrepvtlrtETWVTTKAPKTPKRTRRPRPkpqttptpetpltkpvaat 927
Cdd:PHA03247  2930 QPPPPPPPRPQPPLAPTTDPAGAGEPSGAVP----------QPWLGALVPGRVAVPRFRVP------------------- 2980
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118297  928 dlePSALSTEVPAtvvlataltpvtlrtkaPKTTTlapnvqRTRRPHPRPKTTASTGVSESKSAPtelqslvlkpvtsPS 1007
Cdd:PHA03247  2981 ---QPAPSREAPA-----------------SSTPP------LTGHSLSRVSSWASSLALHEETDP-------------PP 3021
                          570       580       590       600       610       620       630
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1907118297 1008 LEIIQSQSVSDDLElvafstespqKTIAPRQTTSMPPKLKTPhsrmpAKEPVPKEPLHTTSKPKMPPSPEvADTTSVP 1085
Cdd:PHA03247  3022 VSLKQTLWPPDDTE----------DSDADSLFDSDSERSDLE-----ALDPLPPEPHDPFAHEPDPATPE-AGARESP 3083
PHA03247 PHA03247
large tegument protein UL36; Provisional
398-764 1.06e-11

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 70.35  E-value: 1.06e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118297  398 PRGSWASSEEPWVVPGAKTSEDSRVVQPQTAtydvissSTTSDETEIEIHTATRDPILDSVPPKTSRtaeqPRATLAPIE 477
Cdd:PHA03247  2604 DRGDPRGPAPPSPLPPDTHAPDPPPPSPSPA-------ANEPDPHPPPTVPPPERPRDDPAPGRVSR----PRRARRLGR 2672
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118297  478 ALFESRNVEIFTSPEVRPTTaAPQQTTSIPSTPKRQSTPKPPRVKPAPEPETRPSAQTTKAPRKTKKP------------ 545
Cdd:PHA03247  2673 AAQASSPPQRPRRRAARPTV-GSLTSLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPappavpagpatp 2751
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118297  546 -GHHRLRRPKTTRSPevPKSKPALEPATVTPEILvpkivPKPPQKPKATRRPEVPQvKPAHEPVTFGSEAPALAIVTTTD 624
Cdd:PHA03247  2752 gGPARPARPPTTAGP--PAPAPPAAPAAGPPRRL-----TRPAVASLSESRESLPS-PWDPADPPAAVLAPAAALPPAAS 2823
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118297  625 IEPVITRTKASVTTLAPKPPRPRTHRQ------------RTKYKTTQSPKIPHSKPASTTKKVRRPRPKPQTTP------ 686
Cdd:PHA03247  2824 PAGPLPPPTSAQPTAPPPPPGPPPPSLplggsvapggdvRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESfalppd 2903
                          330       340       350       360       370       380       390
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1907118297  687 HPEVPHTILVPATSLEPFIITEAPGTTLVPKLPQQPDYPhPKPKTTRSPAASPTELVPTPVFEPVTPLKEDPVTTIVP 764
Cdd:PHA03247  2904 QPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPP-LAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVP 2980
FN3 cd00063
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ...
1390-1481 3.30e-10

Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.


Pssm-ID: 238020 [Multi-domain]  Cd Length: 93  Bit Score: 58.28  E-value: 3.30e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118297 1390 NPPTNLTVVTVEgcPSFVILDWEKPLNDT--VTEYEVISRENGSFSGKNKSIQITNQTFSTVENLKPDTSYEFQVKPKNP 1467
Cdd:cd00063      2 SPPTNLRVTDVT--STSVTLSWTPPEDDGgpITGYVVEYREKGSGDWKEVEVTPGSETSYTLTGLKPGTEYEFRVRAVNG 79
                           90
                   ....*....|....
gi 1907118297 1468 LGEGPASNTVAFST 1481
Cdd:cd00063     80 GGESPPSESVTVTT 93
PHA03247 PHA03247
large tegument protein UL36; Provisional
545-1195 6.93e-09

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 61.11  E-value: 6.93e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118297  545 PGHHRLRRPKTTRSPEVPKSKPalEPATVTPEilvPKIVPKPPQKPKATRRPEVPQVKPAHEPV-------------TFG 611
Cdd:PHA03247  2475 PGAPVYRRPAEARFPFAAGAAP--DPGGGGPP---DPDAPPAPSRLAPAILPDEPVGEPVHPRMltwirgleelasdDAG 2549
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118297  612 SEAPALAivttTDIEPVITRTKASVTTLAPKPPRP----RTHRQRTKYKTTqSPKIPHSKPASttkkvrrPRPKPQTTPH 687
Cdd:PHA03247  2550 DPPPPLP----PAAPPAAPDRSVPPPRPAPRPSEPavtsRARRPDAPPQSA-RPRAPVDDRGD-------PRGPAPPSPL 2617
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118297  688 PEVPHTILVPATSLEPFIITEAPGTTLVPKLPQQPDYPHPKPKTTRSPAASPTELVPTPVFEPVTPLKEDPVTTIVPITD 767
Cdd:PHA03247  2618 PPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTS 2697
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118297  768 LERvtdletpvafrTEAPGTTLASKISQRTHRPRPRPRPRPRPRPRPKATLSPQAPETKTVPAVVLEPVtlRPEVQVTTL 847
Cdd:PHA03247  2698 LAD-----------PPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPA--RPARPPTTA 2764
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118297  848 APqktqkkhrpspkpkpvpspevteSKPVLPRVREPVTLRTETwVTTKAPKTPKRTRRPRPKPQTTPTPETPLTKPVAAT 927
Cdd:PHA03247  2765 GP-----------------------PAPAPPAAPAAGPPRRLT-RPAVASLSESRESLPSPWDPADPPAAVLAPAAALPP 2820
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118297  928 DLEPSALSTEVPATVVLATALTPVTLRTKAPKTTTLAPNVQRTRRPHPRPKTTASTgvsesksAPTElqslvlkpvtsPS 1007
Cdd:PHA03247  2821 AASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAKPA-------APAR-----------PP 2882
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118297 1008 LEIIQSQSVSDDLELVAFSTESPQKtiaPRQTTSMPPKLKTPhsrmpaKEPVPKEPLHTTSKPKMPPSPEVADTTSVPKD 1087
Cdd:PHA03247  2883 VRRLARPAVSRSTESFALPPDQPER---PPQPQAPPPPQPQP------QPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAG 2953
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118297 1088 ERLSLKPDPEVTHSETVLPPVT-FRVEPPKTTIAPLETRGIPLIPVISPRPSQEELQTAM-EETDQSTQELFTT-KIPRT 1164
Cdd:PHA03247  2954 EPSGAVPQPWLGALVPGRVAVPrFRVPQPAPSREAPASSTPPLTGHSLSRVSSWASSLALhEETDPPPVSLKQTlWPPDD 3033
                          650       660       670
                   ....*....|....*....|....*....|....*
gi 1907118297 1165 TELAKTTQA----PHRLHTAPVRPrIPGRPHGRPA 1195
Cdd:PHA03247  3034 TEDSDADSLfdsdSERSDLEALDP-LPPEPHDPFA 3067
PTZ00449 PTZ00449
104 kDa microneme/rhoptry antigen; Provisional
494-832 3.30e-08

104 kDa microneme/rhoptry antigen; Provisional


Pssm-ID: 185628 [Multi-domain]  Cd Length: 943  Bit Score: 58.55  E-value: 3.30e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118297  494 RPTTAAPQQTTSIPSTPKRQSTPKPPRVKPAPEpetrpSAQTTKAPRKTKKPGhhRLRRPKTTRSPEVPKSKPALEPATV 573
Cdd:PTZ00449   560 KPGPAKEHKPSKIPTLSKKPEFPKDPKHPKDPE-----EPKKPKRPRSAQRPT--RPKSPKLPELLDIPKSPKRPESPKS 632
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118297  574 tpeilvPKiVPKPPQKPKATRRPEVPQV----KPAHEP-VTFG-----------SEAPALAIVTTTDIEPVITRTKASVT 637
Cdd:PTZ00449   633 ------PK-RPPPPQRPSSPERPEGPKIikspKPPKSPkPPFDpkfkekfyddyLDAAAKSKETKTTVVLDESFESILKE 705
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118297  638 TLAPKPPRPRTHRQRTKYKTTQSPKIPHSKPASTTKkvrrPRPKPQTTPHPEVPHTILV---PATSLEPFIITEAPGTTL 714
Cdd:PTZ00449   706 TLPETPGTPFTTPRPLPPKLPRDEEFPFEPIGDPDA----EQPDDIEFFTPPEEERTFFhetPADTPLPDILAEEFKEED 781
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118297  715 VPKLPQQPDYPHPKPKttrspaaSPTELVPTPVFE-PVTPLK----------------------EDPVTTIVPITDLERV 771
Cdd:PTZ00449   782 IHAETGEPDEAMKRPD-------SPSEHEDKPPGDhPSLPKKrhrldglalsttdlesdagriaKDASGKIVKLKRSKSF 854
                          330       340       350       360       370       380       390
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1907118297  772 TDLET--------PVAFR-------TEAPGT-TLASKISQRTHRPRPRPRPRPRPRPRPKATLSPQAPETKTVPAVV 832
Cdd:PTZ00449   855 DDLTTveeaeemgAEARKivvdddgTEADDEdTHPPEEKHKSEVRRRRPPKKPSKPKKPSKPKKPKKPDSAFIPSII 931
FN3 smart00060
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, ...
1391-1471 9.88e-08

Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.


Pssm-ID: 214495 [Multi-domain]  Cd Length: 83  Bit Score: 51.08  E-value: 9.88e-08
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118297  1391 PPTNLTVVTVEgcPSFVILDWEKPLNDTVTEYEV-ISRENGSFSGKNKSIQITNQTFS-TVENLKPDTSYEFQVKPKNPL 1468
Cdd:smart00060    3 PPSNLRVTDVT--STSVTLSWEPPPDDGITGYIVgYRVEYREEGSEWKEVNVTPSSTSyTLTGLKPGTEYEFRVRAVNGA 80

                    ...
gi 1907118297  1469 GEG 1471
Cdd:smart00060   81 GEG 83
PHA03247 PHA03247
large tegument protein UL36; Provisional
507-1109 8.00e-07

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 54.17  E-value: 8.00e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118297  507 PSTPKRQSTPKPPRVKPAPEPETRPS--AQTTKAPRKTKKPghhrlrRPKTTRSPEVPKSKPALEPAtvtpeilvpkivP 584
Cdd:PHA03247  2553 PPLPPAAPPAAPDRSVPPPRPAPRPSepAVTSRARRPDAPP------QSARPRAPVDDRGDPRGPAP------------P 2614
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118297  585 KPPqkPKATRRPEVPQVKPAHEPVTFGSEAPALAIVTTTDIEpvitrtkasvttlAPKPPRPRTHRQRTKykttqspkip 664
Cdd:PHA03247  2615 SPL--PPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRD-------------DPAPGRVSRPRRARR---------- 2669
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118297  665 HSKPASTTKKVRRPRPkpqttphPEVPHTILVPATSLEPfiiteapgttlvpklPQQPDYPHPKPKTTRSpaASPTELVP 744
Cdd:PHA03247  2670 LGRAAQASSPPQRPRR-------RAARPTVGSLTSLADP---------------PPPPPTPEPAPHALVS--ATPLPPGP 2725
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118297  745 TPVFEPVTPLKEDPVTTIVPITDLERVTDLETPVAFRTEAPGTTLASKISQRTHRPRPRPRPRPRPRPRPKATLSPQAPE 824
Cdd:PHA03247  2726 AAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPA 2805
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118297  825 TKTVPAVVLEPVTLRPEVQVTTLAPQKTQKKHRPSPKPKPVPSPEVTESKpVLPrvREPVTLRTETWVTTKAPKTPKRTR 904
Cdd:PHA03247  2806 DPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGS-VAP--GGDVRRRPPSRSPAAKPAAPARPP 2882
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118297  905 RPRPKPQTTPTPEtpltkpvaatdlEPSALSTEVPATVVLATALTPVTLRTKAPKTTTLAPNVQRTRRPHPRPKTTAST- 983
Cdd:PHA03247  2883 VRRLARPAVSRST------------ESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPa 2950
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118297  984 GVSESKSAPTELQSLVLKPVTSPSLEIIQSQSvSDDLELVAFSTESPQKTIAPR------------QTTSMPPKLK---- 1047
Cdd:PHA03247  2951 GAGEPSGAVPQPWLGALVPGRVAVPRFRVPQP-APSREAPASSTPPLTGHSLSRvsswasslalheETDPPPVSLKqtlw 3029
                          570       580       590       600       610       620
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1907118297 1048 ----TPHSRMPAKEPVPKEPLHTTSKPKMPPSPEvaDTTSVPKDERLSLKPDPEVTHSETVLPPVT 1109
Cdd:PHA03247  3030 ppddTEDSDADSLFDSDSERSDLEALDPLPPEPH--DPFAHEPDPATPEAGARESPSSQFGPPPLS 3093
fn3 pfam00041
Fibronectin type III domain;
1391-1474 1.26e-06

Fibronectin type III domain;


Pssm-ID: 394996 [Multi-domain]  Cd Length: 85  Bit Score: 47.79  E-value: 1.26e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118297 1391 PPTNLTVVTVEgcPSFVILDWEKP--LNDTVTEYEVISRENGSFSGKNkSIQITNQTFS-TVENLKPDTSYEFQVKPKNP 1467
Cdd:pfam00041    2 APSNLTVTDVT--STSLTVSWTPPpdGNGPITGYEVEYRPKNSGEPWN-EITVPGTTTSvTLTGLKPGTEYEVRVQAVNG 78

                   ....*..
gi 1907118297 1468 LGEGPAS 1474
Cdd:pfam00041   79 GGEGPPS 85
PHA03247 PHA03247
large tegument protein UL36; Provisional
945-1336 2.90e-06

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 52.63  E-value: 2.90e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118297  945 ATALTPVTLRTKAPKTTTLAPNVQRTRRPHPRPKTTASTGvSESKSAPTELQSLVLKPVTSPSLEIIQSQSVSDDLELVA 1024
Cdd:PHA03247  2596 ARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPSPSPAA-NEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAA 2674
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118297 1025 FSTESPQK--------TIAPRQTTSMPP-KLKTPHSRMPAKEP-VPKEPLHTTSKPKMPPSPEVADTTSVPKDERLSLKP 1094
Cdd:PHA03247  2675 QASSPPQRprrraarpTVGSLTSLADPPpPPPTPEPAPHALVSaTPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGP 2754
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118297 1095 DPEVTHSETVLPPVTFRVEPPKTTIAPLETR--GIPLIPVISPRPSQEELQTAMEETDQSTQELFTTKIPRTTELAKTTQ 1172
Cdd:PHA03247  2755 ARPARPPTTAGPPAPAPPAAPAAGPPRRLTRpaVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSA 2834
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118297 1173 APhrlhTAPVRPRIPGRPH---------GRPALNKTTTRPDKTKPRGTSHKNGVGTGTKQAPKPPSPGRNASVDSHATRK 1243
Cdd:PHA03247  2835 QP----TAPPPPPGPPPPSlplggsvapGGDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQ 2910
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118297 1244 PGSVSGTRRPPIPHRHSSTRPvSPERRPLPPNNVTGKPGRAGIVSSSRVTSPPLKATLHPIGTATARPGAEQKEPTAPAS 1323
Cdd:PHA03247  2911 PQAPPPPQPQPQPPPPPQPQP-PPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPSREAP 2989
                          410
                   ....*....|...
gi 1907118297 1324 EEEFGTTTDFSSS 1336
Cdd:PHA03247  2990 ASSTPPLTGHSLS 3002
PHA03247 PHA03247
large tegument protein UL36; Provisional
969-1412 5.49e-06

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 51.48  E-value: 5.49e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118297  969 RTRRPHPRPK-TTASTGVSESKSAPTELQSLVLKPVTS-----PSLEIIQSQSVSDDLELVAFSTESPQKTIAPRQTTsm 1042
Cdd:PHA03247  2585 RARRPDAPPQsARPRAPVDDRGDPRGPAPPSPLPPDTHapdppPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVS-- 2662
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118297 1043 PPKLKTPHSRMPAKEPVPKEPLHTTSKPKMPPSPEVADttsvPKDERLSLKPDPEVTHSETVLPPVTF--RVEPPKTTIA 1120
Cdd:PHA03247  2663 RPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSLAD----PPPPPPTPEPAPHALVSATPLPPGPAaaRQASPALPAA 2738
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118297 1121 PLeTRGIPLIPVI--SPRPSQEELQTAMEETDQSTQELFTTKIPRTTELAKTTQAPHRlhTAPVRPRIPGrPHGRPALNK 1198
Cdd:PHA03247  2739 PA-PPAVPAGPATpgGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESR--ESLPSPWDPA-DPPAAVLAP 2814
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118297 1199 TTTRPDKTKPRGTSHKNGvgTGTKQAPKPPSPGRNASVDSHATRKPGSvSGTRRPPiphrhSSTRPVSPERRPLPPNNVT 1278
Cdd:PHA03247  2815 AAALPPAASPAGPLPPPT--SAQPTAPPPPPGPPPPSLPLGGSVAPGG-DVRRRPP-----SRSPAAKPAAPARPPVRRL 2886
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118297 1279 GKPGRAGIVSSSRVTSPPLKATLHPIGTATARPGAEQKEPTAPASEEEFGTTTDFSSSPTKETDPLGKPRFIGPHvryiP 1358
Cdd:PHA03247  2887 ARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQ----P 2962
                          410       420       430       440       450
                   ....*....|....*....|....*....|....*....|....*....|....
gi 1907118297 1359 KPENKPCSITDSVRRFPTEEATEGNATSPPQNPPTNLTVVTVEGCPSFVILDWE 1412
Cdd:PHA03247  2963 WLGALVPGRVAVPRFRVPQPAPSREAPASSTPPLTGHSLSRVSSWASSLALHEE 3016
PHA03377 PHA03377
EBNA-3C; Provisional
517-749 1.03e-05

EBNA-3C; Provisional


Pssm-ID: 177614 [Multi-domain]  Cd Length: 1000  Bit Score: 50.44  E-value: 1.03e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118297  517 KPPRVKPAPEPETRPSAQT---TKAPRKTKKPGHHRLRRPKTTRSPEVPkskpaLEPATVTPEILVPKIVPKPPQKPKAT 593
Cdd:PHA03377   414 RKPRTLPWPTPKTHPVKRTlvkTSGRSDEAEQAQSTPERPGPSDQPSVP-----VEPAHLTPVEHTTVILHQPPQSPPTV 488
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118297  594 rrpevpQVKPAHEPVTFGSEApalAIVTTTDIEPVITRTKASVTTLAPKPPRPRTHRQRTKYKTTQSPKIPHSKPASTTK 673
Cdd:PHA03377   489 ------AIKPAPPPSRRRRGA---CVVYDDDIIEVIDVETTEEEESVTQPAKPHRKVQDGFQRSGRRQKRATPPKVSPSD 559
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1907118297  674 KvRRPRPKPQTTPHPEVPHTILVPATSLEPFIITEAPGTTLVPKLPQQPdyPHPKPKTTRSPAASPTELVPTPVFE 749
Cdd:PHA03377   560 R-GPPKASPPVMAPPSTGPRVMATPSTGPRDMAPPSTGPRQQAKCKDGP--PASGPHEKQPPSSAPRDMAPSVVRM 632
PHA03247 PHA03247
large tegument protein UL36; Provisional
1173-1393 3.79e-05

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 48.78  E-value: 3.79e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118297 1173 APHRLHTAPVRPRIPGRPHGRP---ALNKTTTRPDKTKPRGTSHKNGVGTGTKQAPKPPSPgrnASVDSHATRKPGSVSG 1249
Cdd:PHA03247  2556 PPAAPPAAPDRSVPPPRPAPRPsepAVTSRARRPDAPPQSARPRAPVDDRGDPRGPAPPSP---LPPDTHAPDPPPPSPS 2632
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118297 1250 TRRPPIPHRHSSTRPVSPERRPLPPNNVTGKPGRAGIVSSSRVTSPPLKATLHPIGTATARPGAEQKEPTAPASEEEFGT 1329
Cdd:PHA03247  2633 PAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSLADPPPPPPTPEPAP 2712
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1907118297 1330 TTDFSSSPTKETDPLGKPRFIGPHVRYIPKPENKPCSITDSVRRFPTEEATEG-NATSPPQNPPT 1393
Cdd:PHA03247  2713 HALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGpPAPAPPAAPAA 2777
PHA03247 PHA03247
large tegument protein UL36; Provisional
1042-1392 9.17e-05

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 47.63  E-value: 9.17e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118297 1042 MPPKLK--TPHSRMPAKEPVPKEP-LHTTSKPKMPPSPEVADTTSVPKDERlslkPDPEVTHSETVLPPVTFRVEPPKTT 1118
Cdd:PHA03247  2555 LPPAAPpaAPDRSVPPPRPAPRPSePAVTSRARRPDAPPQSARPRAPVDDR----GDPRGPAPPSPLPPDTHAPDPPPPS 2630
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118297 1119 IAPLETRGIPLIPVISPRPSQEELQTAMEETDQSTQelfTTKIPRTTELAKTTQAPHRLHTAP--------VRPRIPGR- 1189
Cdd:PHA03247  2631 PSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRR---ARRLGRAAQASSPPQRPRRRAARPtvgsltslADPPPPPPt 2707
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118297 1190 PHGRP---------------ALNKTTTRPDKTKPRGTSHKNGVGTGTKQAPKPPSPGRNASVDSHATRKPGSVSGTRRPP 1254
Cdd:PHA03247  2708 PEPAPhalvsatplppgpaaARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPA 2787
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118297 1255 IPHRHSSTRPVSPERRPLPPNNVTgkPGRAGIVSSSRVTSPPLKATLHPIGTATARPGAEQKEPTAPASEEEFGttTDFS 1334
Cdd:PHA03247  2788 VASLSESRESLPSPWDPADPPAAV--LAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPG--GDVR 2863
                          330       340       350       360       370       380
                   ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1907118297 1335 SSPTKETDPLGKPRFIGPHVRYIPKPENKPCSIT-----DSVRRFPTEEATEGNATSPPQNPP 1392
Cdd:PHA03247  2864 RRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESfalppDQPERPPQPQAPPPPQPQPQPPPP 2926
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
491-754 1.02e-04

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 47.07  E-value: 1.02e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118297  491 PEVRPTTAAPQQTTSIPSTPKRQSTPKPPRVKPAPEPETRPSAQTTKAPRKTKKPGHHRLRRPKTTRSPEVPKSKPALEP 570
Cdd:pfam03154  172 PVLQAQSGAASPPSPPPPGTTQAATAGPTPSAPSVPPQGSPATSQPPNQTQSTAAPHTLIQQTPTLHPQRLPSPHPPLQP 251
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118297  571 ATVT--PEILVPKIVPKPPQKPKATRRPEVPQVKPAHEPVTFGSEAPALaivtttdiepvitrTKASVTTLAPKPPRPRT 648
Cdd:pfam03154  252 MTQPppPSQVSPQPLPQPSLHGQMPPMPHSLQTGPSHMQHPVPPQPFPL--------------TPQSSQSQVPPGPSPAA 317
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118297  649 HRQrtkykTTQSPKIPHSKPASTTKKVRRPRPKPqttphpevphtilvPATSLEPFIitEAPGTTLVPKLPQQPDYPHPK 728
Cdd:pfam03154  318 PGQ-----SQQRIHTPPSQSQLQSQQPPREQPLP--------------PAPLSMPHI--KPPPTTPIPQLPNPQSHKHPP 376
                          250       260
                   ....*....|....*....|....*.
gi 1907118297  729 PKTTRSPAASPTELVPTPVFEPVTPL 754
Cdd:pfam03154  377 HLSGPSPFQMNSNLPPPPALKPLSSL 402
FN3 COG3401
Fibronectin type 3 domain [General function prediction only];
1375-1486 1.23e-04

Fibronectin type 3 domain [General function prediction only];


Pssm-ID: 442628 [Multi-domain]  Cd Length: 603  Bit Score: 46.92  E-value: 1.23e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118297 1375 PTEEATEGNATSPPqNPPTNLTVVTVEgcPSFVILDWEKPLNDTVTEYEvISRENGSfSGKNKSIQITNQTFSTVENLKP 1454
Cdd:COG3401    220 PSNEVSVTTPTTPP-SAPTGLTATADT--PGSVTLSWDPVTESDATGYR-VYRSNSG-DGPFTKVATVTTTSYTDTGLTN 294
                           90       100       110
                   ....*....|....*....|....*....|...
gi 1907118297 1455 DTSYEFQVKPKNPLG-EGPASNTVAFSTESADP 1486
Cdd:COG3401    295 GTTYYYRVTAVDAAGnESAPSNVVSVTTDLTPP 327
FN3 COG3401
Fibronectin type 3 domain [General function prediction only];
1375-1529 1.70e-04

Fibronectin type 3 domain [General function prediction only];


Pssm-ID: 442628 [Multi-domain]  Cd Length: 603  Bit Score: 46.15  E-value: 1.70e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118297 1375 PTEEATEGNATSPPQnPPTNLTVVTVEgcPSFVILDWEKPLNDTVTEYEV--ISRENGSFSGKNKSIqitNQTFSTVENL 1452
Cdd:COG3401    314 PSNVVSVTTDLTPPA-APSGLTATAVG--SSSITLSWTASSDADVTGYNVyrSTSGGGTYTKIAETV---TTTSYTDTGL 387
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1907118297 1453 KPDTSYEFQVKPKNPLG-EGPASNTVAFSTESADPRVSEPISAGRDAIWTERPFNSDSYSECKGKQYVKRTWYKKFVG 1529
Cdd:COG3401    388 TPGTTYYYKVTAVDAAGnESAPSEEVSATTASAASGESLTASVDAVPLTDVAGATAAASAASNPGVSAAVLADGGDTG 465
fn3 pfam00041
Fibronectin type III domain;
116-195 1.94e-04

Fibronectin type III domain;


Pssm-ID: 394996 [Multi-domain]  Cd Length: 85  Bit Score: 41.63  E-value: 1.94e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118297  116 KPLQLVVGTLTPSSVFLSWgflinphhdwTLPSHCPSD-RFYTIRYREKDKEKKWIFQLCPATET--IVENLKPNTVYEF 192
Cdd:pfam00041    2 APSNLTVTDVTSTSLTVSW----------TPPPDGNGPiTGYEVEYRPKNSGEPWNEITVPGTTTsvTLTGLKPGTEYEV 71

                   ...
gi 1907118297  193 GVK 195
Cdd:pfam00041   72 RVQ 74
FN3 smart00060
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, ...
114-195 2.81e-04

Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.


Pssm-ID: 214495 [Multi-domain]  Cd Length: 83  Bit Score: 41.06  E-value: 2.81e-04
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118297   114 PRKPLQLVVGTLTPSSVFLSWgflinphhdwtLPSHCPSDRFYTIRYREKDKEKKWIFQLCPA----TETIVENLKPNTV 189
Cdd:smart00060    1 PSPPSNLRVTDVTSTSVTLSW-----------EPPPDDGITGYIVGYRVEYREEGSEWKEVNVtpssTSYTLTGLKPGTE 69

                    ....*.
gi 1907118297   190 YEFGVK 195
Cdd:smart00060   70 YEFRVR 75
FN3 cd00063
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ...
114-195 3.08e-04

Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.


Pssm-ID: 238020 [Multi-domain]  Cd Length: 93  Bit Score: 41.33  E-value: 3.08e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118297  114 PRKPLQLVVGTLTPSSVFLSWgfliNPHHDWTLPSHcpsdrFYTIRYREKDKE--KKWIFQLCPATETIVENLKPNTVYE 191
Cdd:cd00063      1 PSPPTNLRVTDVTSTSVTLSW----TPPEDDGGPIT-----GYVVEYREKGSGdwKEVEVTPGSETSYTLTGLKPGTEYE 71

                   ....
gi 1907118297  192 FGVK 195
Cdd:cd00063     72 FRVR 75
PRK14950 PRK14950
DNA polymerase III subunits gamma and tau; Provisional
515-626 5.05e-04

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237864 [Multi-domain]  Cd Length: 585  Bit Score: 44.80  E-value: 5.05e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118297  515 TPKPPRVKPAPEPETRPSAQTTKAPRKTKKPGHHRLRRPKttrsPEVPKSKPalePATVTPEILVPKIVPKPPQKPKATR 594
Cdd:PRK14950   361 VPVPAPQPAKPTAAAPSPVRPTPAPSTRPKAAAAANIPPK----EPVRETAT---PPPVPPRPVAPPVPHTPESAPKLTR 433
                           90       100       110
                   ....*....|....*....|....*....|..
gi 1907118297  595 RPEVPQVKPAHEPVTFGSEAPALAIVTTTDIE 626
Cdd:PRK14950   434 AAIPVDEKPKYTPPAPPKEEEKALIADGDVLE 465
PspC_subgroup_2 NF033839
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ...
491-691 6.33e-04

pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.


Pssm-ID: 468202 [Multi-domain]  Cd Length: 557  Bit Score: 44.37  E-value: 6.33e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118297  491 PEVRPTTAAPQ---QTTSIPSTPKRQSTPKPPRVKPAPEPETrPSAQTTKAPRKTKKPGHHRLRRPKTTRSPEVPKSKPA 567
Cdd:NF033839   286 EPGNKKPSAPKpgmQPSPQPEKKEVKPEPETPKPEVKPQLEK-PKPEVKPQPEKPKPEVKPQLETPKPEVKPQPEKPKPE 364
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118297  568 LEPATVTPEilvPKIVPKPPQ-KPKATRRPEVPQVKPAHEPVTFGSEAPALAIVTTTDIEPVITRTKASVTTLAPKP-PR 645
Cdd:NF033839   365 VKPQPEKPK---PEVKPQPETpKPEVKPQPEKPKPEVKPQPEKPKPEVKPQPEKPKPEVKPQPEKPKPEVKPQPEKPkPE 441
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|....*.
gi 1907118297  646 PRTHRQRTKYKTTQSPKIPHSKPASTTKKvrrprPKPQTTPHPEVP 691
Cdd:NF033839   442 VKPQPEKPKPEVKPQPETPKPEVKPQPEK-----PKPEVKPQPEKP 482
PRK07003 PRK07003
DNA polymerase III subunit gamma/tau;
458-759 6.75e-04

DNA polymerase III subunit gamma/tau;


Pssm-ID: 235906 [Multi-domain]  Cd Length: 830  Bit Score: 44.46  E-value: 6.75e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118297  458 VPPKTSRTAEQPRATLAP---IEALFESRNVEIFTSPEVRPTTAAPQQTTsIPSTPKRQSTPKP-------PRVKPAPEP 527
Cdd:PRK07003   372 VPARVAGAVPAPGARAAAavgASAVPAVTAVTGAAGAALAPKAAAAAAAT-RAEAPPAAPAPPAtadrgddAADGDAPVP 450
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118297  528 ---ETRPSAQTTKAPRKTKKPGHHRLRRPKTTRSPEVPKSKPALEPATVTPEILVPKIVPKPPQ--------KPKATRRP 596
Cdd:PRK07003   451 akaNARASADSRCDERDAQPPADSGSASAPASDAPPDAAFEPAPRAAAPSAATPAAVPDARAPAaasredapAAAAPPAP 530
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118297  597 EVPQVKPA--HEPVTFGSEAPALAIVTTTDIEPVITRTK--------ASVTTLAPKPPRPRTHRQrtkyktTQSPKIPHS 666
Cdd:PRK07003   531 EARPPTPAaaAPAARAGGAAAALDVLRNAGMRVSSDRGAraaaaakpAAAPAAAPKPAAPRVAVQ------VPTPRARAA 604
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118297  667 KPASTTKKVRRPRPKPQTT----PHPEVPHTILVPATSLEPFIiteAPGTTLVPKLPQQPDYPHPKPKTTRSPAAsPTEL 742
Cdd:PRK07003   605 TGDAPPNGAARAEQAAESRgappPWEDIPPDDYVPLSADEGFG---GPDDGFVPVFDSGPDDVRVAPKPADAPAP-PVDT 680
                          330
                   ....*....|....*..
gi 1907118297  743 VPTPvfePVTPLkeDPV 759
Cdd:PRK07003   681 RPLP---PAIPL--DAI 692
PHA03377 PHA03377
EBNA-3C; Provisional
459-756 1.36e-03

EBNA-3C; Provisional


Pssm-ID: 177614 [Multi-domain]  Cd Length: 1000  Bit Score: 43.50  E-value: 1.36e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118297  459 PPKTSRTAEQPRATLAPIEA------LFESRNVEIFTSP------EVRPTTAAPQQTTSIPSTPKRQSTPKPPRVKPAPE 526
Cdd:PHA03377   607 PPASGPHEKQPPSSAPRDMApsvvrmFLRERLLEQSTGPkpksfwEMRAGRDGSGIQQEPSSRRQPATQSTPPRPSWLPS 686
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118297  527 PETRPSAQTTKAprktkKPGHHRLRRPKTTRSPEVPKSKPALE-PATVTPEILVPKIVPKPPQKPKATRRPEvPQVKPAH 605
Cdd:PHA03377   687 VFVLPSVDAGRA-----QPSEESHLSSMSPTQPISHEEQPRYEdPDDPLDLSLHPDQAPPPSHQAPYSGHEE-PQAQQAP 760
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118297  606 EPVTFGSEAPALAIVTTTdiEPVITRTKASVTTLAPKPPRPRTHRQRTKYKTTQSPKIPHSKPASTTKKVRRPRPKPQTT 685
Cdd:PHA03377   761 YPGYWEPRPPQAPYLGYQ--EPQAQGVQVSSYPGYAGPWGLRAQHPRYRHSWAYWSQYPGHGHPQGPWAPRPPHLPPQWD 838
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1907118297  686 PHPEVPHTILVPATSLEPFIITEAPGTTLVPKLP-QQPDYPHPKPKTTRSPAASPTELVPTPVFEPVTPLKE 756
Cdd:PHA03377   839 GSAGHGQDQVSQFPHLQSETGPPRLQLSQVPQLPySQTLVSSSAPSWSSPQPRAPIRPIPTRFPPPPMPLQD 910
PTZ00449 PTZ00449
104 kDa microneme/rhoptry antigen; Provisional
1030-1281 1.54e-03

104 kDa microneme/rhoptry antigen; Provisional


Pssm-ID: 185628 [Multi-domain]  Cd Length: 943  Bit Score: 43.52  E-value: 1.54e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118297 1030 PQKTIAPRQTTSMPPKLKTPhsRMPAKEPVPKEPLhTTSKPKMPPSPEVADTTSVPKDERLSLKPDPEVTHSETVLPPVT 1109
Cdd:PTZ00449   569 PSKIPTLSKKPEFPKDPKHP--KDPEEPKKPKRPR-SAQRPTRPKSPKLPELLDIPKSPKRPESPKSPKRPPPPQRPSSP 645
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118297 1110 FRVEPPKT--TIAPLETRGIPLIPVISPRPSQEELQTAMEETDQST----QELFTTKIPRTTELAKTTQAPHRLHTAPVR 1183
Cdd:PTZ00449   646 ERPEGPKIikSPKPPKSPKPPFDPKFKEKFYDDYLDAAAKSKETKTtvvlDESFESILKETLPETPGTPFTTPRPLPPKL 725
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118297 1184 PRIPGRPHGRPAlnktttRPDKTKPRGTSHKNGVGTGTKQAPKPPSPGRNASVDSHATRKPGSVSGTRRPPIPHRhsstR 1263
Cdd:PTZ00449   726 PRDEEFPFEPIG------DPDAEQPDDIEFFTPPEEERTFFHETPADTPLPDILAEEFKEEDIHAETGEPDEAMK----R 795
                          250
                   ....*....|....*....
gi 1907118297 1264 PVSP-ERRPLPPNNVTGKP 1281
Cdd:PTZ00449   796 PDSPsEHEDKPPGDHPSLP 814
PRK07994 PRK07994
DNA polymerase III subunits gamma and tau; Validated
490-618 1.91e-03

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236138 [Multi-domain]  Cd Length: 647  Bit Score: 42.93  E-value: 1.91e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118297  490 SPEVRPTTAAPQQTTSIPSTPKRQSTPKPPRVKPAPEPETRPSAQTTKAPRKTKKPGHHRLRRPKTTRSPEV-PKSKPAL 568
Cdd:PRK07994   370 VPPQSAAPAASAQATAAPTAAVAPPQAPAVPPPPASAPQQAPAVPLPETTSQLLAARQQLQRAQGATKAKKSePAAASRA 449
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 1907118297  569 EPATVTPEIL-----VPKIVPKPPQKPKATR-RPEVPQVKPAHEPVTFGSEAPALA 618
Cdd:PRK07994   450 RPVNSALERLasvrpAPSALEKAPAKKEAYRwKATNPVEVKKEPVATPKALKKALE 505
PRK14950 PRK14950
DNA polymerase III subunits gamma and tau; Provisional
480-628 2.22e-03

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237864 [Multi-domain]  Cd Length: 585  Bit Score: 42.49  E-value: 2.22e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118297  480 FESRNVEIFTSPEVRPTTAAPQQTTSIPSTPKRQSTPKPPRVKPAPEPETRPSAQTTKAPRKTKKPghhrLRRPKTTRSP 559
Cdd:PRK14950   351 LELAVIEALLVPVPAPQPAKPTAAAPSPVRPTPAPSTRPKAAAAANIPPKEPVRETATPPPVPPRP----VAPPVPHTPE 426
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1907118297  560 EVPKSKPALEPATVTPEILVPkivPKPPQKPKATRRPEV--PQVKPAHEPVT--FGSEAPALAIVTTTDIEPV 628
Cdd:PRK14950   427 SAPKLTRAAIPVDEKPKYTPP---APPKEEEKALIADGDvlEQLEAIWKQILrdVPPRSPAVQALLSSGVRPV 496
Not5 COG5665
CCR4-NOT transcriptional regulation complex, NOT5 subunit [Transcription];
428-758 2.37e-03

CCR4-NOT transcriptional regulation complex, NOT5 subunit [Transcription];


Pssm-ID: 444384 [Multi-domain]  Cd Length: 874  Bit Score: 42.73  E-value: 2.37e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118297  428 ATYDVISSSTTSDETEIEIHTATR-------DPILDSVPPKTSRTAEQPRATLAPIEALFESRNVEIFTSPEVR------ 494
Cdd:COG5665    208 STPQAFNASATSGRSQHIVQAAKRvgvewwgDPSLLATPPATPATEEKSSQQPKSQPTSPSGGTTPPSTNQLTTsntpts 287
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118297  495 --------PTTAAPQQTTSIPSTPKRQSTPKPPRV--KPAPEPETRPSAQTTKAPRKTKKPGHhrlrRPKTTRSPEVPKS 564
Cdd:COG5665    288 takaqpqpPTKKQPAKEPPSDTASGNPSAPSVLINsdSPTSEDPATASVPTTEETTAFTTPSS----VPSTPAEKDTPAT 363
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118297  565 KPALEPATVTPEILV-PKIVPKPPQKPKATrrpevpqvkpAHEPVTFGS-EAPALAIVTTTDIEPVITRTKASVTTLAPK 642
Cdd:COG5665    364 DLATPVSPTPPETSVdKKVSPDSATSSTKS----------EKEGGTASSpMPPNIAIGAKDDVDATDPSQEAKEYTKNAP 433
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118297  643 PPRPRTHRQRTKYKTTQSPKIPHSK-PASTTKKVRRPRPKPQTTPHPEVPHTILVPATSLEPFIITEAPGTTLVPKLPQQ 721
Cdd:COG5665    434 MTPEADSAPESSVRTEASPSAGSDLePENTTLRDPAPNAIPPPEDPSTIGRLSSGDKLANETGPPVIRRDSTPSSTADQS 513
                          330       340       350       360
                   ....*....|....*....|....*....|....*....|..
gi 1907118297  722 --PDYPHPKPKTT---RSPAASPTELVPTPVFEPVTPLKEDP 758
Cdd:COG5665    514 ivGVLAFGLDQRTqaeISVEAASRSNPLLNSQVKSFPLGKRS 555
PHA03247 PHA03247
large tegument protein UL36; Provisional
307-575 3.00e-03

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 42.62  E-value: 3.00e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118297  307 TLALPAESKTPEVEKLAGQPVTVTPESVSRSTKPTLSSALDTAETALVLSEKTSE-TARSVLIPEFELPLSTLAPkrfpe 385
Cdd:PHA03247  2756 RPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAvLAPAAALPPAASPAGPLPP----- 2830
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118297  386 fPEAKTAFPLEKPRGSWASSEEP--WVVPGAKtsedsrvvqpqtatydvISSSTTSDETEIEIHTATRDPILDSVPPKTS 463
Cdd:PHA03247  2831 -PTSAQPTAPPPPPGPPPPSLPLggSVAPGGD-----------------VRRRPPSRSPAAKPAAPARPPVRRLARPAVS 2892
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118297  464 RTAE---QPRATLAPiealfeSRNVEIFTSPEVRPTTAAPQQTTSIPSTPKRQSTPKPPRVKPAPEPEtrPSAQTTKAPR 540
Cdd:PHA03247  2893 RSTEsfaLPPDQPER------PPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGE--PSGAVPQPWL 2964
                          250       260       270
                   ....*....|....*....|....*....|....*
gi 1907118297  541 KTKKPGHHRLRRpktTRSPEVPKSKPALEPATVTP 575
Cdd:PHA03247  2965 GALVPGRVAVPR---FRVPQPAPSREAPASSTPPL 2996
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
469-600 3.53e-03

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 42.28  E-value: 3.53e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118297  469 PRATLAPIEALfESRNVEIFTSPEVRPT----TAAPQQTTSIPSTPKRQSTPKPPRVkPAPEPETRPSAQTTKAPRKTKK 544
Cdd:PRK07764   371 ERGLLARLERL-ERRLGVAGGAGAPAAAapsaAAAAPAAAPAPAAAAPAAAAAPAPA-AAPQPAPAPAPAPAPPSPAGNA 448
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 1907118297  545 PGHHRLRRPKTTRSPEVPKSKPALEPATVTPEILVPKIVPKPPQKPKATRRPEVPQ 600
Cdd:PRK07764   449 PAGGAPSPPPAAAPSAQPAPAPAAAPEPTAAPAPAPPAAPAPAAAPAAPAAPAAPA 504
PRK14954 PRK14954
DNA polymerase III subunits gamma and tau; Provisional
515-612 4.42e-03

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 184918 [Multi-domain]  Cd Length: 620  Bit Score: 41.85  E-value: 4.42e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118297  515 TPKPPRVKPAPEPETrPSAQTTKAPRKTKKPGhhrlRRPKTTRSPEvpkSKPAlePATVTPeilVPKIVPKPPqKPKATR 594
Cdd:PRK14954   385 AGSPDVKKKAPEPDL-PQPDRHPGPAKPEAPG----ARPAELPSPA---SAPT--PEQQPP---VARSAPLPP-SPQASA 450
                           90
                   ....*....|....*...
gi 1907118297  595 RPEVPQVKPAhepVTFGS 612
Cdd:PRK14954   451 PRNVASGKPG---VDLGS 465
COG3979 COG3979
Chitodextrinase [Carbohydrate transport and metabolism];
1387-1486 6.13e-03

Chitodextrinase [Carbohydrate transport and metabolism];


Pssm-ID: 443178 [Multi-domain]  Cd Length: 369  Bit Score: 40.91  E-value: 6.13e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118297 1387 PPQNPpTNLTVVTVEgcPSFVILDWEK-PLNDTVTEYEVisrengsFSGKNKSIQITNQTFSTVENLKPDTSYEFQVKPK 1465
Cdd:COG3979      2 APTAP-TGLTASNVT--SSSVSLSWDAsTDNVGVTGYDV-------YRGGDQVATVTGLTAWTVTGLTPGTEYTFTVGAC 71
                           90       100
                   ....*....|....*....|.
gi 1907118297 1466 nplgeGPASNTVAFSTESADP 1486
Cdd:COG3979     72 -----DAAGNVSAASGTSTAM 87
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
491-664 7.86e-03

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 41.01  E-value: 7.86e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118297  491 PEVRPTTAAPQQTTSIPSTPKRQSTPKPPRVKPAPEPETRPSAQTTKAPRKTKKPGHHRLRRPKTTRSPEVPKSKPALEP 570
Cdd:PRK12323   383 AQPAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAPEALAAARQASARGPGGAPAPAPAPAAAPAAAA 462
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118297  571 ATVTPEIL---VPKIVPKPPQKPKATRRPEVPQVKPAHE-PVTFGSEAPA------LAIVTTTDIEPVITRTKASVTTLA 640
Cdd:PRK12323   463 RPAAAGPRpvaAAAAAAPARAAPAAAPAPADDDPPPWEElPPEFASPAPAqpdaapAGWVAESIPDPATADPDDAFETLA 542
                          170       180
                   ....*....|....*....|....
gi 1907118297  641 PKPPRPRTHRQRTKYKTTQSPKIP 664
Cdd:PRK12323   543 PAPAAAPAPRAAAATEPVVAPRPP 566
PRK10263 PRK10263
DNA translocase FtsK; Provisional
558-828 8.91e-03

DNA translocase FtsK; Provisional


Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 40.84  E-value: 8.91e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118297  558 SPEVPKSKPALE----PATVTPEilvPKIVPKPPQKPKATR--RPEVPQVKPAHEPV--TFGSEAPALAIVTTTDIEPVI 629
Cdd:PRK10263   348 SVDVPPAQPTVAwqpvPGPQTGE---PVIAPAPEGYPQQSQyaQPAVQYNEPLQQPVqpQQPYYAPAAEQPAQQPYYAPA 424
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118297  630 TRTKASVTTLAPKPPRPRTHRQrtkykTTQSPKIPHSKPASTtKKVRRPRPKPQTTPHPEVPHTILVPATSLEPFIITEA 709
Cdd:PRK10263   425 PEQPAQQPYYAPAPEQPVAGNA-----WQAEEQQSTFAPQST-YQTEQTYQQPAAQEPLYQQPQPVEQQPVVEPEPVVEE 498
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118297  710 pgttLVPKLPQQPDYPHPKPKTTRS--PAASPTELVPTPVFEPVtplkedPVTTIVPITDLERVTDLETPVAFRTEAPGT 787
Cdd:PRK10263   499 ----TKPARPPLYYFEEVEEKRAREreQLAAWYQPIPEPVKEPE------PIKSSLKAPSVAAVPPVEAAAAVSPLASGV 568
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|....
gi 1907118297  788 ---TLASKISQRTHRPRPRPRPRPRPRPRPKATLSPQAPETKTV 828
Cdd:PRK10263   569 kkaTLATGAAATVAAPVFSLANSGGPRPQVKEGIGPQLPRPKRI 612
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH