NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|1958650962|ref|XP_038950815|]
View 

pecanex-like protein 3 isoform X3 [Rattus norvegicus]

Protein Classification

oligosaccharide repeat unit polymerase; pecanex family protein( domain architecture ID 10523572)

oligosaccharide repeat unit polymerase may act to polymerize the oligosaccharide repeat units of surface polysaccharides, including O-antigen in Gram-negative bacteria and capsular polysaccharide in Gram-positive bacteria; pecanex family protein similar to Drosophila melanogaster protein pecanex that is involved in neurogenesis

Gene Ontology:  GO:0016020

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
Pecanex_C pfam05041
Pecanex protein (C-terminus); This family consists of C terminal region of the pecanex protein ...
1577-1803 2.02e-138

Pecanex protein (C-terminus); This family consists of C terminal region of the pecanex protein homologs. The pecanex protein is a maternal-effect neurogenic gene found in Drosophila.


:

Pssm-ID: 461533  Cd Length: 227  Bit Score: 429.82  E-value: 2.02e-138
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958650962 1577 DQDWNSPLVTLCFGLCVLGRRALGTASHSMSASLEPFLYGLHALFKGDFRITSPRDEWVFADMDLLHRVVAPGVRMALKL 1656
Cdd:pfam05041    1 DSDSDSTLVTLCFALSLLGRRALGSASHSMSNSLESFLYGLHFLFKGDFRITSDKDEWVFMDLDLLRKVVAPAMRMALKL 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958650962 1657 HQDHFTSPDEYEEPAALYDAIAANEERLVISHEGDPAWRSAILSNTPSLLALRHVMDDASDEYKIIMLNRRHLSFRVIKV 1736
Cdd:pfam05041   81 HQDHFTDPDEYDENEVLYDAIHTYELVIVIEHESDPRWRVAVLSNNPSLLALRHVDDDGEDEYKIIMLNRRTLSFRVIKV 160
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1958650962 1737 NRECVRGLWAGQQQELVFLRNRNPERGSIQNAKQALRNMINSSCDQPLGYPIYVSPLTTSLAGSHPQ 1803
Cdd:pfam05041  161 NRECVRGLWAGQQQELIFLRNRNRERGSIQNAKQALRNIINSSCDQPIGYPIYVSPLTTSYSNTHLQ 227
PHA03247 super family cl33720
large tegument protein UL36; Provisional
201-655 2.70e-08

large tegument protein UL36; Provisional


The actual alignment was detected with superfamily member PHA03247:

Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 59.57  E-value: 2.70e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958650962  201 PGVVPDPSLPSTDSSERSPMAGDAVPWGGSSVADTPMSPLLKGSLSQELSKSFLTLTRPDRALVRTSSrreqcrgTGGYQ 280
Cdd:PHA03247  2589 PDAPPQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDP-------APGRV 2661
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958650962  281 PLDRRGSGDPLPQKAGSSdscfsgTDRETLSSFRSEKTNSTHLDSPPgghaPEGSDTDPPSEAELPASPDAGVPSDDTLR 360
Cdd:PHA03247  2662 SRPRRARRLGRAAQASSP------PQRPRRRAARPTVGSLTSLADPP----PPPPTPEPAPHALVSATPLPPGPAAARQA 2731
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958650962  361 SFDTVIGAGTPPGQTEPLLVV----RPKDLALLRPNKRRPP-MRGHSPPGRTPRRPLLEGSGFFEDEDTSEGSELSPASS 435
Cdd:PHA03247  2732 SPALPAAPAPPAVPAGPATPGgparPARPPTTAGPPAPAPPaAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAV 2811
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958650962  436 LRSQRRYSTDSSSSTSCYSPESSQGAAGGPRKRRAPHGAEEGTAVPPKRPYG----TQRTPSTASAKTHARVLSMDGAGG 511
Cdd:PHA03247  2812 LAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRrrppSRSPAAKPAAPARPPVRRLARPAV 2891
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958650962  512 DVLRAPLAGSKAELEAQPGMELAA--GEPAMLPPEARRGPAANQPGW-RGELQEEGAVGGAPEETGQRECTSNVR----R 584
Cdd:PHA03247  2892 SRSTESFALPPDQPERPPQPQAPPppQPQPQPPPPPQPQPPPPPPPRpQPPLAPTTDPAGAGEPSGAVPQPWLGAlvpgR 2971
                          410       420       430       440       450       460       470
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1958650962  585 AQAIRRRhnaGSNPTPPASVMGSPPSSLQEAQRGRAASHSRALTL-PSALHFASSLLLTRAGPNVHEASNFD 655
Cdd:PHA03247  2972 VAVPRFR---VPQPAPSREAPASSTPPLTGHSLSRVSSWASSLALhEETDPPPVSLKQTLWPPDDTEDSDAD 3040
 
Name Accession Description Interval E-value
Pecanex_C pfam05041
Pecanex protein (C-terminus); This family consists of C terminal region of the pecanex protein ...
1577-1803 2.02e-138

Pecanex protein (C-terminus); This family consists of C terminal region of the pecanex protein homologs. The pecanex protein is a maternal-effect neurogenic gene found in Drosophila.


Pssm-ID: 461533  Cd Length: 227  Bit Score: 429.82  E-value: 2.02e-138
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958650962 1577 DQDWNSPLVTLCFGLCVLGRRALGTASHSMSASLEPFLYGLHALFKGDFRITSPRDEWVFADMDLLHRVVAPGVRMALKL 1656
Cdd:pfam05041    1 DSDSDSTLVTLCFALSLLGRRALGSASHSMSNSLESFLYGLHFLFKGDFRITSDKDEWVFMDLDLLRKVVAPAMRMALKL 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958650962 1657 HQDHFTSPDEYEEPAALYDAIAANEERLVISHEGDPAWRSAILSNTPSLLALRHVMDDASDEYKIIMLNRRHLSFRVIKV 1736
Cdd:pfam05041   81 HQDHFTDPDEYDENEVLYDAIHTYELVIVIEHESDPRWRVAVLSNNPSLLALRHVDDDGEDEYKIIMLNRRTLSFRVIKV 160
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1958650962 1737 NRECVRGLWAGQQQELVFLRNRNPERGSIQNAKQALRNMINSSCDQPLGYPIYVSPLTTSLAGSHPQ 1803
Cdd:pfam05041  161 NRECVRGLWAGQQQELIFLRNRNRERGSIQNAKQALRNIINSSCDQPIGYPIYVSPLTTSYSNTHLQ 227
PHA03247 PHA03247
large tegument protein UL36; Provisional
201-655 2.70e-08

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 59.57  E-value: 2.70e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958650962  201 PGVVPDPSLPSTDSSERSPMAGDAVPWGGSSVADTPMSPLLKGSLSQELSKSFLTLTRPDRALVRTSSrreqcrgTGGYQ 280
Cdd:PHA03247  2589 PDAPPQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDP-------APGRV 2661
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958650962  281 PLDRRGSGDPLPQKAGSSdscfsgTDRETLSSFRSEKTNSTHLDSPPgghaPEGSDTDPPSEAELPASPDAGVPSDDTLR 360
Cdd:PHA03247  2662 SRPRRARRLGRAAQASSP------PQRPRRRAARPTVGSLTSLADPP----PPPPTPEPAPHALVSATPLPPGPAAARQA 2731
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958650962  361 SFDTVIGAGTPPGQTEPLLVV----RPKDLALLRPNKRRPP-MRGHSPPGRTPRRPLLEGSGFFEDEDTSEGSELSPASS 435
Cdd:PHA03247  2732 SPALPAAPAPPAVPAGPATPGgparPARPPTTAGPPAPAPPaAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAV 2811
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958650962  436 LRSQRRYSTDSSSSTSCYSPESSQGAAGGPRKRRAPHGAEEGTAVPPKRPYG----TQRTPSTASAKTHARVLSMDGAGG 511
Cdd:PHA03247  2812 LAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRrrppSRSPAAKPAAPARPPVRRLARPAV 2891
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958650962  512 DVLRAPLAGSKAELEAQPGMELAA--GEPAMLPPEARRGPAANQPGW-RGELQEEGAVGGAPEETGQRECTSNVR----R 584
Cdd:PHA03247  2892 SRSTESFALPPDQPERPPQPQAPPppQPQPQPPPPPQPQPPPPPPPRpQPPLAPTTDPAGAGEPSGAVPQPWLGAlvpgR 2971
                          410       420       430       440       450       460       470
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1958650962  585 AQAIRRRhnaGSNPTPPASVMGSPPSSLQEAQRGRAASHSRALTL-PSALHFASSLLLTRAGPNVHEASNFD 655
Cdd:PHA03247  2972 VAVPRFR---VPQPAPSREAPASSTPPLTGHSLSRVSSWASSLALhEETDPPPVSLKQTLWPPDDTEDSDAD 3040
KLF9_13_N-like cd21975
Kruppel-like factor (KLF) 9, KLF13, KLF14, KLF16, and similar proteins; Kruppel/Krueppel-like ...
455-592 2.06e-03

Kruppel-like factor (KLF) 9, KLF13, KLF14, KLF16, and similar proteins; Kruppel/Krueppel-like transcription factors (KLFs) belong to a family of proteins, called the Specificity Protein (SP)/KLF family, characterized by a C-terminal DNA-binding domain of 81 amino acids consisting of three Kruppel-like C2H2 zinc fingers. These factors bind to a loose consensus motif, namely NNRCRCCYY (where N is any nucleotide; R is A/G, and Y is C/T), such as the recurring motifs in GC and GT boxes (5'-GGGGCGGGG-3' and 5-GGTGTGGGG-3') that are present in promoters and more distal regulatory elements of mammalian genes. Members of the KLF family can act as activators or repressors of transcription depending on cell and promoter context. KLFs regulate various cellular functions, such as proliferation, differentiation, and apoptosis, as well as the development and homeostasis of several types of tissue. KLF9, KLF10, KLF11, KLF13, KLF14, and KLF16 share a conserved alpha-helical motif AA/VXXL that mediates their binding to Sin3A and their activities as transcriptional repressors. In addition to the C-terminal DNA-binding domain, each KLF also has a unique N-terminal activation/repression domain that confers specificity and allows it to bind specifically to a certain partner, leading to distinct activities in vivo. This model represents the related N-terminal domains of KLF9, KLF13, KLF14, KLF16, and similar proteins.


Pssm-ID: 409240 [Multi-domain]  Cd Length: 163  Bit Score: 40.83  E-value: 2.06e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958650962  455 PESSQGAAGGPRKRRAPHGAEEGTAVPPKRPYGTQrTPSTASAKTHARVLSM--DGAGGDVLRAPLAGSKAELEAQPGME 532
Cdd:cd21975     25 PEGAGLAAGLDVRATREVAKGPGPPGPAWKPDGAD-SPGLVTAAPHLLAANVlaPLRGPSVEGSSLESGDADMGSDSDVA 103
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1958650962  533 LAAGEPAMLPPEARRGPAAN-QPGWrgeLQEEGAVGGAPEETGQRECTSNVRR-AQAIRRRH 592
Cdd:cd21975    104 PASGAAASTSPESSSDAASSpSPLS---LLHPGEAGLEPERPRPRVRRGVRRRgVTPAAKRH 162
 
Name Accession Description Interval E-value
Pecanex_C pfam05041
Pecanex protein (C-terminus); This family consists of C terminal region of the pecanex protein ...
1577-1803 2.02e-138

Pecanex protein (C-terminus); This family consists of C terminal region of the pecanex protein homologs. The pecanex protein is a maternal-effect neurogenic gene found in Drosophila.


Pssm-ID: 461533  Cd Length: 227  Bit Score: 429.82  E-value: 2.02e-138
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958650962 1577 DQDWNSPLVTLCFGLCVLGRRALGTASHSMSASLEPFLYGLHALFKGDFRITSPRDEWVFADMDLLHRVVAPGVRMALKL 1656
Cdd:pfam05041    1 DSDSDSTLVTLCFALSLLGRRALGSASHSMSNSLESFLYGLHFLFKGDFRITSDKDEWVFMDLDLLRKVVAPAMRMALKL 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958650962 1657 HQDHFTSPDEYEEPAALYDAIAANEERLVISHEGDPAWRSAILSNTPSLLALRHVMDDASDEYKIIMLNRRHLSFRVIKV 1736
Cdd:pfam05041   81 HQDHFTDPDEYDENEVLYDAIHTYELVIVIEHESDPRWRVAVLSNNPSLLALRHVDDDGEDEYKIIMLNRRTLSFRVIKV 160
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1958650962 1737 NRECVRGLWAGQQQELVFLRNRNPERGSIQNAKQALRNMINSSCDQPLGYPIYVSPLTTSLAGSHPQ 1803
Cdd:pfam05041  161 NRECVRGLWAGQQQELIFLRNRNRERGSIQNAKQALRNIINSSCDQPIGYPIYVSPLTTSYSNTHLQ 227
PHA03247 PHA03247
large tegument protein UL36; Provisional
201-655 2.70e-08

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 59.57  E-value: 2.70e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958650962  201 PGVVPDPSLPSTDSSERSPMAGDAVPWGGSSVADTPMSPLLKGSLSQELSKSFLTLTRPDRALVRTSSrreqcrgTGGYQ 280
Cdd:PHA03247  2589 PDAPPQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDP-------APGRV 2661
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958650962  281 PLDRRGSGDPLPQKAGSSdscfsgTDRETLSSFRSEKTNSTHLDSPPgghaPEGSDTDPPSEAELPASPDAGVPSDDTLR 360
Cdd:PHA03247  2662 SRPRRARRLGRAAQASSP------PQRPRRRAARPTVGSLTSLADPP----PPPPTPEPAPHALVSATPLPPGPAAARQA 2731
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958650962  361 SFDTVIGAGTPPGQTEPLLVV----RPKDLALLRPNKRRPP-MRGHSPPGRTPRRPLLEGSGFFEDEDTSEGSELSPASS 435
Cdd:PHA03247  2732 SPALPAAPAPPAVPAGPATPGgparPARPPTTAGPPAPAPPaAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAV 2811
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958650962  436 LRSQRRYSTDSSSSTSCYSPESSQGAAGGPRKRRAPHGAEEGTAVPPKRPYG----TQRTPSTASAKTHARVLSMDGAGG 511
Cdd:PHA03247  2812 LAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRrrppSRSPAAKPAAPARPPVRRLARPAV 2891
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958650962  512 DVLRAPLAGSKAELEAQPGMELAA--GEPAMLPPEARRGPAANQPGW-RGELQEEGAVGGAPEETGQRECTSNVR----R 584
Cdd:PHA03247  2892 SRSTESFALPPDQPERPPQPQAPPppQPQPQPPPPPQPQPPPPPPPRpQPPLAPTTDPAGAGEPSGAVPQPWLGAlvpgR 2971
                          410       420       430       440       450       460       470
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1958650962  585 AQAIRRRhnaGSNPTPPASVMGSPPSSLQEAQRGRAASHSRALTL-PSALHFASSLLLTRAGPNVHEASNFD 655
Cdd:PHA03247  2972 VAVPRFR---VPQPAPSREAPASSTPPLTGHSLSRVSSWASSLALhEETDPPPVSLKQTLWPPDDTEDSDAD 3040
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
197-600 1.11e-06

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 54.02  E-value: 1.11e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958650962  197 PQTPPGVVPDPSlPSTDSSERSPMAGDAVPWGGSSVADTPMSPLLKGSLSQELSKSFLTLTRPDRAlVRTSSRREQCRGT 276
Cdd:PHA03307   118 PPTPPPASPPPS-PAPDLSEMLRPVGSPGPPPAASPPAAGASPAAVASDAASSRQAALPLSSPEET-ARAPSSPPAEPPP 195
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958650962  277 GGYQPLDRRGSGDPLPQKAGSSDSCFSGTDRETLSSFRSEKTNSTHLDSPPGGHAPEGSDTDP-PSEAELPASPDAGVPS 355
Cdd:PHA03307   196 STPPAAASPRPPRRSSPISASASSPAPAPGRSAADDAGASSSDSSSSESSGCGWGPENECPLPrPAPITLPTRIWEASGW 275
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958650962  356 DDTLRSFDTVIGAGTPPGqtepllvvrpkdlallrPNKRRPPMRGHSPPGRTPRRPLLEGSGfFEDEDTSEGSELSPASS 435
Cdd:PHA03307   276 NGPSSRPGPASSSSSPRE-----------------RSPSPSPSSPGSGPAPSSPRASSSSSS-SRESSSSSTSSSSESSR 337
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958650962  436 LRSQRRYSTDSSSSTSCYSPESSQGAAggPRKRRAPHGAEEGTAVPPKRPygtqrTPSTASAKTHARVLSMDGAGgdvlr 515
Cdd:PHA03307   338 GAAVSPGPSPSRSPSPSRPPPPADPSS--PRKRPRPSRAPSSPAASAGRP-----TRRRARAAVAGRARRRDATG----- 405
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958650962  516 aPLAGSKaeleAQPGMELAAGEPAMLPPEARRGPAANQPGWRGELQEEGAV--GGApeeTGQRECTSNVRRAQAIRRRHN 593
Cdd:PHA03307   406 -RFPAGR----PRPSPLDAGAASGAFYARYPLLTPSGEPWPGSPPPPPGRVryGGL---GDSRPGLWDAPEVREAAARYE 477

                   ....*..
gi 1958650962  594 AGSNPTP 600
Cdd:PHA03307   478 ASPGPVP 484
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
286-620 6.39e-06

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 51.71  E-value: 6.39e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958650962  286 GSGDPLPQKAGSSDSCFSGTDRETLSSFRSEKTNSTHLDSPPGGHAPEGSDTDPPSEA----ELPASPDAGVPSDDTLRS 361
Cdd:PHA03307    74 GPGTEAPANESRSTPTWSLSTLAPASPAREGSPTPPGPSSPDPPPPTPPPASPPPSPApdlsEMLRPVGSPGPPPAASPP 153
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958650962  362 FDTVIGAGTPPGQTEPLLVVRPkdLALLRPNKRRPPMRGHSPPGRTPrRPLLEGSGFFEDEDTSEGSELSPASSLRSQRR 441
Cdd:PHA03307   154 AAGASPAAVASDAASSRQAALP--LSSPEETARAPSSPPAEPPPSTP-PAAASPRPPRRSSPISASASSPAPAPGRSAAD 230
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958650962  442 YSTDSSSSTSCYSP----ESSQGAAGGP---------RKRRAPHGAEEGTAVPPKRPYGTQRTPSTASAKTHARVLSMDG 508
Cdd:PHA03307   231 DAGASSSDSSSSESsgcgWGPENECPLPrpapitlptRIWEASGWNGPSSRPGPASSSSSPRERSPSPSPSSPGSGPAPS 310
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958650962  509 ---AGGDVLRAPLAGSKAELEAQPGMELAAGEPAmlPPEAR-----RGPAANQPGWRGELQEEGAVGGAPEETGQRECTS 580
Cdd:PHA03307   311 sprASSSSSSSRESSSSSTSSSSESSRGAAVSPG--PSPSRspspsRPPPPADPSSPRKRPRPSRAPSSPAASAGRPTRR 388
                          330       340       350       360
                   ....*....|....*....|....*....|....*....|
gi 1958650962  581 NVRRAQAIRRRHNAGSNPTPPASVMGSPPSSLQEAQRGRA 620
Cdd:PHA03307   389 RARAAVAGRARRRDATGRFPAGRPRPSPLDAGAASGAFYA 428
PHA03247 PHA03247
large tegument protein UL36; Provisional
334-609 3.45e-05

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 49.17  E-value: 3.45e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958650962  334 GSDTDPPSEAELPASPDAGVP-SDDTLRSFDTVIGA-----GTPPGQTEPLLVVRPKDLALLRPNKRRPPMRGHSP---- 403
Cdd:PHA03247  2549 GDPPPPLPPAAPPAAPDRSVPpPRPAPRPSEPAVTSrarrpDAPPQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPdppp 2628
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958650962  404 PGRTPRRPLLEGSGFFEDEDTSEGSELSPASSLRSQRRYSTDSSSSTSCYSPEssqgaagGPRKRRAPHGAEEGTAV--- 480
Cdd:PHA03247  2629 PSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQ-------RPRRRAARPTVGSLTSLadp 2701
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958650962  481 -PPKRPYGTQRTPSTASAKTHARVLSMDGAGGDVLRAPLAGSKAELEAQPGME-------LAAGEPAMLPPEARRG--PA 550
Cdd:PHA03247  2702 pPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGParparppTTAGPPAPAPPAAPAAgpPR 2781
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|....*....
gi 1958650962  551 ANQPGWRGELQEEGAVGGAPEETGQRECTSNVRRAQAIRRRHNAGSNPTPPASVMGSPP 609
Cdd:PHA03247  2782 RLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPP 2840
PRK13863 PRK13863
T-DNA border endonuclease VirD2;
419-622 7.59e-04

T-DNA border endonuclease VirD2;


Pssm-ID: 237533 [Multi-domain]  Cd Length: 446  Bit Score: 44.17  E-value: 7.59e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958650962  419 FEDEDTSEGSelsPASSLRSQRRYSTDSSSSTSCYSPESSQGAAG----GPRKRRAPHGAEEGTAVPPKRPYGTQRTPST 494
Cdd:PRK13863   211 FEDADFEEFS---PGEDHREPSQSFDTSPGEAPQGEPESAERPEKlqneSEVRLQEPAGSSIKADARIRVSLESERRAQP 287
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958650962  495 ASAKTharvlSMDGAGGDVLRAPLAGSKAELEAQPGMELAAGEPAMLPPEAR-------RGPAANQPGWRGELQEEGAVG 567
Cdd:PRK13863   288 SASKI-----PVADDFGIETSYVAEGDVRKLEGNSGTPRLATEVATHTTSERqqrrkrpRDDEGEPSGAKRTRLNGIAVG 362
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 1958650962  568 gaPEET-GQRECTSNVRRAQAIRRRHNAGSNPTPPASVMGSPPSSLQEAQRGRAAS 622
Cdd:PRK13863   363 --PEANaGEQDGRDDPITSPAQPPRSNPLADPVRASIATDSLPATADRQQQREPSS 416
KLF9_13_N-like cd21975
Kruppel-like factor (KLF) 9, KLF13, KLF14, KLF16, and similar proteins; Kruppel/Krueppel-like ...
455-592 2.06e-03

Kruppel-like factor (KLF) 9, KLF13, KLF14, KLF16, and similar proteins; Kruppel/Krueppel-like transcription factors (KLFs) belong to a family of proteins, called the Specificity Protein (SP)/KLF family, characterized by a C-terminal DNA-binding domain of 81 amino acids consisting of three Kruppel-like C2H2 zinc fingers. These factors bind to a loose consensus motif, namely NNRCRCCYY (where N is any nucleotide; R is A/G, and Y is C/T), such as the recurring motifs in GC and GT boxes (5'-GGGGCGGGG-3' and 5-GGTGTGGGG-3') that are present in promoters and more distal regulatory elements of mammalian genes. Members of the KLF family can act as activators or repressors of transcription depending on cell and promoter context. KLFs regulate various cellular functions, such as proliferation, differentiation, and apoptosis, as well as the development and homeostasis of several types of tissue. KLF9, KLF10, KLF11, KLF13, KLF14, and KLF16 share a conserved alpha-helical motif AA/VXXL that mediates their binding to Sin3A and their activities as transcriptional repressors. In addition to the C-terminal DNA-binding domain, each KLF also has a unique N-terminal activation/repression domain that confers specificity and allows it to bind specifically to a certain partner, leading to distinct activities in vivo. This model represents the related N-terminal domains of KLF9, KLF13, KLF14, KLF16, and similar proteins.


Pssm-ID: 409240 [Multi-domain]  Cd Length: 163  Bit Score: 40.83  E-value: 2.06e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958650962  455 PESSQGAAGGPRKRRAPHGAEEGTAVPPKRPYGTQrTPSTASAKTHARVLSM--DGAGGDVLRAPLAGSKAELEAQPGME 532
Cdd:cd21975     25 PEGAGLAAGLDVRATREVAKGPGPPGPAWKPDGAD-SPGLVTAAPHLLAANVlaPLRGPSVEGSSLESGDADMGSDSDVA 103
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1958650962  533 LAAGEPAMLPPEARRGPAAN-QPGWrgeLQEEGAVGGAPEETGQRECTSNVRR-AQAIRRRH 592
Cdd:cd21975    104 PASGAAASTSPESSSDAASSpSPLS---LLHPGEAGLEPERPRPRVRRGVRRRgVTPAAKRH 162
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH