NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|15237657|ref|NP_201232|]
View 

splicing factor [Arabidopsis thaliana]

Protein Classification

splicing factor 3B subunit 1 family protein( domain architecture ID 12097887)

splicing factor 3B subunit 1 (SF3B1) family protein may be involved in pre-mRNA splicing as a component of the splicing factor SF3B complex

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
HSH155 super family cl26678
U2 snRNP spliceosome subunit [RNA processing and modification];
385-1262 0e+00

U2 snRNP spliceosome subunit [RNA processing and modification];


The actual alignment was detected with superfamily member COG5181:

Pssm-ID: 227508 [Multi-domain]  Cd Length: 975  Bit Score: 1195.15  E-value: 0e+00
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15237657  385 YVPIRTPARKLQQTPTPMATPGYVIPEENRGQQYDV----PP----------------EVPG--GLPFMKPEDYQYFGSL 442
Cdd:COG5181   72 MDPGFQPDSDKKRELELNNTWDMEYPDSKRSSRWDEmgyePPqeinmclpargykaltDFHGyaDLGFFKVEDLKYFADD 151
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15237657  443 LNEENEEELSPEEQKERKIMKLLLKVKNGTPPQRKTALRQLTDKARELGAGPLFNKILPLLMQPTLEDQERHLLVKVIDR 522
Cdd:COG5181  152 EKDFFMPLLEDREGDERDVYRLLLKVKNGGKRMRMEGLRILTDKAVNFGAAAVFNKVLPMLMSRELEDQERHLVVKLIDR 231
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15237657  523 ILYKLDEMVRPYVHKILVVIEPLLIDEDYYARVEGREIISNLSKAAGLASMIAAMRPDIDNIDEYVRNTTARAFSVVASA 602
Cdd:COG5181  232 LLYGLDDLKVPYVHKILVVVGPLLIDEDLKRRCMGREIILNLVYRCGLGFSVSSMRPDITSKDEYVRNVTGRAVGVVADA 311
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15237657  603 LGIPALLPFLKAVCQSKRSWQARHTGIKIVQQIAILIGCAVLPHLRSLVEIIEHGLSDENQKVRTITALSLAALAEAAAP 682
Cdd:COG5181  312 LGVEELLPFLEALCGSRKSWEARHTGIRIAQQICELLGRSRLSHLGPLLKCISKLLKDRSRFVRIDTANALSYLAELVGP 391
                        330       340       350       360       370       380       390       400
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15237657  683 YGIESFDSVLKPLWKGIRSHRGKVLAAFLKAIGFIIPLMDAIYASYYTKEVMVILIREFQSPDEEMKKIVLKVVKQCVST 762
Cdd:COG5181  392 YGIEQFDEVLCPLWEGASQHRGKELVSFLKAMGFIIPLMSPEYACHDTREHMEIVFREFKSPDEEMKKDLLVVERICDKV 471
                        410       420       430       440       450       460       470       480
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15237657  763 EGVEPEYIRSDILPEFFRNFWTRKMALERRNYKQLVETTVEVANKVGVADIVGRVVEDLKDESEQYRRMVMETIDKVVTN 842
Cdd:COG5181  472 GTDTPWKLRDQVSPEFFSPFWRRRSAGDRRSYKQVVLTTVILAKMGGDPRVSRKILEYYSDEPEPYRKMNAGLVSRIFSR 551
                        490       500       510       520       530       540       550       560
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15237657  843 LGASDIDARLEELLIDGILYAFQEQTSDDAnVMLNGFGAVVNALGQRVKPYLPQICGTIKWRLNNKSAKVRQQAADLISR 922
Cdd:COG5181  552 LGRLGFDERLEERLYDSILNAFQEQDTTVG-LILPCFSTVLVSLEFRGKPHLSMIVSTILKLLRSKPPDVRIRAADLMGS 630
                        570       580       590       600       610       620       630       640
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15237657  923 IAVVMKQCGEEQLMGHLGVVLYEYLGEEYPEVLGSILGALKAIVNVIGMTKMTPPIKDLLPRLTPILKNRHEKVQENCID 1002
Cdd:COG5181  631 LAKVLKACGETKELAKLGNILYENLGEDYPEVLGSILKAICSIYSVHRFRSMQPPISGILPSLTPILRNKHQKVVANTIA 710
                        650       660       670       680       690       700       710       720
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15237657 1003 LVGRIADRGAEFVPAREWMRICFELLEMLKAHKKGIRRATVNTFGYIAKAIGPQDVLATLLNNLKVQERQNRVCTTVAIA 1082
Cdd:COG5181  711 LVGTICMNSPEYIGVREWMRICFELVDSLKSWNKEIRRNATETFGCISRAIGPQDVLDILLNNLKVQERQQRVCTSVAIS 790
                        730       740       750       760       770       780       790       800
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15237657 1083 IVAETCSPFTVLPALMNEYRVPELNVQNGVLKSLSFLFEYIGEMGKDYIYAVTPLLEDALMDRDLVHRQTAASAVKHMAL 1162
Cdd:COG5181  791 IVAEYCGPFSVLPTLMSDYETPEANVQNGVLKAMCFMFEYIGQASLDYVYSITPLLEDALTDRDPVHRQTAMNVIRHLVL 870
                        810       820       830       840       850       860       870       880
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15237657 1163 GVAGLGCEDALVHLLNFIWPNIFETSPHVINAVMEAIEGMRVALGAAVILNYCLQGLFHPARKVREVYWKIYNSLYIGAQ 1242
Cdd:COG5181  871 NCPGTGDEDAAIHLLNLLWPNILEPSPHVIQSFDEGMESFATVLGSGAMMKYVQQGLFHPSSTVRKRYWTVYNIMYVFDS 950
                        890       900
                 ....*....|....*....|
gi 15237657 1243 DTLVAAYPVlEDEQNNVYSR 1262
Cdd:COG5181  951 DAMVPCYPV-EEDLNPELAR 969
SF3b1 pfam08920
Splicing factor 3B subunit 1; This family consists of several eukaryotic splicing factor 3B ...
287-406 2.27e-62

Splicing factor 3B subunit 1; This family consists of several eukaryotic splicing factor 3B subunit 1 proteins, which associate with p14 through a C-terminus beta-strand that interacts with beta-3 of the p14 RNA recognition motif (RRM) beta-sheet, which is in turn connected to an alpha-helix by a loop that makes extensive contacts with both the shorter C-terminal helix and RRM of p14. This subunit is required for 'A' splicing complex assembly (formed by the stable binding of U2 snRNP to the branchpoint sequence in pre-mRNA) and 'E' splicing complex assembly.


:

Pssm-ID: 462634 [Multi-domain]  Cd Length: 114  Bit Score: 207.61  E-value: 2.27e-62
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15237657    287 KRQRSRWDETPATMGSAtpMGGVTPGAayTPGV-TPIGGIDMATPTPGQLifrGPMTPEQLNMQRWEKDIEERNRPLSDE 365
Cdd:pfam08920    1 SKRRSRWDETPANAGSG--PGGATPGE--TPGRqTPVGAMGMATPTPGAL---GPMTPEQMQAFRWEKEIDERNRPLTDE 73
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|.
gi 15237657    366 ELDAMFPKDGYKVLDPPATYVPIRTPARKLQQTPTPMATPG 406
Cdd:pfam08920   74 ELDAMLPGEGYKILDPPAGYVPIRTPARKLLATPTPMGGTG 114
 
Name Accession Description Interval E-value
HSH155 COG5181
U2 snRNP spliceosome subunit [RNA processing and modification];
385-1262 0e+00

U2 snRNP spliceosome subunit [RNA processing and modification];


Pssm-ID: 227508 [Multi-domain]  Cd Length: 975  Bit Score: 1195.15  E-value: 0e+00
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15237657  385 YVPIRTPARKLQQTPTPMATPGYVIPEENRGQQYDV----PP----------------EVPG--GLPFMKPEDYQYFGSL 442
Cdd:COG5181   72 MDPGFQPDSDKKRELELNNTWDMEYPDSKRSSRWDEmgyePPqeinmclpargykaltDFHGyaDLGFFKVEDLKYFADD 151
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15237657  443 LNEENEEELSPEEQKERKIMKLLLKVKNGTPPQRKTALRQLTDKARELGAGPLFNKILPLLMQPTLEDQERHLLVKVIDR 522
Cdd:COG5181  152 EKDFFMPLLEDREGDERDVYRLLLKVKNGGKRMRMEGLRILTDKAVNFGAAAVFNKVLPMLMSRELEDQERHLVVKLIDR 231
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15237657  523 ILYKLDEMVRPYVHKILVVIEPLLIDEDYYARVEGREIISNLSKAAGLASMIAAMRPDIDNIDEYVRNTTARAFSVVASA 602
Cdd:COG5181  232 LLYGLDDLKVPYVHKILVVVGPLLIDEDLKRRCMGREIILNLVYRCGLGFSVSSMRPDITSKDEYVRNVTGRAVGVVADA 311
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15237657  603 LGIPALLPFLKAVCQSKRSWQARHTGIKIVQQIAILIGCAVLPHLRSLVEIIEHGLSDENQKVRTITALSLAALAEAAAP 682
Cdd:COG5181  312 LGVEELLPFLEALCGSRKSWEARHTGIRIAQQICELLGRSRLSHLGPLLKCISKLLKDRSRFVRIDTANALSYLAELVGP 391
                        330       340       350       360       370       380       390       400
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15237657  683 YGIESFDSVLKPLWKGIRSHRGKVLAAFLKAIGFIIPLMDAIYASYYTKEVMVILIREFQSPDEEMKKIVLKVVKQCVST 762
Cdd:COG5181  392 YGIEQFDEVLCPLWEGASQHRGKELVSFLKAMGFIIPLMSPEYACHDTREHMEIVFREFKSPDEEMKKDLLVVERICDKV 471
                        410       420       430       440       450       460       470       480
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15237657  763 EGVEPEYIRSDILPEFFRNFWTRKMALERRNYKQLVETTVEVANKVGVADIVGRVVEDLKDESEQYRRMVMETIDKVVTN 842
Cdd:COG5181  472 GTDTPWKLRDQVSPEFFSPFWRRRSAGDRRSYKQVVLTTVILAKMGGDPRVSRKILEYYSDEPEPYRKMNAGLVSRIFSR 551
                        490       500       510       520       530       540       550       560
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15237657  843 LGASDIDARLEELLIDGILYAFQEQTSDDAnVMLNGFGAVVNALGQRVKPYLPQICGTIKWRLNNKSAKVRQQAADLISR 922
Cdd:COG5181  552 LGRLGFDERLEERLYDSILNAFQEQDTTVG-LILPCFSTVLVSLEFRGKPHLSMIVSTILKLLRSKPPDVRIRAADLMGS 630
                        570       580       590       600       610       620       630       640
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15237657  923 IAVVMKQCGEEQLMGHLGVVLYEYLGEEYPEVLGSILGALKAIVNVIGMTKMTPPIKDLLPRLTPILKNRHEKVQENCID 1002
Cdd:COG5181  631 LAKVLKACGETKELAKLGNILYENLGEDYPEVLGSILKAICSIYSVHRFRSMQPPISGILPSLTPILRNKHQKVVANTIA 710
                        650       660       670       680       690       700       710       720
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15237657 1003 LVGRIADRGAEFVPAREWMRICFELLEMLKAHKKGIRRATVNTFGYIAKAIGPQDVLATLLNNLKVQERQNRVCTTVAIA 1082
Cdd:COG5181  711 LVGTICMNSPEYIGVREWMRICFELVDSLKSWNKEIRRNATETFGCISRAIGPQDVLDILLNNLKVQERQQRVCTSVAIS 790
                        730       740       750       760       770       780       790       800
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15237657 1083 IVAETCSPFTVLPALMNEYRVPELNVQNGVLKSLSFLFEYIGEMGKDYIYAVTPLLEDALMDRDLVHRQTAASAVKHMAL 1162
Cdd:COG5181  791 IVAEYCGPFSVLPTLMSDYETPEANVQNGVLKAMCFMFEYIGQASLDYVYSITPLLEDALTDRDPVHRQTAMNVIRHLVL 870
                        810       820       830       840       850       860       870       880
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15237657 1163 GVAGLGCEDALVHLLNFIWPNIFETSPHVINAVMEAIEGMRVALGAAVILNYCLQGLFHPARKVREVYWKIYNSLYIGAQ 1242
Cdd:COG5181  871 NCPGTGDEDAAIHLLNLLWPNILEPSPHVIQSFDEGMESFATVLGSGAMMKYVQQGLFHPSSTVRKRYWTVYNIMYVFDS 950
                        890       900
                 ....*....|....*....|
gi 15237657 1243 DTLVAAYPVlEDEQNNVYSR 1262
Cdd:COG5181  951 DAMVPCYPV-EEDLNPELAR 969
SF3b1 pfam08920
Splicing factor 3B subunit 1; This family consists of several eukaryotic splicing factor 3B ...
287-406 2.27e-62

Splicing factor 3B subunit 1; This family consists of several eukaryotic splicing factor 3B subunit 1 proteins, which associate with p14 through a C-terminus beta-strand that interacts with beta-3 of the p14 RNA recognition motif (RRM) beta-sheet, which is in turn connected to an alpha-helix by a loop that makes extensive contacts with both the shorter C-terminal helix and RRM of p14. This subunit is required for 'A' splicing complex assembly (formed by the stable binding of U2 snRNP to the branchpoint sequence in pre-mRNA) and 'E' splicing complex assembly.


Pssm-ID: 462634 [Multi-domain]  Cd Length: 114  Bit Score: 207.61  E-value: 2.27e-62
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15237657    287 KRQRSRWDETPATMGSAtpMGGVTPGAayTPGV-TPIGGIDMATPTPGQLifrGPMTPEQLNMQRWEKDIEERNRPLSDE 365
Cdd:pfam08920    1 SKRRSRWDETPANAGSG--PGGATPGE--TPGRqTPVGAMGMATPTPGAL---GPMTPEQMQAFRWEKEIDERNRPLTDE 73
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|.
gi 15237657    366 ELDAMFPKDGYKVLDPPATYVPIRTPARKLQQTPTPMATPG 406
Cdd:pfam08920   74 ELDAMLPGEGYKILDPPAGYVPIRTPARKLLATPTPMGGTG 114
PHA03378 PHA03378
EBNA-3B; Provisional
216-429 1.05e-08

EBNA-3B; Provisional


Pssm-ID: 223065 [Multi-domain]  Cd Length: 991  Bit Score: 59.70  E-value: 1.05e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15237657   216 LPDAAPGIGRWD--APTPGRVSDATPSAGRRNRWDETP------TPGRVTDSDATPGGGVTPGATPSGVTWDGLA-TPTP 286
Cdd:PHA03378  686 PIQWAPGTMQPPprAPTPMRPPAAPPGRAQRPAAATGRarppaaAPGRARPPAAAPGRARPPAAAPGRARPPAAApGRAR 765
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15237657   287 KRQRSRWDETPATMGSATPMGGVTPGAAYTPGVTPIGGidmatPTPGQLIFR---GPMTPEQLNMQRWEKDIEERNRPLS 363
Cdd:PHA03378  766 PPAAAPGAPTPQPPPQAPPAPQQRPRGAPTPQPPPQAG-----PTSMQLMPRaapGQQGPTKQILRQLLTGGVKRGRPSL 840
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15237657   364 D-----EELDAMFPK------DGYKVLDPPATYVPIRTPARKLQQ--TPTPMATPGYV-IPEENRGQQYDVPPEVPGGLP 429
Cdd:PHA03378  841 KkpaalERQAAAGPTpspgsgTSDKIVQAPVFYPPVLQPIQVMRQlgSVRAAAASTVTqAPTEYTGERRGVGPMHPTDIP 920
CTD smart01104
Spt5 C-terminal nonapeptide repeat binding Spt4; The C-terminal domain of the transcription ...
230-343 1.37e-06

Spt5 C-terminal nonapeptide repeat binding Spt4; The C-terminal domain of the transcription elongation factor protein Spt5 is necessary for binding to Spt4 to form the functional complex that regulates early transcription elongation by RNA polymerase II. The complex may be involved in pre-mRNA processing through its association with mRNA capping enzymes. This CTD domain carries a regular nonapeptide repeat that can be present in up to 18 copies, as in S. pombe. The repeat has a characteristic TPA motif.


Pssm-ID: 215026 [Multi-domain]  Cd Length: 121  Bit Score: 48.67  E-value: 1.37e-06
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15237657     230 TPGRVSDA--TPSAGRRnrwdetpTPGRVTDSDATPGGGvTPGATPsgvTWDGLATPTPkRQRSRWDE---TPATMGSAT 304
Cdd:smart01104    4 TPAWGASGskTPAWGSR-------TPGTAAGGAPTARGG-SGSRTP---AWGGAGSRTP-AWGGAGPTgsrTPAWGGASA 71
                            90       100       110
                    ....*....|....*....|....*....|....*....
gi 15237657     305 PMGGVTPGAAYTPGVTPIGGidMATPTPGQLIFRGPMTP 343
Cdd:smart01104   72 WGNKSSEGSASSWAAGPGGA--YGAPTPGYGGTPSAYGP 108
HEAT_EZ pfam13513
HEAT-like repeat; The HEAT repeat family is related to armadillo/beta-catenin-like repeats ...
952-1007 2.07e-03

HEAT-like repeat; The HEAT repeat family is related to armadillo/beta-catenin-like repeats (see pfam00514). These EZ repeats are found in subunits of cyanobacterial phycocyanin lyase and other proteins and probably carry out a scaffolding role.


Pssm-ID: 463906 [Multi-domain]  Cd Length: 55  Bit Score: 37.35  E-value: 2.07e-03
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 15237657    952 PEVLGSILGALKAIVNViGMTKMTPPIKDLLPRLTPILKNRHEKVQENCIDLVGRI 1007
Cdd:pfam13513    1 WRVREAAALALGSLAEG-GPDLLAPAVPELLPALLPLLNDDSDLVREAAAWALGRL 55
 
Name Accession Description Interval E-value
HSH155 COG5181
U2 snRNP spliceosome subunit [RNA processing and modification];
385-1262 0e+00

U2 snRNP spliceosome subunit [RNA processing and modification];


Pssm-ID: 227508 [Multi-domain]  Cd Length: 975  Bit Score: 1195.15  E-value: 0e+00
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15237657  385 YVPIRTPARKLQQTPTPMATPGYVIPEENRGQQYDV----PP----------------EVPG--GLPFMKPEDYQYFGSL 442
Cdd:COG5181   72 MDPGFQPDSDKKRELELNNTWDMEYPDSKRSSRWDEmgyePPqeinmclpargykaltDFHGyaDLGFFKVEDLKYFADD 151
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15237657  443 LNEENEEELSPEEQKERKIMKLLLKVKNGTPPQRKTALRQLTDKARELGAGPLFNKILPLLMQPTLEDQERHLLVKVIDR 522
Cdd:COG5181  152 EKDFFMPLLEDREGDERDVYRLLLKVKNGGKRMRMEGLRILTDKAVNFGAAAVFNKVLPMLMSRELEDQERHLVVKLIDR 231
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15237657  523 ILYKLDEMVRPYVHKILVVIEPLLIDEDYYARVEGREIISNLSKAAGLASMIAAMRPDIDNIDEYVRNTTARAFSVVASA 602
Cdd:COG5181  232 LLYGLDDLKVPYVHKILVVVGPLLIDEDLKRRCMGREIILNLVYRCGLGFSVSSMRPDITSKDEYVRNVTGRAVGVVADA 311
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15237657  603 LGIPALLPFLKAVCQSKRSWQARHTGIKIVQQIAILIGCAVLPHLRSLVEIIEHGLSDENQKVRTITALSLAALAEAAAP 682
Cdd:COG5181  312 LGVEELLPFLEALCGSRKSWEARHTGIRIAQQICELLGRSRLSHLGPLLKCISKLLKDRSRFVRIDTANALSYLAELVGP 391
                        330       340       350       360       370       380       390       400
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15237657  683 YGIESFDSVLKPLWKGIRSHRGKVLAAFLKAIGFIIPLMDAIYASYYTKEVMVILIREFQSPDEEMKKIVLKVVKQCVST 762
Cdd:COG5181  392 YGIEQFDEVLCPLWEGASQHRGKELVSFLKAMGFIIPLMSPEYACHDTREHMEIVFREFKSPDEEMKKDLLVVERICDKV 471
                        410       420       430       440       450       460       470       480
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15237657  763 EGVEPEYIRSDILPEFFRNFWTRKMALERRNYKQLVETTVEVANKVGVADIVGRVVEDLKDESEQYRRMVMETIDKVVTN 842
Cdd:COG5181  472 GTDTPWKLRDQVSPEFFSPFWRRRSAGDRRSYKQVVLTTVILAKMGGDPRVSRKILEYYSDEPEPYRKMNAGLVSRIFSR 551
                        490       500       510       520       530       540       550       560
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15237657  843 LGASDIDARLEELLIDGILYAFQEQTSDDAnVMLNGFGAVVNALGQRVKPYLPQICGTIKWRLNNKSAKVRQQAADLISR 922
Cdd:COG5181  552 LGRLGFDERLEERLYDSILNAFQEQDTTVG-LILPCFSTVLVSLEFRGKPHLSMIVSTILKLLRSKPPDVRIRAADLMGS 630
                        570       580       590       600       610       620       630       640
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15237657  923 IAVVMKQCGEEQLMGHLGVVLYEYLGEEYPEVLGSILGALKAIVNVIGMTKMTPPIKDLLPRLTPILKNRHEKVQENCID 1002
Cdd:COG5181  631 LAKVLKACGETKELAKLGNILYENLGEDYPEVLGSILKAICSIYSVHRFRSMQPPISGILPSLTPILRNKHQKVVANTIA 710
                        650       660       670       680       690       700       710       720
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15237657 1003 LVGRIADRGAEFVPAREWMRICFELLEMLKAHKKGIRRATVNTFGYIAKAIGPQDVLATLLNNLKVQERQNRVCTTVAIA 1082
Cdd:COG5181  711 LVGTICMNSPEYIGVREWMRICFELVDSLKSWNKEIRRNATETFGCISRAIGPQDVLDILLNNLKVQERQQRVCTSVAIS 790
                        730       740       750       760       770       780       790       800
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15237657 1083 IVAETCSPFTVLPALMNEYRVPELNVQNGVLKSLSFLFEYIGEMGKDYIYAVTPLLEDALMDRDLVHRQTAASAVKHMAL 1162
Cdd:COG5181  791 IVAEYCGPFSVLPTLMSDYETPEANVQNGVLKAMCFMFEYIGQASLDYVYSITPLLEDALTDRDPVHRQTAMNVIRHLVL 870
                        810       820       830       840       850       860       870       880
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15237657 1163 GVAGLGCEDALVHLLNFIWPNIFETSPHVINAVMEAIEGMRVALGAAVILNYCLQGLFHPARKVREVYWKIYNSLYIGAQ 1242
Cdd:COG5181  871 NCPGTGDEDAAIHLLNLLWPNILEPSPHVIQSFDEGMESFATVLGSGAMMKYVQQGLFHPSSTVRKRYWTVYNIMYVFDS 950
                        890       900
                 ....*....|....*....|
gi 15237657 1243 DTLVAAYPVlEDEQNNVYSR 1262
Cdd:COG5181  951 DAMVPCYPV-EEDLNPELAR 969
SF3b1 pfam08920
Splicing factor 3B subunit 1; This family consists of several eukaryotic splicing factor 3B ...
287-406 2.27e-62

Splicing factor 3B subunit 1; This family consists of several eukaryotic splicing factor 3B subunit 1 proteins, which associate with p14 through a C-terminus beta-strand that interacts with beta-3 of the p14 RNA recognition motif (RRM) beta-sheet, which is in turn connected to an alpha-helix by a loop that makes extensive contacts with both the shorter C-terminal helix and RRM of p14. This subunit is required for 'A' splicing complex assembly (formed by the stable binding of U2 snRNP to the branchpoint sequence in pre-mRNA) and 'E' splicing complex assembly.


Pssm-ID: 462634 [Multi-domain]  Cd Length: 114  Bit Score: 207.61  E-value: 2.27e-62
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15237657    287 KRQRSRWDETPATMGSAtpMGGVTPGAayTPGV-TPIGGIDMATPTPGQLifrGPMTPEQLNMQRWEKDIEERNRPLSDE 365
Cdd:pfam08920    1 SKRRSRWDETPANAGSG--PGGATPGE--TPGRqTPVGAMGMATPTPGAL---GPMTPEQMQAFRWEKEIDERNRPLTDE 73
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|.
gi 15237657    366 ELDAMFPKDGYKVLDPPATYVPIRTPARKLQQTPTPMATPG 406
Cdd:pfam08920   74 ELDAMLPGEGYKILDPPAGYVPIRTPARKLLATPTPMGGTG 114
PHA03378 PHA03378
EBNA-3B; Provisional
216-429 1.05e-08

EBNA-3B; Provisional


Pssm-ID: 223065 [Multi-domain]  Cd Length: 991  Bit Score: 59.70  E-value: 1.05e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15237657   216 LPDAAPGIGRWD--APTPGRVSDATPSAGRRNRWDETP------TPGRVTDSDATPGGGVTPGATPSGVTWDGLA-TPTP 286
Cdd:PHA03378  686 PIQWAPGTMQPPprAPTPMRPPAAPPGRAQRPAAATGRarppaaAPGRARPPAAAPGRARPPAAAPGRARPPAAApGRAR 765
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15237657   287 KRQRSRWDETPATMGSATPMGGVTPGAAYTPGVTPIGGidmatPTPGQLIFR---GPMTPEQLNMQRWEKDIEERNRPLS 363
Cdd:PHA03378  766 PPAAAPGAPTPQPPPQAPPAPQQRPRGAPTPQPPPQAG-----PTSMQLMPRaapGQQGPTKQILRQLLTGGVKRGRPSL 840
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15237657   364 D-----EELDAMFPK------DGYKVLDPPATYVPIRTPARKLQQ--TPTPMATPGYV-IPEENRGQQYDVPPEVPGGLP 429
Cdd:PHA03378  841 KkpaalERQAAAGPTpspgsgTSDKIVQAPVFYPPVLQPIQVMRQlgSVRAAAASTVTqAPTEYTGERRGVGPMHPTDIP 920
CTD smart01104
Spt5 C-terminal nonapeptide repeat binding Spt4; The C-terminal domain of the transcription ...
230-343 1.37e-06

Spt5 C-terminal nonapeptide repeat binding Spt4; The C-terminal domain of the transcription elongation factor protein Spt5 is necessary for binding to Spt4 to form the functional complex that regulates early transcription elongation by RNA polymerase II. The complex may be involved in pre-mRNA processing through its association with mRNA capping enzymes. This CTD domain carries a regular nonapeptide repeat that can be present in up to 18 copies, as in S. pombe. The repeat has a characteristic TPA motif.


Pssm-ID: 215026 [Multi-domain]  Cd Length: 121  Bit Score: 48.67  E-value: 1.37e-06
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15237657     230 TPGRVSDA--TPSAGRRnrwdetpTPGRVTDSDATPGGGvTPGATPsgvTWDGLATPTPkRQRSRWDE---TPATMGSAT 304
Cdd:smart01104    4 TPAWGASGskTPAWGSR-------TPGTAAGGAPTARGG-SGSRTP---AWGGAGSRTP-AWGGAGPTgsrTPAWGGASA 71
                            90       100       110
                    ....*....|....*....|....*....|....*....
gi 15237657     305 PMGGVTPGAAYTPGVTPIGGidMATPTPGQLIFRGPMTP 343
Cdd:smart01104   72 WGNKSSEGSASSWAAGPGGA--YGAPTPGYGGTPSAYGP 108
PHA03247 PHA03247
large tegument protein UL36; Provisional
217-429 3.02e-05

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 48.78  E-value: 3.02e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15237657   217 PDAAPGIGRWDAP----TPGRVSDATPSAGRRNRWDETPTPGRVTDSDATPGGGVTPGATPSGVTWDGLATPTPKRQRSR 292
Cdd:PHA03247 2652 PRDDPAPGRVSRPrrarRLGRAAQASSPPQRPRRRAARPTVGSLTSLADPPPPPPTPEPAPHALVSATPLPPGPAAARQA 2731
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15237657   293 WDETPAT-MGSATPMGGVTPGAAYTPGvTPIGGIDMATPTPGqlifRGPMTPEQLNMQRwekdieERNRPLSDEELDAMF 371
Cdd:PHA03247 2732 SPALPAApAPPAVPAGPATPGGPARPA-RPPTTAGPPAPAPP----AAPAAGPPRRLTR------PAVASLSESRESLPS 2800
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|....*....
gi 15237657   372 PKDGykvlDPPATYVPIRTPARKLQQTP-TPMATPGYVIPEEnrgqqydvPPEVPGGLP 429
Cdd:PHA03247 2801 PWDP----ADPPAAVLAPAAALPPAASPaGPLPPPTSAQPTA--------PPPPPGPPP 2847
HEAT_EZ pfam13513
HEAT-like repeat; The HEAT repeat family is related to armadillo/beta-catenin-like repeats ...
952-1007 2.07e-03

HEAT-like repeat; The HEAT repeat family is related to armadillo/beta-catenin-like repeats (see pfam00514). These EZ repeats are found in subunits of cyanobacterial phycocyanin lyase and other proteins and probably carry out a scaffolding role.


Pssm-ID: 463906 [Multi-domain]  Cd Length: 55  Bit Score: 37.35  E-value: 2.07e-03
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 15237657    952 PEVLGSILGALKAIVNViGMTKMTPPIKDLLPRLTPILKNRHEKVQENCIDLVGRI 1007
Cdd:pfam13513    1 WRVREAAALALGSLAEG-GPDLLAPAVPELLPALLPLLNDDSDLVREAAAWALGRL 55
PHA03377 PHA03377
EBNA-3C; Provisional
217-421 2.20e-03

EBNA-3C; Provisional


Pssm-ID: 177614 [Multi-domain]  Cd Length: 1000  Bit Score: 42.35  E-value: 2.20e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15237657   217 PDAAP---GIGRWDAPTPGRVSDATPSAGRRNRWDETPTPGRVTDSDATPGGGVTPGATPSGV---TWDGLAT-PTPKRQ 289
Cdd:PHA03377  568 PVMAPpstGPRVMATPSTGPRDMAPPSTGPRQQAKCKDGPPASGPHEKQPPSSAPRDMAPSVVrmfLRERLLEqSTGPKP 647
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15237657   290 RSRWD-----------ETPAT-MGSATPmgGVTPGAAYTPGVTPIGGIDMATPTPGQLIFRGPMTP-------EQLNMQR 350
Cdd:PHA03377  648 KSFWEmragrdgsgiqQEPSSrRQPATQ--STPPRPSWLPSVFVLPSVDAGRAQPSEESHLSSMSPtqpisheEQPRYED 725
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 15237657   351 WEKDIEERNRPLSDEELDAMFPKDGYKvlDPPATYVPIRTparklQQTPTPMATP--GYVIPEENRGQQYDVP 421
Cdd:PHA03377  726 PDDPLDLSLHPDQAPPPSHQAPYSGHE--EPQAQQAPYPG-----YWEPRPPQAPylGYQEPQAQGVQVSSYP 791
PHA03247 PHA03247
large tegument protein UL36; Provisional
219-429 8.72e-03

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 40.69  E-value: 8.72e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15237657   219 AAPGIGRWDAPTPGRVSDATPSAGRRNRWDETPTPGRVTDSDATPGGGVTPGATPSGVTWDGLATPTPKrqrSRWDETPA 298
Cdd:PHA03247 2731 ASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLP---SPWDPADP 2807
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15237657   299 TMGSATPMGGVTPGAAYTPGV-TPIGGIDMATPTPgqlifRGPMtPEQLNMQRWEK---DIEERNRPLSDEELDAMFPKD 374
Cdd:PHA03247 2808 PAAVLAPAAALPPAASPAGPLpPPTSAQPTAPPPP-----PGPP-PPSLPLGGSVApggDVRRRPPSRSPAAKPAAPARP 2881
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|....*...
gi 15237657   375 GYKVLDPPA---TYVPIRTPARKLQQTPTPMATPGYVIPEENRGQQYDVPPEVPGGLP 429
Cdd:PHA03247 2882 PVRRLARPAvsrSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRP 2939
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH