NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|319918875|ref|NP_001025246|]
View 

serine/arginine repetitive matrix protein 2 [Danio rerio]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
cwf21_SRRM2-like cd21373
cwf21 domain found in serine/arginine repetitive matrix proteins, SRRM2, SRRM3 and similar ...
55-103 2.58e-20

cwf21 domain found in serine/arginine repetitive matrix proteins, SRRM2, SRRM3 and similar proteins; This subfamily includes SRRM2 and SRRM3, both of which contain a cwf21 domain at the N-terminus. SRRM2, also called 300 kDa nuclear matrix antigen, serine/arginine-rich splicing factor-related nuclear matrix protein of 300 kDa, SR-related nuclear matrix protein of 300 kDa, Ser/Arg-related nuclear matrix protein of 300 kDa, splicing coactivator subunit SRm300, or Tax-responsive enhancer element-binding protein 803 (TaxREB803), is required for pre-mRNA splicing as component of the spliceosome. SRRM3 may play a role in regulating breast cancer cell invasiveness. It may be involved in RYBP-mediated breast cancer progression. The cwf21 domain is involved in mRNA splicing; it binds directly to the spliceosomal protein Prp8.


:

Pssm-ID: 410600 [Multi-domain]  Cd Length: 50  Bit Score: 85.32  E-value: 2.58e-20
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*....
gi 319918875   55 PNADILEHQRKRQLEVKCAELQDMMEEQGYSAEEIEEKVNTFRLMLQER 103
Cdd:cd21373     1 PNKEILDHERKRKIEVKCLELEDLLEEQGYTEEEIQAKVDEYRALLLEK 49
PRK12678 super family cl36163
transcription termination factor Rho; Provisional
289-516 1.49e-08

transcription termination factor Rho; Provisional


The actual alignment was detected with superfamily member PRK12678:

Pssm-ID: 237171 [Multi-domain]  Cd Length: 672  Bit Score: 59.15  E-value: 1.49e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 319918875  289 TDSRPEEVRKGRSPDRRRRGRSQESPKRMEIRERSPRRSRTPEQNKRYKGREREREQLQEKPQRKRNDSSSRSPPPKQQL 368
Cdd:PRK12678   73 PAAAARRAARAAAAARQAEQPAAEAAAAKAEAAPAARAAAAAAAEAASAPEAAQARERRERGEAARRGAARKAGEGGEQP 152
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 319918875  369 DRRPRSEEREKPPQTRRHDSSSPSPPPNKQRQDRRQRSEERLKaplRKRPDSSPRSPSPKQQRDRRDDGKKNKVQSRHDS 448
Cdd:PRK12678  153 ATEARADAAERTEEEERDERRRRGDREDRQAEAERGERGRREE---RGRDGDDRDRRDRREQGDRREERGRRDGGDRRGR 229
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 319918875  449 SSSSSTSSPSPSPSPRKDRMRRGhsgekarapspRDREKERARGGERisDRDKSRKEDRGRDGDKAKE 516
Cdd:PRK12678  230 RRRRDRRDARGDDNREDRGDRDG-----------DDGEGRGGRRGRR--FRDRDRRGRRGGDGGNERE 284
SF-CC1 super family cl36939
splicing factor, CC1-like family; This model represents a subfamily of RNA splicing factors ...
483-585 4.38e-04

splicing factor, CC1-like family; This model represents a subfamily of RNA splicing factors including the Pad-1 protein (N. crassa), CAPER (M. musculus) and CC1.3 (H.sapiens). These proteins are characterized by an N-terminal arginine-rich, low complexity domain followed by three (or in the case of 4 H. sapiens paralogs, two) RNA recognition domains (rrm: pfam00706). These splicing factors are closely related to the U2AF splicing factor family (TIGR01642). A homologous gene from Plasmodium falciparum was identified in the course of the analysis of that genome at TIGR and was included in the seed.


The actual alignment was detected with superfamily member TIGR01622:

Pssm-ID: 273721 [Multi-domain]  Cd Length: 494  Bit Score: 44.52  E-value: 4.38e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 319918875   483 RDREKERARGGERISDRDKSRKEDRGRDGDKAKEQARNRSSDSVSPKRSPFANGRQKERELEKERQEQRDREKRREEEMR 562
Cdd:TIGR01622    3 RDRERERLRDSSSAGDRDRRRDKGRERSRDRSRDRERSRSRRRDRHRDRDYYRGRERRSRSRRPNRRYRPREKRRRRGDS 82
                           90       100
                   ....*....|....*....|...
gi 319918875   563 EEALRKARESDREKERNRSRREE 585
Cdd:TIGR01622   83 YRRRRDDRRSRREKPRARDGTPE 105
PHA03307 super family cl33723
transcriptional regulator ICP4; Provisional
744-918 8.17e-04

transcriptional regulator ICP4; Provisional


The actual alignment was detected with superfamily member PHA03307:

Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 44.01  E-value: 8.17e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 319918875  744 PPIQRQSSPPEPQSKRDDVEKKQRDPERERRPGQSSSSFTVMNDRERGKERYTPTETSSPPLSPPQRVLDRAAQVGERYM 823
Cdd:PHA03307  203 SPRPPRRSSPISASASSPAPAPGRSAADDAGASSSDSSSSESSGCGWGPENECPLPRPAPITLPTRIWEASGWNGPSSRP 282
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 319918875  824 PSGESQSQSRGRGGERYSPSE------------LEQESSRPSPSRGRDRQSDQRKEAPPLSPREKGRIDAPQVKQAASRP 891
Cdd:PHA03307  283 GPASSSSSPRERSPSPSPSSPgsgpapssprasSSSSSSRESSSSSTSSSSESSRGAAVSPGPSPSRSPSPSRPPPPADP 362
                         170       180
                  ....*....|....*....|....*..
gi 319918875  892 SPKRTPPRQYQDPQRSLSPSPRRGVRR 918
Cdd:PHA03307  363 SSPRKRPRPSRAPSSPAASAGRPTRRR 389
 
Name Accession Description Interval E-value
cwf21_SRRM2-like cd21373
cwf21 domain found in serine/arginine repetitive matrix proteins, SRRM2, SRRM3 and similar ...
55-103 2.58e-20

cwf21 domain found in serine/arginine repetitive matrix proteins, SRRM2, SRRM3 and similar proteins; This subfamily includes SRRM2 and SRRM3, both of which contain a cwf21 domain at the N-terminus. SRRM2, also called 300 kDa nuclear matrix antigen, serine/arginine-rich splicing factor-related nuclear matrix protein of 300 kDa, SR-related nuclear matrix protein of 300 kDa, Ser/Arg-related nuclear matrix protein of 300 kDa, splicing coactivator subunit SRm300, or Tax-responsive enhancer element-binding protein 803 (TaxREB803), is required for pre-mRNA splicing as component of the spliceosome. SRRM3 may play a role in regulating breast cancer cell invasiveness. It may be involved in RYBP-mediated breast cancer progression. The cwf21 domain is involved in mRNA splicing; it binds directly to the spliceosomal protein Prp8.


Pssm-ID: 410600 [Multi-domain]  Cd Length: 50  Bit Score: 85.32  E-value: 2.58e-20
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*....
gi 319918875   55 PNADILEHQRKRQLEVKCAELQDMMEEQGYSAEEIEEKVNTFRLMLQER 103
Cdd:cd21373     1 PNKEILDHERKRKIEVKCLELEDLLEEQGYTEEEIQAKVDEYRALLLEK 49
cwf21 pfam08312
cwf21 domain; The cwf21 family is involved in mRNA splicing. It has been isolated as a ...
60-103 8.25e-12

cwf21 domain; The cwf21 family is involved in mRNA splicing. It has been isolated as a subcomplex of the splicosome in Schizosaccharomyces pombe. The function of the cwf21 domain is to bind directly to the spliceosomal protein Prp8. Mutations in the cwf21 domain prevent Prp8 from binding. The structure of this domain has recently been solved which shows this domain to be composed of two alpha helices.


Pssm-ID: 462421 [Multi-domain]  Cd Length: 44  Bit Score: 60.90  E-value: 8.25e-12
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....
gi 319918875    60 LEHQRKRQLEVKCAELQDMMEEQGYSAEEIEEKVNTFRLMLQER 103
Cdd:pfam08312    1 LEHERKREIEVKVLELRDELEEQGLSEEEIEEKVDELRKKLLAE 44
PRK12678 PRK12678
transcription termination factor Rho; Provisional
289-516 1.49e-08

transcription termination factor Rho; Provisional


Pssm-ID: 237171 [Multi-domain]  Cd Length: 672  Bit Score: 59.15  E-value: 1.49e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 319918875  289 TDSRPEEVRKGRSPDRRRRGRSQESPKRMEIRERSPRRSRTPEQNKRYKGREREREQLQEKPQRKRNDSSSRSPPPKQQL 368
Cdd:PRK12678   73 PAAAARRAARAAAAARQAEQPAAEAAAAKAEAAPAARAAAAAAAEAASAPEAAQARERRERGEAARRGAARKAGEGGEQP 152
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 319918875  369 DRRPRSEEREKPPQTRRHDSSSPSPPPNKQRQDRRQRSEERLKaplRKRPDSSPRSPSPKQQRDRRDDGKKNKVQSRHDS 448
Cdd:PRK12678  153 ATEARADAAERTEEEERDERRRRGDREDRQAEAERGERGRREE---RGRDGDDRDRRDRREQGDRREERGRRDGGDRRGR 229
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 319918875  449 SSSSSTSSPSPSPSPRKDRMRRGhsgekarapspRDREKERARGGERisDRDKSRKEDRGRDGDKAKE 516
Cdd:PRK12678  230 RRRRDRRDARGDDNREDRGDRDG-----------DDGEGRGGRRGRR--FRDRDRRGRRGGDGGNERE 284
SF-CC1 TIGR01622
splicing factor, CC1-like family; This model represents a subfamily of RNA splicing factors ...
483-585 4.38e-04

splicing factor, CC1-like family; This model represents a subfamily of RNA splicing factors including the Pad-1 protein (N. crassa), CAPER (M. musculus) and CC1.3 (H.sapiens). These proteins are characterized by an N-terminal arginine-rich, low complexity domain followed by three (or in the case of 4 H. sapiens paralogs, two) RNA recognition domains (rrm: pfam00706). These splicing factors are closely related to the U2AF splicing factor family (TIGR01642). A homologous gene from Plasmodium falciparum was identified in the course of the analysis of that genome at TIGR and was included in the seed.


Pssm-ID: 273721 [Multi-domain]  Cd Length: 494  Bit Score: 44.52  E-value: 4.38e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 319918875   483 RDREKERARGGERISDRDKSRKEDRGRDGDKAKEQARNRSSDSVSPKRSPFANGRQKERELEKERQEQRDREKRREEEMR 562
Cdd:TIGR01622    3 RDRERERLRDSSSAGDRDRRRDKGRERSRDRSRDRERSRSRRRDRHRDRDYYRGRERRSRSRRPNRRYRPREKRRRRGDS 82
                           90       100
                   ....*....|....*....|...
gi 319918875   563 EEALRKARESDREKERNRSRREE 585
Cdd:TIGR01622   83 YRRRRDDRRSRREKPRARDGTPE 105
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
744-918 8.17e-04

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 44.01  E-value: 8.17e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 319918875  744 PPIQRQSSPPEPQSKRDDVEKKQRDPERERRPGQSSSSFTVMNDRERGKERYTPTETSSPPLSPPQRVLDRAAQVGERYM 823
Cdd:PHA03307  203 SPRPPRRSSPISASASSPAPAPGRSAADDAGASSSDSSSSESSGCGWGPENECPLPRPAPITLPTRIWEASGWNGPSSRP 282
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 319918875  824 PSGESQSQSRGRGGERYSPSE------------LEQESSRPSPSRGRDRQSDQRKEAPPLSPREKGRIDAPQVKQAASRP 891
Cdd:PHA03307  283 GPASSSSSPRERSPSPSPSSPgsgpapssprasSSSSSSRESSSSSTSSSSESSRGAAVSPGPSPSRSPSPSRPPPPADP 362
                         170       180
                  ....*....|....*....|....*..
gi 319918875  892 SPKRTPPRQYQDPQRSLSPSPRRGVRR 918
Cdd:PHA03307  363 SSPRKRPRPSRAPSSPAASAGRPTRRR 389
PRK12678 PRK12678
transcription termination factor Rho; Provisional
431-626 8.87e-04

transcription termination factor Rho; Provisional


Pssm-ID: 237171 [Multi-domain]  Cd Length: 672  Bit Score: 43.74  E-value: 8.87e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 319918875  431 RDRRDDGKKNKVQSRHDSSSSSSTSSPSPSPSPRKDRMRRGHSGEKARAPSPRDREKERARGGERISDRDKSRKEDRGRD 510
Cdd:PRK12678   77 ARRAARAAAAARQAEQPAAEAAAAKAEAAPAARAAAAAAAEAASAPEAAQARERRERGEAARRGAARKAGEGGEQPATEA 156
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 319918875  511 GDKAKEQARNRSSDSVSPKRSPFANGRQKERELEKERQEQRDREKRREEEMREEALRKARESDREKERNRSRREEVSHSD 590
Cdd:PRK12678  157 RADAAERTEEEERDERRRRGDREDRQAEAERGERGRREERGRDGDDRDRRDRREQGDRREERGRRDGGDRRGRRRRRDRR 236
                         170       180       190
                  ....*....|....*....|....*....|....*.
gi 319918875  591 RTSSSRCQPENRRGIRGSEAEQEVLKKDRRMEEDKR 626
Cdd:PRK12678  237 DARGDDNREDRGDRDGDDGEGRGGRRGRRFRDRDRR 272
PTZ00121 PTZ00121
MAEBL; Provisional
323-771 1.84e-03

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 42.82  E-value: 1.84e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 319918875  323 SPRRSRTPEQNKRYKGREREREQLQEKPQRKRNDSSSRSPPPKQQLDRRprSEEREKPPQTRRHDSSSPSPPPNKQRQDR 402
Cdd:PTZ00121 1068 QDEGLKPSYKDFDFDAKEDNRADEATEEAFGKAEEAKKTETGKAEEARK--AEEAKKKAEDARKAEEARKAEDARKAEEA 1145
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 319918875  403 RQRSEERLKAPLRKRPDSSPRSPSPKQQRDRRDDGKKNKVQSRHDSSSSSSTSSPSPSPSPRKDRMRRGhsgEKARAPSP 482
Cdd:PTZ00121 1146 RKAEDAKRVEIARKAEDARKAEEARKAEDAKKAEAARKAEEVRKAEELRKAEDARKAEAARKAEEERKA---EEARKAED 1222
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 319918875  483 RDREKERARGGERISDRDKSRKEDRGRDGDKAKEQARNRSSDSVSPKRSPFANGRQKERELEKERQEQRDREKRREEEMR 562
Cdd:PTZ00121 1223 AKKAEAVKKAEEAKKDAEEAKKAEEERNNEEIRKFEEARMAHFARRQAAIKAEEARKADELKKAEEKKKADEAKKAEEKK 1302
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 319918875  563 EEALRKARESDREKERNRSRREEVSHSDRTSSSRCQPENRRGIRGSEAEQEVLKKDRRMEEDKRQEKSPPHQKMEKLAQK 642
Cdd:PTZ00121 1303 KADEAKKKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADA 1382
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 319918875  643 EKTGQKDKAKAVSSSSSSSSSSSSSNSESDSDSSSSSSSSSSSSSSSSSSSDDDKKKKQSSKDSTSASKSVPNAVIAQAI 722
Cdd:PTZ00121 1383 AKKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKKADEAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKAEEA 1462
                         410       420       430       440
                  ....*....|....*....|....*....|....*....|....*....
gi 319918875  723 ARREKENRVRNGESDEGRKTYPPIQRQSSPPEPQSKRDDVEKKQRDPER 771
Cdd:PTZ00121 1463 KKKAEEAKKADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKK 1511
 
Name Accession Description Interval E-value
cwf21_SRRM2-like cd21373
cwf21 domain found in serine/arginine repetitive matrix proteins, SRRM2, SRRM3 and similar ...
55-103 2.58e-20

cwf21 domain found in serine/arginine repetitive matrix proteins, SRRM2, SRRM3 and similar proteins; This subfamily includes SRRM2 and SRRM3, both of which contain a cwf21 domain at the N-terminus. SRRM2, also called 300 kDa nuclear matrix antigen, serine/arginine-rich splicing factor-related nuclear matrix protein of 300 kDa, SR-related nuclear matrix protein of 300 kDa, Ser/Arg-related nuclear matrix protein of 300 kDa, splicing coactivator subunit SRm300, or Tax-responsive enhancer element-binding protein 803 (TaxREB803), is required for pre-mRNA splicing as component of the spliceosome. SRRM3 may play a role in regulating breast cancer cell invasiveness. It may be involved in RYBP-mediated breast cancer progression. The cwf21 domain is involved in mRNA splicing; it binds directly to the spliceosomal protein Prp8.


Pssm-ID: 410600 [Multi-domain]  Cd Length: 50  Bit Score: 85.32  E-value: 2.58e-20
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*....
gi 319918875   55 PNADILEHQRKRQLEVKCAELQDMMEEQGYSAEEIEEKVNTFRLMLQER 103
Cdd:cd21373     1 PNKEILDHERKRKIEVKCLELEDLLEEQGYTEEEIQAKVDEYRALLLEK 49
cwf21_SRRM3 cd21376
cwf21 domain found in serine/arginine repetitive matrix protein 3 and similar proteins; Serine ...
39-105 9.04e-18

cwf21 domain found in serine/arginine repetitive matrix protein 3 and similar proteins; Serine/arginine repetitive matrix protein 3 (SRRM3) may play a role in regulating breast cancer cell invasiveness. It may also be involved in RYBP-mediated breast cancer progression. SRRM3 contains a cwf21 domain at the N-terminus. The cwf21 domain is involved in mRNA splicing; it binds directly to the spliceosomal protein Prp8.


Pssm-ID: 410602 [Multi-domain]  Cd Length: 68  Bit Score: 78.63  E-value: 9.04e-18
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 319918875   39 RDEKDKERLESQLNRQPNADILEHQRKRQLEVKCAELQDMMEEQGYSAEEIEEKVNTFRLMLQERQE 105
Cdd:cd21376     1 KSEEEIKKLDAALVKKPNREILDHERKRKVELKCMEMQELMEEQGYTEEEIRQKVSTFRQMLMEKEG 67
cwf21_SRRM2 cd21375
cwf21 domain found in serine/arginine repetitive matrix protein 2; Serine/arginine repetitive ...
41-104 1.43e-17

cwf21 domain found in serine/arginine repetitive matrix protein 2; Serine/arginine repetitive matrix protein 2 (SRRM2) is also called 300 kDa nuclear matrix antigen, serine/arginine-rich splicing factor-related nuclear matrix protein of 300 kDa, SR-related nuclear matrix protein of 300 kDa, Ser/Arg-related nuclear matrix protein of 300 kDa, splicing coactivator subunit SRm300, or Tax-responsive enhancer element-binding protein 803 (TaxREB803). It is required for pre-mRNA splicing as component of the spliceosome. It contains a cwf21 domain at the N-terminus. The cwf21 domain is involved in mRNA splicing; it binds directly to the spliceosomal protein Prp8.


Pssm-ID: 410601 [Multi-domain]  Cd Length: 64  Bit Score: 78.13  E-value: 1.43e-17
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 319918875   41 EKDKERLESQLNRQPNADILEHQRKRQLEVKCAELQDMMEEQGYSAEEIEEKVNTFRLMLQERQ 104
Cdd:cd21375     1 EEELRRLEAALVKKPNPDILDHERKRRVELKCLELEEMMEEQGYSEEEIQEKVATFRLMLLEKD 64
cwf21_CWC21-like cd21372
cwf21 domain found in fungal complexed with CEF1 protein 21 (CWC21) and similar proteins; This ...
56-103 2.14e-14

cwf21 domain found in fungal complexed with CEF1 protein 21 (CWC21) and similar proteins; This subfamily includes complexed with CEF1 protein 21 (CWC21) from budding yeast, complexed with cdc5 protein 21 (CWF21) from fission yeast, as well as their orthologs, serine/arginine repetitive matrix proteins (SRRM2 and SRRM3) from vertebrates. Both CWC21 and CWF21 are pre-mRNA-splicing factors that may function at or prior to the first catalytic step of splicing at the catalytic center of the spliceosome, together with ISY1. SRRM2 is required for pre-mRNA splicing as a component of the spliceosome. SRRM3 may play a role in regulating breast cancer cell invasiveness. It may be involved in RYBP-mediated breast cancer progression. Members of this family contain a cwf21 domain at the N-terminus. The cwf21 domain is involved in mRNA splicing; it binds directly to the spliceosomal protein Prp8.


Pssm-ID: 410599 [Multi-domain]  Cd Length: 49  Bit Score: 68.27  E-value: 2.14e-14
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*...
gi 319918875   56 NADILEHQRKRQLEVKCAELQDMMEEQGYSAEEIEEKVNTFRLMLQER 103
Cdd:cd21372     1 DKEILEHERKRQIELKCLELRDELEDEGLSEEEIEEKVDELREKLLKE 48
cwf21 pfam08312
cwf21 domain; The cwf21 family is involved in mRNA splicing. It has been isolated as a ...
60-103 8.25e-12

cwf21 domain; The cwf21 family is involved in mRNA splicing. It has been isolated as a subcomplex of the splicosome in Schizosaccharomyces pombe. The function of the cwf21 domain is to bind directly to the spliceosomal protein Prp8. Mutations in the cwf21 domain prevent Prp8 from binding. The structure of this domain has recently been solved which shows this domain to be composed of two alpha helices.


Pssm-ID: 462421 [Multi-domain]  Cd Length: 44  Bit Score: 60.90  E-value: 8.25e-12
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....
gi 319918875    60 LEHQRKRQLEVKCAELQDMMEEQGYSAEEIEEKVNTFRLMLQER 103
Cdd:pfam08312    1 LEHERKREIEVKVLELRDELEEQGLSEEEIEEKVDELRKKLLAE 44
PRK12678 PRK12678
transcription termination factor Rho; Provisional
289-516 1.49e-08

transcription termination factor Rho; Provisional


Pssm-ID: 237171 [Multi-domain]  Cd Length: 672  Bit Score: 59.15  E-value: 1.49e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 319918875  289 TDSRPEEVRKGRSPDRRRRGRSQESPKRMEIRERSPRRSRTPEQNKRYKGREREREQLQEKPQRKRNDSSSRSPPPKQQL 368
Cdd:PRK12678   73 PAAAARRAARAAAAARQAEQPAAEAAAAKAEAAPAARAAAAAAAEAASAPEAAQARERRERGEAARRGAARKAGEGGEQP 152
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 319918875  369 DRRPRSEEREKPPQTRRHDSSSPSPPPNKQRQDRRQRSEERLKaplRKRPDSSPRSPSPKQQRDRRDDGKKNKVQSRHDS 448
Cdd:PRK12678  153 ATEARADAAERTEEEERDERRRRGDREDRQAEAERGERGRREE---RGRDGDDRDRRDRREQGDRREERGRRDGGDRRGR 229
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 319918875  449 SSSSSTSSPSPSPSPRKDRMRRGhsgekarapspRDREKERARGGERisDRDKSRKEDRGRDGDKAKE 516
Cdd:PRK12678  230 RRRRDRRDARGDDNREDRGDRDG-----------DDGEGRGGRRGRR--FRDRDRRGRRGGDGGNERE 284
PTZ00121 PTZ00121
MAEBL; Provisional
228-653 7.63e-06

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 50.52  E-value: 7.63e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 319918875  228 REESVNSDTESSSSDEKETSKGKRKRSDNEAPPPSKSRRRQSASSSPARSQSPQRGKQQKKTDS---RPEEVRKGRSPDR 304
Cdd:PTZ00121 1062 AKAHVGQDEGLKPSYKDFDFDAKEDNRADEATEEAFGKAEEAKKTETGKAEEARKAEEAKKKAEdarKAEEARKAEDARK 1141
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 319918875  305 RRRGRSQESPKRMEIRERSPRRSRTPEQNkryKGREREREQLQEKPQRKRNDSSSRSPPPKQQLDRRPRSEEREKPPQTR 384
Cdd:PTZ00121 1142 AEEARKAEDAKRVEIARKAEDARKAEEAR---KAEDAKKAEAARKAEEVRKAEELRKAEDARKAEAARKAEEERKAEEAR 1218
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 319918875  385 RHDSSSPSPPPNKQRQDRRQRSEERLKAPLRKRPDSSPRSPSPKQQRDRRDDGKKNKVQSRHDSSSSSSTSSPSPSPSPR 464
Cdd:PTZ00121 1219 KAEDAKKAEAVKKAEEAKKDAEEAKKAEEERNNEEIRKFEEARMAHFARRQAAIKAEEARKADELKKAEEKKKADEAKKA 1298
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 319918875  465 KDRMRRGHSGEKARAPSPRDREKERARGGERISDRDKSRKEDRGRDGDKAKEQARNRSSDSVSPKRSPFANGRQKERE-- 542
Cdd:PTZ00121 1299 EEKKKADEAKKKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAkk 1378
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 319918875  543 --------LEKERQEQRDREKRREEEMREEALRKARESDREKERNRSRREEVSHSDRTSSsrcQPENRRgiRGSEAEQEV 614
Cdd:PTZ00121 1379 kadaakkkAEEKKKADEAKKKAEEDKKKADELKKAAAAKKKADEAKKKAEEKKKADEAKK---KAEEAK--KADEAKKKA 1453
                         410       420       430
                  ....*....|....*....|....*....|....*....
gi 319918875  615 LKKDRRMEEDKRQEKSPPHQKMEKLAQKEKTGQKDKAKA 653
Cdd:PTZ00121 1454 EEAKKAEEAKKKAEEAKKADEAKKKAEEAKKADEAKKKA 1492
PRK12678 PRK12678
transcription termination factor Rho; Provisional
354-583 1.77e-04

transcription termination factor Rho; Provisional


Pssm-ID: 237171 [Multi-domain]  Cd Length: 672  Bit Score: 46.05  E-value: 1.77e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 319918875  354 RNDSSSRSPPPKQQLDRRPRSEEREKPPQTRRHDSSSPSPPPNKQRQDRRQRSEERLKAPLRKRPDSSPRSPSPKQQRDR 433
Cdd:PRK12678   59 RGGGAAAAAATPAAPAAAARRAARAAAAARQAEQPAAEAAAAKAEAAPAARAAAAAAAEAASAPEAAQARERRERGEAAR 138
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 319918875  434 RDDGKKNKVQSRHDSSSSSSTSSPSPSPSPRKDRMRRGHSGEKARapspRDREKERARGGERISDRDKSRKEDRGRDGDK 513
Cdd:PRK12678  139 RGAARKAGEGGEQPATEARADAAERTEEEERDERRRRGDREDRQA----EAERGERGRREERGRDGDDRDRRDRREQGDR 214
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 319918875  514 AKEQARNRSSDSVSpkrspfaNGRQKERELEKERQEQRDREKRREEEMREEAlrkARESDREKERNRSRR 583
Cdd:PRK12678  215 REERGRRDGGDRRG-------RRRRRDRRDARGDDNREDRGDRDGDDGEGRG---GRRGRRFRDRDRRGR 274
cwf21 cd21369
cwf21 domain; The cwf21 domain is involved in mRNA splicing; it binds directly to the ...
58-103 1.91e-04

cwf21 domain; The cwf21 domain is involved in mRNA splicing; it binds directly to the spliceosomal protein Prp8. Mutations in the cwf21 domain prevents its binding to Prp8. The domain is composed of two alpha helices. Proteins containing the cwf21 domain include complexed with CEF1 protein 21 (CWC21) from budding yeast, complexed with cdc5 protein 21 (CWF21) from fission yeast, as well as their orthologs, serine/arginine repetitive matrix proteins (SRRM2 and SRRM3) from vertebrates. This domain family also includes U2-associated protein SR140 from Eumetazoa, protein RRC1, and similar proteins from plants.


Pssm-ID: 410596 [Multi-domain]  Cd Length: 48  Bit Score: 40.15  E-value: 1.91e-04
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*..
gi 319918875   58 DILEHQRKRQLEVKCAELQDMMEEQG-YSAEEIEEKVNTFRLMLQER 103
Cdd:cd21369     2 DEEKRAKKREIELKVMELRDELEEQGrKPEQQIQEKVEHYRDKLLQR 48
SF-CC1 TIGR01622
splicing factor, CC1-like family; This model represents a subfamily of RNA splicing factors ...
483-585 4.38e-04

splicing factor, CC1-like family; This model represents a subfamily of RNA splicing factors including the Pad-1 protein (N. crassa), CAPER (M. musculus) and CC1.3 (H.sapiens). These proteins are characterized by an N-terminal arginine-rich, low complexity domain followed by three (or in the case of 4 H. sapiens paralogs, two) RNA recognition domains (rrm: pfam00706). These splicing factors are closely related to the U2AF splicing factor family (TIGR01642). A homologous gene from Plasmodium falciparum was identified in the course of the analysis of that genome at TIGR and was included in the seed.


Pssm-ID: 273721 [Multi-domain]  Cd Length: 494  Bit Score: 44.52  E-value: 4.38e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 319918875   483 RDREKERARGGERISDRDKSRKEDRGRDGDKAKEQARNRSSDSVSPKRSPFANGRQKERELEKERQEQRDREKRREEEMR 562
Cdd:TIGR01622    3 RDRERERLRDSSSAGDRDRRRDKGRERSRDRSRDRERSRSRRRDRHRDRDYYRGRERRSRSRRPNRRYRPREKRRRRGDS 82
                           90       100
                   ....*....|....*....|...
gi 319918875   563 EEALRKARESDREKERNRSRREE 585
Cdd:TIGR01622   83 YRRRRDDRRSRREKPRARDGTPE 105
PRK12678 PRK12678
transcription termination factor Rho; Provisional
228-506 7.67e-04

transcription termination factor Rho; Provisional


Pssm-ID: 237171 [Multi-domain]  Cd Length: 672  Bit Score: 43.74  E-value: 7.67e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 319918875  228 REESVNSDTESSSSDEKETSKGKRKRSDNEAPPPSKSRRRQSASSSPARSQSPQRGKQQKKTDSRPEEVRKGRSPDRRRR 307
Cdd:PRK12678   69 TPAAPAAAARRAARAAAAARQAEQPAAEAAAAKAEAAPAARAAAAAAAEAASAPEAAQARERRERGEAARRGAARKAGEG 148
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 319918875  308 GRSQESPKRMEIRERSPRRSRTPEQNKRYKGREREREQLQEKPQRKRNDSSSRSPPPKQQLDRRPRSEEREKPPQTRRHD 387
Cdd:PRK12678  149 GEQPATEARADAAERTEEEERDERRRRGDREDRQAEAERGERGRREERGRDGDDRDRRDRREQGDRREERGRRDGGDRRG 228
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 319918875  388 ssspspppnkQRQDRRQRSEerlkaplrkrpdssprspspkQQRDRRDDGKKNKVQSRHDsssssstsspspspsPRKDR 467
Cdd:PRK12678  229 ----------RRRRRDRRDA---------------------RGDDNREDRGDRDGDDGEG---------------RGGRR 262
                         250       260       270
                  ....*....|....*....|....*....|....*....
gi 319918875  468 MRRGhsgekarapspRDREKERARGGERISDRDKSRKED 506
Cdd:PRK12678  263 GRRF-----------RDRDRRGRRGGDGGNEREPELRED 290
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
744-918 8.17e-04

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 44.01  E-value: 8.17e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 319918875  744 PPIQRQSSPPEPQSKRDDVEKKQRDPERERRPGQSSSSFTVMNDRERGKERYTPTETSSPPLSPPQRVLDRAAQVGERYM 823
Cdd:PHA03307  203 SPRPPRRSSPISASASSPAPAPGRSAADDAGASSSDSSSSESSGCGWGPENECPLPRPAPITLPTRIWEASGWNGPSSRP 282
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 319918875  824 PSGESQSQSRGRGGERYSPSE------------LEQESSRPSPSRGRDRQSDQRKEAPPLSPREKGRIDAPQVKQAASRP 891
Cdd:PHA03307  283 GPASSSSSPRERSPSPSPSSPgsgpapssprasSSSSSSRESSSSSTSSSSESSRGAAVSPGPSPSRSPSPSRPPPPADP 362
                         170       180
                  ....*....|....*....|....*..
gi 319918875  892 SPKRTPPRQYQDPQRSLSPSPRRGVRR 918
Cdd:PHA03307  363 SSPRKRPRPSRAPSSPAASAGRPTRRR 389
PRK12678 PRK12678
transcription termination factor Rho; Provisional
431-626 8.87e-04

transcription termination factor Rho; Provisional


Pssm-ID: 237171 [Multi-domain]  Cd Length: 672  Bit Score: 43.74  E-value: 8.87e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 319918875  431 RDRRDDGKKNKVQSRHDSSSSSSTSSPSPSPSPRKDRMRRGHSGEKARAPSPRDREKERARGGERISDRDKSRKEDRGRD 510
Cdd:PRK12678   77 ARRAARAAAAARQAEQPAAEAAAAKAEAAPAARAAAAAAAEAASAPEAAQARERRERGEAARRGAARKAGEGGEQPATEA 156
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 319918875  511 GDKAKEQARNRSSDSVSPKRSPFANGRQKERELEKERQEQRDREKRREEEMREEALRKARESDREKERNRSRREEVSHSD 590
Cdd:PRK12678  157 RADAAERTEEEERDERRRRGDREDRQAEAERGERGRREERGRDGDDRDRRDRREQGDRREERGRRDGGDRRGRRRRRDRR 236
                         170       180       190
                  ....*....|....*....|....*....|....*.
gi 319918875  591 RTSSSRCQPENRRGIRGSEAEQEVLKKDRRMEEDKR 626
Cdd:PRK12678  237 DARGDDNREDRGDRDGDDGEGRGGRRGRRFRDRDRR 272
PRK12678 PRK12678
transcription termination factor Rho; Provisional
216-409 8.94e-04

transcription termination factor Rho; Provisional


Pssm-ID: 237171 [Multi-domain]  Cd Length: 672  Bit Score: 43.74  E-value: 8.94e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 319918875  216 REKKKDKKKKKKREESVNSDTESSSSDEKETSKGKRKRSDNEAPPPSKSRRRQSASSSPARSQSPQRGKQQKKTDSRPEE 295
Cdd:PRK12678   78 RRAARAAAAARQAEQPAAEAAAAKAEAAPAARAAAAAAAEAASAPEAAQARERRERGEAARRGAARKAGEGGEQPATEAR 157
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 319918875  296 VRKGRSPDRRRRGRSQESPKRMEIRERSPRRSRTPEQNKRYKG----REREREQLQEKPQRKRNDSSSRSPPPKQQLDRR 371
Cdd:PRK12678  158 ADAAERTEEEERDERRRRGDREDRQAEAERGERGRREERGRDGddrdRRDRREQGDRREERGRRDGGDRRGRRRRRDRRD 237
                         170       180       190
                  ....*....|....*....|....*....|....*...
gi 319918875  372 PRSEEREKPPQTRRHDSSSPSPPPNKQRQDRRQRSEER 409
Cdd:PRK12678  238 ARGDDNREDRGDRDGDDGEGRGGRRGRRFRDRDRRGRR 275
PTZ00121 PTZ00121
MAEBL; Provisional
292-653 9.71e-04

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 43.59  E-value: 9.71e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 319918875  292 RPEEVRKGRSPDRRRRGRSQESPKR-MEIRE----RSPRRSRTPEQNKRYKgREREREQLQEKPQRKRNDSSSRSPPPKQ 366
Cdd:PTZ00121 1159 KAEDARKAEEARKAEDAKKAEAARKaEEVRKaeelRKAEDARKAEAARKAE-EERKAEEARKAEDAKKAEAVKKAEEAKK 1237
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 319918875  367 QLDRRPRSEEREKPPQTRRHDSSSPSPPPnkQRQDRRQRSEERLKAPLRKRPDSSPRSPSPKQQRDRRDDGKKNKVQS-- 444
Cdd:PTZ00121 1238 DAEEAKKAEEERNNEEIRKFEEARMAHFA--RRQAAIKAEEARKADELKKAEEKKKADEAKKAEEKKKADEAKKKAEEak 1315
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 319918875  445 RHDSSSSSSTSSPSPSPSPRKDRMRRGHSGEKARAPSprDREKERARGGERISDRDKSRKEDRGRDGDKAKEQARNRSSD 524
Cdd:PTZ00121 1316 KADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEA--EAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKA 1393
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 319918875  525 SVSPKRSpfANGRQKERELEKERQEQRDREKRREEEMREEALRKARESDREKERNRSRREEVSHSDRTSSSRCQPENRRg 604
Cdd:PTZ00121 1394 DEAKKKA--EEDKKKADELKKAAAAKKKADEAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAK- 1470
                         330       340       350       360
                  ....*....|....*....|....*....|....*....|....*....
gi 319918875  605 iRGSEAEQEVLKKDRRMEEDKRQEKSPPHQKMEKLAQKEKTGQKDKAKA 653
Cdd:PTZ00121 1471 -KADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKA 1518
PTZ00121 PTZ00121
MAEBL; Provisional
225-652 1.41e-03

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 43.21  E-value: 1.41e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 319918875  225 KKKREESVNSDTESSSSDEKETSKGKRKRSDNEAPPPSKSRRRQSASSSPARSQSPQRGKQQKKtdsRPEEVRKGRSPDR 304
Cdd:PTZ00121 1342 KKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKK---KADELKKAAAAKK 1418
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 319918875  305 RRRGRSQ--ESPKRMEIRERSPRRSRTPEQNKRYKGREREREQLQEKPQRKRNDSSSRspppKQQLDRRPRSEEREKPPQ 382
Cdd:PTZ00121 1419 KADEAKKkaEEKKKADEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKKADEAK----KKAEEAKKADEAKKKAEE 1494
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 319918875  383 TRRHDSSSPSPPPNKQRQDRRQRSEERLKAPLRKRPDSSPRSPSPKQQRDRRDDGKKNKVQSRHDSSSSSSTSSPSPSPS 462
Cdd:PTZ00121 1495 AKKKADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEAKKADEAKKAEEKKKADELKKAEELKKAEEKKKAEEAKKAEE 1574
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 319918875  463 PRKDRMRRGHSGEKArapsprdrEKERARGGERISDRDKSRKEDRGRDGDKAKEQARN-RSSDSVSPKRSPFANGRQKE- 540
Cdd:PTZ00121 1575 DKNMALRKAEEAKKA--------EEARIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEElKKAEEEKKKVEQLKKKEAEEk 1646
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 319918875  541 RELEKERQEQRDREKRREEEMREEALRKARESDREKERNRSRREEVSHSDRTSSSRCQPENRRGIRGSEAEQEVLKKDRR 620
Cdd:PTZ00121 1647 KKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEE 1726
                         410       420       430
                  ....*....|....*....|....*....|..
gi 319918875  621 MEEDKRQEKSPPHQKMEKLAQKEKTGQKDKAK 652
Cdd:PTZ00121 1727 ENKIKAEEAKKEAEEDKKKAEEAKKDEEEKKK 1758
PTZ00121 PTZ00121
MAEBL; Provisional
323-771 1.84e-03

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 42.82  E-value: 1.84e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 319918875  323 SPRRSRTPEQNKRYKGREREREQLQEKPQRKRNDSSSRSPPPKQQLDRRprSEEREKPPQTRRHDSSSPSPPPNKQRQDR 402
Cdd:PTZ00121 1068 QDEGLKPSYKDFDFDAKEDNRADEATEEAFGKAEEAKKTETGKAEEARK--AEEAKKKAEDARKAEEARKAEDARKAEEA 1145
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 319918875  403 RQRSEERLKAPLRKRPDSSPRSPSPKQQRDRRDDGKKNKVQSRHDSSSSSSTSSPSPSPSPRKDRMRRGhsgEKARAPSP 482
Cdd:PTZ00121 1146 RKAEDAKRVEIARKAEDARKAEEARKAEDAKKAEAARKAEEVRKAEELRKAEDARKAEAARKAEEERKA---EEARKAED 1222
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 319918875  483 RDREKERARGGERISDRDKSRKEDRGRDGDKAKEQARNRSSDSVSPKRSPFANGRQKERELEKERQEQRDREKRREEEMR 562
Cdd:PTZ00121 1223 AKKAEAVKKAEEAKKDAEEAKKAEEERNNEEIRKFEEARMAHFARRQAAIKAEEARKADELKKAEEKKKADEAKKAEEKK 1302
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 319918875  563 EEALRKARESDREKERNRSRREEVSHSDRTSSSRCQPENRRGIRGSEAEQEVLKKDRRMEEDKRQEKSPPHQKMEKLAQK 642
Cdd:PTZ00121 1303 KADEAKKKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADA 1382
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 319918875  643 EKTGQKDKAKAVSSSSSSSSSSSSSNSESDSDSSSSSSSSSSSSSSSSSSSDDDKKKKQSSKDSTSASKSVPNAVIAQAI 722
Cdd:PTZ00121 1383 AKKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKKADEAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKAEEA 1462
                         410       420       430       440
                  ....*....|....*....|....*....|....*....|....*....
gi 319918875  723 ARREKENRVRNGESDEGRKTYPPIQRQSSPPEPQSKRDDVEKKQRDPER 771
Cdd:PTZ00121 1463 KKKAEEAKKADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKK 1511
SF-CC1 TIGR01622
splicing factor, CC1-like family; This model represents a subfamily of RNA splicing factors ...
469-583 2.65e-03

splicing factor, CC1-like family; This model represents a subfamily of RNA splicing factors including the Pad-1 protein (N. crassa), CAPER (M. musculus) and CC1.3 (H.sapiens). These proteins are characterized by an N-terminal arginine-rich, low complexity domain followed by three (or in the case of 4 H. sapiens paralogs, two) RNA recognition domains (rrm: pfam00706). These splicing factors are closely related to the U2AF splicing factor family (TIGR01642). A homologous gene from Plasmodium falciparum was identified in the course of the analysis of that genome at TIGR and was included in the seed.


Pssm-ID: 273721 [Multi-domain]  Cd Length: 494  Bit Score: 41.83  E-value: 2.65e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 319918875   469 RRGHSGEKARApSPRDREKERARGGERISDRDKSRKEDRGRDGDKAKEQARNRSSDSVSPKRSPFANGRQKERELEKERQ 548
Cdd:TIGR01622    2 YRDRERERLRD-SSSAGDRDRRRDKGRERSRDRSRDRERSRSRRRDRHRDRDYYRGRERRSRSRRPNRRYRPREKRRRRG 80
                           90       100       110
                   ....*....|....*....|....*....|....*
gi 319918875   549 EQRDREKRREEEMREEALRKARESDREKERNRSRR 583
Cdd:TIGR01622   81 DSYRRRRDDRRSRREKPRARDGTPEPLTEDERDRR 115
PRK12678 PRK12678
transcription termination factor Rho; Provisional
406-634 2.73e-03

transcription termination factor Rho; Provisional


Pssm-ID: 237171 [Multi-domain]  Cd Length: 672  Bit Score: 42.20  E-value: 2.73e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 319918875  406 SEERLKAPLRKRPDSSPRSPSPKQQRDRRDDGKKNKVQSRHDSSSSSSTSSPSPSPSPRKDRMRRGHSGEKARAPSPRDR 485
Cdd:PRK12678   66 AAATPAAPAAAARRAARAAAAARQAEQPAAEAAAAKAEAAPAARAAAAAAAEAASAPEAAQARERRERGEAARRGAARKA 145
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 319918875  486 EKERARGGERISDRDKSRKEDRGRDGDKAKEQARNRSSDSvspkrspfANGRQKERELEKERQEQRDREKRREEEMREEA 565
Cdd:PRK12678  146 GEGGEQPATEARADAAERTEEEERDERRRRGDREDRQAEA--------ERGERGRREERGRDGDDRDRRDRREQGDRREE 217
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 319918875  566 LRKARESDREKERNRSRREEVSHSDRTSSSRCQPENRRGIRGSEAEQEVLKKDRRMEEDKRQEKSPPHQ 634
Cdd:PRK12678  218 RGRRDGGDRRGRRRRRDRRDARGDDNREDRGDRDGDDGEGRGGRRGRRFRDRDRRGRRGGDGGNEREPE 286
PRK12678 PRK12678
transcription termination factor Rho; Provisional
350-558 4.86e-03

transcription termination factor Rho; Provisional


Pssm-ID: 237171 [Multi-domain]  Cd Length: 672  Bit Score: 41.04  E-value: 4.86e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 319918875  350 PQRKRNDSSSRSPPPKQQLDRRPRSEEREKPPQTRRHDSSSPSPPPNKQRQDRRQRSEERLKAPLRKRPDSSPRSPSPKQ 429
Cdd:PRK12678   76 AARRAARAAAAARQAEQPAAEAAAAKAEAAPAARAAAAAAAEAASAPEAAQARERRERGEAARRGAARKAGEGGEQPATE 155
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 319918875  430 QRDRRDDGKKNKVQSRHDSSSSSSTSSPSPSPSPRKDRMRRGHSGEKARAPSPRDREKERARGGERISDRDKSRKEDRGR 509
Cdd:PRK12678  156 ARADAAERTEEEERDERRRRGDREDRQAEAERGERGRREERGRDGDDRDRRDRREQGDRREERGRRDGGDRRGRRRRRDR 235
                         170       180       190       200
                  ....*....|....*....|....*....|....*....|....*....
gi 319918875  510 DGDKAKEQARNRSSDSVSPKRSPFANGRQKERELEKERQEQRDREKRRE 558
Cdd:PRK12678  236 RDARGDDNREDRGDRDGDDGEGRGGRRGRRFRDRDRRGRRGGDGGNERE 284
PTZ00449 PTZ00449
104 kDa microneme/rhoptry antigen; Provisional
244-383 9.52e-03

104 kDa microneme/rhoptry antigen; Provisional


Pssm-ID: 185628 [Multi-domain]  Cd Length: 943  Bit Score: 40.44  E-value: 9.52e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 319918875  244 KETSKGKrKRSDNEAPPPSKSRRRQSASSSPARSQSPQRGKQQKKTDSRPEEvrkgrspdrrrrgrsQESPKRMEiRERS 323
Cdd:PTZ00449  534 HEDSKES-DEPKEGGKPGETKEGEVGKKPGPAKEHKPSKIPTLSKKPEFPKD---------------PKHPKDPE-EPKK 596
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 319918875  324 PRRSRTPEQNKRYKGRER-EREQLQEKPQRKRNDSSSRSPPPKQqldrRPRSEEREKPPQT 383
Cdd:PTZ00449  597 PKRPRSAQRPTRPKSPKLpELLDIPKSPKRPESPKSPKRPPPPQ----RPSSPERPEGPKI 653
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH