NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|109150409|ref|NP_849254|]
View 

SR-related and CTD-associated factor 4 isoform 1 [Mus musculus]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
CID_SFRS15_SCAF4 cd17005
CID (CTD-Interacting Domain) of Splicing factor arginine serine rich 15; Splicing factor ...
5-135 2.53e-88

CID (CTD-Interacting Domain) of Splicing factor arginine serine rich 15; Splicing factor arginine serine rich 15 (SFRS15) is also called CTD-binding SR-like protein RA4 or SR-related and CTD-associated factor 4 (SCAF4). It may act to physically and functionally link transcription and pre-mRNA processing. SFRS15/SCAF4 contains a CTD-interacting domain (CID) at the amino terminus and a Ser/Arg-rich domain followed by an RNA recognition motif. CID binds tightly to the carboxy-terminal domain (CTD) of RNA polymerase (Pol) II (RNAP II). During transcription, RNAP II synthesizes eukaryotic messenger RNA. Transcription is coupled to RNA processing through the CTD, which consists of up to 52 repeats of the sequence Tyr1-Ser2-Pro3-Thr4-Ser5-Pro6-Ser7. CID contains eight alpha-helices in a right-handed superhelical arrangement, which closely resembles that of the VHS domains and ARM (Armadillo) repeat proteins, except for its two amino-terminal helices.


:

Pssm-ID: 340802  Cd Length: 131  Bit Score: 281.08  E-value: 2.53e-88
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109150409    5 NAFNQELFSLMDMKPPISRAKMILITKAAIKAIKLYKHVVQIVEKFIKKCKPEYKVPGLYVIDSIVRQSRHQFGTDKDVF 84
Cdd:cd17005     1 NAFNQELFSLMDMKPPISRAKMILITKAAIKAIKLYKHVVQIVEKFIKKCKPEYKVPGLYVIDSIVRQSRHQFGADKDVF 80
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|.
gi 109150409   85 GPRFSKNITATFQYLYLCPSEDKSKIVRVLNLWQKNGVFKIEIIQPLLDMA 135
Cdd:cd17005    81 GPRFSKNITATFQYLYLCPSEDKSKIVRVLNLWQKNGVFKIEIIQPLLDMA 131
RRM_SCAF4 cd12461
RNA recognition motif (RRM) found in SR-related and CTD-associated factor 4 (SCAF4) and ...
569-649 9.30e-56

RNA recognition motif (RRM) found in SR-related and CTD-associated factor 4 (SCAF4) and similar proteins; The CD corresponds to the RRM of SCAF4 (also termed splicing factor, arginine/serine-rich 15 or SFR15, or CTD-binding SR-like protein RA4) that belongs to a new class of SCAFs (SR-like CTD-associated factors). Although its biological function remains unclear, SCAF4 shows high sequence similarity to SCAF8 that interacts specifically with a highly serine-phosphorylated form of the carboxy-terminal domain (CTD) of the largest subunit of RNA polymerase II (pol II) and may play a direct role in coupling with both, transcription and pre-mRNA processing, processes. SCAF4 and SCAF8 both contain a conserved N-terminal CTD-interacting domain (CID), an atypical RNA recognition motif (RRM), also termed RBD (RNA binding domain) or RNP (ribonucleoprotein domain), and serine/arginine-rich motifs.


:

Pssm-ID: 409894 [Multi-domain]  Cd Length: 81  Bit Score: 187.56  E-value: 9.30e-56
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109150409  569 SVCSTTLWVGQLDKRTTQQDVASLLEEFGPIESINMIPPRGCAYIVMVHRQDAYRALQKLSRGNYKVNQKSIKIAWALNK 648
Cdd:cd12461     1 SVCSTTLWVGQLDKRTTQQDVASLLEEFGQIESINMIPPRGCAYIVMVHRQDAYRALQKLSRGNYKVNQKSIKIAWALNK 80

                  .
gi 109150409  649 G 649
Cdd:cd12461    81 G 81
PAT1 super family cl37801
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ...
303-476 9.60e-10

Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.


The actual alignment was detected with superfamily member pfam09770:

Pssm-ID: 401645 [Multi-domain]  Cd Length: 846  Bit Score: 63.13  E-value: 9.60e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109150409   303 SATSPPPPQTPfgyPGDGVQQPAYTQHQSMDQFQPRMMPIQQDTMHHQVPLPPngqmpgfgllsapppfppmpqpgmpqp 382
Cdd:pfam09770  204 RAQAKKPAQQP---APAPAQPPAAPPAQQAQQQQQFPPQIQQQQQPQQQPQQP--------------------------- 253
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109150409   383 gMAQPGLAQPgMAQPTMPQPGMPQPGMPQPGMAQPGlaQPGMAQPGMPQPA--------MPQPAMPQPGMAQPGVSPAPP 454
Cdd:pfam09770  254 -QQHPGQGHP-VTILQRPQSPQPDPAQPSIQPQAQQ--FHQQPPPVPVQPTqilqnpnrLSAARVGYPQNPQPGVQPAPA 329
                          170       180
                   ....*....|....*....|..
gi 109150409   455 VQPTFQSTFQPQNEPHSQKPHQ 476
Cdd:pfam09770  330 HQAHRQQGSFGRQAPIITHPQQ 351
SF-CC1 super family cl36939
splicing factor, CC1-like family; This model represents a subfamily of RNA splicing factors ...
1059-1171 6.35e-04

splicing factor, CC1-like family; This model represents a subfamily of RNA splicing factors including the Pad-1 protein (N. crassa), CAPER (M. musculus) and CC1.3 (H.sapiens). These proteins are characterized by an N-terminal arginine-rich, low complexity domain followed by three (or in the case of 4 H. sapiens paralogs, two) RNA recognition domains (rrm: pfam00706). These splicing factors are closely related to the U2AF splicing factor family (TIGR01642). A homologous gene from Plasmodium falciparum was identified in the course of the analysis of that genome at TIGR and was included in the seed.


The actual alignment was detected with superfamily member TIGR01622:

Pssm-ID: 273721 [Multi-domain]  Cd Length: 494  Bit Score: 43.75  E-value: 6.35e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109150409  1059 RDQERfgRRSFGSRVENDRERYGSRNDDRDNSNRERREWGrRSPDRDRHRD--------LEERSRRSSGHrdrdrdsRDR 1130
Cdd:TIGR01622    3 RDRER--ERLRDSSSAGDRDRRRDKGRERSRDRSRDRERS-RSRRRDRHRDrdyyrgreRRSRSRRPNRR-------YRP 72
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|.
gi 109150409  1131 ESRREKEENRKEKHEVADRAGGnkavEPPLSQVGTIDTVSE 1171
Cdd:TIGR01622   73 REKRRRRGDSYRRRRDDRRSRR----EKPRARDGTPEPLTE 109
 
Name Accession Description Interval E-value
CID_SFRS15_SCAF4 cd17005
CID (CTD-Interacting Domain) of Splicing factor arginine serine rich 15; Splicing factor ...
5-135 2.53e-88

CID (CTD-Interacting Domain) of Splicing factor arginine serine rich 15; Splicing factor arginine serine rich 15 (SFRS15) is also called CTD-binding SR-like protein RA4 or SR-related and CTD-associated factor 4 (SCAF4). It may act to physically and functionally link transcription and pre-mRNA processing. SFRS15/SCAF4 contains a CTD-interacting domain (CID) at the amino terminus and a Ser/Arg-rich domain followed by an RNA recognition motif. CID binds tightly to the carboxy-terminal domain (CTD) of RNA polymerase (Pol) II (RNAP II). During transcription, RNAP II synthesizes eukaryotic messenger RNA. Transcription is coupled to RNA processing through the CTD, which consists of up to 52 repeats of the sequence Tyr1-Ser2-Pro3-Thr4-Ser5-Pro6-Ser7. CID contains eight alpha-helices in a right-handed superhelical arrangement, which closely resembles that of the VHS domains and ARM (Armadillo) repeat proteins, except for its two amino-terminal helices.


Pssm-ID: 340802  Cd Length: 131  Bit Score: 281.08  E-value: 2.53e-88
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109150409    5 NAFNQELFSLMDMKPPISRAKMILITKAAIKAIKLYKHVVQIVEKFIKKCKPEYKVPGLYVIDSIVRQSRHQFGTDKDVF 84
Cdd:cd17005     1 NAFNQELFSLMDMKPPISRAKMILITKAAIKAIKLYKHVVQIVEKFIKKCKPEYKVPGLYVIDSIVRQSRHQFGADKDVF 80
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|.
gi 109150409   85 GPRFSKNITATFQYLYLCPSEDKSKIVRVLNLWQKNGVFKIEIIQPLLDMA 135
Cdd:cd17005    81 GPRFSKNITATFQYLYLCPSEDKSKIVRVLNLWQKNGVFKIEIIQPLLDMA 131
RRM_SCAF4 cd12461
RNA recognition motif (RRM) found in SR-related and CTD-associated factor 4 (SCAF4) and ...
569-649 9.30e-56

RNA recognition motif (RRM) found in SR-related and CTD-associated factor 4 (SCAF4) and similar proteins; The CD corresponds to the RRM of SCAF4 (also termed splicing factor, arginine/serine-rich 15 or SFR15, or CTD-binding SR-like protein RA4) that belongs to a new class of SCAFs (SR-like CTD-associated factors). Although its biological function remains unclear, SCAF4 shows high sequence similarity to SCAF8 that interacts specifically with a highly serine-phosphorylated form of the carboxy-terminal domain (CTD) of the largest subunit of RNA polymerase II (pol II) and may play a direct role in coupling with both, transcription and pre-mRNA processing, processes. SCAF4 and SCAF8 both contain a conserved N-terminal CTD-interacting domain (CID), an atypical RNA recognition motif (RRM), also termed RBD (RNA binding domain) or RNP (ribonucleoprotein domain), and serine/arginine-rich motifs.


Pssm-ID: 409894 [Multi-domain]  Cd Length: 81  Bit Score: 187.56  E-value: 9.30e-56
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109150409  569 SVCSTTLWVGQLDKRTTQQDVASLLEEFGPIESINMIPPRGCAYIVMVHRQDAYRALQKLSRGNYKVNQKSIKIAWALNK 648
Cdd:cd12461     1 SVCSTTLWVGQLDKRTTQQDVASLLEEFGQIESINMIPPRGCAYIVMVHRQDAYRALQKLSRGNYKVNQKSIKIAWALNK 80

                  .
gi 109150409  649 G 649
Cdd:cd12461    81 G 81
RPR smart00582
domain present in proteins, which are involved in regulation of nuclear pre-mRNA;
6-134 1.17e-36

domain present in proteins, which are involved in regulation of nuclear pre-mRNA;


Pssm-ID: 214731  Cd Length: 124  Bit Score: 134.33  E-value: 1.17e-36
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109150409      6 AFNQELFSLMdmkppISRAKMILITKAAIKAIKLYKHVVQIVEKFIKKCKPEYKVPGLYVIDSIVRQSRHQFGTD-KDVF 84
Cdd:smart00582    1 AFEQKLESLN-----NSQESIQTLTKWAIEHASHAKEIVELWEKYIKKAPVPRKLPLLYLLDSIVQNSKRKYGSEfGDEL 75
                            90       100       110       120       130
                    ....*....|....*....|....*....|....*....|....*....|
gi 109150409     85 GPRFSKNITATFQYlylCPSEDKSKIVRVLNLWQKNGVFKIEIIQPLLDM 134
Cdd:smart00582   76 GPVFQDALRRVLGA---APEELKKKIRRLLNIWEERGIFPPEVLRPLREK 122
CID pfam04818
CID domain; This domain binds to the phosphorylated C-terminal domain (CTD) of RNA polymerase ...
6-128 4.19e-30

CID domain; This domain binds to the phosphorylated C-terminal domain (CTD) of RNA polymerase II. This domain is known as the CTD-interacting domain (CID).


Pssm-ID: 461442  Cd Length: 117  Bit Score: 115.38  E-value: 4.19e-30
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109150409     6 AFNQELFSLMDmkppiSRAKMILITKAAIKAIKLYKHVVQIVEKFIKKCKPEYKVPGLYVIDSIVRQSRHQFGTDkdvFG 85
Cdd:pfam04818    2 ALEKKLSSLNN-----SQESIQTLSKWILFHRKHAKAIVEVWEKYLKKAKPEKKLHLLYLANDVLQNSRKKGKSE---FA 73
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....
gi 109150409    86 PRFSKNITATFQYLYL-CPSEDKSKIVRVLNLWQKNGVFKIEII 128
Cdd:pfam04818   74 DAFEPVLPEAFASAYKkCDEKLKKKLERLLNIWEERNVFSPEVI 117
RRM smart00360
RNA recognition motif;
574-642 8.43e-12

RNA recognition motif;


Pssm-ID: 214636 [Multi-domain]  Cd Length: 73  Bit Score: 61.84  E-value: 8.43e-12
                            10        20        30        40        50        60        70
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 109150409    574 TLWVGQLDKRTTQQDVASLLEEFGPIESINMIP------PRGCAYIVMVHRQDAYRALQKLsrGNYKVNQKSIKI 642
Cdd:smart00360    1 TLFVGNLPPDTTEEELRELFSKFGKVESVRLVRdketgkSKGFAFVEFESEEDAEKALEAL--NGKELDGRPLKV 73
PAT1 pfam09770
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ...
303-476 9.60e-10

Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.


Pssm-ID: 401645 [Multi-domain]  Cd Length: 846  Bit Score: 63.13  E-value: 9.60e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109150409   303 SATSPPPPQTPfgyPGDGVQQPAYTQHQSMDQFQPRMMPIQQDTMHHQVPLPPngqmpgfgllsapppfppmpqpgmpqp 382
Cdd:pfam09770  204 RAQAKKPAQQP---APAPAQPPAAPPAQQAQQQQQFPPQIQQQQQPQQQPQQP--------------------------- 253
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109150409   383 gMAQPGLAQPgMAQPTMPQPGMPQPGMPQPGMAQPGlaQPGMAQPGMPQPA--------MPQPAMPQPGMAQPGVSPAPP 454
Cdd:pfam09770  254 -QQHPGQGHP-VTILQRPQSPQPDPAQPSIQPQAQQ--FHQQPPPVPVQPTqilqnpnrLSAARVGYPQNPQPGVQPAPA 329
                          170       180
                   ....*....|....*....|..
gi 109150409   455 VQPTFQSTFQPQNEPHSQKPHQ 476
Cdd:pfam09770  330 HQAHRQQGSFGRQAPIITHPQQ 351
RRM_1 pfam00076
RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain); The RRM motif is probably diagnostic ...
575-641 3.77e-08

RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain); The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternative splicing, and protein components of snRNPs. The motif also appears in a few single stranded DNA binding proteins. The RRM structure consists of four strands and two helices arranged in an alpha/beta sandwich, with a third helix present during RNA binding in some cases The C-terminal beta strand (4th strand) and final helix are hard to align and have been omitted in the SEED alignment The LA proteins have an N terminal rrm which is included in the seed. There is a second region towards the C terminus that has some features characteriztic of a rrm but does not appear to have the important structural core of a rrm. The LA proteins are one of the main autoantigens in Systemic lupus erythematosus (SLE), an autoimmune disease.


Pssm-ID: 425453 [Multi-domain]  Cd Length: 70  Bit Score: 51.46  E-value: 3.77e-08
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 109150409   575 LWVGQLDKRTTQQDVASLLEEFGPIESINMIP-----PRGCAYIVMVHRQDAYRALQKLSrgNYKVNQKSIK 641
Cdd:pfam00076    1 LFVGNLPPDTTEEDLKDLFSKFGPIKSIRLVRdetgrSKGFAFVEFEDEEDAEKAIEALN--GKELGGRELK 70
PRK10263 PRK10263
DNA translocase FtsK; Provisional
322-459 6.28e-07

DNA translocase FtsK; Provisional


Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 53.94  E-value: 6.28e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109150409  322 QQPAYTQHQSMDQFQPRMMPIQQDTMHHQvPLPPNGQMPgfgllsaPPPFPPMPQPGMPQPGMAQPGLAQPGMAQPTMPQ 401
Cdd:PRK10263  714 EQPAGANPFSLDDFEFSPMKALLDDGPHE-PLFTPIVEP-------VQQPQQPVAPQQQYQQPQQPVAPQPQYQQPQQPV 785
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 109150409  402 PGMPQPGMPQ-PGMAQPGLAQPgmAQPGMPQPAMPQP---AMPQPGMAQPGVSPAPPVQPTF 459
Cdd:PRK10263  786 APQPQYQQPQqPVAPQPQYQQP--QQPVAPQPQYQQPqqpVAPQPQYQQPQQPVAPQPQDTL 845
PABP-1234 TIGR01628
polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins ...
574-644 3.62e-05

polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins recognize the poly-A of mRNA and consists of four tandem RNA recognition domains at the N-terminus (rrm: pfam00076) followed by a PABP-specific domain (pfam00658) at the C-terminus. The protein is involved in the transport of mRNA's from the nucleus to the cytoplasm. There are four paralogs in Homo sapiens which are expressed in testis, platelets, broadly expressed and of unknown tissue range.


Pssm-ID: 130689 [Multi-domain]  Cd Length: 562  Bit Score: 47.88  E-value: 3.62e-05
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 109150409   574 TLWVGQLDKRTTQQDVASLLEEFGPIESI----NMIPPR--GCAYIVMVHRQDAYRALQKLsrgNYK-VNQKSIKIAW 644
Cdd:TIGR01628    2 SLYVGDLDPDVTEAKLYDLFKPFGPVLSVrvcrDSVTRRslGYGYVNFQNPADAERALETM---NFKrLGGKPIRIMW 76
COG3416 COG3416
Uncharacterized conserved protein, DUF2076 domain [Function unknown];
393-443 3.92e-04

Uncharacterized conserved protein, DUF2076 domain [Function unknown];


Pssm-ID: 442642 [Multi-domain]  Cd Length: 237  Bit Score: 43.47  E-value: 3.92e-04
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|.
gi 109150409  393 GMAQPTMPQPGMPQPGmPQPGMAQPGLAQPGMAQPGMPQPAMPQPAMPQPG 443
Cdd:COG3416    93 GQRPPPAPQPSQPGPQ-QQPAPPSGPWGQAAPQQPGYGQPQYGQPAAGPSG 142
SF-CC1 TIGR01622
splicing factor, CC1-like family; This model represents a subfamily of RNA splicing factors ...
1059-1171 6.35e-04

splicing factor, CC1-like family; This model represents a subfamily of RNA splicing factors including the Pad-1 protein (N. crassa), CAPER (M. musculus) and CC1.3 (H.sapiens). These proteins are characterized by an N-terminal arginine-rich, low complexity domain followed by three (or in the case of 4 H. sapiens paralogs, two) RNA recognition domains (rrm: pfam00706). These splicing factors are closely related to the U2AF splicing factor family (TIGR01642). A homologous gene from Plasmodium falciparum was identified in the course of the analysis of that genome at TIGR and was included in the seed.


Pssm-ID: 273721 [Multi-domain]  Cd Length: 494  Bit Score: 43.75  E-value: 6.35e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109150409  1059 RDQERfgRRSFGSRVENDRERYGSRNDDRDNSNRERREWGrRSPDRDRHRD--------LEERSRRSSGHrdrdrdsRDR 1130
Cdd:TIGR01622    3 RDRER--ERLRDSSSAGDRDRRRDKGRERSRDRSRDRERS-RSRRRDRHRDrdyyrgreRRSRSRRPNRR-------YRP 72
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|.
gi 109150409  1131 ESRREKEENRKEKHEVADRAGGnkavEPPLSQVGTIDTVSE 1171
Cdd:TIGR01622   73 REKRRRRGDSYRRRRDDRRSRR----EKPRARDGTPEPLTE 109
PABP-1234 TIGR01628
polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins ...
391-474 9.25e-04

polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins recognize the poly-A of mRNA and consists of four tandem RNA recognition domains at the N-terminus (rrm: pfam00076) followed by a PABP-specific domain (pfam00658) at the C-terminus. The protein is involved in the transport of mRNA's from the nucleus to the cytoplasm. There are four paralogs in Homo sapiens which are expressed in testis, platelets, broadly expressed and of unknown tissue range.


Pssm-ID: 130689 [Multi-domain]  Cd Length: 562  Bit Score: 43.26  E-value: 9.25e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109150409   391 QPGMAQPTMPQPGMPQPGMPQPGMAQPGLAQPGmAQPGMPQPAMPQPAMPqPGMAQPGVSPAPPVQPtfqsTFQPQNEPH 470
Cdd:TIGR01628  412 QPLGWPRMSMMPTPMGPGGPLRPNGLAPMNAVR-APSRNAQNAAQKPPMQ-PVMYPPNYQSLPLSQD----LPQPQSTAS 485

                   ....
gi 109150409   471 SQKP 474
Cdd:TIGR01628  486 QGGQ 489
KLF1_2_4_N cd21972
N-terminal domain of Kruppel-like factor (KLF) 1, KLF2, KLF4, and similar proteins; Kruppel ...
307-457 8.13e-03

N-terminal domain of Kruppel-like factor (KLF) 1, KLF2, KLF4, and similar proteins; Kruppel/Krueppel-like transcription factors (KLFs) belong to a family of proteins called the Specificity Protein (SP)/KLF family, characterized by a C-terminal DNA-binding domain of 81 amino acids consisting of three Kruppel-like C2H2 zinc fingers. These factors bind to a loose consensus motif, namely NNRCRCCYY (where N is any nucleotide; R is A/G, and Y is C/T), such as the recurring motifs in GC and GT boxes (5'-GGGGCGGGG-3' and 5-GGTGTGGGG-3') that are present in promoters and more distal regulatory elements of mammalian genes. Members of the KLF family can act as activators or repressors of transcription depending on cell and promoter context. KLFs regulate various cellular functions, such as proliferation, differentiation, and apoptosis, as well as the development and homeostasis of several types of tissue. In addition to the C-terminal DNA-binding domain, each KLF also has a unique N-terminal activation/repression domain that confers specifity and allows it to bind specifically to a certain partner, leading to distinct activities in vivo. This model represents the related N-terminal domains of KLF1, KLF2, KLF4, and similar proteins.


Pssm-ID: 409230 [Multi-domain]  Cd Length: 194  Bit Score: 38.81  E-value: 8.13e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109150409  307 PPPPQTPFGYPGDGVQQPAYTQHQSMDQFQPRmmpiqqdtmhHQVPLPPNGQMPGFGLLSAPPPFPPMPQPGMPQPGMAQ 386
Cdd:cd21972    37 DNPPPPDPAYPPPESPESCSTVYDSDGCHPTP----------NAYCGPNGPGLPGHFLLAGNSPNLGPKIKTENQEQACM 106
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 109150409  387 PGLAQPGMaqpTMPQPGMPQPGMPQPGMAQPGLAQPGMAQPGMPQPAMPQPAMPQPGMAQPGVSPAPPVQP 457
Cdd:cd21972   107 PVAGYSGH---YGPREPQRVPPAPPPPQYAGHFQYHGHFNMFSPPLRANHPGMSTVMLTPLSTPPLGFLSP 174
PRP38_assoc pfam12871
Pre-mRNA-splicing factor 38-associated hydrophilic C-term; This domain is a hydrophilic region ...
1057-1125 9.69e-03

Pre-mRNA-splicing factor 38-associated hydrophilic C-term; This domain is a hydrophilic region found at the C-terminus of plant and metazoan pre-mRNA-splicing factor 38 proteins. The function is not known.


Pssm-ID: 463734 [Multi-domain]  Cd Length: 98  Bit Score: 36.68  E-value: 9.69e-03
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109150409  1057 SGRDQERFGRRSFGSRV-ENDRERYGSRNDDRDNSnRERREWGRRSPDRDRHRDLEERSRRSSGHRDRDR 1125
Cdd:pfam12871   25 SDESERASLSRKRRSRSrRRSSTRDRSRSRSRSRS-RDRRSRGTRDRRRDRDRDRYRSLRSRSRDRSRDR 93
PRK12678 PRK12678
transcription termination factor Rho; Provisional
1048-1117 9.87e-03

transcription termination factor Rho; Provisional


Pssm-ID: 237171 [Multi-domain]  Cd Length: 672  Bit Score: 40.27  E-value: 9.87e-03
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109150409 1048 RNDSRQQFNSGRDQERFGRRSFGSRVENDRERYGSRNDDRDNSNRERREWGRRSPDRDRHRDLEERSRRS 1117
Cdd:PRK12678  180 DRQAEAERGERGRREERGRDGDDRDRRDRREQGDRREERGRRDGGDRRGRRRRRDRRDARGDDNREDRGD 249
 
Name Accession Description Interval E-value
CID_SFRS15_SCAF4 cd17005
CID (CTD-Interacting Domain) of Splicing factor arginine serine rich 15; Splicing factor ...
5-135 2.53e-88

CID (CTD-Interacting Domain) of Splicing factor arginine serine rich 15; Splicing factor arginine serine rich 15 (SFRS15) is also called CTD-binding SR-like protein RA4 or SR-related and CTD-associated factor 4 (SCAF4). It may act to physically and functionally link transcription and pre-mRNA processing. SFRS15/SCAF4 contains a CTD-interacting domain (CID) at the amino terminus and a Ser/Arg-rich domain followed by an RNA recognition motif. CID binds tightly to the carboxy-terminal domain (CTD) of RNA polymerase (Pol) II (RNAP II). During transcription, RNAP II synthesizes eukaryotic messenger RNA. Transcription is coupled to RNA processing through the CTD, which consists of up to 52 repeats of the sequence Tyr1-Ser2-Pro3-Thr4-Ser5-Pro6-Ser7. CID contains eight alpha-helices in a right-handed superhelical arrangement, which closely resembles that of the VHS domains and ARM (Armadillo) repeat proteins, except for its two amino-terminal helices.


Pssm-ID: 340802  Cd Length: 131  Bit Score: 281.08  E-value: 2.53e-88
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109150409    5 NAFNQELFSLMDMKPPISRAKMILITKAAIKAIKLYKHVVQIVEKFIKKCKPEYKVPGLYVIDSIVRQSRHQFGTDKDVF 84
Cdd:cd17005     1 NAFNQELFSLMDMKPPISRAKMILITKAAIKAIKLYKHVVQIVEKFIKKCKPEYKVPGLYVIDSIVRQSRHQFGADKDVF 80
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|.
gi 109150409   85 GPRFSKNITATFQYLYLCPSEDKSKIVRVLNLWQKNGVFKIEIIQPLLDMA 135
Cdd:cd17005    81 GPRFSKNITATFQYLYLCPSEDKSKIVRVLNLWQKNGVFKIEIIQPLLDMA 131
CID_SCAF8_like cd16983
CID (CTD-Interacting Domain) of SR-related and CTD-associated factor 8 and similar proteins; ...
6-135 3.96e-78

CID (CTD-Interacting Domain) of SR-related and CTD-associated factor 8 and similar proteins; This subfamily includes SR-related and CTD-associated factors 8 (SCAF8) and 4 (SCAF4), and similar proteins. SCAF4 is also called Splicing factor arginine serine rich 15 (SFRS15). Members may play roles in mRNA processing. Both SCAF4 and SCAF8 contains a CTD-interacting domain (CID) at the amino terminus and a Ser/Arg-rich domain followed by an RNA recognition motif. CID binds tightly to the carboxy-terminal domain (CTD) of RNA polymerase (Pol) II (RNAP II). During transcription, RNAP II synthesizes eukaryotic messenger RNA. Transcription is coupled to RNA processing through the CTD, which consists of up to 52 repeats of the sequence Tyr1-Ser2-Pro3-Thr4-Ser5-Pro6-Ser7. CID contains eight alpha-helices in a right-handed superhelical arrangement, which closely resembles that of the VHS domains and ARM (Armadillo) repeat proteins, except for its two amino-terminal helices.


Pssm-ID: 340780  Cd Length: 131  Bit Score: 252.53  E-value: 3.96e-78
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109150409    6 AFNQELFSLMDMKPPISRAKMILITKAAIKAIKLYKHVVQIVEKFIKKCKPEYKVPGLYVIDSIVRQSRHQFGTDKDVFG 85
Cdd:cd16983     2 EFNKELSSLLDSKPPVSKSKINAITKLAIKAIKFYKHVVQSVEKFIQKCKPEYKLPGLYVIDSIIRQSRHQYGKEKDVYA 81
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|
gi 109150409   86 PRFSKNITATFQYLYLCPSEDKSKIVRVLNLWQKNGVFKIEIIQPLLDMA 135
Cdd:cd16983    82 PRFAKNLSKTFLNLLKCPEKDKPKVKRVLNLWQKNGVFPKEIIQPLLDAA 131
CID_SCAF8 cd17004
CID (CTD-Interacting Domain) of SR-related and CTD-associated factor 8; SR-related and ...
6-135 1.85e-77

CID (CTD-Interacting Domain) of SR-related and CTD-associated factor 8; SR-related and CTD-associated factor 8 (SCAF8) is also called CDC5L complex-associated protein 7 (CCAP7) or RNA-binding motif protein 16 (RBM16). It may play a role in mRNA processing. SCAF8 contains a CTD-interacting domain (CID) at the amino terminus and a Ser/Arg-rich domain followed by an RNA recognition motif. CID binds tightly to the carboxy-terminal domain (CTD) of RNA polymerase (Pol) II (RNAP II). During transcription, RNAP II synthesizes eukaryotic messenger RNA. Transcription is coupled to RNA processing through the CTD, which consists of up to 52 repeats of the sequence Tyr1-Ser2-Pro3-Thr4-Ser5-Pro6-Ser7. CID contains eight alpha-helices in a right-handed superhelical arrangement, which closely resembles that of the VHS domains and ARM (Armadillo) repeat proteins, except for its two amino-terminal helices.


Pssm-ID: 340801  Cd Length: 131  Bit Score: 250.72  E-value: 1.85e-77
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109150409    6 AFNQELFSLMDMKPPISRAKMILITKAAIKAIKLYKHVVQIVEKFIKKCKPEYKVPGLYVIDSIVRQSRHQFGTDKDVFG 85
Cdd:cd17004     2 TFNSELYSLNDYKPPISKAKMTQITKAAIKAIKFYKHVVQSVEKFIQKCKPEYKVPGLYVIDSIVRQSRHQFGQEKDVFA 81
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|
gi 109150409   86 PRFSKNITATFQYLYLCPSEDKSKIVRVLNLWQKNGVFKIEIIQPLLDMA 135
Cdd:cd17004    82 PRFSNNIISTFQNLYRCPGDDKSKIVRVLNLWQKNNVFKSEIIQPLLDMA 131
RRM_SCAF4 cd12461
RNA recognition motif (RRM) found in SR-related and CTD-associated factor 4 (SCAF4) and ...
569-649 9.30e-56

RNA recognition motif (RRM) found in SR-related and CTD-associated factor 4 (SCAF4) and similar proteins; The CD corresponds to the RRM of SCAF4 (also termed splicing factor, arginine/serine-rich 15 or SFR15, or CTD-binding SR-like protein RA4) that belongs to a new class of SCAFs (SR-like CTD-associated factors). Although its biological function remains unclear, SCAF4 shows high sequence similarity to SCAF8 that interacts specifically with a highly serine-phosphorylated form of the carboxy-terminal domain (CTD) of the largest subunit of RNA polymerase II (pol II) and may play a direct role in coupling with both, transcription and pre-mRNA processing, processes. SCAF4 and SCAF8 both contain a conserved N-terminal CTD-interacting domain (CID), an atypical RNA recognition motif (RRM), also termed RBD (RNA binding domain) or RNP (ribonucleoprotein domain), and serine/arginine-rich motifs.


Pssm-ID: 409894 [Multi-domain]  Cd Length: 81  Bit Score: 187.56  E-value: 9.30e-56
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109150409  569 SVCSTTLWVGQLDKRTTQQDVASLLEEFGPIESINMIPPRGCAYIVMVHRQDAYRALQKLSRGNYKVNQKSIKIAWALNK 648
Cdd:cd12461     1 SVCSTTLWVGQLDKRTTQQDVASLLEEFGQIESINMIPPRGCAYIVMVHRQDAYRALQKLSRGNYKVNQKSIKIAWALNK 80

                  .
gi 109150409  649 G 649
Cdd:cd12461    81 G 81
RRM_SCAF8 cd12462
RNA recognition motif (RRM) found in SR-related and CTD-associated factor 8 (SCAF8) and ...
571-649 1.01e-43

RNA recognition motif (RRM) found in SR-related and CTD-associated factor 8 (SCAF8) and similar proteins; This subgroup corresponds to the RRM of SCAF8 (also termed CDC5L complex-associated protein 7, or RNA-binding motif protein 16, or CTD-binding SR-like protein RA8), a nuclear matrix protein that interacts specifically with a highly serine-phosphorylated form of the carboxy-terminal domain (CTD) of the largest subunit of RNA polymerase II (pol II). The pol II CTD plays a role in coupling transcription and pre-mRNA processing. SCAF8 co-localizes primarily with transcription sites that are enriched in nuclear matrix fraction, which is known to contain proteins involved in pre-mRNA processing. Thus, SCAF8 may play a direct role in coupling with both, transcription and pre-mRNA processing, processes. SCAF8, together with SCAF4, represents a new class of SCAFs (SR-like CTD-associated factors). They contain a conserved N-terminal CTD-interacting domain (CID), an atypical RNA recognition motif (RRM), also termed RBD (RNA binding domain) or RNP (ribonucleoprotein domain), and serine/arginine-rich motifs.


Pssm-ID: 409895 [Multi-domain]  Cd Length: 79  Bit Score: 152.92  E-value: 1.01e-43
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 109150409  571 CSTTLWVGQLDKRTTQQDVASLLEEFGPIESINMIPPRGCAYIVMVHRQDAYRALQKLSRGNYKVNQKSIKIAWALNKG 649
Cdd:cd12462     1 CSTTLWVGQVDKKATQQDLTNLFEEFGQIESINMIPPRGCAYVCMVHRQDAYRALQKLSTGSFKIGSKIIKIAWALNKG 79
RRM_SCAF4_SCAF8 cd12227
RNA recognition motif (RRM) found in SR-related and CTD-associated factor 4 (SCAF4), ...
571-649 1.22e-43

RNA recognition motif (RRM) found in SR-related and CTD-associated factor 4 (SCAF4), SR-related and CTD-associated factor 8 (SCAF8) and similar proteins; This subfamily corresponds to the RRM in a new class of SCAFs (SR-like CTD-associated factors), including SCAF4, SCAF8 and similar proteins. The biological role of SCAF4 remains unclear, but it shows high sequence similarity to SCAF8 (also termed CDC5L complex-associated protein 7, or RNA-binding motif protein 16, or CTD-binding SR-like protein RA8). SCAF8 is a nuclear matrix protein that interacts specifically with a highly serine-phosphorylated form of the carboxy-terminal domain (CTD) of the largest subunit of RNA polymerase II (pol II). The pol II CTD plays a role in coupling transcription and pre-mRNA processing. In addition, SCAF8 co-localizes primarily with transcription sites that are enriched in nuclear matrix fraction, which is known to contain proteins involved in pre-mRNA processing. Thus, SCAF8 may play a direct role in coupling with both, transcription and pre-mRNA processing, processes. SCAF8 and SCAF4 both contain a conserved N-terminal CTD-interacting domain (CID), an atypical RNA recognition motif (RRM), also termed RBD (RNA binding domain) or RNPs (ribonucleoprotein domain), and serine/arginine-rich motifs.


Pssm-ID: 409674 [Multi-domain]  Cd Length: 77  Bit Score: 152.59  E-value: 1.22e-43
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 109150409  571 CSTTLWVGQLDKRTTQQDVASLLEEFGPIESINMIPPRGCAYIVMVHRQDAYRALQKLSrgNYKVNQKSIKIAWALNKG 649
Cdd:cd12227     1 CSTTLWVGHLSKKVTQEELKNLFEEYGEIQSIDMIPPRGCAYVCMKTRQDAHRALQKLK--NHKLRGKSIKIAWAPNKG 77
CID cd03562
CID (CTD-Interacting Domain) family; The CTD-Interacting Domain (CID) is present in several ...
6-134 2.23e-43

CID (CTD-Interacting Domain) family; The CTD-Interacting Domain (CID) is present in several eukaryotic RNA-processing factors including yeast proteins, Pcf11 and Nrd1, and vertebrate proteins, CTD-associated factors 8 (SCAF8) and Regulation of nuclear pre-mRNA domain-containing proteins (such as RPRD1 and RPRD2). Pcf11 is a conserved and essential subunit of the yeast cleavage factor IA, which is required for polyadenylation-dependent 3'-RNA processing and transcription termination. Nrd1 is implicated in polyadenylation-independent 3'-RNA processing. CID binds tightly to the carboxy-terminal domain (CTD) of RNA polymerase (Pol) II (RNAP II). During transcription, RNAP II synthesizes eukaryotic messenger RNA. Transcription is coupled to RNA processing through the CTD, which consists of up to 52 repeats of the sequence Tyr1-Ser2-Pro3-Thr4-Ser5-Pro6-Ser7. CID contains eight alpha-helices in a right-handed superhelical arrangement, which closely resembles that of the VHS domains and ARM (Armadillo) repeat proteins, except for its two amino-terminal helices.


Pssm-ID: 340766  Cd Length: 123  Bit Score: 153.83  E-value: 2.23e-43
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109150409    6 AFNQELFSLMDMkppiSRAKMILITKAAIKAIKLYKHVVQIVEKFIKKCKPEYKVPGLYVIDSIVRQSRHQfgtdKDVFG 85
Cdd:cd03562     2 AFNSKLEELSDL----SQQSITTLTKWAIHHIKHSRPIVTVIEREIRKCKPNRKLTFLYLIDSIIRNSKRK----GPEFT 73
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|
gi 109150409   86 PRFSKNITATFQYLY-LCPSEDKSKIVRVLNLWQKNGVFKIEIIQPLLDM 134
Cdd:cd03562    74 KDFSPVIVELFKHVYsETDEDCKKKLGRVLSIWEERNVFENSVLEQLKQA 123
RPR smart00582
domain present in proteins, which are involved in regulation of nuclear pre-mRNA;
6-134 1.17e-36

domain present in proteins, which are involved in regulation of nuclear pre-mRNA;


Pssm-ID: 214731  Cd Length: 124  Bit Score: 134.33  E-value: 1.17e-36
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109150409      6 AFNQELFSLMdmkppISRAKMILITKAAIKAIKLYKHVVQIVEKFIKKCKPEYKVPGLYVIDSIVRQSRHQFGTD-KDVF 84
Cdd:smart00582    1 AFEQKLESLN-----NSQESIQTLTKWAIEHASHAKEIVELWEKYIKKAPVPRKLPLLYLLDSIVQNSKRKYGSEfGDEL 75
                            90       100       110       120       130
                    ....*....|....*....|....*....|....*....|....*....|
gi 109150409     85 GPRFSKNITATFQYlylCPSEDKSKIVRVLNLWQKNGVFKIEIIQPLLDM 134
Cdd:smart00582   76 GPVFQDALRRVLGA---APEELKKKIRRLLNIWEERGIFPPEVLRPLREK 122
CID pfam04818
CID domain; This domain binds to the phosphorylated C-terminal domain (CTD) of RNA polymerase ...
6-128 4.19e-30

CID domain; This domain binds to the phosphorylated C-terminal domain (CTD) of RNA polymerase II. This domain is known as the CTD-interacting domain (CID).


Pssm-ID: 461442  Cd Length: 117  Bit Score: 115.38  E-value: 4.19e-30
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109150409     6 AFNQELFSLMDmkppiSRAKMILITKAAIKAIKLYKHVVQIVEKFIKKCKPEYKVPGLYVIDSIVRQSRHQFGTDkdvFG 85
Cdd:pfam04818    2 ALEKKLSSLNN-----SQESIQTLSKWILFHRKHAKAIVEVWEKYLKKAKPEKKLHLLYLANDVLQNSRKKGKSE---FA 73
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....
gi 109150409    86 PRFSKNITATFQYLYL-CPSEDKSKIVRVLNLWQKNGVFKIEII 128
Cdd:pfam04818   74 DAFEPVLPEAFASAYKkCDEKLKKKLERLLNIWEERNVFSPEVI 117
CID_Nrd1_like cd16984
CID (CTD-Interacting Domain) of Nrd1 and similar proteins; This subfamily includes ...
7-131 1.06e-13

CID (CTD-Interacting Domain) of Nrd1 and similar proteins; This subfamily includes Saccharomyces cerevisiae protein Nrd1, Schizosaccharomyces pombe Rpb7-binding protein Seb1, and similar proteins. Nrd1 cooperates with Nab3 and Sen1, also called the Nrd1-Nab3-Sen1 (NNS) complex, to terminate the transcription by RNA polymerase (Pol) II (RNAPII) of many noncoding RNAs (ncRNAs), including small nuclear RNAs (snRNAs), small nucleolar RNAs (snoRNAs), and cryptic unstable transcripts (CUTs). Schizosaccharomyces pombe Seb1 does not function in an NNS-like termination pathway but promotes polyadenylation site selection of coding and noncoding genes. It cotranscriptionally controls alternative polyadenylation. CID binds tightly to the carboxy-terminal domain (CTD) of RNAP II. Nrd1 CID preferentially interacts with CTD phosphorylated at Ser5. During transcription, RNAP II synthesizes eukaryotic messenger RNA. Transcription is coupled to RNA processing through the CTD, which consists of up to 52 repeats of the sequence Tyr1-Ser2-Pro3-Thr4-Ser5-Pro6-Ser7. CID contains eight alpha-helices in a right-handed superhelical arrangement, which closely resembles that of the VHS domains and ARM (Armadillo) repeat proteins, except for its two amino-terminal helices.


Pssm-ID: 340781  Cd Length: 145  Bit Score: 69.56  E-value: 1.06e-13
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109150409    7 FNQELFSLMDMKPP-ISRAKMILITKAAIKAIKLYKHVVQIVEKFIKKCKPEYKVPGLYVIDSIVRQSRHQ---FGTDKD 82
Cdd:cd16984     3 FEATLKSLQALKPPgVSGSKIKKLTDIAVDNVQSESQIVSKLYRYFKKAPPTHKLGVLYVVDSVVRAWIDQakkNGQSID 82
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|....*....
gi 109150409   83 VFGP---------RFSKNITATFQ-YLYLCPSEDKSKIVRVLNLWQKNGVFKIEIIQPL 131
Cdd:cd16984    83 SSAPdgtfaagvyKITELIESLMNdAIQSAPEDQKEKIKKLVDIWEKGNTFPAEMLNSI 141
RRM smart00360
RNA recognition motif;
574-642 8.43e-12

RNA recognition motif;


Pssm-ID: 214636 [Multi-domain]  Cd Length: 73  Bit Score: 61.84  E-value: 8.43e-12
                            10        20        30        40        50        60        70
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 109150409    574 TLWVGQLDKRTTQQDVASLLEEFGPIESINMIP------PRGCAYIVMVHRQDAYRALQKLsrGNYKVNQKSIKI 642
Cdd:smart00360    1 TLFVGNLPPDTTEEELRELFSKFGKVESVRLVRdketgkSKGFAFVEFESEEDAEKALEAL--NGKELDGRPLKV 73
CID_Pcf11 cd16982
CID (CTD-Interacting Domain) of Pcf11; Pcf11 is conserved across eukaryotes. The best studied ...
33-117 1.67e-11

CID (CTD-Interacting Domain) of Pcf11; Pcf11 is conserved across eukaryotes. The best studied protein is Saccharomyces cerevisiae Pcf11, also called protein 1 of CF I, an essential subunit of the cleavage factor IA (CFIA) complex which is required for polyadenylation-dependent pre-mRNA 3'-end processing and RNA polymerase (Pol) II (RNAP II) transcription termination. Human Pcf11, also referred to as pre-mRNA cleavage complex 2 protein Pcf11, has been shown to enhance degradation of RNAP II-associated nascent RNA and transcriptional termination. The family also includes plant PCFS4 (Pcf11-similar-4 protein or Polyadenylation and cleavage factor homolog 4) and Caenorhabditis elegans Polyadenylation and cleavage factor homolog 11. CID binds tightly to the carboxy-terminal domain (CTD) of RNAP II. Pcf11 CID preferentially interacts with CTD phosphorylated at Ser2. During transcription, RNAP II synthesizes eukaryotic messenger RNA. Transcription is coupled to RNA processing through the CTD, which consists of up to 52 repeats of the sequence Tyr1-Ser2-Pro3-Thr4-Ser5-Pro6-Ser7. CID contains eight alpha-helices in a right-handed superhelical arrangement, which closely resembles that of the VHS domains and ARM (Armadillo) repeat proteins, except for its two amino-terminal helices.


Pssm-ID: 340779  Cd Length: 127  Bit Score: 62.58  E-value: 1.67e-11
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109150409   33 AIKAIKLYKHVVQIVEKFIKKCKPEYKVPGLYVIDSIVRqsrhqfgTDKDVFGPRFSKNITATFQYLY-LCPSEDKSKIV 111
Cdd:cd16982    26 AEENIQAAQAIVEAIEERIRKVPPEQKLPALYLLDSIVK-------NVGGPYTSLFSPNLVDLFLDAYrLVDEKTRKKLE 98

                  ....*.
gi 109150409  112 RVLNLW 117
Cdd:cd16982    99 KLLNTW 104
RRM_SF cd00590
RNA recognition motif (RRM) superfamily; RRM, also known as RBD (RNA binding domain) or RNP ...
575-642 2.76e-10

RNA recognition motif (RRM) superfamily; RRM, also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), is a highly abundant domain in eukaryotes found in proteins involved in post-transcriptional gene expression processes including mRNA and rRNA processing, RNA export, and RNA stability. This domain is 90 amino acids in length and consists of a four-stranded beta-sheet packed against two alpha-helices. RRM usually interacts with ssRNA, but is also known to interact with ssDNA as well as proteins. RRM binds a variable number of nucleotides, ranging from two to eight. The active site includes three aromatic side-chains located within the conserved RNP1 and RNP2 motifs of the domain. The RRM domain is found in a variety heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing, and protein components of small nuclear ribonucleoproteins (snRNPs).


Pssm-ID: 409669 [Multi-domain]  Cd Length: 72  Bit Score: 57.29  E-value: 2.76e-10
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 109150409  575 LWVGQLDKRTTQQDVASLLEEFGPIESINMIP-----PRGCAYIVMVHRQDAYRALQKLsrGNYKVNQKSIKI 642
Cdd:cd00590     1 LFVGNLPPDTTEEDLRELFSKFGEVVSVRIVRdrdgkSKGFAFVEFESPEDAEKALEAL--NGTELGGRPLKV 71
RRM3_NGR1_NAM8_like cd12346
RNA recognition motif 3 (RRM3) found in yeast negative growth regulatory protein NGR1 (RBP1), ...
573-644 3.23e-10

RNA recognition motif 3 (RRM3) found in yeast negative growth regulatory protein NGR1 (RBP1), yeast protein NAM8 and similar proteins; This subfamily corresponds to the RRM3 of NGR1 and NAM8. NGR1, also termed RNA-binding protein RBP1, is a putative glucose-repressible protein that binds both RNA and single-stranded DNA (ssDNA) in yeast. It may function in regulating cell growth in early log phase, possibly through its participation in RNA metabolism. NGR1 contains two RNA recognition motifs (RRMs), also termed RBDs (RNA binding domains) or RNPs (ribonucleoprotein domains), followed by a glutamine-rich stretch that may be involved in transcriptional activity. In addition, NGR1 has an asparagine-rich region near the carboxyl terminus which also harbors a methionine-rich region. The family also includes protein NAM8, which is a putative RNA-binding protein that acts as a suppressor of mitochondrial splicing deficiencies when overexpressed in yeast. It may be a non-essential component of the mitochondrial splicing machinery. Like NGR1, NAM8 contains two RRMs.


Pssm-ID: 409782 [Multi-domain]  Cd Length: 72  Bit Score: 57.33  E-value: 3.23e-10
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 109150409  573 TTLWVGQLDKRTTQQDVASLLEEFGPIESINMIPPRGCAYIVMVHRQDAYRALQKLSrgNYKVNQKSIKIAW 644
Cdd:cd12346     2 TTVFVGGLDPNVTEEDLRVLFGPFGEIVYVKIPPGKGCGFVQFVNRASAEAAIQKLQ--GTPIGGSRIRLSW 71
PAT1 pfam09770
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ...
303-476 9.60e-10

Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.


Pssm-ID: 401645 [Multi-domain]  Cd Length: 846  Bit Score: 63.13  E-value: 9.60e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109150409   303 SATSPPPPQTPfgyPGDGVQQPAYTQHQSMDQFQPRMMPIQQDTMHHQVPLPPngqmpgfgllsapppfppmpqpgmpqp 382
Cdd:pfam09770  204 RAQAKKPAQQP---APAPAQPPAAPPAQQAQQQQQFPPQIQQQQQPQQQPQQP--------------------------- 253
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109150409   383 gMAQPGLAQPgMAQPTMPQPGMPQPGMPQPGMAQPGlaQPGMAQPGMPQPA--------MPQPAMPQPGMAQPGVSPAPP 454
Cdd:pfam09770  254 -QQHPGQGHP-VTILQRPQSPQPDPAQPSIQPQAQQ--FHQQPPPVPVQPTqilqnpnrLSAARVGYPQNPQPGVQPAPA 329
                          170       180
                   ....*....|....*....|..
gi 109150409   455 VQPTFQSTFQPQNEPHSQKPHQ 476
Cdd:pfam09770  330 HQAHRQQGSFGRQAPIITHPQQ 351
RRM_RBM22 cd12224
RNA recognition motif (RRM) found in Pre-mRNA-splicing factor RBM22 and similar proteins; This ...
573-645 1.02e-09

RNA recognition motif (RRM) found in Pre-mRNA-splicing factor RBM22 and similar proteins; This subgroup corresponds to the RRM of RBM22 (also known as RNA-binding motif protein 22, or Zinc finger CCCH domain-containing protein 16), a newly discovered RNA-binding motif protein which belongs to the SLT11 gene family. SLT11 gene encoding protein (Slt11p) is a splicing factor in yeast, which is required for spliceosome assembly. Slt11p has two distinct biochemical properties: RNA-annealing and RNA-binding activities. RBM22 is the homolog of SLT11 in vertebrate. It has been reported to be involved in pre-splicesome assembly and to interact with the Ca2+-signaling protein ALG-2. It also plays an important role in embryogenesis. RBM22 contains a conserved RNA recognition motif (RRM), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), a zinc finger of the unusual type C-x8-C-x5-C-x3-H, and a C-terminus that is unusually rich in the amino acids Gly and Pro, including sequences of tetraprolines.


Pssm-ID: 409671 [Multi-domain]  Cd Length: 74  Bit Score: 55.75  E-value: 1.02e-09
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 109150409  573 TTLWVGQLDKRTTQQDVASLLEEFGPIESINMIPPRGCAYIVMVHRQDAYRALQKLSrGNYKVNQKSIKIAWA 645
Cdd:cd12224     2 TTLYVGGLGDKITEKDLRDHFYQFGEIRSITVVARQQCAFVQFTTRQAAERAAERTF-NKLIIKGRRLKVKWG 73
CID_Rtt103 cd17003
CID (CTD-Interacting Domain) of yeast transcription termination factor Rtt103 and similar ...
40-131 4.04e-09

CID (CTD-Interacting Domain) of yeast transcription termination factor Rtt103 and similar proteins; Yeast transcription termination factor Rtt103 is a CID (CTD-Interacting Domain) containing protein that functions in DNA damage response. It associates with sites of DNA breaks and is essential for recovery from DNA double strand breaks in the chromosome. CID binds tightly to the carboxy-terminal domain (CTD) of RNA polymerase (Pol) II (RNAP II). Rtt103 CID preferentially interacts with CTD phosphorylated at Ser2. During transcription, RNAP II synthesizes eukaryotic messenger RNA. Transcription is coupled to RNA processing through the CTD, which consists of up to 52 repeats of the sequence Tyr1-Ser2-Pro3-Thr4-Ser5-Pro6-Ser7. CID contains eight alpha-helices in a right-handed superhelical arrangement, which closely resembles that of the VHS domains and ARM (Armadillo) repeat proteins, except for its two amino-terminal helices.


Pssm-ID: 340800  Cd Length: 127  Bit Score: 55.69  E-value: 4.04e-09
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109150409   40 YKH---VVQIVEKFIKK--CKPEYKVPGLYVIDSIVRQSRHQfgtDKDVFGPRFSKNITATFQYLYL-CPSEDKSKIVRV 113
Cdd:cd17003    29 YRHadeIAEIWSDYLLKssVNSRRKLLLIYLANDVVQQAKAK---KKTEFIDAFSKVLPEVLEKIYPsLPSDIKKKIKRV 105
                          90
                  ....*....|....*...
gi 109150409  114 LNLWQKNGVFKIEIIQPL 131
Cdd:cd17003   106 VNVWKQRQIFSKDVIDDI 123
RRM_1 pfam00076
RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain); The RRM motif is probably diagnostic ...
575-641 3.77e-08

RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain); The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternative splicing, and protein components of snRNPs. The motif also appears in a few single stranded DNA binding proteins. The RRM structure consists of four strands and two helices arranged in an alpha/beta sandwich, with a third helix present during RNA binding in some cases The C-terminal beta strand (4th strand) and final helix are hard to align and have been omitted in the SEED alignment The LA proteins have an N terminal rrm which is included in the seed. There is a second region towards the C terminus that has some features characteriztic of a rrm but does not appear to have the important structural core of a rrm. The LA proteins are one of the main autoantigens in Systemic lupus erythematosus (SLE), an autoimmune disease.


Pssm-ID: 425453 [Multi-domain]  Cd Length: 70  Bit Score: 51.46  E-value: 3.77e-08
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 109150409   575 LWVGQLDKRTTQQDVASLLEEFGPIESINMIP-----PRGCAYIVMVHRQDAYRALQKLSrgNYKVNQKSIK 641
Cdd:pfam00076    1 LFVGNLPPDTTEEDLKDLFSKFGPIKSIRLVRdetgrSKGFAFVEFEDEEDAEKAIEALN--GKELGGRELK 70
RRM2_4_MRN1 cd12262
RNA recognition motif 2 (RRM2) and 4 (RRM4) found in RNA-binding protein MRN1 and similar ...
572-640 3.78e-08

RNA recognition motif 2 (RRM2) and 4 (RRM4) found in RNA-binding protein MRN1 and similar proteins; This subgroup corresponds to the RRM2 and RRM4 of MRN1, also termed multicopy suppressor of RSC-NHP6 synthetic lethality protein 1, or post-transcriptional regulator of 69 kDa, and is an RNA-binding protein found in yeast. Although its specific biological role remains unclear, MRN1 might be involved in translational regulation. Members in this family contain four copies of conserved RNA recognition motif (RRM), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain).


Pssm-ID: 409706 [Multi-domain]  Cd Length: 78  Bit Score: 51.63  E-value: 3.78e-08
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 109150409  572 STTLWVGQLDKRTTQQDVASLLEEFGPIESINMIPPRGCAYIVMVHRQDAYRALQKLSRGNYKVNQKSI 640
Cdd:cd12262     3 SRNVYVGNLDDSLTEEEIRGILEKYGEIESIKILKEKNCAFVNYLNIANAIKAVQELPIKNPKFKKVRI 71
RRM_SR140 cd12223
RNA recognition motif (RRM) found in U2-associated protein SR140 and similar proteins; This ...
572-645 7.26e-08

RNA recognition motif (RRM) found in U2-associated protein SR140 and similar proteins; This subgroup corresponds to the RRM of SR140 (also termed U2 snRNP-associated SURP motif-containing protein orU2SURP, or 140 kDa Ser/Arg-rich domain protein) which is a putative splicing factor mainly found in higher eukaryotes. Although it is initially identified as one of the 17S U2 snRNP-associated proteins, the molecular and physiological function of SR140 remains unclear. SR140 contains an N-terminal RNA recognition motif (RRM), also termed RBD (RNA binding domain) or RNP (ribonucleoprotein domain), a SWAP/SURP domain that is found in a number of pre-mRNA splicing factors in the middle region, and a C-terminal arginine/serine-rich domain (RS domain).


Pssm-ID: 409670 [Multi-domain]  Cd Length: 84  Bit Score: 51.14  E-value: 7.26e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109150409  572 STTLWVGQLDKRTTQQDVASLLEEFGPIESINMIPPRG---------CAYIVMVHRQDAYRALQKLSrgNYKVNQKSIKI 642
Cdd:cd12223     1 TTNLYVGNLPPSVTEEVLLREFGRFGPLASVKIMWPRTeeerrrnrnCGFVAFMSRADAERAMRELN--GKDVMGYELKL 78

                  ...
gi 109150409  643 AWA 645
Cdd:cd12223    79 GWG 81
RRM_RBM18 cd12355
RNA recognition motif (RRM) found in eukaryotic RNA-binding protein 18 and similar proteins; ...
575-645 9.06e-08

RNA recognition motif (RRM) found in eukaryotic RNA-binding protein 18 and similar proteins; This subfamily corresponds to the RRM of RBM18, a putative RNA-binding protein containing a well-conserved RNA recognition motif (RRM), also termed RBD (RNA binding domain) or RNP (ribonucleoprotein domain). The biological role of RBM18 remains unclear.


Pssm-ID: 409791 [Multi-domain]  Cd Length: 80  Bit Score: 50.38  E-value: 9.06e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109150409  575 LWVGQLDKRTTQQDVASLLEEFGPIESINMI---------PPRGCAYIVMVHRQDAYRALQKLsrgNYK-VNQKSIKIAW 644
Cdd:cd12355     2 LWIGNLDPRLTEYHLLKLLSKYGKIKKFDFLfhktgplkgQPRGYCFVTFETKEEAEKAIECL---NGKlALGKKLVVRW 78

                  .
gi 109150409  645 A 645
Cdd:cd12355    79 A 79
RRM3_PUB1 cd12622
RNA recognition motif 3 (RRM3) found in yeast nuclear and cytoplasmic polyadenylated ...
573-644 2.74e-07

RNA recognition motif 3 (RRM3) found in yeast nuclear and cytoplasmic polyadenylated RNA-binding protein PUB1 and similar proteins; This subfamily corresponds to the RRM3 of yeast protein PUB1, also termed ARS consensus-binding protein ACBP-60, or poly uridylate-binding protein, or poly(U)-binding protein. PUB1 has been identified as both, a heterogeneous nuclear RNA-binding protein (hnRNP) and a cytoplasmic mRNA-binding protein (mRNP), which may be stably bound to a translationally inactive subpopulation of mRNAs within the cytoplasm. PUB1 is distributed in both, the nucleus and the cytoplasm, and binds to poly(A)+ RNA (mRNA or pre-mRNA). Although it is one of the major cellular proteins cross-linked by UV light to polyadenylated RNAs in vivo, PUB1 is nonessential for cell growth in yeast. PUB1 also binds to T-rich single stranded DNA (ssDNA); however, there is no strong evidence implicating PUB1 in the mechanism of DNA replication. PUB1 contains three RNA recognition motifs (RRMs), also termed RBDs (RNA binding domains) or RNPs (ribonucleoprotein domains), and a GAR motif (glycine and arginine rich stretch) that is located between RRM2 and RRM3.


Pssm-ID: 410033 [Multi-domain]  Cd Length: 74  Bit Score: 48.99  E-value: 2.74e-07
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 109150409  573 TTLWVGQLDKRTTQQDVASLLEEFGPIESINMIPPRGCAYIVMVHRQDAYRALQKLSrgNYKVNQKSIKIAW 644
Cdd:cd12622     1 TTVYVGNLPPEVTQADLIPLFQNFGVIEEVRVQRDKGFGFVKYDTHEEAALAIQQLN--GQPFLGRPIKCSW 70
RRM1_2_CELF1-6_like cd12361
RNA recognition motif 1 (RRM1) and 2 (RRM2) found in CELF/Bruno-like family of RNA binding ...
575-628 3.34e-07

RNA recognition motif 1 (RRM1) and 2 (RRM2) found in CELF/Bruno-like family of RNA binding proteins and plant flowering time control protein FCA; This subfamily corresponds to the RRM1 and RRM2 domains of the CUGBP1 and ETR-3-like factors (CELF) as well as plant flowering time control protein FCA. CELF, also termed BRUNOL (Bruno-like) proteins, is a family of structurally related RNA-binding proteins involved in regulation of pre-mRNA splicing in the nucleus, and control of mRNA translation and deadenylation in the cytoplasm. The family contains six members: CELF-1 (also known as BRUNOL-2, CUG-BP1, NAPOR, EDEN-BP), CELF-2 (also known as BRUNOL-3, ETR-3, CUG-BP2, NAPOR-2), CELF-3 (also known as BRUNOL-1, TNRC4, ETR-1, CAGH4, ER DA4), CELF-4 (BRUNOL-4), CELF-5 (BRUNOL-5) and CELF-6 (BRUNOL-6). They all contain three highly conserved RNA recognition motifs (RRMs), also known as RBDs (RNA binding domains) or RNPs (ribonucleoprotein domains): two consecutive RRMs (RRM1 and RRM2) situated in the N-terminal region followed by a linker region and the third RRM (RRM3) close to the C-terminus of the protein. The low sequence conservation of the linker region is highly suggestive of a large variety in the co-factors that associate with the various CELF family members. Based on both, sequence similarity and function, the CELF family can be divided into two subfamilies, the first containing CELFs 1 and 2, and the second containing CELFs 3, 4, 5, and 6. The different CELF proteins may act through different sites on at least some substrates. Furthermore, CELF proteins may interact with each other in varying combinations to influence alternative splicing in different contexts. This subfamily also includes plant flowering time control protein FCA that functions in the posttranscriptional regulation of transcripts involved in the flowering process. FCA contains two RRMs, and a WW protein interaction domain.


Pssm-ID: 409796 [Multi-domain]  Cd Length: 77  Bit Score: 48.77  E-value: 3.34e-07
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|
gi 109150409  575 LWVGQLDKRTTQQDVASLLEEFGPIESINMI------PPRGCAYIVMVHRQDAYRALQKL 628
Cdd:cd12361     2 LFVGMIPKTASEEDVRPLFEQFGNIEEVQILrdkqtgQSKGCAFVTFSTREEALRAIEAL 61
PRK10263 PRK10263
DNA translocase FtsK; Provisional
322-459 6.28e-07

DNA translocase FtsK; Provisional


Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 53.94  E-value: 6.28e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109150409  322 QQPAYTQHQSMDQFQPRMMPIQQDTMHHQvPLPPNGQMPgfgllsaPPPFPPMPQPGMPQPGMAQPGLAQPGMAQPTMPQ 401
Cdd:PRK10263  714 EQPAGANPFSLDDFEFSPMKALLDDGPHE-PLFTPIVEP-------VQQPQQPVAPQQQYQQPQQPVAPQPQYQQPQQPV 785
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 109150409  402 PGMPQPGMPQ-PGMAQPGLAQPgmAQPGMPQPAMPQP---AMPQPGMAQPGVSPAPPVQPTF 459
Cdd:PRK10263  786 APQPQYQQPQqPVAPQPQYQQP--QQPVAPQPQYQQPqqpVAPQPQYQQPQQPVAPQPQDTL 845
RRM1_TIA1_like cd12352
RNA recognition motif 1 (RRM1) found in granule-associated RNA binding proteins p40-TIA-1 and ...
575-645 1.45e-06

RNA recognition motif 1 (RRM1) found in granule-associated RNA binding proteins p40-TIA-1 and TIAR; This subfamily corresponds to the RRM1 of nucleolysin TIA-1 isoform p40 (p40-TIA-1 or TIA-1) and nucleolysin TIA-1-related protein (TIAR), both of which are granule-associated RNA binding proteins involved in inducing apoptosis in cytotoxic lymphocyte (CTL) target cells. TIA-1 and TIAR share high sequence similarity. They are expressed in a wide variety of cell types. TIA-1 can be phosphorylated by a serine/threonine kinase that is activated during Fas-mediated apoptosis.TIAR is mainly localized in the nucleus of hematopoietic and nonhematopoietic cells. It is translocated from the nucleus to the cytoplasm in response to exogenous triggers of apoptosis. Both, TIA-1 and TIAR, bind specifically to poly(A) but not to poly(C) homopolymers. They are composed of three N-terminal highly homologous RNA recognition motifs (RRMs), also termed RBDs (RNA binding domains) or RNPs (ribonucleoprotein domains), and a glutamine-rich C-terminal auxiliary domain containing a lysosome-targeting motif. TIA-1 and TIAR interact with RNAs containing short stretches of uridylates and their RRM2 can mediate the specific binding to uridylate-rich RNAs. The C-terminal auxiliary domain may be responsible for interacting with other proteins. In addition, TIA-1 and TIAR share a potential serine protease-cleavage site (Phe-Val-Arg) localized at the junction between their RNA binding domains and their C-terminal auxiliary domains.


Pssm-ID: 409788 [Multi-domain]  Cd Length: 73  Bit Score: 47.02  E-value: 1.45e-06
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 109150409  575 LWVGQLDKRTTQQDVASLLEEFGPIESINMIPPRGC----AYIVMVHRQDAYRALQKLSRgnYKVNQKSIKIAWA 645
Cdd:cd12352     1 LYVGNLDRQVTEDLILQLFSQIGPCKSCKMITEHGGndpyCFVEFYEHNHAAAALQAMNG--RKILGKEVKVNWA 73
RRM_snRNP70 cd12236
RNA recognition motif (RRM) found in U1 small nuclear ribonucleoprotein 70 kDa (U1-70K) and ...
574-624 3.81e-06

RNA recognition motif (RRM) found in U1 small nuclear ribonucleoprotein 70 kDa (U1-70K) and similar proteins; This subfamily corresponds to the RRM of U1-70K, also termed snRNP70, a key component of the U1 snRNP complex, which is one of the key factors facilitating the splicing of pre-mRNA via interaction at the 5' splice site, and is involved in regulation of polyadenylation of some viral and cellular genes, enhancing or inhibiting efficient poly(A) site usage. U1-70K plays an essential role in targeting the U1 snRNP to the 5' splice site through protein-protein interactions with regulatory RNA-binding splicing factors, such as the RS protein ASF/SF2. Moreover, U1-70K protein can specifically bind to stem-loop I of the U1 small nuclear RNA (U1 snRNA) contained in the U1 snRNP complex. It also mediates the binding of U1C, another U1-specific protein, to the U1 snRNP complex. U1-70K contains a conserved RNA recognition motif (RRM), also termed RBD (RNA binding domain) or RNP (ribonucleoprotein domain), followed by an adjacent glycine-rich region at the N-terminal half, and two serine/arginine-rich (SR) domains at the C-terminal half. The RRM is responsible for the binding of stem-loop I of U1 snRNA molecule. Additionally, the most prominent immunodominant region that can be recognized by auto-antibodies from autoimmune patients may be located within the RRM. The SR domains are involved in protein-protein interaction with SR proteins that mediate 5' splice site recognition. For instance, the first SR domain is necessary and sufficient for ASF/SF2 Binding. The family also includes Drosophila U1-70K that is an essential splicing factor required for viability in flies, but its SR domain is dispensable. The yeast U1-70k doesn't contain easily recognizable SR domains and shows low sequence similarity in the RRM region with other U1-70k proteins and therefore not included in this family. The RRM domain is dispensable for yeast U1-70K function.


Pssm-ID: 409682 [Multi-domain]  Cd Length: 91  Bit Score: 46.46  E-value: 3.81e-06
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|
gi 109150409  574 TLWVGQLDKRTTQQDVASLLEEFGPIESINMI------PPRGCAYIVMVHRQD---AYRA 624
Cdd:cd12236     3 TLFVARLSYDTTESKLRREFEKYGPIKRVRLVrdkktgKSRGYAFIEFEHERDmkaAYKH 62
RRM3_TIA1_like cd12354
RNA recognition motif 2 (RRM2) found in granule-associated RNA binding proteins (p40-TIA-1 and ...
573-644 6.41e-06

RNA recognition motif 2 (RRM2) found in granule-associated RNA binding proteins (p40-TIA-1 and TIAR), and yeast nuclear and cytoplasmic polyadenylated RNA-binding protein PUB1; This subfamily corresponds to the RRM3 of TIA-1, TIAR, and PUB1. Nucleolysin TIA-1 isoform p40 (p40-TIA-1 or TIA-1) and nucleolysin TIA-1-related protein (TIAR) are granule-associated RNA binding proteins involved in inducing apoptosis in cytotoxic lymphocyte (CTL) target cells. They share high sequence similarity and are expressed in a wide variety of cell types. TIA-1 can be phosphorylated by a serine/threonine kinase that is activated during Fas-mediated apoptosis.TIAR is mainly localized in the nucleus of hematopoietic and nonhematopoietic cells. It is translocated from the nucleus to the cytoplasm in response to exogenous triggers of apoptosis. Both TIA-1 and TIAR bind specifically to poly(A) but not to poly(C) homopolymers. They are composed of three N-terminal highly homologous RNA recognition motifs (RRMs), also termed RBDs (RNA binding domains) or RNPs (ribonucleoprotein domains), and a glutamine-rich C-terminal auxiliary domain containing a lysosome-targeting motif. TIA-1 and TIAR interact with RNAs containing short stretches of uridylates and their RRM2 can mediate the specific binding to uridylate-rich RNAs. The C-terminal auxiliary domain may be responsible for interacting with other proteins. In addition, TIA-1 and TIAR share a potential serine protease-cleavage site (Phe-Val-Arg) localized at the junction between their RNA binding domains and their C-terminal auxiliary domains. This subfamily also includes a yeast nuclear and cytoplasmic polyadenylated RNA-binding protein PUB1, termed ARS consensus-binding protein ACBP-60, or poly uridylate-binding protein, or poly(U)-binding protein, which has been identified as both a heterogeneous nuclear RNA-binding protein (hnRNP) and a cytoplasmic mRNA-binding protein (mRNP). It may be stably bound to a translationally inactive subpopulation of mRNAs within the cytoplasm. PUB1 is distributed in both, the nucleus and the cytoplasm, and binds to poly(A)+ RNA (mRNA or pre-mRNA). Although it is one of the major cellular proteins cross-linked by UV light to polyadenylated RNAs in vivo, PUB1 is nonessential for cell growth in yeast. PUB1 also binds to T-rich single stranded DNA (ssDNA); however, there is no strong evidence implicating PUB1 in the mechanism of DNA replication. PUB1 contains three RRMs, and a GAR motif (glycine and arginine rich stretch) that is located between RRM2 and RRM3.


Pssm-ID: 409790 [Multi-domain]  Cd Length: 71  Bit Score: 44.97  E-value: 6.41e-06
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 109150409  573 TTLWVGQLDKRTTQQDVASLLEEFGPIESINMIPPRGCAYIVMVHRQDAYRALQKLSrgNYKVNQKSIKIAW 644
Cdd:cd12354     1 TTVYVGNITKGLTEALLQQTFSPFGQILEVRVFPDKGYAFIRFDSHEAATHAIVSVN--GTIINGQAVKCSW 70
PRK14971 PRK14971
DNA polymerase III subunit gamma/tau;
391-489 8.01e-06

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237874 [Multi-domain]  Cd Length: 614  Bit Score: 50.16  E-value: 8.01e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109150409  391 QPGMAQPTMPQPGM----PQPGMPQPGMAQPGLAQPGMAQPGMPQPAMPQPAMPQPGMAQPGVSPAPPvqPTFQstfQPQ 466
Cdd:PRK14971  391 QPSAAAAASPSPSQssaaAQPSAPQSATQPAGTPPTVSVDPPAAVPVNPPSTAPQAVRPAQFKEEKKI--PVSK---VSS 465
                          90       100
                  ....*....|....*....|...
gi 109150409  467 NEPHSQKPHQQEMEVEQPCVTEV 489
Cdd:PRK14971  466 LGPSTLRPIQEKAEQATGNIKEA 488
RRM_eIF3G_like cd12408
RNA recognition motif (RRM) found in eukaryotic translation initiation factor 3 subunit G ...
574-634 1.13e-05

RNA recognition motif (RRM) found in eukaryotic translation initiation factor 3 subunit G (eIF-3G) and similar proteins; This subfamily corresponds to the RRM of eIF-3G and similar proteins. eIF-3G, also termed eIF-3 subunit 4, or eIF-3-delta, or eIF3-p42, or eIF3-p44, is the RNA-binding subunit of eIF3, a large multisubunit complex that plays a central role in the initiation of translation by binding to the 40 S ribosomal subunit and promoting the binding of methionyl-tRNAi and mRNA. eIF-3G binds 18 S rRNA and beta-globin mRNA, and therefore appears to be a nonspecific RNA-binding protein. eIF-3G is one of the cytosolic targets and interacts with mature apoptosis-inducing factor (AIF). eIF-3G contains one RNA recognition motif (RRM), also termed RBD (RNA binding domain) or RNP (ribonucleoprotein domain). This family also includes yeast eIF3-p33, a homolog of vertebrate eIF-3G, plays an important role in the initiation phase of protein synthesis in yeast. It binds both, mRNA and rRNA, fragments due to an RRM near its C-terminus.


Pssm-ID: 409842 [Multi-domain]  Cd Length: 76  Bit Score: 44.42  E-value: 1.13e-05
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 109150409  574 TLWVGQLDKRTTQQDVASLLEEFGPIESINmIP-------PRGCAYIVMVHRQDAYRALQKLSRGNYK 634
Cdd:cd12408     1 TIRVTNLSEDATEEDLRELFRPFGPISRVY-LAkdketgqSKGFAFVTFETREDAERAIEKLNGFGYD 67
PHA03247 PHA03247
large tegument protein UL36; Provisional
389-474 1.20e-05

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 49.94  E-value: 1.20e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109150409  389 LAQPGMAQPTMPQPGMPQPGMPQPGMAQPGLAQPGMAQPGMPQPAMPQPAMPQPGMAQPGVSPAP------PVQPTFQST 462
Cdd:PHA03247 2898 FALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPwlgalvPGRVAVPRF 2977
                          90
                  ....*....|..
gi 109150409  463 FQPQNEPHSQKP 474
Cdd:PHA03247 2978 RVPQPAPSREAP 2989
RRM1_Crp79 cd21619
RNA recognition motif 1 (RRM1) found in Schizosaccharomyces pombe mRNA export factor Crp79 and ...
572-643 1.29e-05

RNA recognition motif 1 (RRM1) found in Schizosaccharomyces pombe mRNA export factor Crp79 and similar proteins; Crp79, also called meiotic expression up-regulated protein 5 (Mug5), or polyadenylate-binding protein crp79, or PABP, or poly(A)-binding protein, is an auxiliary mRNA export factor that binds the poly(A) tail of mRNA and is involved in the export of mRNA from the nucleus to the cytoplasm. Members in this family contain three RNA recognition motifs (RRMs), also termed RBDs (RNA binding domains) or RNPs (ribonucleoprotein domains). The model corresponds to the first RRM motif.


Pssm-ID: 410198 [Multi-domain]  Cd Length: 78  Bit Score: 44.44  E-value: 1.29e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109150409  572 STTLWVGQLDKRTTQQDVASLLEEFGPIESInMIPP---------RGCAYIVMVHRQDAYRALQKLsrGNYKVNQKSIKI 642
Cdd:cd21619     1 SNTIYVGNIDMTINEDALEKIFSRYGQVESV-RRPPihtdkadrtTGFGFIKYTDAESAERAMQQA--DGILLGRRRLVV 77

                  .
gi 109150409  643 A 643
Cdd:cd21619    78 R 78
CID_RPRD_like cd16981
CID (CTD-Interacting Domain) of Regulation of nuclear pre-mRNA domain-containing proteins; ...
38-134 1.40e-05

CID (CTD-Interacting Domain) of Regulation of nuclear pre-mRNA domain-containing proteins; This family is composed of Regulation of nuclear pre-mRNA domain-containing proteins 1A (RPRD1A), 1B (RPRD1B), 2 (RPRD2), yeast Rtt103, and similar proteins. RPRD1A, RPRD1B, and RPRD2 are CID (CTD-Interacting Domain) containing proteins that co-purify with RNA polymerase (Pol) II (RNAP II) and three other RNAP II-associated proteins, RPAP2, GRINL1A and RECQL5, but not with the Mediator complex. Yeast transcription termination factor Rtt103 is a CID containing protein that functions in DNA damage response. CID binds tightly to the carboxy-terminal domain (CTD) of RNAP II. During transcription, RNAP II synthesizes eukaryotic messenger RNA. Transcription is coupled to RNA processing through the CTD, which consists of up to 52 repeats of the sequence Tyr1-Ser2-Pro3-Thr4-Ser5-Pro6-Ser7. CID contains eight alpha-helices in a right-handed superhelical arrangement, which closely resembles that of the VHS domains and ARM (Armadillo) repeat proteins, except for its two amino-terminal helices.


Pssm-ID: 340778  Cd Length: 125  Bit Score: 45.65  E-value: 1.40e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109150409   38 KLYKHVVQIVEKFIKKCKPEYKVPGLYVIDSIVRQSRHQFGTDkdvFGPRFSKNITATFQYLYLCPSED-KSKIVRVLNL 116
Cdd:cd16981    30 KHAKQIVKIWLKELKKAKPERKLTLLYLANDVLQNSRRKGAPE---FVEAFKKVLPEALALVRSEGDESvRKKVLRVLNI 106
                          90
                  ....*....|....*...
gi 109150409  117 WQKNGVFKIEIIQPLLDM 134
Cdd:cd16981   107 WEERNVFGSEFLAELRAI 124
RRM2_CELF3_4_5_6 cd12635
RNA recognition motif 2 (RRM2) found in CUGBP Elav-like family member CELF-3, CELF-4, CELF-5, ...
575-628 2.41e-05

RNA recognition motif 2 (RRM2) found in CUGBP Elav-like family member CELF-3, CELF-4, CELF-5, CELF-6 and similar proteins; This subgroup corresponds to the RRM2 of CELF-3, CELF-4, CELF-5, and CELF-6, all of which belong to the CUGBP1 and ETR-3-like factors (CELF) or BRUNOL (Bruno-like) family of RNA-binding proteins that display dual nuclear and cytoplasmic localizations and have been implicated in the regulation of pre-mRNA splicing and in the control of mRNA translation and deadenylation. CELF-3, expressed in brain and testis only, is also known as bruno-like protein 1 (BRUNOL-1), or CAG repeat protein 4, or CUG-BP- and ETR-3-like factor 3, or embryonic lethal abnormal vision (ELAV)-type RNA-binding protein 1 (ETR-1), or expanded repeat domain protein CAG/CTG 4, or trinucleotide repeat-containing gene 4 protein (TNRC4). It plays an important role in the pathogenesis of tauopathies. CELF-3 contains three highly conserved RNA recognition motifs (RRMs), also known as RBDs (RNA binding domains) or RNPs (ribonucleoprotein domains): two consecutive RRMs (RRM1 and RRM2) situated in the N-terminal region followed by a linker region and the third RRM (RRM3) close to the C-terminus of the protein. The effect of CELF-3 on tau splicing is mediated mainly by the RNA-binding activity of RRM2. The divergent linker region might mediate the interaction of CELF-3 with other proteins regulating its activity or involved in target recognition. CELF-4, being highly expressed throughout the brain and in glandular tissues, moderately expressed in heart, skeletal muscle, and liver, is also known as bruno-like protein 4 (BRUNOL-4), or CUG-BP- and ETR-3-like factor 4. Like CELF-3, CELF-4 also contain three highly conserved RRMs. The splicing activation or repression activity of CELF-4 on some specific substrates is mediated by its RRM1/RRM2. On the other hand, both RRM1 and RRM2 of CELF-4 can activate cardiac troponin T (cTNT) exon 5 inclusion. CELF-5, expressed in brain, is also known as bruno-like protein 5 (BRUNOL-5), or CUG-BP- and ETR-3-like factor 5. Although its biological role remains unclear, CELF-5 shares same domain architecture with CELF-3. CELF-6, being strongly expressed in kidney, brain, and testis, is also known as bruno-like protein 6 (BRUNOL-6), or CUG-BP- and ETR-3-like factor 6. It activates exon inclusion of a cardiac troponin T minigene in transient transfection assays in a muscle-specific splicing enhancer (MSE)-dependent manner and can activate inclusion via multiple copies of a single element, MSE2. CELF-6 also promotes skipping of exon 11 of insulin receptor, a known target of CELF activity that is expressed in kidney. In addition to three highly conserved RRMs, CELF-6 also possesses numerous potential phosphorylation sites, a potential nuclear localization signal (NLS) at the C terminus, and an alanine-rich region within the divergent linker region.


Pssm-ID: 410043 [Multi-domain]  Cd Length: 81  Bit Score: 43.56  E-value: 2.41e-05
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*....
gi 109150409  575 LWVGQLDKRTTQQDVASLLEEFGPIESINMI-----PPRGCAYIVMVHRQDAYRALQKL 628
Cdd:cd12635     4 LFVGMLGKQQSEDDVRRLFEPFGSIEECTILrgpdgNSKGCAFVKFSSHAEAQAAINAL 62
Glutenin_hmw pfam03157
High molecular weight glutenin subunit; Members of this family include high molecular weight ...
305-483 2.99e-05

High molecular weight glutenin subunit; Members of this family include high molecular weight subunits of glutenin. This group of gluten proteins is thought to be largely responsible for the elastic properties of gluten, and hence, doughs. Indeed, glutenin high molecular weight subunits are classified as elastomeric proteins, because the glutenin network can withstand significant deformations without breaking, and return to the original conformation when the stress is removed. Elastomeric proteins differ considerably in amino acid sequence, but they are all polymers whose subunits consist of elastomeric domains, composed of repeated motifs, and non-elastic domains that mediate cross-linking between the subunits. The elastomeric domain motifs are all rich in glycine residues in addition to other hydrophobic residues. High molecular weight glutenin subunits have an extensive central elastomeric domain, flanked by two terminal non-elastic domains that form disulphide cross-links. The central elastomeric domain is characterized by the following three repeated motifs: PGQGQQ, GYYPTS[P/L]QQ, GQQ. It possesses overlapping beta-turns within and between the repeated motifs, and assumes a regular helical secondary structure with a diameter of approx. 1.9 nm and a pitch of approx. 1.5 nm.


Pssm-ID: 367362 [Multi-domain]  Cd Length: 786  Bit Score: 48.41  E-value: 2.99e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109150409   305 TSPPPPQTPfGYPGDGVQQPAYTQHQSMDQFQPRMMPIQQDTMHHQVPLPPNGQ-MPGFGLLSAPPPFPPMPQPGMPQPG 383
Cdd:pfam03157  372 TSQQQPQQG-QQPEQGQQGQQQGQGQQGQQPGQGQQPGQGQPGYYPTSPQQSGQgQPGYYPTSPQQSGQGQQPGQGQQPG 450
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109150409   384 MAQPGLAQ-PGMAQPTMpQPGMP----QPGMPQPGMAQPGLAQPGMA-------QPGMPQPAMPQPAMPQPGMAQPGVSP 451
Cdd:pfam03157  451 QEQPGQGQqPGQGQQGQ-QPGQPeqgqQPGQGQPGYYPTSPQQSGQGqqlgqwqQQGQGQPGYYPTSPLQPGQGQPGYYP 529
                          170       180       190
                   ....*....|....*....|....*....|..
gi 109150409   452 APPVQPTFQSTFQPQNEPHSQKPHQQEMEVEQ 483
Cdd:pfam03157  530 TSPQQPGQGQQLGQLQQPTQGQQGQQSGQGQQ 561
CID_RPRD1A cd17011
CID (CTD-Interacting Domain) of Regulation of nuclear pre-mRNA domain-containing protein 1A; ...
41-131 3.27e-05

CID (CTD-Interacting Domain) of Regulation of nuclear pre-mRNA domain-containing protein 1A; Regulation of nuclear pre-mRNA domain-containing protein 1A (RPRD1A) is also called Cyclin-dependent kinase inhibitor 2B-related protein or p15INK4B-related protein (P15RS). RPRD1A is a CID (CTD-Interacting Domain) containing protein that co-purifies with RNA polymerase (Pol) II (RNAP II) and three other RNAP II-associated proteins, RPAP2, GRINL1A and RECQL5, but not with the Mediator complex. CID binds tightly to the carboxy-terminal domain (CTD) of RNAP II. During transcription, RNAP II synthesizes eukaryotic messenger RNA. Transcription is coupled to RNA processing through the CTD, which consists of up to 52 repeats of the sequence Tyr1-Ser2-Pro3-Thr4-Ser5-Pro6-Ser7. RPRD1A form homodimers and heterodimers with RPRD1B through their coiled-coil domains. Both RPRD1A and RPRD1B associate directly with RPAP2 phosphatase and serve as CTD scaffolds to coordinate the dephosphorylation of phospho-S5 by RPAP2. CID contains eight alpha-helices in a right-handed superhelical arrangement, which closely resembles that of the VHS domains and ARM (Armadillo) repeat proteins, except for its two amino-terminal helices.


Pssm-ID: 340808  Cd Length: 128  Bit Score: 44.65  E-value: 3.27e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109150409   41 KH---VVQIVEKFIKKCKPEYKVPGLYVIDSIVRQSRHQfgtdkdvfGPRFSKN----ITATFQYLYLCPSED-KSKIVR 112
Cdd:cd17011    32 KHsrpIVTVWERELRKAKPNRKLTFLYLANDVIQNSKRK--------GPEFTKDfapvIVEAFKHVSSETDEScKKHLGR 103
                          90
                  ....*....|....*....
gi 109150409  113 VLNLWQKNGVFKIEIIQPL 131
Cdd:cd17011   104 VLSIWEERSVYENDVLEQL 122
PABP-1234 TIGR01628
polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins ...
574-644 3.62e-05

polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins recognize the poly-A of mRNA and consists of four tandem RNA recognition domains at the N-terminus (rrm: pfam00076) followed by a PABP-specific domain (pfam00658) at the C-terminus. The protein is involved in the transport of mRNA's from the nucleus to the cytoplasm. There are four paralogs in Homo sapiens which are expressed in testis, platelets, broadly expressed and of unknown tissue range.


Pssm-ID: 130689 [Multi-domain]  Cd Length: 562  Bit Score: 47.88  E-value: 3.62e-05
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 109150409   574 TLWVGQLDKRTTQQDVASLLEEFGPIESI----NMIPPR--GCAYIVMVHRQDAYRALQKLsrgNYK-VNQKSIKIAW 644
Cdd:TIGR01628    2 SLYVGDLDPDVTEAKLYDLFKPFGPVLSVrvcrDSVTRRslGYGYVNFQNPADAERALETM---NFKrLGGKPIRIMW 76
PRK10263 PRK10263
DNA translocase FtsK; Provisional
309-455 4.56e-05

DNA translocase FtsK; Provisional


Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 47.77  E-value: 4.56e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109150409  309 PPQTPFGYPGDGVQQPAYTQHQSMDQFQPrmMPIQQDTMHHQVPLPPNGQMpgfgllsapppfppmpqPGMPQPGMAQPG 388
Cdd:PRK10263  743 PLFTPIVEPVQQPQQPVAPQQQYQQPQQP--VAPQPQYQQPQQPVAPQPQY-----------------QQPQQPVAPQPQ 803
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 109150409  389 LAQPgmAQPTMPQPGMPQPgmPQPGMAQPGLAQPGMAQPGMPQPAMPQPAMPQPGMAQPGVSPAPPV 455
Cdd:PRK10263  804 YQQP--QQPVAPQPQYQQP--QQPVAPQPQYQQPQQPVAPQPQDTLLHPLLMRNGDSRPLHKPTTPL 866
PHA03378 PHA03378
EBNA-3B; Provisional
385-476 6.42e-05

EBNA-3B; Provisional


Pssm-ID: 223065 [Multi-domain]  Cd Length: 991  Bit Score: 47.37  E-value: 6.42e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109150409  385 AQPGLAQPGMAQPTMPQPGMPQPGMPQPGMAQPGLAQPGMAQPGMPQPAMPQPAMPQPGMAQPgvSPAPPVQPTFQStfQ 464
Cdd:PHA03378  714 AQRPAAATGRARPPAAAPGRARPPAAAPGRARPPAAAPGRARPPAAAPGRARPPAAAPGAPTP--QPPPQAPPAPQQ--R 789
                          90
                  ....*....|..
gi 109150409  465 PQNEPHSQKPHQ 476
Cdd:PHA03378  790 PRGAPTPQPPPQ 801
RRM_TRA2 cd12363
RNA recognition motif (RRM) found in transformer-2 protein homolog TRA2-alpha, TRA2-beta and ...
572-628 1.77e-04

RNA recognition motif (RRM) found in transformer-2 protein homolog TRA2-alpha, TRA2-beta and similar proteins; This subfamily corresponds to the RRM of two mammalian homologs of Drosophila transformer-2 (Tra2), TRA2-alpha, TRA2-beta (also termed SFRS10), and similar proteins found in eukaryotes. TRA2-alpha is a 40-kDa serine/arginine-rich (SR) protein that specifically binds to gonadotropin-releasing hormone (GnRH) exonic splicing enhancer on exon 4 (ESE4) and is necessary for enhanced GnRH pre-mRNA splicing. It strongly stimulates GnRH intron A excision in a dose-dependent manner. In addition, TRA2-alpha can interact with either 9G8 or SRp30c, which may also be crucial for ESE-dependent GnRH pre-mRNA splicing. TRA2-beta is a serine/arginine-rich (SR) protein that controls the pre-mRNA alternative splicing of the calcitonin/calcitonin gene-related peptide (CGRP), the survival motor neuron 1 (SMN1) protein and the tau protein. Both, TRA2-alpha and TRA2-beta, contains a well conserved RNA recognition motif (RRM), also termed RBD (RNA binding domain) or RNP (ribonucleoprotein domain), flanked by the N- and C-terminal arginine/serine (RS)-rich regions.


Pssm-ID: 409798 [Multi-domain]  Cd Length: 80  Bit Score: 41.06  E-value: 1.77e-04
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 109150409  572 STTLWVGQLDKRTTQQDVASLLEEFGPIESINMI------PPRGCAYIVMVHRQDAYRALQKL 628
Cdd:cd12363     1 SRCLGVFGLSLYTTERDLREVFSRYGPIEKVQVVydqqtgRSRGFGFVYFESVEDAKEAKERL 63
RRM1_3_MRN1 cd12261
RNA recognition motif 1 (RRM1) and 3 (RRM3) found in RNA-binding protein MRN1 and similar ...
574-645 1.83e-04

RNA recognition motif 1 (RRM1) and 3 (RRM3) found in RNA-binding protein MRN1 and similar proteins; This subfamily corresponds to the RRM1 and RRM3 of MRN1, also termed multicopy suppressor of RSC-NHP6 synthetic lethality protein 1, or post-transcriptional regulator of 69 kDa, which is an RNA-binding protein found in yeast. Although its specific biological role remains unclear, MRN1 might be involved in translational regulation. Members in this family contain four copies of conserved RNA recognition motif (RRM), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain).


Pssm-ID: 240707 [Multi-domain]  Cd Length: 73  Bit Score: 41.05  E-value: 1.83e-04
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 109150409  574 TLWVGQLDKRTTQQDVASLLEEfGPIESINMIPPRGCAYIVMVHRQDAYRALQKLSRGNYKVNQKSIKIAWA 645
Cdd:cd12261     2 TVYLGNLPEDTTIRDILSAIRG-GPLESIKLLPTKNSATVSFLDEAAAEAFYAYARNNGFYINGKRIKVGWG 72
Med15 pfam09606
ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of ...
304-484 2.82e-04

ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of the ARC-Mediator co-activator is a three-helix bundle with marked similarity to the KIX domain. The sterol regulatory element binding protein (SREBP) family of transcription activators use the ARC105 subunit to activate target genes in the regulation of cholesterol and fatty acid homeostasis. In addition, Med15 is a critical transducer of gene activation signals that control early metazoan development.


Pssm-ID: 312941 [Multi-domain]  Cd Length: 732  Bit Score: 45.00  E-value: 2.82e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109150409   304 ATSPPPPQTPFGYPGDGVQQPAYTQHQSMDQFQPRMMPIQQDTMHHQVPLPPNGQMPGfgllsapPPFPPMPQPGMPQPG 383
Cdd:pfam09606  153 GQAGGMMQPSSGQPGSGTPNQMGPNGGPGQGQAGGMNGGQQGPMGGQMPPQMGVPGMP-------GPADAGAQMGQQAQA 225
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109150409   384 MAQPGLAQPGMAQPTMP---QPGMPQPGMPQPGMA-----QPGLAQPGMAQPGMpqPAMPQPAMPQ-PGMAQPGVSPAPP 454
Cdd:pfam09606  226 NGGMNPQQMGGAPNQVAmqqQQPQQQGQQSQLGMGinqmqQMPQGVGGGAGQGG--PGQPMGPPGQqPGAMPNVMSIGDQ 303
                          170       180       190
                   ....*....|....*....|....*....|
gi 109150409   455 VQPTFQSTFQPQNEPHSQKPHQQEMEVEQP 484
Cdd:pfam09606  304 NNYQQQQTRQQQQQQGGNHPAAHQQQMNQS 333
RRM2_MRN1 cd12523
RNA recognition motif 2 (RRM2) found in RNA-binding protein MRN1 and similar proteins; This ...
572-629 2.95e-04

RNA recognition motif 2 (RRM2) found in RNA-binding protein MRN1 and similar proteins; This subgroup corresponds to the RRM2 of MRN1, also termed multicopy suppressor of RSC-NHP6 synthetic lethality protein 1, or post-transcriptional regulator of 69 kDa, which is a RNA-binding protein found in yeast. Although its specific biological role remains unclear, MRN1 might be involved in translational regulation. Members in this family contain four copies of conserved RNA recognition motif (RRM), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain).


Pssm-ID: 409943 [Multi-domain]  Cd Length: 78  Bit Score: 40.50  E-value: 2.95e-04
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*...
gi 109150409  572 STTLWVGQLDKRTTQQDVASLLEEFGPIESINMIPPRGCAYIVMVHRQDAYRALQKLS 629
Cdd:cd12523     3 SRNVYLGNLPESITEEELREDLEKFGPIDQIKIVKEKNIAFVHFLSIANAIKVVTTLP 60
Med15 pfam09606
ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of ...
310-483 3.26e-04

ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of the ARC-Mediator co-activator is a three-helix bundle with marked similarity to the KIX domain. The sterol regulatory element binding protein (SREBP) family of transcription activators use the ARC105 subunit to activate target genes in the regulation of cholesterol and fatty acid homeostasis. In addition, Med15 is a critical transducer of gene activation signals that control early metazoan development.


Pssm-ID: 312941 [Multi-domain]  Cd Length: 732  Bit Score: 45.00  E-value: 3.26e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109150409   310 PQTPFGYPGDGVQQPAYTQHQSMDQFQPRMMPIQQDTMHHQVPLPPNGQMPGFGLLSAPPPFPPMPQPGMPQPGMAQPGL 389
Cdd:pfam09606  130 PQMPMGGAGFPSQMSRVGRMQPGGQAGGMMQPSSGQPGSGTPNQMGPNGGPGQGQAGGMNGGQQGPMGGQMPPQMGVPGM 209
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109150409   390 AQPGMAQPTMPQPGMPQPGMP--QPGMAQPGLA---QPGMAQPGMPQPAMPQPAMPQPGMAQPGVSPAP-PVQPTFQSTF 463
Cdd:pfam09606  210 PGPADAGAQMGQQAQANGGMNpqQMGGAPNQVAmqqQQPQQQGQQSQLGMGINQMQQMPQGVGGGAGQGgPGQPMGPPGQ 289
                          170       180
                   ....*....|....*....|
gi 109150409   464 QPQNEPHSQKPHQQEMEVEQ 483
Cdd:pfam09606  290 QPGAMPNVMSIGDQNNYQQQ 309
COG3416 COG3416
Uncharacterized conserved protein, DUF2076 domain [Function unknown];
393-443 3.92e-04

Uncharacterized conserved protein, DUF2076 domain [Function unknown];


Pssm-ID: 442642 [Multi-domain]  Cd Length: 237  Bit Score: 43.47  E-value: 3.92e-04
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|.
gi 109150409  393 GMAQPTMPQPGMPQPGmPQPGMAQPGLAQPGMAQPGMPQPAMPQPAMPQPG 443
Cdd:COG3416    93 GQRPPPAPQPSQPGPQ-QQPAPPSGPWGQAAPQQPGYGQPQYGQPAAGPSG 142
RRM1_Nop4p cd12674
RNA recognition motif 1 (RRM1) found in yeast nucleolar protein 4 (Nop4p) and similar proteins; ...
573-645 4.04e-04

RNA recognition motif 1 (RRM1) found in yeast nucleolar protein 4 (Nop4p) and similar proteins; This subgroup corresponds to the RRM1 of Nop4p (also known as Nop77p), encoded by YPL043W from Saccharomyces cerevisiae. It is an essential nucleolar protein involved in processing and maturation of 27S pre-rRNA and biogenesis of 60S ribosomal subunits. Nop4p has four RNA recognition motifs (RRMs), also termed RBDs (RNA binding domains) or RNPs (ribonucleoprotein domains).


Pssm-ID: 410075 [Multi-domain]  Cd Length: 80  Bit Score: 40.14  E-value: 4.04e-04
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 109150409  573 TTLWVGQLDKRTTQQDVASLLEEFGPIESINMI------PPRGCAYIVMVHRQDAYRALQKLSrgNYKVNQKSIKIAWA 645
Cdd:cd12674     1 TTLFVRNLPFDVTLESLTDFFSDIGPVKHAVVVtdpetkKSRGYGFVSFSTHDDAEEALAKLK--NRKLSGHILKLDFA 77
RRM1_MRN1 cd12520
RNA recognition motif 1 (RRM1) found in RNA-binding protein MRN1 and similar proteins; This ...
572-644 4.34e-04

RNA recognition motif 1 (RRM1) found in RNA-binding protein MRN1 and similar proteins; This subgroup corresponds to the RRM1 of MRN1, also termed multicopy suppressor of RSC-NHP6 synthetic lethality protein 1, or post-transcriptional regulator of 69 kDa,which is a RNA-binding protein found in yeast. Although its specific biological role remains unclear, MRN1 might be involved in translational regulation. Members in this family contain four copies of conserved RNA recognition motif (RRM), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain).


Pssm-ID: 240964 [Multi-domain]  Cd Length: 74  Bit Score: 40.12  E-value: 4.34e-04
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 109150409  572 STTLWVGQLDKRTTQQDVASLLEeFGPIESINMIPPRGCAYIVMVHRQDAYRALQKLSRGNYKVNQKSIKIAW 644
Cdd:cd12520     1 SRTVYLGNLPPNTTVKELLSHVR-SGPIENVRILPEKNCAFISFLDPSAATAFHSDAILKRLSIKGVELKIGW 72
RRM1_PUB1 cd12614
RNA recognition motif 1 (RRM1) found in yeast nuclear and cytoplasmic polyadenylated ...
575-645 4.62e-04

RNA recognition motif 1 (RRM1) found in yeast nuclear and cytoplasmic polyadenylated RNA-binding protein PUB1 and similar proteins; This subgroup corresponds to the RRM1 of yeast protein PUB1, also termed ARS consensus-binding protein ACBP-60, or poly uridylate-binding protein, or poly(U)-binding protein. PUB1 has been identified as both, a heterogeneous nuclear RNA-binding protein (hnRNP) and a cytoplasmic mRNA-binding protein (mRNP), which may be stably bound to a translationally inactive subpopulation of mRNAs within the cytoplasm. It is distributed in both, the nucleus and the cytoplasm, and binds to poly(A)+ RNA (mRNA or pre-mRNA). Although it is one of the major cellular proteins cross-linked by UV light to polyadenylated RNAs in vivo, PUB1 is nonessential for cell growth in yeast. PUB1 also binds to T-rich single stranded DNA (ssDNA); however, there is no strong evidence implicating PUB1 in the mechanism of DNA replication. PUB1 contains three RNA recognition motifs (RRMs), also termed RBDs (RNA binding domains) or RNPs (ribonucleoprotein domains), and a GAR motif (glycine and arginine rich stretch) that is located between RRM2 and RRM3.


Pssm-ID: 410026 [Multi-domain]  Cd Length: 74  Bit Score: 39.73  E-value: 4.62e-04
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 109150409  575 LWVGQLDKRTTQQDVASLLEEFGPIESINMIPPR---GCAYiVMVH---RQDAYRALQKLsrGNYKVNQKSIKIAWA 645
Cdd:cd12614     1 LYVGNLDPRVTEDLLQEIFAVTGPVENCKIIPDKnskGVNY-GFVEyydRRSAEIAIQTL--NGRQIFGQEIKVNWA 74
RRM1_Hu_like cd12375
RNA recognition motif 1 (RRM1) found in the Hu proteins family, Drosophila sex-lethal (SXL), ...
575-645 4.95e-04

RNA recognition motif 1 (RRM1) found in the Hu proteins family, Drosophila sex-lethal (SXL), and similar proteins; This subfamily corresponds to the RRM1 of Hu proteins and SXL. The Hu proteins family represents a group of RNA-binding proteins involved in diverse biological processes. Since the Hu proteins share high homology with the Drosophila embryonic lethal abnormal vision (ELAV) protein, the Hu family is sometimes referred to as the ELAV family. Drosophila ELAV is exclusively expressed in neurons and is required for the correct differentiation and survival of neurons in flies. The neuronal members of the Hu family include Hu-antigen B (HuB or ELAV-2 or Hel-N1), Hu-antigen C (HuC or ELAV-3 or PLE21), and Hu-antigen D (HuD or ELAV-4), which play important roles in neuronal differentiation, plasticity and memory. HuB is also expressed in gonads. Hu-antigen R (HuR or ELAV-1 or HuA) is ubiquitously expressed Hu family member. It has a variety of biological functions mostly related to the regulation of cellular response to DNA damage and other types of stress. Hu proteins perform their cytoplasmic and nuclear molecular functions by coordinately regulating functionally related mRNAs. In the cytoplasm, Hu proteins recognize and bind to AU-rich RNA elements (AREs) in the 3' untranslated regions (UTRs) of certain target mRNAs, such as GAP-43, vascular epithelial growth factor (VEGF), the glucose transporter GLUT1, eotaxin and c-fos, and stabilize those ARE-containing mRNAs. They also bind and regulate the translation of some target mRNAs, such as neurofilament M, GLUT1, and p27. In the nucleus, Hu proteins function as regulators of polyadenylation and alternative splicing. Each Hu protein contains three RNA recognition motifs (RRMs), also termed RBDs (RNA binding domains) or RNPs (ribonucleoprotein domains). RRM1 and RRM2 may cooperate in binding to an ARE. RRM3 may help to maintain the stability of the RNA-protein complex, and might also bind to poly(A) tails or be involved in protein-protein interactions. This family also includes the sex-lethal protein (SXL) from Drosophila melanogaster. SXL governs sexual differentiation and X chromosome dosage compensation in flies. It induces female-specific alternative splicing of the transformer (tra) pre-mRNA by binding to the tra uridine-rich polypyrimidine tract at the non-sex-specific 3' splice site during the sex-determination process. SXL binds to its own pre-mRNA and promotes female-specific alternative splicing. It contains an N-terminal Gly/Asn-rich domain that may be responsible for the protein-protein interaction, and tandem RRMs that show high preference to bind single-stranded, uridine-rich target RNA transcripts.


Pssm-ID: 409810 [Multi-domain]  Cd Length: 76  Bit Score: 39.70  E-value: 4.95e-04
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 109150409  575 LWVGQLDKRTTQQDVASLLEEFGPIESINMIPPR------GCAYIVMVHRQDAYRALQKLSrgNYKVNQKSIKIAWA 645
Cdd:cd12375     2 LIVNYLPQSMTQEELRSLFGAIGPIESCKLVRDKitgqslGYGFVNYRDPNDARKAINTLN--GLDLENKRLKVSYA 76
RRM1_CELF3_4_5_6 cd12632
RNA recognition motif 1 (RRM1) found in CUGBP Elav-like family member CELF-3, CELF-4, CELF-5, ...
572-624 6.21e-04

RNA recognition motif 1 (RRM1) found in CUGBP Elav-like family member CELF-3, CELF-4, CELF-5, CELF-6 and similar proteins; This subfamily corresponds to the RRM1 of CELF-3, CELF-4, CELF-5, CELF-6, all of which belong to the CUGBP1 and ETR-3-like factors (CELF) or BRUNOL (Bruno-like) family of RNA-binding proteins that display dual nuclear and cytoplasmic localizations and have been implicated in the regulation of pre-mRNA splicing and in the control of mRNA translation and deadenylation. CELF-3, expressed in brain and testis only, is also known as bruno-like protein 1 (BRUNOL-1), or CAG repeat protein 4, or CUG-BP- and ETR-3-like factor 3, or embryonic lethal abnormal vision (ELAV)-type RNA-binding protein 1 (ETR-1), or expanded repeat domain protein CAG/CTG 4, or trinucleotide repeat-containing gene 4 protein (TNRC4). It plays an important role in the pathogenesis of tauopathies. CELF-3 contains three highly conserved RNA recognition motifs (RRMs), also known as RBDs (RNA binding domains) or RNPs (ribonucleoprotein domains): two consecutive RRMs (RRM1 and RRM2) situated in the N-terminal region followed by a linker region and the third RRM (RRM3) close to the C-terminus of the protein.The effect of CELF-3 on tau splicing is mediated mainly by the RNA-binding activity of RRM2. The divergent linker region might mediate the interaction of CELF-3 with other proteins regulating its activity or involved in target recognition. CELF-4, highly expressed throughout the brain and in glandular tissues, moderately expressed in heart, skeletal muscle, and liver, is also known as bruno-like protein 4 (BRUNOL-4), or CUG-BP- and ETR-3-like factor 4. Like CELF-3, CELF-4 also contain three highly conserved RRMs. The splicing activation or repression activity of CELF-4 on some specific substrates is mediated by its RRM1/RRM2. On the other hand, both RRM1 and RRM2 of CELF-4 can activate cardiac troponin T (cTNT) exon 5 inclusion. CELF-5, expressed in brain, is also known as bruno-like protein 5 (BRUNOL-5), or CUG-BP- and ETR-3-like factor 5. Although its biological role remains unclear, CELF-5 shares same domain architecture with CELF-3. CELF-6, strongly expressed in kidney, brain, and testis, is also known as bruno-like protein 6 (BRUNOL-6), or CUG-BP- and ETR-3-like factor 6. It activates exon inclusion of a cardiac troponin T minigene in transient transfection assays in an muscle-specific splicing enhancer (MSE)-dependent manner and can activate inclusion via multiple copies of a single element, MSE2. CELF-6 also promotes skipping of exon 11 of insulin receptor, a known target of CELF activity that is expressed in kidney. In additiona to three highly conserved RRMs, CELF-6 also possesses numerous potential phosphorylation sites, a potential nuclear localization signal (NLS) at the C terminus, and an alanine-rich region within the divergent linker region.


Pssm-ID: 410041 [Multi-domain]  Cd Length: 87  Bit Score: 40.09  E-value: 6.21e-04
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*....
gi 109150409  572 STTLWVGQLDKRTTQQDVASLLEEFGPIESINMIPPR------GCAYIVMVHRQDAYRA 624
Cdd:cd12632     5 AIKLFIGQIPRNLEEKDLRPLFEQFGKIYELTVLKDKytgmhkGCAFLTYCARESALKA 63
SF-CC1 TIGR01622
splicing factor, CC1-like family; This model represents a subfamily of RNA splicing factors ...
1059-1171 6.35e-04

splicing factor, CC1-like family; This model represents a subfamily of RNA splicing factors including the Pad-1 protein (N. crassa), CAPER (M. musculus) and CC1.3 (H.sapiens). These proteins are characterized by an N-terminal arginine-rich, low complexity domain followed by three (or in the case of 4 H. sapiens paralogs, two) RNA recognition domains (rrm: pfam00706). These splicing factors are closely related to the U2AF splicing factor family (TIGR01642). A homologous gene from Plasmodium falciparum was identified in the course of the analysis of that genome at TIGR and was included in the seed.


Pssm-ID: 273721 [Multi-domain]  Cd Length: 494  Bit Score: 43.75  E-value: 6.35e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109150409  1059 RDQERfgRRSFGSRVENDRERYGSRNDDRDNSNRERREWGrRSPDRDRHRD--------LEERSRRSSGHrdrdrdsRDR 1130
Cdd:TIGR01622    3 RDRER--ERLRDSSSAGDRDRRRDKGRERSRDRSRDRERS-RSRRRDRHRDrdyyrgreRRSRSRRPNRR-------YRP 72
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|.
gi 109150409  1131 ESRREKEENRKEKHEVADRAGGnkavEPPLSQVGTIDTVSE 1171
Cdd:TIGR01622   73 REKRRRRGDSYRRRRDDRRSRR----EKPRARDGTPEPLTE 109
RRM4_RBM19_RRM3_MRD1 cd12317
RNA recognition motif 4 (RRM4) found in RNA-binding protein 19 (RBM19) and RNA recognition ...
573-634 6.73e-04

RNA recognition motif 4 (RRM4) found in RNA-binding protein 19 (RBM19) and RNA recognition motif 3 (RRM3) found in multiple RNA-binding domain-containing protein 1 (MRD1); This subfamily corresponds to the RRM4 of RBM19 and the RRM3 of MRD1. RBM19, also termed RNA-binding domain-1 (RBD-1), is a nucleolar protein conserved in eukaryotes involved in ribosome biogenesis by processing rRNA and is essential for preimplantation development. It has a unique domain organization containing 6 conserved RNA recognition motifs (RRMs), also termed RBDs (RNA binding domains) or RNPs (ribonucleoprotein domains). MRD1 is encoded by a novel yeast gene MRD1 (multiple RNA-binding domain). It is well conserved in yeast and its homologues exist in all eukaryotes. MRD1 is present in the nucleolus and the nucleoplasm. It interacts with the 35 S precursor rRNA (pre-rRNA) and U3 small nucleolar RNAs (snoRNAs). MRD1 is essential for the initial processing at the A0-A2 cleavage sites in the 35 S pre-rRNA. MRD1 contains 5 conserved RRMs, which may play an important structural role in organizing specific rRNA processing events.


Pssm-ID: 409756 [Multi-domain]  Cd Length: 72  Bit Score: 39.16  E-value: 6.73e-04
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 109150409  573 TTLWVGQLDKRTTQQDVASLLEEFGPIESINMIPPRGCAYIVMVHRQDAYRALQKLSRGNYK 634
Cdd:cd12317     1 TVILVKNLPFGATEEELRELFEKFGTLGRLLLPPSRTIALVEFLEPQDARRAFKKLAYKRFK 62
RRM_SNP1_like cd21615
RNA recognition motif (RRM) found in Saccharomyces cerevisiae U1 small nuclear ...
574-630 7.18e-04

RNA recognition motif (RRM) found in Saccharomyces cerevisiae U1 small nuclear ribonucleoprotein SNP1 and similar proteins; SNP1, also called U1 snRNP protein SNP1, or U1 small nuclear ribonucleoprotein 70 kDa homolog, or U1 70K, or U1 snRNP 70 kDa homolog, interacts with mRNA and is involved in nuclear mRNA splicing. It is a component of the spliceosome, where it is associated with snRNP U1 by binding stem loop I of U1 snRNA. Members in this family contain an N-terminal U1snRNP70 domain and an RNA recognition motif (RRM), also called RBD (RNA binding domain) or RNP (ribonucleoprotein domain).


Pssm-ID: 410194 [Multi-domain]  Cd Length: 118  Bit Score: 40.76  E-value: 7.18e-04
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 109150409  574 TLWVGQLDKRTTQQDVASLLEEFGPIESINMI------PPRGCAYIVMVHRQDAYRALQKLSR 630
Cdd:cd21615    20 TLFVGRLDYSLTELELQKKFSKFGEIEKIRIVrdketgKSRGYAFIVFKSESDAKNAFKEGNG 82
PAT1 pfam09770
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ...
395-486 8.22e-04

Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.


Pssm-ID: 401645 [Multi-domain]  Cd Length: 846  Bit Score: 43.87  E-value: 8.22e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109150409   395 AQPTMPQPGMPQPGMPQPGMAQPGLAQPGMAQPGMPQPAMPQPAMPQPGMAQPGVSPAPPVQPTFQSTFQPQNEPHSQkP 474
Cdd:pfam09770  205 AQAKKPAQQPAPAPAQPPAAPPAQQAQQQQQFPPQIQQQQQPQQQPQQPQQHPGQGHPVTILQRPQSPQPDPAQPSIQ-P 283
                           90
                   ....*....|..
gi 109150409   475 HQQEMEVEQPCV 486
Cdd:pfam09770  284 QAQQFHQQPPPV 295
Glutenin_hmw pfam03157
High molecular weight glutenin subunit; Members of this family include high molecular weight ...
309-484 8.80e-04

High molecular weight glutenin subunit; Members of this family include high molecular weight subunits of glutenin. This group of gluten proteins is thought to be largely responsible for the elastic properties of gluten, and hence, doughs. Indeed, glutenin high molecular weight subunits are classified as elastomeric proteins, because the glutenin network can withstand significant deformations without breaking, and return to the original conformation when the stress is removed. Elastomeric proteins differ considerably in amino acid sequence, but they are all polymers whose subunits consist of elastomeric domains, composed of repeated motifs, and non-elastic domains that mediate cross-linking between the subunits. The elastomeric domain motifs are all rich in glycine residues in addition to other hydrophobic residues. High molecular weight glutenin subunits have an extensive central elastomeric domain, flanked by two terminal non-elastic domains that form disulphide cross-links. The central elastomeric domain is characterized by the following three repeated motifs: PGQGQQ, GYYPTS[P/L]QQ, GQQ. It possesses overlapping beta-turns within and between the repeated motifs, and assumes a regular helical secondary structure with a diameter of approx. 1.9 nm and a pitch of approx. 1.5 nm.


Pssm-ID: 367362 [Multi-domain]  Cd Length: 786  Bit Score: 43.78  E-value: 8.80e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109150409   309 PPQTPFGYPGDGVQQPAYTQHQSMDQFQPRMMPIQQDTMHHQVPLPPNGQMPGFGLLSAPPPFPPMPQPGMPQPGMAQPG 388
Cdd:pfam03157  254 LGQGQQGYYPISPQQPRQWQQSGQGQQGYYPTSLQQPGQGQSGYYPTSQQQAGQLQQEQQLGQEQQDQQPGQGRQGQQPG 333
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109150409   389 LAQPGMAQPTMPQPGMPQPGM-----PQPGMAQPG-----------LAQPGMAQPGMPQPAMPQPAMP----QPGMAQPG 448
Cdd:pfam03157  334 QGQQGQQPAQGQQPGQGQPGYyptspQQPGQGQPGyyptsqqqpqqGQQPEQGQQGQQQGQGQQGQQPgqgqQPGQGQPG 413
                          170       180       190
                   ....*....|....*....|....*....|....*.
gi 109150409   449 VSPAPPVQPTFQSTFQPQNEPHSQKPHQQEMEVEQP 484
Cdd:pfam03157  414 YYPTSPQQSGQGQPGYYPTSPQQSGQGQQPGQGQQP 449
PABP-1234 TIGR01628
polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins ...
391-474 9.25e-04

polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins recognize the poly-A of mRNA and consists of four tandem RNA recognition domains at the N-terminus (rrm: pfam00076) followed by a PABP-specific domain (pfam00658) at the C-terminus. The protein is involved in the transport of mRNA's from the nucleus to the cytoplasm. There are four paralogs in Homo sapiens which are expressed in testis, platelets, broadly expressed and of unknown tissue range.


Pssm-ID: 130689 [Multi-domain]  Cd Length: 562  Bit Score: 43.26  E-value: 9.25e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109150409   391 QPGMAQPTMPQPGMPQPGMPQPGMAQPGLAQPGmAQPGMPQPAMPQPAMPqPGMAQPGVSPAPPVQPtfqsTFQPQNEPH 470
Cdd:TIGR01628  412 QPLGWPRMSMMPTPMGPGGPLRPNGLAPMNAVR-APSRNAQNAAQKPPMQ-PVMYPPNYQSLPLSQD----LPQPQSTAS 485

                   ....
gi 109150409   471 SQKP 474
Cdd:TIGR01628  486 QGGQ 489
RRM2_Bruno_like cd12636
RNA recognition motif 2 (RRM2) found in Drosophila melanogaster Bruno protein and similar ...
575-628 9.88e-04

RNA recognition motif 2 (RRM2) found in Drosophila melanogaster Bruno protein and similar proteins; This subgroup corresponds to the RRM2 of Bruno, a Drosophila RNA recognition motif (RRM)-containing protein that plays a central role in regulation of Oskar (Osk) expression. It mediates repression by binding to regulatory Bruno response elements (BREs) in the Osk mRNA 3' UTR. The full-length Bruno protein contains three RRMs, two located in the N-terminal half of the protein and the third near the C-terminus, separated by a linker region.


Pssm-ID: 410044 [Multi-domain]  Cd Length: 81  Bit Score: 39.09  E-value: 9.88e-04
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*....
gi 109150409  575 LWVGQLDKRTTQQDVASLLEEFGPIESINMI-----PPRGCAYIVMVHRQDAYRALQKL 628
Cdd:cd12636     4 LFVGMLSKKCNESDVRIMFSPYGSIEECTVLrdqngKSRGCAFVTFTSRQCAVNAIKAM 62
RRM1_I_PABPs cd12378
RNA recognition motif 1 (RRM1) found in type I polyadenylate-binding proteins; This subfamily ...
575-645 1.09e-03

RNA recognition motif 1 (RRM1) found in type I polyadenylate-binding proteins; This subfamily corresponds to the RRM1 of type I poly(A)-binding proteins (PABPs), highly conserved proteins that bind to the poly(A) tail present at the 3' ends of most eukaryotic mRNAs. They have been implicated in the regulation of poly(A) tail length during the polyadenylation reaction, translation initiation, mRNA stabilization by influencing the rate of deadenylation and inhibition of mRNA decapping. The family represents type I polyadenylate-binding proteins (PABPs), including polyadenylate-binding protein 1 (PABP-1 or PABPC1), polyadenylate-binding protein 3 (PABP-3 or PABPC3), polyadenylate-binding protein 4 (PABP-4 or APP-1 or iPABP), polyadenylate-binding protein 5 (PABP-5 or PABPC5), polyadenylate-binding protein 1-like (PABP-1-like or PABPC1L), polyadenylate-binding protein 1-like 2 (PABPC1L2 or RBM32), polyadenylate-binding protein 4-like (PABP-4-like or PABPC4L), yeast polyadenylate-binding protein, cytoplasmic and nuclear (PABP or ACBP-67), and similar proteins. PABP-1 is a ubiquitously expressed multifunctional protein that may play a role in 3' end formation of mRNA, translation initiation, mRNA stabilization, protection of poly(A) from nuclease activity, mRNA deadenylation, inhibition of mRNA decapping, and mRNP maturation. Although PABP-1 is thought to be a cytoplasmic protein, it is also found in the nucleus. PABP-1 may be involved in nucleocytoplasmic trafficking and utilization of mRNP particles. PABP-1 contains four copies of RNA recognition motifs (RRMs), also termed RBDs (RNA binding domains) or RNPs (ribonucleoprotein domains), a less well conserved linker region, and a proline-rich C-terminal conserved domain (CTD). PABP-3 is a testis-specific poly(A)-binding protein specifically expressed in round spermatids. It is mainly found in mammalian and may play an important role in the testis-specific regulation of mRNA homeostasis. PABP-3 shows significant sequence similarity to PABP-1. However, it binds to poly(A) with a lower affinity than PABP-1. Moreover, PABP-1 possesses an A-rich sequence in its 5'-UTR and allows binding of PABP and blockage of translation of its own mRNA. In contrast, PABP-3 lacks the A-rich sequence in its 5'-UTR. PABP-4 is an inducible poly(A)-binding protein (iPABP) that is primarily localized to the cytoplasm. It shows significant sequence similarity to PABP-1 as well. The RNA binding properties of PABP-1 and PABP-4 appear to be identical. PABP-5 is encoded by PABPC5 gene within the X-specific subinterval, and expressed in fetal brain and in a range of adult tissues in mammals, such as ovary and testis. It may play an important role in germ cell development. Moreover, unlike other PABPs, PABP-5 contains only four RRMs, but lacks both the linker region and the CTD. PABP-1-like and PABP-1-like 2 are the orthologs of PABP-1. PABP-4-like is the ortholog of PABP-5. Their cellular functions remain unclear. The family also includes yeast PABP, a conserved poly(A) binding protein containing poly(A) tails that can be attached to the 3'-ends of mRNAs. The yeast PABP and its homologs may play important roles in the initiation of translation and in mRNA decay. Like vertebrate PABP-1, the yeast PABP contains four RRMs, a linker region, and a proline-rich CTD as well. The first two RRMs are mainly responsible for specific binding to poly(A). The proline-rich region may be involved in protein-protein interactions.


Pssm-ID: 409812 [Multi-domain]  Cd Length: 80  Bit Score: 39.15  E-value: 1.09e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109150409  575 LWVGQLDKRTTQqdvASLLEEF---GPIESI----NMIPPR--GCAYIVMVHRQDAYRALQKLsrgNYK-VNQKSIKIAW 644
Cdd:cd12378     2 LYVGDLHPDVTE---AMLYEKFspaGPVLSIrvcrDAVTRRslGYAYVNFQQPADAERALDTL---NFDvIKGKPIRIMW 75

                  .
gi 109150409  645 A 645
Cdd:cd12378    76 S 76
Glutenin_hmw pfam03157
High molecular weight glutenin subunit; Members of this family include high molecular weight ...
322-476 1.31e-03

High molecular weight glutenin subunit; Members of this family include high molecular weight subunits of glutenin. This group of gluten proteins is thought to be largely responsible for the elastic properties of gluten, and hence, doughs. Indeed, glutenin high molecular weight subunits are classified as elastomeric proteins, because the glutenin network can withstand significant deformations without breaking, and return to the original conformation when the stress is removed. Elastomeric proteins differ considerably in amino acid sequence, but they are all polymers whose subunits consist of elastomeric domains, composed of repeated motifs, and non-elastic domains that mediate cross-linking between the subunits. The elastomeric domain motifs are all rich in glycine residues in addition to other hydrophobic residues. High molecular weight glutenin subunits have an extensive central elastomeric domain, flanked by two terminal non-elastic domains that form disulphide cross-links. The central elastomeric domain is characterized by the following three repeated motifs: PGQGQQ, GYYPTS[P/L]QQ, GQQ. It possesses overlapping beta-turns within and between the repeated motifs, and assumes a regular helical secondary structure with a diameter of approx. 1.9 nm and a pitch of approx. 1.5 nm.


Pssm-ID: 367362 [Multi-domain]  Cd Length: 786  Bit Score: 43.01  E-value: 1.31e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109150409   322 QQPAYTQHQSMDQFQPRMMPiQQDTMHHQVPLPPNGQMPGFGLLSAPPPFPPMPQPGMPQPGMAQPGLAQPGMAQPTMPQ 401
Cdd:pfam03157  441 QQPGQGQQPGQEQPGQGQQP-GQGQQGQQPGQPEQGQQPGQGQPGYYPTSPQQSGQGQQLGQWQQQGQGQPGYYPTSPLQ 519
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 109150409   402 PGMPQPGMPQPGMAQPGLAQpgmaQPGMPQPAMPQPAMPQPGMAQPGVSPAPPVQPTfqstfQPQNEPHSQKPHQ 476
Cdd:pfam03157  520 PGQGQPGYYPTSPQQPGQGQ----QLGQLQQPTQGQQGQQSGQGQQGQQPGQGQQGQ-----QPGQGQQGQQPGQ 585
RRM_cwf2 cd12360
RNA recognition motif (RRM) found in yeast pre-mRNA-splicing factor Cwc2 and similar proteins; ...
574-621 1.47e-03

RNA recognition motif (RRM) found in yeast pre-mRNA-splicing factor Cwc2 and similar proteins; This subfamily corresponds to the RRM of yeast protein Cwc2, also termed Complexed with CEF1 protein 2, or PRP19-associated complex protein 40 (Ntc40), or synthetic lethal with CLF1 protein 3, one of the components of the Prp19-associated complex [nineteen complex (NTC)] that can bind to RNA. NTC is composed of the scaffold protein Prp19 and a number of associated splicing factors, and plays a crucial role in intron removal during premature mRNA splicing in eukaryotes. Cwc2 functions as an RNA-binding protein that can bind both small nuclear RNAs (snRNAs) and pre-mRNA in vitro. It interacts directly with the U6 snRNA to link the NTC to the spliceosome during pre-mRNA splicing. In the N-terminal half, Cwc2 contains a CCCH-type zinc finger (ZnF domain), a RNA recognition motif (RRM), also termed RBD (RNA binding domain) or RNP (ribonucleoprotein domain), and an intervening loop, also termed RNA-binding loop or RB loop, between ZnF and RRM, all of which are necessary and sufficient for RNA binding. The ZnF is also responsible for mediating protein-protein interaction. The C-terminal flexible region of Cwc2 interacts with the WD40 domain of Prp19.


Pssm-ID: 409795 [Multi-domain]  Cd Length: 79  Bit Score: 38.78  E-value: 1.47e-03
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|...
gi 109150409  574 TLWVGQLDKRTTQ-QDVASLLE----EFGPIESINMIPPRGCAYIVMVHRQDA 621
Cdd:cd12360     3 TLYVGGIKAASNKlAQIEEILRrhfgEWGEIERIRVLPSKGIAFVRYKNRANA 55
PRK10263 PRK10263
DNA translocase FtsK; Provisional
388-492 1.49e-03

DNA translocase FtsK; Provisional


Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 43.15  E-value: 1.49e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109150409  388 GLAQPGMAQPTMPQPGMPQPGMPQPGMAQPglAQPGMAQPGMPQPAMPQPAMPQPGMAQPGVSPAP-------PVQPTFQ 460
Cdd:PRK10263  739 GPHEPLFTPIVEPVQQPQQPVAPQQQYQQP--QQPVAPQPQYQQPQQPVAPQPQYQQPQQPVAPQPqyqqpqqPVAPQPQ 816
                          90       100       110
                  ....*....|....*....|....*....|..
gi 109150409  461 STfQPQNEPHSQKPHQQEMEVEQPCVTEVKRH 492
Cdd:PRK10263  817 YQ-QPQQPVAPQPQYQQPQQPVAPQPQDTLLH 847
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
385-484 1.89e-03

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 42.67  E-value: 1.89e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109150409  385 AQPGLAQPGMAQPTmPQPGMPQPGmPQPGMAQPGLAQPGMAQPGMPQPAMPQPAMPQPGMAQPGVSPAPPVQPTFQSTFQ 464
Cdd:PRK07764  392 GAPAAAAPSAAAAA-PAAAPAPAA-AAPAAAAAPAPAAAPQPAPAPAPAPAPPSPAGNAPAGGAPSPPPAAAPSAQPAPA 469
                          90       100
                  ....*....|....*....|
gi 109150409  465 PQNEPHSQKPHQQEMEVEQP 484
Cdd:PRK07764  470 PAAAPEPTAAPAPAPPAAPA 489
RRM2_SART3 cd12392
RNA recognition motif 2 (RRM2) found in squamous cell carcinoma antigen recognized by T-cells ...
574-643 1.98e-03

RNA recognition motif 2 (RRM2) found in squamous cell carcinoma antigen recognized by T-cells 3 (SART3) and similar proteins; This subfamily corresponds to the RRM2 of SART3, also termed Tat-interacting protein of 110 kDa (Tip110), is an RNA-binding protein expressed in the nucleus of the majority of proliferating cells, including normal cells and malignant cells, but not in normal tissues except for the testes and fetal liver. It is involved in the regulation of mRNA splicing probably via its complex formation with RNA-binding protein with a serine-rich domain (RNPS1), a pre-mRNA-splicing factor. SART3 has also been identified as a nuclear Tat-interacting protein that regulates Tat transactivation activity through direct interaction and functions as an important cellular factor for HIV-1 gene expression and viral replication. In addition, SART3 is required for U6 snRNP targeting to Cajal bodies. It binds specifically and directly to the U6 snRNA, interacts transiently with the U6 and U4/U6 snRNPs, and promotes the reassembly of U4/U6 snRNPs after splicing in vitro. SART3 contains an N-terminal half-a-tetratricopeptide repeat (HAT)-rich domain, a nuclearlocalization signal (NLS) domain, and two C-terminal RNA recognition motifs (RRMs), also termed RBDs (RNA binding domains) or RNPs (ribonucleoprotein domains).


Pssm-ID: 409826 [Multi-domain]  Cd Length: 81  Bit Score: 38.47  E-value: 1.98e-03
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 109150409  574 TLWVGQLDKRTTQQDVASLLEEFGPIESINMIP-----PRGCAYIVMVHRQDAYRALQKLSrgNYKVNQKSIKIA 643
Cdd:cd12392     4 KLFVKGLPFSCTKEELEELFKQHGTVKDVRLVTyrngkPKGLAYVEYENEADASQAVLKTD--GTEIKDHTISVA 76
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
387-469 2.04e-03

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 42.28  E-value: 2.04e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109150409  387 PGLAQPGMAQPTMPQPGMPQPGMPQPGMAQPGLAQPGMAQPGMPQPAMPQPAMPQPGMAQPGVSPAPPVQPTFQSTFQPQ 466
Cdd:PRK07764  412 PAAAAPAAAAAPAPAAAPQPAPAPAPAPAPPSPAGNAPAGGAPSPPPAAAPSAQPAPAPAAAPEPTAAPAPAPPAAPAPA 491

                  ...
gi 109150409  467 NEP 469
Cdd:PRK07764  492 AAP 494
RRM1_AtRSp31_like cd12234
RNA recognition motif (RRM) found in Arabidopsis thaliana arginine/serine-rich-splicing factor ...
575-645 2.26e-03

RNA recognition motif (RRM) found in Arabidopsis thaliana arginine/serine-rich-splicing factor RSp31 and similar proteins from plants; This subfamily corresponds to the RRM1in a family that represents a novel group of arginine/serine (RS) or serine/arginine (SR) splicing factors existing in plants, such as A. thaliana RSp31, RSp35, RSp41 and similar proteins. Like vertebrate RS splicing factors, these proteins function as plant splicing factors and play crucial roles in constitutive and alternative splicing in plants. They all contain two RNA recognition motifs (RRMs), also termed RBDs (RNA binding domains) or RNPs (ribonucleoprotein domains), at their N-terminus, and an RS domain at their C-terminus.


Pssm-ID: 409680 [Multi-domain]  Cd Length: 72  Bit Score: 37.90  E-value: 2.26e-03
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 109150409  575 LWVGQLDKRTTQQDVASLLEEFGPIESINMipPRGCAYIVMVHRQDAYRALQKLSRGNYKVNQKSIKIAWA 645
Cdd:cd12234     3 VFCGNFEYDARQSEIERLFGKYGRVDRVDM--KSGYAFVYMEDERDAEDAIRGLDNFEFGRQRRRLRVEWT 71
Cytadhesin_P30 pfam07271
Cytadhesin P30/P32; This family consists of several Mycoplasma species specific Cytadhesin P32 ...
385-474 2.30e-03

Cytadhesin P30/P32; This family consists of several Mycoplasma species specific Cytadhesin P32 and P30 proteins. P30 has been found to be membrane associated and localized on the tip organelle. It is thought that it is important in cytadherence and virulence. The N-terminus contains two predicted transmembrane helices followed by a long region of a short 6 residue proline rich repeat.


Pssm-ID: 429374 [Multi-domain]  Cd Length: 275  Bit Score: 41.49  E-value: 2.30e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109150409   385 AQPGLAQPGMAQPTMPQPGMPQPGM-----PQPGMAQPGLAQPGMA-QPGMPQ-PAMPQPAM-----PQPGMAQPGVSPA 452
Cdd:pfam07271  133 TAEPTQPAGVNVANQPQMGINQPQInpqfgPNPQQRIGFPMQPNMGmRPGFNQmPGMPPNQMrpgfnQMPGMPPRPGFPN 212
                           90       100
                   ....*....|....*....|..
gi 109150409   453 PPVQPTFQSTFQPQNEPHSQKP 474
Cdd:pfam07271  213 PMPNMQPRPGFRPQPGPMGNRP 234
CID_RPRD1B cd17012
CID (CTD-Interacting Domain) of Regulation of nuclear pre-mRNA domain-containing protein 1B; ...
41-131 2.31e-03

CID (CTD-Interacting Domain) of Regulation of nuclear pre-mRNA domain-containing protein 1B; Regulation of nuclear pre-mRNA domain-containing protein 1B (RPRD1B) is also called Cell cycle-related and expression-elevated protein in tumor (CREPT). RPRD1B is a CID (CTD-Interacting Domain) containing protein that co-purifies with RNA polymerase (Pol) II (RNAP II) and three other RNAP II-associated proteins, RPAP2, GRINL1A and RECQL5, but not with the Mediator complex. CID binds tightly to the carboxy-terminal domain (CTD) of RNAP II. During transcription, RNAP II synthesizes eukaryotic messenger RNA. Transcription is coupled to RNA processing through the CTD, which consists of up to 52 repeats of the sequence Tyr1-Ser2-Pro3-Thr4-Ser5-Pro6-Ser7. RPRD1B form homodimers and heterodimers with RPRD1A through their coiled-coil domains. Both RPRD1A and RPRD1B associate directly with RPAP2 phosphatase and serve as CTD scaffolds to coordinate the dephosphorylation of phospho-S5 by RPAP2. RPRD1B is highly expressed during tumorigenesis and in endometrial cancer, has been shown to promote tumor growth by accelerating the cell cycle. CID contains eight alpha-helices in a right-handed superhelical arrangement, which closely resembles that of the VHS domains and ARM (Armadillo) repeat proteins, except for its two amino-terminal helices.


Pssm-ID: 340809  Cd Length: 129  Bit Score: 39.60  E-value: 2.31e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109150409   41 KHVVQIV---EKFIKKCKPEYKVPGLYVIDSIVRQSRHQfgtdkdvfGPRFSKNITATFQYLYLCPSED-----KSKIVR 112
Cdd:cd17012    33 KHAGPIVsvwHRELRKAKSSRKLTFLYLANDVIQNSKRK--------GPEFTREFESVLVDAFSHVAREadegcKKPLER 104
                          90
                  ....*....|....*....
gi 109150409  113 VLNLWQKNGVFKIEIIQPL 131
Cdd:cd17012   105 LLNIWQERSVYGGDFIQQL 123
RRM_Aly_REF_like cd12418
RNA recognition motif (RRM) found in the Aly/REF family; This subfamily corresponds to the RRM ...
573-628 2.62e-03

RNA recognition motif (RRM) found in the Aly/REF family; This subfamily corresponds to the RRM of Aly/REF family which includes THO complex subunit 4 (THOC4, also termed Aly/REF), S6K1 Aly/REF-like target (SKAR, also termed PDIP3 or PDIP46) and similar proteins. THOC4 is an mRNA transporter protein with a well conserved RNA recognition motif (RRM), also termed RBD (RNA binding domain) or RNP (ribonucleoprotein domain). It is involved in RNA transportation from the nucleus, and was initially identified as a transcription coactivator of LEF-1 and AML-1 for the TCRalpha enhancer function. In addition, THOC4 specifically binds to rhesus (RH) promoter in erythroid, and might be a novel transcription cofactor for erythroid-specific genes. SKAR shows high sequence homology with THOC4 and possesses one RRM as well. SKAR is widely expressed and localizes to the nucleus. It may be a critical player in the function of S6K1 in cell and organism growth control by binding the activated, hyperphosphorylated form of S6K1 but not S6K2. Furthermore, SKAR functions as a protein partner of the p50 subunit of DNA polymerase delta. In addition, SKAR may have particular importance in pancreatic beta cell size determination and insulin secretion.


Pssm-ID: 409852 [Multi-domain]  Cd Length: 75  Bit Score: 37.94  E-value: 2.62e-03
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 109150409  573 TTLWVGQLDKRTTQQDVASLLEEFGPIESINMIPPR-----GCAYIVMVHRQDAYRALQKL 628
Cdd:cd12418     1 TRVRVSNLHPDVTEEDLRELFGRVGPVKSVKINYDRsgrstGTAYVVFERPEDAEKAIKQF 61
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
304-484 2.81e-03

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 42.06  E-value: 2.81e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109150409   304 ATSPPPPQTPFGYPgdgvqQPAYTQHQSMDQFQPRMMP---IQQDTMHHQVPLP-PNGQMPGFGLLSAPPPFPPMPQPGM 379
Cdd:pfam03154  195 ATAGPTPSAPSVPP-----QGSPATSQPPNQTQSTAAPhtlIQQTPTLHPQRLPsPHPPLQPMTQPPPPSQVSPQPLPQP 269
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109150409   380 PQPGMAQPGLAQPGMAQPTMPQPGMPQP-----------GMPQPGMAQPGLAQ------PGMAQPGMPQPAMPQPAMPQP 442
Cdd:pfam03154  270 SLHGQMPPMPHSLQTGPSHMQHPVPPQPfpltpqssqsqVPPGPSPAAPGQSQqrihtpPSQSQLQSQQPPREQPLPPAP 349
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|....*.
gi 109150409   443 gMAQPGVSPAP----PVQPTFQSTFQPqnePHSQKPHQQEMEVEQP 484
Cdd:pfam03154  350 -LSMPHIKPPPttpiPQLPNPQSHKHP---PHLSGPSPFQMNSNLP 391
SF-CC1 TIGR01622
splicing factor, CC1-like family; This model represents a subfamily of RNA splicing factors ...
1048-1116 3.02e-03

splicing factor, CC1-like family; This model represents a subfamily of RNA splicing factors including the Pad-1 protein (N. crassa), CAPER (M. musculus) and CC1.3 (H.sapiens). These proteins are characterized by an N-terminal arginine-rich, low complexity domain followed by three (or in the case of 4 H. sapiens paralogs, two) RNA recognition domains (rrm: pfam00706). These splicing factors are closely related to the U2AF splicing factor family (TIGR01642). A homologous gene from Plasmodium falciparum was identified in the course of the analysis of that genome at TIGR and was included in the seed.


Pssm-ID: 273721 [Multi-domain]  Cd Length: 494  Bit Score: 41.44  E-value: 3.02e-03
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 109150409  1048 RNDSRQQFNSGRDQERFGRRSFGSRVENDRERYGSRNDDRDNSNRERREWG--------RRSPDRDRHRDLEERSRR 1116
Cdd:TIGR01622   18 DRDRRRDKGRERSRDRSRDRERSRSRRRDRHRDRDYYRGRERRSRSRRPNRryrprekrRRRGDSYRRRRDDRRSRR 94
RRM2_FCA cd12637
RNA recognition motif 2 (RRM2) found in plant flowering time control protein FCA and similar ...
575-634 3.06e-03

RNA recognition motif 2 (RRM2) found in plant flowering time control protein FCA and similar proteins; This subgroup corresponds to the RRM2 of FCA, a gene controlling flowering time in Arabidopsis, which encodes a flowering time control protein that functions in the posttranscriptional regulation of transcripts involved in the flowering process. The flowering time control protein FCA contains two RNA recognition motifs (RRMs), also known as RBDs (RNA binding domains) or RNP (ribonucleoprotein domains), and a WW protein interaction domain.


Pssm-ID: 410045 [Multi-domain]  Cd Length: 81  Bit Score: 37.75  E-value: 3.06e-03
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 109150409  575 LWVGQLDKRTTQQDVASLLEEFGPIESINMIPPR------GCAYIVMVHRQDAYRALQKLSrGNYK 634
Cdd:cd12637     2 LFVGSLPKTATEQEVRDLFEAYGEVEEVYLMKDPvtqqgtGCAFVKFAYKEEALAAIRSLN-GTVT 66
Med15 pfam09606
ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of ...
309-468 3.38e-03

ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of the ARC-Mediator co-activator is a three-helix bundle with marked similarity to the KIX domain. The sterol regulatory element binding protein (SREBP) family of transcription activators use the ARC105 subunit to activate target genes in the regulation of cholesterol and fatty acid homeostasis. In addition, Med15 is a critical transducer of gene activation signals that control early metazoan development.


Pssm-ID: 312941 [Multi-domain]  Cd Length: 732  Bit Score: 41.53  E-value: 3.38e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109150409   309 PPQTPFGYPGDGVQQPAYTQHQSMDQFQPRMMPIQQDTM---HHQVPLPPNGQMPGFGLLSAPPPFPPMPQPGMPQPGMA 385
Cdd:pfam09606  295 PNVMSIGDQNNYQQQQTRQQQQQQGGNHPAAHQQQMNQSvgqGGQVVALGGLNHLETWNPGNFGGLGANPMQRGQPGMMS 374
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109150409   386 QPGLAQPGMAQPTMPQPGMPQPGMPQ-PGMAQPGLAQPGMAQPGM-PQPA---MPQPAMPQ-------PGMAQPGVSPAP 453
Cdd:pfam09606  375 SPSPVPGQQVRQVTPNQFMRQSPQPSvPSPQGPGSQPPQSHPGGMiPSPAlipSPSPQMSQqpaqqrtIGQDSPGGSLNT 454
                          170
                   ....*....|....*
gi 109150409   454 PVQPTFQSTFQPQNE 468
Cdd:pfam09606  455 PGQSAVNSPLNPQEE 469
PRK14950 PRK14950
DNA polymerase III subunits gamma and tau; Provisional
389-482 3.40e-03

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237864 [Multi-domain]  Cd Length: 585  Bit Score: 41.72  E-value: 3.40e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109150409  389 LAQPGMAQPTMPQPGMPQPGMPQPGMAQ-PGLAQPGMAQPGMPQPAMPQPAMPQPGMAQPGVSPAPPVQPTFQSTFQPQN 467
Cdd:PRK14950  360 LVPVPAPQPAKPTAAAPSPVRPTPAPSTrPKAAAAANIPPKEPVRETATPPPVPPRPVAPPVPHTPESAPKLTRAAIPVD 439
                          90
                  ....*....|....*
gi 109150409  468 EPHSQKPHQQEMEVE 482
Cdd:PRK14950  440 EKPKYTPPAPPKEEE 454
RRM_HP0827_like cd12399
RNA recognition motif (RRM) found in Helicobacter pylori HP0827 protein and similar proteins; ...
575-628 3.51e-03

RNA recognition motif (RRM) found in Helicobacter pylori HP0827 protein and similar proteins; This subfamily corresponds to the RRM of H. pylori HP0827, a putative ssDNA-binding protein 12rnp2 precursor, containing one RNA recognition motif (RRM), also termed RBD (RNA binding domain) or RNP (ribonucleoprotein domain). The ssDNA binding may be important in activation of HP0827.


Pssm-ID: 409833 [Multi-domain]  Cd Length: 75  Bit Score: 37.50  E-value: 3.51e-03
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|
gi 109150409  575 LWVGQLDKRTTQQDVASLLEEFGPIESINMI------PPRGCAYIVMVHRQDAYRALQKL 628
Cdd:cd12399     1 LYVGNLPYSASEEQLKSLFGQFGAVFDVKLPmdretkRPRGFGFVELQEEESAEKAIAKL 60
RRM_SRSF10_SRSF12 cd12312
RNA recognition motif (RRM) found in serine/arginine-rich splicing factor SRSF10, SRSF12 and ...
573-645 3.68e-03

RNA recognition motif (RRM) found in serine/arginine-rich splicing factor SRSF10, SRSF12 and similar proteins; This subfamily corresponds to the RRM of SRSF10 and SRSF12. SRSF10, also termed 40 kDa SR-repressor protein (SRrp40), or FUS-interacting serine-arginine-rich protein 1 (FUSIP1), or splicing factor SRp38, or splicing factor, arginine/serine-rich 13A (SFRS13A), or TLS-associated protein with Ser-Arg repeats (TASR). It is a serine-arginine (SR) protein that acts as a potent and general splicing repressor when dephosphorylated. It mediates global inhibition of splicing both in M phase of the cell cycle and in response to heat shock. SRSF10 emerges as a modulator of cholesterol homeostasis through the regulation of low-density lipoprotein receptor (LDLR) splicing efficiency. It also regulates cardiac-specific alternative splicing of triadin pre-mRNA and is required for proper Ca2+ handling during embryonic heart development. In contrast, the phosphorylated SRSF10 functions as a sequence-specific splicing activator in the presence of a nuclear cofactor. It activates distal alternative 5' splice site of adenovirus E1A pre-mRNA in vivo. Moreover, SRSF10 strengthens pre-mRNA recognition by U1 and U2 snRNPs. SRSF10 localizes to the nuclear speckles and can shuttle between nucleus and cytoplasm. SRSF12, also termed 35 kDa SR repressor protein (SRrp35), or splicing factor, arginine/serine-rich 13B (SFRS13B), or splicing factor, arginine/serine-rich 19 (SFRS19), is a serine/arginine (SR) protein-like alternative splicing regulator that antagonizes authentic SR proteins in the modulation of alternative 5' splice site choice. For instance, it activates distal alternative 5' splice site of the adenovirus E1A pre-mRNA in vivo. Both, SRSF10 and SRSF12, contain a single N-terminal RNA recognition motif (RRM), also termed RBD (RNA binding domain) or RNP (ribonucleoprotein domain), followed by a C-terminal RS domain rich in serine-arginine dipeptides.


Pssm-ID: 240758 [Multi-domain]  Cd Length: 84  Bit Score: 37.74  E-value: 3.68e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109150409  573 TTLWVGQLDKRTTQQDVASLLEEFGPIESInMIP-------PRGCAYIVMVHRQDAYRALQKLSRGNykVNQKSIKIAWA 645
Cdd:cd12312     1 TSLFVRNVADDTRPDDLRREFGRYGPIVDV-YIPldfytrrPRGFAYIQFEDVRDAEDALYYLDRTR--FLGREIEIQFA 77
PAT1 pfam09770
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ...
391-477 3.95e-03

Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.


Pssm-ID: 401645 [Multi-domain]  Cd Length: 846  Bit Score: 41.56  E-value: 3.95e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109150409   391 QPGMAQPTMPQPGMPQPGMPQPGMAQPGLAQPGMAQ-PGMPQPAMPQPAMPQPGMAQPGVSPAPPVQPT----------- 458
Cdd:pfam09770  232 QQQQFPPQIQQQQQPQQQPQQPQQHPGQGHPVTILQrPQSPQPDPAQPSIQPQAQQFHQQPPPVPVQPTqilqnpnrlsa 311
                           90       100
                   ....*....|....*....|.
gi 109150409   459 --FQSTFQPQNEPHSQKPHQQ 477
Cdd:pfam09770  312 arVGYPQNPQPGVQPAPAHQA 332
Cytadhesin_P30 pfam07271
Cytadhesin P30/P32; This family consists of several Mycoplasma species specific Cytadhesin P32 ...
307-460 4.55e-03

Cytadhesin P30/P32; This family consists of several Mycoplasma species specific Cytadhesin P32 and P30 proteins. P30 has been found to be membrane associated and localized on the tip organelle. It is thought that it is important in cytadherence and virulence. The N-terminus contains two predicted transmembrane helices followed by a long region of a short 6 residue proline rich repeat.


Pssm-ID: 429374 [Multi-domain]  Cd Length: 275  Bit Score: 40.34  E-value: 4.55e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109150409   307 PPPPQTPFGYPGDGVQQPAYTQHQSMdQFQPRMMPIQQdtmhhqvpLPPNGQMPGFgllsapppfppmpqpgmpqpgMAQ 386
Cdd:pfam07271  152 INQPQINPQFGPNPQQRIGFPMQPNM-GMRPGFNQMPG--------MPPNQMRPGF---------------------NQM 201
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109150409   387 PGLAQ-PGMAQPT---MPQPGM-PQPG--MPQPGMAQPGLAQPgmAQPG-MPQPAMPQPampqPGMAQPgvSPAPPVQPT 458
Cdd:pfam07271  202 PGMPPrPGFPNPMpnmQPRPGFrPQPGpmGNRPGGGFPHPGTP--MGPNrMPNPGMNQR----PGMAPP--RPGFPPQNG 273

                   ..
gi 109150409   459 FQ 460
Cdd:pfam07271  274 PR 275
PRK07994 PRK07994
DNA polymerase III subunits gamma and tau; Validated
400-488 4.90e-03

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236138 [Multi-domain]  Cd Length: 647  Bit Score: 41.00  E-value: 4.90e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109150409  400 PQPGMPQPGMPQPGMAQPGLAQPGMAQPGMPQPAMPQPAMPQPGMAQPGVSPAPPVQPTFQSTF--------QPQNEPHS 471
Cdd:PRK07994  361 PAAPLPEPEVPPQSAAPAASAQATAAPTAAVAPPQAPAVPPPPASAPQQAPAVPLPETTSQLLAarqqlqraQGATKAKK 440
                          90
                  ....*....|....*..
gi 109150409  472 QKPHQQEMEVEQPCVTE 488
Cdd:PRK07994  441 SEPAAASRARPVNSALE 457
RRM4_MRN1 cd12522
RNA recognition motif 4 (RRM4) found in RNA-binding protein MRN1 and similar proteins; This ...
572-636 4.92e-03

RNA recognition motif 4 (RRM4) found in RNA-binding protein MRN1 and similar proteins; This subgroup corresponds to the RRM4 of MRN1, also termed multicopy suppressor of RSC-NHP6 synthetic lethality protein 1, or post-transcriptional regulator of 69 kDa, which is a RNA-binding protein found in yeast. Although its specific biological role remains unclear, MRN1 might be involved in translational regulation. Members in this family contain four copies of conserved RNA recognition motif (RRM), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain).


Pssm-ID: 409942 [Multi-domain]  Cd Length: 81  Bit Score: 37.12  E-value: 4.92e-03
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 109150409  572 STTLWVGQLDKRT--TQQDVASLLEEFGPIESINMIPPRGCAYIVMVHRQDAYRALQKLSRG----NYKVN 636
Cdd:cd12522     3 SRNVYIGNIDDVRvlTEERLRHDFSQYGEIEQVNFLREKNCAFVNFTNIANAIKAIDKIKSKpyykDLKIN 73
COG3416 COG3416
Uncharacterized conserved protein, DUF2076 domain [Function unknown];
400-450 4.96e-03

Uncharacterized conserved protein, DUF2076 domain [Function unknown];


Pssm-ID: 442642 [Multi-domain]  Cd Length: 237  Bit Score: 40.01  E-value: 4.96e-03
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|.
gi 109150409  400 PQPGmPQPGMPQPGmAQPGLAQPGMAQPGMPQPAMPQPAMPQPGMAQPGVS 450
Cdd:COG3416    96 PPPA-PQPSQPGPQ-QQPAPPSGPWGQAAPQQPGYGQPQYGQPAAGPSGGG 144
RRM1_2_CoAA_like cd12343
RNA recognition motif 1 (RRM1) and 2 (RRM2) found in RRM-containing coactivator activator ...
575-642 5.03e-03

RNA recognition motif 1 (RRM1) and 2 (RRM2) found in RRM-containing coactivator activator/modulator (CoAA) and similar proteins; This subfamily corresponds to the RRM in CoAA (also known as RBM14 or PSP2) and RNA-binding protein 4 (RBM4). CoAA is a heterogeneous nuclear ribonucleoprotein (hnRNP)-like protein identified as a nuclear receptor coactivator. It mediates transcriptional coactivation and RNA splicing effects in a promoter-preferential manner, and is enhanced by thyroid hormone receptor-binding protein (TRBP). CoAA contains two N-terminal RNA recognition motifs (RRMs), also termed RBDs (RNA binding domains) or RNPs (ribonucleoprotein domains), and a TRBP-interacting domain. RBM4 is a ubiquitously expressed splicing factor with two isoforms, RBM4A (also known as Lark homolog) and RBM4B (also known as RBM30), which are very similar in structure and sequence. RBM4 may also function as a translational regulator of stress-associated mRNAs as well as play a role in micro-RNA-mediated gene regulation. RBM4 contains two N-terminal RRMs, a CCHC-type zinc finger, and three alanine-rich regions within their C-terminal regions. This family also includes Drosophila RNA-binding protein lark (Dlark), a homolog of human RBM4. It plays an important role in embryonic development and in the circadian regulation of adult eclosion. Dlark shares high sequence similarity with RBM4 at the N-terminal region. However, Dlark has three proline-rich segments instead of three alanine-rich segments within the C-terminal region.


Pssm-ID: 409779 [Multi-domain]  Cd Length: 66  Bit Score: 36.82  E-value: 5.03e-03
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 109150409  575 LWVGQLDKRTTQQDVASLLEEFGPIESINMIppRGCAYIVMVHRQDAYRALQKLSrgNYKVNQKSIKI 642
Cdd:cd12343     2 IFVGNLPDAATSEELRALFEKYGKVTECDIV--KNYAFVHMEKEEDAEDAIKALN--GYEFMGSRINV 65
RRM3_Prp24 cd12298
RNA recognition motif 3 in fungal pre-messenger RNA splicing protein 24 (Prp24) and similar ...
580-627 5.14e-03

RNA recognition motif 3 in fungal pre-messenger RNA splicing protein 24 (Prp24) and similar proteins; This subfamily corresponds to the RRM3 of Prp24, also termed U4/U6 snRNA-associated-splicing factor PRP24 (U4/U6 snRNP), an RNA-binding protein with four well conserved RNA recognition motifs (RRMs), also termed RBDs (RNA binding domains) or RNPs (ribonucleoprotein domains). It facilitates U6 RNA base-pairing with U4 RNA during spliceosome assembly. Prp24 specifically binds free U6 RNA primarily with RRMs 1 and 2 and facilitates pairing of U6 RNA bases with U4 RNA bases. Additionally, it may also be involved in dissociation of the U4/U6 complex during spliceosome activation.


Pssm-ID: 409739 [Multi-domain]  Cd Length: 78  Bit Score: 36.86  E-value: 5.14e-03
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*...
gi 109150409  580 LDKRTTQQDVASLLEEFGPIESINMiPP----------RGCAYIVMVHRQDAYRALQK 627
Cdd:cd12298     8 LDFELDEEALRGIFEKFGEIESINI-PKkqknrkgrhnNGFAFVTFEDADSAESALQL 64
RRM1_CELF1_2_Bruno cd12631
RNA recognition motif 1 (RRM1) found in CUGBP Elav-like family member CELF-1, CELF-2, ...
575-624 6.19e-03

RNA recognition motif 1 (RRM1) found in CUGBP Elav-like family member CELF-1, CELF-2, Drosophila melanogaster Bruno protein and similar proteins; This subgroup corresponds to the RRM1 of CELF-1, CELF-2 and Bruno protein. CELF-1 (also termed BRUNOL-2, or CUG-BP1, or EDEN-BP) and CELF-2 (also termed BRUNOL-3, or ETR-3, or CUG-BP2, or NAPOR) belong to the CUGBP1 and ETR-3-like factors (CELF) or BRUNOL (Bruno-like) family of RNA-binding proteins that have been implicated in regulation of pre-mRNA splicing, and control of mRNA translation and deadenylation. CELF-1 is strongly expressed in all adult and fetal tissues tested. The human CELF-1 is a nuclear and cytoplasmic RNA-binding protein that regulates multiple aspects of nuclear and cytoplasmic mRNA processing, with implications for onset of type 1 myotonic dystrophy (DM1), a neuromuscular disease associated with an unstable CUG triplet expansion in the 3'-UTR (3'-untranslated region) of the DMPK (myotonic dystrophy protein kinase) gene; it preferentially targets UGU-rich mRNA elements. It has been shown to bind to a Bruno response element, a cis-element involved in translational control of oskar mRNA in Drosophila, and share sequence similarity to Bruno, the Drosophila protein that mediates this process. The Xenopus homolog embryo deadenylation element-binding protein (EDEN-BP) mediates sequence-specific deadenylation of Eg5 mRNA. It binds specifically to the EDEN motif in the 3'-untranslated regions of maternal mRNAs and targets these mRNAs for deadenylation and translational repression. CELF-1 contain three highly conserved RNA recognition motifs (RRMs), also known as RBDs (RNA binding domains) or RNPs (ribonucleoprotein domains): two consecutive RRMs (RRM1 and RRM2) situated in the N-terminal region followed by a linker region and the third RRM (RRM3) close to the C-terminus of the protein. The two N-terminal RRMs of EDEN-BP are necessary for the interaction with EDEN as well as a part of the linker region (between RRM2 and RRM3). Oligomerization of EDEN-BP is required for specific mRNA deadenylation and binding. CELF-2 is expressed in all tissues at some level, but highest in brain, heart, and thymus. It has been implicated in the regulation of nuclear and cytoplasmic RNA processing events, including alternative splicing, RNA editing, stability and translation. CELF-2 shares high sequence identity with CELF-1, but shows different binding specificity; it binds preferentially to sequences with UG repeats and UGUU motifs. It has been shown to bind to a Bruno response element, a cis-element involved in translational control of oskar mRNA in Drosophila, and share sequence similarity to Bruno, the Drosophila protein that mediates this process. It also binds to the 3'-UTR of cyclooxygenase-2 messages, affecting both translation and mRNA stability, and binds to apoB mRNA, regulating its C to U editing. CELF-2 also contains three highly conserved RRMs. It binds to RNA via the first two RRMs, which are also important for localization in the cytoplasm. The splicing activation or repression activity of CELF-2 on some specific substrates is mediated by RRM1/RRM2. Both, RRM1 and RRM2 of CELF-2, can activate cardiac troponin T (cTNT) exon 5 inclusion. In addition, CELF-2 possesses a typical arginine and lysine-rich nuclear localization signal (NLS) in the C-terminus, within RRM3. This subgroup also includes Drosophila melanogaster Bruno protein, which plays a central role in regulation of Oskar (Osk) expression in flies. It mediates repression by binding to regulatory Bruno response elements (BREs) in the Osk mRNA 3' UTR. The full-length Bruno protein contains three RRMs, two located in the N-terminal half of the protein and the third near the C-terminus, separated by a linker region.


Pssm-ID: 410040 [Multi-domain]  Cd Length: 84  Bit Score: 37.11  E-value: 6.19e-03
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*...
gi 109150409  575 LWVGQLDKRTTQQDVASLLEEFGPIESINMI------PP--RGCAYIVMVHRQDAYRA 624
Cdd:cd12631     4 MFVGQIPRSWSEKELRELFEQYGAVYQINVLrdrsqnPPqsKGCCFVTFYTRKAALEA 61
RRM2_RBM23_RBM39 cd12284
RNA recognition motif 2 (RRM2) found in vertebrate RNA-binding protein RBM23, RBM39 and ...
575-628 6.99e-03

RNA recognition motif 2 (RRM2) found in vertebrate RNA-binding protein RBM23, RBM39 and similar proteins; This subfamily corresponds to the RRM2 of RBM39 (also termed HCC1), a nuclear autoantigen that contains an N-terminal arginine/serine rich (RS) motif and three RNA recognition motifs (RRMs), also termed RBDs (RNA binding domains) or RNPs (ribonucleoprotein domains). An octapeptide sequence called the RS-ERK motif is repeated six times in the RS region of RBM39. Although the cellular function of RBM23 remains unclear, it shows high sequence homology to RBM39 and contains two RRMs. It may possibly function as a pre-mRNA splicing factor.


Pssm-ID: 409726 [Multi-domain]  Cd Length: 78  Bit Score: 36.84  E-value: 6.99e-03
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|
gi 109150409  575 LWVGQLDKRTTQQDVASLLEEFGPIESINMIP------PRGCAYIVMVHRQDAYRALQKL 628
Cdd:cd12284     1 LYVGSLHFNITEDMLRGIFEPFGKIEFVQLQKdpetgrSKGYGFIQFRDAEDAKKALEQL 60
KLF1_2_4_N cd21972
N-terminal domain of Kruppel-like factor (KLF) 1, KLF2, KLF4, and similar proteins; Kruppel ...
307-457 8.13e-03

N-terminal domain of Kruppel-like factor (KLF) 1, KLF2, KLF4, and similar proteins; Kruppel/Krueppel-like transcription factors (KLFs) belong to a family of proteins called the Specificity Protein (SP)/KLF family, characterized by a C-terminal DNA-binding domain of 81 amino acids consisting of three Kruppel-like C2H2 zinc fingers. These factors bind to a loose consensus motif, namely NNRCRCCYY (where N is any nucleotide; R is A/G, and Y is C/T), such as the recurring motifs in GC and GT boxes (5'-GGGGCGGGG-3' and 5-GGTGTGGGG-3') that are present in promoters and more distal regulatory elements of mammalian genes. Members of the KLF family can act as activators or repressors of transcription depending on cell and promoter context. KLFs regulate various cellular functions, such as proliferation, differentiation, and apoptosis, as well as the development and homeostasis of several types of tissue. In addition to the C-terminal DNA-binding domain, each KLF also has a unique N-terminal activation/repression domain that confers specifity and allows it to bind specifically to a certain partner, leading to distinct activities in vivo. This model represents the related N-terminal domains of KLF1, KLF2, KLF4, and similar proteins.


Pssm-ID: 409230 [Multi-domain]  Cd Length: 194  Bit Score: 38.81  E-value: 8.13e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109150409  307 PPPPQTPFGYPGDGVQQPAYTQHQSMDQFQPRmmpiqqdtmhHQVPLPPNGQMPGFGLLSAPPPFPPMPQPGMPQPGMAQ 386
Cdd:cd21972    37 DNPPPPDPAYPPPESPESCSTVYDSDGCHPTP----------NAYCGPNGPGLPGHFLLAGNSPNLGPKIKTENQEQACM 106
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 109150409  387 PGLAQPGMaqpTMPQPGMPQPGMPQPGMAQPGLAQPGMAQPGMPQPAMPQPAMPQPGMAQPGVSPAPPVQP 457
Cdd:cd21972   107 PVAGYSGH---YGPREPQRVPPAPPPPQYAGHFQYHGHFNMFSPPLRANHPGMSTVMLTPLSTPPLGFLSP 174
RRM1_2_MATR3_like cd12436
RNA recognition motif 1 (RRM1) and 2 (RRM2) found in the matrin 3 family of nuclear proteins; ...
585-643 9.46e-03

RNA recognition motif 1 (RRM1) and 2 (RRM2) found in the matrin 3 family of nuclear proteins; This subfamily corresponds to the RRM of the matrin 3 family of nuclear proteins consisting of Matrin 3 (MATR3), nuclear protein 220 (NP220) and similar proteins. MATR3 is a highly conserved inner nuclear matrix protein that has been implicated in various biological processes. NP220 is a large nucleoplasmic DNA-binding protein that binds to cytidine-rich sequences, such as CCCCC (G/C), in double-stranded DNA (dsDNA). Both, Matrin 3 and NP220, contain two RNA recognition motif (RRM), also termed RBDs (RNA binding domains) or RNPs (ribonucleoprotein domains), and a Cys2-His2 zinc finger-like motif at the C-terminal region.


Pssm-ID: 409870 [Multi-domain]  Cd Length: 76  Bit Score: 36.17  E-value: 9.46e-03
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*....
gi 109150409  585 TQQDVASLLEEFGPIESINMIPPRGCAYIVMVHRQDAYRALQKLSRGNYKVNQKSIKIA 643
Cdd:cd12436    14 SEEDVLKLAEPFGKVNNVLLIRSKREAFIEMETAEDAQAMLSYCKTKPITIKGKKVKVS 72
RRM1_RBM34 cd12394
RNA recognition motif 1 (RRM1) found in RNA-binding protein 34 (RBM34) and similar proteins; ...
574-625 9.62e-03

RNA recognition motif 1 (RRM1) found in RNA-binding protein 34 (RBM34) and similar proteins; This subfamily corresponds to the RRM1 of RBM34, a putative RNA-binding protein containing two RNA recognition motifs (RRMs), also termed RBDs (RNA binding domains) or RNPs (ribonucleoprotein domains). Although the function of RBM34 remains unclear currently, its RRM domains may participate in mRNA processing. RBM34 may act as an mRNA processing-related protein.


Pssm-ID: 409828 [Multi-domain]  Cd Length: 91  Bit Score: 36.80  E-value: 9.62e-03
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 109150409  574 TLWVGQLDKRTTQQDVASLLEEFGPIESINM---------IPPRG--------------CAYIVMVHRQDAYRAL 625
Cdd:cd12394     2 TVFVGNLPVTVKKKALKKLFKEFGKIESVRFrsvavanpkLPKKVavikkkfhpkrdsmNAYVVFKEEESAQKAL 76
PRP38_assoc pfam12871
Pre-mRNA-splicing factor 38-associated hydrophilic C-term; This domain is a hydrophilic region ...
1057-1125 9.69e-03

Pre-mRNA-splicing factor 38-associated hydrophilic C-term; This domain is a hydrophilic region found at the C-terminus of plant and metazoan pre-mRNA-splicing factor 38 proteins. The function is not known.


Pssm-ID: 463734 [Multi-domain]  Cd Length: 98  Bit Score: 36.68  E-value: 9.69e-03
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109150409  1057 SGRDQERFGRRSFGSRV-ENDRERYGSRNDDRDNSnRERREWGRRSPDRDRHRDLEERSRRSSGHRDRDR 1125
Cdd:pfam12871   25 SDESERASLSRKRRSRSrRRSSTRDRSRSRSRSRS-RDRRSRGTRDRRRDRDRDRYRSLRSRSRDRSRDR 93
PRK12678 PRK12678
transcription termination factor Rho; Provisional
1048-1117 9.87e-03

transcription termination factor Rho; Provisional


Pssm-ID: 237171 [Multi-domain]  Cd Length: 672  Bit Score: 40.27  E-value: 9.87e-03
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109150409 1048 RNDSRQQFNSGRDQERFGRRSFGSRVENDRERYGSRNDDRDNSNRERREWGRRSPDRDRHRDLEERSRRS 1117
Cdd:PRK12678  180 DRQAEAERGERGRREERGRDGDDRDRRDRREQGDRREERGRRDGGDRRGRRRRRDRRDARGDDNREDRGD 249
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH