NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|1823900034|ref|XP_032960628|]
View 

THO complex subunit 2 isoform X8 [Rhinolophus ferrumequinum]

Protein Classification

Thoc2 and Tho2 domain-containing protein( domain architecture ID 13162558)

protein containing domains THOC2_N, Thoc2, and Tho2

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
Tho2 pfam11262
Transcription factor/nuclear export subunit protein 2; THO and TREX form a eukaryotic complex ...
874-1173 6.15e-129

Transcription factor/nuclear export subunit protein 2; THO and TREX form a eukaryotic complex which functions in messenger ribonucleoprotein metabolism and plays a role in preventing the transcription-associated genetic instability. Tho2, along with four other subunits forms THO


:

Pssm-ID: 463251  Cd Length: 304  Bit Score: 403.15  E-value: 6.15e-129
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1823900034  874 DDISPQFYATFWSLTMYDLAVPHTSYEREVNKLKVQMKAI----DDNQEMPPNKKKKEKERCTALQDKLLEEEKKQMEHV 949
Cdd:pfam11262    1 EYISPEFYVTFWQLSLYDIYVPTESYEAEIERLKKQIRELsrdrSDMSRAGASKKKKEKKRLEALIDKLKEELKEHIEHV 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1823900034  950 QRVLQRLKLEKDNWLLA-KSTKNETITKFLQLCIFPRCIFSAIDAVYCARFVELVHQQKTPNFSTLLCYDRVFSD-IIYT 1027
Cdd:pfam11262   81 EKTRKRLQKEKDSWFPGsKAKKNALIDAFLQHCILPRALLSPADALYCAKFIKLLHELGTPNFSTLLLYDRLFKDnLRSL 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1823900034 1028 VASCTENEASRYGRFLCCMLETVTRWHSDRATYEKEC---GNYPGFLTILRAtgfDGGNKADQLDYENFRHVVHKWHYKL 1104
Cdd:pfam11262  161 IFSCTEREAENLGRFLNEILKDLSRWHADEAVYEKEAlgkKNLPGFATKFND---DDGKPTDFLSYEDFRRLLYKWHKKL 237
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1823900034 1105 TKASVHCLETGEYTHIRNILIVLTKILPWYPKVLNLGQALERRVHKICqeEKEKRPDLYALAMGYFGQL 1173
Cdd:pfam11262  238 TSALKSCLESGEYMHIRNAIIVLKKILPVFPAVDFMGEALLKAVEKLA--EREKREDLKVLANSYLGLL 304
THOC2_N super family cl24644
THO complex subunit 2 N-terminus; This family represents the N-terminus of THO complex subunit ...
10-566 1.52e-87

THO complex subunit 2 N-terminus; This family represents the N-terminus of THO complex subunit 2.


The actual alignment was detected with superfamily member pfam16134:

Pssm-ID: 465032  Cd Length: 614  Bit Score: 299.15  E-value: 1.52e-87
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1823900034   10 VEWIKNWEKSGRGEflhLCRILSENKNHDSSTyrDFQQALYELSYHVIKGNLKHEQASNVLNDI-SEFREDMPSILADVF 88
Cdd:pfam16134    2 DERINNWGGSGRQE---LIEQLKLARNDEDED--ELSDLFQELIRSVLDGRLDPEDAGSFLKEIiKEEPTDSSEDVAKLF 76
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1823900034   89 CilDIEtNCLEEKSKRDYFTQLVLACLylVSDTVLKERLDPETLESLGLIKQsQQFNQKSVKIKTKLFYKQQKFNLLREE 168
Cdd:pfam16134   77 L--DVL-STFSDSEDMLALRDLLAATI--ISPSLMRLELDTKLLQELGLVRD-TTFHRMLIRKSTNLLYRQKKYNLLREE 150
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1823900034  169 NEGYAKLIAELgQDLSGNITSDLI---LENIKSLIGCFNLDPNRVLDVILEVFECR-PEHDDFFISLL------------ 232
Cdd:pfam16134  151 SEGYSKLITEL-FTTSDNDTFEKVdytFERVKALIGKFDLDPGRVLDVILDVFAAFlVKHYRFFVKFLrasswwprtees 229
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1823900034  233 ---ESYMSMCEP--QTLCHILGFKFKFYQ-EPNGETPSSLYRVAAVLLQFNLIDLDDLYVHLLPADNcIMDEHKREIVE- 305
Cdd:pfam16134  230 dwiSSTKTLPPGgnRVAAQLLGFKLRFYSsDADDELPENLIYLAALLIKEGFISFGDLYPHLSPDDE-EMEALKEEYKKe 308
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1823900034  306 -AKQIVRK----LTMV-VLS-------------SEKIDEREKEKEKEEEKVEKPPDNQKLGLLEALLKIGDWQHAQNIMD 366
Cdd:pfam16134  309 lEEESMEGganaLAMAgALPddddtlppakedeAAASKKAPTKEEEKKEKEPEPKDNQKIQLLKSLLAIGALPESLFILG 388
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1823900034  367 QMpPYYAASHKLIALAICKLIHITIEPLYR-----------RVGVPKGAK-----GSPVNALQNKRAPK----------- 419
Cdd:pfam16134  389 RY-PWLALVDPEIPELIHRILEHSIEPLYEstrsvplssrpESGLPKGNIvrldeNPPRRLLRWPKTDKpffdlgtkyrf 467
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1823900034  420 ----------QAESFEDLRRDVFNMFCYLGPHLSHDPILFAKVVRIGKSFMKEfqsDGSKQEDKEKTevilsclLSITDQ 489
Cdd:pfam16134  468 yydewkdnlpVCQTVDDLFTLSHEFLNLIGVNLGQDPSLLSKLCRIGVKDLEN---SDESEENRDRW-------IDYLRR 537
                          570       580       590       600       610       620       630
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1823900034  490 VLLPSLSLMDCNACMSEELWGMFKTFPYQHRYRLYGQWKNETYNSHPLLVKVKAQTIDRAKYIMKRLTKENVKPSGR 566
Cdd:pfam16134  538 FIFPALSLLEANPIVVDEVYELLKLFPFETRYFLYGEWYEKLTKRNPLIKIAFNKAEKETKDILKRLSKDNIRPMAR 614
Thoc2 pfam11732
Transcription- and export-related complex subunit; The THO/TREX complex is the transcription- ...
568-642 2.61e-39

Transcription- and export-related complex subunit; The THO/TREX complex is the transcription- and export-related complex associated with spliceosomes that preferentially deal with spliced mRNAs as opposed to unspliced mRNAs. Thoc2 plays a role in RNA polymerase II (RNA pol II)-dependent transcription and is required for the stability of DNA repeats. In humans, the TRE complex is comprised of the exon-junction-associated proteins Aly/REF and UAP56 together with the THO proteins THOC1 (hHpr1/p84), Thoc2 (hRlr1), THOC3 (hTex1), THOC5 (fSAP79), THOC6 (fSAP35), and THOC7 (fSAP24). Although much evidence indicates that the function of the TREX complex as an adaptor between the mRNA and components of the export machinery is conserved among eukaryotes, in Drosophila the majority of mRNAs can be exported from the nucleus independently of the THO complex.


:

Pssm-ID: 463334  Cd Length: 75  Bit Score: 140.69  E-value: 2.61e-39
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1823900034  568 IGKLSHSNPTILFDYILSQIQKYDNLITPVVDSLKYLTSLNYDVLAYCIIEALANPEKERMKHDDTTISSWLQSL 642
Cdd:pfam11732    1 LAKLSHSNPLIVFEVALNQIESYDNLIEPVVDALKYFTDLGYDVLTYCLLERLTNPGRSRVKDDGTNISPWLQSL 75
U2AF_lg super family cl36941
U2 snRNP auxilliary factor, large subunit, splicing factor; These splicing factors consist of ...
1445-1556 1.21e-04

U2 snRNP auxilliary factor, large subunit, splicing factor; These splicing factors consist of an N-terminal arginine-rich low complexity domain followed by three tandem RNA recognition motifs (pfam00076). The well-characterized members of this family are auxilliary components of the U2 small nuclear ribonuclearprotein splicing factor (U2AF). These proteins are closely related to the CC1-like subfamily of splicing factors (TIGR01622). Members of this subfamily are found in plants, metazoa and fungi.


The actual alignment was detected with superfamily member TIGR01642:

Pssm-ID: 273727 [Multi-domain]  Cd Length: 509  Bit Score: 46.42  E-value: 1.21e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1823900034 1445 PSLSKSKEREMDKkDLDKSRERSREREKKDEKDRKERKRDHSNNDREVPPDLTKRRKEENGTM-----GVSKHKSESPCE 1519
Cdd:TIGR01642    5 PDREREKSRGRDR-DRSSERPRRRSRDRSRFRDRHRRSRERSYREDSRPRDRRRYDSRSPRSLryssvRRSRDRPRRRSR 83
                           90       100       110
                   ....*....|....*....|....*....|....*..
gi 1823900034 1520 SPYPNEKDKEKNKSKSSGKEKGGDSFKSEKMDKISSG 1556
Cdd:TIGR01642   84 SVRSIEQHRRRLRDRSPSNQWRKDDKKRSLWDIKPPG 120
PTZ00121 super family cl31754
MAEBL; Provisional
1291-1593 3.50e-03

MAEBL; Provisional


The actual alignment was detected with superfamily member PTZ00121:

Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 42.44  E-value: 3.50e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1823900034 1291 EARILGKDGKEKPK-EERPNKDEKARETKERTPKSDKEKEKFKKEEKVKDEKFKTTvpnvESKSTQEKEREKE--PSRER 1367
Cdd:PTZ00121  1458 KAEEAKKKAEEAKKaDEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKAD----EAKKAEEAKKADEakKAEEA 1533
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1823900034 1368 DIAKEMKSKENVKGGEKTPVSGSLKSPVPRSDIAEPEREQKRRKIDTHPSpSHSSTVKDSLIELKESSAKLYLNHTPPSL 1447
Cdd:PTZ00121  1534 KKADEAKKAEEKKKADELKKAEELKKAEEKKKAEEAKKAEEDKNMALRKA-EEAKKAEEARIEEVMKLYEEEKKMKAEEA 1612
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1823900034 1448 SKSKEREMDKKDLDKSRERSREREKKDEKDRKERKRDHSNNDREVPPDLTK---RRKEENgtmgvSKHKSESPcespypN 1524
Cdd:PTZ00121  1613 KKAEEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAaeeAKKAEE-----DKKKAEEA------K 1681
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1823900034 1525 EKDKEKNKSKSSGKEKGGDSFKSEKMDKISSGGKKESRHDKEKIEKKEKRDSSGGKEEKKHHKSSDKHR 1593
Cdd:PTZ00121  1682 KAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEEDKKKAEEAK 1750
 
Name Accession Description Interval E-value
Tho2 pfam11262
Transcription factor/nuclear export subunit protein 2; THO and TREX form a eukaryotic complex ...
874-1173 6.15e-129

Transcription factor/nuclear export subunit protein 2; THO and TREX form a eukaryotic complex which functions in messenger ribonucleoprotein metabolism and plays a role in preventing the transcription-associated genetic instability. Tho2, along with four other subunits forms THO


Pssm-ID: 463251  Cd Length: 304  Bit Score: 403.15  E-value: 6.15e-129
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1823900034  874 DDISPQFYATFWSLTMYDLAVPHTSYEREVNKLKVQMKAI----DDNQEMPPNKKKKEKERCTALQDKLLEEEKKQMEHV 949
Cdd:pfam11262    1 EYISPEFYVTFWQLSLYDIYVPTESYEAEIERLKKQIRELsrdrSDMSRAGASKKKKEKKRLEALIDKLKEELKEHIEHV 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1823900034  950 QRVLQRLKLEKDNWLLA-KSTKNETITKFLQLCIFPRCIFSAIDAVYCARFVELVHQQKTPNFSTLLCYDRVFSD-IIYT 1027
Cdd:pfam11262   81 EKTRKRLQKEKDSWFPGsKAKKNALIDAFLQHCILPRALLSPADALYCAKFIKLLHELGTPNFSTLLLYDRLFKDnLRSL 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1823900034 1028 VASCTENEASRYGRFLCCMLETVTRWHSDRATYEKEC---GNYPGFLTILRAtgfDGGNKADQLDYENFRHVVHKWHYKL 1104
Cdd:pfam11262  161 IFSCTEREAENLGRFLNEILKDLSRWHADEAVYEKEAlgkKNLPGFATKFND---DDGKPTDFLSYEDFRRLLYKWHKKL 237
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1823900034 1105 TKASVHCLETGEYTHIRNILIVLTKILPWYPKVLNLGQALERRVHKICqeEKEKRPDLYALAMGYFGQL 1173
Cdd:pfam11262  238 TSALKSCLESGEYMHIRNAIIVLKKILPVFPAVDFMGEALLKAVEKLA--EREKREDLKVLANSYLGLL 304
THOC2_N pfam16134
THO complex subunit 2 N-terminus; This family represents the N-terminus of THO complex subunit ...
10-566 1.52e-87

THO complex subunit 2 N-terminus; This family represents the N-terminus of THO complex subunit 2.


Pssm-ID: 465032  Cd Length: 614  Bit Score: 299.15  E-value: 1.52e-87
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1823900034   10 VEWIKNWEKSGRGEflhLCRILSENKNHDSSTyrDFQQALYELSYHVIKGNLKHEQASNVLNDI-SEFREDMPSILADVF 88
Cdd:pfam16134    2 DERINNWGGSGRQE---LIEQLKLARNDEDED--ELSDLFQELIRSVLDGRLDPEDAGSFLKEIiKEEPTDSSEDVAKLF 76
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1823900034   89 CilDIEtNCLEEKSKRDYFTQLVLACLylVSDTVLKERLDPETLESLGLIKQsQQFNQKSVKIKTKLFYKQQKFNLLREE 168
Cdd:pfam16134   77 L--DVL-STFSDSEDMLALRDLLAATI--ISPSLMRLELDTKLLQELGLVRD-TTFHRMLIRKSTNLLYRQKKYNLLREE 150
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1823900034  169 NEGYAKLIAELgQDLSGNITSDLI---LENIKSLIGCFNLDPNRVLDVILEVFECR-PEHDDFFISLL------------ 232
Cdd:pfam16134  151 SEGYSKLITEL-FTTSDNDTFEKVdytFERVKALIGKFDLDPGRVLDVILDVFAAFlVKHYRFFVKFLrasswwprtees 229
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1823900034  233 ---ESYMSMCEP--QTLCHILGFKFKFYQ-EPNGETPSSLYRVAAVLLQFNLIDLDDLYVHLLPADNcIMDEHKREIVE- 305
Cdd:pfam16134  230 dwiSSTKTLPPGgnRVAAQLLGFKLRFYSsDADDELPENLIYLAALLIKEGFISFGDLYPHLSPDDE-EMEALKEEYKKe 308
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1823900034  306 -AKQIVRK----LTMV-VLS-------------SEKIDEREKEKEKEEEKVEKPPDNQKLGLLEALLKIGDWQHAQNIMD 366
Cdd:pfam16134  309 lEEESMEGganaLAMAgALPddddtlppakedeAAASKKAPTKEEEKKEKEPEPKDNQKIQLLKSLLAIGALPESLFILG 388
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1823900034  367 QMpPYYAASHKLIALAICKLIHITIEPLYR-----------RVGVPKGAK-----GSPVNALQNKRAPK----------- 419
Cdd:pfam16134  389 RY-PWLALVDPEIPELIHRILEHSIEPLYEstrsvplssrpESGLPKGNIvrldeNPPRRLLRWPKTDKpffdlgtkyrf 467
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1823900034  420 ----------QAESFEDLRRDVFNMFCYLGPHLSHDPILFAKVVRIGKSFMKEfqsDGSKQEDKEKTevilsclLSITDQ 489
Cdd:pfam16134  468 yydewkdnlpVCQTVDDLFTLSHEFLNLIGVNLGQDPSLLSKLCRIGVKDLEN---SDESEENRDRW-------IDYLRR 537
                          570       580       590       600       610       620       630
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1823900034  490 VLLPSLSLMDCNACMSEELWGMFKTFPYQHRYRLYGQWKNETYNSHPLLVKVKAQTIDRAKYIMKRLTKENVKPSGR 566
Cdd:pfam16134  538 FIFPALSLLEANPIVVDEVYELLKLFPFETRYFLYGEWYEKLTKRNPLIKIAFNKAEKETKDILKRLSKDNIRPMAR 614
Thoc2 pfam11732
Transcription- and export-related complex subunit; The THO/TREX complex is the transcription- ...
568-642 2.61e-39

Transcription- and export-related complex subunit; The THO/TREX complex is the transcription- and export-related complex associated with spliceosomes that preferentially deal with spliced mRNAs as opposed to unspliced mRNAs. Thoc2 plays a role in RNA polymerase II (RNA pol II)-dependent transcription and is required for the stability of DNA repeats. In humans, the TRE complex is comprised of the exon-junction-associated proteins Aly/REF and UAP56 together with the THO proteins THOC1 (hHpr1/p84), Thoc2 (hRlr1), THOC3 (hTex1), THOC5 (fSAP79), THOC6 (fSAP35), and THOC7 (fSAP24). Although much evidence indicates that the function of the TREX complex as an adaptor between the mRNA and components of the export machinery is conserved among eukaryotes, in Drosophila the majority of mRNAs can be exported from the nucleus independently of the THO complex.


Pssm-ID: 463334  Cd Length: 75  Bit Score: 140.69  E-value: 2.61e-39
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1823900034  568 IGKLSHSNPTILFDYILSQIQKYDNLITPVVDSLKYLTSLNYDVLAYCIIEALANPEKERMKHDDTTISSWLQSL 642
Cdd:pfam11732    1 LAKLSHSNPLIVFEVALNQIESYDNLIEPVVDALKYFTDLGYDVLTYCLLERLTNPGRSRVKDDGTNISPWLQSL 75
U2AF_lg TIGR01642
U2 snRNP auxilliary factor, large subunit, splicing factor; These splicing factors consist of ...
1445-1556 1.21e-04

U2 snRNP auxilliary factor, large subunit, splicing factor; These splicing factors consist of an N-terminal arginine-rich low complexity domain followed by three tandem RNA recognition motifs (pfam00076). The well-characterized members of this family are auxilliary components of the U2 small nuclear ribonuclearprotein splicing factor (U2AF). These proteins are closely related to the CC1-like subfamily of splicing factors (TIGR01622). Members of this subfamily are found in plants, metazoa and fungi.


Pssm-ID: 273727 [Multi-domain]  Cd Length: 509  Bit Score: 46.42  E-value: 1.21e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1823900034 1445 PSLSKSKEREMDKkDLDKSRERSREREKKDEKDRKERKRDHSNNDREVPPDLTKRRKEENGTM-----GVSKHKSESPCE 1519
Cdd:TIGR01642    5 PDREREKSRGRDR-DRSSERPRRRSRDRSRFRDRHRRSRERSYREDSRPRDRRRYDSRSPRSLryssvRRSRDRPRRRSR 83
                           90       100       110
                   ....*....|....*....|....*....|....*..
gi 1823900034 1520 SPYPNEKDKEKNKSKSSGKEKGGDSFKSEKMDKISSG 1556
Cdd:TIGR01642   84 SVRSIEQHRRRLRDRSPSNQWRKDDKKRSLWDIKPPG 120
PTZ00121 PTZ00121
MAEBL; Provisional
1291-1593 3.50e-03

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 42.44  E-value: 3.50e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1823900034 1291 EARILGKDGKEKPK-EERPNKDEKARETKERTPKSDKEKEKFKKEEKVKDEKFKTTvpnvESKSTQEKEREKE--PSRER 1367
Cdd:PTZ00121  1458 KAEEAKKKAEEAKKaDEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKAD----EAKKAEEAKKADEakKAEEA 1533
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1823900034 1368 DIAKEMKSKENVKGGEKTPVSGSLKSPVPRSDIAEPEREQKRRKIDTHPSpSHSSTVKDSLIELKESSAKLYLNHTPPSL 1447
Cdd:PTZ00121  1534 KKADEAKKAEEKKKADELKKAEELKKAEEKKKAEEAKKAEEDKNMALRKA-EEAKKAEEARIEEVMKLYEEEKKMKAEEA 1612
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1823900034 1448 SKSKEREMDKKDLDKSRERSREREKKDEKDRKERKRDHSNNDREVPPDLTK---RRKEENgtmgvSKHKSESPcespypN 1524
Cdd:PTZ00121  1613 KKAEEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAaeeAKKAEE-----DKKKAEEA------K 1681
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1823900034 1525 EKDKEKNKSKSSGKEKGGDSFKSEKMDKISSGGKKESRHDKEKIEKKEKRDSSGGKEEKKHHKSSDKHR 1593
Cdd:PTZ00121  1682 KAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEEDKKKAEEAK 1750
Caldesmon pfam02029
Caldesmon;
1401-1592 6.55e-03

Caldesmon;


Pssm-ID: 460421 [Multi-domain]  Cd Length: 495  Bit Score: 41.01  E-value: 6.55e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1823900034 1401 AEPEREQK----RRKIDTHPSPSHSStvkDSLIELKESSAKLYLNHTPPSLSKSKEREMDKKDLDKSRERSREREKKDEK 1476
Cdd:pfam02029    7 AARERRRRareeRRRQKEEEEPSGQV---TESVEPNEHNSYEEDSELKPSGQGGLDEEEAFLDRTAKREERRQKRLQEAL 83
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1823900034 1477 DRkERKRDHSNNDREVPPDLTKRRKEENGTMGVSKHKSESPCESPYPNEKDKEKNKSKSSGKEKGGDSFKSEKMDKISSG 1556
Cdd:pfam02029   84 ER-QKEFDPTIADEKESVAERKENNEEEENSSWEKEEKRDSRLGRYKEEETEIREKEYQENKWSTEVRQAEEEGEEEEDK 162
                          170       180       190
                   ....*....|....*....|....*....|....*.
gi 1823900034 1557 GKKESRHDKEKIEKKEKRDSSGGKEEKKhhKSSDKH 1592
Cdd:pfam02029  163 SEEAEEVPTENFAKEEVKDEKIKKEKKV--KYESKV 196
 
Name Accession Description Interval E-value
Tho2 pfam11262
Transcription factor/nuclear export subunit protein 2; THO and TREX form a eukaryotic complex ...
874-1173 6.15e-129

Transcription factor/nuclear export subunit protein 2; THO and TREX form a eukaryotic complex which functions in messenger ribonucleoprotein metabolism and plays a role in preventing the transcription-associated genetic instability. Tho2, along with four other subunits forms THO


Pssm-ID: 463251  Cd Length: 304  Bit Score: 403.15  E-value: 6.15e-129
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1823900034  874 DDISPQFYATFWSLTMYDLAVPHTSYEREVNKLKVQMKAI----DDNQEMPPNKKKKEKERCTALQDKLLEEEKKQMEHV 949
Cdd:pfam11262    1 EYISPEFYVTFWQLSLYDIYVPTESYEAEIERLKKQIRELsrdrSDMSRAGASKKKKEKKRLEALIDKLKEELKEHIEHV 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1823900034  950 QRVLQRLKLEKDNWLLA-KSTKNETITKFLQLCIFPRCIFSAIDAVYCARFVELVHQQKTPNFSTLLCYDRVFSD-IIYT 1027
Cdd:pfam11262   81 EKTRKRLQKEKDSWFPGsKAKKNALIDAFLQHCILPRALLSPADALYCAKFIKLLHELGTPNFSTLLLYDRLFKDnLRSL 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1823900034 1028 VASCTENEASRYGRFLCCMLETVTRWHSDRATYEKEC---GNYPGFLTILRAtgfDGGNKADQLDYENFRHVVHKWHYKL 1104
Cdd:pfam11262  161 IFSCTEREAENLGRFLNEILKDLSRWHADEAVYEKEAlgkKNLPGFATKFND---DDGKPTDFLSYEDFRRLLYKWHKKL 237
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1823900034 1105 TKASVHCLETGEYTHIRNILIVLTKILPWYPKVLNLGQALERRVHKICqeEKEKRPDLYALAMGYFGQL 1173
Cdd:pfam11262  238 TSALKSCLESGEYMHIRNAIIVLKKILPVFPAVDFMGEALLKAVEKLA--EREKREDLKVLANSYLGLL 304
THOC2_N pfam16134
THO complex subunit 2 N-terminus; This family represents the N-terminus of THO complex subunit ...
10-566 1.52e-87

THO complex subunit 2 N-terminus; This family represents the N-terminus of THO complex subunit 2.


Pssm-ID: 465032  Cd Length: 614  Bit Score: 299.15  E-value: 1.52e-87
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1823900034   10 VEWIKNWEKSGRGEflhLCRILSENKNHDSSTyrDFQQALYELSYHVIKGNLKHEQASNVLNDI-SEFREDMPSILADVF 88
Cdd:pfam16134    2 DERINNWGGSGRQE---LIEQLKLARNDEDED--ELSDLFQELIRSVLDGRLDPEDAGSFLKEIiKEEPTDSSEDVAKLF 76
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1823900034   89 CilDIEtNCLEEKSKRDYFTQLVLACLylVSDTVLKERLDPETLESLGLIKQsQQFNQKSVKIKTKLFYKQQKFNLLREE 168
Cdd:pfam16134   77 L--DVL-STFSDSEDMLALRDLLAATI--ISPSLMRLELDTKLLQELGLVRD-TTFHRMLIRKSTNLLYRQKKYNLLREE 150
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1823900034  169 NEGYAKLIAELgQDLSGNITSDLI---LENIKSLIGCFNLDPNRVLDVILEVFECR-PEHDDFFISLL------------ 232
Cdd:pfam16134  151 SEGYSKLITEL-FTTSDNDTFEKVdytFERVKALIGKFDLDPGRVLDVILDVFAAFlVKHYRFFVKFLrasswwprtees 229
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1823900034  233 ---ESYMSMCEP--QTLCHILGFKFKFYQ-EPNGETPSSLYRVAAVLLQFNLIDLDDLYVHLLPADNcIMDEHKREIVE- 305
Cdd:pfam16134  230 dwiSSTKTLPPGgnRVAAQLLGFKLRFYSsDADDELPENLIYLAALLIKEGFISFGDLYPHLSPDDE-EMEALKEEYKKe 308
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1823900034  306 -AKQIVRK----LTMV-VLS-------------SEKIDEREKEKEKEEEKVEKPPDNQKLGLLEALLKIGDWQHAQNIMD 366
Cdd:pfam16134  309 lEEESMEGganaLAMAgALPddddtlppakedeAAASKKAPTKEEEKKEKEPEPKDNQKIQLLKSLLAIGALPESLFILG 388
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1823900034  367 QMpPYYAASHKLIALAICKLIHITIEPLYR-----------RVGVPKGAK-----GSPVNALQNKRAPK----------- 419
Cdd:pfam16134  389 RY-PWLALVDPEIPELIHRILEHSIEPLYEstrsvplssrpESGLPKGNIvrldeNPPRRLLRWPKTDKpffdlgtkyrf 467
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1823900034  420 ----------QAESFEDLRRDVFNMFCYLGPHLSHDPILFAKVVRIGKSFMKEfqsDGSKQEDKEKTevilsclLSITDQ 489
Cdd:pfam16134  468 yydewkdnlpVCQTVDDLFTLSHEFLNLIGVNLGQDPSLLSKLCRIGVKDLEN---SDESEENRDRW-------IDYLRR 537
                          570       580       590       600       610       620       630
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1823900034  490 VLLPSLSLMDCNACMSEELWGMFKTFPYQHRYRLYGQWKNETYNSHPLLVKVKAQTIDRAKYIMKRLTKENVKPSGR 566
Cdd:pfam16134  538 FIFPALSLLEANPIVVDEVYELLKLFPFETRYFLYGEWYEKLTKRNPLIKIAFNKAEKETKDILKRLSKDNIRPMAR 614
Thoc2 pfam11732
Transcription- and export-related complex subunit; The THO/TREX complex is the transcription- ...
568-642 2.61e-39

Transcription- and export-related complex subunit; The THO/TREX complex is the transcription- and export-related complex associated with spliceosomes that preferentially deal with spliced mRNAs as opposed to unspliced mRNAs. Thoc2 plays a role in RNA polymerase II (RNA pol II)-dependent transcription and is required for the stability of DNA repeats. In humans, the TRE complex is comprised of the exon-junction-associated proteins Aly/REF and UAP56 together with the THO proteins THOC1 (hHpr1/p84), Thoc2 (hRlr1), THOC3 (hTex1), THOC5 (fSAP79), THOC6 (fSAP35), and THOC7 (fSAP24). Although much evidence indicates that the function of the TREX complex as an adaptor between the mRNA and components of the export machinery is conserved among eukaryotes, in Drosophila the majority of mRNAs can be exported from the nucleus independently of the THO complex.


Pssm-ID: 463334  Cd Length: 75  Bit Score: 140.69  E-value: 2.61e-39
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1823900034  568 IGKLSHSNPTILFDYILSQIQKYDNLITPVVDSLKYLTSLNYDVLAYCIIEALANPEKERMKHDDTTISSWLQSL 642
Cdd:pfam11732    1 LAKLSHSNPLIVFEVALNQIESYDNLIEPVVDALKYFTDLGYDVLTYCLLERLTNPGRSRVKDDGTNISPWLQSL 75
U2AF_lg TIGR01642
U2 snRNP auxilliary factor, large subunit, splicing factor; These splicing factors consist of ...
1445-1556 1.21e-04

U2 snRNP auxilliary factor, large subunit, splicing factor; These splicing factors consist of an N-terminal arginine-rich low complexity domain followed by three tandem RNA recognition motifs (pfam00076). The well-characterized members of this family are auxilliary components of the U2 small nuclear ribonuclearprotein splicing factor (U2AF). These proteins are closely related to the CC1-like subfamily of splicing factors (TIGR01622). Members of this subfamily are found in plants, metazoa and fungi.


Pssm-ID: 273727 [Multi-domain]  Cd Length: 509  Bit Score: 46.42  E-value: 1.21e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1823900034 1445 PSLSKSKEREMDKkDLDKSRERSREREKKDEKDRKERKRDHSNNDREVPPDLTKRRKEENGTM-----GVSKHKSESPCE 1519
Cdd:TIGR01642    5 PDREREKSRGRDR-DRSSERPRRRSRDRSRFRDRHRRSRERSYREDSRPRDRRRYDSRSPRSLryssvRRSRDRPRRRSR 83
                           90       100       110
                   ....*....|....*....|....*....|....*..
gi 1823900034 1520 SPYPNEKDKEKNKSKSSGKEKGGDSFKSEKMDKISSG 1556
Cdd:TIGR01642   84 SVRSIEQHRRRLRDRSPSNQWRKDDKKRSLWDIKPPG 120
SF-CC1 TIGR01622
splicing factor, CC1-like family; This model represents a subfamily of RNA splicing factors ...
1448-1540 7.30e-04

splicing factor, CC1-like family; This model represents a subfamily of RNA splicing factors including the Pad-1 protein (N. crassa), CAPER (M. musculus) and CC1.3 (H.sapiens). These proteins are characterized by an N-terminal arginine-rich, low complexity domain followed by three (or in the case of 4 H. sapiens paralogs, two) RNA recognition domains (rrm: pfam00706). These splicing factors are closely related to the U2AF splicing factor family (TIGR01642). A homologous gene from Plasmodium falciparum was identified in the course of the analysis of that genome at TIGR and was included in the seed.


Pssm-ID: 273721 [Multi-domain]  Cd Length: 494  Bit Score: 44.14  E-value: 7.30e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1823900034 1448 SKSKEREMDK---------KDLDKSRERSREREKKDEKDRKERKRDHSNNDrevPPDLTKRRKEENGTMGVSKHKSESPC 1518
Cdd:TIGR01622    2 YRDRERERLRdsssagdrdRRRDKGRERSRDRSRDRERSRSRRRDRHRDRD---YYRGRERRSRSRRPNRRYRPREKRRR 78
                           90       100
                   ....*....|....*....|..
gi 1823900034 1519 EspyPNEKDKEKnKSKSSGKEK 1540
Cdd:TIGR01622   79 R---GDSYRRRR-DDRRSRREK 96
PTZ00121 PTZ00121
MAEBL; Provisional
1291-1593 3.50e-03

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 42.44  E-value: 3.50e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1823900034 1291 EARILGKDGKEKPK-EERPNKDEKARETKERTPKSDKEKEKFKKEEKVKDEKFKTTvpnvESKSTQEKEREKE--PSRER 1367
Cdd:PTZ00121  1458 KAEEAKKKAEEAKKaDEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKAD----EAKKAEEAKKADEakKAEEA 1533
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1823900034 1368 DIAKEMKSKENVKGGEKTPVSGSLKSPVPRSDIAEPEREQKRRKIDTHPSpSHSSTVKDSLIELKESSAKLYLNHTPPSL 1447
Cdd:PTZ00121  1534 KKADEAKKAEEKKKADELKKAEELKKAEEKKKAEEAKKAEEDKNMALRKA-EEAKKAEEARIEEVMKLYEEEKKMKAEEA 1612
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1823900034 1448 SKSKEREMDKKDLDKSRERSREREKKDEKDRKERKRDHSNNDREVPPDLTK---RRKEENgtmgvSKHKSESPcespypN 1524
Cdd:PTZ00121  1613 KKAEEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAaeeAKKAEE-----DKKKAEEA------K 1681
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1823900034 1525 EKDKEKNKSKSSGKEKGGDSFKSEKMDKISSGGKKESRHDKEKIEKKEKRDSSGGKEEKKHHKSSDKHR 1593
Cdd:PTZ00121  1682 KAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEEDKKKAEEAK 1750
Caldesmon pfam02029
Caldesmon;
1401-1592 6.55e-03

Caldesmon;


Pssm-ID: 460421 [Multi-domain]  Cd Length: 495  Bit Score: 41.01  E-value: 6.55e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1823900034 1401 AEPEREQK----RRKIDTHPSPSHSStvkDSLIELKESSAKLYLNHTPPSLSKSKEREMDKKDLDKSRERSREREKKDEK 1476
Cdd:pfam02029    7 AARERRRRareeRRRQKEEEEPSGQV---TESVEPNEHNSYEEDSELKPSGQGGLDEEEAFLDRTAKREERRQKRLQEAL 83
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1823900034 1477 DRkERKRDHSNNDREVPPDLTKRRKEENGTMGVSKHKSESPCESPYPNEKDKEKNKSKSSGKEKGGDSFKSEKMDKISSG 1556
Cdd:pfam02029   84 ER-QKEFDPTIADEKESVAERKENNEEEENSSWEKEEKRDSRLGRYKEEETEIREKEYQENKWSTEVRQAEEEGEEEEDK 162
                          170       180       190
                   ....*....|....*....|....*....|....*.
gi 1823900034 1557 GKKESRHDKEKIEKKEKRDSSGGKEEKKhhKSSDKH 1592
Cdd:pfam02029  163 SEEAEEVPTENFAKEEVKDEKIKKEKKV--KYESKV 196
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH