NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|255918233|ref|NP_001157645|]
View 

cleavage and polyadenylation specificity factor subunit 1 isoform 1 [Mus musculus]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
SFT1 super family cl34923
Pre-mRNA cleavage and polyadenylation specificity factor [RNA processing and modification];
1-1421 3.79e-119

Pre-mRNA cleavage and polyadenylation specificity factor [RNA processing and modification];


The actual alignment was detected with superfamily member COG5161:

Pssm-ID: 227490 [Multi-domain]  Cd Length: 1319  Bit Score: 405.50  E-value: 3.79e-119
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 255918233    1 MYAVYKQAHPPTGLEFTMYCNFFNNSERNLVVAGTSQLYVYRLNRDaealtkndgstegkahrEKLELVASFSFFGNVMS 80
Cdd:COG5161     1 MNYLYSDESDWTVTEGCSAGLFTPSRTCSLLVYNGNILAVRLWKYD-----------------SGLVLVDEHMLLEKVTQ 63
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 255918233   81 MASVQLAGAKRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFE-----------------EPELRDG------------ 131
Cdd:COG5161    64 IEKYPQISSEQDGLLLLTHRAKISLLRFDSQANEFRTISLHYYEgkfkgkslvelakfstlEFDIRSScallfnedignf 143
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 255918233  132 FVQNVHTP-RVRVDPDGRCAAMLIYGTRLVVLPFRRESLAEEHEGLMGEGQRSSflPSYIIDVRALDEKLLNIIDLQFLH 210
Cdd:COG5161   144 LPFHVNKNdDDEVRIDVDLGMFQMSKRHFSIFPSQGTNTFNKRKRTLFPGKFSA--PSKVLKFSELDGKIKNIIDFVFLE 221
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 255918233  211 GYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKVHPVIWSLTSLPFDCTQALAVPKpigGVVIFAVNSLLYLN 290
Cdd:COG5161   222 NYSIPTVALLYDPKLSLPRKYTILKNPYNAIVFTLDLGAGRSAVIDEFLVLPRDFRVTVAGPV---GALLFGSNELILID 298
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 255918233  291 QSVPPYGVALNSLTTGTTAFPlRTQEGVRITLDCAQAAFISY---------DKMVISLKGGEIYVLTLITDGmRSVRAFH 361
Cdd:COG5161   299 STGSSYTIPLNSMSEKYGGNK-IVEDISLSDVNCFSRGTTSIwipsskcliETLFLGDLNGDRYYLRISMDG-KRIIGFD 376
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 255918233  362 -----------FDKAAASVLTTSMVTMepgyLFLGSRLGNSLLLKYTEKLqepPASSVREAadkeeppskkKRVEPAVGW 430
Cdd:COG5161   377 iaslefegdllKKGSAVSCVGHVNNLL----FFGGVGDSNSRVLRIKSLL---PTIETRAS----------EGVGPLEGG 439
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 255918233  431 TGGKTvpQDEVDEIEV---YGSEAQSGTQLATYSFEVCDSMLNIGPCANAAVGEPAFLSEEFQNSPEPdLEIVVCSGYGK 507
Cdd:COG5161   440 NDEEM--DDEYSAPENklfGNKEQEVRRQDEPYDAELFNALSNAGPITDFAVGKVDVEKGLPIPNIGL-LNLVVTKGSDS 516
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 255918233  508 NGALSVLQKSIRPQVVTTFELPGCYDMWTViapvrkeeeetpkaesteqepSAPKAEEDGRRHGFLILSREDSTMILQTG 587
Cdd:COG5161   517 EAALAVEGTSLEPCICTVSSFIPLEIVWSQ---------------------KIRGYLRCSRALDFYILSRVSDSRIFRWS 575
                         650       660       670       680       690       700       710       720
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 255918233  588 QEIMELDTSGFATQGPTVFAGNIGDNRYIVQVSPLGIRLLEG-VNQLHFIPVDLGApIVQCAVADPYVVIMSAEGHVTMF 666
Cdd:COG5161   576 EEFLLEVSGEYTRDVNTLLFVEFGEENRVVQVTPSYLLRYDQdLRMLGRVEFASRA-VEARSVRDPLILVVRDSGKILTF 654
                         730       740       750       760       770       780       790       800
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 255918233  667 LLKSDSyggrhhrLALHKPPLhhqskVIALCLYRDVSGMFTTESRLGgarDELGGRSGSEAEGLgSETSPTVDDEeemly 746
Cdd:COG5161   655 YDREKN-------MRLFKIDL-----VTCLADAKNKSFVLSDSNSLG---IFDIGKRISQLEPC-LVKGLPYAIQ----- 713
                         810       820       830       840       850       860       870       880
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 255918233  747 gdssalFSPSkeearrsSQPPADRDPAPFKADPthwcllVRENGTMEIYQLPDwrLVFlvknfpvgqrvlvdssfgqptt 826
Cdd:COG5161   714 ------FSPE-------ASPAMDLAGEEDGDDQ------LTEISMSLTYNLID--MLF---------------------- 750
                         890       900       910       920       930       940       950       960
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 255918233  827 qgevrkeeatrqgELPLVKEVLLVALGSRQSRPYL-LVHVDQELLIYEAFPhdsqlgqgnlkvrfkkvPHNINFREKKPK 905
Cdd:COG5161   751 -------------RLPSIGNYMVAYLGLDLKEEYLfDNSLSSEIVFYKTHL-----------------PRHVSFNLNVTR 800
                         970       980       990      1000      1010      1020      1030      1040
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 255918233  906 PSKKKAEGCSTEEGSGGRGRVARFRYFEDIYGYSGVFICGPSPHWLLVTGRGALRLHPMGiDGPIDSFAPFHNvncpRGF 985
Cdd:COG5161   801 NDLAITGAPDNADIKAFSSVGRIDMVFIKAVGHSFMFVTGKGPFLCRSRYTSSSKAFHRG-NIPLVSVIPLSK----RGY 875
                        1050      1060      1070      1080      1090      1100      1110      1120
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 255918233  986 LYFNRQGELRISVLPAYLSYDA-PWPVRKIPLRCTAHYVAYHVESKVYAVATstntpCTRIPRMTGEEKEFEAIERDDRY 1064
Cdd:COG5161   876 LMVDNVLGVRASQYVFDNGYVGnKNPVKRTPKHKTLQKLVYHCAGRYMVVGS-----CEEAGFSPKGEDGESGIPVDTNV 950
                        1130      1140      1150      1160      1170      1180      1190      1200
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 255918233 1065 IHPQQEAFSIQLISPVSWEAIPnaRIELEEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVTCRGRILIMDVIE 1144
Cdd:COG5161   951 PHAEGYRFYVDLYSPKSWEVID--TYEFDENEYVFHIKYLILDDMQGTKGKSPYILVGTTFIEGEDRPARGRLHVLEIIS 1028
                        1210      1220      1230      1240      1250      1260      1270      1280
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 255918233 1145 VVPEPGQPLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQKIFLWSL-RASELTGMAFIDTQLYIHQMISVKNFILAA 1223
Cdd:COG5161  1029 VVPSPGSPFTDCKLKVLGIEETKGTVVRVCEVRGKIALCQGQKVMVRKIdRSSGIIPVGFYDLHIFTSSIKVVKNLLLAG 1108
                        1290      1300      1310      1320      1330      1340      1350      1360
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 255918233 1224 DVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVG 1303
Cdd:COG5161  1109 DIYQGLSFFGFQSEPYRMHLISSSEPLRNATSTEFLVTGNELYFLCCDAKGNIHGLTYSPNNPISMSGARLVKRSSFTLH 1188
                        1370      1380      1390      1400      1410      1420      1430      1440
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 255918233 1304 A---HVNTFWRTPCRGAAEGPSKKSVvwenkhitwFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFR 1380
Cdd:COG5161  1189 SaeiKMNLLPRNSEFGAGFKKNFIMV---------YSRSDGMLIHVVPISDAHYRRLLGIQTAIMARLKSVGGLNPRDYR 1259
                        1450      1460      1470      1480
                  ....*....|....*....|....*....|....*....|.
gi 255918233 1381 mLHVDRRILQNAVRNVLDGELLNRYLYLSTMERSELAKKIG 1421
Cdd:COG5161  1260 -LNSDIHLHSLSLRSPLDLHIINLFSYFDMSTRESVASKAG 1299
 
Name Accession Description Interval E-value
SFT1 COG5161
Pre-mRNA cleavage and polyadenylation specificity factor [RNA processing and modification];
1-1421 3.79e-119

Pre-mRNA cleavage and polyadenylation specificity factor [RNA processing and modification];


Pssm-ID: 227490 [Multi-domain]  Cd Length: 1319  Bit Score: 405.50  E-value: 3.79e-119
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 255918233    1 MYAVYKQAHPPTGLEFTMYCNFFNNSERNLVVAGTSQLYVYRLNRDaealtkndgstegkahrEKLELVASFSFFGNVMS 80
Cdd:COG5161     1 MNYLYSDESDWTVTEGCSAGLFTPSRTCSLLVYNGNILAVRLWKYD-----------------SGLVLVDEHMLLEKVTQ 63
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 255918233   81 MASVQLAGAKRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFE-----------------EPELRDG------------ 131
Cdd:COG5161    64 IEKYPQISSEQDGLLLLTHRAKISLLRFDSQANEFRTISLHYYEgkfkgkslvelakfstlEFDIRSScallfnedignf 143
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 255918233  132 FVQNVHTP-RVRVDPDGRCAAMLIYGTRLVVLPFRRESLAEEHEGLMGEGQRSSflPSYIIDVRALDEKLLNIIDLQFLH 210
Cdd:COG5161   144 LPFHVNKNdDDEVRIDVDLGMFQMSKRHFSIFPSQGTNTFNKRKRTLFPGKFSA--PSKVLKFSELDGKIKNIIDFVFLE 221
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 255918233  211 GYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKVHPVIWSLTSLPFDCTQALAVPKpigGVVIFAVNSLLYLN 290
Cdd:COG5161   222 NYSIPTVALLYDPKLSLPRKYTILKNPYNAIVFTLDLGAGRSAVIDEFLVLPRDFRVTVAGPV---GALLFGSNELILID 298
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 255918233  291 QSVPPYGVALNSLTTGTTAFPlRTQEGVRITLDCAQAAFISY---------DKMVISLKGGEIYVLTLITDGmRSVRAFH 361
Cdd:COG5161   299 STGSSYTIPLNSMSEKYGGNK-IVEDISLSDVNCFSRGTTSIwipsskcliETLFLGDLNGDRYYLRISMDG-KRIIGFD 376
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 255918233  362 -----------FDKAAASVLTTSMVTMepgyLFLGSRLGNSLLLKYTEKLqepPASSVREAadkeeppskkKRVEPAVGW 430
Cdd:COG5161   377 iaslefegdllKKGSAVSCVGHVNNLL----FFGGVGDSNSRVLRIKSLL---PTIETRAS----------EGVGPLEGG 439
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 255918233  431 TGGKTvpQDEVDEIEV---YGSEAQSGTQLATYSFEVCDSMLNIGPCANAAVGEPAFLSEEFQNSPEPdLEIVVCSGYGK 507
Cdd:COG5161   440 NDEEM--DDEYSAPENklfGNKEQEVRRQDEPYDAELFNALSNAGPITDFAVGKVDVEKGLPIPNIGL-LNLVVTKGSDS 516
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 255918233  508 NGALSVLQKSIRPQVVTTFELPGCYDMWTViapvrkeeeetpkaesteqepSAPKAEEDGRRHGFLILSREDSTMILQTG 587
Cdd:COG5161   517 EAALAVEGTSLEPCICTVSSFIPLEIVWSQ---------------------KIRGYLRCSRALDFYILSRVSDSRIFRWS 575
                         650       660       670       680       690       700       710       720
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 255918233  588 QEIMELDTSGFATQGPTVFAGNIGDNRYIVQVSPLGIRLLEG-VNQLHFIPVDLGApIVQCAVADPYVVIMSAEGHVTMF 666
Cdd:COG5161   576 EEFLLEVSGEYTRDVNTLLFVEFGEENRVVQVTPSYLLRYDQdLRMLGRVEFASRA-VEARSVRDPLILVVRDSGKILTF 654
                         730       740       750       760       770       780       790       800
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 255918233  667 LLKSDSyggrhhrLALHKPPLhhqskVIALCLYRDVSGMFTTESRLGgarDELGGRSGSEAEGLgSETSPTVDDEeemly 746
Cdd:COG5161   655 YDREKN-------MRLFKIDL-----VTCLADAKNKSFVLSDSNSLG---IFDIGKRISQLEPC-LVKGLPYAIQ----- 713
                         810       820       830       840       850       860       870       880
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 255918233  747 gdssalFSPSkeearrsSQPPADRDPAPFKADPthwcllVRENGTMEIYQLPDwrLVFlvknfpvgqrvlvdssfgqptt 826
Cdd:COG5161   714 ------FSPE-------ASPAMDLAGEEDGDDQ------LTEISMSLTYNLID--MLF---------------------- 750
                         890       900       910       920       930       940       950       960
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 255918233  827 qgevrkeeatrqgELPLVKEVLLVALGSRQSRPYL-LVHVDQELLIYEAFPhdsqlgqgnlkvrfkkvPHNINFREKKPK 905
Cdd:COG5161   751 -------------RLPSIGNYMVAYLGLDLKEEYLfDNSLSSEIVFYKTHL-----------------PRHVSFNLNVTR 800
                         970       980       990      1000      1010      1020      1030      1040
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 255918233  906 PSKKKAEGCSTEEGSGGRGRVARFRYFEDIYGYSGVFICGPSPHWLLVTGRGALRLHPMGiDGPIDSFAPFHNvncpRGF 985
Cdd:COG5161   801 NDLAITGAPDNADIKAFSSVGRIDMVFIKAVGHSFMFVTGKGPFLCRSRYTSSSKAFHRG-NIPLVSVIPLSK----RGY 875
                        1050      1060      1070      1080      1090      1100      1110      1120
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 255918233  986 LYFNRQGELRISVLPAYLSYDA-PWPVRKIPLRCTAHYVAYHVESKVYAVATstntpCTRIPRMTGEEKEFEAIERDDRY 1064
Cdd:COG5161   876 LMVDNVLGVRASQYVFDNGYVGnKNPVKRTPKHKTLQKLVYHCAGRYMVVGS-----CEEAGFSPKGEDGESGIPVDTNV 950
                        1130      1140      1150      1160      1170      1180      1190      1200
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 255918233 1065 IHPQQEAFSIQLISPVSWEAIPnaRIELEEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVTCRGRILIMDVIE 1144
Cdd:COG5161   951 PHAEGYRFYVDLYSPKSWEVID--TYEFDENEYVFHIKYLILDDMQGTKGKSPYILVGTTFIEGEDRPARGRLHVLEIIS 1028
                        1210      1220      1230      1240      1250      1260      1270      1280
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 255918233 1145 VVPEPGQPLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQKIFLWSL-RASELTGMAFIDTQLYIHQMISVKNFILAA 1223
Cdd:COG5161  1029 VVPSPGSPFTDCKLKVLGIEETKGTVVRVCEVRGKIALCQGQKVMVRKIdRSSGIIPVGFYDLHIFTSSIKVVKNLLLAG 1108
                        1290      1300      1310      1320      1330      1340      1350      1360
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 255918233 1224 DVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVG 1303
Cdd:COG5161  1109 DIYQGLSFFGFQSEPYRMHLISSSEPLRNATSTEFLVTGNELYFLCCDAKGNIHGLTYSPNNPISMSGARLVKRSSFTLH 1188
                        1370      1380      1390      1400      1410      1420      1430      1440
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 255918233 1304 A---HVNTFWRTPCRGAAEGPSKKSVvwenkhitwFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFR 1380
Cdd:COG5161  1189 SaeiKMNLLPRNSEFGAGFKKNFIMV---------YSRSDGMLIHVVPISDAHYRRLLGIQTAIMARLKSVGGLNPRDYR 1259
                        1450      1460      1470      1480
                  ....*....|....*....|....*....|....*....|.
gi 255918233 1381 mLHVDRRILQNAVRNVLDGELLNRYLYLSTMERSELAKKIG 1421
Cdd:COG5161  1260 -LNSDIHLHSLSLRSPLDLHIINLFSYFDMSTRESVASKAG 1299
CPSF_A pfam03178
CPSF A subunit region; This family includes a region that lies towards the C-terminus of the ...
1071-1406 4.48e-95

CPSF A subunit region; This family includes a region that lies towards the C-terminus of the cleavage and polyadenylation specificity factor (CPSF) A (160 kDa) subunit. CPSF is involved in mRNA polyadenylation and binds the AAUAAA conserved sequence in pre-mRNA. CPSF has also been found to be necessary for splicing of single-intron pre-mRNAs. The function of the aligned region is unknown but may be involved in RNA/DNA binding.


Pssm-ID: 427182  Cd Length: 319  Bit Score: 309.14  E-value: 4.48e-95
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 255918233  1071 AFSIQLISPVSWEAIPnaRIELEEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVTCR-GRILIMDVIEVvpep 1149
Cdd:pfam03178    1 ASCIRLVDPITKEVID--TLELEENEAVLSVKSVNLEDSSTTKGKEEYLVVGTAFDLGEDPAARsGRILVFEIIEV---- 74
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 255918233  1150 gqPLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQKIFLWSL-RASELTGMAFIDTQLYIHQMISVKNFILAADVMKS 1228
Cdd:pfam03178   75 --PETNRKLKLVHKTEVKGAVTALAEFQGRLLAGQGQKLRVYDLgEDKSLLPKAFLDTGVYVVDLKVFGNRIIVGDLMKS 152
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 255918233  1229 ISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDNaqlGFLVSDRDRNLMVYMYLPEAKESFGG-MRLLRRADFHVGAHVN 1307
Cdd:pfam03178  153 VTFVGYDEEPYRLIEFARDTQPRWVTAAEFLDGD---TVLVADKFGNLHVLRYDPDVPESLDGdPRLLVRAEFHLGETVT 229
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 255918233  1308 TFWRTPC-RGAAEGPSKKSVVWenkhitwfATLDGGIGLLLP-MQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVD 1385
Cdd:pfam03178  230 SFRKGSLvPGGSESPSSPQLLY--------GTLDGSIGLLVPfISEEDYRFLQSLQQQLRDELPHLGGLDHRAFRSYYTP 301
                          330       340
                   ....*....|....*....|.
gi 255918233  1386 RRilqnAVRNVLDGELLNRYL 1406
Cdd:pfam03178  302 PR----TVKGVIDGDLLERFL 318
 
Name Accession Description Interval E-value
SFT1 COG5161
Pre-mRNA cleavage and polyadenylation specificity factor [RNA processing and modification];
1-1421 3.79e-119

Pre-mRNA cleavage and polyadenylation specificity factor [RNA processing and modification];


Pssm-ID: 227490 [Multi-domain]  Cd Length: 1319  Bit Score: 405.50  E-value: 3.79e-119
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 255918233    1 MYAVYKQAHPPTGLEFTMYCNFFNNSERNLVVAGTSQLYVYRLNRDaealtkndgstegkahrEKLELVASFSFFGNVMS 80
Cdd:COG5161     1 MNYLYSDESDWTVTEGCSAGLFTPSRTCSLLVYNGNILAVRLWKYD-----------------SGLVLVDEHMLLEKVTQ 63
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 255918233   81 MASVQLAGAKRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFE-----------------EPELRDG------------ 131
Cdd:COG5161    64 IEKYPQISSEQDGLLLLTHRAKISLLRFDSQANEFRTISLHYYEgkfkgkslvelakfstlEFDIRSScallfnedignf 143
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 255918233  132 FVQNVHTP-RVRVDPDGRCAAMLIYGTRLVVLPFRRESLAEEHEGLMGEGQRSSflPSYIIDVRALDEKLLNIIDLQFLH 210
Cdd:COG5161   144 LPFHVNKNdDDEVRIDVDLGMFQMSKRHFSIFPSQGTNTFNKRKRTLFPGKFSA--PSKVLKFSELDGKIKNIIDFVFLE 221
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 255918233  211 GYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKVHPVIWSLTSLPFDCTQALAVPKpigGVVIFAVNSLLYLN 290
Cdd:COG5161   222 NYSIPTVALLYDPKLSLPRKYTILKNPYNAIVFTLDLGAGRSAVIDEFLVLPRDFRVTVAGPV---GALLFGSNELILID 298
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 255918233  291 QSVPPYGVALNSLTTGTTAFPlRTQEGVRITLDCAQAAFISY---------DKMVISLKGGEIYVLTLITDGmRSVRAFH 361
Cdd:COG5161   299 STGSSYTIPLNSMSEKYGGNK-IVEDISLSDVNCFSRGTTSIwipsskcliETLFLGDLNGDRYYLRISMDG-KRIIGFD 376
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 255918233  362 -----------FDKAAASVLTTSMVTMepgyLFLGSRLGNSLLLKYTEKLqepPASSVREAadkeeppskkKRVEPAVGW 430
Cdd:COG5161   377 iaslefegdllKKGSAVSCVGHVNNLL----FFGGVGDSNSRVLRIKSLL---PTIETRAS----------EGVGPLEGG 439
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 255918233  431 TGGKTvpQDEVDEIEV---YGSEAQSGTQLATYSFEVCDSMLNIGPCANAAVGEPAFLSEEFQNSPEPdLEIVVCSGYGK 507
Cdd:COG5161   440 NDEEM--DDEYSAPENklfGNKEQEVRRQDEPYDAELFNALSNAGPITDFAVGKVDVEKGLPIPNIGL-LNLVVTKGSDS 516
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 255918233  508 NGALSVLQKSIRPQVVTTFELPGCYDMWTViapvrkeeeetpkaesteqepSAPKAEEDGRRHGFLILSREDSTMILQTG 587
Cdd:COG5161   517 EAALAVEGTSLEPCICTVSSFIPLEIVWSQ---------------------KIRGYLRCSRALDFYILSRVSDSRIFRWS 575
                         650       660       670       680       690       700       710       720
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 255918233  588 QEIMELDTSGFATQGPTVFAGNIGDNRYIVQVSPLGIRLLEG-VNQLHFIPVDLGApIVQCAVADPYVVIMSAEGHVTMF 666
Cdd:COG5161   576 EEFLLEVSGEYTRDVNTLLFVEFGEENRVVQVTPSYLLRYDQdLRMLGRVEFASRA-VEARSVRDPLILVVRDSGKILTF 654
                         730       740       750       760       770       780       790       800
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 255918233  667 LLKSDSyggrhhrLALHKPPLhhqskVIALCLYRDVSGMFTTESRLGgarDELGGRSGSEAEGLgSETSPTVDDEeemly 746
Cdd:COG5161   655 YDREKN-------MRLFKIDL-----VTCLADAKNKSFVLSDSNSLG---IFDIGKRISQLEPC-LVKGLPYAIQ----- 713
                         810       820       830       840       850       860       870       880
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 255918233  747 gdssalFSPSkeearrsSQPPADRDPAPFKADPthwcllVRENGTMEIYQLPDwrLVFlvknfpvgqrvlvdssfgqptt 826
Cdd:COG5161   714 ------FSPE-------ASPAMDLAGEEDGDDQ------LTEISMSLTYNLID--MLF---------------------- 750
                         890       900       910       920       930       940       950       960
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 255918233  827 qgevrkeeatrqgELPLVKEVLLVALGSRQSRPYL-LVHVDQELLIYEAFPhdsqlgqgnlkvrfkkvPHNINFREKKPK 905
Cdd:COG5161   751 -------------RLPSIGNYMVAYLGLDLKEEYLfDNSLSSEIVFYKTHL-----------------PRHVSFNLNVTR 800
                         970       980       990      1000      1010      1020      1030      1040
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 255918233  906 PSKKKAEGCSTEEGSGGRGRVARFRYFEDIYGYSGVFICGPSPHWLLVTGRGALRLHPMGiDGPIDSFAPFHNvncpRGF 985
Cdd:COG5161   801 NDLAITGAPDNADIKAFSSVGRIDMVFIKAVGHSFMFVTGKGPFLCRSRYTSSSKAFHRG-NIPLVSVIPLSK----RGY 875
                        1050      1060      1070      1080      1090      1100      1110      1120
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 255918233  986 LYFNRQGELRISVLPAYLSYDA-PWPVRKIPLRCTAHYVAYHVESKVYAVATstntpCTRIPRMTGEEKEFEAIERDDRY 1064
Cdd:COG5161   876 LMVDNVLGVRASQYVFDNGYVGnKNPVKRTPKHKTLQKLVYHCAGRYMVVGS-----CEEAGFSPKGEDGESGIPVDTNV 950
                        1130      1140      1150      1160      1170      1180      1190      1200
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 255918233 1065 IHPQQEAFSIQLISPVSWEAIPnaRIELEEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVTCRGRILIMDVIE 1144
Cdd:COG5161   951 PHAEGYRFYVDLYSPKSWEVID--TYEFDENEYVFHIKYLILDDMQGTKGKSPYILVGTTFIEGEDRPARGRLHVLEIIS 1028
                        1210      1220      1230      1240      1250      1260      1270      1280
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 255918233 1145 VVPEPGQPLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQKIFLWSL-RASELTGMAFIDTQLYIHQMISVKNFILAA 1223
Cdd:COG5161  1029 VVPSPGSPFTDCKLKVLGIEETKGTVVRVCEVRGKIALCQGQKVMVRKIdRSSGIIPVGFYDLHIFTSSIKVVKNLLLAG 1108
                        1290      1300      1310      1320      1330      1340      1350      1360
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 255918233 1224 DVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVG 1303
Cdd:COG5161  1109 DIYQGLSFFGFQSEPYRMHLISSSEPLRNATSTEFLVTGNELYFLCCDAKGNIHGLTYSPNNPISMSGARLVKRSSFTLH 1188
                        1370      1380      1390      1400      1410      1420      1430      1440
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 255918233 1304 A---HVNTFWRTPCRGAAEGPSKKSVvwenkhitwFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFR 1380
Cdd:COG5161  1189 SaeiKMNLLPRNSEFGAGFKKNFIMV---------YSRSDGMLIHVVPISDAHYRRLLGIQTAIMARLKSVGGLNPRDYR 1259
                        1450      1460      1470      1480
                  ....*....|....*....|....*....|....*....|.
gi 255918233 1381 mLHVDRRILQNAVRNVLDGELLNRYLYLSTMERSELAKKIG 1421
Cdd:COG5161  1260 -LNSDIHLHSLSLRSPLDLHIINLFSYFDMSTRESVASKAG 1299
CPSF_A pfam03178
CPSF A subunit region; This family includes a region that lies towards the C-terminus of the ...
1071-1406 4.48e-95

CPSF A subunit region; This family includes a region that lies towards the C-terminus of the cleavage and polyadenylation specificity factor (CPSF) A (160 kDa) subunit. CPSF is involved in mRNA polyadenylation and binds the AAUAAA conserved sequence in pre-mRNA. CPSF has also been found to be necessary for splicing of single-intron pre-mRNAs. The function of the aligned region is unknown but may be involved in RNA/DNA binding.


Pssm-ID: 427182  Cd Length: 319  Bit Score: 309.14  E-value: 4.48e-95
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 255918233  1071 AFSIQLISPVSWEAIPnaRIELEEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVTCR-GRILIMDVIEVvpep 1149
Cdd:pfam03178    1 ASCIRLVDPITKEVID--TLELEENEAVLSVKSVNLEDSSTTKGKEEYLVVGTAFDLGEDPAARsGRILVFEIIEV---- 74
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 255918233  1150 gqPLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQKIFLWSL-RASELTGMAFIDTQLYIHQMISVKNFILAADVMKS 1228
Cdd:pfam03178   75 --PETNRKLKLVHKTEVKGAVTALAEFQGRLLAGQGQKLRVYDLgEDKSLLPKAFLDTGVYVVDLKVFGNRIIVGDLMKS 152
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 255918233  1229 ISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDNaqlGFLVSDRDRNLMVYMYLPEAKESFGG-MRLLRRADFHVGAHVN 1307
Cdd:pfam03178  153 VTFVGYDEEPYRLIEFARDTQPRWVTAAEFLDGD---TVLVADKFGNLHVLRYDPDVPESLDGdPRLLVRAEFHLGETVT 229
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 255918233  1308 TFWRTPC-RGAAEGPSKKSVVWenkhitwfATLDGGIGLLLP-MQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVD 1385
Cdd:pfam03178  230 SFRKGSLvPGGSESPSSPQLLY--------GTLDGSIGLLVPfISEEDYRFLQSLQQQLRDELPHLGGLDHRAFRSYYTP 301
                          330       340
                   ....*....|....*....|.
gi 255918233  1386 RRilqnAVRNVLDGELLNRYL 1406
Cdd:pfam03178  302 PR----TVKGVIDGDLLERFL 318
MMS1_N pfam10433
Mono-functional DNA-alkylating methyl methanesulfonate N-term; MMS1 is a protein that protects ...
92-670 1.90e-27

Mono-functional DNA-alkylating methyl methanesulfonate N-term; MMS1 is a protein that protects against replication-dependent DNA damage in Saccharomyces cerevisiae. MMS1 belongs to the DDB1 family of cullin 4 adaptors and the two proteins are homologous. MMS1 bridges the interaction of MMS22 and Crt10 with Cul8/Rtt101. Cul8/Rtt101 is a cullin protein involved in the regulation of DNA replication subsequent to DNA damage. The N-terminal region of MMS1 and the C-terminal of MMS22 are required for the the MMS1-MMS22 interaction. The human HIV-1 virion-associated protein Vpr assembles with DDB1 through interaction with DCAF1 (chromatin assembly factor) to form an E3 ubiquitin ligase that targets cellular substrates for proteasome-mediated degradation and subsequent G2 arrest.


Pssm-ID: 463091  Cd Length: 486  Bit Score: 118.14  E-value: 1.90e-27
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 255918233    92 DALLLSFKDAKLSVVEYDPGTHDLKTLslHYFE---EPELRDGFVQNvhtpRVRVDPDGRCAAMLIYGTRLVVLPFRRES 168
Cdd:pfam10433    1 DHLVVGTDSGRLVFLSWDPEKNQFETI--HSREdlgKSGSRRSQPGQ----YLAVDPKGRAIAVSAYEGVFLVYPLKQPQ 74
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 255918233   169 LAEEHEglmgEGQRSSFLPSYIIDvraldeklLNIIDLQFLH-GYYEPTLLILFEPNQtwpGRVAVrqdTCSIVAISLNI 247
Cdd:pfam10433   75 KLNRNE----ALLLSSPLEARKSE--------GFILSMVFLDpGYDNPIFALLEQDRT---GKTHL---KLYEWDLGLNH 136
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 255918233   248 TQKvhPVIWSLTS--LPFDCTQA---LAVPKPIGGVVIFAVNSLLYLNQSVPPYgvalnsLTTGTTAFPLRTQegvritl 322
Cdd:pfam10433  137 VVR--GPKWSEPLdfLPKEDRGAnllIPVPKGPGGVLVCGETIITYKDILDQPD------IRCPPVARPLREN------- 201
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 255918233   323 dcaQAAFISYDKM-----VISLKGGEIYVLTLITD------GMRSVRAFHFDKAaasvltTSMVTMEPGYLFLGSRLGNS 391
Cdd:pfam10433  202 ---ATIFVAWHKLdnffiLLADEYGDLYLLTIENDednvvtSIKIGYFGTTSVA------SALVILDNGFLFVASEFGDS 272
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 255918233   392 LLLKYTEKLQEPPAssvreaadkeeppskkkrvepavgwtggktvpqdevdeievygseaqsgtqlatySFEVCDSMLNI 471
Cdd:pfam10433  273 QLYQIDARGDDDLS-------------------------------------------------------NLELVQTFSNW 297
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 255918233   472 GPCANAAVgepAFLSEefqnspEPDLEIVVCSGYGKNGALSVLQKSIRPQVVTTFELPG--CYDMWTViapvrkeeeetp 549
Cdd:pfam10433  298 APILDFVV---MDLGG------EDTARIYTCSGAGKRGSLRSLRHGVGAEELAVSEEPGspITGVWTL------------ 356
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 255918233   550 kaesteqePSAPKAEEDgrrhGFLILSREDSTMILQ-TGQEIMELDT-SGFATQGPTVFAGNIGDNRyIVQVSPLGIRLL 627
Cdd:pfam10433  357 --------KSSPEDEYD----DYLVVSFVNETRVLSiDGDGVEEVDEdSGFLLSVPTLAAGNLGDGR-LLQVTPNGIRLI 423
                          570       580       590       600
                   ....*....|....*....|....*....|....*....|...
gi 255918233   628 EGVNQLHFIPVDLGAPIVQCAVADPYVVIMSAEGHVTMFLLKS 670
Cdd:pfam10433  424 DSDKRISEWKPPGGKSITAAAANGRQVLLALSGGELVYFEIST 466
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH