NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|1720408070|ref|XP_030109247|]
View 

uncharacterized protein C9orf43 homolog isoform X8 [Mus musculus]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
DUF4647 super family cl21314
Domain of unknown function (DUF4647); This family of proteins is found in eukaryotes. Proteins ...
1-323 1.96e-163

Domain of unknown function (DUF4647); This family of proteins is found in eukaryotes. Proteins in this family are typically between 282 and 480 amino acids in length.


The actual alignment was detected with superfamily member pfam15504:

Pssm-ID: 464752  Cd Length: 467  Bit Score: 464.34  E-value: 1.96e-163
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408070   1 MVVTWVPEETEKDVsPVQKTDVSSWPGKKRRKKLRKKSKPSLYYPGRQY----SRSPAAIVPPPSPEHHLEQLSPEAIPL 76
Cdd:pfam15504 140 MVVIWIPEEPEKHV-AEEKPDVTSQDGKKKRKKSTVKSKSSLGLSGKQYretqLRSPGMIVPPPSPVHLLEQLSSESIPL 218
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408070  77 WAQVGMLPQDLLEECILAHEKSIIGPEVKIELSKMRKSLPLERRRPESAISSKMYLTIQRLTLQRPSLRYPARLRKLCPN 156
Cdd:pfam15504 219 WAQFDMLPQDLLKDLLLDEGKTMPCPEMKIQLAMMKKSLPLEKSRPDSAISSKMFLSVHRLTLQRPSLRYPEHLKKLRHN 298
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408070 157 LKQgEGLAGHGSSDSLMQQGKAKTFPPKQEPKKKAKRNVKGQYGEETTSGHFFHDSVGLRISGQEDQQTPWEEEDIEKTS 236
Cdd:pfam15504 299 LKT-EGLRKQQQWQQQQQQRKVKTPTKKQEAKKKAKSDPGSQYTSRKHSGHIFHDPVGLRTLRGQESDKKQQQEGKEKGP 377
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408070 237 AETHVSLEEV-YEFDKYYTEYYATPESAVLYET--VYQNLDDDEETMVGIKASSKDRNLKNLSAMMDGIGWNPELKLLRI 313
Cdd:pfam15504 378 TLKQVSTERPqMDYAEKYLDYYHSPESPELYETesTYKDISTQVEAVLESQASSREETPKNLSASMDGISWNPELKLLRI 457
                         330
                  ....*....|
gi 1720408070 314 LQATEEEDEE 323
Cdd:pfam15504 458 LQATDDEDEE 467
 
Name Accession Description Interval E-value
DUF4647 pfam15504
Domain of unknown function (DUF4647); This family of proteins is found in eukaryotes. Proteins ...
1-323 1.96e-163

Domain of unknown function (DUF4647); This family of proteins is found in eukaryotes. Proteins in this family are typically between 282 and 480 amino acids in length.


Pssm-ID: 464752  Cd Length: 467  Bit Score: 464.34  E-value: 1.96e-163
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408070   1 MVVTWVPEETEKDVsPVQKTDVSSWPGKKRRKKLRKKSKPSLYYPGRQY----SRSPAAIVPPPSPEHHLEQLSPEAIPL 76
Cdd:pfam15504 140 MVVIWIPEEPEKHV-AEEKPDVTSQDGKKKRKKSTVKSKSSLGLSGKQYretqLRSPGMIVPPPSPVHLLEQLSSESIPL 218
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408070  77 WAQVGMLPQDLLEECILAHEKSIIGPEVKIELSKMRKSLPLERRRPESAISSKMYLTIQRLTLQRPSLRYPARLRKLCPN 156
Cdd:pfam15504 219 WAQFDMLPQDLLKDLLLDEGKTMPCPEMKIQLAMMKKSLPLEKSRPDSAISSKMFLSVHRLTLQRPSLRYPEHLKKLRHN 298
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408070 157 LKQgEGLAGHGSSDSLMQQGKAKTFPPKQEPKKKAKRNVKGQYGEETTSGHFFHDSVGLRISGQEDQQTPWEEEDIEKTS 236
Cdd:pfam15504 299 LKT-EGLRKQQQWQQQQQQRKVKTPTKKQEAKKKAKSDPGSQYTSRKHSGHIFHDPVGLRTLRGQESDKKQQQEGKEKGP 377
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408070 237 AETHVSLEEV-YEFDKYYTEYYATPESAVLYET--VYQNLDDDEETMVGIKASSKDRNLKNLSAMMDGIGWNPELKLLRI 313
Cdd:pfam15504 378 TLKQVSTERPqMDYAEKYLDYYHSPESPELYETesTYKDISTQVEAVLESQASSREETPKNLSASMDGISWNPELKLLRI 457
                         330
                  ....*....|
gi 1720408070 314 LQATEEEDEE 323
Cdd:pfam15504 458 LQATDDEDEE 467
 
Name Accession Description Interval E-value
DUF4647 pfam15504
Domain of unknown function (DUF4647); This family of proteins is found in eukaryotes. Proteins ...
1-323 1.96e-163

Domain of unknown function (DUF4647); This family of proteins is found in eukaryotes. Proteins in this family are typically between 282 and 480 amino acids in length.


Pssm-ID: 464752  Cd Length: 467  Bit Score: 464.34  E-value: 1.96e-163
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408070   1 MVVTWVPEETEKDVsPVQKTDVSSWPGKKRRKKLRKKSKPSLYYPGRQY----SRSPAAIVPPPSPEHHLEQLSPEAIPL 76
Cdd:pfam15504 140 MVVIWIPEEPEKHV-AEEKPDVTSQDGKKKRKKSTVKSKSSLGLSGKQYretqLRSPGMIVPPPSPVHLLEQLSSESIPL 218
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408070  77 WAQVGMLPQDLLEECILAHEKSIIGPEVKIELSKMRKSLPLERRRPESAISSKMYLTIQRLTLQRPSLRYPARLRKLCPN 156
Cdd:pfam15504 219 WAQFDMLPQDLLKDLLLDEGKTMPCPEMKIQLAMMKKSLPLEKSRPDSAISSKMFLSVHRLTLQRPSLRYPEHLKKLRHN 298
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408070 157 LKQgEGLAGHGSSDSLMQQGKAKTFPPKQEPKKKAKRNVKGQYGEETTSGHFFHDSVGLRISGQEDQQTPWEEEDIEKTS 236
Cdd:pfam15504 299 LKT-EGLRKQQQWQQQQQQRKVKTPTKKQEAKKKAKSDPGSQYTSRKHSGHIFHDPVGLRTLRGQESDKKQQQEGKEKGP 377
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408070 237 AETHVSLEEV-YEFDKYYTEYYATPESAVLYET--VYQNLDDDEETMVGIKASSKDRNLKNLSAMMDGIGWNPELKLLRI 313
Cdd:pfam15504 378 TLKQVSTERPqMDYAEKYLDYYHSPESPELYETesTYKDISTQVEAVLESQASSREETPKNLSASMDGISWNPELKLLRI 457
                         330
                  ....*....|
gi 1720408070 314 LQATEEEDEE 323
Cdd:pfam15504 458 LQATDDEDEE 467
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH