NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|42571627|ref|NP_973904|]
View 

hypothetical protein (DUF626) [Arabidopsis thaliana]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
protein_MS5 super family cl19902
Protein MS5; Proteins are known only from species of Brassicaceae. Protein MS5 is essential ...
18-344 1.39e-98

Protein MS5; Proteins are known only from species of Brassicaceae. Protein MS5 is essential for pairing of homologs during early prophase stage of meiosis but not necessary for the initiation of DNA double-strand breaks.


The actual alignment was detected with superfamily member TIGR01572:

Pssm-ID: 450408 [Multi-domain]  Cd Length: 265  Bit Score: 293.29  E-value: 1.39e-98
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 42571627    18 DFWRQAAKSDGFDLenislPPGTNGIVMGLIPYDCQRARHYPFPVLVKLYAKFGLHRYNMLKGTSFQLATLMKFNMLPNY 97
Cdd:TIGR01572   1 EYYRQIRESDGFDV-----PIGIPTSLTPLYHYDCANNSRYPDGDLVKIYARVGLHRYNFLEGTNLELDHVDKFNKRMCA 75
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 42571627    98 ISSFYMTLLAHDPDpaAGSSQKTFQVRVDEQQFGSLDINCSIARPKHegdllEVSTETPFMPHFhggalgdgifkvelpd 177
Cdd:TIGR01572  76 LSSYYITLLAVDPD--SRFLQQTFQVRVDEQKLETLDLTVEIARPKP-----KVTTNEPFLKPW---------------- 132
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 42571627   178 clSDTALNelagavlrgelpehvfddalyaragGIFQGELPDWPSDDVLN-DGKRFYMVKESEWQATDWISMYLELVITT 256
Cdd:TIGR01572 133 --SDSAID-------------------------DIYKGELPEWPSDDALMsDQKRFYRVKESELRENDWISLYLELALVS 185
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 42571627   257 TDKSiSIKPEVLSKLEIVKVAIETaTKDEEPSNERLKAYRAHVYITFKGLAEPRahervfeIGEHVERQAIVRRVM-GHR 335
Cdd:TIGR01572 186 HDRG-DMRLKSLSQLQIVKVVIET-REDFKPNKEKLNAKTAIVYITFKGLNKPR-------IGEPVDRKAIVRRIIdEIT 256

                  ....*....
gi 42571627   336 GDLTLKGKL 344
Cdd:TIGR01572 257 GHLCLEGKC 265
 
Name Accession Description Interval E-value
A_thl_para_3677 TIGR01572
Arabidopsis paralogous family TIGR01572; This model describes a paralogous family of ...
18-344 1.39e-98

Arabidopsis paralogous family TIGR01572; This model describes a paralogous family of hypothetical proteins in Arabidopsis thaliana. No homologs are detected from other species. Length heterogeneity within the family is attributable partly to a 21-residue repeat present in from zero to three tandem copies. The central region of the repeat resembles the pattern [VIF][FY][QK]GX[LM]P[DEK]XXXDDAL.


Pssm-ID: 273698 [Multi-domain]  Cd Length: 265  Bit Score: 293.29  E-value: 1.39e-98
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 42571627    18 DFWRQAAKSDGFDLenislPPGTNGIVMGLIPYDCQRARHYPFPVLVKLYAKFGLHRYNMLKGTSFQLATLMKFNMLPNY 97
Cdd:TIGR01572   1 EYYRQIRESDGFDV-----PIGIPTSLTPLYHYDCANNSRYPDGDLVKIYARVGLHRYNFLEGTNLELDHVDKFNKRMCA 75
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 42571627    98 ISSFYMTLLAHDPDpaAGSSQKTFQVRVDEQQFGSLDINCSIARPKHegdllEVSTETPFMPHFhggalgdgifkvelpd 177
Cdd:TIGR01572  76 LSSYYITLLAVDPD--SRFLQQTFQVRVDEQKLETLDLTVEIARPKP-----KVTTNEPFLKPW---------------- 132
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 42571627   178 clSDTALNelagavlrgelpehvfddalyaragGIFQGELPDWPSDDVLN-DGKRFYMVKESEWQATDWISMYLELVITT 256
Cdd:TIGR01572 133 --SDSAID-------------------------DIYKGELPEWPSDDALMsDQKRFYRVKESELRENDWISLYLELALVS 185
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 42571627   257 TDKSiSIKPEVLSKLEIVKVAIETaTKDEEPSNERLKAYRAHVYITFKGLAEPRahervfeIGEHVERQAIVRRVM-GHR 335
Cdd:TIGR01572 186 HDRG-DMRLKSLSQLQIVKVVIET-REDFKPNKEKLNAKTAIVYITFKGLNKPR-------IGEPVDRKAIVRRIIdEIT 256

                  ....*....
gi 42571627   336 GDLTLKGKL 344
Cdd:TIGR01572 257 GHLCLEGKC 265
protein_MS5 pfam04776
Protein MS5; Proteins are known only from species of Brassicaceae. Protein MS5 is essential ...
217-340 7.82e-54

Protein MS5; Proteins are known only from species of Brassicaceae. Protein MS5 is essential for pairing of homologs during early prophase stage of meiosis but not necessary for the initiation of DNA double-strand breaks.


Pssm-ID: 428116  Cd Length: 118  Bit Score: 173.58  E-value: 7.82e-54
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 42571627   217 LPDWPSDDVL-NDGKRFYMVKESEWQATDWISMYLELVITTTDKSISikPEVLSK-LEIVKVAIETATKDEEPSNERLKA 294
Cdd:pfam04776   1 MPDWPSDDAFaNDSKRFYVVKESELQENDWIRLYLELALYTKWRSIS--DTDLSKlLEIKKVVVETKEEDEPPSERKLKA 78
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|....*..
gi 42571627   295 YRAHVYITFKGLAEPRahervfeIGEHVERQAIVRRVM-GHRGDLTL 340
Cdd:pfam04776  79 KNAIFYITFKGLAKAR-------VGEHVDRKAIVRRTMdGKPGHLSL 118
 
Name Accession Description Interval E-value
A_thl_para_3677 TIGR01572
Arabidopsis paralogous family TIGR01572; This model describes a paralogous family of ...
18-344 1.39e-98

Arabidopsis paralogous family TIGR01572; This model describes a paralogous family of hypothetical proteins in Arabidopsis thaliana. No homologs are detected from other species. Length heterogeneity within the family is attributable partly to a 21-residue repeat present in from zero to three tandem copies. The central region of the repeat resembles the pattern [VIF][FY][QK]GX[LM]P[DEK]XXXDDAL.


Pssm-ID: 273698 [Multi-domain]  Cd Length: 265  Bit Score: 293.29  E-value: 1.39e-98
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 42571627    18 DFWRQAAKSDGFDLenislPPGTNGIVMGLIPYDCQRARHYPFPVLVKLYAKFGLHRYNMLKGTSFQLATLMKFNMLPNY 97
Cdd:TIGR01572   1 EYYRQIRESDGFDV-----PIGIPTSLTPLYHYDCANNSRYPDGDLVKIYARVGLHRYNFLEGTNLELDHVDKFNKRMCA 75
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 42571627    98 ISSFYMTLLAHDPDpaAGSSQKTFQVRVDEQQFGSLDINCSIARPKHegdllEVSTETPFMPHFhggalgdgifkvelpd 177
Cdd:TIGR01572  76 LSSYYITLLAVDPD--SRFLQQTFQVRVDEQKLETLDLTVEIARPKP-----KVTTNEPFLKPW---------------- 132
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 42571627   178 clSDTALNelagavlrgelpehvfddalyaragGIFQGELPDWPSDDVLN-DGKRFYMVKESEWQATDWISMYLELVITT 256
Cdd:TIGR01572 133 --SDSAID-------------------------DIYKGELPEWPSDDALMsDQKRFYRVKESELRENDWISLYLELALVS 185
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 42571627   257 TDKSiSIKPEVLSKLEIVKVAIETaTKDEEPSNERLKAYRAHVYITFKGLAEPRahervfeIGEHVERQAIVRRVM-GHR 335
Cdd:TIGR01572 186 HDRG-DMRLKSLSQLQIVKVVIET-REDFKPNKEKLNAKTAIVYITFKGLNKPR-------IGEPVDRKAIVRRIIdEIT 256

                  ....*....
gi 42571627   336 GDLTLKGKL 344
Cdd:TIGR01572 257 GHLCLEGKC 265
protein_MS5 pfam04776
Protein MS5; Proteins are known only from species of Brassicaceae. Protein MS5 is essential ...
217-340 7.82e-54

Protein MS5; Proteins are known only from species of Brassicaceae. Protein MS5 is essential for pairing of homologs during early prophase stage of meiosis but not necessary for the initiation of DNA double-strand breaks.


Pssm-ID: 428116  Cd Length: 118  Bit Score: 173.58  E-value: 7.82e-54
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 42571627   217 LPDWPSDDVL-NDGKRFYMVKESEWQATDWISMYLELVITTTDKSISikPEVLSK-LEIVKVAIETATKDEEPSNERLKA 294
Cdd:pfam04776   1 MPDWPSDDAFaNDSKRFYVVKESELQENDWIRLYLELALYTKWRSIS--DTDLSKlLEIKKVVVETKEEDEPPSERKLKA 78
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|....*..
gi 42571627   295 YRAHVYITFKGLAEPRahervfeIGEHVERQAIVRRVM-GHRGDLTL 340
Cdd:pfam04776  79 KNAIFYITFKGLAKAR-------VGEHVDRKAIVRRTMdGKPGHLSL 118
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH