NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|3128209|gb|AAC26689|]
View 

unknown protein [Arabidopsis thaliana]

Protein Classification

DUF642 domain-containing protein( domain architecture ID 11477412)

DUF642 domain-containing protein contains a conserved CGP sequence motif

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
PLN03089 PLN03089
hypothetical protein; Provisional
24-389 0e+00

hypothetical protein; Provisional


:

Pssm-ID: 215569 [Multi-domain]  Cd Length: 373  Bit Score: 634.69  E-value: 0e+00
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3128209    24 VAAADSAGKTSPVEDGLVVNGDFETPPSNGFPDDAIIEDTSEIPSWRSDGTVELIKSGQKQGGMILIVPEGRHAVRLGND 103
Cdd:PLN03089  13 LLLCAAAASAAPVTDGLLPNGDFETPPKKSQMNGTVVIGKNAIPGWEISGFVEYISSGQKQGGMLLVVPEGAHAVRLGNE 92
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3128209   104 AEISQELTVEKGSIYSVTFSAARTCAQLESLNVSVASSdepiaSQTIDLQTVYSVQGWDPYAWAFEAVVDRVRLVFKNPG 183
Cdd:PLN03089  93 ASISQTLTVTKGSYYSLTFSAARTCAQDESLNVSVPPE-----SGVLPLQTLYSSSGWDSYAWAFKAESDVVNLVFHNPG 167
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3128209   184 MEDDPTCGPIIDDIAVKKLFTPDKPKGNAVINGDFEEGPWMFRNTTLGVLLPTNLDEEISSLPGWTVESNRAVRFIDSDH 263
Cdd:PLN03089 168 VEEDPACGPLIDAVAIKTLFPPRPTKDNLLKNGGFEEGPYVFPNSSWGVLLPPNIEDDTSPLPGWMIESLKAVKYIDSAH 247
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3128209   264 FSVPEGKRALELLSGKEGIISQMVETKANIPYKMSFSLGHAGDKCKEPLAVMAFAGDQAQNFHYMAQANSSFERSELNFT 343
Cdd:PLN03089 248 FSVPEGKRAVELVSGKESAIAQVVRTVPGKSYNLSFTVGDANNGCHGSMMVEAFAGKDTQKVPYESQGKGGFKRASLRFK 327
                        330       340       350       360
                 ....*....|....*....|....*....|....*....|....*.
gi 3128209   344 AKAERTRIAFYSIYYNTRTDDMTSLCGPVIDDVKVWFSGSSRIGFS 389
Cdd:PLN03089 328 AVSNRTRITFYSSFYHTKSDDFGSLCGPVVDDVRVVPVRAPRAGKP 373
 
Name Accession Description Interval E-value
PLN03089 PLN03089
hypothetical protein; Provisional
24-389 0e+00

hypothetical protein; Provisional


Pssm-ID: 215569 [Multi-domain]  Cd Length: 373  Bit Score: 634.69  E-value: 0e+00
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3128209    24 VAAADSAGKTSPVEDGLVVNGDFETPPSNGFPDDAIIEDTSEIPSWRSDGTVELIKSGQKQGGMILIVPEGRHAVRLGND 103
Cdd:PLN03089  13 LLLCAAAASAAPVTDGLLPNGDFETPPKKSQMNGTVVIGKNAIPGWEISGFVEYISSGQKQGGMLLVVPEGAHAVRLGNE 92
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3128209   104 AEISQELTVEKGSIYSVTFSAARTCAQLESLNVSVASSdepiaSQTIDLQTVYSVQGWDPYAWAFEAVVDRVRLVFKNPG 183
Cdd:PLN03089  93 ASISQTLTVTKGSYYSLTFSAARTCAQDESLNVSVPPE-----SGVLPLQTLYSSSGWDSYAWAFKAESDVVNLVFHNPG 167
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3128209   184 MEDDPTCGPIIDDIAVKKLFTPDKPKGNAVINGDFEEGPWMFRNTTLGVLLPTNLDEEISSLPGWTVESNRAVRFIDSDH 263
Cdd:PLN03089 168 VEEDPACGPLIDAVAIKTLFPPRPTKDNLLKNGGFEEGPYVFPNSSWGVLLPPNIEDDTSPLPGWMIESLKAVKYIDSAH 247
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3128209   264 FSVPEGKRALELLSGKEGIISQMVETKANIPYKMSFSLGHAGDKCKEPLAVMAFAGDQAQNFHYMAQANSSFERSELNFT 343
Cdd:PLN03089 248 FSVPEGKRAVELVSGKESAIAQVVRTVPGKSYNLSFTVGDANNGCHGSMMVEAFAGKDTQKVPYESQGKGGFKRASLRFK 327
                        330       340       350       360
                 ....*....|....*....|....*....|....*....|....*.
gi 3128209   344 AKAERTRIAFYSIYYNTRTDDMTSLCGPVIDDVKVWFSGSSRIGFS 389
Cdd:PLN03089 328 AVSNRTRITFYSSFYHTKSDDFGSLCGPVVDDVRVVPVRAPRAGKP 373
DUF642 pfam04862
Protein of unknown function (DUF642); This family represents a duplicated conserved region ...
39-200 3.45e-80

Protein of unknown function (DUF642); This family represents a duplicated conserved region found in a number of uncharacterized plant proteins, potentially in the stem. There is a conserved CGP sequence motif.


Pssm-ID: 398500 [Multi-domain]  Cd Length: 157  Bit Score: 243.31  E-value: 3.45e-80
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3128209     39 GLVVNGDFETPPSNGFPDDAIIEDTSEIPSWRSDGTVELIKSGQKQGGMILIVPEGRHAVRLGNDAEISQELTVEKGSIY 118
Cdd:pfam04862   1 GLLPNGDFETGPDPSNMKGTVLAGPNAIPGWTVTGFVEYIKSGQKQGDMYLQVPEGAHAVRLGNDASISQTFSVTPGSTY 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3128209    119 SVTFSAARTCAQLESLNVSVASSdepiaSQTIDLQTVYSVQGWDPYAWAFEAVVDRVRLVFKNPGMEDDPTCGPIIDDIA 198
Cdd:pfam04862  81 SLTFSAARTCAQDESLNVSVAPD-----SGVFPFQTLYSSSGWDSYAWAFKATGSVVTLVFHNPGVEEDPACGPLIDNVA 155

                  ..
gi 3128209    199 VK 200
Cdd:pfam04862 156 IK 157
choice_anch_C TIGR04362
choice-of-anchor C domain; This family describes an extracellular bacterial domain that occurs ...
40-199 3.03e-07

choice-of-anchor C domain; This family describes an extracellular bacterial domain that occurs on a number of proteins with PEP-CTERM (exosortase recognition site) sequences at the C-terminus, as well some with an apparent alternate anchor sequence. Note that related pfam04862 (DUF642), as of release 26, is double the length of this model because it has two tandem regions homologous to this domain. pfam04862, in turn, belongs to a Pfam clan called the galactose-binding domain-like superfamily.


Pssm-ID: 275156 [Multi-domain]  Cd Length: 157  Bit Score: 49.67  E-value: 3.03e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3128209     40 LVVNGDFETPPSNGFPDDAIIEDTSEIPSWR-SDGTVELIKSGQKQGgmilivpEGRHAVRL-GNDA--EISQELTVEKG 115
Cdd:TIGR04362   2 LITNGSFESGSDPGNGFSTLSAGSSAITGWTvGSGSVDLINGYWQAS-------EGSRSIDLnGTTGpgGISQTFNTVAG 74
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3128209    116 SIYSVTFSAARTCAQ---LESLNVSVASSDEPIASQTIDLQTvySVQGWDPYAWAFEAVVDRVRLVFKNpgMEDDPTCGP 192
Cdd:TIGR04362  75 QTYRVTFDLAGNPDGgpgLKDLTVSVGGASQDFSFDTTGKTT--ANMGWTTKSFDFTATSTSTTLSFTS--LDNGGAWGP 150

                  ....*..
gi 3128209    193 IIDDIAV 199
Cdd:TIGR04362 151 ALDNVSV 157
 
Name Accession Description Interval E-value
PLN03089 PLN03089
hypothetical protein; Provisional
24-389 0e+00

hypothetical protein; Provisional


Pssm-ID: 215569 [Multi-domain]  Cd Length: 373  Bit Score: 634.69  E-value: 0e+00
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3128209    24 VAAADSAGKTSPVEDGLVVNGDFETPPSNGFPDDAIIEDTSEIPSWRSDGTVELIKSGQKQGGMILIVPEGRHAVRLGND 103
Cdd:PLN03089  13 LLLCAAAASAAPVTDGLLPNGDFETPPKKSQMNGTVVIGKNAIPGWEISGFVEYISSGQKQGGMLLVVPEGAHAVRLGNE 92
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3128209   104 AEISQELTVEKGSIYSVTFSAARTCAQLESLNVSVASSdepiaSQTIDLQTVYSVQGWDPYAWAFEAVVDRVRLVFKNPG 183
Cdd:PLN03089  93 ASISQTLTVTKGSYYSLTFSAARTCAQDESLNVSVPPE-----SGVLPLQTLYSSSGWDSYAWAFKAESDVVNLVFHNPG 167
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3128209   184 MEDDPTCGPIIDDIAVKKLFTPDKPKGNAVINGDFEEGPWMFRNTTLGVLLPTNLDEEISSLPGWTVESNRAVRFIDSDH 263
Cdd:PLN03089 168 VEEDPACGPLIDAVAIKTLFPPRPTKDNLLKNGGFEEGPYVFPNSSWGVLLPPNIEDDTSPLPGWMIESLKAVKYIDSAH 247
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3128209   264 FSVPEGKRALELLSGKEGIISQMVETKANIPYKMSFSLGHAGDKCKEPLAVMAFAGDQAQNFHYMAQANSSFERSELNFT 343
Cdd:PLN03089 248 FSVPEGKRAVELVSGKESAIAQVVRTVPGKSYNLSFTVGDANNGCHGSMMVEAFAGKDTQKVPYESQGKGGFKRASLRFK 327
                        330       340       350       360
                 ....*....|....*....|....*....|....*....|....*.
gi 3128209   344 AKAERTRIAFYSIYYNTRTDDMTSLCGPVIDDVKVWFSGSSRIGFS 389
Cdd:PLN03089 328 AVSNRTRITFYSSFYHTKSDDFGSLCGPVVDDVRVVPVRAPRAGKP 373
DUF642 pfam04862
Protein of unknown function (DUF642); This family represents a duplicated conserved region ...
39-200 3.45e-80

Protein of unknown function (DUF642); This family represents a duplicated conserved region found in a number of uncharacterized plant proteins, potentially in the stem. There is a conserved CGP sequence motif.


Pssm-ID: 398500 [Multi-domain]  Cd Length: 157  Bit Score: 243.31  E-value: 3.45e-80
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3128209     39 GLVVNGDFETPPSNGFPDDAIIEDTSEIPSWRSDGTVELIKSGQKQGGMILIVPEGRHAVRLGNDAEISQELTVEKGSIY 118
Cdd:pfam04862   1 GLLPNGDFETGPDPSNMKGTVLAGPNAIPGWTVTGFVEYIKSGQKQGDMYLQVPEGAHAVRLGNDASISQTFSVTPGSTY 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3128209    119 SVTFSAARTCAQLESLNVSVASSdepiaSQTIDLQTVYSVQGWDPYAWAFEAVVDRVRLVFKNPGMEDDPTCGPIIDDIA 198
Cdd:pfam04862  81 SLTFSAARTCAQDESLNVSVAPD-----SGVFPFQTLYSSSGWDSYAWAFKATGSVVTLVFHNPGVEEDPACGPLIDNVA 155

                  ..
gi 3128209    199 VK 200
Cdd:pfam04862 156 IK 157
choice_anch_C TIGR04362
choice-of-anchor C domain; This family describes an extracellular bacterial domain that occurs ...
40-199 3.03e-07

choice-of-anchor C domain; This family describes an extracellular bacterial domain that occurs on a number of proteins with PEP-CTERM (exosortase recognition site) sequences at the C-terminus, as well some with an apparent alternate anchor sequence. Note that related pfam04862 (DUF642), as of release 26, is double the length of this model because it has two tandem regions homologous to this domain. pfam04862, in turn, belongs to a Pfam clan called the galactose-binding domain-like superfamily.


Pssm-ID: 275156 [Multi-domain]  Cd Length: 157  Bit Score: 49.67  E-value: 3.03e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3128209     40 LVVNGDFETPPSNGFPDDAIIEDTSEIPSWR-SDGTVELIKSGQKQGgmilivpEGRHAVRL-GNDA--EISQELTVEKG 115
Cdd:TIGR04362   2 LITNGSFESGSDPGNGFSTLSAGSSAITGWTvGSGSVDLINGYWQAS-------EGSRSIDLnGTTGpgGISQTFNTVAG 74
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3128209    116 SIYSVTFSAARTCAQ---LESLNVSVASSDEPIASQTIDLQTvySVQGWDPYAWAFEAVVDRVRLVFKNpgMEDDPTCGP 192
Cdd:TIGR04362  75 QTYRVTFDLAGNPDGgpgLKDLTVSVGGASQDFSFDTTGKTT--ANMGWTTKSFDFTATSTSTTLSFTS--LDNGGAWGP 150

                  ....*..
gi 3128209    193 IIDDIAV 199
Cdd:TIGR04362 151 ALDNVSV 157
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH