NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|1063726322|ref|NP_001329630|]
View 

choice-of-anchor C domain protein, putative (Protein of unknown function, DUF642) [Arabidopsis thaliana]

Protein Classification

DUF642 domain-containing protein( domain architecture ID 11477412)

DUF642 domain-containing protein contains a conserved CGP sequence motif

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
PLN03089 PLN03089
hypothetical protein; Provisional
1-325 0e+00

hypothetical protein; Provisional


:

Pssm-ID: 215569 [Multi-domain]  Cd Length: 373  Bit Score: 652.41  E-value: 0e+00
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063726322   1 MKGTQVINITAIPNWELSGFVEYIPSGHKQGDMILVVPKGAFAVRLGNEASIKQKISVKKGSYYSITFSAARTCAQDERL 80
Cdd:PLN03089   44 MNGTVVIGKNAIPGWEISGFVEYISSGQKQGGMLLVVPEGAHAVRLGNEASISQTLTVTKGSYYSLTFSAARTCAQDESL 123
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063726322  81 NVSVAPHHAVMPIQTVYSSSGWDLYSWAFKAQSDYADIVIHNPGVEEDPACGPLIDGVAMRALFPPRPTNKNILKNGGFE 160
Cdd:PLN03089  124 NVSVPPESGVLPLQTLYSSSGWDSYAWAFKAESDVVNLVFHNPGVEEDPACGPLIDAVAIKTLFPPRPTKDNLLKNGGFE 203
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063726322 161 EGPWVLPNISSGVLIPPNSIDDHSPLPGWMVESLKAVKYIDSDHFSVPQGRRAVELVAGKESAVAQVVRTIPGKTYVLSF 240
Cdd:PLN03089  204 EGPYVFPNSSWGVLLPPNIEDDTSPLPGWMIESLKAVKYIDSAHFSVPEGKRAVELVSGKESAIAQVVRTVPGKSYNLSF 283
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063726322 241 SVGDASNACAGSMIVEAFAGKDTIKVPYESKGKGGFKRSSLRFVAVSSRTRVMFYSTFYAMRNDDFSSLCGPVIDDVKLL 320
Cdd:PLN03089  284 TVGDANNGCHGSMMVEAFAGKDTQKVPYESQGKGGFKRASLRFKAVSNRTRITFYSSFYHTKSDDFGSLCGPVVDDVRVV 363

                  ....*
gi 1063726322 321 SARRP 325
Cdd:PLN03089  364 PVRAP 368
 
Name Accession Description Interval E-value
PLN03089 PLN03089
hypothetical protein; Provisional
1-325 0e+00

hypothetical protein; Provisional


Pssm-ID: 215569 [Multi-domain]  Cd Length: 373  Bit Score: 652.41  E-value: 0e+00
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063726322   1 MKGTQVINITAIPNWELSGFVEYIPSGHKQGDMILVVPKGAFAVRLGNEASIKQKISVKKGSYYSITFSAARTCAQDERL 80
Cdd:PLN03089   44 MNGTVVIGKNAIPGWEISGFVEYISSGQKQGGMLLVVPEGAHAVRLGNEASISQTLTVTKGSYYSLTFSAARTCAQDESL 123
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063726322  81 NVSVAPHHAVMPIQTVYSSSGWDLYSWAFKAQSDYADIVIHNPGVEEDPACGPLIDGVAMRALFPPRPTNKNILKNGGFE 160
Cdd:PLN03089  124 NVSVPPESGVLPLQTLYSSSGWDSYAWAFKAESDVVNLVFHNPGVEEDPACGPLIDAVAIKTLFPPRPTKDNLLKNGGFE 203
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063726322 161 EGPWVLPNISSGVLIPPNSIDDHSPLPGWMVESLKAVKYIDSDHFSVPQGRRAVELVAGKESAVAQVVRTIPGKTYVLSF 240
Cdd:PLN03089  204 EGPYVFPNSSWGVLLPPNIEDDTSPLPGWMIESLKAVKYIDSAHFSVPEGKRAVELVSGKESAIAQVVRTVPGKSYNLSF 283
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063726322 241 SVGDASNACAGSMIVEAFAGKDTIKVPYESKGKGGFKRSSLRFVAVSSRTRVMFYSTFYAMRNDDFSSLCGPVIDDVKLL 320
Cdd:PLN03089  284 TVGDANNGCHGSMMVEAFAGKDTQKVPYESQGKGGFKRASLRFKAVSNRTRITFYSSFYHTKSDDFGSLCGPVVDDVRVV 363

                  ....*
gi 1063726322 321 SARRP 325
Cdd:PLN03089  364 PVRAP 368
DUF642 pfam04862
Protein of unknown function (DUF642); This family represents a duplicated conserved region ...
1-141 9.55e-83

Protein of unknown function (DUF642); This family represents a duplicated conserved region found in a number of uncharacterized plant proteins, potentially in the stem. There is a conserved CGP sequence motif.


Pssm-ID: 398500 [Multi-domain]  Cd Length: 157  Bit Score: 247.17  E-value: 9.55e-83
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063726322   1 MKGTQVINITAIPNWELSGFVEYIPSGHKQGDMILVVPKGAFAVRLGNEASIKQKISVKKGSYYSITFSAARTCAQDERL 80
Cdd:pfam04862  17 MKGTVLAGPNAIPGWTVTGFVEYIKSGQKQGDMYLQVPEGAHAVRLGNDASISQTFSVTPGSTYSLTFSAARTCAQDESL 96
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1063726322  81 NVSVAPHHAVMPIQTVYSSSGWDLYSWAFKAQSDYADIVIHNPGVEEDPACGPLIDGVAMR 141
Cdd:pfam04862  97 NVSVAPDSGVFPFQTLYSSSGWDSYAWAFKATGSVVTLVFHNPGVEEDPACGPLIDNVAIK 157
 
Name Accession Description Interval E-value
PLN03089 PLN03089
hypothetical protein; Provisional
1-325 0e+00

hypothetical protein; Provisional


Pssm-ID: 215569 [Multi-domain]  Cd Length: 373  Bit Score: 652.41  E-value: 0e+00
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063726322   1 MKGTQVINITAIPNWELSGFVEYIPSGHKQGDMILVVPKGAFAVRLGNEASIKQKISVKKGSYYSITFSAARTCAQDERL 80
Cdd:PLN03089   44 MNGTVVIGKNAIPGWEISGFVEYISSGQKQGGMLLVVPEGAHAVRLGNEASISQTLTVTKGSYYSLTFSAARTCAQDESL 123
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063726322  81 NVSVAPHHAVMPIQTVYSSSGWDLYSWAFKAQSDYADIVIHNPGVEEDPACGPLIDGVAMRALFPPRPTNKNILKNGGFE 160
Cdd:PLN03089  124 NVSVPPESGVLPLQTLYSSSGWDSYAWAFKAESDVVNLVFHNPGVEEDPACGPLIDAVAIKTLFPPRPTKDNLLKNGGFE 203
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063726322 161 EGPWVLPNISSGVLIPPNSIDDHSPLPGWMVESLKAVKYIDSDHFSVPQGRRAVELVAGKESAVAQVVRTIPGKTYVLSF 240
Cdd:PLN03089  204 EGPYVFPNSSWGVLLPPNIEDDTSPLPGWMIESLKAVKYIDSAHFSVPEGKRAVELVSGKESAIAQVVRTVPGKSYNLSF 283
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063726322 241 SVGDASNACAGSMIVEAFAGKDTIKVPYESKGKGGFKRSSLRFVAVSSRTRVMFYSTFYAMRNDDFSSLCGPVIDDVKLL 320
Cdd:PLN03089  284 TVGDANNGCHGSMMVEAFAGKDTQKVPYESQGKGGFKRASLRFKAVSNRTRITFYSSFYHTKSDDFGSLCGPVVDDVRVV 363

                  ....*
gi 1063726322 321 SARRP 325
Cdd:PLN03089  364 PVRAP 368
DUF642 pfam04862
Protein of unknown function (DUF642); This family represents a duplicated conserved region ...
1-141 9.55e-83

Protein of unknown function (DUF642); This family represents a duplicated conserved region found in a number of uncharacterized plant proteins, potentially in the stem. There is a conserved CGP sequence motif.


Pssm-ID: 398500 [Multi-domain]  Cd Length: 157  Bit Score: 247.17  E-value: 9.55e-83
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063726322   1 MKGTQVINITAIPNWELSGFVEYIPSGHKQGDMILVVPKGAFAVRLGNEASIKQKISVKKGSYYSITFSAARTCAQDERL 80
Cdd:pfam04862  17 MKGTVLAGPNAIPGWTVTGFVEYIKSGQKQGDMYLQVPEGAHAVRLGNDASISQTFSVTPGSTYSLTFSAARTCAQDESL 96
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1063726322  81 NVSVAPHHAVMPIQTVYSSSGWDLYSWAFKAQSDYADIVIHNPGVEEDPACGPLIDGVAMR 141
Cdd:pfam04862  97 NVSVAPDSGVFPFQTLYSSSGWDSYAWAFKATGSVVTLVFHNPGVEEDPACGPLIDNVAIK 157
PLN03089 PLN03089
hypothetical protein; Provisional
9-148 2.36e-05

hypothetical protein; Provisional


Pssm-ID: 215569 [Multi-domain]  Cd Length: 373  Bit Score: 45.72  E-value: 2.36e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063726322   9 ITAIPNW--ELSGFVEYIPSGHkqgdmiLVVPKGAFAVRL--GNEASIKQKISVKKGSYYSITFS---AARTC------- 74
Cdd:PLN03089  226 TSPLPGWmiESLKAVKYIDSAH------FSVPEGKRAVELvsGKESAIAQVVRTVPGKSYNLSFTvgdANNGChgsmmve 299
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063726322  75 --AQDERLNVSVAPHhavmpiqtvySSSGWDLYSWAFKAQSDYADIV---------IHNPGVeedpACGPLIDGVAMRAL 143
Cdd:PLN03089  300 afAGKDTQKVPYESQ----------GKGGFKRASLRFKAVSNRTRITfyssfyhtkSDDFGS----LCGPVVDDVRVVPV 365

                  ....*
gi 1063726322 144 FPPRP 148
Cdd:PLN03089  366 RAPRA 370
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH