NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|334183024|ref|NP_174678|]
View 

Tetratricopeptide repeat (TPR)-like superfamily protein [Arabidopsis thaliana]

Protein Classification

pentatricopeptide repeat-containing protein( domain architecture ID 1000225)

pentatricopeptide repeat (PPR)-containing protein may form anti-parallel alpha helices and bind single-stranded RNA in a sequence-specific and modular manner

Gene Ontology:  GO:0003723|GO:0009451
PubMed:  24471833|19004664

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
PLN03081 super family cl33631
pentatricopeptide (PPR) repeat-containing protein; Provisional
2-581 3.63e-121

pentatricopeptide (PPR) repeat-containing protein; Provisional


The actual alignment was detected with superfamily member PLN03081:

Pssm-ID: 215563 [Multi-domain]  Cd Length: 697  Bit Score: 373.44  E-value: 3.63e-121
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334183024   2 ARVYmETMIQKCVSFSQI---KQLQSHFLTAGHFQSSFLRSRLLE---RCaispfGDLSFAVQIFRYIPKPLTNDWNAII 75
Cdd:PLN03081 123 ASTY-DALVEACIALKSIrcvKAVYWHVESSGFEPDQYMMNRVLLmhvKC-----GMLIDARRLFDEMPERNLASWGTII 196
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334183024  76 RGFAGSSHPSLAFSWYRSMLQQSSsssaicrvDALTCSFT--LKACARALCSSAMDQLHCQINRRGLSADSLLCTTLLDA 153
Cdd:PLN03081 197 GGLVDAGNYREAFALFREMWEDGS--------DAEPRTFVvmLRASAGLGSARAGQQLHCCVLKTGVVGDTFVSCALIDM 268
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334183024 154 YSKNGDLISAYKLFDEMPVRDVASWNALIAGLVSGNRASEAMELYKRMETEGIRRSEVTVVAALGACSHLGDVKEGEN-- 231
Cdd:PLN03081 269 YSKCGDIEDARCVFDGMPEKTTVAWNSMLAGYALHGYSEEALCLYYEMRDSGVSIDQFTFSIMIRIFSRLALLEHAKQah 348
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334183024 232 ---IFHGYSNDnvIVSNAA-IDMYSKCGFVDKAYQVFEQFTgKKSVVTWNTMITGFAVHGEAHRALEIFDKLEDNGIKPD 307
Cdd:PLN03081 349 aglIRTGFPLD--IVANTAlVDLYSKWGRMEDARNVFDRMP-RKNLISWNALIAGYGNHGRGTKAVEMFERMIAEGVAPN 425
                        330       340       350       360       370       380       390       400
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334183024 308 DVSYLAALTACRHAGLVEYGLSVFNNMA-CKGVERNMKHYGCVVDLLSRAGRLREAHDIICSMSMIPDPVLWQSLLGASE 386
Cdd:PLN03081 426 HVTFLAVLSACRYSGLSEQGWEIFQSMSeNHRIKPRAMHYACMIELLGREGLLDEAYAMIRRAPFKPTVNMWAALLTACR 505
                        410       420       430       440       450       460       470       480
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334183024 387 IYSDVEMAEIASREIKEMGVNNDGDFVLLSNVYAAQGRWKDVGRVRDDMESKQVKKIPGLSYIEAKGTIHEFYNSDKSHE 466
Cdd:PLN03081 506 IHKNLELGRLAAEKLYGMGPEKLNNYVVLLNLYNSSGRQAEAAKVVETLKRKGLSMHPACTWIEVKKQDHSFFSGDRLHP 585
                        490       500       510       520       530       540       550       560
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334183024 467 QWREIYEKIDEIRFKIREDGYVAQTGLVLHDIGEEEKENALCYHSEKLAVAYGLMmmdGADEESPVRVINNLRICGDCHV 546
Cdd:PLN03081 586 QSREIYQKLDELMKEISEYGYVAEENELLPDVDEDEEKVSGRYHSEKLAIAFGLI---NTSEWTPLQITQSHRICKDCHK 662
                        570       580       590
                 ....*....|....*....|....*....|....*
gi 334183024 547 VFKHISKIYKREIIVRDRVRFHRFKDGSCSCRDFW 581
Cdd:PLN03081 663 VIKFIALVTKREIVVRDASRFHHFKLGKCSCGDYW 697
 
Name Accession Description Interval E-value
PLN03081 PLN03081
pentatricopeptide (PPR) repeat-containing protein; Provisional
2-581 3.63e-121

pentatricopeptide (PPR) repeat-containing protein; Provisional


Pssm-ID: 215563 [Multi-domain]  Cd Length: 697  Bit Score: 373.44  E-value: 3.63e-121
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334183024   2 ARVYmETMIQKCVSFSQI---KQLQSHFLTAGHFQSSFLRSRLLE---RCaispfGDLSFAVQIFRYIPKPLTNDWNAII 75
Cdd:PLN03081 123 ASTY-DALVEACIALKSIrcvKAVYWHVESSGFEPDQYMMNRVLLmhvKC-----GMLIDARRLFDEMPERNLASWGTII 196
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334183024  76 RGFAGSSHPSLAFSWYRSMLQQSSsssaicrvDALTCSFT--LKACARALCSSAMDQLHCQINRRGLSADSLLCTTLLDA 153
Cdd:PLN03081 197 GGLVDAGNYREAFALFREMWEDGS--------DAEPRTFVvmLRASAGLGSARAGQQLHCCVLKTGVVGDTFVSCALIDM 268
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334183024 154 YSKNGDLISAYKLFDEMPVRDVASWNALIAGLVSGNRASEAMELYKRMETEGIRRSEVTVVAALGACSHLGDVKEGEN-- 231
Cdd:PLN03081 269 YSKCGDIEDARCVFDGMPEKTTVAWNSMLAGYALHGYSEEALCLYYEMRDSGVSIDQFTFSIMIRIFSRLALLEHAKQah 348
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334183024 232 ---IFHGYSNDnvIVSNAA-IDMYSKCGFVDKAYQVFEQFTgKKSVVTWNTMITGFAVHGEAHRALEIFDKLEDNGIKPD 307
Cdd:PLN03081 349 aglIRTGFPLD--IVANTAlVDLYSKWGRMEDARNVFDRMP-RKNLISWNALIAGYGNHGRGTKAVEMFERMIAEGVAPN 425
                        330       340       350       360       370       380       390       400
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334183024 308 DVSYLAALTACRHAGLVEYGLSVFNNMA-CKGVERNMKHYGCVVDLLSRAGRLREAHDIICSMSMIPDPVLWQSLLGASE 386
Cdd:PLN03081 426 HVTFLAVLSACRYSGLSEQGWEIFQSMSeNHRIKPRAMHYACMIELLGREGLLDEAYAMIRRAPFKPTVNMWAALLTACR 505
                        410       420       430       440       450       460       470       480
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334183024 387 IYSDVEMAEIASREIKEMGVNNDGDFVLLSNVYAAQGRWKDVGRVRDDMESKQVKKIPGLSYIEAKGTIHEFYNSDKSHE 466
Cdd:PLN03081 506 IHKNLELGRLAAEKLYGMGPEKLNNYVVLLNLYNSSGRQAEAAKVVETLKRKGLSMHPACTWIEVKKQDHSFFSGDRLHP 585
                        490       500       510       520       530       540       550       560
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334183024 467 QWREIYEKIDEIRFKIREDGYVAQTGLVLHDIGEEEKENALCYHSEKLAVAYGLMmmdGADEESPVRVINNLRICGDCHV 546
Cdd:PLN03081 586 QSREIYQKLDELMKEISEYGYVAEENELLPDVDEDEEKVSGRYHSEKLAIAFGLI---NTSEWTPLQITQSHRICKDCHK 662
                        570       580       590
                 ....*....|....*....|....*....|....*
gi 334183024 547 VFKHISKIYKREIIVRDRVRFHRFKDGSCSCRDFW 581
Cdd:PLN03081 663 VIKFIALVTKREIVVRDASRFHHFKLGKCSCGDYW 697
DYW_deaminase pfam14432
DYW family of nucleic acid deaminases; This is the DYW domain found in nucleic acid deaminases ...
486-581 4.70e-40

DYW family of nucleic acid deaminases; This is the DYW domain found in nucleic acid deaminases prototyped by the plant PPR DYW proteins that are implicated in chloroplast and mitochondrial RNA transcript maturation by numerous C to U editing events. The name derives from the DYW motif present at the C-terminus of the classical plant PPR DYW deaminases. Members containing this domain are present in bacteria, plants, Naegleria, and fungi. Plants and Naegleria show lineage-specific expansions of this family. This domain contains a characteriztic zinc-binding motif (CXXC, HXE) which has been shown to bind zinc ions. This domain is often fused to PPR repeats. Ascomycete versions, which are independent lateral transfers, contain a large insert within the domain and are often fused to ankyrin repeats. Bacterial versions are predicted to function as toxins in polymorphic toxin systems.


Pssm-ID: 464172 [Multi-domain]  Cd Length: 93  Bit Score: 140.64  E-value: 4.70e-40
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334183024  486 GYVAQTGLVLHDIGEEEKENALCYHSEKLAVAYGLMMM-DGAdeesPVRVINNLRICGDCHVVFKHISKIYKREIIVRDR 564
Cdd:pfam14432   1 GYVPDLRFVLHDVDDEEKKQLLCGHSEKLALAYGLLTTpDGA----TIRITKNLRVCGDCHTAFKYISKIRGREIVVRDA 76
                          90
                  ....*....|....*..
gi 334183024  565 VRFHRFKDGSCSCRDFW 581
Cdd:pfam14432  77 SRFHHFKNGLCSCGDFW 93
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
274-308 1.45e-05

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 42.06  E-value: 1.45e-05
                          10        20        30
                  ....*....|....*....|....*....|....*
gi 334183024  274 VTWNTMITGFAVHGEAHRALEIFDKLEDNGIKPDD 308
Cdd:TIGR00756   1 VTYNTLIDGLCKAGRVEEALELFKEMKERGIEPDV 35
 
Name Accession Description Interval E-value
PLN03081 PLN03081
pentatricopeptide (PPR) repeat-containing protein; Provisional
2-581 3.63e-121

pentatricopeptide (PPR) repeat-containing protein; Provisional


Pssm-ID: 215563 [Multi-domain]  Cd Length: 697  Bit Score: 373.44  E-value: 3.63e-121
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334183024   2 ARVYmETMIQKCVSFSQI---KQLQSHFLTAGHFQSSFLRSRLLE---RCaispfGDLSFAVQIFRYIPKPLTNDWNAII 75
Cdd:PLN03081 123 ASTY-DALVEACIALKSIrcvKAVYWHVESSGFEPDQYMMNRVLLmhvKC-----GMLIDARRLFDEMPERNLASWGTII 196
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334183024  76 RGFAGSSHPSLAFSWYRSMLQQSSsssaicrvDALTCSFT--LKACARALCSSAMDQLHCQINRRGLSADSLLCTTLLDA 153
Cdd:PLN03081 197 GGLVDAGNYREAFALFREMWEDGS--------DAEPRTFVvmLRASAGLGSARAGQQLHCCVLKTGVVGDTFVSCALIDM 268
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334183024 154 YSKNGDLISAYKLFDEMPVRDVASWNALIAGLVSGNRASEAMELYKRMETEGIRRSEVTVVAALGACSHLGDVKEGEN-- 231
Cdd:PLN03081 269 YSKCGDIEDARCVFDGMPEKTTVAWNSMLAGYALHGYSEEALCLYYEMRDSGVSIDQFTFSIMIRIFSRLALLEHAKQah 348
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334183024 232 ---IFHGYSNDnvIVSNAA-IDMYSKCGFVDKAYQVFEQFTgKKSVVTWNTMITGFAVHGEAHRALEIFDKLEDNGIKPD 307
Cdd:PLN03081 349 aglIRTGFPLD--IVANTAlVDLYSKWGRMEDARNVFDRMP-RKNLISWNALIAGYGNHGRGTKAVEMFERMIAEGVAPN 425
                        330       340       350       360       370       380       390       400
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334183024 308 DVSYLAALTACRHAGLVEYGLSVFNNMA-CKGVERNMKHYGCVVDLLSRAGRLREAHDIICSMSMIPDPVLWQSLLGASE 386
Cdd:PLN03081 426 HVTFLAVLSACRYSGLSEQGWEIFQSMSeNHRIKPRAMHYACMIELLGREGLLDEAYAMIRRAPFKPTVNMWAALLTACR 505
                        410       420       430       440       450       460       470       480
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334183024 387 IYSDVEMAEIASREIKEMGVNNDGDFVLLSNVYAAQGRWKDVGRVRDDMESKQVKKIPGLSYIEAKGTIHEFYNSDKSHE 466
Cdd:PLN03081 506 IHKNLELGRLAAEKLYGMGPEKLNNYVVLLNLYNSSGRQAEAAKVVETLKRKGLSMHPACTWIEVKKQDHSFFSGDRLHP 585
                        490       500       510       520       530       540       550       560
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334183024 467 QWREIYEKIDEIRFKIREDGYVAQTGLVLHDIGEEEKENALCYHSEKLAVAYGLMmmdGADEESPVRVINNLRICGDCHV 546
Cdd:PLN03081 586 QSREIYQKLDELMKEISEYGYVAEENELLPDVDEDEEKVSGRYHSEKLAIAFGLI---NTSEWTPLQITQSHRICKDCHK 662
                        570       580       590
                 ....*....|....*....|....*....|....*
gi 334183024 547 VFKHISKIYKREIIVRDRVRFHRFKDGSCSCRDFW 581
Cdd:PLN03081 663 VIKFIALVTKREIVVRDASRFHHFKLGKCSCGDYW 697
PLN03077 PLN03077
Protein ECB2; Provisional
71-579 8.66e-112

Protein ECB2; Provisional


Pssm-ID: 215561 [Multi-domain]  Cd Length: 857  Bit Score: 353.39  E-value: 8.66e-112
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334183024  71 WNAIIRGFAGSSHPSLAFSWYRSMLQQSSSSsaicrvDALTCSFTLKACAralCSSAMD---QLHCQINRRGLSADSLLC 147
Cdd:PLN03077 357 WTAMISGYEKNGLPDKALETYALMEQDNVSP------DEITIASVLSACA---CLGDLDvgvKLHELAERKGLISYVVVA 427
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334183024 148 TTLLDAYSKNGDLISAYKLFDEMPVRDVASWNALIAGLVSGNRASEAMELYKRMETEgIRRSEVTVVAALGACSHLGDVK 227
Cdd:PLN03077 428 NALIEMYSKCKCIDKALEVFHNIPEKDVISWTSIIAGLRLNNRCFEALIFFRQMLLT-LKPNSVTLIAALSACARIGALM 506
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334183024 228 EGENI----------FHGYsndnviVSNAAIDMYSKCGFVDKAYQVFEqfTGKKSVVTWNTMITGFAVHGEAHRALEIFD 297
Cdd:PLN03077 507 CGKEIhahvlrtgigFDGF------LPNALLDLYVRCGRMNYAWNQFN--SHEKDVVSWNILLTGYVAHGKGSMAVELFN 578
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334183024 298 KLEDNGIKPDDVSYLAALTACRHAGLVEYGLSVFNNMACK-GVERNMKHYGCVVDLLSRAGRLREAHDIICSMSMIPDPV 376
Cdd:PLN03077 579 RMVESGVNPDEVTFISLLCACSRSGMVTQGLEYFHSMEEKySITPNLKHYACVVDLLGRAGKLTEAYNFINKMPITPDPA 658
                        330       340       350       360       370       380       390       400
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334183024 377 LWQSLLGASEIYSDVEMAEIASREIKEMGVNNDGDFVLLSNVYAAQGRWKDVGRVRDDMESKQVKKIPGLSYIEAKGTIH 456
Cdd:PLN03077 659 VWGALLNACRIHRHVELGELAAQHIFELDPNSVGYYILLCNLYADAGKWDEVARVRKTMRENGLTVDPGCSWVEVKGKVH 738
                        410       420       430       440       450       460       470       480
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334183024 457 EFYNSDKSHEQWREIYEKIDEIRFKIREDGYVAQTGLVLHDIgEEEKENALCYHSEKLAVAYGLMmmdGADEESPVRVIN 536
Cdd:PLN03077 739 AFLTDDESHPQIKEINTVLEGFYEKMKASGLAGSESSSMDEI-EVSKDDIFCGHSERLAIAFGLI---NTVPGMPIWVTK 814
                        490       500       510       520
                 ....*....|....*....|....*....|....*....|...
gi 334183024 537 NLRICGDCHVVFKHISKIYKREIIVRDRVRFHRFKDGSCSCRD 579
Cdd:PLN03077 815 NLYMCENCHNTVKFISKIVRREISVRDTEQFHHFKDGECSCGD 857
DYW_deaminase pfam14432
DYW family of nucleic acid deaminases; This is the DYW domain found in nucleic acid deaminases ...
486-581 4.70e-40

DYW family of nucleic acid deaminases; This is the DYW domain found in nucleic acid deaminases prototyped by the plant PPR DYW proteins that are implicated in chloroplast and mitochondrial RNA transcript maturation by numerous C to U editing events. The name derives from the DYW motif present at the C-terminus of the classical plant PPR DYW deaminases. Members containing this domain are present in bacteria, plants, Naegleria, and fungi. Plants and Naegleria show lineage-specific expansions of this family. This domain contains a characteriztic zinc-binding motif (CXXC, HXE) which has been shown to bind zinc ions. This domain is often fused to PPR repeats. Ascomycete versions, which are independent lateral transfers, contain a large insert within the domain and are often fused to ankyrin repeats. Bacterial versions are predicted to function as toxins in polymorphic toxin systems.


Pssm-ID: 464172 [Multi-domain]  Cd Length: 93  Bit Score: 140.64  E-value: 4.70e-40
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334183024  486 GYVAQTGLVLHDIGEEEKENALCYHSEKLAVAYGLMMM-DGAdeesPVRVINNLRICGDCHVVFKHISKIYKREIIVRDR 564
Cdd:pfam14432   1 GYVPDLRFVLHDVDDEEKKQLLCGHSEKLALAYGLLTTpDGA----TIRITKNLRVCGDCHTAFKYISKIRGREIVVRDA 76
                          90
                  ....*....|....*..
gi 334183024  565 VRFHRFKDGSCSCRDFW 581
Cdd:pfam14432  77 SRFHHFKNGLCSCGDFW 93
PLN03077 PLN03077
Protein ECB2; Provisional
116-386 1.78e-25

Protein ECB2; Provisional


Pssm-ID: 215561 [Multi-domain]  Cd Length: 857  Bit Score: 111.48  E-value: 1.78e-25
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334183024 116 LKACARALCSsamdqlhcqINRRGLSadslLCTTLLDAYSKNGDLISAYKLFDEMPVRDVASWNALIAGLVSGNRASEAM 195
Cdd:PLN03077 106 SRVCSRALSS---------HPSLGVR----LGNAMLSMFVRFGELVHAWYVFGKMPERDLFSWNVLVGGYAKAGYFDEAL 172
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334183024 196 ELYKRMETEGIRRSEVTVVAALGACSHLGDVKEGENIF-----HGYSNDnVIVSNAAIDMYSKCGFVDKAYQVFEQFTgK 270
Cdd:PLN03077 173 CLYHRMLWAGVRPDVYTFPCVLRTCGGIPDLARGREVHahvvrFGFELD-VDVVNALITMYVKCGDVVSARLVFDRMP-R 250
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334183024 271 KSVVTWNTMITGFAVHGEAHRALEIFDKLEDNGIKPDDVSYLAALTACRHAGLVEYGLSVFNNMACKGVERNMKHYGCVV 350
Cdd:PLN03077 251 RDCISWNAMISGYFENGECLEGLELFFTMRELSVDPDLMTITSVISACELLGDERLGREMHGYVVKTGFAVDVSVCNSLI 330
                        250       260       270
                 ....*....|....*....|....*....|....*.
gi 334183024 351 DLLSRAGRLREAHDIICSMSMiPDPVLWQSLLGASE 386
Cdd:PLN03077 331 QMYLSLGSWGEAEKVFSRMET-KDAVSWTAMISGYE 365
E_motif pfam20431
E motif; This entry represents the E motif found in plant pentatricopeptide repeat (PPR) ...
388-450 5.19e-20

E motif; This entry represents the E motif found in plant pentatricopeptide repeat (PPR) proteins which contain a DYW deaminase domain. The DYW domain is required for RNA editing, a process that deaminates specific cytidines to uridines. This motif, together with the E+ motif, precedes the DYW domain and, although their role is not clear, they are essential in the RNA editing reaction. The E/E+ motifs may contain two degenerate PPR motifs that could be involved in RNA or protein binding.


Pssm-ID: 466580 [Multi-domain]  Cd Length: 63  Bit Score: 83.75  E-value: 5.19e-20
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 334183024  388 YSDVEMAEIASREIKEMGVNNDGDFVLLSNVYAAQGRWKDVGRVRDDMESKQVKKIPGLSYIE 450
Cdd:pfam20431   1 YSNVELAEKAANILLELEKTNDGNYTLLSNIYAYAGRWKDVERIRKLMKSSGIKKRPGCSWIE 63
PLN03218 PLN03218
maturation of RBCL 1; Provisional
122-407 1.52e-18

maturation of RBCL 1; Provisional


Pssm-ID: 215636 [Multi-domain]  Cd Length: 1060  Bit Score: 89.94  E-value: 1.52e-18
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334183024  122 ALCSSAMD-----QLHCQINRRGLSADSLLCTTLLDAYSKNGDLISAYKLFDEMPVR----DVASWNALIAGLVSGNRAS 192
Cdd:PLN03218  445 SVCASSQDidgalRVLRLVQEAGLKADCKLYTTLISTCAKSGKVDAMFEVFHEMVNAgveaNVHTFGALIDGCARAGQVA 524
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334183024  193 EAMELYkrmeteGIRRSEvtvvaalgacshlgDVKEGENIFhgysndnvivsNAAIDMYSKCGFVDKAYQVFEQFTG--- 269
Cdd:PLN03218  525 KAFGAY------GIMRSK--------------NVKPDRVVF-----------NALISACGQSGAVDRAFDVLAEMKAeth 573
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334183024  270 --KKSVVTWNTMITGFAVHGEAHRALEIFDKLEDNGIKPDDVSYLAALTACRHAGLVEYGLSVFNNMACKGVERNMKHYG 347
Cdd:PLN03218  574 piDPDHITVGALMKACANAGQVDRAKEVYQMIHEYNIKGTPEVYTIAVNSCSQKGDWDFALSIYDDMKKKGVKPDEVFFS 653
                         250       260       270       280       290       300
                  ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 334183024  348 CVVDLLSRAGRLREAHDIICSM---SMIPDPVLWQSLLGASEIYSDVEMAEIASREIKEMGVN 407
Cdd:PLN03218  654 ALVDVAGHAGDLDKAFEILQDArkqGIKLGTVSYSSLMGACSNAKNWKKALELYEDIKSIKLR 716
PLN03218 PLN03218
maturation of RBCL 1; Provisional
108-355 1.86e-13

maturation of RBCL 1; Provisional


Pssm-ID: 215636 [Multi-domain]  Cd Length: 1060  Bit Score: 73.76  E-value: 1.86e-13
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334183024  108 DALTCSFTLKACARALCSSAMDQLHCQINRRGLSADSLLCTTLLDAYSKNGDLISAYKLFDEMPVRDVAS----WNALIA 183
Cdd:PLN03218  578 DHITVGALMKACANAGQVDRAKEVYQMIHEYNIKGTPEVYTIAVNSCSQKGDWDFALSIYDDMKKKGVKPdevfFSALVD 657
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334183024  184 GLVSGNRASEAMELYKRMETEGIRRSEVTVVAALGACSHLGDVKegenifhgysndnvivsnaaidmyskcgfvdKAYQV 263
Cdd:PLN03218  658 VAGHAGDLDKAFEILQDARKQGIKLGTVSYSSLMGACSNAKNWK-------------------------------KALEL 706
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334183024  264 FEQFTGKK---SVVTWNTMITGFAVHGEAHRALEIFDKLEDNGIKPDDVSYLAALTACRHAGLVEYGLSVFNNMACKGVE 340
Cdd:PLN03218  707 YEDIKSIKlrpTVSTMNALITALCEGNQLPKALEVLSEMKRLGLCPNTITYSILLVASERKDDADVGLDLLSQAKEDGIK 786
                         250
                  ....*....|....*
gi 334183024  341 RNMKHYGCVVDLLSR 355
Cdd:PLN03218  787 PNLVMCRCITGLCLR 801
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
271-318 8.39e-09

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 51.60  E-value: 8.39e-09
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*...
gi 334183024  271 KSVVTWNTMITGFAVHGEAHRALEIFDKLEDNGIKPDDVSYLAALTAC 318
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGL 48
PLN03077 PLN03077
Protein ECB2; Provisional
179-440 4.89e-08

Protein ECB2; Provisional


Pssm-ID: 215561 [Multi-domain]  Cd Length: 857  Bit Score: 56.01  E-value: 4.89e-08
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334183024 179 NALIAGLVSGNRASEAMELYKRMETEGIRRSEVTVVAALGACSHLGDVKEGENIfHGYSNDN-----VIVSNAAIDMYSK 253
Cdd:PLN03077  55 NSQLRALCSHGQLEQALKLLESMQELRVPVDEDAYVALFRLCEWKRAVEEGSRV-CSRALSShpslgVRLGNAMLSMFVR 133
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334183024 254 CGFVDKAYQVFEQFtGKKSVVTWNTMITGFAVHGEAHRALEIFDKLEDNGIKPDDVSYLAALTACRHAGLVEYGLSVFNN 333
Cdd:PLN03077 134 FGELVHAWYVFGKM-PERDLFSWNVLVGGYAKAGYFDEALCLYHRMLWAGVRPDVYTFPCVLRTCGGIPDLARGREVHAH 212
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334183024 334 MACKGVERNMKHYGCVVDLLSRAGRLREAH---------DIICSMSMI-------------------------PDPVLWQ 379
Cdd:PLN03077 213 VVRFGFELDVDVVNALITMYVKCGDVVSARlvfdrmprrDCISWNAMIsgyfengecleglelfftmrelsvdPDLMTIT 292
                        250       260       270       280       290       300
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 334183024 380 SLLGASEIYSDVEMA-EIASREIK-----EMGVNNDgdfvlLSNVYAAQGRWKDVGRVRDDMESKQV 440
Cdd:PLN03077 293 SVISACELLGDERLGrEMHGYVVKtgfavDVSVCNS-----LIQMYLSLGSWGEAEKVFSRMETKDA 354
PPR_3 pfam13812
Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat ...
270-318 4.16e-06

Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. In the case of the Arabidopsis protein UniProtKB:Q66GI4, the repeated helices in this N-terminal region, of protein-only RNase P (PRORP) enzymes, form the pentatricopeptide repeat (PPR) domain which enhances pre-tRNA binding affinity. PROPRP enzymes process precursor tRNAs in human mitochondria and in all tRNA-using compartments of Arabidopsis thaliana.


Pssm-ID: 316342 [Multi-domain]  Cd Length: 63  Bit Score: 44.27  E-value: 4.16e-06
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*....
gi 334183024  270 KKSVVTWNTMITGFAVHGEAHRALEIFDKLEDNGIKPDDVSYLAALTAC 318
Cdd:pfam13812  12 QLNVNTYTHLLHAYANVGNLKLALEIFERMKKKGIKPTLDTYNAILGVI 60
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
274-308 1.45e-05

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 42.06  E-value: 1.45e-05
                          10        20        30
                  ....*....|....*....|....*....|....*
gi 334183024  274 VTWNTMITGFAVHGEAHRALEIFDKLEDNGIKPDD 308
Cdd:TIGR00756   1 VTYNTLIDGLCKAGRVEEALELFKEMKERGIEPDV 35
PPR pfam01535
PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up ...
274-304 2.97e-05

PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up to 18 copies in some proteins. This family appears to be greatly expanded in plants. This repeat occurs in PET309 that may be involved in RNA stabilization. This domain occurs in crp1 that is involved in RNA processing. This repeat is associated with a predicted plant protein Swiss:O49549 that has a domain organization similar to the human BRCA1 protein. The repeat has been called PPR.


Pssm-ID: 366695 [Multi-domain]  Cd Length: 31  Bit Score: 40.91  E-value: 2.97e-05
                          10        20        30
                  ....*....|....*....|....*....|.
gi 334183024  274 VTWNTMITGFAVHGEAHRALEIFDKLEDNGI 304
Cdd:pfam01535   1 VTYNSLISGYCKNGKLEEALELFKEMKEKGI 31
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
143-185 6.57e-05

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 40.42  E-value: 6.57e-05
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*..
gi 334183024  143 DSLLCTTLLDAYSKNGDLISAYKLFDEMPVR----DVASWNALIAGL 185
Cdd:pfam13041   2 DVVTYNTLINGYCKKGKVEEAFKLFNEMKKRgvkpNVYTYTILINGL 48
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
173-212 1.12e-04

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 40.04  E-value: 1.12e-04
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|
gi 334183024  173 RDVASWNALIAGLVSGNRASEAMELYKRMETEGIRRSEVT 212
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYT 40
PPR pfam01535
PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up ...
177-206 5.82e-04

PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up to 18 copies in some proteins. This family appears to be greatly expanded in plants. This repeat occurs in PET309 that may be involved in RNA stabilization. This domain occurs in crp1 that is involved in RNA processing. This repeat is associated with a predicted plant protein Swiss:O49549 that has a domain organization similar to the human BRCA1 protein. The repeat has been called PPR.


Pssm-ID: 366695 [Multi-domain]  Cd Length: 31  Bit Score: 37.44  E-value: 5.82e-04
                          10        20        30
                  ....*....|....*....|....*....|
gi 334183024  177 SWNALIAGLVSGNRASEAMELYKRMETEGI 206
Cdd:pfam01535   2 TYNSLISGYCKNGKLEEALELFKEMKEKGI 31
PPR_long pfam17177
Pentacotripeptide-repeat region of PRORP; Pentatricopeptide repeat (PPR) proteins are a large ...
286-369 9.40e-04

Pentacotripeptide-repeat region of PRORP; Pentatricopeptide repeat (PPR) proteins are a large family of modular RNA-binding proteins which mediate several aspects of gene expression primarily in organelles but also in the nucleus. PPR_long is the region of Arabidopsis protein-only RNase P (PRORP) enzyme that consists of up to eleven alpha-helices. PRORPs are a class of RNA processing enzymes that catalyze maturation of the 5' end of precursor tRNAs in Eukaryotes. All PPR proteins contain tandemly repeated sequence motifs (the PPR motifs) which can vary in number. The series of helix-turn-helix motifs formed by PPR motifs throughout the protein produces a superheros with a central groove that allows the protein to bind RNA. Proteins containing PPR motifs are known to have roles in transcription, RNA processing, splicing, stability, editing, and translation. Over a decade after the discovery of PPR proteins, the super-helical structure was confirmed. The protein-only mitochondrial RNase P crystal structure from Arabidopsis thaliana (PRORP1) confirmed the role of its PPR motifs in pre-tRNA binding and suggest it has evolved independently from other RNase P proteins that rely on catalytic RNA.


Pssm-ID: 407303 [Multi-domain]  Cd Length: 212  Bit Score: 40.84  E-value: 9.40e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334183024  286 HGEAHRALEIFDKLEDNGIKPDDVSYLAALTACRHAGL---------VEYGLSVFNNMACKGVERNMKHYGCVVDLLSRA 356
Cdd:pfam17177  24 HADATGALALYDAAKAEGVRLAQYHYNVLLYLCSKAADatdlkpqlaADRGFEVFEAMKAQGVSPNEATYTAVARLAAAK 103
                          90
                  ....*....|...
gi 334183024  357 GRLREAHDIICSM 369
Cdd:pfam17177 104 GDGDLAFDLVKEM 116
PPR pfam01535
PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up ...
147-175 2.42e-03

PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up to 18 copies in some proteins. This family appears to be greatly expanded in plants. This repeat occurs in PET309 that may be involved in RNA stabilization. This domain occurs in crp1 that is involved in RNA processing. This repeat is associated with a predicted plant protein Swiss:O49549 that has a domain organization similar to the human BRCA1 protein. The repeat has been called PPR.


Pssm-ID: 366695 [Multi-domain]  Cd Length: 31  Bit Score: 35.52  E-value: 2.42e-03
                          10        20
                  ....*....|....*....|....*....
gi 334183024  147 CTTLLDAYSKNGDLISAYKLFDEMPVRDV 175
Cdd:pfam01535   3 YNSLISGYCKNGKLEEALELFKEMKEKGI 31
PPR_long pfam17177
Pentacotripeptide-repeat region of PRORP; Pentatricopeptide repeat (PPR) proteins are a large ...
151-265 2.60e-03

Pentacotripeptide-repeat region of PRORP; Pentatricopeptide repeat (PPR) proteins are a large family of modular RNA-binding proteins which mediate several aspects of gene expression primarily in organelles but also in the nucleus. PPR_long is the region of Arabidopsis protein-only RNase P (PRORP) enzyme that consists of up to eleven alpha-helices. PRORPs are a class of RNA processing enzymes that catalyze maturation of the 5' end of precursor tRNAs in Eukaryotes. All PPR proteins contain tandemly repeated sequence motifs (the PPR motifs) which can vary in number. The series of helix-turn-helix motifs formed by PPR motifs throughout the protein produces a superheros with a central groove that allows the protein to bind RNA. Proteins containing PPR motifs are known to have roles in transcription, RNA processing, splicing, stability, editing, and translation. Over a decade after the discovery of PPR proteins, the super-helical structure was confirmed. The protein-only mitochondrial RNase P crystal structure from Arabidopsis thaliana (PRORP1) confirmed the role of its PPR motifs in pre-tRNA binding and suggest it has evolved independently from other RNase P proteins that rely on catalytic RNA.


Pssm-ID: 407303 [Multi-domain]  Cd Length: 212  Bit Score: 39.69  E-value: 2.60e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334183024  151 LDAYSKNGDLISAYKLFDEMPVRDVA----SWNALIAGLVSGNRASEAM---------ELYKRMETEGIRRSEVTVVAAL 217
Cdd:pfam17177  18 LDKCSKHADATGALALYDAAKAEGVRlaqyHYNVLLYLCSKAADATDLKpqlaadrgfEVFEAMKAQGVSPNEATYTAVA 97
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|..
gi 334183024  218 GACSHLGDVKEGENIFHGYSNDNV---IVSNA-AIDMYSKCGFVDKAYQVFE 265
Cdd:pfam17177  98 RLAAAKGDGDLAFDLVKEMEAAGVsprLRSYSpALHAYCEAGDADKAYEVEE 149
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
177-210 3.21e-03

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 35.51  E-value: 3.21e-03
                          10        20        30
                  ....*....|....*....|....*....|....
gi 334183024  177 SWNALIAGLVSGNRASEAMELYKRMETEGIRRSE 210
Cdd:TIGR00756   2 TYNTLIDGLCKAGRVEEALELFKEMKERGIEPDV 35
Eplus_motif pfam20430
E+ motif; This is the E+ motif found in some plant pentatricopeptide repeat (PPR) proteins ...
455-478 3.98e-03

E+ motif; This is the E+ motif found in some plant pentatricopeptide repeat (PPR) proteins which contain a C-terminal DYW deaminase domain. The DYW domain is required for RNA editing, a process that deaminates specific cytidines to uridines. This motif, together with the E motif, precedes the DYW domain and, although their role is not clear, they are essential in th RNA editing reaction. The E/E+ motifs may contain two degenerate PPR motifs that could be involved in RNA or protein binding.


Pssm-ID: 466579 [Multi-domain]  Cd Length: 28  Bit Score: 34.94  E-value: 3.98e-03
                          10        20
                  ....*....|....*....|....
gi 334183024  455 IHEFYNSDKSHEQWREIYEKIDEI 478
Cdd:pfam20430   1 TYTFFAGDKSHPESKQIYEKLSDL 24
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
240-284 6.33e-03

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 35.03  E-value: 6.33e-03
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*...
gi 334183024  240 NVIVSNAAIDMYSKCGFVDKAYQVFEQFTGKK---SVVTWNTMITGFA 284
Cdd:pfam13041   2 DVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGvkpNVYTYTILINGLC 49
PPR_1 pfam12854
PPR repeat; This family matches additional variants of the PPR repeat that were not captured ...
138-170 7.91e-03

PPR repeat; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. The exact function is not known.


Pssm-ID: 403914 [Multi-domain]  Cd Length: 34  Bit Score: 34.24  E-value: 7.91e-03
                          10        20        30
                  ....*....|....*....|....*....|...
gi 334183024  138 RGLSADSLLCTTLLDAYSKNGDLISAYKLFDEM 170
Cdd:pfam12854   1 KGLKPDVVTYNTLINGLCRAGRVDEAFELLDEM 33
PPR_3 pfam13812
Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat ...
295-346 9.00e-03

Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. In the case of the Arabidopsis protein UniProtKB:Q66GI4, the repeated helices in this N-terminal region, of protein-only RNase P (PRORP) enzymes, form the pentatricopeptide repeat (PPR) domain which enhances pre-tRNA binding affinity. PROPRP enzymes process precursor tRNAs in human mitochondria and in all tRNA-using compartments of Arabidopsis thaliana.


Pssm-ID: 316342 [Multi-domain]  Cd Length: 63  Bit Score: 35.03  E-value: 9.00e-03
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|..
gi 334183024  295 IFDKLEDNGIKPDDVSYLAALTACRHAGLVEYGLSVFNNMACKGVERNMKHY 346
Cdd:pfam13812   2 ILREMVRDGIQLNVNTYTHLLHAYANVGNLKLALEIFERMKKKGIKPTLDTY 53
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH