|
Name |
Accession |
Description |
Interval |
E-value |
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
234-599 |
3.58e-24 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 107.30 E-value: 3.58e-24
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 966925059 234 VNFNYSGTLLASVGSnpDYTLTIWNWKEEQPILRTKAFSQEVFKVTFNPDNEEQLTTSGSGHIKFWEMAftfTGLKLQGS 313
Cdd:COG2319 84 VAFSPDGRLLASASA--DGTVRLWDLATGLLLRTLTGHTGAVRSVAFSPDGKTLASGSADGTVRLWDLA---TGKLLRTL 158
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 966925059 314 LGRFGKTTTTDIegymeLPDGKVL-SGSEWGNMLLWE---GGLIKVelCRGtskscHNGPINQIML--DEGEVITVGSDG 387
Cdd:COG2319 159 TGHSGAVTSVAF-----SPDGKLLaSGSDDGTVRLWDlatGKLLRT--LTG-----HTGAVRSVAFspDGKLLASGSADG 226
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 966925059 388 CVRIWDFETidtadiidetGlleiEPINELQVDKNVNL---FSmtkmsetGNNFWLA---QDANGAIWKLDlsfsniTQD 461
Cdd:COG2319 227 TVRLWDLAT----------G----KLLRTLTGHSGSVRsvaFS-------PDGRLLAsgsADGTVRLWDLA------TGE 279
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 966925059 462 PECLFSFHSGAVEAVAVSPLTYLMATTALDCSVRIYDFASKTLLTQMKFKQGG-TALVWVPrmvnfTGAEIIVGFEDGVV 540
Cdd:COG2319 280 LLRTLTGHSGGVNSVAFSPDGKLLASGSDDGTVRLWDLATGKLLRTLTGHTGAvRSVAFSP-----DGKTLASGSDDGTV 354
|
330 340 350 360 370
....*....|....*....|....*....|....*....|....*....|....*....
gi 966925059 541 RVLELYDPKGLTIFVGrkkisdadirlkqvfkpHTARVTALAYERDGEILATGSKDQTV 599
Cdd:COG2319 355 RLWDLATGELLRTLTG-----------------HTGAVTSVAFSPDGRTLASGSADGTV 396
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
234-599 |
1.46e-21 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 97.02 E-value: 1.46e-21
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 966925059 234 VNFNYSGTLLASVGSnpDYTLTIWNWKEEQPILRTKAFSQEVFKVTFNPDNEEQLTTSGSGHIKFW-----EMAFTFTGL 308
Cdd:cd00200 15 VAFSPDGKLLATGSG--DGTIKVWDLETGELLRTLKGHTGPVRDVAASADGTYLASGSSDKTIRLWdletgECVRTLTGH 92
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 966925059 309 KLQGSLGRFgkttttdiegymeLPDGKVLSGSEW-GNMLLWEggLIKVELCrgTSKSCHNGPINQI-MLDEGEVITVGS- 385
Cdd:cd00200 93 TSYVSSVAF-------------SPDGRILSSSSRdKTIKVWD--VETGKCL--TTLRGHTDWVNSVaFSPDGTFVASSSq 155
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 966925059 386 DGCVRIWDFETIdtadiidetglleiepinelqvdknvnlfsmtkmsetgnnfwlaqdangaiwkldlsfsnitqdpECL 465
Cdd:cd00200 156 DGTIKLWDLRTG-----------------------------------------------------------------KCV 170
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 966925059 466 --FSFHSGAVEAVAVSPLTYLMATTALDCSVRIYDFASKTLLTQMKFKQGG-TALVWvprmvNFTGAEIIVGFEDGVVRV 542
Cdd:cd00200 171 atLTGHTGEVNSVAFSPDGEKLLSSSSDGTIKLWDLSTGKCLGTLRGHENGvNSVAF-----SPDGYLLASGSEDGTIRV 245
|
330 340 350 360 370
....*....|....*....|....*....|....*....|....*....|....*..
gi 966925059 543 lelYDpkgltifvGRKKisdadiRLKQVFKPHTARVTALAYERDGEILATGSKDQTV 599
Cdd:cd00200 246 ---WD--------LRTG------ECVQTLSGHTNSVTSLAWSPDGKRLASGSADGTI 285
|
|
| Smc |
COG1196 |
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ... |
1550-1831 |
1.68e-07 |
|
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 440809 [Multi-domain] Cd Length: 983 Bit Score: 56.48 E-value: 1.68e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 966925059 1550 LTLQLREKRLDIEEALVEekkvVDNLKKEYDTLSKKVKIVATNLNAAEEALEAYQREKQQRLNELLVVIpLKLHQIeyvv 1629
Cdd:COG1196 216 RELKEELKELEAELLLLK----LRELEAELEELEAELEELEAELEELEAELAELEAELEELRLELEELE-LELEEA---- 286
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 966925059 1630 fgeipsdlsgtlvfsNHALRRLQERIRELQEENSKQQKLNKEWRERRKQLIREKREMTKTIHKMEETVRQLmiskfgrvv 1709
Cdd:COG1196 287 ---------------QAEEYELLAELARLEQDIARLEERRRELEERLEELEEELAELEEELEELEEELEEL--------- 342
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 966925059 1710 NLEALQTLSVNTTLEELKIRKLRKELANAKEMKMWEEKIAQMRWELMMKTKEHTRKLYQMNDLCIEKKKLDSRLNTLQNQ 1789
Cdd:COG1196 343 EEELEEAEEELEEAEAELAEAEEALLEAEAELAEAEEELEELAEELLEALRAAAELAAQLEELEEAEEALLERLERLEEE 422
|
250 260 270 280
....*....|....*....|....*....|....*....|..
gi 966925059 1790 QGNAFQSLREADVVAREEVTELIQLQAERISALKEEIALLRR 1831
Cdd:COG1196 423 LEELEEALAELEEEEEEEEEALEEAAEEEAELEEEEEALLEL 464
|
|
| SMC_prok_B |
TIGR02168 |
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ... |
1545-1832 |
4.44e-06 |
|
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274008 [Multi-domain] Cd Length: 1179 Bit Score: 51.98 E-value: 4.44e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 966925059 1545 VALFELTLQLREKRLDIEEA---LVEEKKVVDNLKKEYDTLSKKVKIVATNLNAAEEALEAYQREKQQrLNELLVVIPLK 1621
Cdd:TIGR02168 705 KELEELEEELEQLRKELEELsrqISALRKDLARLEAEVEQLEERIAQLSKELTELEAEIEELEERLEE-AEEELAEAEAE 783
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 966925059 1622 LHQIEYVVfgeipSDLSGTLVFSNHALRRLQERIRELQEENSKQQklnkewrERRKQLIREKREMTKTIHKMEETVRQLm 1701
Cdd:TIGR02168 784 IEELEAQI-----EQLKEELKALREALDELRAELTLLNEEAANLR-------ERLESLERRIAATERRLEDLEEQIEEL- 850
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 966925059 1702 iskfgrvvnLEALQTLSVNTTLEELKIRKLRKELANA-KEMKMWEEKIAQMRWE---LMMKTKEHTRKLYQMNDLCIEKK 1777
Cdd:TIGR02168 851 ---------SEDIESLAAEIEELEELIEELESELEALlNERASLEEALALLRSEleeLSEELRELESKRSELRRELEELR 921
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|....*....
gi 966925059 1778 ----KLDSRLNTLQNQQGNAFQSLREADVVAREEVTELIQLQAERISALKEEIALLRRK 1832
Cdd:TIGR02168 922 eklaQLELRLEGLEVRIDNLQERLSEEYSLTLEEAEALENKIEDDEEEARRRLKRLENK 980
|
|
| SMC_N |
pfam02463 |
RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The ... |
916-1832 |
2.38e-05 |
|
RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.
Pssm-ID: 426784 [Multi-domain] Cd Length: 1161 Bit Score: 49.58 E-value: 2.38e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 966925059 916 IEDPKAYSIENARRKREHDKLMKEVEEIKAQKREQIKALRTEFcKLLEMNEELPTHMQFKRTDFDVDSKIRAEmhrrtaf 995
Cdd:pfam02463 158 IEEEAAGSRLKRKKKEALKKLIEETENLAELIIDLEELKLQEL-KLKEQAKKALEYYQLKEKLELEEEYLLYL------- 229
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 966925059 996 KIQQVEKElaWEKEKHELGLKKLKnrfrdplesdtivvhailsdhKISSYRLVQPSKYSKFKRTSQSERKPSKLDRFEKE 1075
Cdd:pfam02463 230 DYLKLNEE--RIDLLQELLRDEQE---------------------EIESSKQEIEKEEEKLAQVLKENKEEEKEKKLQEE 286
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 966925059 1076 GTGRKDSQRDAGGSIAIQEESIIEKGKkfRPKTLSEIMVENQIEKTRKLILKAERAQ--LKIQQRKKEWEELYKSKPGDD 1153
Cdd:pfam02463 287 ELKLLAKEEEELKSELLKLERRKVDDE--EKLKESEKEKKKAEKELKKEKEEIEELEkeLKELEIKREAEEEEEEELEKL 364
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 966925059 1154 YE-DPKDLQAIKEAQVYMGDFNLKTAPDYKIPEHMRINAAKKEEELGHLDSlvhgNKRYMNKCILSLRDLKVAVVEEIQC 1232
Cdd:pfam02463 365 QEkLEQLEEELLAKKKLESERLSSAAKLKEEELELKSEEEKEAQLLLELAR----QLEDLLKEEKKEELEILEEEEESIE 440
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 966925059 1233 LVQELKNIQST-LHVSKHIpipQIPQIHPEEVPEKRFQYDEETLLNFKQQHMKSD-DEKSPGVEQTGSGGPGGGFLKLSS 1310
Cdd:pfam02463 441 LKQGKLTEEKEeLEKQELK---LLKDELELKKSEDLLKETQLVKLQEQLELLLSRqKLEERSQKESKARSGLKVLLALIK 517
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 966925059 1311 GKDGDLTTRDSMSRSSKASTLSLDIPKYMEFEKAEPTDVELEIMKRDEIKHVYMQQYLVNRVKELIVTFDAELRLLRHQK 1390
Cdd:pfam02463 518 DGVGGRIISAHGRLGDLGVAVENYKVAISTAVIVEVSATADEVEERQKLVRALTELPLGARKLRLLIPKLKLPLKSIAVL 597
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 966925059 1391 LKLDTQMKLSDLHHVTLFQEI----LLLKNFEKQENILQERVNSLDKEEQYMQWKINETLKEMEEKKNEITKLQEQEKAL 1466
Cdd:pfam02463 598 EIDPILNLAQLDKATLEADEDdkraKVVEGILKDTELTKLKESAKAKESGLRKGVSLEEGLAEKSEVKASLSELTKELLE 677
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 966925059 1467 YAGFQTAIGENNKFANFLMKVLKKKIKRVKKKEVEGDADEDEESEESSEEESSLESDEDESESEDEVFDDSICPTNCDVA 1546
Cdd:pfam02463 678 IQELQEKAESELAKEEILRRQLEIKKKEQREKEELKKLKLEAEELLADRVQEAQDKINEELKLLKQKIDEEEEEEEKSRL 757
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 966925059 1547 LFELTLQLREKRLDIEEALvEEKKVVDNLKKEYDTLSKKVKIVATNLNAAEEALEAYQREKQQRLNELLVVIPLKLHQIE 1626
Cdd:pfam02463 758 KKEEKEEEKSELSLKEKEL-AEEREKTEKLKVEEEKEEKLKAQEEELRALEEELKEEAELLEEEQLLIEQEEKIKEEELE 836
|
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 966925059 1627 YvvfgeipsdLSGTLVFSNHALRRLQERIRELQEENSKQQKLNKEWRERRKQLIREKREMTKTIHKMEETVRQlmiskfg 1706
Cdd:pfam02463 837 E---------LALELKEEQKLEKLAEEELERLEEEITKEELLQELLLKEEELEEQKLKDELESKEEKEKEEKK------- 900
|
810 820 830 840 850 860 870 880
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 966925059 1707 rvvnlEALQTLSVNTTLEELKIRKLRKELANAKEMKMWEEkiaqmrWELMMKTKEHTRKLYQMNDLCIEKKKLDSRLNTL 1786
Cdd:pfam02463 901 -----ELEEESQKLNLLEEKENEIEERIKEEAEILLKYEE------EPEELLLEEADEKEKEENNKEEEEERNKRLLLAK 969
|
890 900 910 920
....*....|....*....|....*....|....*....|....*....
gi 966925059 1787 Q---NQQGNAFQSLREADVVAREEVTELIQLQAERISALKEEIALLRRK 1832
Cdd:pfam02463 970 EelgKVNLMAIEEFEEKEERYNKDELEKERLEEEKKKLIRAIIEETCQR 1018
|
|
| WD40 |
smart00320 |
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ... |
566-603 |
5.02e-05 |
|
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Pssm-ID: 197651 [Multi-domain] Cd Length: 40 Bit Score: 41.91 E-value: 5.02e-05
10 20 30
....*....|....*....|....*....|....*...
gi 966925059 566 RLKQVFKPHTARVTALAYERDGEILATGSKDQTVFFFE 603
Cdd:smart00320 3 ELLKTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
|
|
| WD40 |
pfam00400 |
WD domain, G-beta repeat; |
566-603 |
1.37e-04 |
|
WD domain, G-beta repeat;
Pssm-ID: 459801 [Multi-domain] Cd Length: 39 Bit Score: 40.79 E-value: 1.37e-04
10 20 30
....*....|....*....|....*....|....*...
gi 966925059 566 RLKQVFKPHTARVTALAYERDGEILATGSKDQTVFFFE 603
Cdd:pfam00400 2 KLLKTLEGHTGSVTSLAFSPDGKLLASGSDDGTVKVWD 39
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
567-639 |
1.74e-04 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 45.40 E-value: 1.74e-04
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 966925059 567 LKQVFKPHTARVTALAYERDGEILATGSKDQTVFFFEVERDYKPIGYINTPGPVCQLMWSPmshlESTLLIIC 639
Cdd:cd00200 1 LRRTLKGHTGGVTCVAFSPDGKLLATGSGDGTIKVWDLETGELLRTLKGHTGPVRDVAASA----DGTYLASG 69
|
|
| PRK03918 |
PRK03918 |
DNA double-strand break repair ATPase Rad50; |
1549-1827 |
6.91e-04 |
|
DNA double-strand break repair ATPase Rad50;
Pssm-ID: 235175 [Multi-domain] Cd Length: 880 Bit Score: 44.67 E-value: 6.91e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 966925059 1549 ELTLQLREKRLDIEEALVEEKKVVDNLKKEYDTLSKKVKI-----VATNLNAAEEALEAYQREKQQRLNELLVVIPLKLH 1623
Cdd:PRK03918 456 EYTAELKRIEKELKEIEEKERKLRKELRELEKVLKKESELiklkeLAEQLKELEEKLKKYNLEELEKKAEEYEKLKEKLI 535
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 966925059 1624 QIEyvvfGEIpsdlsgtlvfsnhalRRLQERIRELQEENSKQQKLNKEWR---ERRKQLIREKREMT-KTIHKMEETVRQ 1699
Cdd:PRK03918 536 KLK----GEI---------------KSLKKELEKLEELKKKLAELEKKLDeleEELAELLKELEELGfESVEELEERLKE 596
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 966925059 1700 L--MISKFGRVVN----LEALQTL--SVNTTLEEL---------KIRKLRKELaNAKEMKMWEEKIAQMRWELMMKTKEH 1762
Cdd:PRK03918 597 LepFYNEYLELKDaekeLEREEKElkKLEEELDKAfeelaetekRLEELRKEL-EELEKKYSEEEYEELREEYLELSREL 675
|
250 260 270 280 290 300
....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 966925059 1763 TRKLYQMNDLCIEKKKLDSRLNTLQNQQGNAFQSLREADVV--AREEVTELIqlqaERISALKEEIA 1827
Cdd:PRK03918 676 AGLRAELEELEKRREEIKKTLEKLKEELEEREKAKKELEKLekALERVEELR----EKVKKYKALLK 738
|
|
| PTZ00121 |
PTZ00121 |
MAEBL; Provisional |
917-1229 |
5.36e-03 |
|
MAEBL; Provisional
Pssm-ID: 173412 [Multi-domain] Cd Length: 2084 Bit Score: 42.05 E-value: 5.36e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 966925059 917 EDPKAYSIENARRKREHDKLMKEVEEIKAQkrEQIKALRTEFCKLLEMNEELPTHMQFKRTDFDVDSKIRAEMHRRTAFK 996
Cdd:PTZ00121 1508 AKKKADEAKKAEEAKKADEAKKAEEAKKAD--EAKKAEEKKKADELKKAEELKKAEEKKKAEEAKKAEEDKNMALRKAEE 1585
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 966925059 997 IQQVEKELAWEKEKHELGLKKLKNRFRDPLESDTIVVHAILSDHKISSYRLVQPSKYSKFKRTSQSERKPSKLDRFEKEG 1076
Cdd:PTZ00121 1586 AKKAEEARIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAE 1665
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 966925059 1077 TGRKDSQRDAGGSIAIQEESiiEKGKKfrpktlseimvENQIEKTRKLILKAERAQLKIQQRKKEWEELYKSkpgDDYED 1156
Cdd:PTZ00121 1666 EAKKAEEDKKKAEEAKKAEE--DEKKA-----------AEALKKEAEEAKKAEELKKKEAEEKKKAEELKKA---EEENK 1729
|
250 260 270 280 290 300 310
....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 966925059 1157 PKDLQAIKEAQvymgdfnlktaPDYKIPEHMRINAAKKEEelghldslVHGNKRYMNKCILSLRDLKVAVVEE 1229
Cdd:PTZ00121 1730 IKAEEAKKEAE-----------EDKKKAEEAKKDEEEKKK--------IAHLKKEEEKKAEEIRKEKEAVIEE 1783
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
234-599 |
3.58e-24 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 107.30 E-value: 3.58e-24
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 966925059 234 VNFNYSGTLLASVGSnpDYTLTIWNWKEEQPILRTKAFSQEVFKVTFNPDNEEQLTTSGSGHIKFWEMAftfTGLKLQGS 313
Cdd:COG2319 84 VAFSPDGRLLASASA--DGTVRLWDLATGLLLRTLTGHTGAVRSVAFSPDGKTLASGSADGTVRLWDLA---TGKLLRTL 158
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 966925059 314 LGRFGKTTTTDIegymeLPDGKVL-SGSEWGNMLLWE---GGLIKVelCRGtskscHNGPINQIML--DEGEVITVGSDG 387
Cdd:COG2319 159 TGHSGAVTSVAF-----SPDGKLLaSGSDDGTVRLWDlatGKLLRT--LTG-----HTGAVRSVAFspDGKLLASGSADG 226
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 966925059 388 CVRIWDFETidtadiidetGlleiEPINELQVDKNVNL---FSmtkmsetGNNFWLA---QDANGAIWKLDlsfsniTQD 461
Cdd:COG2319 227 TVRLWDLAT----------G----KLLRTLTGHSGSVRsvaFS-------PDGRLLAsgsADGTVRLWDLA------TGE 279
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 966925059 462 PECLFSFHSGAVEAVAVSPLTYLMATTALDCSVRIYDFASKTLLTQMKFKQGG-TALVWVPrmvnfTGAEIIVGFEDGVV 540
Cdd:COG2319 280 LLRTLTGHSGGVNSVAFSPDGKLLASGSDDGTVRLWDLATGKLLRTLTGHTGAvRSVAFSP-----DGKTLASGSDDGTV 354
|
330 340 350 360 370
....*....|....*....|....*....|....*....|....*....|....*....
gi 966925059 541 RVLELYDPKGLTIFVGrkkisdadirlkqvfkpHTARVTALAYERDGEILATGSKDQTV 599
Cdd:COG2319 355 RLWDLATGELLRTLTG-----------------HTGAVTSVAFSPDGRTLASGSADGTV 396
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
234-599 |
1.46e-21 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 97.02 E-value: 1.46e-21
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 966925059 234 VNFNYSGTLLASVGSnpDYTLTIWNWKEEQPILRTKAFSQEVFKVTFNPDNEEQLTTSGSGHIKFW-----EMAFTFTGL 308
Cdd:cd00200 15 VAFSPDGKLLATGSG--DGTIKVWDLETGELLRTLKGHTGPVRDVAASADGTYLASGSSDKTIRLWdletgECVRTLTGH 92
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 966925059 309 KLQGSLGRFgkttttdiegymeLPDGKVLSGSEW-GNMLLWEggLIKVELCrgTSKSCHNGPINQI-MLDEGEVITVGS- 385
Cdd:cd00200 93 TSYVSSVAF-------------SPDGRILSSSSRdKTIKVWD--VETGKCL--TTLRGHTDWVNSVaFSPDGTFVASSSq 155
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 966925059 386 DGCVRIWDFETIdtadiidetglleiepinelqvdknvnlfsmtkmsetgnnfwlaqdangaiwkldlsfsnitqdpECL 465
Cdd:cd00200 156 DGTIKLWDLRTG-----------------------------------------------------------------KCV 170
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 966925059 466 --FSFHSGAVEAVAVSPLTYLMATTALDCSVRIYDFASKTLLTQMKFKQGG-TALVWvprmvNFTGAEIIVGFEDGVVRV 542
Cdd:cd00200 171 atLTGHTGEVNSVAFSPDGEKLLSSSSDGTIKLWDLSTGKCLGTLRGHENGvNSVAF-----SPDGYLLASGSEDGTIRV 245
|
330 340 350 360 370
....*....|....*....|....*....|....*....|....*....|....*..
gi 966925059 543 lelYDpkgltifvGRKKisdadiRLKQVFKPHTARVTALAYERDGEILATGSKDQTV 599
Cdd:cd00200 246 ---WD--------LRTG------ECVQTLSGHTNSVTSLAWSPDGKRLASGSADGTI 285
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
238-627 |
1.73e-21 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 99.22 E-value: 1.73e-21
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 966925059 238 YSGTLLASVGSNPDYTLTIWNWKEEQPILRTKAFSQEVFKVTFNPDNEEQLTTSGSGHIKFWEMAftfTGLKLQGSLGRF 317
Cdd:COG2319 44 ASPDGARLAAGAGDLTLLLLDAAAGALLATLLGHTAAVLSVAFSPDGRLLASASADGTVRLWDLA---TGLLLRTLTGHT 120
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 966925059 318 GKTTTTDIEgymelPDGK-VLSGSEWGNMLLWE---GGLIKVelcrgtsKSCHNGPINQIML--DEGEVITVGSDGCVRI 391
Cdd:COG2319 121 GAVRSVAFS-----PDGKtLASGSADGTVRLWDlatGKLLRT-------LTGHSGAVTSVAFspDGKLLASGSDDGTVRL 188
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 966925059 392 WDFETidtadiidetGlleiEPINELQVDKNVnLFSMTkMSETGNnfWLA---QDANGAIWKLDlsfsniTQDPECLFSF 468
Cdd:COG2319 189 WDLAT----------G----KLLRTLTGHTGA-VRSVA-FSPDGK--LLAsgsADGTVRLWDLA------TGKLLRTLTG 244
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 966925059 469 HSGAVEAVAVSPLTYLMATTALDCSVRIYDFASKTLLTQMKFKQGG-TALVWVPrmvnfTGAEIIVGFEDGVVRVLELyd 547
Cdd:COG2319 245 HSGSVRSVAFSPDGRLLASGSADGTVRLWDLATGELLRTLTGHSGGvNSVAFSP-----DGKLLASGSDDGTVRLWDL-- 317
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 966925059 548 pkgltifvgrkkisdADIRLKQVFKPHTARVTALAYERDGEILATGSKDQTVFFFEVERDYKPIGYINTPGPVCQLMWSP 627
Cdd:COG2319 318 ---------------ATGKLLRTLTGHTGAVRSVAFSPDGKTLASGSDDGTVRLWDLATGELLRTLTGHTGAVTSVAFSP 382
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
209-498 |
3.33e-19 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 90.09 E-value: 3.33e-19
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 966925059 209 IIIYEYPSLRPYRILRdGTEKGYAYVNFNYSGTLLASVGSnpDYTLTIWNWKEEQPILRTKAFSQEVFKVTFNPDNEEQL 288
Cdd:cd00200 33 IKVWDLETGELLRTLK-GHTGPVRDVAASADGTYLASGSS--DKTIRLWDLETGECVRTLTGHTSYVSSVAFSPDGRILS 109
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 966925059 289 TTSGSGHIKFWEMA-----FTFTGLKLQGSLGRFgkttttdiegymeLPDGKVLSGSEW-GNMLLWEGGLIKvelCRGTS 362
Cdd:cd00200 110 SSSRDKTIKVWDVEtgkclTTLRGHTDWVNSVAF-------------SPDGTFVASSSQdGTIKLWDLRTGK---CVATL 173
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 966925059 363 KScHNGPINQIML--DEGEVITVGSDGCVRIWDFETIDTadiideTGLLEI--EPINELQVDKNVNLFSmtkmsetgnnf 438
Cdd:cd00200 174 TG-HTGEVNSVAFspDGEKLLSSSSDGTIKLWDLSTGKC------LGTLRGheNGVNSVAFSPDGYLLA----------- 235
|
250 260 270 280 290 300
....*....|....*....|....*....|....*....|....*....|....*....|....
gi 966925059 439 wlAQDANGAI--WKLDlsfsnitqDPECLFSF--HSGAVEAVAVSPLTYLMATTALDCSVRIYD 498
Cdd:cd00200 236 --SGSEDGTIrvWDLR--------TGECVQTLsgHTNSVTSLAWSPDGKRLASGSADGTIRIWD 289
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
332-627 |
9.71e-19 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 88.55 E-value: 9.71e-19
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 966925059 332 PDGKVL-SGSEWGNMLLW--EGGLIKVELCrGtskscHNGPINQIML--DEGEVITVGSDGCVRIWDFETIdtadiidet 406
Cdd:cd00200 19 PDGKLLaTGSGDGTIKVWdlETGELLRTLK-G-----HTGPVRDVAAsaDGTYLASGSSDKTIRLWDLETG--------- 83
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 966925059 407 glleiEPINELQVDKNvNLFSMTKMSetgNNFWLA---QDANGAIWKLDlsfsnitqDPECLFSF--HSGAVEAVAVSPL 481
Cdd:cd00200 84 -----ECVRTLTGHTS-YVSSVAFSP---DGRILSsssRDKTIKVWDVE--------TGKCLTTLrgHTDWVNSVAFSPD 146
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 966925059 482 TYLMATTALDCSVRIYDFASKTLLTQMKFKQGG-TALVWVPrmvnfTGAEIIVGFEDGVVRVLELYDPKgltifvgrkki 560
Cdd:cd00200 147 GTFVASSSQDGTIKLWDLRTGKCVATLTGHTGEvNSVAFSP-----DGEKLLSSSSDGTIKLWDLSTGK----------- 210
|
250 260 270 280 290 300
....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 966925059 561 sdadirLKQVFKPHTARVTALAYERDGEILATGSKDQTVFFFEVERDYKPIGYINTPGPVCQLMWSP 627
Cdd:cd00200 211 ------CLGTLRGHENGVNSVAFSPDGYLLASGSEDGTIRVWDLRTGECVQTLSGHTNSVTSLAWSP 271
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
239-627 |
1.51e-14 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 78.03 E-value: 1.51e-14
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 966925059 239 SGTLLASVGSNPDYTLTIWNWKEEQPILRTKAFSQEVFKVTFNPDNEEQLTTSGSGHIKFWEMAFTFTGLKLQGSLGRfg 318
Cdd:COG2319 3 SADGAALAAASADLALALLAAALGALLLLLLGLAAAVASLAASPDGARLAAGAGDLTLLLLDAAAGALLATLLGHTAA-- 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 966925059 319 kttttdIEGYMELPDGKVL-SGSEWGNMLLW--EGGLIKVELcrgtskSCHNGPINQIML--DEGEVITVGSDGCVRIWD 393
Cdd:COG2319 81 ------VLSVAFSPDGRLLaSASADGTVRLWdlATGLLLRTL------TGHTGAVRSVAFspDGKTLASGSADGTVRLWD 148
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 966925059 394 fetidtadiidetglleiepinelqvdknvnlfsmtkmsetgnnfwlaqdangaiwkldlsfsniTQDPECLFSF--HSG 471
Cdd:COG2319 149 -----------------------------------------------------------------LATGKLLRTLtgHSG 163
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 966925059 472 AVEAVAVSPLTYLMATTALDCSVRIYDFASKTLLTQMKfkqGGTALVWVprmVNFT--GAEIIVGFEDGVVRVLELYDPK 549
Cdd:COG2319 164 AVTSVAFSPDGKLLASGSDDGTVRLWDLATGKLLRTLT---GHTGAVRS---VAFSpdGKLLASGSADGTVRLWDLATGK 237
|
330 340 350 360 370 380 390
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 966925059 550 GLTIFVGrkkisdadirlkqvfkpHTARVTALAYERDGEILATGSKDQTVFFFEVErDYKPIGYINTP-GPVCQLMWSP 627
Cdd:COG2319 238 LLRTLTG-----------------HSGSVRSVAFSPDGRLLASGSADGTVRLWDLA-TGELLRTLTGHsGGVNSVAFSP 298
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
158-393 |
1.22e-11 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 67.36 E-value: 1.22e-11
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 966925059 158 IYIAG--NQLIFLNLKTKEQIylRSSSGEGIGVIGVHPHKTYFIVAekGSFPD--IIIYEYPSLRPYRILRdGTEKGYAY 233
Cdd:cd00200 66 LASGSsdKTIRLWDLETGECV--RTLTGHTSYVSSVAFSPDGRILS--SSSRDktIKVWDVETGKCLTTLR-GHTDWVNS 140
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 966925059 234 VNFNYSGTLLASVGSnpDYTLTIWNWKEEQPILRTKAFSQEVFKVTFNPDNEEQLTTSGSGHIKFWEMAFTFTGLKLQGS 313
Cdd:cd00200 141 VAFSPDGTFVASSSQ--DGTIKLWDLRTGKCVATLTGHTGEVNSVAFSPDGEKLLSSSSDGTIKLWDLSTGKCLGTLRGH 218
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 966925059 314 LGRFgktTTTDIegymeLPDGK-VLSGSEWGNMLLWegglikvELCRGTSKSC---HNGPINQIML--DEGEVITVGSDG 387
Cdd:cd00200 219 ENGV---NSVAF-----SPDGYlLASGSEDGTIRVW-------DLRTGECVQTlsgHTNSVTSLAWspDGKRLASGSADG 283
|
....*.
gi 966925059 388 CVRIWD 393
Cdd:cd00200 284 TIRIWD 289
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
234-396 |
2.78e-11 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 67.63 E-value: 2.78e-11
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 966925059 234 VNFNYSGTLLASVGSnpDYTLTIWNWKEEQPILRTKAFSQEVFKVTFNPDNEeQL-TTSGSGHIKFWEMAftfTGLKLQG 312
Cdd:COG2319 252 VAFSPDGRLLASGSA--DGTVRLWDLATGELLRTLTGHSGGVNSVAFSPDGK-LLaSGSDDGTVRLWDLA---TGKLLRT 325
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 966925059 313 SLGRFGKTTTTDIegymeLPDGKVL-SGSEWGNMLLW--EGGLIKVELcRGtskscHNGPINQIML--DEGEVITVGSDG 387
Cdd:COG2319 326 LTGHTGAVRSVAF-----SPDGKTLaSGSDDGTVRLWdlATGELLRTL-TG-----HTGAVTSVAFspDGRTLASGSADG 394
|
....*....
gi 966925059 388 CVRIWDFET 396
Cdd:COG2319 395 TVRLWDLAT 403
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
209-396 |
3.07e-11 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 67.63 E-value: 3.07e-11
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 966925059 209 IIIYEYPSLRPYRILRDGTEKGYAyVNFNYSGTLLASVGSnpDYTLTIWNWKEEQPILRTKAFSQEVFKVTFNPDNeEQL 288
Cdd:COG2319 186 VRLWDLATGKLLRTLTGHTGAVRS-VAFSPDGKLLASGSA--DGTVRLWDLATGKLLRTLTGHSGSVRSVAFSPDG-RLL 261
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 966925059 289 -TTSGSGHIKFWEMAftfTGLKLQGSLGRFGKTTTTDIegymeLPDGKVL-SGSEWGNMLLWEGGLIKvelCRGTSKScH 366
Cdd:COG2319 262 aSGSADGTVRLWDLA---TGELLRTLTGHSGGVNSVAF-----SPDGKLLaSGSDDGTVRLWDLATGK---LLRTLTG-H 329
|
170 180 190
....*....|....*....|....*....|..
gi 966925059 367 NGPINQIML--DEGEVITVGSDGCVRIWDFET 396
Cdd:COG2319 330 TGAVRSVAFspDGKTLASGSDDGTVRLWDLAT 361
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
469-631 |
4.88e-10 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 62.74 E-value: 4.88e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 966925059 469 HSGAVEAVAVSPLTYLMATTALDCSVRIYDFASKTLLTQMKFKQGG-TALVWVPRmvnftGAEIIVGFEDGVVRVLELYD 547
Cdd:cd00200 8 HTGGVTCVAFSPDGKLLATGSGDGTIKVWDLETGELLRTLKGHTGPvRDVAASAD-----GTYLASGSSDKTIRLWDLET 82
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 966925059 548 PKGLTIFVGrkkisdadirlkqvfkpHTARVTALAYERDGEILATGSKDQTVFFFEVErDYKPIGYINT-PGPVCQLMWS 626
Cdd:cd00200 83 GECVRTLTG-----------------HTSYVSSVAFSPDGRILSSSSRDKTIKVWDVE-TGKCLTTLRGhTDWVNSVAFS 144
|
....*
gi 966925059 627 PMSHL 631
Cdd:cd00200 145 PDGTF 149
|
|
| Smc |
COG1196 |
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ... |
1550-1831 |
1.68e-07 |
|
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 440809 [Multi-domain] Cd Length: 983 Bit Score: 56.48 E-value: 1.68e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 966925059 1550 LTLQLREKRLDIEEALVEekkvVDNLKKEYDTLSKKVKIVATNLNAAEEALEAYQREKQQRLNELLVVIpLKLHQIeyvv 1629
Cdd:COG1196 216 RELKEELKELEAELLLLK----LRELEAELEELEAELEELEAELEELEAELAELEAELEELRLELEELE-LELEEA---- 286
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 966925059 1630 fgeipsdlsgtlvfsNHALRRLQERIRELQEENSKQQKLNKEWRERRKQLIREKREMTKTIHKMEETVRQLmiskfgrvv 1709
Cdd:COG1196 287 ---------------QAEEYELLAELARLEQDIARLEERRRELEERLEELEEELAELEEELEELEEELEEL--------- 342
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 966925059 1710 NLEALQTLSVNTTLEELKIRKLRKELANAKEMKMWEEKIAQMRWELMMKTKEHTRKLYQMNDLCIEKKKLDSRLNTLQNQ 1789
Cdd:COG1196 343 EEELEEAEEELEEAEAELAEAEEALLEAEAELAEAEEELEELAEELLEALRAAAELAAQLEELEEAEEALLERLERLEEE 422
|
250 260 270 280
....*....|....*....|....*....|....*....|..
gi 966925059 1790 QGNAFQSLREADVVAREEVTELIQLQAERISALKEEIALLRR 1831
Cdd:COG1196 423 LEELEEALAELEEEEEEEEEALEEAAEEEAELEEEEEALLEL 464
|
|
| SMC_prok_B |
TIGR02168 |
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ... |
1545-1832 |
4.44e-06 |
|
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274008 [Multi-domain] Cd Length: 1179 Bit Score: 51.98 E-value: 4.44e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 966925059 1545 VALFELTLQLREKRLDIEEA---LVEEKKVVDNLKKEYDTLSKKVKIVATNLNAAEEALEAYQREKQQrLNELLVVIPLK 1621
Cdd:TIGR02168 705 KELEELEEELEQLRKELEELsrqISALRKDLARLEAEVEQLEERIAQLSKELTELEAEIEELEERLEE-AEEELAEAEAE 783
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 966925059 1622 LHQIEYVVfgeipSDLSGTLVFSNHALRRLQERIRELQEENSKQQklnkewrERRKQLIREKREMTKTIHKMEETVRQLm 1701
Cdd:TIGR02168 784 IEELEAQI-----EQLKEELKALREALDELRAELTLLNEEAANLR-------ERLESLERRIAATERRLEDLEEQIEEL- 850
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 966925059 1702 iskfgrvvnLEALQTLSVNTTLEELKIRKLRKELANA-KEMKMWEEKIAQMRWE---LMMKTKEHTRKLYQMNDLCIEKK 1777
Cdd:TIGR02168 851 ---------SEDIESLAAEIEELEELIEELESELEALlNERASLEEALALLRSEleeLSEELRELESKRSELRRELEELR 921
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|....*....
gi 966925059 1778 ----KLDSRLNTLQNQQGNAFQSLREADVVAREEVTELIQLQAERISALKEEIALLRRK 1832
Cdd:TIGR02168 922 eklaQLELRLEGLEVRIDNLQERLSEEYSLTLEEAEALENKIEDDEEEARRRLKRLENK 980
|
|
| EnvC |
COG4942 |
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ... |
1553-1838 |
1.02e-05 |
|
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 443969 [Multi-domain] Cd Length: 377 Bit Score: 50.15 E-value: 1.02e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 966925059 1553 QLREKRLDIEEALVEEKKVVDNLKKEYDTLSKKVKIVATNLNAAEEALEAYQREkqqrlnellvviplklhqieyvvfge 1632
Cdd:COG4942 24 EAEAELEQLQQEIAELEKELAALKKEEKALLKQLAALERRIAALARRIRALEQE-------------------------- 77
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 966925059 1633 ipsdlsgtlvfsnhaLRRLQERIRELQEEnskQQKLNKEWRERRKQLirekREMTKTIHKMEETVRQLMI---SKFGRVV 1709
Cdd:COG4942 78 ---------------LAALEAELAELEKE---IAELRAELEAQKEEL----AELLRALYRLGRQPPLALLlspEDFLDAV 135
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 966925059 1710 -NLEALQTLSVNTTLEELKIRKLRKELANAKEmkmweeKIAQMRWELMMKTKEHTRKLYQMNDLCIEKKKLDSRLNTLQN 1788
Cdd:COG4942 136 rRLQYLKYLAPARREQAEELRADLAELAALRA------ELEAERAELEALLAELEEERAALEALKAERQKLLARLEKELA 209
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|
gi 966925059 1789 QQGNAFQSLREADVVAREEVTELIQLQAERiSALKEEIALLRRKGGLILP 1838
Cdd:COG4942 210 ELAAELAELQQEAEELEALIARLEAEAAAA-AERTPAAGFAALKGKLPWP 258
|
|
| SMC_prok_B |
TIGR02168 |
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ... |
1553-1792 |
2.01e-05 |
|
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274008 [Multi-domain] Cd Length: 1179 Bit Score: 49.67 E-value: 2.01e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 966925059 1553 QLREKRLDIEEALVEEKKVVDNLKKEYDTLSKKVKIVA---TNLNAAEEALEAYQREKQQRLNELLvvipLKLHQIEYvv 1629
Cdd:TIGR02168 271 ELRLEVSELEEEIEELQKELYALANEISRLEQQKQILRerlANLERQLEELEAQLEELESKLDELA----EELAELEE-- 344
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 966925059 1630 fgeipsDLSGTLVFSNHALRRLQERIRELQEENSKQQKLNKEWRERRKQLIREKREMT---KTIHKMEETVRQLMiskfG 1706
Cdd:TIGR02168 345 ------KLEELKEELESLEAELEELEAELEELESRLEELEEQLETLRSKVAQLELQIAslnNEIERLEARLERLE----D 414
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 966925059 1707 RVVNLEAlQTLSVNTTLEELKIRKLRKELANAKEM-KMWEEKIAQMRWELMMKTKEHTRKLYQMNDLCIEKKKLDSRLNT 1785
Cdd:TIGR02168 415 RRERLQQ-EIEELLKKLEEAELKELQAELEELEEElEELQEELERLEEALEELREELEEAEQALDAAERELAQLQARLDS 493
|
....*..
gi 966925059 1786 LQNQQGN 1792
Cdd:TIGR02168 494 LERLQEN 500
|
|
| SMC_N |
pfam02463 |
RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The ... |
916-1832 |
2.38e-05 |
|
RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.
Pssm-ID: 426784 [Multi-domain] Cd Length: 1161 Bit Score: 49.58 E-value: 2.38e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 966925059 916 IEDPKAYSIENARRKREHDKLMKEVEEIKAQKREQIKALRTEFcKLLEMNEELPTHMQFKRTDFDVDSKIRAEmhrrtaf 995
Cdd:pfam02463 158 IEEEAAGSRLKRKKKEALKKLIEETENLAELIIDLEELKLQEL-KLKEQAKKALEYYQLKEKLELEEEYLLYL------- 229
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 966925059 996 KIQQVEKElaWEKEKHELGLKKLKnrfrdplesdtivvhailsdhKISSYRLVQPSKYSKFKRTSQSERKPSKLDRFEKE 1075
Cdd:pfam02463 230 DYLKLNEE--RIDLLQELLRDEQE---------------------EIESSKQEIEKEEEKLAQVLKENKEEEKEKKLQEE 286
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 966925059 1076 GTGRKDSQRDAGGSIAIQEESIIEKGKkfRPKTLSEIMVENQIEKTRKLILKAERAQ--LKIQQRKKEWEELYKSKPGDD 1153
Cdd:pfam02463 287 ELKLLAKEEEELKSELLKLERRKVDDE--EKLKESEKEKKKAEKELKKEKEEIEELEkeLKELEIKREAEEEEEEELEKL 364
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 966925059 1154 YE-DPKDLQAIKEAQVYMGDFNLKTAPDYKIPEHMRINAAKKEEELGHLDSlvhgNKRYMNKCILSLRDLKVAVVEEIQC 1232
Cdd:pfam02463 365 QEkLEQLEEELLAKKKLESERLSSAAKLKEEELELKSEEEKEAQLLLELAR----QLEDLLKEEKKEELEILEEEEESIE 440
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 966925059 1233 LVQELKNIQST-LHVSKHIpipQIPQIHPEEVPEKRFQYDEETLLNFKQQHMKSD-DEKSPGVEQTGSGGPGGGFLKLSS 1310
Cdd:pfam02463 441 LKQGKLTEEKEeLEKQELK---LLKDELELKKSEDLLKETQLVKLQEQLELLLSRqKLEERSQKESKARSGLKVLLALIK 517
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 966925059 1311 GKDGDLTTRDSMSRSSKASTLSLDIPKYMEFEKAEPTDVELEIMKRDEIKHVYMQQYLVNRVKELIVTFDAELRLLRHQK 1390
Cdd:pfam02463 518 DGVGGRIISAHGRLGDLGVAVENYKVAISTAVIVEVSATADEVEERQKLVRALTELPLGARKLRLLIPKLKLPLKSIAVL 597
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 966925059 1391 LKLDTQMKLSDLHHVTLFQEI----LLLKNFEKQENILQERVNSLDKEEQYMQWKINETLKEMEEKKNEITKLQEQEKAL 1466
Cdd:pfam02463 598 EIDPILNLAQLDKATLEADEDdkraKVVEGILKDTELTKLKESAKAKESGLRKGVSLEEGLAEKSEVKASLSELTKELLE 677
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 966925059 1467 YAGFQTAIGENNKFANFLMKVLKKKIKRVKKKEVEGDADEDEESEESSEEESSLESDEDESESEDEVFDDSICPTNCDVA 1546
Cdd:pfam02463 678 IQELQEKAESELAKEEILRRQLEIKKKEQREKEELKKLKLEAEELLADRVQEAQDKINEELKLLKQKIDEEEEEEEKSRL 757
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 966925059 1547 LFELTLQLREKRLDIEEALvEEKKVVDNLKKEYDTLSKKVKIVATNLNAAEEALEAYQREKQQRLNELLVVIPLKLHQIE 1626
Cdd:pfam02463 758 KKEEKEEEKSELSLKEKEL-AEEREKTEKLKVEEEKEEKLKAQEEELRALEEELKEEAELLEEEQLLIEQEEKIKEEELE 836
|
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 966925059 1627 YvvfgeipsdLSGTLVFSNHALRRLQERIRELQEENSKQQKLNKEWRERRKQLIREKREMTKTIHKMEETVRQlmiskfg 1706
Cdd:pfam02463 837 E---------LALELKEEQKLEKLAEEELERLEEEITKEELLQELLLKEEELEEQKLKDELESKEEKEKEEKK------- 900
|
810 820 830 840 850 860 870 880
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 966925059 1707 rvvnlEALQTLSVNTTLEELKIRKLRKELANAKEMKMWEEkiaqmrWELMMKTKEHTRKLYQMNDLCIEKKKLDSRLNTL 1786
Cdd:pfam02463 901 -----ELEEESQKLNLLEEKENEIEERIKEEAEILLKYEE------EPEELLLEEADEKEKEENNKEEEEERNKRLLLAK 969
|
890 900 910 920
....*....|....*....|....*....|....*....|....*....
gi 966925059 1787 Q---NQQGNAFQSLREADVVAREEVTELIQLQAERISALKEEIALLRRK 1832
Cdd:pfam02463 970 EelgKVNLMAIEEFEEKEERYNKDELEKERLEEEKKKLIRAIIEETCQR 1018
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
169-302 |
4.83e-05 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 47.98 E-value: 4.83e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 966925059 169 NLKTKEQIYLRSSSGEGIGVIGVHPHKTYFIVAekGSFPDIIIYEYPSLRPYRILRDGTEKGYAyVNFNYSGTLLASVGS 248
Cdd:COG2319 274 DLATGELLRTLTGHSGGVNSVAFSPDGKLLASG--SDDGTVRLWDLATGKLLRTLTGHTGAVRS-VAFSPDGKTLASGSD 350
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|....
gi 966925059 249 npDYTLTIWNWKEEQPILRTKAFSQEVFKVTFNPDNEEQLTTSGSGHIKFWEMA 302
Cdd:COG2319 351 --DGTVRLWDLATGELLRTLTGHTGAVTSVAFSPDGRTLASGSADGTVRLWDLA 402
|
|
| WD40 |
smart00320 |
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ... |
566-603 |
5.02e-05 |
|
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Pssm-ID: 197651 [Multi-domain] Cd Length: 40 Bit Score: 41.91 E-value: 5.02e-05
10 20 30
....*....|....*....|....*....|....*...
gi 966925059 566 RLKQVFKPHTARVTALAYERDGEILATGSKDQTVFFFE 603
Cdd:smart00320 3 ELLKTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
|
|
| WD40 |
pfam00400 |
WD domain, G-beta repeat; |
566-603 |
1.37e-04 |
|
WD domain, G-beta repeat;
Pssm-ID: 459801 [Multi-domain] Cd Length: 39 Bit Score: 40.79 E-value: 1.37e-04
10 20 30
....*....|....*....|....*....|....*...
gi 966925059 566 RLKQVFKPHTARVTALAYERDGEILATGSKDQTVFFFE 603
Cdd:pfam00400 2 KLLKTLEGHTGSVTSLAFSPDGKLLASGSDDGTVKVWD 39
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
567-639 |
1.74e-04 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 45.40 E-value: 1.74e-04
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 966925059 567 LKQVFKPHTARVTALAYERDGEILATGSKDQTVFFFEVERDYKPIGYINTPGPVCQLMWSPmshlESTLLIIC 639
Cdd:cd00200 1 LRRTLKGHTGGVTCVAFSPDGKLLATGSGDGTIKVWDLETGELLRTLKGHTGPVRDVAASA----DGTYLASG 69
|
|
| PRK03918 |
PRK03918 |
DNA double-strand break repair ATPase Rad50; |
1549-1827 |
6.91e-04 |
|
DNA double-strand break repair ATPase Rad50;
Pssm-ID: 235175 [Multi-domain] Cd Length: 880 Bit Score: 44.67 E-value: 6.91e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 966925059 1549 ELTLQLREKRLDIEEALVEEKKVVDNLKKEYDTLSKKVKI-----VATNLNAAEEALEAYQREKQQRLNELLVVIPLKLH 1623
Cdd:PRK03918 456 EYTAELKRIEKELKEIEEKERKLRKELRELEKVLKKESELiklkeLAEQLKELEEKLKKYNLEELEKKAEEYEKLKEKLI 535
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 966925059 1624 QIEyvvfGEIpsdlsgtlvfsnhalRRLQERIRELQEENSKQQKLNKEWR---ERRKQLIREKREMT-KTIHKMEETVRQ 1699
Cdd:PRK03918 536 KLK----GEI---------------KSLKKELEKLEELKKKLAELEKKLDeleEELAELLKELEELGfESVEELEERLKE 596
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 966925059 1700 L--MISKFGRVVN----LEALQTL--SVNTTLEEL---------KIRKLRKELaNAKEMKMWEEKIAQMRWELMMKTKEH 1762
Cdd:PRK03918 597 LepFYNEYLELKDaekeLEREEKElkKLEEELDKAfeelaetekRLEELRKEL-EELEKKYSEEEYEELREEYLELSREL 675
|
250 260 270 280 290 300
....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 966925059 1763 TRKLYQMNDLCIEKKKLDSRLNTLQNQQGNAFQSLREADVV--AREEVTELIqlqaERISALKEEIA 1827
Cdd:PRK03918 676 AGLRAELEELEKRREEIKKTLEKLKEELEEREKAKKELEKLekALERVEELR----EKVKKYKALLK 738
|
|
| COG4913 |
COG4913 |
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown]; |
1580-1800 |
9.34e-04 |
|
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
Pssm-ID: 443941 [Multi-domain] Cd Length: 1089 Bit Score: 44.14 E-value: 9.34e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 966925059 1580 DTLSKKVKIVATNLNAAEEALEAYQREKQQRLNELLVviplkLHQIEYVVFGEIpsDLSGTLvfsnHALRRLQERIRELQ 1659
Cdd:COG4913 613 AALEAELAELEEELAEAEERLEALEAELDALQERREA-----LQRLAEYSWDEI--DVASAE----REIAELEAELERLD 681
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 966925059 1660 EENSKQQKLNKEW---RERRKQLIREKREMTKTIHKMEETVRQLmiskfgrvvnlealqtlsvnttleELKIRKLRKELA 1736
Cdd:COG4913 682 ASSDDLAALEEQLeelEAELEELEEELDELKGEIGRLEKELEQA------------------------EEELDELQDRLE 737
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 966925059 1737 NAKEMKMWE--EKIAQMRWELMmkTKEHTRKLyqmndlcieKKKLDSRLNTLQNQQGNAFQSLREA 1800
Cdd:COG4913 738 AAEDLARLElrALLEERFAAAL--GDAVEREL---------RENLEERIDALRARLNRAEEELERA 792
|
|
| WD40 |
smart00320 |
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ... |
459-498 |
1.73e-03 |
|
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Pssm-ID: 197651 [Multi-domain] Cd Length: 40 Bit Score: 37.68 E-value: 1.73e-03
10 20 30 40
....*....|....*....|....*....|....*....|
gi 966925059 459 TQDPECLFSFHSGAVEAVAVSPLTYLMATTALDCSVRIYD 498
Cdd:smart00320 1 SGELLKTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
|
|
| NBCH_WD40 |
pfam20426 |
Neurobeachin beta propeller domain; This entry represents the beta propeller domain found at ... |
560-606 |
1.76e-03 |
|
Neurobeachin beta propeller domain; This entry represents the beta propeller domain found at the C-terminus of neurobeachin-like proteins.
Pssm-ID: 466575 [Multi-domain] Cd Length: 350 Bit Score: 42.75 E-value: 1.76e-03
10 20 30 40
....*....|....*....|....*....|....*....|....*..
gi 966925059 560 ISDADIRLKQVFKPHTARVTALAYERDGEILATGSKDQTVFFFEVER 606
Cdd:pfam20426 109 ISLNDGRMVQSIRQHKDVVSCVAVTSDGSILATGSYDTTVMVWEVLR 155
|
|
| YhaN |
COG4717 |
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown]; |
1648-1832 |
2.62e-03 |
|
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
Pssm-ID: 443752 [Multi-domain] Cd Length: 641 Bit Score: 42.45 E-value: 2.62e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 966925059 1648 LRRLQERIRELQEENSKQQKLNKEWRERRKQLIREKREMTKTIHKMEETVRQLmiskfgrvvnLEALQTLSVNTTLEELK 1727
Cdd:COG4717 48 LERLEKEADELFKPQGRKPELNLKELKELEEELKEAEEKEEEYAELQEELEEL----------EEELEELEAELEELREE 117
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 966925059 1728 IRKLRKELANAKEMKMWEEKIAQMRwELMMKTKEHTRKLYQMNDLCIEKKKLDSRLNTLQNQQGNAFQSLREADVVAREE 1807
Cdd:COG4717 118 LEKLEKLLQLLPLYQELEALEAELA-ELPERLEELEERLEELRELEEELEELEAELAELQEELEELLEQLSLATEEELQD 196
|
170 180
....*....|....*....|....*
gi 966925059 1808 VTELIQLQAERISALKEEIALLRRK 1832
Cdd:COG4717 197 LAEELEELQQRLAELEEELEEAQEE 221
|
|
| PRK03918 |
PRK03918 |
DNA double-strand break repair ATPase Rad50; |
1546-1755 |
3.17e-03 |
|
DNA double-strand break repair ATPase Rad50;
Pssm-ID: 235175 [Multi-domain] Cd Length: 880 Bit Score: 42.36 E-value: 3.17e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 966925059 1546 ALFELTLQLRE-----KRLDIEEaLVEEKKVVDNLKKEYDTLSKKVKIVATNLNAAEE------ALEAYQREKQQRLNEL 1614
Cdd:PRK03918 497 KLKELAEQLKEleeklKKYNLEE-LEKKAEEYEKLKEKLIKLKGEIKSLKKELEKLEElkkklaELEKKLDELEEELAEL 575
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 966925059 1615 LVviplKLHQIEYVVFGEIPSDLSG---------TLVFSNHALRRLQERIRELQEENSKQ----QKLNKEWRERRKQLIR 1681
Cdd:PRK03918 576 LK----ELEELGFESVEELEERLKElepfyneylELKDAEKELEREEKELKKLEEELDKAfeelAETEKRLEELRKELEE 651
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 966925059 1682 EKREMTKTIHkmeETVRQLMISKFGRVVNLEAlQTLSVNTTLEELK--IRKLRKELANAKEMKMWEEKIAQMRWEL 1755
Cdd:PRK03918 652 LEKKYSEEEY---EELREEYLELSRELAGLRA-ELEELEKRREEIKktLEKLKEELEEREKAKKELEKLEKALERV 723
|
|
| COG4913 |
COG4913 |
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown]; |
1567-1836 |
3.37e-03 |
|
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
Pssm-ID: 443941 [Multi-domain] Cd Length: 1089 Bit Score: 42.59 E-value: 3.37e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 966925059 1567 EEKKV---VDNLKKEYDTLskkvkivatnlNAAEEALEayqREKQQRlnELLVVIPlKLHQ------IEYVVFGEIPSDL 1637
Cdd:COG4913 219 EEPDTfeaADALVEHFDDL-----------ERAHEALE---DAREQI--ELLEPIR-ELAEryaaarERLAELEYLRAAL 281
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 966925059 1638 sgTLVFSNHALRRLQERIRELQEENSKQQKLNKEWRERRKQLIREKREMTktihkmeetvRQLMISKFGRVVNLEAlqtl 1717
Cdd:COG4913 282 --RLWFAQRRLELLEAELEELRAELARLEAELERLEARLDALREELDELE----------AQIRGNGGDRLEQLER---- 345
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 966925059 1718 svnttleelKIRKLRKELAN-AKEMKMWEEKIAQMRWELmmktkEHTRKLYQMNdlcieKKKLDSRLNTLQNQQGNAFQS 1796
Cdd:COG4913 346 ---------EIERLERELEErERRRARLEALLAALGLPL-----PASAEEFAAL-----RAEAAALLEALEEELEALEEA 406
|
250 260 270 280
....*....|....*....|....*....|....*....|
gi 966925059 1797 LREADVvareevtELIQLQAERiSALKEEIALLRRKGGLI 1836
Cdd:COG4913 407 LAEAEA-------ALRDLRREL-RELEAEIASLERRKSNI 438
|
|
| PTZ00121 |
PTZ00121 |
MAEBL; Provisional |
917-1229 |
5.36e-03 |
|
MAEBL; Provisional
Pssm-ID: 173412 [Multi-domain] Cd Length: 2084 Bit Score: 42.05 E-value: 5.36e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 966925059 917 EDPKAYSIENARRKREHDKLMKEVEEIKAQkrEQIKALRTEFCKLLEMNEELPTHMQFKRTDFDVDSKIRAEMHRRTAFK 996
Cdd:PTZ00121 1508 AKKKADEAKKAEEAKKADEAKKAEEAKKAD--EAKKAEEKKKADELKKAEELKKAEEKKKAEEAKKAEEDKNMALRKAEE 1585
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 966925059 997 IQQVEKELAWEKEKHELGLKKLKNRFRDPLESDTIVVHAILSDHKISSYRLVQPSKYSKFKRTSQSERKPSKLDRFEKEG 1076
Cdd:PTZ00121 1586 AKKAEEARIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAE 1665
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 966925059 1077 TGRKDSQRDAGGSIAIQEESiiEKGKKfrpktlseimvENQIEKTRKLILKAERAQLKIQQRKKEWEELYKSkpgDDYED 1156
Cdd:PTZ00121 1666 EAKKAEEDKKKAEEAKKAEE--DEKKA-----------AEALKKEAEEAKKAEELKKKEAEEKKKAEELKKA---EEENK 1729
|
250 260 270 280 290 300 310
....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 966925059 1157 PKDLQAIKEAQvymgdfnlktaPDYKIPEHMRINAAKKEEelghldslVHGNKRYMNKCILSLRDLKVAVVEE 1229
Cdd:PTZ00121 1730 IKAEEAKKEAE-----------EDKKKAEEAKKDEEEKKK--------IAHLKKEEEKKAEEIRKEKEAVIEE 1783
|
|
| SMC_prok_A |
TIGR02169 |
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ... |
1593-1845 |
5.58e-03 |
|
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274009 [Multi-domain] Cd Length: 1164 Bit Score: 41.59 E-value: 5.58e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 966925059 1593 LNAAEEALEAYQREKQQRLNELLVVIPLKLHQIEyvvfgEIPSDLSGTlvfsNHALRRLQERIRELQEENSKQQKLNKEW 1672
Cdd:TIGR02169 686 LKRELSSLQSELRRIENRLDELSQELSDASRKIG-----EIEKEIEQL----EQEEEKLKERLEELEEDLSSLEQEIENV 756
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 966925059 1673 RERRKQLIREKREMTKTIHKMEETV----RQLMISKFGRVVNL------------EALQTLSVNTTLEELKIRKLRKELA 1736
Cdd:TIGR02169 757 KSELKELEARIEELEEDLHKLEEALndleARLSHSRIPEIQAElskleeevsrieARLREIEQKLNRLTLEKEYLEKEIQ 836
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 966925059 1737 NAKE-MKMWEEKIA--QMRWELMMKTKEHT-----RKLYQMNDLCIEKKKLDSRLNTLQNQQGNAFQSLREADvVAREEV 1808
Cdd:TIGR02169 837 ELQEqRIDLKEQIKsiEKEIENLNGKKEELeeeleELEAALRDLESRLGDLKKERDELEAQLRELERKIEELE-AQIEKK 915
|
250 260 270
....*....|....*....|....*....|....*..
gi 966925059 1809 TELIQLQAERISALKEEIAllrrkggLILPPIRSPQE 1845
Cdd:TIGR02169 916 RKRLSELKAKLEALEEELS-------EIEDPKGEDEE 945
|
|
| 235kDa-fam |
TIGR01612 |
reticulocyte binding/rhoptry protein; This model represents a group of paralogous families in ... |
1566-1788 |
6.02e-03 |
|
reticulocyte binding/rhoptry protein; This model represents a group of paralogous families in plasmodium species alternately annotated as reticulocyte binding protein, 235-kDa family protein and rhoptry protein. Rhoptry protein is localized on the cell surface and is extremely large (although apparently lacking in repeat structure) and is important for the process of invasion of the RBCs by the parasite. These proteins are found in P. falciparum, P. vivax and P. yoelii.
Pssm-ID: 130673 [Multi-domain] Cd Length: 2757 Bit Score: 41.96 E-value: 6.02e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 966925059 1566 VEEKKVVDNLKKEYDTLSKKVKivatnlNAAEEALEAYQREKQQRLNELL-VVIPLKLHqieyvVFGEIPSDLSGTLV-- 1642
Cdd:TIGR01612 692 TEDKAKLDDLKSKIDKEYDKIQ------NMETATVELHLSNIENKKNELLdIIVEIKKH-----IHGEINKDLNKILEdf 760
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 966925059 1643 -------------FSNH--ALRRLQERIRELQEENSKQQKLNKEWRERRKQLIREKREMTKTIH-KMEETVRQL------ 1700
Cdd:TIGR01612 761 knkekelsnkindYAKEkdELNKYKSKISEIKNHYNDQINIDNIKDEDAKQNYDKSKEYIKTISiKEDEIFKIInemkfm 840
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 966925059 1701 ---MISKFGRVVNLEALQTLSVNTTLEELK--IRKLRKELANAKeMKMWEEKIAQMRwELMMKTKEHTRKLYQ-MNDLci 1774
Cdd:TIGR01612 841 kddFLNKVDKFINFENNCKEKIDSEHEQFAelTNKIKAEISDDK-LNDYEKKFNDSK-SLINEINKSIEEEYQnINTL-- 916
|
250
....*....|....
gi 966925059 1775 ekKKLDSRLNTLQN 1788
Cdd:TIGR01612 917 --KKVDEYIKICEN 928
|
|
| YhaN |
COG4717 |
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown]; |
1546-1756 |
8.68e-03 |
|
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
Pssm-ID: 443752 [Multi-domain] Cd Length: 641 Bit Score: 40.91 E-value: 8.68e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 966925059 1546 ALFELTLQLREKRLDIEEALVEEKKVVDNLKKEYDTLSKKVKIVATNLNAAEEALEayqrEKQQRLNELLVVIPLKLHQI 1625
Cdd:COG4717 57 ELFKPQGRKPELNLKELKELEEELKEAEEKEEEYAELQEELEELEEELEELEAELE----ELREELEKLEKLLQLLPLYQ 132
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 966925059 1626 EYvvfgeipSDLSGTLVFSNHALRRLQERIRELQEENSKQQKLNKEWRERRKQLIREKREMT----KTIHKMEETVRQLM 1701
Cdd:COG4717 133 EL-------EALEAELAELPERLEELEERLEELRELEEELEELEAELAELQEELEELLEQLSlateEELQDLAEELEELQ 205
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|....*...
gi 966925059 1702 iskfgrvvnlEALQTLSVNTTLEELKIRKLRKELANAKEMKMW---EEKIAQMRWELM 1756
Cdd:COG4717 206 ----------QRLAELEEELEEAQEELEELEEELEQLENELEAaalEERLKEARLLLL 253
|
|
| WD40 |
pfam00400 |
WD domain, G-beta repeat; |
463-498 |
8.74e-03 |
|
WD domain, G-beta repeat;
Pssm-ID: 459801 [Multi-domain] Cd Length: 39 Bit Score: 35.78 E-value: 8.74e-03
10 20 30
....*....|....*....|....*....|....*...
gi 966925059 463 ECLFSF--HSGAVEAVAVSPLTYLMATTALDCSVRIYD 498
Cdd:pfam00400 2 KLLKTLegHTGSVTSLAFSPDGKLLASGSDDGTVKVWD 39
|
|
|