NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|1907141596|ref|XP_036018293|]
View 

coiled-coil domain-containing protein 141 isoform X3 [Mus musculus]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
I-set pfam07679
Immunoglobulin I-set domain;
1360-1449 1.94e-25

Immunoglobulin I-set domain;


:

Pssm-ID: 400151 [Multi-domain]  Cd Length: 90  Bit Score: 101.57  E-value: 1.94e-25
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907141596 1360 PHFSRLLSNVTVMEGSPVTLEVEVTGFPEPTLTWFKKGQKLCADGHLQVLHKDTKHSVFIPKVCEADAGLYVAQAQNSSG 1439
Cdd:pfam07679    1 PKFTQKPKDVEVQEGESARFTCTVTGTPDPEVSWFKDGQPLRSSDRFKVTYEGGTYTLTISNVQPDDSGKYTCVATNSAG 80
                           90
                   ....*....|
gi 1907141596 1440 TLSSKAILHV 1449
Cdd:pfam07679   81 EAEASAELTV 90
SPEC cd00176
Spectrin repeats, found in several proteins involved in cytoskeletal structure; family members ...
132-356 6.96e-10

Spectrin repeats, found in several proteins involved in cytoskeletal structure; family members include spectrin, alpha-actinin and dystrophin; the spectrin repeat forms a three helix bundle with the second helix interrupted by proline in some sequences; the repeats are independent folding units; tandem repeats are found in differing numbers and arrange in an antiparallel manner to form dimers; the repeats are defined by a characteristic tryptophan (W) residue in helix A and a leucine (L) at the carboxyl end of helix C and separated by a linker of 5 residues; two copies of the repeat are present here


:

Pssm-ID: 238103 [Multi-domain]  Cd Length: 213  Bit Score: 60.54  E-value: 6.96e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907141596  132 EFFQSALEFAIKIDQAEAFLQNPHEFESTEALQSLLLLHDRHAKELLERSldllNKSQQLTDFIEKfkcegstmnseLIQ 211
Cdd:cd00176      4 QFLRDADELEAWLSEKEELLSSTDYGDDLESVEALLKKHEALEAELAAHE----ERVEALNELGEQ-----------LIE 68
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907141596  212 GAQSSCLKIDSLLELLQDRRRQLDKYLQQQRQELSQVLQLCLWDQQENQVSSWFQKAIRDLQEQSLGASLSDNRELICKH 291
Cdd:cd00176     69 EGHPDAEEIQERLEELNQRWEELRELAEERRQRLEEALDLQQFFRDADDLEQWLEEKEAALASEDLGKDLESVEELLKKH 148
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1907141596  292 EDLIVKAKEWDSAVEKLKSQALGILLSKDLAGKEHLQLSNQKLNRLQEEFGRLMVERKAWLSMAN 356
Cdd:cd00176    149 KELEEELEAHEPRLKSLNELAEELLEEGHPDADEEIEEKLEELNERWEELLELAEERQKKLEEAL 213
SMC_prok_B super family cl37069
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
716-1067 3.80e-06

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


The actual alignment was detected with superfamily member TIGR02168:

Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 51.98  E-value: 3.80e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907141596  716 SLQSTSDLRRRWIAMKPQLQQLHEDVQqiTKEWEVLSSQgapLKEKAEQLKDLVHLHRRQRERIQEYEEILYKTVQFHQV 795
Cdd:TIGR02168  204 SLERQAEKAERYKELKAELRELELALL--VLRLEELREE---LEELQEELKEAEEELEELTAELQELEEKLEELRLEVSE 278
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907141596  796 KEELVHLIKPRELELLAQpmelasSEEVQMQLGRSQERRAhvdHLHQLALTLGVDIISSVQQP--LKESFKDIKKKFNNL 873
Cdd:TIGR02168  279 LEEEIEELQKELYALANE------ISRLEQQKQILRERLA---NLERQLEELEAQLEELESKLdeLAEELAELEEKLEEL 349
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907141596  874 KFNY-------SKKNEKSRNLKTLQYQIQ-QVDTYAEKIQALRKKMEKVNNKtsdsfLSYpsnkanmLSEAMEDLKKNVD 945
Cdd:TIGR02168  350 KEELesleaelEELEAELEELESRLEELEeQLETLRSKVAQLELQIASLNNE-----IER-------LEARLERLEDRRE 417
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907141596  946 DFDKVVTDYKMNLDLTE--HLQEVIEECNFWYEDASATVVRVgkysmecqtREAVDILHRQFNKFITP--SVPQQEERIQ 1021
Cdd:TIGR02168  418 RLQQEIEELLKKLEEAElkELQAELEELEEELEELQEELERL---------EEALEELREELEEAEQAldAAERELAQLQ 488
                          330       340       350       360
                   ....*....|....*....|....*....|....*....|....*.
gi 1907141596 1022 EVIDLAQRLYGLEEGQKYAEKIVTRHKEILESITELCGSLVELKEK 1067
Cdd:TIGR02168  489 ARLDSLERLQENLEGFSEGVKALLKNQSGLSGILGVLSELISVDEG 534
sbcc super family cl31020
exonuclease SbcC; All proteins in this family for which functions are known are part of an ...
219-790 1.87e-04

exonuclease SbcC; All proteins in this family for which functions are known are part of an exonuclease complex with sbcD homologs. This complex is involved in the initiation of recombination to regulate the levels of palindromic sequences in DNA. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University). [DNA metabolism, DNA replication, recombination, and repair]


The actual alignment was detected with superfamily member TIGR00618:

Pssm-ID: 129705 [Multi-domain]  Cd Length: 1042  Bit Score: 46.12  E-value: 1.87e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907141596  219 KIDSLLELLQDRRRQLDKYLQQ------QRQELSQVLQLC-LWDQQENQVSSWF--QKAIRDLQEQslgaSLSDNRELIC 289
Cdd:TIGR00618  308 QAQRIHTELQSKMRSRAKLLMKraahvkQQSSIEEQRRLLqTLHSQEIHIRDAHevATSIREISCQ----QHTLTQHIHT 383
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907141596  290 KHEDL-IVKAKEWDSAVEKLKSQAL-----------GILLSKDLAGKEHLQLSNQKLNRLQEEFGRLMVERKAWLSMAND 357
Cdd:TIGR00618  384 LQQQKtTLTQKLQSLCKELDILQREqatidtrtsafRDLQGQLAHAKKQQELQQRYAELCAAAITCTAQCEKLEKIHLQE 463
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907141596  358 FFTSANKSFDVLGKVEAYLKLLKSEGLSLPVLAAKHEELHREIKDSTATALQKgRTLISQVDSCRSRVTGIHEMMGYIQN 437
Cdd:TIGR00618  464 SAQSLKEREQQLQTKEQIHLQETRKKAVVLARLLELQEEPCPLCGSCIHPNPA-RQDIDNPGPLTRRMQRGEQTYAQLET 542
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907141596  438 RVDCLTEQCTAHEEFARKRQQLATSVDDYLRKVEMSIQEIRPILATTLDVAS-----SPSESEKILNKYLELDIQVKETA 512
Cdd:TIGR00618  543 SEEDVYHQLTSERKQRASLKEQMQEIQQSFSILTQCDNRSKEDIPNLQNITVrlqdlTEKLSEAEDMLACEQHALLRKLQ 622
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907141596  513 HALEAAAKIMTEKN---ELELNEVALLPLKVKWLEEELstlgRSISCRSRILQT-YVAFRKSSEEAEEQLQSLKEFYLTE 588
Cdd:TIGR00618  623 PEQDLQDVRLHLQQcsqELALKLTALHALQLTLTQERV----REHALSIRVLPKeLLASRQLALQKMQSEKEQLTYWKEM 698
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907141596  589 IPWKDEddaVVKCQSNSAERKWQLFlkksfltQDLSLEFLNLI-NMAKENEILN--VKNEMHIMENIMEKQTNGREELSH 665
Cdd:TIGR00618  699 LAQCQT---LLRELETHIEEYDREF-------NEIENASSSLGsDLAAREDALNqsLKELMHQARTVLKARTEAHFNNNE 768
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907141596  666 LRVAWYLKAIEGKPAREQWEMFKEKLTKTTHSVKLLHEVLMPVSALDLGGSLQSTSDLRRRWIAMKPQLQQLHEDVQQIT 745
Cdd:TIGR00618  769 EVTAALQTGAELSHLAAEIQFFNRLREEDTHLLKTLEAEIGQEIPSDEDILNLQCETLVQEEEQFLSRLEEKSATLGEIT 848
                          570       580       590       600
                   ....*....|....*....|....*....|....*....|....*
gi 1907141596  746 KEwevlssqgapLKEKAEQLKDLVHLHRRQRERIQEYEEILYKTV 790
Cdd:TIGR00618  849 HQ----------LLKYEECSKQLAQLTQEQAKIIQLSDKLNGINQ 883
 
Name Accession Description Interval E-value
I-set pfam07679
Immunoglobulin I-set domain;
1360-1449 1.94e-25

Immunoglobulin I-set domain;


Pssm-ID: 400151 [Multi-domain]  Cd Length: 90  Bit Score: 101.57  E-value: 1.94e-25
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907141596 1360 PHFSRLLSNVTVMEGSPVTLEVEVTGFPEPTLTWFKKGQKLCADGHLQVLHKDTKHSVFIPKVCEADAGLYVAQAQNSSG 1439
Cdd:pfam07679    1 PKFTQKPKDVEVQEGESARFTCTVTGTPDPEVSWFKDGQPLRSSDRFKVTYEGGTYTLTISNVQPDDSGKYTCVATNSAG 80
                           90
                   ....*....|
gi 1907141596 1440 TLSSKAILHV 1449
Cdd:pfam07679   81 EAEASAELTV 90
IG_like smart00410
Immunoglobulin like; IG domains that cannot be classified into one of IGv1, IGc1, IGc2, IG.
1367-1449 5.84e-17

Immunoglobulin like; IG domains that cannot be classified into one of IGv1, IGc1, IGc2, IG.


Pssm-ID: 214653 [Multi-domain]  Cd Length: 85  Bit Score: 77.16  E-value: 5.84e-17
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907141596  1367 SNVTVMEGSPVTLEVEVTGFPEPTLTWFKKGQK-LCADGHLQVLHKDTKHSVFIPKVCEADAGLYVAQAQNSSGTLSSKA 1445
Cdd:smart00410    2 PSVTVKEGESVTLSCEASGSPPPEVTWYKQGGKlLAESGRFSVSRSGSTSTLTISNVTPEDSGTYTCAATNSSGSASSGT 81

                    ....
gi 1907141596  1446 ILHV 1449
Cdd:smart00410   82 TLTV 85
IgI_titin_I1-like cd20951
Immunoglobulin domain I1 of the titin I-band and similar proteins; a member of the I-set of ...
1360-1449 5.93e-15

Immunoglobulin domain I1 of the titin I-band and similar proteins; a member of the I-set of IgSF domains; The members here are composed of the immunoglobulin domain I1 of the titin I-band and similar proteins. Titin is a key component in the assembly and functioning of vertebrate striated muscles. By providing connections at the level of individual microfilaments, it contributes to the fine balance of forces between the two halves of the sarcomere. The Ig superfamily (IgSF) is a heterogenous group of proteins, built on a common fold comprised of a sandwich of two beta sheets. The two sheets are linked together by a conserved disulfide bond between B strand and F strand. IgSF domains can be divided into 4 main classes based on their structures and sequences: the Variable (V), Constant 1 (C1), Constant 2 (C2), and Intermediate (I) sets. The Ig I1 domain of the titin I-band is a member of the I-set Ig domains, having A-B-E-D strands in one beta-sheet and A'-G-F-C-C' in the other. Like the V-set Ig domains, members of the I-set have a discontinuous A strand but lack a C" strand. I-set domains are found in several cell adhesion molecules (such as VCAM, ICAM, and MADCAM), and are also present in numerous other diverse protein families, including several tyrosine-protein kinase receptors, the hemolymph protein hemolin, the muscle proteins titin, telokin, and twitchin, the neuronal adhesion molecule axonin-1, and the signaling molecule semaphorin 4D that is involved in axonal guidance, immune function and angiogenesis.


Pssm-ID: 409543 [Multi-domain]  Cd Length: 94  Bit Score: 71.68  E-value: 5.93e-15
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907141596 1360 PHFSRLLSNVTVMEGSPVTLEVEVTGFPEPTLTWFKKGQKL---CADGHLQVLHKDTKHSVFIPKVCEADAGLYVAQAQN 1436
Cdd:cd20951      1 PEFIIRLQSHTVWEKSDAKLRVEVQGKPDPEVKWYKNGVPIdpsSIPGKYKIESEYGVHVLHIRRVTVEDSAVYSAVAKN 80
                           90
                   ....*....|...
gi 1907141596 1437 SSGTLSSKAILHV 1449
Cdd:cd20951     81 IHGEASSSASVVV 93
SPEC cd00176
Spectrin repeats, found in several proteins involved in cytoskeletal structure; family members ...
132-356 6.96e-10

Spectrin repeats, found in several proteins involved in cytoskeletal structure; family members include spectrin, alpha-actinin and dystrophin; the spectrin repeat forms a three helix bundle with the second helix interrupted by proline in some sequences; the repeats are independent folding units; tandem repeats are found in differing numbers and arrange in an antiparallel manner to form dimers; the repeats are defined by a characteristic tryptophan (W) residue in helix A and a leucine (L) at the carboxyl end of helix C and separated by a linker of 5 residues; two copies of the repeat are present here


Pssm-ID: 238103 [Multi-domain]  Cd Length: 213  Bit Score: 60.54  E-value: 6.96e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907141596  132 EFFQSALEFAIKIDQAEAFLQNPHEFESTEALQSLLLLHDRHAKELLERSldllNKSQQLTDFIEKfkcegstmnseLIQ 211
Cdd:cd00176      4 QFLRDADELEAWLSEKEELLSSTDYGDDLESVEALLKKHEALEAELAAHE----ERVEALNELGEQ-----------LIE 68
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907141596  212 GAQSSCLKIDSLLELLQDRRRQLDKYLQQQRQELSQVLQLCLWDQQENQVSSWFQKAIRDLQEQSLGASLSDNRELICKH 291
Cdd:cd00176     69 EGHPDAEEIQERLEELNQRWEELRELAEERRQRLEEALDLQQFFRDADDLEQWLEEKEAALASEDLGKDLESVEELLKKH 148
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1907141596  292 EDLIVKAKEWDSAVEKLKSQALGILLSKDLAGKEHLQLSNQKLNRLQEEFGRLMVERKAWLSMAN 356
Cdd:cd00176    149 KELEEELEAHEPRLKSLNELAEELLEEGHPDADEEIEEKLEELNERWEELLELAEERQKKLEEAL 213
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
716-1067 3.80e-06

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 51.98  E-value: 3.80e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907141596  716 SLQSTSDLRRRWIAMKPQLQQLHEDVQqiTKEWEVLSSQgapLKEKAEQLKDLVHLHRRQRERIQEYEEILYKTVQFHQV 795
Cdd:TIGR02168  204 SLERQAEKAERYKELKAELRELELALL--VLRLEELREE---LEELQEELKEAEEELEELTAELQELEEKLEELRLEVSE 278
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907141596  796 KEELVHLIKPRELELLAQpmelasSEEVQMQLGRSQERRAhvdHLHQLALTLGVDIISSVQQP--LKESFKDIKKKFNNL 873
Cdd:TIGR02168  279 LEEEIEELQKELYALANE------ISRLEQQKQILRERLA---NLERQLEELEAQLEELESKLdeLAEELAELEEKLEEL 349
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907141596  874 KFNY-------SKKNEKSRNLKTLQYQIQ-QVDTYAEKIQALRKKMEKVNNKtsdsfLSYpsnkanmLSEAMEDLKKNVD 945
Cdd:TIGR02168  350 KEELesleaelEELEAELEELESRLEELEeQLETLRSKVAQLELQIASLNNE-----IER-------LEARLERLEDRRE 417
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907141596  946 DFDKVVTDYKMNLDLTE--HLQEVIEECNFWYEDASATVVRVgkysmecqtREAVDILHRQFNKFITP--SVPQQEERIQ 1021
Cdd:TIGR02168  418 RLQQEIEELLKKLEEAElkELQAELEELEEELEELQEELERL---------EEALEELREELEEAEQAldAAERELAQLQ 488
                          330       340       350       360
                   ....*....|....*....|....*....|....*....|....*.
gi 1907141596 1022 EVIDLAQRLYGLEEGQKYAEKIVTRHKEILESITELCGSLVELKEK 1067
Cdd:TIGR02168  489 ARLDSLERLQENLEGFSEGVKALLKNQSGLSGILGVLSELISVDEG 534
sbcc TIGR00618
exonuclease SbcC; All proteins in this family for which functions are known are part of an ...
219-790 1.87e-04

exonuclease SbcC; All proteins in this family for which functions are known are part of an exonuclease complex with sbcD homologs. This complex is involved in the initiation of recombination to regulate the levels of palindromic sequences in DNA. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University). [DNA metabolism, DNA replication, recombination, and repair]


Pssm-ID: 129705 [Multi-domain]  Cd Length: 1042  Bit Score: 46.12  E-value: 1.87e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907141596  219 KIDSLLELLQDRRRQLDKYLQQ------QRQELSQVLQLC-LWDQQENQVSSWF--QKAIRDLQEQslgaSLSDNRELIC 289
Cdd:TIGR00618  308 QAQRIHTELQSKMRSRAKLLMKraahvkQQSSIEEQRRLLqTLHSQEIHIRDAHevATSIREISCQ----QHTLTQHIHT 383
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907141596  290 KHEDL-IVKAKEWDSAVEKLKSQAL-----------GILLSKDLAGKEHLQLSNQKLNRLQEEFGRLMVERKAWLSMAND 357
Cdd:TIGR00618  384 LQQQKtTLTQKLQSLCKELDILQREqatidtrtsafRDLQGQLAHAKKQQELQQRYAELCAAAITCTAQCEKLEKIHLQE 463
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907141596  358 FFTSANKSFDVLGKVEAYLKLLKSEGLSLPVLAAKHEELHREIKDSTATALQKgRTLISQVDSCRSRVTGIHEMMGYIQN 437
Cdd:TIGR00618  464 SAQSLKEREQQLQTKEQIHLQETRKKAVVLARLLELQEEPCPLCGSCIHPNPA-RQDIDNPGPLTRRMQRGEQTYAQLET 542
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907141596  438 RVDCLTEQCTAHEEFARKRQQLATSVDDYLRKVEMSIQEIRPILATTLDVAS-----SPSESEKILNKYLELDIQVKETA 512
Cdd:TIGR00618  543 SEEDVYHQLTSERKQRASLKEQMQEIQQSFSILTQCDNRSKEDIPNLQNITVrlqdlTEKLSEAEDMLACEQHALLRKLQ 622
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907141596  513 HALEAAAKIMTEKN---ELELNEVALLPLKVKWLEEELstlgRSISCRSRILQT-YVAFRKSSEEAEEQLQSLKEFYLTE 588
Cdd:TIGR00618  623 PEQDLQDVRLHLQQcsqELALKLTALHALQLTLTQERV----REHALSIRVLPKeLLASRQLALQKMQSEKEQLTYWKEM 698
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907141596  589 IPWKDEddaVVKCQSNSAERKWQLFlkksfltQDLSLEFLNLI-NMAKENEILN--VKNEMHIMENIMEKQTNGREELSH 665
Cdd:TIGR00618  699 LAQCQT---LLRELETHIEEYDREF-------NEIENASSSLGsDLAAREDALNqsLKELMHQARTVLKARTEAHFNNNE 768
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907141596  666 LRVAWYLKAIEGKPAREQWEMFKEKLTKTTHSVKLLHEVLMPVSALDLGGSLQSTSDLRRRWIAMKPQLQQLHEDVQQIT 745
Cdd:TIGR00618  769 EVTAALQTGAELSHLAAEIQFFNRLREEDTHLLKTLEAEIGQEIPSDEDILNLQCETLVQEEEQFLSRLEEKSATLGEIT 848
                          570       580       590       600
                   ....*....|....*....|....*....|....*....|....*
gi 1907141596  746 KEwevlssqgapLKEKAEQLKDLVHLHRRQRERIQEYEEILYKTV 790
Cdd:TIGR00618  849 HQ----------LLKYEECSKQLAQLTQEQAKIIQLSDKLNGINQ 883
SCP-1 pfam05483
Synaptonemal complex protein 1 (SCP-1); Synaptonemal complex protein 1 (SCP-1) is the major ...
304-798 2.83e-03

Synaptonemal complex protein 1 (SCP-1); Synaptonemal complex protein 1 (SCP-1) is the major component of the transverse filaments of the synaptonemal complex. Synaptonemal complexes are structures that are formed between homologous chromosomes during meiotic prophase.


Pssm-ID: 114219 [Multi-domain]  Cd Length: 787  Bit Score: 42.40  E-value: 2.83e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907141596  304 AVEKLKSQA------LGILLSKDLAGKEHLQLSNQK-LNRLQEEFGRLMVERKAWLSMANDFFTSANKSFDVLGKVEAYL 376
Cdd:pfam05483  198 AFEELRVQAenarleMHFKLKEDHEKIQHLEEEYKKeINDKEKQVSLLLIQITEKENKMKDLTFLLEESRDKANQLEEKT 277
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907141596  377 KLlKSEglSLPVLAAKHEELHREIKDSTATALQKGRTLISQVDSCRSRVTGIHEMMGYIQNRVDCLTEQCTAHEEFARKR 456
Cdd:pfam05483  278 KL-QDE--NLKELIEKKDHLTKELEDIKMSLQRSMSTQKALEEDLQIATKTICQLTEEKEAQMEELNKAKAAHSFVVTEF 354
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907141596  457 QQLATSVDDYLRKVEMSIQEIRPILAT-TLDVASSPSESEKILNKYLELDIQVKETAHALEAAAKIMTEKNELELneval 535
Cdd:pfam05483  355 EATTCSLEELLRTEQQRLEKNEDQLKIiTMELQKKSSELEEMTKFKNNKEVELEELKKILAEDEKLLDEKKQFEK----- 429
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907141596  536 LPLKVKWLEEELSTLGRSISCRSRILQTYVAFRKSSEEA-EEQLQSLKefylTEIPWKDEDDAVVKCQSNsaerkwQLFL 614
Cdd:pfam05483  430 IAEELKGKEQELIFLLQAREKEIHDLEIQLTAIKTSEEHyLKEVEDLK----TELEKEKLKNIELTAHCD------KLLL 499
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907141596  615 KKSFLTQDLSLEFLNLINmaKENEILNVKNEMHIM----ENIMEKQTNGREELSHLRvawylkaiegKPAREQWEMFKEK 690
Cdd:pfam05483  500 ENKELTQEASDMTLELKK--HQEDIINCKKQEERMlkqiENLEEKEMNLRDELESVR----------EEFIQKGDEVKCK 567
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907141596  691 LTKTTHSVKLLhEVLMPVSALDLGGSLQSTSDLRRRWIAMKPQLQQLHEDVQQITKEWEVLSSQGAPLKEKAEQLK-DLV 769
Cdd:pfam05483  568 LDKSEENARSI-EYEVLKKEKQMKILENKCNNLKKQIENKNKNIEELHQENKALKKKGSAENKQLNAYEIKVNKLElELA 646
                          490       500
                   ....*....|....*....|....*....
gi 1907141596  770 HLHRRQRERIQEYEeilyKTVQFHQVKEE 798
Cdd:pfam05483  647 SAKQKFEEIIDNYQ----KEIEDKKISEE 671
SPEC cd00176
Spectrin repeats, found in several proteins involved in cytoskeletal structure; family members ...
259-458 9.06e-03

Spectrin repeats, found in several proteins involved in cytoskeletal structure; family members include spectrin, alpha-actinin and dystrophin; the spectrin repeat forms a three helix bundle with the second helix interrupted by proline in some sequences; the repeats are independent folding units; tandem repeats are found in differing numbers and arrange in an antiparallel manner to form dimers; the repeats are defined by a characteristic tryptophan (W) residue in helix A and a leucine (L) at the carboxyl end of helix C and separated by a linker of 5 residues; two copies of the repeat are present here


Pssm-ID: 238103 [Multi-domain]  Cd Length: 213  Bit Score: 39.35  E-value: 9.06e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907141596  259 NQVSSWFQKAIRDLQEQSLGASLSDNRELICKHEDLIVKAKEWDSAVEKLKSQALGiLLSKDLAGKEHLQLSNQKLNRLQ 338
Cdd:cd00176     10 DELEAWLSEKEELLSSTDYGDDLESVEALLKKHEALEAELAAHEERVEALNELGEQ-LIEEGHPDAEEIQERLEELNQRW 88
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907141596  339 EEFGRLMVERKAWLSMANDFFTSANKSFDVLGKVEAYLKLLKSE--GLSLPVLAAKHEElHREIKDSTATALQKGRTLIS 416
Cdd:cd00176     89 EELRELAEERRQRLEEALDLQQFFRDADDLEQWLEEKEAALASEdlGKDLESVEELLKK-HKELEEELEAHEPRLKSLNE 167
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|..
gi 1907141596  417 QVDSCRSRvtGIHEMMGYIQNRVDCLTEQCTAHEEFARKRQQ 458
Cdd:cd00176    168 LAEELLEE--GHPDADEEIEEKLEELNERWEELLELAEERQK 207
 
Name Accession Description Interval E-value
I-set pfam07679
Immunoglobulin I-set domain;
1360-1449 1.94e-25

Immunoglobulin I-set domain;


Pssm-ID: 400151 [Multi-domain]  Cd Length: 90  Bit Score: 101.57  E-value: 1.94e-25
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907141596 1360 PHFSRLLSNVTVMEGSPVTLEVEVTGFPEPTLTWFKKGQKLCADGHLQVLHKDTKHSVFIPKVCEADAGLYVAQAQNSSG 1439
Cdd:pfam07679    1 PKFTQKPKDVEVQEGESARFTCTVTGTPDPEVSWFKDGQPLRSSDRFKVTYEGGTYTLTISNVQPDDSGKYTCVATNSAG 80
                           90
                   ....*....|
gi 1907141596 1440 TLSSKAILHV 1449
Cdd:pfam07679   81 EAEASAELTV 90
IG_like smart00410
Immunoglobulin like; IG domains that cannot be classified into one of IGv1, IGc1, IGc2, IG.
1367-1449 5.84e-17

Immunoglobulin like; IG domains that cannot be classified into one of IGv1, IGc1, IGc2, IG.


Pssm-ID: 214653 [Multi-domain]  Cd Length: 85  Bit Score: 77.16  E-value: 5.84e-17
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907141596  1367 SNVTVMEGSPVTLEVEVTGFPEPTLTWFKKGQK-LCADGHLQVLHKDTKHSVFIPKVCEADAGLYVAQAQNSSGTLSSKA 1445
Cdd:smart00410    2 PSVTVKEGESVTLSCEASGSPPPEVTWYKQGGKlLAESGRFSVSRSGSTSTLTISNVTPEDSGTYTCAATNSSGSASSGT 81

                    ....
gi 1907141596  1446 ILHV 1449
Cdd:smart00410   82 TLTV 85
IgI_titin_I1-like cd20951
Immunoglobulin domain I1 of the titin I-band and similar proteins; a member of the I-set of ...
1360-1449 5.93e-15

Immunoglobulin domain I1 of the titin I-band and similar proteins; a member of the I-set of IgSF domains; The members here are composed of the immunoglobulin domain I1 of the titin I-band and similar proteins. Titin is a key component in the assembly and functioning of vertebrate striated muscles. By providing connections at the level of individual microfilaments, it contributes to the fine balance of forces between the two halves of the sarcomere. The Ig superfamily (IgSF) is a heterogenous group of proteins, built on a common fold comprised of a sandwich of two beta sheets. The two sheets are linked together by a conserved disulfide bond between B strand and F strand. IgSF domains can be divided into 4 main classes based on their structures and sequences: the Variable (V), Constant 1 (C1), Constant 2 (C2), and Intermediate (I) sets. The Ig I1 domain of the titin I-band is a member of the I-set Ig domains, having A-B-E-D strands in one beta-sheet and A'-G-F-C-C' in the other. Like the V-set Ig domains, members of the I-set have a discontinuous A strand but lack a C" strand. I-set domains are found in several cell adhesion molecules (such as VCAM, ICAM, and MADCAM), and are also present in numerous other diverse protein families, including several tyrosine-protein kinase receptors, the hemolymph protein hemolin, the muscle proteins titin, telokin, and twitchin, the neuronal adhesion molecule axonin-1, and the signaling molecule semaphorin 4D that is involved in axonal guidance, immune function and angiogenesis.


Pssm-ID: 409543 [Multi-domain]  Cd Length: 94  Bit Score: 71.68  E-value: 5.93e-15
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907141596 1360 PHFSRLLSNVTVMEGSPVTLEVEVTGFPEPTLTWFKKGQKL---CADGHLQVLHKDTKHSVFIPKVCEADAGLYVAQAQN 1436
Cdd:cd20951      1 PEFIIRLQSHTVWEKSDAKLRVEVQGKPDPEVKWYKNGVPIdpsSIPGKYKIESEYGVHVLHIRRVTVEDSAVYSAVAKN 80
                           90
                   ....*....|...
gi 1907141596 1437 SSGTLSSKAILHV 1449
Cdd:cd20951     81 IHGEASSSASVVV 93
IgI_1_Titin_Z1z2-like cd20974
First Ig-like domain of the giant muscle protein titin Z1z2 in the sarcomeric Z-disk and ...
1360-1450 2.89e-14

First Ig-like domain of the giant muscle protein titin Z1z2 in the sarcomeric Z-disk and similar proteins; a member of the I-set of IgSF domains; The members here are composed of the first immunoglobulin (Ig)-like domain of the giant muscle protein titin Z1z2 in the sarcomeric Z-disk and similar proteins. Titin is a key component in the assembly and functioning of vertebrate striated muscles. By providing connections at the level of individual microfilaments, it contributes to the fine balance of forces between the two halves of the sarcomere. The Ig superfamily (IgSF) is a heterogenous group of proteins, built on a common fold comprised of a sandwich of two beta sheets. IgSF domains can be divided into 4 main classes based on their structures and sequences: the Variable (V), Constant 1 (C1), Constant 2 (C2), and Intermediate (I) sets. Unlike the V-set, one of the distinctive features of I-set domains is the lack of a C" strand. The structure of the titin Z1z2 lacks this strand and thus it belongs to the I-set of the IgSF. I-set domains are found in several cell adhesion molecules (such as VCAM, ICAM, and MADCAM), and are also present in numerous other diverse protein families, including several tyrosine-protein kinase receptors, the hemolymph protein hemolin, the muscle proteins titin, telokin, and twitchin, the neuronal adhesion molecule axonin-1, and the signaling molecule semaphorin 4D that is involved in axonal guidance, immune function and angiogenesis.


Pssm-ID: 409566 [Multi-domain]  Cd Length: 93  Bit Score: 69.69  E-value: 2.89e-14
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907141596 1360 PHFSRLLSNVTVMEGSPVTLEVEVTGFPEPTLTWFKKGQ--KLCADGHLQVLHKDTKHSVFIPKVCEADAGLYVAQAQNS 1437
Cdd:cd20974      1 PVFTQPLQSVVVLEGSTATFEAHVSGKPVPEVSWFRDGQviSTSTLPGVQISFSDGRAKLSIPAVTKANSGRYSLTATNG 80
                           90
                   ....*....|...
gi 1907141596 1438 SGTLSSKAILHVT 1450
Cdd:cd20974     81 SGQATSTAELLVL 93
IgI_2_Titin_Z1z2-like cd20972
Second Ig-like domain of the giant muscle protein titin Z1z2 in the sarcomeric Z-disk, and ...
1359-1449 1.32e-13

Second Ig-like domain of the giant muscle protein titin Z1z2 in the sarcomeric Z-disk, and similar domains; a member of the I-set of IgSF domains; The members here are composed of the second immunoglobulin (Ig)-like domain of the giant muscle protein titin Z1z2 in the sarcomeric Z-disk and similar proteins. Titin is a key component in the assembly and functioning of vertebrate striated muscles. By providing connections at the level of individual microfilaments, it contributes to the fine balance of forces between the two halves of the sarcomere. The Ig superfamily (IgSF) is a heterogenous group of proteins, built on a common fold comprised of a sandwich of two beta sheets. IgSF domains can be divided into 4 main classes based on their structures and sequences: the Variable (V), Constant 1 (C1), Constant 2 (C2), and Intermediate (I) sets. Unlike the V-set, one of the distinctive features of I-set domains is the lack of a C" strand. The structure of the titin Z1z2 lacks this strand and thus it belongs to the I-set of the IgSF. I-set domains are found in several cell adhesion molecules (such as VCAM, ICAM, and MADCAM), and are also present in numerous other diverse protein families, including several tyrosine-protein kinase receptors, the hemolymph protein hemolin, the muscle proteins titin, telokin, and twitchin, the neuronal adhesion molecule axonin-1, and the signaling molecule semaphorin 4D that is involved in axonal guidance, immune function and angiogenesis.


Pssm-ID: 409564 [Multi-domain]  Cd Length: 91  Bit Score: 67.99  E-value: 1.32e-13
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907141596 1359 APHFSRLLSNVTVMEGSPVTLEVEVTGFPEPTLTWFKKGQKLCADGHLQVLHKDTKHSVFIPKVCEADAGLYVAQAQNSS 1438
Cdd:cd20972      1 PPQFIQKLRSQEVAEGSKVRLECRVTGNPTPVVRWFCEGKELQNSPDIQIHQEGDLHSLIIAEAFEEDTGRYSCLATNSV 80
                           90
                   ....*....|.
gi 1907141596 1439 GTLSSKAILHV 1449
Cdd:cd20972     81 GSDTTSAEIFV 91
Ig_Titin_like cd05748
Immunoglobulin (Ig)-like domain of titin and similar proteins; The members here are composed ...
1369-1449 2.72e-13

Immunoglobulin (Ig)-like domain of titin and similar proteins; The members here are composed of the immunoglobulin (Ig)-like domain found in titin-like proteins and similar proteins. Titin (also called connectin) is a fibrous sarcomeric protein specifically found in vertebrate striated muscle. Titin is a giant protein; depending on isoform composition, it ranges from 2970 to 3700 kDa, and is of a length that spans half a sarcomere. Titin largely consists of multiple repeats of Ig-like and fibronectin type 3 (FN-III)-like domains. Titin connects the ends of myosin thick filaments to Z disks and extends along the thick filament to the H zone. It appears to function similarly to an elastic band, keeping the myosin filaments centered in the sarcomere during muscle contraction or stretching. Within the sarcomere, titin is also attached to or is associated with myosin binding protein C (MyBP-C). MyBP-C appears to contribute to the generation of passive tension by titin and like titin has repeated Ig-like and FN-III domains. Also included in this group are worm twitchin and insect projectin, thick filament proteins of invertebrate muscle which also have repeated Ig-like and FN-III domains.


Pssm-ID: 409406 [Multi-domain]  Cd Length: 82  Bit Score: 66.46  E-value: 2.72e-13
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907141596 1369 VTVMEGSPVTLEVEVTGFPEPTLTWFKKGQKLCADGHLQVLHKDTKHSVFIPKVCEADAGLYVAQAQNSSGTlsSKAILH 1448
Cdd:cd05748      2 IVVRAGESLRLDIPIKGRPTPTVTWSKDGQPLKETGRVQIETTASSTSLVIKNAKRSDSGKYTLTLKNSAGE--KSATIN 79

                   .
gi 1907141596 1449 V 1449
Cdd:cd05748     80 V 80
Ig_3 pfam13927
Immunoglobulin domain; This family contains immunoglobulin-like domains.
1360-1436 3.21e-12

Immunoglobulin domain; This family contains immunoglobulin-like domains.


Pssm-ID: 464046 [Multi-domain]  Cd Length: 78  Bit Score: 63.35  E-value: 3.21e-12
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1907141596 1360 PHFSRLLSNVTVMEGSPVTLEVEVTGFPEPTLTWFKKGQKLCADGHLQVLHKDTKHSVFIPKVCEADAGLYVAQAQN 1436
Cdd:pfam13927    2 PVITVSPSSVTVREGETVTLTCEATGSPPPTITWYKNGEPISSGSTRSRSLSGSNSTLTISNVTRSDAGTYTCVASN 78
Ig cd00096
Immunoglobulin domain; The members here are composed of the immunoglobulin (Ig) domain found ...
1377-1446 1.43e-10

Immunoglobulin domain; The members here are composed of the immunoglobulin (Ig) domain found in the Ig superfamily. The Ig superfamily is a heterogenous group of proteins, built on a common fold comprised of a sandwich of two beta sheets. Members of this group are components of immunoglobulin, neuroglia, cell surface glycoproteins, including T-cell receptors, CD2, CD4, CD8, and membrane glycoproteins, including butyrophilin and chondroitin sulfate proteoglycan core protein. A predominant feature of most Ig domains is a disulfide bridge connecting the two beta-sheets with a tryptophan residue packed against the disulfide bond. Ig superfamily (IgSF) domains can be divided into 4 main classes based on their structures and sequences: the Variable (V), Constant 1 (C1), Constant 2 (C2), and Intermediate (I) sets. Typically, the V-set domains have A, B, E, and D strands in one sheet and A', G, F, C, C' and C" in the other. The structures in C1-set are smaller than those in the V-set; they have one beta sheet that is formed by strands A, B, E, and D and the other by strands G, F, C, and C'. Moreover, a C1-set Ig domain contains a short C' strand (three residues) and lacks A' and C" strand. Unlike other Ig domain sets, C2-set structures do not have a D strand. Like the V-set Ig domains, members of the I-set have a discontinuous A strand, but lack a C" strand.


Pssm-ID: 409353 [Multi-domain]  Cd Length: 70  Bit Score: 58.49  E-value: 1.43e-10
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907141596 1377 VTLEVEVTGFPEPTLTWFKKGQKLCADGHLQVLHKDTKHSVFIPKVCEADAGLYVAQAQNSSGTLSSKAI 1446
Cdd:cd00096      1 VTLTCSASGNPPPTITWYKNGKPLPPSSRDSRRSELGNGTLTISNVTLEDSGTYTCVASNSAGGSASASV 70
IgI_Myotilin_C_like cd05744
Immunoglobulin (Ig)-like domain of myotilin, palladin, and myopalladin; member of the I-set of ...
1360-1449 3.01e-10

Immunoglobulin (Ig)-like domain of myotilin, palladin, and myopalladin; member of the I-set of Ig superfamily (IgSF) domains; The members here are composed of the immunoglobulin (Ig)-like domain in myotilin, palladin, and myopalladin. Myotilin, palladin, and myopalladin function as scaffolds that regulate actin organization. Myotilin and myopalladin are most abundant in skeletal and cardiac muscle; palladin is ubiquitously expressed in the organs of developing vertebrates and plays a key role in cellular morphogenesis. The three family members each interact with specific molecular partners with all three binding to alpha-actinin; In addition, palladin also binds to vasodilator-stimulated phosphoprotein (VASP) and ezrin, myotilin binds to filamin and actin, and myopalladin also binds to nebulin and cardiac ankyrin repeat protein (CARP). This group belongs to the I-set of IgSF domains, having A-B-E-D strands in one beta-sheet and A'-G-F-C-C' in the other. Like the V-set Ig domains, members of the I-set have a discontinuous A strand, but lack a C" strand.


Pssm-ID: 409405 [Multi-domain]  Cd Length: 91  Bit Score: 58.28  E-value: 3.01e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907141596 1360 PHFSRLLSNVTVMEGSPVTLEVEVTGFPEPTLTWFKKGQKLCADG-HLQVLHKDTKHSVFIPKVCEADAGLYVAQAQNSS 1438
Cdd:cd05744      1 PHFLQAPGDLEVQEGRLCRFDCKVSGLPTPDLFWQLNGKPVRPDSaHKMLVRENGRHSLIIEPVTKRDAGIYTCIARNRA 80
                           90
                   ....*....|.
gi 1907141596 1439 GTLSSKAILHV 1449
Cdd:cd05744     81 GENSFNAELVV 91
IgI_4_hemolin-like cd20978
Fourth immunoglobulin (Ig)-like domain of hemolin, and similar domains; a member of the I-set ...
1360-1449 3.07e-10

Fourth immunoglobulin (Ig)-like domain of hemolin, and similar domains; a member of the I-set of IgSF domains; The members here are composed of the fourth immunoglobulin (Ig)-like domain of hemolin and similar proteins. Hemolin, an insect immunoglobulin superfamily (IgSF) member containing four Ig-like domains, is a lipopolysaccharide-binding immune protein induced during bacterial infection. Hemolin shares significant sequence similarity with the first four Ig-like domains of the transmembrane cell adhesion molecules (CAMs) of the L1 family. IgSF domains can be divided into 4 main classes based on their structures and sequences: the Variable (V), Constant 1 (C1), Constant 2 (C2), and Intermediate (I) sets. The fourth Ig-like domain of hemolin is a member of the I-set Ig domains, having A-B-E-D strands in one beta-sheet and A'-G-F-C-C' in the other. Like the V-set domains, members of the I-set have a discontinuous A strand but lack a C" strand. I-set domains are found in several cell adhesion molecules (such as VCAM, ICAM, and MADCAM), and are also present in numerous other diverse protein families, including several tyrosine-protein kinase receptors, the muscle proteins titin, telokin, and twitchin, the neuronal adhesion molecule axonin-1, and the signaling molecule semaphorin 4D that is involved in axonal guidance, immune function and angiogenesis.


Pssm-ID: 409570 [Multi-domain]  Cd Length: 88  Bit Score: 58.17  E-value: 3.07e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907141596 1360 PHF-SRLLSNVTVMEGSPVTLEVEVTGFPEPTLTWFKKGQKLCAD-GHLQVlhkdTKHSVFIPKVCEADAGLYVAQAQNS 1437
Cdd:cd20978      1 PKFiQKPEKNVVVKGGQDVTLPCQVTGVPQPKITWLHNGKPLQGPmERATV----EDGTLTIINVQPEDTGYYGCVATNE 76
                           90
                   ....*....|..
gi 1907141596 1438 SGTLSSKAILHV 1449
Cdd:cd20978     77 IGDIYTETLLHV 88
SPEC cd00176
Spectrin repeats, found in several proteins involved in cytoskeletal structure; family members ...
132-356 6.96e-10

Spectrin repeats, found in several proteins involved in cytoskeletal structure; family members include spectrin, alpha-actinin and dystrophin; the spectrin repeat forms a three helix bundle with the second helix interrupted by proline in some sequences; the repeats are independent folding units; tandem repeats are found in differing numbers and arrange in an antiparallel manner to form dimers; the repeats are defined by a characteristic tryptophan (W) residue in helix A and a leucine (L) at the carboxyl end of helix C and separated by a linker of 5 residues; two copies of the repeat are present here


Pssm-ID: 238103 [Multi-domain]  Cd Length: 213  Bit Score: 60.54  E-value: 6.96e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907141596  132 EFFQSALEFAIKIDQAEAFLQNPHEFESTEALQSLLLLHDRHAKELLERSldllNKSQQLTDFIEKfkcegstmnseLIQ 211
Cdd:cd00176      4 QFLRDADELEAWLSEKEELLSSTDYGDDLESVEALLKKHEALEAELAAHE----ERVEALNELGEQ-----------LIE 68
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907141596  212 GAQSSCLKIDSLLELLQDRRRQLDKYLQQQRQELSQVLQLCLWDQQENQVSSWFQKAIRDLQEQSLGASLSDNRELICKH 291
Cdd:cd00176     69 EGHPDAEEIQERLEELNQRWEELRELAEERRQRLEEALDLQQFFRDADDLEQWLEEKEAALASEDLGKDLESVEELLKKH 148
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1907141596  292 EDLIVKAKEWDSAVEKLKSQALGILLSKDLAGKEHLQLSNQKLNRLQEEFGRLMVERKAWLSMAN 356
Cdd:cd00176    149 KELEEELEAHEPRLKSLNELAEELLEEGHPDADEEIEEKLEELNERWEELLELAEERQKKLEEAL 213
IgI_3_Robo cd05725
Third immunoglobulin (Ig)-like domain in Robo (roundabout) receptors; member of the I-set of ...
1367-1449 7.68e-10

Third immunoglobulin (Ig)-like domain in Robo (roundabout) receptors; member of the I-set of Ig superfamily (IgSF) domains; The members here are composed of the third immunoglobulin (Ig)-like domain in Robo (roundabout) receptors. Robo receptors play a role in the development of the central nervous system (CNS), and are receptors of Slit protein. Slit is a repellant secreted by the neural cells in the midline. Slit acts through Robo to prevent most neurons from crossing the midline from either side. Three mammalian Robo homologs (Robo1, Robo2, Robo3), and three mammalian Slit homologs (Slit-1,Slit-2, Slit-3), have been identified. Commissural axons, which cross the midline, express low levels of Robo; longitudinal axons, which avoid the midline, express high levels of Robo. Robo1, Robo2, and Robo3 are expressed by commissural neurons in the vertebrate spinal cord and Slit-1, Slit-2, and Slit-3 are expressed at the ventral midline. Robo-3 is a divergent member of the Robo family which instead of being a positive regulator of Slit responsiveness, antagonizes Slit responsiveness in precrossing axons. The Slit-Robo interaction is mediated by the second leucine-rich repeat (LRR) domain of Slit and the two N-terminal Ig domains of Robo, Ig1 and Ig2. The primary Robo binding site for Slit2 has been shown by surface plasmon resonance experiments and mutational analysis to be the Ig1 domain, while the Ig2 domain has been proposed to harbor a weak secondary binding site. This group belongs to the I-set of IgSF domains, having A-B-E-D strands in one beta-sheet and A'-G-F-C-C' in the other. Like the V-set Ig domains, members of the I-set have a discontinuous A strand, but lack a C" strand.


Pssm-ID: 409390 [Multi-domain]  Cd Length: 83  Bit Score: 56.63  E-value: 7.68e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907141596 1367 SNVTVMEGSPVTLEVEVTGFPEPTLTWFKKGQKLcADGHLQVLHKdtkHSVFIPKVCEADAGLYVAQAQNSSGTLSSKAI 1446
Cdd:cd05725      5 QNQVVLVDDSAEFQCEVGGDPVPTVRWRKEDGEL-PKGRYEILDD---HSLKIRKVTAGDMGSYTCVAENMVGKIEASAT 80

                   ...
gi 1907141596 1447 LHV 1449
Cdd:cd05725     81 LTV 83
IgC_1_Robo cd07693
First immunoglobulin (Ig)-like constant domain in Robo (roundabout) receptors, and similar ...
1367-1449 3.51e-09

First immunoglobulin (Ig)-like constant domain in Robo (roundabout) receptors, and similar domains; The members here are composed of the first immunoglobulin (Ig)-like domain in Roundabout (Robo) receptors. Robo receptors play a role in the development of the central nervous system (CNS), and are receptors of Slit protein. Slit is a repellant secreted by the neural cells in the midline. Slit acts through Robo to prevent most neurons from crossing the midline from either side. Three mammalian Robo homologs (Robo1, Robo2, and Robo3), and three mammalian Slit homologs (Slit1, Slit2, Slit3), have been identified. Commissural axons, which cross the midline, express low levels of Robo; longitudinal axons, which avoid the midline, express high levels of Robo. Robo1, Robo2, and Robo3 are expressed by commissural neurons in the vertebrate spinal cord and Slit1, Slit2,and Slit3 are expressed at the ventral midline. Robo3 is a divergent member of the Robo family which instead of being a positive regulator of Slit responsiveness, antagonizes Slit responsiveness in precrossing axons. The Slit-Robo interaction is mediated by the second leucine-rich repeat (LRR) domain of Slit and the two N-terminal Ig domains of Robo, Ig1 and Ig2. The primary Robo binding site for Slit2 has been shown by surface plasmon resonance experiments and mutational analysis to be is the Ig1 domain, while the Ig2 domain has been proposed to harbor a weak secondary binding site.


Pssm-ID: 409490 [Multi-domain]  Cd Length: 99  Bit Score: 55.64  E-value: 3.51e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907141596 1367 SNVTVMEGSPVTLEVEVTGFPEPTLTWFKKGQKLCADghlqVLHKDTKH------SVFIPKVC-----EADAGLYVAQAQ 1435
Cdd:cd07693      8 SDLIVSKGDPATLNCKAEGRPTPTIQWLKNGQPLETD----KDDPRSHRivlpsgSLFFLRVVhgrkgRSDEGVYVCVAH 83
                           90
                   ....*....|....*
gi 1907141596 1436 NSSGTLSSK-AILHV 1449
Cdd:cd07693     84 NSLGEAVSRnASLEV 98
IgI_2_Palladin_C cd20990
Second C-terminal immunoglobulin (Ig)-like domain of palladin; member of the I-set of Ig ...
1360-1439 3.62e-09

Second C-terminal immunoglobulin (Ig)-like domain of palladin; member of the I-set of Ig superfamily (IgSF) domains; The members here are composed of the C-terminal immunoglobulin (Ig)-like domain of palladin. Palladin belongs to the palladin-myotilin-myopalladin family. Proteins belonging to this family contain multiple Ig-like domains and function as scaffolds, modulating actin cytoskeleton. Palladin binds to alpha-actinin ezrin, vasodilator-stimulated phosphoprotein VASP, SPIN90 (also known as DIP or mDia interacting protein), and Src. Palladin also binds F-actin directly, via its Ig3 domain. Palladin is expressed as several alternatively spliced isoforms, having various combinations of Ig-like domains, in a cell-type-specific manner. It has been suggested that palladin's different Ig-like domains may be specialized for distinct functions. This group belongs to the I-set of IgSF domains, having A-B-E-D strands in one beta-sheet and A'-G-F-C-C' in the other. Like the V-set Ig domains, members of the I-set have a discontinuous A strand, but lack a C" strand.


Pssm-ID: 409582  Cd Length: 91  Bit Score: 55.11  E-value: 3.62e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907141596 1360 PHFSRLLSNVTVMEGSPVTLEVEVTGFPEPTLTWFKKGQKLCAD-GHLQVLHKDTKHSVFIPKVCEADAGLYVAQAQNSS 1438
Cdd:cd20990      1 PHFLQAPGDLTVQEGKLCRMDCKVSGLPTPDLSWQLDGKPIRPDsAHKMLVRENGVHSLIIEPVTSRDAGIYTCIATNRA 80

                   .
gi 1907141596 1439 G 1439
Cdd:cd20990     81 G 81
IgI_4_MYLK-like cd20976
Fourth Ig-like domain from smooth muscle myosin light chain kinase and similar domains ; a ...
1359-1449 4.21e-09

Fourth Ig-like domain from smooth muscle myosin light chain kinase and similar domains ; a member of the I-set of IgSF domains; The members here are composed of the fourth immunoglobulin (Ig)-like domain from smooth muscle myosin light chain kinase (MYLK) and similar domains. The Ig superfamily (IgSF) is a heterogenous group of proteins, built on a common fold comprised of a sandwich of two beta sheets. IgSF domains can be divided into 4 main classes based on their structures and sequences: the Variable (V), Constant 1 (C1), Constant 2 (C2), and Intermediate (I) sets. Unlike the V-set, one of the distinctive features of I-set domains is the lack of a C" strand. The structure of this group shows that the fourth Ig-like domain from myosin light chain kinase lacks this strand and thus belongs to the I-set of the IgSF. I-set domains are found in several cell adhesion molecules (such as VCAM, ICAM, and MADCAM), and are also present in numerous other diverse protein families, including several tyrosine-protein kinase receptors, the hemolymph protein hemolin, the muscle proteins titin, telokin, and twitchin, the neuronal adhesion molecule axonin-1, and the signaling molecule semaphorin 4D that is involved in axonal guidance, immune function and angiogenesis.


Pssm-ID: 409568 [Multi-domain]  Cd Length: 90  Bit Score: 54.95  E-value: 4.21e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907141596 1359 APHFSRLLSNVTVMEGSPVTLEVEVTGFPEPTLTWFKKGQKLCADG-HLQVLHKDTKhsVFIPKVCEADAGLYVAQAQNS 1437
Cdd:cd20976      1 APSFSSVPKDLEAVEGQDFVAQCSARGKPVPRITWIRNAQPLQYAAdRSTCEAGVGE--LHIQDVLPEDHGTYTCLAKNA 78
                           90
                   ....*....|..
gi 1907141596 1438 SGTLSSKAILHV 1449
Cdd:cd20976     79 AGQVSCSAWVTV 90
IgI_telokin-like cd20973
immunoglobulin-like domain of telokin and similar proteins; a member of the I-set of IgSF ...
1363-1449 1.34e-08

immunoglobulin-like domain of telokin and similar proteins; a member of the I-set of IgSF domains; The members here are composed of the immunoglobulin (Ig) domain in telokin, the C-terminal domain of myosin light chain kinase which is identical to telokin, and similar proteins. The Ig superfamily (IgSF) is a heterogenous group of proteins, built on a common fold comprised of a sandwich of two beta sheets. IgSF domains can be divided into 4 main classes based on their structures and sequences: the Variable (V), Constant 1 (C1), Constant 2 (C2), and Intermediate (I) sets. Unlike the V-set, one of the distinctive features of I-set domains is the lack of a C" strand. The structure of the telokin Ig domain lacks this strand and thus it belongs to the I-set of the IgSF. I-set domains are found in several cell adhesion molecules (such as VCAM, ICAM, and MADCAM), and are also present in numerous other diverse protein families, including several tyrosine-protein kinase receptors, the hemolymph protein hemolin, the muscle proteins titin, telokin, and twitchin, the neuronal adhesion molecule axonin-1, and the signaling molecule semaphorin 4D that is involved in axonal guidance, immune function and angiogenesis.


Pssm-ID: 409565 [Multi-domain]  Cd Length: 88  Bit Score: 53.35  E-value: 1.34e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907141596 1363 SRLLSNVTVMEGSPVTLEVEVTGFPEPTLTWFKKGQKLCADGHLQVLH-KDTKHSVFIPKVCEADAGLYVAQAQNSSGTL 1441
Cdd:cd20973      1 IQTLRDKEVVEGSAARFDCKVEGYPDPEVKWMKDDNPIVESRRFQIDQdEDGLCSLIISDVCGDDSGKYTCKAVNSLGEA 80

                   ....*...
gi 1907141596 1442 SSKAILHV 1449
Cdd:cd20973     81 TCSAELTV 88
IgI_Titin_like cd05747
Immunoglobulin (Ig)-like domain of human titin C terminus and similar proteins; member of the ...
1368-1439 3.36e-08

Immunoglobulin (Ig)-like domain of human titin C terminus and similar proteins; member of the I-set of Ig superfamily (IgSF) domains; The members here are composed of the fifth immunoglobulin (Ig)-like domain from the C-terminus of human titin x and similar proteins. Titin (also called connectin) is a fibrous sarcomeric protein specifically found in vertebrate striated muscle. Titin is gigantic; depending on isoform composition it ranges from 2970 to 3700 kDa, and is of a length that spans half a sarcomere. Titin largely consists of multiple repeats of Ig-like and fibronectin type 3 (FN-III)-like domains. Titin connects the ends of myosin thick filaments to Z disks and extends along the thick filament to the H zone and appears to function similar to an elastic band, keeping the myosin filaments centered in the sarcomere during muscle contraction or stretching. This group belongs to the I-set of IgSF domains, having A-B-E-D strands in one beta-sheet and A'-G-F-C-C' in the other. Like the V-set Ig domains, members of the I-set have a discontinuous A strand, but lack a C" strand.


Pssm-ID: 143224 [Multi-domain]  Cd Length: 92  Bit Score: 52.36  E-value: 3.36e-08
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1907141596 1368 NVTVMEGSPVTLEVEVTGFPEPTLTWFKKGQKLCADGHLQVLHKDTKHSVFIPKVCEADAGLYVAQAQNSSG 1439
Cdd:cd05747     12 SLTVSEGESARFSCDVDGEPAPTVTWMREGQIIVSSQRHQITSTEYKSTFEISKVQMSDEGNYTVVVENSEG 83
IgI_1_MuSK cd20970
agrin-responsive first immunoglobulin-like domains (Ig1) of the MuSK ectodomain; a member of ...
1366-1449 5.51e-08

agrin-responsive first immunoglobulin-like domains (Ig1) of the MuSK ectodomain; a member of the I-set of IgSF domains; The members here are composed of the first immunoglobulin-like domains (Ig1) of the Muscle-specific kinase (MuSK). MuSK is a receptor tyrosine kinase specifically expressed in skeletal muscle, where it plays a central role in the formation and maintenance of the neuromuscular junction (NMJ). MuSK is activated by agrin, a neuron-derived heparan sulfate proteoglycan. The activation of MUSK in myotubes regulates the formation of NMJs through the regulation of different processes including the specific expression of genes in subsynaptic nuclei, the reorganization of the actin cytoskeleton and the clustering of the acetylcholine receptors (AChR) in the postsynaptic membrane. The Ig superfamily (IgSF) is a heterogenous group of proteins, built on a common fold comprised of a sandwich of two beta sheets. IgSF domains can be divided into 4 main classes based on their structures and sequences: the Variable (V), Constant 1 (C1), Constant 2 (C2), and Intermediate (I) sets. Unlike the V-set, one of the distinctive features of I-set domains is the lack of a C" strand. The structure of the MuSK lacks this strand and thus it belongs to the I-set of the IgSF. I-set domains are found in several cell adhesion molecules (such as VCAM, ICAM, and MADCAM), and are also present in numerous other diverse protein families, including several tyrosine-protein kinase receptors, the hemolymph protein hemolin, the muscle proteins titin, telokin, and twitchin, the neuronal adhesion molecule axonin-1, and the signaling molecule semaphorin 4D that is involved in axonal guidance, immune function and angiogenesis.


Pssm-ID: 409562 [Multi-domain]  Cd Length: 92  Bit Score: 51.74  E-value: 5.51e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907141596 1366 LSNVTVMEGSPVTLEVEVTGFPEPTLTWFKKGQklcadghlQVLHKDTKHSVF-------IPKVCEADAGLYVAQAQN-S 1437
Cdd:cd20970      9 SFTVTAREGENATFMCRAEGSPEPEISWTRNGN--------LIIEFNTRYIVRengttltIRNIRRSDMGIYLCIASNgV 80
                           90
                   ....*....|..
gi 1907141596 1438 SGTLSSKAILHV 1449
Cdd:cd20970     81 PGSVEKRITLQV 92
IgI_LRIG1-like cd05763
Immunoglobulin (Ig)-like ectodomain of the LRIG1 (Leucine-rich Repeats And Immunoglobulin-like ...
1362-1449 9.55e-08

Immunoglobulin (Ig)-like ectodomain of the LRIG1 (Leucine-rich Repeats And Immunoglobulin-like Domains Protein 1) and similar proteins; member of the I-set of IgSF domains; The members here are composed of subgroup of the immunoglobulin (Ig) domain found in the Ig superfamily. The Ig superfamily is a heterogenous group of proteins, built on a common fold comprised of a sandwich of two beta sheets. A predominant feature of most Ig domains is a disulfide bridge connecting the two beta-sheets with a tryptophan residue packed against the disulfide bond. The ectodomain of LRIG1 has two distinct regions: the proposed 15 LRRs and three Ig-like domains closer to the membrane. LRIG1 has been reported to interact with many receptor tyrosine kinases, GDNF/c-Ret, E-cadherin, JAK/STAT, c-Met, and the EGFR family signaling systems. Immunoglobulin Superfamily (IgSF) domains can be divided into 4 main classes based on their structures and sequences: the Variable (V), Constant 1 (C1), Constant 2 (C2), and Intermediate (I) sets. The structure of the LRIG1 extracellular Ig domain lacks a C" strand and thus is better described as a member of the I-set of IgSF domains.


Pssm-ID: 409420 [Multi-domain]  Cd Length: 91  Bit Score: 51.08  E-value: 9.55e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907141596 1362 FSRLLSNVTVMEGSPVTLEVEVTGFPEPTLTWFKKGQKLCADGHLQVLHKDTKHSVF-IPKVCEADAGLYVAQAQNSSGT 1440
Cdd:cd05763      2 FTKTPHDITIRAGSTARLECAATGHPTPQIAWQKDGGTDFPAARERRMHVMPEDDVFfIVDVKIEDTGVYSCTAQNSAGS 81

                   ....*....
gi 1907141596 1441 LSSKAILHV 1449
Cdd:cd05763     82 ISANATLTV 90
IgI_Myotilin_C cd05892
C-terminal immunoglobulin (Ig)-like domain of myotilin; member of the I-set of Ig superfamily ...
1360-1449 1.52e-07

C-terminal immunoglobulin (Ig)-like domain of myotilin; member of the I-set of Ig superfamily (IgSF) domains; The members here are composed of the C-terminal immunoglobulin (Ig)-like domain of myotilin. Mytolin belongs to the palladin-myotilin-myopalladin family. Proteins belonging to the latter family contain multiple Ig-like domains and function as scaffolds, modulating the actin cytoskeleton. Myotilin is most abundant in skeletal and cardiac muscle and is involved in maintaining sarcomere integrity. It binds to alpha-actinin, filamin, and actin. Mutations in myotilin lead to muscle disorders. This group belongs to the I-set of IgSF domains, having A-B-E-D strands in one beta-sheet and A'-G-F-C-C' in the other. Like the V-set Ig domains, members of the I-set have a discontinuous A strand, but lack a C" strand.


Pssm-ID: 409473  Cd Length: 92  Bit Score: 50.54  E-value: 1.52e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907141596 1360 PHFSRLLSNVTVMEGSPVTLEVEVTGFPEPTLTWFKKGQKLCADGHLQVLHKDT--KHSVFIPKVCEADAGLYVAQAQNS 1437
Cdd:cd05892      1 PMFIQKPQNKKVLEGDPVRLECQISAIPPPQIFWKKNNEMLQYNTDRISLYQDNcgRICLLIQNANKKDAGWYTVSAVNE 80
                           90
                   ....*....|..
gi 1907141596 1438 SGTLSSKAILHV 1449
Cdd:cd05892     81 AGVVSCNARLDV 92
IgI_4_Robo cd05726
Fourth immunoglobulin (Ig)-like domain in Robo (roundabout) receptors; member of the I-set of ...
1368-1451 5.71e-07

Fourth immunoglobulin (Ig)-like domain in Robo (roundabout) receptors; member of the I-set of Ig superfamily (IgSF) domains; Members here are composed the fourth immunoglobulin (Ig)-like domain in Robo (roundabout) receptors. Robo receptors play a role in the development of the central nervous system (CNS), and are receptors of Slit protein. Slit is a repellant secreted by the neural cells in the midline. Slit acts through Robo to prevent most neurons from crossing the midline from either side. Three mammalian Robo homologs (Robo1, Robo2, Robo3), and three mammalian Slit homologs (Slit-1, Slit-2, Slit-3), have been identified. Commissural axons, which cross the midline, express low levels of Robo; longitudinal axons, which avoid the midline, express high levels of Robo. Robo1, Robo2, and Robo3 are expressed by commissural neurons in the vertebrate spinal cord and Slit-1, Slit-2, and Slit-3 are expressed at the ventral midline. Robo-3 is a divergent member of the Robo family which instead of being a positive regulator of Slit responsiveness, antagonizes Slit responsiveness in precrossing axons. The Slit-Robo interaction is mediated by the second leucine-rich repeat (LRR) domain of Slit and the two N-terminal Ig domains of Robo, Ig1 and Ig2. The primary Robo binding site for Slit2 has been shown by surface plasmon resonance experiments and mutational analysis to be the Ig1 domain, while the Ig2 domain has been proposed to harbor a weak secondary binding site. This group belongs to the I-set of IgSF domains, having A-B-E-D strands in one beta-sheet and A'-G-F-C-C' in the other. Like the V-set Ig domains, members of the I-set have a discontinuous A strand, but lack a C" strand.


Pssm-ID: 409391 [Multi-domain]  Cd Length: 98  Bit Score: 49.18  E-value: 5.71e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907141596 1368 NVTVMEGSPVTLEVEVTGFPEPTLTWFKKG-QKLCADGhlQVLHKDTKHSV------FIPKVCEADAGLYVAQAQNSSGT 1440
Cdd:cd05726      8 DQVVALGRTVTFQCETKGNPQPAIFWQKEGsQNLLFPY--QPPQPSSRFSVsptgdlTITNVQRSDVGYYICQALNVAGS 85
                           90
                   ....*....|.
gi 1907141596 1441 LSSKAILHVTG 1451
Cdd:cd05726     86 ILAKAQLEVTD 96
IgI_2_Robo cd05724
Second immunoglobulin (Ig)-like domain in Robo (roundabout) receptors; member of the I-set of ...
1367-1445 6.15e-07

Second immunoglobulin (Ig)-like domain in Robo (roundabout) receptors; member of the I-set of Ig superfamily (IgSF) domains; The members here are composed of the second immunoglobulin (Ig)-like domain in Robo (roundabout) receptors. Robo receptors play a role in the development of the central nervous system (CNS), and are receptors of the Slit protein. Slit is a repellant secreted by the neural cells in the midline. Slit acts through Robo to prevent most neurons from crossing the midline from either side. Three mammalian Robo homologs (Robo1, Robo2, and Robo3), and three mammalian Slit homologs (Slit-1,Slit-2, Slit-3), have been identified. Commissural axons, which cross the midline, express low levels of Robo; longitudinal axons, which avoid the midline, express high levels of Robo. Robo1, Robo2, and Robo3 are expressed by commissural neurons in the vertebrate spinal cord and Slit-1, Slit-2, Slit-3 are expressed at the ventral midline. Robo-3 is a divergent member of the Robo family which instead of being a positive regulator of Slit responsiveness, antagonizes Slit responsiveness in precrossing axons. The Slit-Robo interaction is mediated by the second leucine-rich repeat (LRR) domain of Slit and the two N-terminal Ig domains of Robo, Ig1 and Ig2. The primary Robo binding site for Slit-2 has been shown by surface plasmon resonance experiments and mutational analysis to be the Ig1 domain, while the Ig2 domain has been proposed to harbor a weak secondary binding site. This group belongs to the I-set of IgSF domains, having A-B-E-D strands in one beta-sheet and A'-G-F-C-C' in the other. Like the V-set Ig domains, members of the I-set have a discontinuous A strand, but lack a C" strand.


Pssm-ID: 409389 [Multi-domain]  Cd Length: 87  Bit Score: 48.55  E-value: 6.15e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907141596 1367 SNVTVMEGSPVTLEVEV-TGFPEPTLTWFKKGQKLCADG-HLQVLHKDtkhSVFIPKVCEADAGLYVAQAQNSSGTLSSK 1444
Cdd:cd05724      5 SDTQVAVGEMAVLECSPpRGHPEPTVSWRKDGQPLNLDNeRVRIVDDG---NLLIAEARKSDEGTYKCVATNMVGERESR 81

                   .
gi 1907141596 1445 A 1445
Cdd:cd05724     82 A 82
IgI_Twitchin_like cd20949
C-terminal immunoglobulin-like domain of the myosin-associated giant protein kinase Twitchin, ...
1367-1449 1.53e-06

C-terminal immunoglobulin-like domain of the myosin-associated giant protein kinase Twitchin, and similar domains; member of the I-set IgSF domains; The members here are composed of the C-terminal immunoglobulin-like domain of the myosin-associated giant protein kinase Twitchin and similar proteins, including Caenorhabditis elegans and Aplysia californica Twitchin, Drosophila melanogaster Projectin, and similar proteins. These are very large muscle proteins containing multiple immunoglobulin (Ig)-like and fibronectin type III (FN3) domains and a single kinase domain near the C-terminus. In humans these proteins are called Titin. The Ig superfamily (IgSF) is a heterogenous group of proteins, built on a common fold comprised of a sandwich of two beta sheets. IgSF domains can be divided into 4 main classes based on their structures and sequences: the Variable (V), Constant 1 (C1), Constant 2 (C2), and Intermediate (I) sets. The Ig-like domain of the Twitchin is a member of the I-set IgSF domains, having A-B-E-D strands in one beta-sheet and A'-G-F-C-C' in the other. Like the V-set Ig domains, members of the I-set have a discontinuous A strand but lack a C" strand. I-set domains are found in several cell adhesion molecules (such as VCAM, ICAM, and MADCAM), and are also present in numerous other diverse protein families, including several tyrosine-protein kinase receptors, the hemolymph protein hemolin, the muscle proteins (titin, telokin, and twitchin), the neuronal adhesion molecule axonin-1, and the signaling molecule semaphorin 4D.


Pssm-ID: 409541 [Multi-domain]  Cd Length: 89  Bit Score: 47.71  E-value: 1.53e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907141596 1367 SNVTVMEGSPVTLEVEVTGFPEPTLTWFKKGQKLCADGHLQVLHKDTKHSVFIPKVCEADAGLYVAQAQNSSGTLSSKAI 1446
Cdd:cd20949      7 YVTTVKEGQSATILCEVKGEPQPNVTWHFNGQPISASVADMSKYRILADGLLINKVTQDDTGEYTCRAYQVNSIASDMQE 86

                   ...
gi 1907141596 1447 LHV 1449
Cdd:cd20949     87 RTV 89
IgI_3_NCAM-1 cd05730
Third immunoglobulin (Ig)-like domain of Neural Cell Adhesion Molecule 1 (NCAM-1); member of ...
1368-1449 1.57e-06

Third immunoglobulin (Ig)-like domain of Neural Cell Adhesion Molecule 1 (NCAM-1); member of the I-set of IgSF domains; The members here are composed of the third immunoglobulin (Ig)-like domain of Neural Cell Adhesion Molecule (NCAM-1). NCAM plays important roles in the development and regeneration of the central nervous system, in synaptogenesis and neural migration. NCAM mediates cell-cell and cell-substratum recognition and adhesion via homophilic (NCAM-NCAM), and heterophilic (NCAM-non-NCAM), interactions. NCAM is expressed as three major isoforms having different intracellular extensions. The extracellular portion of NCAM has five N-terminal Ig-like domains and two fibronectin type III domains. The double zipper adhesion complex model for NCAM homophilic binding involves Ig1, Ig2, and Ig3. By this model, Ig1 and Ig2 mediate dimerization of NCAM molecules situated on the same cell surface (cis interactions), and Ig3 domains mediate interactions between NCAM molecules expressed on the surface of opposing cells (trans interactions) through binding to the Ig1 and Ig2 domains. The adhesive ability of NCAM is modulated by the addition of polysialic acid chains to the fifth Ig-like domain.


Pssm-ID: 143207 [Multi-domain]  Cd Length: 95  Bit Score: 48.00  E-value: 1.57e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907141596 1368 NVTVMEGSPVTLEVEVTGFPEPTLTWFKKGQKLcADGHLQVLHKDTKHSVFIPKVCEADAGLYVAQAQNSSGtlSSKAIL 1447
Cdd:cd05730     12 NATANLGQSVTLACDADGFPEPTMTWTKDGEPI-ESGEEKYSFNEDGSEMTILDVDKLDEAEYTCIAENKAG--EQEAEI 88

                   ..
gi 1907141596 1448 HV 1449
Cdd:cd05730     89 HL 90
IgI_5_Dscam cd20958
Fifth immunoglobulin domain of the Drosophila melanogaster Dscam protein, and similar domains; ...
1360-1450 2.06e-06

Fifth immunoglobulin domain of the Drosophila melanogaster Dscam protein, and similar domains; a member of the I-set of IgSF domains; The members here are composed of the fifth immunoglobulin domain of the Drosophila melanogaster Down syndrome cell adhesion molecule (DSCAM) protein and similar proteins. Down syndrome cell adhesion molecule (DSCAM) is a cell adhesion molecule that plays critical roles in neural development, including axon guidance and branching, axon target recognition, self-avoidance and synaptic formation. DSCAM belongs to the immunoglobulin superfamily and contributes to defects in the central nervous system in Down syndrome patients. Vertebrate DSCAMs differ from Drosophila Dscam1 in that they lack the extensive alternative splicing that occurs in the insect gene. Drosophila melanogaster Dscam has 38,016 isoforms generated by the alternative splicing of four variable exon clusters, which allows every neuron in the fly to display a distinctive set of Dscam proteins on its cell surface. Drosophila Dscam1 is a cell-surface protein that plays important roles in neural development and axon tiling of neurons. It is shown that thousands of isoforms bind themselves through specific homophilic (self-binding) interactions, a process which mediates cellular self-recognition. Drosophila Dscam2 is also alternatively spliced and plays a key role in the development of two visual system neurons, monopolar cells L1 and L2. This group is a member of the I-set Ig domains, having A-B-E-D strands in one beta-sheet and A'-G-F-C-C' in the other. Like the V-set Ig domains, members of the I-set have a discontinuous A strand but lack a C" strand.


Pssm-ID: 409550 [Multi-domain]  Cd Length: 89  Bit Score: 47.18  E-value: 2.06e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907141596 1360 PHFSRLLSNVTVMEGSPVTLEVEVTGFPEPTLTWFKKGQKLcADGHLQVLHKD---TKHSVFIpkvcEADAGLYVAQAQN 1436
Cdd:cd20958      1 PPFIRPMGNLTAVAGQTLRLHCPVAGYPISSITWEKDGRRL-PLNHRQRVFPNgtlVIENVQR----SSDEGEYTCTARN 75
                           90
                   ....*....|....
gi 1907141596 1437 SSGtLSSKAILHVT 1450
Cdd:cd20958     76 QQG-QSASRSVFVK 88
Ig5_Contactin cd04969
Fifth immunoglobulin (Ig) domain of contactin; The members here are composed of the fifth ...
1360-1449 2.30e-06

Fifth immunoglobulin (Ig) domain of contactin; The members here are composed of the fifth immunoglobulin (Ig) domain of contactins. Contactins are neural cell adhesion molecules and are comprised of six Ig domains followed by four fibronectin type III (FnIII) domains anchored to the membrane by glycosylphosphatidylinositol. The first four Ig domains form the intermolecular binding fragment, which arranges as a compact U-shaped module via contacts between Ig domains 1 and 4, and between Ig domains 2 and 3. Contactin-2 (TAG-1, axonin-1) may play a part in the neuronal processes of neurite outgrowth, axon guidance and fasciculation, and neuronal migration. This group also includes contactin-1 and contactin-5. The different contactins show different expression patterns in the central nervous system. During development and in adulthood, contactin-2 is transiently expressed in subsets of central and peripheral neurons. Contactin-5 is expressed specifically in the rat postnatal nervous system, peaking at about 3 weeks postnatal, and a lack of contactin-5 (NB-2) results in an impairment of neuronal activity in the rat auditory system. Contactin-5 is highly expressed in the adult human brain in the occipital lobe and in the amygdala. Contactin-1 is differentially expressed in tumor tissues and may, through a RhoA mechanism, facilitate invasion and metastasis of human lung adenocarcinoma.


Pssm-ID: 409358 [Multi-domain]  Cd Length: 89  Bit Score: 47.07  E-value: 2.30e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907141596 1360 PHFSR--LLSNVTVMEGSPVTLEVEVTGFPEPTLTWFKKGQKLCADGHLQVLhkdTKHSVFIPKVCEADAGLYVAQAQNS 1437
Cdd:cd04969      1 PDFELnpVKKKILAAKGGDVIIECKPKASPKPTISWSKGTELLTNSSRICIL---PDGSLKIKNVTKSDEGKYTCFAVNF 77
                           90
                   ....*....|..
gi 1907141596 1438 SGTLSSKAILHV 1449
Cdd:cd04969     78 FGKANSTGSLSV 89
Ig_2 pfam13895
Immunoglobulin domain; This domain contains immunoglobulin-like domains.
1366-1449 3.09e-06

Immunoglobulin domain; This domain contains immunoglobulin-like domains.


Pssm-ID: 464026 [Multi-domain]  Cd Length: 79  Bit Score: 46.62  E-value: 3.09e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907141596 1366 LSNVTVMEGSPVTLEVEVTGFPEPTLTWFKKGQKLcadghlqvlhkDTKHSVFIPKVCEADAGLYVAQAQNSSG-TLSSK 1444
Cdd:pfam13895    6 PSPTVVTEGEPVTLTCSAPGNPPPSYTWYKDGSAI-----------SSSPNFFTLSVSAEDSGTYTCVARNGRGgKVSNP 74

                   ....*
gi 1907141596 1445 AILHV 1449
Cdd:pfam13895   75 VELTV 79
IgI_1_Palladin_C cd05893
First C-terminal immunoglobulin (Ig)-like domain of palladin; member of the I-set of Ig ...
1360-1449 3.66e-06

First C-terminal immunoglobulin (Ig)-like domain of palladin; member of the I-set of Ig superfamily (IgSF) domains; The members here are composed of the C-terminal immunoglobulin (Ig)-like domain of palladin. Palladin belongs to the palladin-myotilin-myopalladin family. Proteins belonging to this family contain multiple Ig-like domains and function as scaffolds, modulating actin cytoskeleton. Palladin binds to alpha-actinin ezrin, vasodilator-stimulated phosphoprotein VASP, SPIN90 (also known as DIP or mDia interacting protein), and Src. Palladin also binds F-actin directly, via its Ig3 domain. Palladin is expressed as several alternatively spliced isoforms, having various combinations of Ig-like domains, in a cell-type-specific manner. It has been suggested that palladin's different Ig-like domains may be specialized for distinct functions. This group belongs to the I-set of IgSF domains, having A-B-E-D strands in one beta-sheet and A'-G-F-C-C' in the other. Like the V-set Ig domains, members of the I-set have a discontinuous A strand, but lack a C" strand.


Pssm-ID: 409474  Cd Length: 92  Bit Score: 46.63  E-value: 3.66e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907141596 1360 PHFSRLLSNVTVMEGSPVTLEVEVTGFPEPTLTWFKKGQKLCADGHLQVLHKDTKHSVFIPKVCEA--DAGLYVAQAQNS 1437
Cdd:cd05893      1 PFFEMKLKHYKIFEGMPVTFTCRVAGNPKPKIYWFKDGKQISPKSDHYTIQRDLDGTCSLHTTASTldDDGNYTIMAANP 80
                           90
                   ....*....|..
gi 1907141596 1438 SGTLSSKAILHV 1449
Cdd:cd05893     81 QGRISCTGRLMV 92
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
716-1067 3.80e-06

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 51.98  E-value: 3.80e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907141596  716 SLQSTSDLRRRWIAMKPQLQQLHEDVQqiTKEWEVLSSQgapLKEKAEQLKDLVHLHRRQRERIQEYEEILYKTVQFHQV 795
Cdd:TIGR02168  204 SLERQAEKAERYKELKAELRELELALL--VLRLEELREE---LEELQEELKEAEEELEELTAELQELEEKLEELRLEVSE 278
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907141596  796 KEELVHLIKPRELELLAQpmelasSEEVQMQLGRSQERRAhvdHLHQLALTLGVDIISSVQQP--LKESFKDIKKKFNNL 873
Cdd:TIGR02168  279 LEEEIEELQKELYALANE------ISRLEQQKQILRERLA---NLERQLEELEAQLEELESKLdeLAEELAELEEKLEEL 349
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907141596  874 KFNY-------SKKNEKSRNLKTLQYQIQ-QVDTYAEKIQALRKKMEKVNNKtsdsfLSYpsnkanmLSEAMEDLKKNVD 945
Cdd:TIGR02168  350 KEELesleaelEELEAELEELESRLEELEeQLETLRSKVAQLELQIASLNNE-----IER-------LEARLERLEDRRE 417
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907141596  946 DFDKVVTDYKMNLDLTE--HLQEVIEECNFWYEDASATVVRVgkysmecqtREAVDILHRQFNKFITP--SVPQQEERIQ 1021
Cdd:TIGR02168  418 RLQQEIEELLKKLEEAElkELQAELEELEEELEELQEELERL---------EEALEELREELEEAEQAldAAERELAQLQ 488
                          330       340       350       360
                   ....*....|....*....|....*....|....*....|....*.
gi 1907141596 1022 EVIDLAQRLYGLEEGQKYAEKIVTRHKEILESITELCGSLVELKEK 1067
Cdd:TIGR02168  489 ARLDSLERLQENLEGFSEGVKALLKNQSGLSGILGVLSELISVDEG 534
Ig4_Contactin-2-like cd05728
Fourth Ig domain of the neural cell adhesion molecule contactin-2, and similar domains; The ...
1374-1449 1.23e-05

Fourth Ig domain of the neural cell adhesion molecule contactin-2, and similar domains; The members here are composed of the fourth Ig domain of the neural cell adhesion molecule contactin-2. Contactins are comprised of six Ig domains followed by four fibronectin type III (FnIII) domains anchored to the membrane by glycosylphosphatidylinositol. Contactin-2 (also called TAG-1, axonin-1) facilitates cell adhesion by homophilic binding between molecules in apposed membranes. The first four Ig domains form the intermolecular binding fragment which arranges as a compact U-shaped module by contacts between Ig domains 1 and 4, and domains 2 and 3. It has been proposed that a linear zipper-like array forms, from contactin-2 molecules alternatively provided by the two apposed membranes.


Pssm-ID: 143205 [Multi-domain]  Cd Length: 85  Bit Score: 44.90  E-value: 1.23e-05
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1907141596 1374 GSPVTLEVEVTGFPEPTLTWFKKGQKLCADGHLQVLHKDTKhsvfIPKVCEADAGLYVAQAQNSSGTLSSKAILHV 1449
Cdd:cd05728     14 GSSLRWECKASGNPRPAYRWLKNGQPLASENRIEVEAGDLR----ITKLSLSDSGMYQCVAENKHGTIYASAELAV 85
IgI_4_Neogenin_like cd05723
Fourth immunoglobulin (Ig)-like domain in neogenin, and similar domains; member of the I-set ...
1367-1447 1.75e-05

Fourth immunoglobulin (Ig)-like domain in neogenin, and similar domains; member of the I-set of Ig superfamily (IgSF) domains; The members here are composed of the fourth immunoglobulin (Ig)-like domain in neogenin and related proteins. Neogenin is a cell surface protein which is expressed in the developing nervous system of vertebrate embryos in the growing nerve cells. It is also expressed in other embryonic tissues, and may play a general role in developmental processes such as cell migration, cell-cell recognition, and tissue growth regulation. Included in this group is the tumor suppressor protein DCC which is deleted in colorectal carcinoma. DCC and neogenin each have four Ig-like domains followed by six fibronectin type III domains, a transmembrane domain, and an intracellular domain. This group belongs to the I-set of IgSF domains, having A-B-E-D strands in one beta-sheet and A'-G-F-C-C' in the other. Like the V-set Ig domains, members of the I-set have a discontinuous A strand, but lack a C" strand.


Pssm-ID: 409388  Cd Length: 84  Bit Score: 44.49  E-value: 1.75e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907141596 1367 SNVTVMEGSPVTLEVEVTGFPEPTLTWFKKGQKLCADGHLQVLHkdtKHSVFIPKVCEADAGLYVAQAQNSSGTLSSKAI 1446
Cdd:cd05723      5 SNIYAHESMDIVFECEVTGKPTPTVKWVKNGDVVIPSDYFKIVK---EHNLQVLGLVKSDEGFYQCIAENDVGNAQASAQ 81

                   .
gi 1907141596 1447 L 1447
Cdd:cd05723     82 L 82
Ig3_Peroxidasin cd05745
Third immunoglobulin (Ig)-like domain of peroxidasin; The members here are composed of the ...
1373-1449 2.19e-05

Third immunoglobulin (Ig)-like domain of peroxidasin; The members here are composed of the third immunoglobulin (Ig)-like domain in peroxidasin. Peroxidasin has a peroxidase domain and interacting extracellular motifs containing four Ig-like domains. It has been suggested that peroxidasin is secreted and has functions related to the stabilization of the extracellular matrix. It may play a part in various other important processes such as removal and destruction of cells which have undergone programmed cell death and protection of the organism against non-self.


Pssm-ID: 143222 [Multi-domain]  Cd Length: 74  Bit Score: 43.77  E-value: 2.19e-05
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1907141596 1373 EGSPVTLEVEVTGFPEPTLTWFKKGQKLCADGHLQVLHKDTkhsVFIPKVCEADAGLYVAQAQNSSGTLSSKAILHV 1449
Cdd:cd05745      1 EGQTVDFLCEAQGYPQPVIAWTKGGSQLSVDRRHLVLSSGT---LRISRVALHDQGQYECQAVNIVGSQRTVAQLTV 74
IgI_M-protein_C cd05891
C-terminal immunoglobulin (Ig)-like domain of M-protein; member of the I-set of Ig superfamily ...
1369-1439 2.42e-05

C-terminal immunoglobulin (Ig)-like domain of M-protein; member of the I-set of Ig superfamily (IgSF) domains; The members here are composed of the C-terminal immunoglobulin (Ig)-like domain of M-protein (also known as myomesin-2). M-protein is a structural protein localized to the M-band, a transverse structure in the center of the sarcomere, and is a candidate for M-band bridges. M-protein is modular consisting mainly of repetitive IG-like and fibronectin type III (FnIII) domains and has a muscle-type specific expression pattern. M-protein is present in fast fibers.


Pssm-ID: 143299  Cd Length: 92  Bit Score: 44.52  E-value: 2.42e-05
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1907141596 1369 VTVMEGSPVTLEVEVTGFPEPTLTWFKKGQKLCADGHLQVLHKDTKH-SVFIPKVCEADAGLYVAQAQNSSG 1439
Cdd:cd05891     11 VTIMEGKTLNLTCTVFGNPDPEVIWFKNDQDIELSEHYSVKLEQGKYaSLTIKGVTSEDSGKYSINVKNKYG 82
IgI_2_MuSK cd20968
agrin-responsive second immunoglobulin-like domains (Ig2) of the Muscle-specific kinase (MuSK) ...
1368-1446 2.82e-05

agrin-responsive second immunoglobulin-like domains (Ig2) of the Muscle-specific kinase (MuSK) ectodomain; a member of the I-set of Ig superfamily domains; The members here are composed of the second immunoglobulin-like (Ig) domains of the Muscle-specific kinase (MuSK) ectodomain. MuSK is a receptor tyrosine kinase specifically expressed in skeletal muscle, where it plays a central role in the formation and maintenance of the neuromuscular junction (NMJ). MuSK is activated by agrin, a neuron-derived heparan sulfate proteoglycan. The activation of MUSK in myotubes regulates the formation of NMJs through the regulation of different processes including the specific expression of genes in subsynaptic nuclei, the reorganization of the actin cytoskeleton and the clustering of the acetylcholine receptors (AChR) in the postsynaptic membrane. The Ig superfamily (IgSF) is a heterogenous group of proteins, built on a common fold comprised of a sandwich of two beta sheets. IgSF domains can be divided into 4 main classes based on their structures and sequences: the Variable (V), Constant 1 (C1), Constant 2 (C2), and Intermediate (I) sets. Unlike the V-set, one of the distinctive features of I-set domains is the lack of a C" strand. The structure of the MuSK lacks this strand and thus it belongs to the I-set of the IgSF. I-set domains are found in several cell adhesion molecules (such as VCAM, ICAM, and MADCAM), and are also present in numerous other diverse protein families, including several tyrosine-protein kinase receptors, the hemolymph protein hemolin, the muscle proteins titin, telokin, and twitchin, the neuronal adhesion molecule axonin-1, and the signaling molecule semaphorin 4D that is involved in axonal guidance, immune function and angiogenesis.


Pssm-ID: 409560 [Multi-domain]  Cd Length: 88  Bit Score: 44.16  E-value: 2.82e-05
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1907141596 1368 NVTVMEGSPVTLEVEVTGFPEPTLTWFKKGQKLCADGHLQVLHKDtkhSVFIPKVCEADAGLYVAQAQNSSGTLSSKAI 1446
Cdd:cd20968      8 NVTIIEGLKAVLPCTTMGNPKPSVSWIKGDDLIKENNRIAVLESG---SLRIHNVQKEDAGQYRCVAKNSLGIAYSKPV 83
IgC2_3_Dscam cd20957
Third immunoglobulin domain of the Drosophila melanogaster Dscam protein, and similar domains; ...
1370-1447 3.17e-05

Third immunoglobulin domain of the Drosophila melanogaster Dscam protein, and similar domains; a member of the Constant 2 (C2)-set of IgSF domains; The members here are composed of the third immunoglobulin domain of the Drosophila melanogaster Down syndrome cell adhesion molecule (DSCAM) protein and similar proteins. Down syndrome cell adhesion molecule (DSCAM) is a cell adhesion molecule that plays critical roles in neural development, including axon guidance and branching, axon target recognition, self-avoidance and synaptic formation. DSCAM belongs to the immunoglobulin superfamily and contributes to defects in the central nervous system in Down syndrome patients. Vertebrate DSCAMs differ from Drosophila Dscam1 in that they lack the extensive alternative splicing that occurs in the insect gene. Drosophila melanogaster Dscam has 38,016 isoforms generated by the alternative splicing of four variable exon clusters, which allows every neuron in the fly to display a distinctive set of Dscam proteins on its cell surface. Drosophila Dscam1 is a cell-surface protein that plays important roles in neural development and axon tiling of neurons. It is shown that thousands of isoforms bind themselves through specific homophilic (self-binding) interactions, a process which mediates cellular self-recognition. Drosophila Dscam2 is also alternatively spliced and plays a key role in the development of two visual system neurons, monopolar cells L1 and L2. IgSF domains can be divided into 4 main classes based on their structures and sequences: the Variable (V), Constant 1 (C1), Constant 2 (C2), and Intermediate (I) sets. This group belongs to the C2-set of IgSF domains, having A, B, and E strands in one beta-sheet and A', G, F, C, and C' in the other. Unlike other Ig domain sets, the C2-set lacks the D strand.


Pssm-ID: 409549 [Multi-domain]  Cd Length: 88  Bit Score: 44.06  E-value: 3.17e-05
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1907141596 1370 TVMEGSPVTLEVEVTGFPEPTLTWFKKGQKLCADGHLQVLHKDTKHsvfIPKVCEADAGLYVAQAQNSSGTLSSKAIL 1447
Cdd:cd20957     12 TVDFGRTAVFNCSVTGNPIHTVLWMKDGKPLGHSSRVQILSEDVLV---IPSVKREDKGMYQCFVRNDGDSAQATAEL 86
Ig3_L1-CAM_like cd05731
Third immunoglobulin (Ig)-like domain of the L1 cell adhesion molecule (CAM), and similar ...
1367-1450 4.04e-05

Third immunoglobulin (Ig)-like domain of the L1 cell adhesion molecule (CAM), and similar domains; The members here are composed of the third immunoglobulin (Ig)-like domain of the L1 cell adhesion molecule (CAM). L1 belongs to the L1 subfamily of cell adhesion molecules (CAMs) and is comprised of an extracellular region having six Ig-like domains and five fibronectin type III domains, a transmembrane region and an intracellular domain. L1 is primarily expressed in the nervous system and is involved in its development and function. L1 is associated with an X-linked recessive disorder, X-linked hydrocephalus, MASA syndrome, and spastic paraplegia type 1, that involves abnormalities of axonal growth. This group also contains the chicken neuron-glia cell adhesion molecule, Ng-CAM and human neurofascin.


Pssm-ID: 409394 [Multi-domain]  Cd Length: 83  Bit Score: 43.55  E-value: 4.04e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907141596 1367 SNVTVMEGSPVTLEVEVTGFPEPTLTWFKKGQKLCADghlQVLHKDTKHSVFIPKVCEADAGLYVAQAQNSSGtlSSKAI 1446
Cdd:cd05731      3 SSTMVLRGGVLLLECIAEGLPTPDIRWIKLGGELPKG---RTKFENFNKTLKIENVSEADSGEYQCTASNTMG--SARHT 77

                   ....
gi 1907141596 1447 LHVT 1450
Cdd:cd05731     78 ISVT 81
IgI_5_Robo cd20952
Fifth Ig-like domain of Roundabout (Robo) homolog 1/2, and similar domains; a member of the ...
1368-1449 4.12e-05

Fifth Ig-like domain of Roundabout (Robo) homolog 1/2, and similar domains; a member of the I-set of IgSF domains; The members here are composed of the fifth Ig-like domain of Roundabout (Robo) homolog 1/2 and similar domains. Robo receptors play a role in the development of the central nervous system (CNS), and are receptors of Slit protein. Slit is a repellant secreted by the neural cells in the midline. Slit acts through Robo to prevent most neurons from crossing the midline from either side. Three mammalian Robo homologs (Robo1, -2, and -3), and three mammalian Slit homologs (Slit-1,-2, -3), have been identified. Commissural axons, which cross the midline, express low levels of Robo; longitudinal axons, which avoid the midline, express high levels of Robo. Robo1, -2, and -3 are expressed by commissural neurons in the vertebrate spinal cord and Slits 1, -2, -3 are expressed at the ventral midline. Robo-3 is a divergent member of the Robo family which instead of being a positive regulator of slit responsiveness, antagonizes slit responsiveness in precrossing axons. The Slit-Robo interaction is mediated by the second leucine-rich repeat (LRR) domain of Slit and the two N-terminal Ig domains of Robo, Ig1 and Ig2. The primary Robo binding site for Slit2 has been shown by surface plasmon resonance experiments and mutational analysis to be is the Ig1 domain, while the Ig2 domain has been proposed to harbor a weak secondary binding site. The fifth Ig-like domain of Robo 1 and 2 is a member of the I-set Ig domains, having A-B-E-D strands in one beta-sheet and A'-G-F-C-C' in the other. Like the V-set Ig domains, members of the I-set have a discontinuous A strand but lack a C" strand. I-set domains are found in several cell adhesion molecules (such as VCAM, ICAM, and MADCAM), and are also present in numerous other diverse protein families, including several tyrosine-protein kinase receptors


Pssm-ID: 409544 [Multi-domain]  Cd Length: 87  Bit Score: 43.64  E-value: 4.12e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907141596 1368 NVTVMEGSPVTLEVEVTGFPEPTLTWFKKG-QKLCADGHLQVLHKDtkhSVFIPKVCEADAGLYVAQAQNSSGTLSSKAI 1446
Cdd:cd20952      8 NQTVAVGGTVVLNCQATGEPVPTISWLKDGvPLLGKDERITTLENG---SLQIKGAEKSDTGEYTCVALNLSGEATWSAV 84

                   ...
gi 1907141596 1447 LHV 1449
Cdd:cd20952     85 LDV 87
IgI_VEGFR-3 cd05863
Immunoglobulin (Ig)-like domain of vascular endothelial growth factor receptor 3 (VEGFR-3); ...
1377-1441 4.13e-05

Immunoglobulin (Ig)-like domain of vascular endothelial growth factor receptor 3 (VEGFR-3); member of the I-set of Ig superfamily (IgSF) domains; The members here are composed of the immunoglobulin (Ig)-like domain of vascular endothelial growth factor receptor 3 (VEGFR-3). The VEGFRs have an extracellular component with seven Ig-like domains, a transmembrane segment, and an intracellular tyrosine kinase domain interrupted by a kinase-insert domain. VEGFRs bind VEGFs with high affinity at the Ig-like domains. VEGFR-3 (Flt-4) binds two members of the VEGF family (VEGF-C and VEGF-D) and is involved in tumor angiogenesis and growth. This group belongs to the I-set of IgSF domains, having A-B-E-D strands in one beta-sheet and A'-G-F-C-C' in the other. Like the V-set Ig domains, members of the I-set have a discontinuous A strand, but lack a C" strand.


Pssm-ID: 409449  Cd Length: 88  Bit Score: 43.77  E-value: 4.13e-05
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1907141596 1377 VTLEVEVTGFPEPTLTWFKKGQKLCAdghlqvlhKDTKHSVFIPKVCEADAGLYVAQAQNSSGTL 1441
Cdd:cd05863     22 VKLPVKVAAYPPPEFQWYKDGKLISG--------KHSPHSLQIKDVTEASAGTYTLVLWNSAAGL 78
Ig_DSCAM cd05734
Immunoglobulin (Ig)-like domain of Down Syndrome Cell Adhesion molecule (DSCAM); The members ...
1360-1449 4.31e-05

Immunoglobulin (Ig)-like domain of Down Syndrome Cell Adhesion molecule (DSCAM); The members here are composed of the immunoglobulin (Ig)-like domain of Down Syndrome Cell Adhesion molecule (DSCAM). DSCAM is a cell adhesion molecule expressed largely in the developing nervous system. The gene encoding DSCAM is located at human chromosome 21q22, the locus associated with the intellectual disability phenotype of Down Syndrome. DSCAM is predicted to be the largest member of the IG superfamily. It has been demonstrated that DSCAM can mediate cation-independent homophilic intercellular adhesion.


Pssm-ID: 409397 [Multi-domain]  Cd Length: 97  Bit Score: 43.64  E-value: 4.31e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907141596 1360 PHFSRLLSNVTVMEGSPVTLEVEVTGFPEPTLTW-FKKGQK-------LCADGHLQVLhkdTKHSVFIPKVCEADAGLYV 1431
Cdd:cd05734      2 PRFVVQPNDQDGIYGKAVVLNCSADGYPPPTIVWkHSKGSGvpqfqhiVPLNGRIQLL---SNGSLLIKHVLEEDSGYYL 78
                           90
                   ....*....|....*....
gi 1907141596 1432 AQAQNSSGTLSSKAI-LHV 1449
Cdd:cd05734     79 CKVSNDVGADISKSMyLTV 97
IgI_NCAM-1_like cd05732
Immunoglobulin (Ig)-like I-set domain of Neural Cell Adhesion Molecule 1 (NCAM-1) and similar ...
1360-1439 4.36e-05

Immunoglobulin (Ig)-like I-set domain of Neural Cell Adhesion Molecule 1 (NCAM-1) and similar proteins; The members here are composed of the fourth immunoglobulin (Ig)-like domain of Neural Cell Adhesion Molecule (NCAM-1). NCAM plays important roles in the development and regeneration of the central nervous system, in synaptogenesis and neural migration. NCAM mediates cell-cell and cell-substratum recognition and adhesion via homophilic (NCAM-NCAM), and heterophilic (NCAM-non-NCAM), interactions. NCAM is expressed as three major isoforms having different intracellular extensions. The extracellular portion of NCAM has five N-terminal Ig-like domains and two fibronectin type III domains. The double zipper adhesion complex model for NCAM homophilic binding involves Ig1, Ig2, and Ig3. By this model, Ig1 and Ig2 mediate dimerization of NCAM molecules situated on the same cell surface (cis interactions), and Ig3 domains mediate interactions between NCAM molecules expressed on the surface of opposing cells (trans interactions), through binding to the Ig1 and Ig2 domains. The adhesive ability of NCAM is modulated by the addition of polysialic acid chains to the fifth Ig-like domain. Also included in this group is NCAM-2 (also known as OCAM/mamFas II and RNCAM) NCAM-2 is differentially expressed in the developing and mature olfactory epithelium (OE). One of the unique features of I-set domains is the lack of a C" strand. The structures of this group show that the Ig domain lacks this strand and thus is a member of the I-set of Ig domains.


Pssm-ID: 409395 [Multi-domain]  Cd Length: 96  Bit Score: 43.67  E-value: 4.36e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907141596 1360 PHFSRLlSNVTVMEGSPVTLEVEVTGFPEPTLTW--------FKKGQklcADGHLQVLHKDTKHSVFIPKVCEADAGLYV 1431
Cdd:cd05732      3 PKITYL-ENQTAVELEQITLTCEAEGDPIPEITWrratrgisFEEGD---LDGRIVVRGHARVSSLTLKDVQLTDAGRYD 78

                   ....*...
gi 1907141596 1432 AQAQNSSG 1439
Cdd:cd05732     79 CEASNRIG 86
Ig3_L1-CAM cd05876
Third immunoglobulin (Ig)-like domain of the L1 cell adhesion molecule (CAM); The members here ...
1367-1440 5.59e-05

Third immunoglobulin (Ig)-like domain of the L1 cell adhesion molecule (CAM); The members here are composed of the third immunoglobulin (Ig)-like domain of the L1 cell adhesion molecule (CAM). L1 belongs to the L1 subfamily of cell adhesion molecules (CAMs) and is comprised of an extracellular region having six Ig-like domains, five fibronectin type III domains, a transmembrane region and an intracellular domain. L1 is primarily expressed in the nervous system and is involved in its development and function. L1 is associated with an X-linked recessive disorder, X-linked hydrocephalus, MASA syndrome, or spastic paraplegia type 1, that involves abnormalities of axonal growth. This group also contains the chicken neuron-glia cell adhesion molecule, Ng-CAM.


Pssm-ID: 409460 [Multi-domain]  Cd Length: 83  Bit Score: 42.98  E-value: 5.59e-05
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1907141596 1367 SNVTVMEGSPVTLEVEVTGFPEPTLTWFKKGQKLCADghlQVLHKDTKHSVFIPKVCEADAGLYVAQAQNSSGT 1440
Cdd:cd05876      3 SSLVALRGQSLVLECIAEGLPTPTVKWLRPSGPLPPD---RVKYQNHNKTLQLLNVGESDDGEYVCLAENSLGS 73
Ig_Pro_neuregulin cd05750
Immunoglobulin (Ig)-like domain in neuregulins; The members here are composed of the ...
1368-1449 5.79e-05

Immunoglobulin (Ig)-like domain in neuregulins; The members here are composed of the immunoglobulin (Ig)-like domain in neuregulins (NRGs). NRGs are signaling molecules which participate in cell-cell interactions in the nervous system, breast, heart, and other organ systems, and are implicated in the pathology of diseases including schizophrenia, multiple sclerosis, and breast cancer. There are four members of the neuregulin gene family (NRG-1, NRG-2, NRG-3, and NRG-4). The NRG-1 protein, binds to and activates the tyrosine kinases receptors ErbB3 and ErbB4, initiating signaling cascades. The other NRGs proteins bind one or the other or both of these ErbBs. NRG-1 has multiple functions: in the brain it regulates various processes such as radial glia formation and neuronal migration, dendritic development, and expression of neurotransmitters receptors, while in the peripheral nervous system NRG-1 regulates processes such as target cell differentiation, and Schwann cell survival. There are many NRG-1 isoforms which arise from the alternative splicing of mRNA. Less is known of the functions of the other NRGs. NRG-2 and NRG-3 are expressed predominantly in the nervous system. NRG-2 is expressed by motor neurons and terminal Schwann cells, and is concentrated near synaptic sites and may be a signal that regulates synaptic differentiation. NRG-4 has been shown to direct pancreatic islet cell development towards the delta-cell lineage.


Pssm-ID: 409408 [Multi-domain]  Cd Length: 92  Bit Score: 43.27  E-value: 5.79e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907141596 1368 NVTVMEGSPVTLEVEVTG-FPEPTLTWFKKGQKLCA--DGHLQVLHKDTKHSVFIPKVCEADAGLYVAQAQNSSGTLSSK 1444
Cdd:cd05750      8 SQTVQEGSKLVLKCEATSeNPSPRYRWFKDGKELNRkrPKNIKIRNKKKNSELQINKAKLEDSGEYTCVVENILGKDTVT 87

                   ....*
gi 1907141596 1445 AILHV 1449
Cdd:cd05750     88 GNVTV 92
sbcc TIGR00618
exonuclease SbcC; All proteins in this family for which functions are known are part of an ...
219-790 1.87e-04

exonuclease SbcC; All proteins in this family for which functions are known are part of an exonuclease complex with sbcD homologs. This complex is involved in the initiation of recombination to regulate the levels of palindromic sequences in DNA. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University). [DNA metabolism, DNA replication, recombination, and repair]


Pssm-ID: 129705 [Multi-domain]  Cd Length: 1042  Bit Score: 46.12  E-value: 1.87e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907141596  219 KIDSLLELLQDRRRQLDKYLQQ------QRQELSQVLQLC-LWDQQENQVSSWF--QKAIRDLQEQslgaSLSDNRELIC 289
Cdd:TIGR00618  308 QAQRIHTELQSKMRSRAKLLMKraahvkQQSSIEEQRRLLqTLHSQEIHIRDAHevATSIREISCQ----QHTLTQHIHT 383
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907141596  290 KHEDL-IVKAKEWDSAVEKLKSQAL-----------GILLSKDLAGKEHLQLSNQKLNRLQEEFGRLMVERKAWLSMAND 357
Cdd:TIGR00618  384 LQQQKtTLTQKLQSLCKELDILQREqatidtrtsafRDLQGQLAHAKKQQELQQRYAELCAAAITCTAQCEKLEKIHLQE 463
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907141596  358 FFTSANKSFDVLGKVEAYLKLLKSEGLSLPVLAAKHEELHREIKDSTATALQKgRTLISQVDSCRSRVTGIHEMMGYIQN 437
Cdd:TIGR00618  464 SAQSLKEREQQLQTKEQIHLQETRKKAVVLARLLELQEEPCPLCGSCIHPNPA-RQDIDNPGPLTRRMQRGEQTYAQLET 542
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907141596  438 RVDCLTEQCTAHEEFARKRQQLATSVDDYLRKVEMSIQEIRPILATTLDVAS-----SPSESEKILNKYLELDIQVKETA 512
Cdd:TIGR00618  543 SEEDVYHQLTSERKQRASLKEQMQEIQQSFSILTQCDNRSKEDIPNLQNITVrlqdlTEKLSEAEDMLACEQHALLRKLQ 622
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907141596  513 HALEAAAKIMTEKN---ELELNEVALLPLKVKWLEEELstlgRSISCRSRILQT-YVAFRKSSEEAEEQLQSLKEFYLTE 588
Cdd:TIGR00618  623 PEQDLQDVRLHLQQcsqELALKLTALHALQLTLTQERV----REHALSIRVLPKeLLASRQLALQKMQSEKEQLTYWKEM 698
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907141596  589 IPWKDEddaVVKCQSNSAERKWQLFlkksfltQDLSLEFLNLI-NMAKENEILN--VKNEMHIMENIMEKQTNGREELSH 665
Cdd:TIGR00618  699 LAQCQT---LLRELETHIEEYDREF-------NEIENASSSLGsDLAAREDALNqsLKELMHQARTVLKARTEAHFNNNE 768
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907141596  666 LRVAWYLKAIEGKPAREQWEMFKEKLTKTTHSVKLLHEVLMPVSALDLGGSLQSTSDLRRRWIAMKPQLQQLHEDVQQIT 745
Cdd:TIGR00618  769 EVTAALQTGAELSHLAAEIQFFNRLREEDTHLLKTLEAEIGQEIPSDEDILNLQCETLVQEEEQFLSRLEEKSATLGEIT 848
                          570       580       590       600
                   ....*....|....*....|....*....|....*....|....*
gi 1907141596  746 KEwevlssqgapLKEKAEQLKDLVHLHRRQRERIQEYEEILYKTV 790
Cdd:TIGR00618  849 HQ----------LLKYEECSKQLAQLTQEQAKIIQLSDKLNGINQ 883
IgI_Myomesin_like_C cd05737
C-terminal immunoglobulin (Ig)-like domain of myomesin and M-protein; member of the I-set of ...
1369-1449 2.13e-04

C-terminal immunoglobulin (Ig)-like domain of myomesin and M-protein; member of the I-set of Ig superfamily (IgSF) domains; The members here are composed of the C-terminal immunoglobulin (Ig)-like domain of myomesin and M-protein (also known as myomesin-2). Myomesin and M-protein are both structural proteins localized to the M-band, a transverse structure in the center of the sarcomere, and are candidates for M-band bridges. Both proteins are modular, consisting mainly of repetitive Ig-like and fibronectin type III (FnIII) domains. Myomesin is expressed in all types of vertebrate striated muscle; M-protein has a muscle-type specific expression pattern. Myomesin is present in both slow and fast fibers; M-protein is present only in fast fibers. It has been suggested that myomesin acts as a molecular spring with alternative splicing as a means of modifying its elasticity.


Pssm-ID: 319300  Cd Length: 92  Bit Score: 41.81  E-value: 2.13e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907141596 1369 VTVMEGSPVTLEVEVTGFPEPTLTWFKKGQKLCADGHLQVLHKDTKHSVF-IPKVCEADAGLYVAQAQNSSGTLSSKAIL 1447
Cdd:cd05737     11 VTIMEGKTLNLTCNVWGDPPPEVSWLKNDQALAFLDHCNLKVEAGRTVYFtINGVSSEDSGKYGLVVKNKYGSETSDVTV 90

                   ..
gi 1907141596 1448 HV 1449
Cdd:cd05737     91 SV 92
IgI_7_Dscam cd20954
Seventh immunoglobulin domain of the Drosophila melanogaster Dscam protein, and similar ...
1371-1447 3.35e-04

Seventh immunoglobulin domain of the Drosophila melanogaster Dscam protein, and similar domains; a member of the I-set of IgSF domains; The members here are composed of the seventh immunoglobulin domain of the Drosophila melanogaster Down syndrome cell adhesion molecule (DSCAM) protein and similar proteins. Down syndrome cell adhesion molecule (DSCAM) is a cell adhesion molecule that plays critical roles in neural development, including axon guidance and branching, axon target recognition, self-avoidance and synaptic formation. DSCAM belongs to the immunoglobulin superfamily and contributes to defects in the central nervous system in Down syndrome patients. Vertebrate DSCAMs differ from Drosophila Dscam1 in that they lack the extensive alternative splicing that occurs in the insect gene. Drosophila melanogaster Dscam has 38,016 isoforms generated by the alternative splicing of four variable exon clusters, which allows every neuron in the fly to display a distinctive set of Dscam proteins on its cell surface. Drosophila Dscam1 is a cell-surface protein that plays important roles in neural development and axon tiling of neurons. It is shown that thousands of isoforms bind themselves through specific homophilic (self-binding) interactions, a process which mediates cellular self-recognition. Drosophila Dscam2 is also alternatively spliced and plays a key role in the development of two visual system neurons, monopolar cells L1 and L2. This group is a member of the I-set Ig domains, having A-B-E-D strands in one beta-sheet and A'-G-F-C-C' in the other. Like the V-set Ig domains, members of the I-set have a discontinuous A strand but lack a C" strand.


Pssm-ID: 409546 [Multi-domain]  Cd Length: 96  Bit Score: 41.14  E-value: 3.35e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907141596 1371 VMEGSPVTLEVEVTGFPEPTLTWfKKG--------QKLCADGHLQVLHKDTkhsVFIPKVCEADAGLYVAQAQNSSGTLS 1442
Cdd:cd20954     13 VAAGQDVMLHCQADGFPTPTVTW-KKAtgstpgeyKDLLYDPNVRILPNGT---LVFGHVQKENEGHYLCEAKNGIGSGL 88

                   ....*
gi 1907141596 1443 SKAIL 1447
Cdd:cd20954     89 SKVIF 93
Ig_C5_MyBP-C cd05894
C5 immunoglobulin (Ig) domain of cardiac myosin binding protein C (MyBP-C); The members here ...
1369-1449 3.58e-04

C5 immunoglobulin (Ig) domain of cardiac myosin binding protein C (MyBP-C); The members here are composed of the C5 immunoglobulin (Ig) domain of cardiac myosin binding protein C (MyBP-C). MyBP-C consists of repeated domains, Ig and fibronectin type 3, and various linkers. Three isoforms of MYBP-C exist: slow-skeletal (ssMyBP-C), fast-skeletal (fsMyBP-C), and cardiac (cMyBP-C). cMYBP-C has insertions between and inside domains and an additional cardiac-specific Ig domain at the N-terminus. For cMYBP_C an interaction has been demonstrated between this C5 domain and the Ig C8 domain.


Pssm-ID: 409475  Cd Length: 86  Bit Score: 40.98  E-value: 3.58e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907141596 1369 VTVMEGSPVTLEVEVTGFPEPTLTWfKKGQKLCADGHLQVLHKDTKH-SVFIPKVCE-ADAGLYVAQAQNSSGtlSSKAI 1446
Cdd:cd05894      5 IVVVAGNKLRLDVPISGEPAPTVTW-SRGDKAFTATEGRVRVESYKDlSSFVIEGAErEDEGVYTITVTNPVG--EDHAS 81

                   ...
gi 1907141596 1447 LHV 1449
Cdd:cd05894     82 LFV 84
Ig4_L1-NrCAM_like cd04978
Fourth immunoglobulin (Ig)-like domain of L1, Ng-CAM (Neuron-glia CAM cell adhesion molecule), ...
1367-1449 3.89e-04

Fourth immunoglobulin (Ig)-like domain of L1, Ng-CAM (Neuron-glia CAM cell adhesion molecule), and NrCAM (Ng-CAM-related); The members here are composed of the fourth immunoglobulin (Ig)-like domain of L1, Ng-CAM (Neuron-glia CAM cell adhesion molecule), and NrCAM (Ng-CAM-related). These proteins belong to the L1 subfamily of cell adhesion molecules (CAMs) and are comprised of an extracellular region having six Ig-like domains and five fibronectin type III domains, a transmembrane region and an intracellular domain. These molecules are primarily expressed in the nervous system. L1 is associated with an X-linked recessive disorder, X-linked hydrocephalus, MASA syndrome, or spastic paraplegia type 1, that involves abnormalities of axonal growth.


Pssm-ID: 409367 [Multi-domain]  Cd Length: 89  Bit Score: 40.89  E-value: 3.89e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907141596 1367 SNVTVMEGSPVTLEVEVTGFPEPTLTWFKKGQKLCADGHLQVLhKDTKHSVFIPKVCEADAGLYVAQAQNSSGTLSSKAI 1446
Cdd:cd04978      7 PSLVLSPGETGELICEAEGNPQPTITWRLNGVPIEPAPEDMRR-TVDGRTLIFSNLQPNDTAVYQCNASNVHGYLLANAF 85

                   ...
gi 1907141596 1447 LHV 1449
Cdd:cd04978     86 LHV 88
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
219-946 5.28e-04

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 44.66  E-value: 5.28e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907141596  219 KIDSLLELLQDRRRQLD---KYLQQQRQELSQVLQlclwDQQENQVSSwfQKAIRDLQEQSLGASlSDNRELICKHEDLI 295
Cdd:TIGR02168  299 RLEQQKQILRERLANLErqlEELEAQLEELESKLD----ELAEELAEL--EEKLEELKEELESLE-AELEELEAELEELE 371
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907141596  296 VKAKEWDSAVEKLKSQALGILLSKDLAGKEHLQLSNQkLNRLQEEFGRLMVERKAwlsmANDFFTSANKSfdvlgKVEAY 375
Cdd:TIGR02168  372 SRLEELEEQLETLRSKVAQLELQIASLNNEIERLEAR-LERLEDRRERLQQEIEE----LLKKLEEAELK-----ELQAE 441
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907141596  376 LKLLKSEGLSLPVLAAKHEELHREIKDSTATALQKGRTLISQVDSCRSRVTGIHEMMGyiqnrvdclteqctAHEEFARk 455
Cdd:TIGR02168  442 LEELEEELEELQEELERLEEALEELREELEEAEQALDAAERELAQLQARLDSLERLQE--------------NLEGFSE- 506
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907141596  456 rqqlatsvddYLRKVEMSIQEIRPILATTLDVASSPSESEKILNKYLELDIQ--VKETAHALEAAAKIMTEKNeleLNEV 533
Cdd:TIGR02168  507 ----------GVKALLKNQSGLSGILGVLSELISVDEGYEAAIEAALGGRLQavVVENLNAAKKAIAFLKQNE---LGRV 573
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907141596  534 ALLPLKVkWLEEELSTLGRSISCRSRilqTYVAFRKSSEEAEEQLQSLKEFYLTEIpwkdeddAVVKCQSNSAERKWQLF 613
Cdd:TIGR02168  574 TFLPLDS-IKGTEIQGNDREILKNIE---GFLGVAKDLVKFDPKLRKALSYLLGGV-------LVVDDLDNALELAKKLR 642
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907141596  614 LKKSFLTQDL------------SLEFLNLInMAKENEILNVKNEMHIMEN-----------IMEKQTNGREELSHLRVAW 670
Cdd:TIGR02168  643 PGYRIVTLDGdlvrpggvitggSAKTNSSI-LERRREIEELEEKIEELEEkiaelekalaeLRKELEELEEELEQLRKEL 721
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907141596  671 YLKAIEGKPAREQWEMFKEKLTKTTHSVKLLHEVLMPVSAlDLGGSLQSTSDLRRRWIAMKPQLQQLHEDVQQITKEWEV 750
Cdd:TIGR02168  722 EELSRQISALRKDLARLEAEVEQLEERIAQLSKELTELEA-EIEELEERLEEAEEELAEAEAEIEELEAQIEQLKEELKA 800
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907141596  751 LSSQGAPLKEKAEQLKDLVHlhrRQRERIQEYEEilyktvQFHQVKEELVHLIKprELELLAQPMELASSEEVQMQLGRS 830
Cdd:TIGR02168  801 LREALDELRAELTLLNEEAA---NLRERLESLER------RIAATERRLEDLEE--QIEELSEDIESLAAEIEELEELIE 869
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907141596  831 QERRAHVDHLhqlaltlgvDIISSVQQPLKEsfkdikkkfnnLKFNYSKKNEKSRNLKtlqyqiQQVDTYAEKIQALRKK 910
Cdd:TIGR02168  870 ELESELEALL---------NERASLEEALAL-----------LRSELEELSEELRELE------SKRSELRRELEELREK 923
                          730       740       750       760
                   ....*....|....*....|....*....|....*....|..
gi 1907141596  911 MEKVNNKTS------DSFLSYPSNKANMLSEAMEDLKKNVDD 946
Cdd:TIGR02168  924 LAQLELRLEglevriDNLQERLSEEYSLTLEEAEALENKIED 965
IgI_APEG-1_like cd20975
Immunoglobulin-like domain of human Aortic Preferentially Expressed Protein-1 (APEG-1) and ...
1360-1449 6.04e-04

Immunoglobulin-like domain of human Aortic Preferentially Expressed Protein-1 (APEG-1) and similar proteins; a member of the I-set of IgSF domains; The members here are composed of the immunoglobulin I-set (IgI) domain of the Human Aortic Preferentially Expressed Protein-1 (APEG-1) and similar proteins. APEG-1 is a novel specific smooth muscle differentiation marker predicted to play a role in the growth and differentiation of arterial smooth muscle cells (SMCs). The Ig superfamily (IgSF) is a heterogenous group of proteins, built on a common fold comprised of a sandwich of two beta sheets. IgSF domains can be divided into 4 main classes based on their structures and sequences: the Variable (V), Constant 1 (C1), Constant 2 (C2), and Intermediate (I) sets. Unlike the V-set, one of the distinctive features of I-set domains is the lack of a C" strand. The structure of the human APEG-1 lacks this strand and thus it belongs to the I-set of the IgSF. I-set domains are found in several cell adhesion molecules (such as VCAM, ICAM, and MADCAM), and are also present in numerous other diverse protein families, including several tyrosine-protein kinase receptors, the hemolymph protein hemolin, the muscle proteins titin, telokin, and twitchin, the neuronal adhesion molecule axonin-1, and the signaling molecule semaphorin 4D that is involved in axonal guidance, immune function and angiogenesis.


Pssm-ID: 409567  Cd Length: 91  Bit Score: 40.53  E-value: 6.04e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907141596 1360 PHFSRLLSNVTVMEGSPVTLEVEVTGFPEPTLTWFKKGQKLCADG-HLQVLHKDTKHSVFIPKVCEADAGLYVAQAQNSS 1438
Cdd:cd20975      1 PTFKVSLMDQSVREGQDVIMSIRVQGEPKPVVSWLRNRQPVRPDQrRFAEEAEGGLCRLRILAAERGDAGFYTCKAVNEY 80
                           90
                   ....*....|.
gi 1907141596 1439 GTLSSKAILHV 1449
Cdd:cd20975     81 GARQCEARLEV 91
IgI_4_Dscam cd20956
Fourth immunoglobulin domain of the Drosophila melanogaster Dscam protein, and similar domains; ...
1359-1449 7.09e-04

Fourth immunoglobulin domain of the Drosophila melanogaster Dscam protein, and similar domains; a member of the I-set of IgSF domains; The members here are composed of the fourth immunoglobulin domain of the Drosophila melanogaster Down syndrome cell adhesion molecule (DSCAM) protein and similar proteins. Down syndrome cell adhesion molecule (DSCAM) is a cell adhesion molecule that plays critical roles in neural development, including axon guidance and branching, axon target recognition, self-avoidance and synaptic formation. DSCAM belongs to the immunoglobulin superfamily and contributes to defects in the central nervous system in Down syndrome patients. Vertebrate DSCAMs differ from Drosophila Dscam1 in that they lack the extensive alternative splicing that occurs in the insect gene. Drosophila melanogaster Dscam has 38,016 isoforms generated by the alternative splicing of four variable exon clusters, which allows every neuron in the fly to display a distinctive set of Dscam proteins on its cell surface. Drosophila Dscam1 is a cell-surface protein that plays important roles in neural development and axon tiling of neurons. It is shown that thousands of isoforms bind themselves through specific homophilic (self-binding) interactions, a process which mediates cellular self-recognition. Drosophila Dscam2 is also alternatively spliced and plays a key role in the development of two visual system neurons, monopolar cells L1 and L2. This group is a member of the I-set Ig domains, having A-B-E-D strands in one beta-sheet and A'-G-F-C-C' in the other. Like the V-set Ig domains, members of the I-set have a discontinuous A strand but lack a C" strand.


Pssm-ID: 409548 [Multi-domain]  Cd Length: 96  Bit Score: 40.24  E-value: 7.09e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907141596 1359 APHFSRLLSNVTVMEGSPVTLEVEVTGFPEPTLTWFKKGQKLCADGHLQV-----LHKDTKHSVFIPKVCEADAGLYVAQ 1433
Cdd:cd20956      1 APVLLETFSEQTLQPGPSVSLKCVASGNPLPQITWTLDGFPIPESPRFRVgdyvtSDGDVVSYVNISSVRVEDGGEYTCT 80
                           90
                   ....*....|....*.
gi 1907141596 1434 AQNSSGTLSSKAILHV 1449
Cdd:cd20956     81 ATNDVGSVSHSARINV 96
IgI_NCAM-2 cd05870
Immunoglobulin (Ig)-like I-set domain of Neural Cell Adhesion Molecule 2 (NCAM-2); The members ...
1360-1439 1.17e-03

Immunoglobulin (Ig)-like I-set domain of Neural Cell Adhesion Molecule 2 (NCAM-2); The members here are composed of the fourth Ig domain of Neural Cell Adhesion Molecule NCAM-2 (also known as OCAM/mamFas II and RNCAM). NCAM-2 is organized similarly to NCAM, including five N-terminal Ig-like domains and two fibronectin type III domains. NCAM-2 is differentially expressed in the developing and mature olfactory epithelium (OE), and may function like NCAM, as an adhesion molecule. One of the unique features of I-set domains is the lack of a C" strand. The structures of this group show that the Ig domain lacks this strand and thus is a member of the I-set of Ig domains.


Pssm-ID: 143278 [Multi-domain]  Cd Length: 98  Bit Score: 39.57  E-value: 1.17e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907141596 1360 PHFSRLlSNVTVMEGSPVTLEVEVTGFPEPTLTW--------FKKGQKlCADGHLQVLHKDTKHSVFIPKVCEADAGLYV 1431
Cdd:cd05870      3 PHIIQL-KNETTVENGAATLSCKAEGEPIPEITWkrasdghtFSEGDK-SPDGRIEVKGQHGESSLHIKDVKLSDSGRYD 80

                   ....*...
gi 1907141596 1432 AQAQNSSG 1439
Cdd:cd05870     81 CEAASRIG 88
sbcc TIGR00618
exonuclease SbcC; All proteins in this family for which functions are known are part of an ...
224-925 1.68e-03

exonuclease SbcC; All proteins in this family for which functions are known are part of an exonuclease complex with sbcD homologs. This complex is involved in the initiation of recombination to regulate the levels of palindromic sequences in DNA. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University). [DNA metabolism, DNA replication, recombination, and repair]


Pssm-ID: 129705 [Multi-domain]  Cd Length: 1042  Bit Score: 43.03  E-value: 1.68e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907141596  224 LELLQDRRRQLDKYLQQQRQELSQVLQLCLWDQQENQVSSWFQKAIRDLQE-QSLGASLSDNRELI---CKHEDLIVKAK 299
Cdd:TIGR00618  221 KQVLEKELKHLREALQQTQQSHAYLTQKREAQEEQLKKQQLLKQLRARIEElRAQEAVLEETQERInraRKAAPLAAHIK 300
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907141596  300 ewdsAVEKLKSQALGIL--LSKDLAGKEHLQLSNQKLNRLQEEFGRLMVERKAWLSMANDFFTSANKsfdvlgkveaylk 377
Cdd:TIGR00618  301 ----AVTQIEQQAQRIHteLQSKMRSRAKLLMKRAAHVKQQSSIEEQRRLLQTLHSQEIHIRDAHEV------------- 363
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907141596  378 llksEGLSLPVLAAKHEELHR-----EIKDSTATALQKGRTLISQVDSCRSRVTGIHEMMGYIQNRVDCLTEQCTAHEEF 452
Cdd:TIGR00618  364 ----ATSIREISCQQHTLTQHihtlqQQKTTLTQKLQSLCKELDILQREQATIDTRTSAFRDLQGQLAHAKKQQELQQRY 439
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907141596  453 ARKRQQLATSVDDYLRKVEMSIQEIRpilattldvaSSPSESEKILNKYLELDIQVKETaHALEAAAKIMTEKNELELNE 532
Cdd:TIGR00618  440 AELCAAAITCTAQCEKLEKIHLQESA----------QSLKEREQQLQTKEQIHLQETRK-KAVVLARLLELQEEPCPLCG 508
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907141596  533 VALLPLKVKWLEEELSTLGRSIscrSRILQTYVAFRKSSEEAEEQLQSlkefyLTEIPWKDEDDAVVKCQSNSA-ERKWQ 611
Cdd:TIGR00618  509 SCIHPNPARQDIDNPGPLTRRM---QRGEQTYAQLETSEEDVYHQLTS-----ERKQRASLKEQMQEIQQSFSIlTQCDN 580
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907141596  612 LFLKKSFLTQDLSLEFLNLINMAKENEIlNVKNEMHIMENIMEKQTNGREELSHLR-VAWYLKAIEGKPAREQWEMFKEK 690
Cdd:TIGR00618  581 RSKEDIPNLQNITVRLQDLTEKLSEAED-MLACEQHALLRKLQPEQDLQDVRLHLQqCSQELALKLTALHALQLTLTQER 659
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907141596  691 LTKTTHSVKLLHEVLMPVSALDLgGSLQSTSDLRRRWIAMKPQ----LQQLHEDVQQITKEWE----VLSSQGAPLKEKA 762
Cdd:TIGR00618  660 VREHALSIRVLPKELLASRQLAL-QKMQSEKEQLTYWKEMLAQcqtlLRELETHIEEYDREFNeienASSSLGSDLAARE 738
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907141596  763 EQLKDLVH-LHRRQRERIQEYEEIlyktvQFHQVKEELVHLIKPRELELLAQPMEL---ASSEEVQMQLGRSQERRAHVD 838
Cdd:TIGR00618  739 DALNQSLKeLMHQARTVLKARTEA-----HFNNNEEVTAALQTGAELSHLAAEIQFfnrLREEDTHLLKTLEAEIGQEIP 813
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907141596  839 HlHQLALTLGVDIISSVQQPLKESFKDIKKKFNNLKFNYSKKNEKSRNLKTLQYQIQQVDTYAEKIQALRkkmEKVNNKT 918
Cdd:TIGR00618  814 S-DEDILNLQCETLVQEEEQFLSRLEEKSATLGEITHQLLKYEECSKQLAQLTQEQAKIIQLSDKLNGIN---QIKIQFD 889

                   ....*..
gi 1907141596  919 SDSFLSY 925
Cdd:TIGR00618  890 GDALIKF 896
SCP-1 pfam05483
Synaptonemal complex protein 1 (SCP-1); Synaptonemal complex protein 1 (SCP-1) is the major ...
304-798 2.83e-03

Synaptonemal complex protein 1 (SCP-1); Synaptonemal complex protein 1 (SCP-1) is the major component of the transverse filaments of the synaptonemal complex. Synaptonemal complexes are structures that are formed between homologous chromosomes during meiotic prophase.


Pssm-ID: 114219 [Multi-domain]  Cd Length: 787  Bit Score: 42.40  E-value: 2.83e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907141596  304 AVEKLKSQA------LGILLSKDLAGKEHLQLSNQK-LNRLQEEFGRLMVERKAWLSMANDFFTSANKSFDVLGKVEAYL 376
Cdd:pfam05483  198 AFEELRVQAenarleMHFKLKEDHEKIQHLEEEYKKeINDKEKQVSLLLIQITEKENKMKDLTFLLEESRDKANQLEEKT 277
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907141596  377 KLlKSEglSLPVLAAKHEELHREIKDSTATALQKGRTLISQVDSCRSRVTGIHEMMGYIQNRVDCLTEQCTAHEEFARKR 456
Cdd:pfam05483  278 KL-QDE--NLKELIEKKDHLTKELEDIKMSLQRSMSTQKALEEDLQIATKTICQLTEEKEAQMEELNKAKAAHSFVVTEF 354
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907141596  457 QQLATSVDDYLRKVEMSIQEIRPILAT-TLDVASSPSESEKILNKYLELDIQVKETAHALEAAAKIMTEKNELELneval 535
Cdd:pfam05483  355 EATTCSLEELLRTEQQRLEKNEDQLKIiTMELQKKSSELEEMTKFKNNKEVELEELKKILAEDEKLLDEKKQFEK----- 429
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907141596  536 LPLKVKWLEEELSTLGRSISCRSRILQTYVAFRKSSEEA-EEQLQSLKefylTEIPWKDEDDAVVKCQSNsaerkwQLFL 614
Cdd:pfam05483  430 IAEELKGKEQELIFLLQAREKEIHDLEIQLTAIKTSEEHyLKEVEDLK----TELEKEKLKNIELTAHCD------KLLL 499
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907141596  615 KKSFLTQDLSLEFLNLINmaKENEILNVKNEMHIM----ENIMEKQTNGREELSHLRvawylkaiegKPAREQWEMFKEK 690
Cdd:pfam05483  500 ENKELTQEASDMTLELKK--HQEDIINCKKQEERMlkqiENLEEKEMNLRDELESVR----------EEFIQKGDEVKCK 567
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907141596  691 LTKTTHSVKLLhEVLMPVSALDLGGSLQSTSDLRRRWIAMKPQLQQLHEDVQQITKEWEVLSSQGAPLKEKAEQLK-DLV 769
Cdd:pfam05483  568 LDKSEENARSI-EYEVLKKEKQMKILENKCNNLKKQIENKNKNIEELHQENKALKKKGSAENKQLNAYEIKVNKLElELA 646
                          490       500
                   ....*....|....*....|....*....
gi 1907141596  770 HLHRRQRERIQEYEeilyKTVQFHQVKEE 798
Cdd:pfam05483  647 SAKQKFEEIIDNYQ----KEIEDKKISEE 671
SPEC cd00176
Spectrin repeats, found in several proteins involved in cytoskeletal structure; family members ...
52-238 8.73e-03

Spectrin repeats, found in several proteins involved in cytoskeletal structure; family members include spectrin, alpha-actinin and dystrophin; the spectrin repeat forms a three helix bundle with the second helix interrupted by proline in some sequences; the repeats are independent folding units; tandem repeats are found in differing numbers and arrange in an antiparallel manner to form dimers; the repeats are defined by a characteristic tryptophan (W) residue in helix A and a leucine (L) at the carboxyl end of helix C and separated by a linker of 5 residues; two copies of the repeat are present here


Pssm-ID: 238103 [Multi-domain]  Cd Length: 213  Bit Score: 39.35  E-value: 8.73e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907141596   52 EIGSSQDETKKLLHDHELLLAKLKALEDRVWELLREADRTAEANKAQSQVYDAMAQTLGEAWATLVSMLERRRELLGLTS 131
Cdd:cd00176     27 DYGDDLESVEALLKKHEALEAELAAHEERVEALNELGEQLIEEGHPDAEEIQERLEELNQRWEELRELAEERRQRLEEAL 106
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907141596  132 E---FFQSALEFAIKIDQAEAFLQNPHEFESTEALQSLLLLHDRHAKELLERSLDLlnksQQLTDFIEKFKCEGSTMNSE 208
Cdd:cd00176    107 DlqqFFRDADDLEQWLEEKEAALASEDLGKDLESVEELLKKHKELEEELEAHEPRL----KSLNELAEELLEEGHPDADE 182
                          170       180       190
                   ....*....|....*....|....*....|.
gi 1907141596  209 LIQGAQSSCL-KIDSLLELLQDRRRQLDKYL 238
Cdd:cd00176    183 EIEEKLEELNeRWEELLELAEERQKKLEEAL 213
SPEC cd00176
Spectrin repeats, found in several proteins involved in cytoskeletal structure; family members ...
259-458 9.06e-03

Spectrin repeats, found in several proteins involved in cytoskeletal structure; family members include spectrin, alpha-actinin and dystrophin; the spectrin repeat forms a three helix bundle with the second helix interrupted by proline in some sequences; the repeats are independent folding units; tandem repeats are found in differing numbers and arrange in an antiparallel manner to form dimers; the repeats are defined by a characteristic tryptophan (W) residue in helix A and a leucine (L) at the carboxyl end of helix C and separated by a linker of 5 residues; two copies of the repeat are present here


Pssm-ID: 238103 [Multi-domain]  Cd Length: 213  Bit Score: 39.35  E-value: 9.06e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907141596  259 NQVSSWFQKAIRDLQEQSLGASLSDNRELICKHEDLIVKAKEWDSAVEKLKSQALGiLLSKDLAGKEHLQLSNQKLNRLQ 338
Cdd:cd00176     10 DELEAWLSEKEELLSSTDYGDDLESVEALLKKHEALEAELAAHEERVEALNELGEQ-LIEEGHPDAEEIQERLEELNQRW 88
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907141596  339 EEFGRLMVERKAWLSMANDFFTSANKSFDVLGKVEAYLKLLKSE--GLSLPVLAAKHEElHREIKDSTATALQKGRTLIS 416
Cdd:cd00176     89 EELRELAEERRQRLEEALDLQQFFRDADDLEQWLEEKEAALASEdlGKDLESVEELLKK-HKELEEELEAHEPRLKSLNE 167
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|..
gi 1907141596  417 QVDSCRSRvtGIHEMMGYIQNRVDCLTEQCTAHEEFARKRQQ 458
Cdd:cd00176    168 LAEELLEE--GHPDADEEIEEKLEELNERWEELLELAEERQK 207
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH