user: GUEST
width: 600


MyHits has reached its end of life and no longer provides data or services. Thank you for your support and trust for more than 23 years!
However, the webserver will remain online in its present form at least until end of March 2025.
To ensure the future of MyHits, we would be happy if a person or community would take over the resource or parts of it. Interested? Please contact us (myhits [at] sib.swiss).

Pagni M, Ioannidis V, Cerutti L, Zahn-Zabal M, Jongeneel CV, Hau J, Martin O, Kuznetsov D, Falquet L.
MyHits: improvements to an interactive resource for analyzing protein sequences.
Nucleic Acids Res. 2007 Jul; 35(Web Server issue):W433-7

DescriptionRecName: Full=Major sperm protein 19/31/40/45/50/51/53/59/61/65/81/113/142; Short=MSP;
MyHits logo
MyHits synonymsMSP19_CAEEL , P53017 , 69F69631BEA5B147
match map segment
iprf:MSP ipfam:Motile_Sperm  
Legends: 1, INIT_MET Removed. {ECO:0000250}; 2, N-acetylalanine. {ECO:0000250}; 3, STRAND {ECO:0000244|PDB:1GRW}; 4, TURN {ECO:0000244|PDB:1GRW}; 5, HELIX {ECO:0000244|PDB:1GRW}.
ID   MSP19_CAEEL             Reviewed;         127 AA.
AC   P53017;
DT   01-OCT-1996, integrated into UniProtKB/Swiss-Prot.
DT   23-JAN-2007, sequence version 2.
DT   12-APR-2017, entry version 131.
DE   RecName: Full=Major sperm protein 19/31/40/45/50/51/53/59/61/65/81/113/142;
DE            Short=MSP;
GN   Name=msp-19; ORFNames=F36H12.7;
GN   and
GN   Name=msp-31; ORFNames=R05F9.13;
GN   and
GN   Name=msp-40; ORFNames=C33F10.9;
GN   and
GN   Name=msp-45; ORFNames=F58A6.8;
GN   and
GN   Name=msp-50; ORFNames=C34F11.4;
GN   and
GN   Name=msp-51; ORFNames=ZK354.5;
GN   and
GN   Name=msp-53; ORFNames=R13H9.4;
GN   and
GN   Name=msp-59; ORFNames=ZK354.11;
GN   and
GN   Name=msp-64; ORFNames=ZK1248.6;
GN   and
GN   Name=msp-65; ORFNames=ZK354.1;
GN   and
GN   Name=msp-81; ORFNames=K07F5.1;
GN   and
GN   Name=msp-113; ORFNames=ZK354.4;
GN   and
GN   Name=msp-142; ORFNames=K05F1.2;
OS   Caenorhabditis elegans.
OC   Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Rhabditida;
OC   Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis.
OX   NCBI_TaxID=6239;
RN   [1]
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=Bristol N2;
RX   PubMed=9851916; DOI=10.1126/science.282.5396.2012;
RG   The C. elegans sequencing consortium;
RT   "Genome sequence of the nematode C. elegans: a platform for
RT   investigating biology.";
RL   Science 282:2012-2018(1998).
RN   [2]
RP   X-RAY CRYSTALLOGRAPHY (2.6 ANGSTROMS), AND SUBUNIT.
RX   PubMed=12051923; DOI=10.1016/S0022-2836(02)00294-2;
RA   Baker A.M., Roberts T.M., Stewart M.;
RT   "2.6 A resolution crystal structure of helices of the motile major
RT   sperm protein (MSP) of Caenorhabditis elegans.";
RL   J. Mol. Biol. 319:491-499(2002).
CC   -!- FUNCTION: Central component in molecular interactions underlying
CC       sperm crawling. Forms an extensive filament system that extends
CC       from sperm villipoda, along the leading edge of the pseudopod.
CC   -!- SUBUNIT: Helical subfilaments are built from MSP dimers; filaments
CC       are formed from two subfilaments coiling round one another; and
CC       filaments themselves supercoil to produce bundles.
CC       {ECO:0000269|PubMed:12051923}.
CC   -!- SUBCELLULAR LOCATION: Cell projection, pseudopodium. Cytoplasm,
CC       cytoskeleton.
CC   -!- TISSUE SPECIFICITY: Sperm.
CC   -!- MISCELLANEOUS: Around 30 MSP isoforms may exist in C.elegans.
CC   -----------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution-NoDerivs License
CC   -----------------------------------------------------------------------
DR   EMBL; FO080275; CCD62525.1; -; Genomic_DNA.
DR   EMBL; FO080689; CCD65824.1; -; Genomic_DNA.
DR   EMBL; FO080690; CCD65838.1; -; Genomic_DNA.
DR   EMBL; FO080763; CCD66514.1; -; Genomic_DNA.
DR   EMBL; FO081106; CCD69142.1; -; Genomic_DNA.
DR   EMBL; FO081120; CCD69279.1; -; Genomic_DNA.
DR   EMBL; FO081309; CCD70664.1; -; Genomic_DNA.
DR   EMBL; FO081310; CCD70677.1; -; Genomic_DNA.
DR   EMBL; FO081310; CCD70681.1; -; Genomic_DNA.
DR   EMBL; FO081310; CCD70682.1; -; Genomic_DNA.
DR   EMBL; FO081310; CCD70687.1; -; Genomic_DNA.
DR   EMBL; FO081574; CCD72508.1; -; Genomic_DNA.
DR   PIR; A88165; A88165.
DR   PIR; B88689; B88689.
DR   PIR; C88688; C88688.
DR   PIR; C88689; C88689.
DR   PIR; D88164; D88164.
DR   PIR; E88134; E88134.
DR   PIR; F88138; F88138.
DR   PIR; G88145; G88145.
DR   PIR; G88686; G88686.
DR   PIR; H88146; H88146.
DR   PIR; H88688; H88688.
DR   PIR; H88792; H88792.
DR   RefSeq; NP_494865.1; NM_062464.6.
DR   RefSeq; NP_494898.1; NM_062497.6.
DR   RefSeq; NP_494959.1; NM_062558.8.
DR   RefSeq; NP_494972.1; NM_062571.8.
DR   RefSeq; NP_495144.1; NM_062743.4.
DR   RefSeq; NP_495149.1; NM_062748.3.
DR   RefSeq; NP_500714.1; NM_068313.5.
DR   RefSeq; NP_500760.1; NM_068359.5.
DR   RefSeq; NP_500771.1; NM_068370.7.
DR   RefSeq; NP_500773.1; NM_068372.6.
DR   RefSeq; NP_500778.1; NM_068377.7.
DR   RefSeq; NP_500780.1; NM_068379.4.
DR   RefSeq; NP_501759.1; NM_069358.4.
DR   UniGene; Cel.14684; -.
DR   UniGene; Cel.14687; -.
DR   UniGene; Cel.14690; -.
DR   UniGene; Cel.18187; -.
DR   UniGene; Cel.18261; -.
DR   UniGene; Cel.21338; -.
DR   UniGene; Cel.21355; -.
DR   UniGene; Cel.21393; -.
DR   UniGene; Cel.21394; -.
DR   UniGene; Cel.30510; -.
DR   UniGene; Cel.34224; -.
DR   UniGene; Cel.34229; -.
DR   UniGene; Cel.6144; -.
DR   PDB; 1GRW; X-ray; 2.60 A; A/B/C/D=2-127.
DR   PDBsum; 1GRW; -.
DR   ProteinModelPortal; P53017; -.
DR   SMR; P53017; -.
DR   BioGrid; 39322; 1.
DR   BioGrid; 39324; 1.
DR   BioGrid; 56828; 1.
DR   DIP; DIP-25397N; -.
DR   DIP; DIP-26837N; -.
DR   IntAct; P53017; 1.
DR   MINT; MINT-1056924; -.
DR   STRING; 6239.ZK354.5; -.
DR   EPD; P53017; -.
DR   PaxDb; P53017; -.
DR   PeptideAtlas; P53017; -.
DR   PRIDE; P53017; -.
DR   EnsemblMetazoa; C33F10.9; C33F10.9; WBGene00003435.
DR   EnsemblMetazoa; C34F11.4; C34F11.4; WBGene00003443.
DR   EnsemblMetazoa; F36H12.7; F36H12.7; WBGene00003426.
DR   EnsemblMetazoa; F58A6.8; F58A6.8; WBGene00003438.
DR   EnsemblMetazoa; K05F1.2; K05F1.2; WBGene00003469.
DR   EnsemblMetazoa; K07F5.1; K07F5.1; WBGene00003467.
DR   EnsemblMetazoa; R05F9.13; R05F9.13; WBGene00003429.
DR   EnsemblMetazoa; R13H9.4; R13H9.4; WBGene00003446.
DR   EnsemblMetazoa; ZK1248.6; ZK1248.6; WBGene00003457.
DR   EnsemblMetazoa; ZK354.1; ZK354.1; WBGene00003458.
DR   EnsemblMetazoa; ZK354.11; ZK354.11; WBGene00003452.
DR   EnsemblMetazoa; ZK354.4; ZK354.4; WBGene00003468.
DR   EnsemblMetazoa; ZK354.5; ZK354.5; WBGene00003444.
DR   GeneID; 173830; -.
DR   GeneID; 173849; -.
DR   GeneID; 173884; -.
DR   GeneID; 173890; -.
DR   GeneID; 173981; -.
DR   GeneID; 173983; -.
DR   GeneID; 177275; -.
DR   GeneID; 177305; -.
DR   GeneID; 177309; -.
DR   GeneID; 177311; -.
DR   GeneID; 177827; -.
DR   GeneID; 191292; -.
DR   GeneID; 259801; -.
DR   KEGG; cel:CELE_C33F10.9; -.
DR   KEGG; cel:CELE_C34F11.4; -.
DR   KEGG; cel:CELE_F36H12.7; -.
DR   KEGG; cel:CELE_F58A6.8; -.
DR   KEGG; cel:CELE_K05F1.2; -.
DR   KEGG; cel:CELE_K07F5.1; -.
DR   KEGG; cel:CELE_R05F9.13; -.
DR   KEGG; cel:CELE_R13H9.4; -.
DR   KEGG; cel:CELE_ZK1248.6; -.
DR   KEGG; cel:CELE_ZK354.1; -.
DR   KEGG; cel:CELE_ZK354.11; -.
DR   KEGG; cel:CELE_ZK354.4; -.
DR   KEGG; cel:CELE_ZK354.5; -.
DR   UCSC; C33F10.9; c. elegans.
DR   CTD; 173830; -.
DR   CTD; 173849; -.
DR   CTD; 173884; -.
DR   CTD; 173890; -.
DR   CTD; 173981; -.
DR   CTD; 173983; -.
DR   CTD; 177275; -.
DR   CTD; 177305; -.
DR   CTD; 177309; -.
DR   CTD; 177311; -.
DR   CTD; 177827; -.
DR   CTD; 191292; -.
DR   CTD; 259801; -.
DR   WormBase; C33F10.9; CE02806; WBGene00003435; msp-40.
DR   WormBase; C34F11.4; CE02806; WBGene00003443; msp-50.
DR   WormBase; F36H12.7; CE02806; WBGene00003426; msp-19.
DR   WormBase; F58A6.8; CE02806; WBGene00003438; msp-45.
DR   WormBase; K05F1.2; CE02806; WBGene00003469; msp-142.
DR   WormBase; K07F5.1; CE02806; WBGene00003467; msp-81.
DR   WormBase; R05F9.13; CE02806; WBGene00003429; msp-31.
DR   WormBase; R13H9.4; CE02806; WBGene00003446; msp-53.
DR   WormBase; ZK1248.6; CE02806; WBGene00003457; msp-64.
DR   WormBase; ZK354.1; CE02806; WBGene00003458; msp-65.
DR   WormBase; ZK354.11; CE02806; WBGene00003452; msp-59.
DR   WormBase; ZK354.4; CE02806; WBGene00003468; msp-113.
DR   WormBase; ZK354.5; CE02806; WBGene00003444; msp-51.
DR   eggNOG; ENOG410JBRH; Eukaryota.
DR   eggNOG; ENOG410ZFGP; LUCA.
DR   GeneTree; ENSGT00730000111368; -.
DR   HOGENOM; HOG000015566; -.
DR   InParanoid; P53017; -.
DR   OMA; RVFAMAQ; -.
DR   OrthoDB; \N; -.
DR   PhylomeDB; P53017; -.
DR   EvolutionaryTrace; P53017; -.
DR   PRO; PR:P53017; -.
DR   Proteomes; UP000001940; Chromosome II.
DR   Proteomes; UP000001940; Chromosome IV.
DR   Bgee; WBGene00003426; -.
DR   GO; GO:0005737; C:cytoplasm; IEA:UniProtKB-KW.
DR   GO; GO:0005856; C:cytoskeleton; IEA:UniProtKB-SubCell.
DR   GO; GO:0031143; C:pseudopodium; IEA:UniProtKB-SubCell.
DR   Gene3D; 2.60.40.10; -; 1.
DR   InterPro; IPR013783; Ig-like_fold.
DR   InterPro; IPR000535; MSP_dom.
DR   InterPro; IPR008962; PapD-like.
DR   Pfam; PF00635; Motile_Sperm; 1.
DR   SUPFAM; SSF49354; SSF49354; 1.
DR   PROSITE; PS50202; MSP; 1.
PE   1: Evidence at protein level;
KW   3D-structure; Acetylation; Cell projection; Complete proteome;
KW   Cytoplasm; Cytoskeleton; Reference proteome.
FT   INIT_MET      1      1       Removed. {ECO:0000250}.
FT   CHAIN         2    127       Major sperm protein
FT                                19/31/40/45/50/51/53/59/61/65/81/113/142.
FT                                /FTId=PRO_0000213437.
FT   DOMAIN        9    126       MSP. {ECO:0000255|PROSITE-
FT                                ProRule:PRU00132}.
FT   MOD_RES       2      2       N-acetylalanine. {ECO:0000250}.
FT   STRAND       10     16       {ECO:0000244|PDB:1GRW}.
FT   STRAND       18     22       {ECO:0000244|PDB:1GRW}.
FT   STRAND       28     36       {ECO:0000244|PDB:1GRW}.
FT   STRAND       38     40       {ECO:0000244|PDB:1GRW}.
FT   STRAND       42     49       {ECO:0000244|PDB:1GRW}.
FT   TURN         51     53       {ECO:0000244|PDB:1GRW}.
FT   STRAND       54     63       {ECO:0000244|PDB:1GRW}.
FT   STRAND       68     75       {ECO:0000244|PDB:1GRW}.
FT   HELIX        80     82       {ECO:0000244|PDB:1GRW}.
FT   STRAND       89     96       {ECO:0000244|PDB:1GRW}.
FT   HELIX       107    110       {ECO:0000244|PDB:1GRW}.
FT   STRAND      112    114       {ECO:0000244|PDB:1GRW}.
FT   STRAND      117    126       {ECO:0000244|PDB:1GRW}.
CC   --------------------------------------------------------------------------
CC   The following FT lines are automated annotations from the MyHits database.
CC   --------------------------------------------------------------------------
FT   MYHIT         9    126       iprf:MSP [T]
FT   MYHIT        10    113       ipfam:Motile_Sperm [T]
SQ   SEQUENCE   127 AA;  14209 MW;  69F69631BEA5B147 CRC64;
     MAQSVPPGDI QTQPGTKIVF NAPYDDKHTY HIKVINSSAR RIGYGIKTTN MKRLGVDPPC
     GVLDPKEAVL LAVSCDAFAF GQEDTNNDRI TVEWTNTPDG AAKQFRREWF QGDGMVRRKN
     LPIEYNP
//