Legends: 1, SIGNAL {ECO:0000255}; 2, REPEAT PSA 1; 3, REPEAT PSA 2; 4, REPEAT PSA 3; 5, REPEAT PSA 4; 6, REPEAT PSA 5; 7, REPEAT PSA 6; 8, REPEAT PSA 7; 9, REPEAT PSA 8; 10, REPEAT PSA 9; 11, REPEAT PSA 10; 12, REPEAT PSA 11; 13, REPEAT PSA 12; 14, REPEAT PSA 13; 15, REPEAT PSA 14; 16, REPEAT PSA 15; 17, REPEAT PSA 16; 18, REPEAT PSA 17; 19, REPEAT PSA 18; 20, REPEAT PSA 19; 21, REPEAT PSA 20; 22, REPEAT PSA 21; 23, REPEAT PSA 22; 24, REPEAT PSA 23; 25, REPEAT PSA 24; 26, REPEAT PSA 25; 27, REPEAT PSA 26; 28, REPEAT PSA 27; 29, REPEAT PSA 28; 30, REPEAT PSA 29; 31, REPEAT PSA 30; 32, REPEAT PSA 31; 33, REPEAT PSA 32; 34, REPEAT PSA 33; 35, REPEAT PSA 34; 36, ipfam:Paramecium_SA [T]; 37, ismart:PSA [T]; 38, ismart:PSI [T].
|
ID G156_PARPR Reviewed; 2715 AA.
AC P13837;
DT 01-JAN-1990, integrated into UniProtKB/Swiss-Prot.
DT 01-JAN-1990, sequence version 1.
DT 30-NOV-2016, entry version 72.
DE RecName: Full=G surface protein, allelic form 156;
DE Flags: Precursor;
GN Name=156G;
OS Paramecium primaurelia.
OC Eukaryota; Alveolata; Ciliophora; Intramacronucleata;
OC Oligohymenophorea; Peniculida; Parameciidae; Paramecium.
OX NCBI_TaxID=5886;
RN [1]
RP NUCLEOTIDE SEQUENCE [GENOMIC DNA].
RC STRAIN=156;
RX PubMed=3783679; DOI=10.1016/0022-2836(86)90380-3;
RA Prat A., Katinka M., Caron F., Meyer E.;
RT "Nucleotide sequence of the Paramecium primaurelia G surface protein.
RT A huge protein with a highly periodic structure.";
RL J. Mol. Biol. 189:47-60(1986).
CC -!- FUNCTION: This protein is the surface antigen or immobilization
CC antigen of Paramecium primaurelia.
CC -!- SUBCELLULAR LOCATION: Cell membrane; Lipid-anchor, GPI-anchor.
CC -!- INDUCTION: Expression of G protein occurs at low temperatures (14-
CC 32 degrees Celsius).
CC -!- DOMAIN: It has internal homologies and a highly periodic structure
CC with 34 periods of about 75 residues, each period containing 8
CC cysteines, except for four half periods. A variable part of 475
CC residues comprises 4 almost identical periods in the middle of the
CC protein.
CC -!- SIMILARITY: Contains 34 PSA repeats. {ECO:0000305}.
DR EMBL; X03882; CAA27514.1; -; Genomic_DNA.
DR PIR; A23475; A23475.
DR GO; GO:0031225; C:anchored component of membrane; IEA:UniProtKB-KW.
DR GO; GO:0005886; C:plasma membrane; IEA:UniProtKB-SubCell.
DR InterPro; IPR002895; Paramecium_SA.
DR InterPro; IPR016201; PSI.
DR Pfam; PF01508; Paramecium_SA; 31.
DR SMART; SM00639; PSA; 33.
DR SMART; SM00423; PSI; 10.
PE 2: Evidence at transcript level;
KW Cell membrane; Glycoprotein; GPI-anchor; Lipoprotein; Membrane;
KW Repeat; Signal.
FT SIGNAL 1 20 {ECO:0000255}.
FT CHAIN 21 2715 G surface protein, allelic form 156.
FT /FTId=PRO_0000021310.
FT REPEAT 111 171 PSA 1.
FT REPEAT 177 237 PSA 2.
FT REPEAT 243 303 PSA 3.
FT REPEAT 309 366 PSA 4.
FT REPEAT 372 404 PSA 5.
FT REPEAT 405 467 PSA 6.
FT REPEAT 473 530 PSA 7.
FT REPEAT 536 596 PSA 8.
FT REPEAT 602 673 PSA 9.
FT REPEAT 688 748 PSA 10.
FT REPEAT 752 812 PSA 11.
FT REPEAT 820 895 PSA 12.
FT REPEAT 934 1001 PSA 13.
FT REPEAT 1008 1067 PSA 14.
FT REPEAT 1073 1141 PSA 15.
FT REPEAT 1147 1215 PSA 16.
FT REPEAT 1221 1289 PSA 17.
FT REPEAT 1295 1363 PSA 18.
FT REPEAT 1369 1437 PSA 19.
FT REPEAT 1443 1507 PSA 20.
FT REPEAT 1513 1578 PSA 21.
FT REPEAT 1586 1652 PSA 22.
FT REPEAT 1693 1751 PSA 23.
FT REPEAT 1759 1819 PSA 24.
FT REPEAT 1827 1898 PSA 25.
FT REPEAT 1904 1976 PSA 26.
FT REPEAT 1984 2044 PSA 27.
FT REPEAT 2080 2149 PSA 28.
FT REPEAT 2155 2215 PSA 29.
FT REPEAT 2219 2286 PSA 30.
FT REPEAT 2290 2355 PSA 31.
FT REPEAT 2359 2430 PSA 32.
FT REPEAT 2434 2500 PSA 33.
FT REPEAT 2505 2573 PSA 34.
CC --------------------------------------------------------------------------
CC The following FT lines are automated annotations from the MyHits database.
CC --------------------------------------------------------------------------
FT MYHIT 689 748 ipfam:Paramecium_SA [T]
FT MYHIT 1008 1067 ismart:PSA [T]
FT MYHIT 309 366 ismart:PSA [T]
FT MYHIT 752 812 ismart:PSA [T]
FT MYHIT 1222 1288 ipfam:Paramecium_SA [T]
FT MYHIT 1759 1819 ismart:PSA [T]
FT MYHIT 2155 2215 ismart:PSA [T]
FT MYHIT 935 1000 ipfam:Paramecium_SA [T]
FT MYHIT 1148 1214 ipfam:Paramecium_SA [T]
FT MYHIT 1369 1437 ismart:PSA [T]
FT MYHIT 1904 1976 ipfam:Paramecium_SA [T]
FT MYHIT 77 118 ismart:PSI [T]
FT MYHIT 688 748 ismart:PSA [T]
FT MYHIT 1221 1289 ismart:PSA [T]
FT MYHIT 1904 1976 ismart:PSA [T]
FT MYHIT 433 487 ismart:PSI [T]
FT MYHIT 136 184 ismart:PSI [T]
FT MYHIT 2220 2285 ipfam:Paramecium_SA [T]
FT MYHIT 1009 1066 ipfam:Paramecium_SA [T]
FT MYHIT 406 466 ipfam:Paramecium_SA [T]
FT MYHIT 1828 1898 ipfam:Paramecium_SA [T]
FT MYHIT 271 316 ismart:PSI [T]
FT MYHIT 1295 1363 ismart:PSA [T]
FT MYHIT 1147 1215 ismart:PSA [T]
FT MYHIT 111 171 ismart:PSA [T]
FT MYHIT 1944 1991 ismart:PSI [T]
FT MYHIT 934 1001 ismart:PSA [T]
FT MYHIT 26 71 ismart:PSI [T]
FT MYHIT 405 467 ismart:PSA [T]
FT MYHIT 473 530 ismart:PSA [T]
FT MYHIT 536 596 ipfam:Paramecium_SA [T]
FT MYHIT 2181 2233 ismart:PSI [T]
FT MYHIT 1073 1141 ismart:PSA [T]
FT MYHIT 536 596 ismart:PSA [T]
FT MYHIT 243 303 ismart:PSA [T]
FT MYHIT 1586 1652 ismart:PSA [T]
FT MYHIT 178 237 ipfam:Paramecium_SA [T]
FT MYHIT 602 673 ismart:PSA [T]
FT MYHIT 244 303 ipfam:Paramecium_SA [T]
FT MYHIT 1444 1506 ipfam:Paramecium_SA [T]
FT MYHIT 2435 2500 ipfam:Paramecium_SA [T]
FT MYHIT 2359 2430 ismart:PSA [T]
FT MYHIT 820 895 ismart:PSA [T]
FT MYHIT 564 616 ismart:PSI [T]
FT MYHIT 2081 2148 ipfam:Paramecium_SA [T]
FT MYHIT 335 386 ismart:PSI [T]
FT MYHIT 2505 2573 ismart:PSA [T]
FT MYHIT 1761 1818 ipfam:Paramecium_SA [T]
FT MYHIT 753 812 ipfam:Paramecium_SA [T]
FT MYHIT 1694 1751 ipfam:Paramecium_SA [T]
FT MYHIT 1515 1577 ipfam:Paramecium_SA [T]
FT MYHIT 1827 1898 ismart:PSA [T]
FT MYHIT 1587 1651 ipfam:Paramecium_SA [T]
FT MYHIT 1985 2044 ipfam:Paramecium_SA [T]
FT MYHIT 2219 2286 ismart:PSA [T]
FT MYHIT 1513 1578 ismart:PSA [T]
FT MYHIT 2156 2215 ipfam:Paramecium_SA [T]
FT MYHIT 474 529 ipfam:Paramecium_SA [T]
FT MYHIT 602 673 ipfam:Paramecium_SA [T]
FT MYHIT 2434 2500 ismart:PSA [T]
FT MYHIT 1443 1507 ismart:PSA [T]
FT MYHIT 1296 1362 ipfam:Paramecium_SA [T]
FT MYHIT 1984 2044 ismart:PSA [T]
FT MYHIT 821 887 ipfam:Paramecium_SA [T]
FT MYHIT 1785 1834 ismart:PSI [T]
FT MYHIT 112 170 ipfam:Paramecium_SA [T]
FT MYHIT 310 366 ipfam:Paramecium_SA [T]
FT MYHIT 2290 2355 ismart:PSA [T]
FT MYHIT 2080 2149 ismart:PSA [T]
FT MYHIT 177 237 ismart:PSA [T]
FT MYHIT 1074 1134 ipfam:Paramecium_SA [T]
FT MYHIT 1693 1751 ismart:PSA [T]
FT MYHIT 2359 2430 ipfam:Paramecium_SA [T]
FT MYHIT 1370 1436 ipfam:Paramecium_SA [T]
SQ SEQUENCE 2715 AA; 279551 MW; 97BE359AB9C7C298 CRC64;
MNNKFIIFSL LLALVASQTY SLTSCTCAQL LSEGDCIKNV SLGCSWDTTK KTCGVSTTPV
TPTVTYAAYC DTFAETDCPK AKPCTDCGNY AACAWVESKC TFFTGCTPFA KTLDSECQAI
SNRCITDGTH CVEVDACSTY KKQLPCAKNA AGSLCYWDTT NNTCVDANTC DKLPATFATD
KDCRDVISTC TTKTGGGCVD SGNNCSDQTL EIQCVWNKLK TTSCYWDGAA CKDRICDNAP
TSLTTDDACK TFRTDGTCTT KANGGCVTRT TCAAATIQAS CIKNSSGGDC YWTGTACVDK
ACANTPTTIA TNSACAGFVT GCITKSGGGC VVNGACSVAN VQAACVKNPS NFDCIWDTTC
KEKTCANAST TNNTHDLCTS YLSTCTVKSG GGCQNRTCAN APTTMTTNDA CEAYFTGNNC
ITKSGGGCVT NTTCAAITLE AACVKNSSGS TCFWDTASSS CKDKTCVNAP ATNTTHDLCQ
PFLNTCTVNS TSAGCVEKTC ENSLVLAICD KDTSSRACIW KGKCYKKQCV LASSATTTHA
DCQTYHSTCT LSNSGTGCVP LPLKCEAITI EAACNLKANG QPCGWNGSQC IDKACSTASK
TFTTTSQCTG HISTCVANNP VTVNGSLTIQ GCQDLPTTCA RRKSSENCEI TRVGFPTCLW
VSSSTSCVEK SCATASTVGT TGALSAGGFT FSGCQTYLNT CISNNTADGC IAKPSSCSSL
VSSNCRDGSK ASGDCYWNGS SCVDKTCANI IQTTHNSCNT TFNQCTVNNG GTACQTLATA
CTSYSTQENC KFTSTNKNCV WTGLACRNAT CADAPDTTAY DSDTECLAYP TPSETCTVVY
KVGAQGCVSK SANCSDYMTS AQCHKTLTNL TANDDCKWIV DRCYALSSFA TGACTTFKGT
KTMCEGYRAG CTNTVGAASS ASCTLDCTLK TGSGLTFADC QALDSTCSVK KDGTGCIAIQ
STCAGYGSTA ANCFRSSASG TAGYCAMNTN CQSVTSAAEC AFVTGLTGLD HSKCQLYHSS
CTSLKDGTGC QEYKTTCSGY AATNNCATSG QGKCFFDVEC LRFSNCASIT GTGLTTAICG
TYDAGCVANV NGTACQEKLA TCDLYLTQNS CSTSAAAATA DKCAWSGTAC LAVTTVGTHC
PYVTGTGLTD LICAAYNANC TANKAGTACQ EKKATCNLYT TEATCSTSAA AATADKCAWS
GAACLAVTTV ATECAYVTGT GLTDLICAAY NANCTANKAG TACQEKKATC NLYTTEATCS
TSAAAATADK CAWSGAACLA VTTVATECAY VTGTGLTNAI CAAYNANCTA NKAGTACQEK
KATCNLYTTE ATCSTSAAAA TADKCAWSGA ACLAVTTVAT ECAYVTGTGL TNAICAAYNA
NCTANKAGTA CQEKKATCNL YTTEATCSTS AAAATADKCA WSGAACLAVT TVATECAYVT
GTGLTKAICA TYNAGCINLK DGTGCQEAKA NCKDYTTSNK CTAQTTSTLS CLWIDNSCYP
VTDLNCSVIT GLGFVHAQCQ AYSTGCTSVS DGSKCQDFKS TCEQYPGTTL GCTKTASTKC
YLQGSACITI SNVATDCAKI TGSAGTITFE ICQSYNTGCS VNRARSACVQ QQAQCSGYTS
AMTSCYKSGA GLCIASTNTD TACVAATAAT CDAVYLGAGN YSSANCNEMK AGCTNNGTTA
CVAKTCANAA GITFNHTNCN SYLNTCTVNS GNSACQTMAS KCADQTQASC LYSVEGECVV
VGTSCVRKTC DTAATDATRD DDTECSTYQQ SCTVARLGAC QARAACATYK SSLQCKFNTS
GGKCFWNPTN KTCVDLNCGN IEATTLYDTH NECVAVDATL ACTVRATNGA AAQGCMARGA
CASYTIEEQC KTNASNGVCV WNTNANLPAP ACQDKSCTSA PTSTTTHNDC YAYYNTATVK
CTVVATPSNS GGNPTLGGCQ QTAACSSYID KEQCQINANG DPCGWNGTQC ADKSCATASA
TADYDDDTKC RAYITNKCTV SDSGQGCVEI PATCETMTQK QCYYNKAGDP CYWTGTACIT
KSCDNAPDAT ATADECNTYL AGCTLNNVKC KTKVCEDFAF ATDALCKQAI STCTTNGTNC
VTRGTCFQAL SQAGCVTSST NQQCEWIPAV LNASNVITSP AYCTIKNCST APITLTSEAA
CAGYFTNCTT KNGGGCVTKS TCSAVTIDVA CTTALNGTVC AWDSAQNKCR DKDCQDFSGT
THAACQAQRA GCTAGAGGKC ARVQNCEQTS VRAACIEGTN GPCLWIDKYQ NTDGTKGACF
RYTSCKSLNW NNDSSCKWIS NKCTTNGSNC VGITLCSETN TDGGCVTGYD GACIQSVPDL
NSSDPKVCKP YTSCADAFYT THSDCQIASS KCTTNGTTGC IALGSCSSYT VQAGCYFNDK
GTLYTSGVIT STGICTWDTT SSSCRDQSCA DLTGTTHATC SSQLSTCTSD GTTCLLKGAC
TSYTTQTACT TAVGSDGACY WELASATNNN TAKCRLLTCA DIQNGTATNV CSVALSTCVS
NGTACIPKAN CSTYTSKVAC NSGGLDGICV FTQSTATGAA AGTGTCALMT ACTVANNDQT
ACQAARDRCS WTAASGTRAT AVASKCATHT CATNQATNGA CTRFLNWDKK TQQVCTLVSG
ACTATDPSSF SSNDCFLVSG YTYTCNASTS KCGVCTAVVV QPNTTDNNTN TTDNNTTTDS
GYILGLSIVL GYLMF
//
|