C2orf16 (англ. Chromosome 2 open reading frame 16) – білок, який кодується однойменним геном, розташованим у людей на 2-й хромосомі.[3] Довжина поліпептидного ланцюга білка становить 1 984 амінокислот, а молекулярна маса — 224 321[4].
Послідовність амінокислот
10 | | 20 | | 30 | | 40 | | 50 |
MELTPGAQQQ | | GINYQELTSG | | WQDVKSMMLV | | PEPTRKFPSG | | PLLTSVRFSN |
LSPESQQQDV | | KSLEFTVEPK | | LQSVKHVKLS | | SVSLQQTIKS | | VELAPGSLPQ |
RVKYGEQTPR | | TNYQIMESSE | | LIPRPGHQFA | | KYAEMIPQPK | | YQIPKSANLI |
SIPIYHATES | | SEMAQGLAYK | | GIDTVEKSVG | | LTPKLTGRAK | | ESLGMLLQPD |
LQVPKFVDLT | | PMVRDQGSKF | | LGLTPEKSYQ | | ILETMELLSQ | | SRPRVKDVGE |
LYMKPLQQTV | | EYEGITPELK | | HYFTEAMGLT | | AEARIQANEF | | FGMTPKPTSQ |
ATGFAERSPR | | LCPQNLECVE | | VISEKRLQGE | | ESVVLIPKSL | | HHVPDSASGM |
TPGLGHRVPE | | SVELTSKSGV | | QVEKTLQLTP | | KPQHHVGSPG | | IISGLGHQVP |
ESVNLTCKQW | | LQMEESLEVP | | LKQTSQVIGH | | EESVELTSEA | | RQHREVSMGL |
TKSKNQSMKS | | PGTTPGPLGR | | IVEFMRISPE | | PLDQVTESAR | | TQLQVAQSEE |
VILIDVPKVV | | QSVKVTPGPP | | FQIVKSVTIP | | RPTPQMVEYI | | ELTPKLQYVR |
PSEHHTGPCL | | QDVKSTKLIT | | KPKHQILETV | | ELTGFQIVKT | | MLIPGPSLQI |
VKSEELAPGP | | IPQVVEPIGV | | ALESGIEAIN | | CVDLLPRPHL | | QELIVPAELT |
PSPCTQVKSA | | ELTSPQTSPF | | EEHTILTHKQ | | GLQAVKSTVI | | KTEPPKVMET |
EDLNLGHVCQ | | NRDCQKLTSE | | ELQVGTDFSR | | FLQSSSTTLI | | SSSVRTASEL |
GGLWDSGIQE | | VSRALDIKNP | | GTDILQPEET | | YIDPTMIQSL | | TFPLALHNQS |
SDKTANIVEN | | PCPEILGVDV | | ISKETTKRKQ | | MEELENSLQR | | HLPQSWRSRS |
RTFQAESGVQ | | KGLIKSFPGR | | QHNVWESHAW | | RQRLPRKYLS | | TMLMLGNILG |
TTMERKLCSQ | | TSLAERATAD | | TCQSIQNLFG | | IPAELMEPSQ | | SLPEKGPVTI |
SQPSVVKNYI | | QRHTFYHGHK | | KRMALRIWTR | | GSTSSIIQQY | | SGTRVRIKKT |
NSTFNGISQE | | VIQHMPVSCA | | GGQLPVLVKS | | ESSLSIFYDR | | EDLVPMEESE |
DSQSDSQTRI | | SESQHSLKPN | | YLSQAKTDFS | | EQFQLLEDLQ | | LKIAAKLLRS |
QIPPDVPPPL | | ASGLVLKYPI | | CLQCGRCSGL | | NCHHKLQTTS | | GPYLLIYPQL |
HLVRTPEGHG | | EVRLHLGFRL | | RIGKRSQISK | | YRERDRPVIR | | RSPISPSQRK |
AKIYTQASKS | | PTSTIDLQSG | | PSQSPAPVQV | | YIRRGQRSRP | | DLVEKTKTRA |
PGHYEFTQVH | | NLPESDSEST | | QNEKRAKVRT | | KKTSDSKYPM | | KRITKRLRKH |
RKFYTNSRTT | | IESPSRELAA | | HLRRKRIGAT | | QTSTASLKRQ | | PKKPSQPKFM |
QLLFQSLKRA | | FQTAHRVIAS | | VGRKPVDGTR | | PDNLWASKNY | | YPKQNARDYC |
LPSSIKRDKR | | SADKLTPAGS | | TIKQEDILWG | | GTVQCRSAQQ | | PRRAYSFQPR |
PLRLPKPTDS | | QSGIAFQTAS | | VGQPLRTVQK | | DSSSRSKKNF | | YRNETSSQES |
KNLSTPGTRV | | QARGRILPGS | | PVKRTWHRHL | | KDKLTHKEHN | | HPSFYRERTP |
RGPSERTRHN | | PSWRNHRSPS | | ERSQRSSLER | | RHHSPSQRSH | | CSPSRKNHSS |
PSERSWRSPS | | QRNHCSPPER | | SCHSLSERGL | | HSPSQRSHRG | | PSQRRHHSPS |
ERSHRSPSER | | SHRSSSERRH | | RSPSQRSHRG | | PSERSHCSPS | | ERRHRSPSQR |
SHRGPSERRH | | HSPSKRSHRS | | PARRSHRSPS | | ERSHHSPSER | | SHHSPSERRH |
HSPSERSHCS | | PSERSHCSPS | | ERRHRSPSER | | RHHSPSEKSH | | HSPSERSHHS |
PSERRRHSPL | | ERSRHSLLER | | SHRSPSERRS | | HRSFERSHRR | | ISERSHSPSE |
KSHLSPLERS | | RCSPSERRGH | | SSSGKTCHSP | | SERSHRSPSG | | MRQGRTSERS |
HRSSCERTRH | | SPSEMRPGRP | | SGRNHCSPSE | | RSRRSPLKEG | | LKYSFPGERP |
SHSLSRDFKN | | QTTLLGTTHK | | NPKAGQVWRP | | EATR |
- The status, quality, and expansion of the NIH full-length cDNA project: the Mammalian Gene Collection (MGC). Genome Res. 14: 2121—2127. 2004. PMID 15489334 DOI:10.1101/gr.2596504