image source:

The MissingProteinPedia is a protein data and information sharing web system that aims to collate any relevant data ‘Missing proteins’ as defined by neXtProt. At its core is a schema-less database-driven web system allowing captures of all PE2-4 protein PubMed data, based on gene and protein including synonyms. The database also allows unpublished, preliminary or proprietary data (e.g., antibody, MS, cell biological and genetic studies) to be shared with collaborators via a protected interface.

MissingProteinPedia facilitates the Human Proteome Project (HPP) cross-disciplinary collaboration by providing a complimentary, unfiltered, lower stringency perspective to both the HPP metrics and guidelines approaches, enabling community evaluation and scrutiny. MissingProteinPedia incorporates text mining technology to fetch and search accumulated UniProt, GeneCards, GeneRifs, PubMed PE2-4 data. Besides, MissingProteinPedia summarizes publicly available MS data from PRIDE, GPMDB, ProteomicsDB and MaxQB for relevant PE2-4 proteins. It allows community administrators to curate information before web publication.

We encourage all visitors to browse and contribute to 'finding' these proteins by sharing any relevant information on them that we may have missed!


Showing 81-100 of 1,482 items.
Protein IDGene IDProtein NameChromosome IDGene NameTag(s) 
Q7Z713ANKRD37Ankyrin repeat domain-containing protein 37Chromosome-4Ankyrin repeat domain-containing protein 37
Q9BPW5RASL11BRas-like protein family member 11BChromosome-4Ras-like protein family member 11B
Q64ET8FRG2Protein FRG2Chromosome-4Protein FRG2
Q86SH2ZAR1Zygote arrest protein 1Chromosome-4Zygote arrest protein 1
Q5FYB0ARSJArylsulfatase JChromosome-4Arylsulfatase J
Q6V702C4orf22Uncharacterized protein C4orf22Chromosome-4Uncharacterized protein C4orf22
Q8WWX0ASB5Ankyrin repeat and SOCS box protein 5Chromosome-4Ankyrin repeat and SOCS box protein 5
Q3SXZ3ZNF718Zinc finger protein 718Chromosome-4Zinc finger protein 718
O95803NDST3Bifunctional heparan sulfate N-deacetylase/N-sulfotransferase 3Chromosome-4Bifunctional heparan sulfate N-deacetylase/N-sulfotransferase 3
Q9BZM3GSX2GS homeobox 2Chromosome-4GS homeobox 2
P78367NKX3-2Homeobox protein Nkx-3.2Chromosome-4Homeobox protein Nkx-3.2
A5PLN7FAM149AProtein FAM149AChromosome-4Protein FAM149A
Q7Z5S9TMEM144Transmembrane protein 144Chromosome-4Transmembrane protein 144
C9J302C4orf51Uncharacterized protein C4orf51Chromosome-4Uncharacterized protein C4orf51
Q6UXD7MFSD7Major facilitator superfamily domain-containing protein 7Chromosome-4Major facilitator superfamily domain-containing protein 7
Q8N614TMEM156Transmembrane protein 156Chromosome-4Transmembrane protein 156
Q495C1RNF212Probable E3 SUMO-protein ligase RNF212Chromosome-4Probable E3 SUMO-protein ligase RNF212

Related Publications

Publications to be cited for using MPP data and services

Islam, M.T. et al. Protannotator: A Semiautomated Pipeline for Chromosome-Wise Functional Annotation of the “Missing” Human Proteome. Journal of Proteome Research 13 (1), 76-83, doi: 10.1021/pr400794x (2014)
Islam, M.T. et al. A systematic bioinformatics approach to identify high quality MS data and functionally annotate proteins and proteomes. Methods Mol. Biol.1549, 163–176, doi: 10.1007/978-1-4939-6740-7_13 (2016)
Baker, M. S. et al. Accelerating the search for the missing proteins in the human proteome. Nat. Commun. 8, 14271 doi: 10.1038/ncomms14271 (2017).
Islam, M.T. et al. Missing ProteinPedia - under preparation.


Professor Mark S. Baker

  •  Department of Biomedical Sciences,

           Faculty of Medicine and Health Sciences,

           Level 1, 75 Talavera Rd, Macquarie University, NSW 2109, Australia

Bioinformatics, Database and Web Administration

Professor Shoba Ranganathan

  •  Department of Biomedical Sciences,

           Department of Chemistry and Biomolecular Science,

           Building F7B Room 121, Macquarie University, NSW 2109, Australia

Write to us