Automated analysis and reannotation of subcellular locations in confocal images from the Human Protein Atlas.

Li J, Newberg JY, Uhlén M, Lundberg E, Murphy RF

PLoS ONE 7 (11) e50514 [2012-11-30; online 2012-11-30]

The Human Protein Atlas contains immunofluorescence images showing subcellular locations for thousands of proteins. These are currently annotated by visual inspection. In this paper, we describe automated approaches to analyze the images and their use to improve annotation. We began by training classifiers to recognize the annotated patterns. By ranking proteins according to the confidence of the classifier, we generated a list of proteins that were strong candidates for reexamination. In parallel, we applied hierarchical clustering to group proteins and identified proteins whose annotations were inconsistent with the remainder of the proteins in their cluster. These proteins were reexamined by the original annotators, and a significant fraction had their annotations changed. The results demonstrate that automated approaches can provide an important complement to visual annotation.

Affiliated researcher

PubMed 23226299

DOI 10.1371/journal.pone.0050514

Crossref 10.1371/journal.pone.0050514

pii: PONE-D-12-11801
pmc: PMC3511558

Publications 9.5.0