Carnegie Mellon University
Automated analysis and reannotation of subcellular locations in c.pdf.pdf' (4.26 MB)

Automated analysis and reannotation of subcellular locations in confocal images from the Human Protein Atlas.

Download (4.26 MB)
journal contribution
posted on 2015-09-01, 00:00 authored by Jieyue Li, Justin Newberg, Mathias Uhlén, Emma Lundberg, Robert MurphyRobert Murphy

The Human Protein Atlas contains immunofluorescence images showing subcellular locations for thousands of proteins. These are currently annotated by visual inspection. In this paper, we describe automated approaches to analyze the images and their use to improve annotation. We began by training classifiers to recognize the annotated patterns. By ranking proteins according to the confidence of the classifier, we generated a list of proteins that were strong candidates for reexamination. In parallel, we applied hierarchical clustering to group proteins and identified proteins whose annotations were inconsistent with the remainder of the proteins in their cluster. These proteins were reexamined by the original annotators, and a significant fraction had their annotations changed. The results demonstrate that automated approaches can provide an important complement to visual annotation.


Publisher Statement

© The Author 2015. Published by Oxford University Press.