Sparse Output Coding for Large-Scale Visual Recognition

Zhou, Bin; P Xing, Eric

doi:10.1184/R1/6476345.v1

file.pdf (608.59 kB)

Sparse Output Coding for Large-Scale Visual Recognition

journal contribution

posted on 2013-06-01, 00:00 authored by Bin Zhou, Eric P Xing

Many vision tasks require a multi-class classifier to discriminate multiple categories, on the order of hundreds or thousands. In this paper, we propose sparse output coding, a principled way for large-scale multi-class classification, by turning high-cardinality multi-class categorization into a bit-by-bit decoding problem. Specifically, sparse output coding is composed of two steps: efficient coding matrix learning with scalability to thousands of classes, and probabilistic decoding. Empirical results on object recognition and scene classification demonstrate the effectiveness of our proposed approach.

History

Publisher Statement

© 2013 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.

Date

2013-06-01

Usage metrics

Keywords

Machine Learning

Licence

In Copyright

Exports

RefWorks

BibTeX

Ref. manager

Endnote

DataCite

NLM

DC

Sparse Output Coding for Large-Scale Visual Recognition

History

Publisher Statement

Date

Usage metrics

Categories

Keywords

Licence

Exports