Carnegie Mellon University
Browse

MapReduce for Bayesian Network Parameter Learning using the EM Algorithm

Download (237.16 kB)
journal contribution
posted on 2012-12-01, 00:00 authored by Aniruddha BasakAniruddha Basak, Irina Brinster, Ole J Mengshoel
This work applies the distributed computing framework MapReduce to Bayesian network parameter learning from incomplete data. We formulate the classical Expectation Maximization (EM) algorithm within the MapReduce framework. Analytically and experimentally we analyze the speed-up that can be obtained by means of MapReduce. We present details of the MapReduce formulation of EM, report speed-ups versus the sequential case, and carefully compare various Hadoop cluster configurations in experiments with Bayesian networks of different sizes and structures.

History

Date

2012-12-01