Improving Language-Universal Feature Extraction with Deep Maxout and Convolutional Neural Networks

Miao, Yajie; Metze, Florian

doi:10.1184/R1/6473417.v1

file.pdf (133.01 kB)

Improving Language-Universal Feature Extraction with Deep Maxout and Convolutional Neural Networks

journal contribution

posted on 2014-09-01, 00:00 authored by Yajie Miao, Florian MetzeFlorian Metze

When deployed in automated speech recognition (ASR), deep neural networks (DNNs) can be treated as a complex feature extractor plus a simple linear classifier. Previous work has investigated the utility of multilingual DNNs acting as language-universal feature extractors (LUFEs). In this paper, we explore different strategies to further improve LUFEs. First, we replace the standard sigmoid nonlinearity with the recently proposed maxout units. The resulting maxout LUFEs have the nice property of generating sparse feature representations. Second, the convolutional neural network (CNN) architecture is applied to obtain more invariant feature space. We evaluate the performance of LUFEs on a cross-language ASR task. Each of the proposed techniques results in word error rate reduction compared with the existing DNN-based LUFEs. Combining the two methods together brings additional improvement on the target language.

History

Date

2014-09-01

Usage metrics

Keywords

language-universal feature extraction deep maxout networks deep convolutional networks

Licence

In Copyright

Exports

RefWorks

BibTeX

Ref. manager

Endnote

DataCite

NLM

DC

Improving Language-Universal Feature Extraction with Deep Maxout and Convolutional Neural Networks

History

Date

Usage metrics

Categories

Keywords

Licence

Exports