Paragraph: Thwarting Signature Learning by Training Maliciously

Newsome, James; Karp, Brad; Song, Dawn

doi:10.1184/R1/6469130.v1

Paragraph: Thwarting Signature Learning by Training Maliciously

journal contribution

posted on 2006-01-01, 00:00 authored by James Newsome, Brad Karp, Dawn Song

Defending a server against Internet worms and defending a user’s email inbox against spam bear certain similarities. In both cases, a stream of samples arrives, and a classiﬁer must automatically determine whether each sample falls into a malicious target class (e.g., worm network trafﬁc, or spam email). A learner typically generates a classiﬁer automatically by analyzing two labeled training pools: one of innocuous samples, and one of samples that fall in the malicious target class. Learning techniques have previously found success in settings where the content of the labeled samples used in training is either random, or even constructed by a helpful teacher, who aims to speed learning of an accurate classiﬁer. In the case of learning classiﬁers for worms and spam, however, an adversary controls the content of the labeled samples to a great extent. In this paper, we describe practical attacks against learning, in which an adversary constructs labeled samples that, when used to train a learner, prevent or severely delay generation of an accurate classiﬁer. We show that even a delusive adversary, whose samples are all correctly labeled, can obstruct learning. We simulate and implement highly effective instances of these attacks against the Polygraph [15] automatic polymorphic worm signature generation algorithms.

History

Date

2006-01-01

Usage metrics

Keywords

automatic signature generation machine learning worm spam Computer Engineering Electrical and Electronic Engineering not elsewhere classified

Licence

In Copyright

Paragraph: Thwarting Signature Learning by Training Maliciously

History

Date

Usage metrics

Categories

Keywords

Licence

Exports