Scan Detection on Very Large Networks Using Logistic Regression Modeling

Gates, Carrie; McNutt, Joshua J.; Kadane, Joseph B.; Kellner, Marc I.

doi:10.1184/R1/6586880.v1

file.pdf (128.8 kB)

Scan Detection on Very Large Networks Using Logistic Regression Modeling

journal contribution

posted on 2005-01-01, 00:00 authored by Carrie Gates, Joshua J. McNutt, Joseph B. Kadane, Marc I. Kellner

Scanning activity is a common activity on the Internet today, representing malicious activity such as information gathering by a motivated adversary or automated tools searching for vulnerable hosts (e.g., worms). Many scan detection techniques have been developed; however, their focus has been on smaller networks where packet-level information is available, or where internal characteristics of the network are known. For large networks, such as those of ISPs, large corporations or government organizations, this information might not be available. This paper presents a model of scans that can be used given only unidirectional flow data. The model uses a Bayesian logistic regression, which was developed using a combination of expert opinion and manually-classified training data. It is shown to have a detection rate of 95.5% with a false positive rate of 0.4% overall when tested against a set of 300 TCP events.

History

Publisher Statement

Date

2005-01-01

Usage metrics

Keywords

Statistics Probability

Licence

In Copyright

Exports

RefWorks

BibTeX

Ref. manager

Endnote

DataCite

NLM

DC

Scan Detection on Very Large Networks Using Logistic Regression Modeling

History

Publisher Statement

Date

Usage metrics

Categories

Keywords

Licence

Exports