Semantic Anomaly Detection in Online Data Sources

Raz, Orna; Koopman, Philip; Shaw, Mary

doi:10.1184/R1/6625712.v1

File(s) stored somewhere else

http://marian.shaw-weil.com/displaypaper.php?PAPER_ID=12&header=http://spoke.compose.cs.cmu.edu/shaweb/p/srchtop.txt&trailer=http://spoke.compose.cs.cmu.edu/shaweb/p/pubsbot.txt&admin=yes

Please note: Linked content is NOT stored on Carnegie Mellon University and we can't guarantee its availability, quality, security or accept any liability.

Semantic Anomaly Detection in Online Data Sources

journal contribution

posted on 2002-01-01, 00:00 authored by Orna Raz, Philip Koopman, Mary Shaw

Much of the software we use for everyday purposes incorporates elements developed and maintained by someone other than the developer. These elements include not only code and databases but also dynamic data feeds from online data sources. Although everyday software is not mission critical, it must be dependable enough for practical use. This is limited by the dependability of the incorporated elements. It is particularly difficult to evaluate the dependability of dynamic data feeds, because they may be changed by their proprietors as they are used. Further, the specifications of these data feeds are often even sketchier than the specifications of software components. We demonstrate a method of inferring invariants about the normal behavior of dynamic data feeds. We use these invariants as proxies for specifications to perform on-going detection of anomalies in the data feed. We show the feasibility of our approach and demonstrate its usefulness for semantic anomaly detection: identifying occasions when a dynamic data feed is delivering unreasonable values, even though its behavior may be superficially acceptable (i.e., it is delivering parsable results in a timely fashion).

History

Date

2002-01-01

Usage metrics

Keywords

anomaly detection everyday dependability

Licence

In Copyright

Exports

RefWorks

BibTeX

Ref. manager

Endnote

DataCite

NLM

DC

File(s) stored somewhere else

Semantic Anomaly Detection in Online Data Sources

History

Date

Usage metrics

Categories

Keywords

Licence

Exports