Disclosure Risk vs. Data Utility through the R-U Confidentiality Map in Multivariate Settings

Duncan, George T.; Keller-McNulty, Sallie A.; Stokes, S. Lynne

doi:10.1184/R1/6471212.v1

File(s) stored somewhere else

http://www.heinz.cmu.edu/faculty-and-research/research/research-details/index.aspx?rid=270

Please note: Linked content is NOT stored on Carnegie Mellon University and we can't guarantee its availability, quality, security or accept any liability.

Disclosure Risk vs. Data Utility through the R-U Confidentiality Map in Multivariate Settings

journal contribution

posted on 2005-01-01, 00:00 authored by George T. Duncan, Sallie A. Keller-McNulty, S. Lynne Stokes

Information organizations, such as statistical agencies, must ensure that data access does not compromise the confidentiality afforded data providers, whether individuals or establishments. Recognizing that deidentification of data is generally inadequate to protect confidentiality against attack by a data snooper, information organizations (IOs)—such as statistical agencies, data archives, and trade associations—can implement a variety of disclosure limitation (DL) techniques—such as topcoding, noise addition and data swapping—in developing data products. Desirably, the resulting restricted data have both high data utility U to data users and low disclosure risk R from data snoopers. IOs lack a framework for examining tradeoffs between R and U under a specific DL procedure. They also lack systematic ways of comparing the performance of distinct DL procedures. To provide this framework and facilitate comparisons, the R-U confidentiality map is introduced to trace the joint impact on R and U to changes in the parameters of a DL procedure. Implementation of an R-U confidentiality map is illustrated in the case of multivariate noise addition. Analysis is provided for two important multivariate estimation problems: a data user seeks to estimate linear combinations of means and to estimate regression coefficients.

History

Date

2005-01-01

Usage metrics

Keywords

Multivariate Additive Noise Confidentiality Protection Disclosure Limitation R-U Confidentiality Map Regression

Licence

In Copyright

Exports

RefWorks

BibTeX

Ref. manager

Endnote

DataCite

NLM

DC

File(s) stored somewhere else

Disclosure Risk vs. Data Utility through the R-U Confidentiality Map in Multivariate Settings

History

Date

Usage metrics

Categories

Keywords

Licence

Exports