TY - GEN
T1 - A privacy-preserving data obfuscation scheme used in data statistics and data mining
AU - Yang, Pan
AU - Gui, Xiaolin
AU - Tian, Feng
AU - Yao, Jing
AU - Lin, Jiancai
PY - 2014
Y1 - 2014
N2 - Many applications are benefited from data sharing, especially data statistics and data mining. But as the shared data may contain private information of data owner, it has a high risk of revealing data owner's privacy. Data obfuscation is proposed to gain a balance between data privacy and data usability. But it is hard for the present obfuscation schemes to remain the usability of data in a fine-grained level. Besides, the original data can't be retrieved from the obfuscated data. To address the above issues, we proposed a data obfuscation scheme that adds an accurate 'noise' to the original data to protect the privacy while keeping the numeral characteristics of data unchanged in different levels. Besides, the scheme can also lower the impact on data mining. Furthermore, by allocating different keys to users, different users have different permissions to access to data. To achieve this, our scheme comes in four steps. Firstly, an improved cloud model is proposed to generate an accurate 'noise'. Next, an obfuscation algorithm is propose to add noise to the original data. Then, an initial scheme for dataset obfuscation is proposed, including the grouping and key allocating processes. In the final step, a fine-grained grouping scheme based on similarity is proposed. The experiments show that our scheme obfuscates date correctly, efficiently, and securely.
AB - Many applications are benefited from data sharing, especially data statistics and data mining. But as the shared data may contain private information of data owner, it has a high risk of revealing data owner's privacy. Data obfuscation is proposed to gain a balance between data privacy and data usability. But it is hard for the present obfuscation schemes to remain the usability of data in a fine-grained level. Besides, the original data can't be retrieved from the obfuscated data. To address the above issues, we proposed a data obfuscation scheme that adds an accurate 'noise' to the original data to protect the privacy while keeping the numeral characteristics of data unchanged in different levels. Besides, the scheme can also lower the impact on data mining. Furthermore, by allocating different keys to users, different users have different permissions to access to data. To achieve this, our scheme comes in four steps. Firstly, an improved cloud model is proposed to generate an accurate 'noise'. Next, an obfuscation algorithm is propose to add noise to the original data. Then, an initial scheme for dataset obfuscation is proposed, including the grouping and key allocating processes. In the final step, a fine-grained grouping scheme based on similarity is proposed. The experiments show that our scheme obfuscates date correctly, efficiently, and securely.
KW - data mining
KW - numeral characteristics
KW - obfuscation
KW - privacy-preserving
UR - https://www.scopus.com/pages/publications/84903997752
U2 - 10.1109/HPCC.and.EUC.2013.126
DO - 10.1109/HPCC.and.EUC.2013.126
M3 - 会议稿件
AN - SCOPUS:84903997752
SN - 9780769550886
T3 - Proceedings - 2013 IEEE International Conference on High Performance Computing and Communications, HPCC 2013 and 2013 IEEE International Conference on Embedded and Ubiquitous Computing, EUC 2013
SP - 881
EP - 887
BT - Proceedings - 2013 IEEE International Conference on High Performance Computing and Communications, HPCC 2013 and 2013 IEEE International Conference on Embedded and Ubiquitous Computing, EUC 2013
PB - IEEE Computer Society
T2 - 15th IEEE International Conference on High Performance Computing and Communications, HPCC 2013 and 11th IEEE/IFIP International Conference on Embedded and Ubiquitous Computing, EUC 2013
Y2 - 13 November 2013 through 15 November 2013
ER -