TY - JOUR
T1 - Privacy-Preserving K-Means Clustering for Vehicular Driving Behavior Analysis
AU - Chang, Yuan
AU - Li, Yinuo
AU - Luan, Tom H.
AU - Wang, Yuntao
AU - Zheng, Jinkai
AU - Su, Zhou
N1 - Publisher Copyright:
© 2013 IEEE.
PY - 2026
Y1 - 2026
N2 - The rapid development of intelligent transportation systems has led to the generation of massive vehicular data from onboard sensors, GPS devices, and driving logs, paving the way for advanced driver behavior profiling. To extract insights from such data, clustering techniques, particularly k-means, are widely employed. However, traditional centralized k-means clustering for driving behavior analysis poses potential privacy leakage, as raw features are transmitted to the cloud server in plaintext. Existing privacy-preserving k-means schemes either rely on interactive homomorphic encryption protocols, which incur significant communication overhead, or adopt differential privacy, which reduces accuracy due to noise injection. To address these challenges, we propose PPKC, a non-interactive privacy-preserving k-means clustering framework tailored for vehicular driving behavior analysis. PPKC leverages an improved Paillier cryptosystem with a three-modulus construction, enabling secure Euclidean distance computation and cluster center updates directly over ciphertexts without decryption or repeated interaction with data owners. Each vehicle only uploads a single round of encrypted feature vectors, after which the cloud server can independently execute the complete k-means clustering process. We rigorously analyze the security of PPKC, proving its semantic security and indistinguishability under the IND-CCA assumption with unforgeable digital signatures. Extensive experiments on the highD dataset demonstrate that PPKC achieves high clustering accuracy with significantly lower communication and computation overhead than existing protocols.
AB - The rapid development of intelligent transportation systems has led to the generation of massive vehicular data from onboard sensors, GPS devices, and driving logs, paving the way for advanced driver behavior profiling. To extract insights from such data, clustering techniques, particularly k-means, are widely employed. However, traditional centralized k-means clustering for driving behavior analysis poses potential privacy leakage, as raw features are transmitted to the cloud server in plaintext. Existing privacy-preserving k-means schemes either rely on interactive homomorphic encryption protocols, which incur significant communication overhead, or adopt differential privacy, which reduces accuracy due to noise injection. To address these challenges, we propose PPKC, a non-interactive privacy-preserving k-means clustering framework tailored for vehicular driving behavior analysis. PPKC leverages an improved Paillier cryptosystem with a three-modulus construction, enabling secure Euclidean distance computation and cluster center updates directly over ciphertexts without decryption or repeated interaction with data owners. Each vehicle only uploads a single round of encrypted feature vectors, after which the cloud server can independently execute the complete k-means clustering process. We rigorously analyze the security of PPKC, proving its semantic security and indistinguishability under the IND-CCA assumption with unforgeable digital signatures. Extensive experiments on the highD dataset demonstrate that PPKC achieves high clustering accuracy with significantly lower communication and computation overhead than existing protocols.
KW - Privacy preservation
KW - homomorphic encryption
KW - k-means clustering
UR - https://www.scopus.com/pages/publications/105025750771
U2 - 10.1109/TNSE.2025.3644926
DO - 10.1109/TNSE.2025.3644926
M3 - 文章
AN - SCOPUS:105025750771
SN - 2327-4697
VL - 13
SP - 4930
EP - 4945
JO - IEEE Transactions on Network Science and Engineering
JF - IEEE Transactions on Network Science and Engineering
ER -