Privacy-Preserving Outlier Detection with High Efficiency over Distributed Datasets

May 10, 2021·
Guanghong Lu
Chunhui Duan
Chunhui Duan
Corresponding author
,
Guohao Zhou
,
Xuan Ding
,
Yunhao Liu
· 2 min read
Type
Publication
IEEE International Conference on Computer Communications, 1-10
Status
Peer-reviewed
publications

Abstract

The ability to detect outliers is crucial in data mining, with widespread usage in many fields, including fraud detection, malicious behavior monitoring, health diagnosis, etc. With the tremendous volume of data becoming more distributed than ever, global outlier detection for a group of distributed datasets is particularly desirable. In this work, we propose PIF (Privacy-preserving Isolation Forest), which can detect outliers for multiple distributed data providers with high efficiency and accuracy while giving certain security guarantees. To achieve the goal, PIF makes an innovative improvement to the traditional iForest algorithm, enabling it in distributed environments. With a series of carefully-designed algorithms, each participating party collaborates to build an ensemble of isolation trees efficiently without disclosing sensitive information of data. Besides, to deal with complicated real-world scenarios where different kinds of partitioned data are involved, we propose a comprehensive schema that can work for both horizontally and vertically partitioned data models. We have implemented our method and evaluated it with extensive experiments. It is demonstrated that PIF can achieve comparable AUC to existing iForest on average and maintains a linear time complexity without privacy violation.

Citation

G. Lu, C. Duan, G. Zhou, X. Ding, and Y. Liu, “Privacy-Preserving Outlier Detection with High Efficiency over Distributed Datasets,” in Proceedings of the IEEE International Conference on Computer Communications(INFOCOM), 2021, pp. 1-10.

中文引用(GB/T 7714)

Lu G, Duan C, Zhou G, et al. Privacy-Preserving Outlier Detection with High Efficiency over Distributed Datasets[C]//Proceedings of the IEEE International Conference on Computer Communications(INFOCOM). 2021:1-10.

Chunhui Duan
Authors
Chunhui Duan (she/her)
Associate Professor
I am Chunhui Duan (段春晖), currently an Associate Professor (tenure-track) in School of Computer Science and Technology, Beijing Institute of Technology. Previously, I worked as a postdoc research fellow at Tsinghua University. I received the B.S. and Ph.D. degrees from the School of Software at Tsinghua University, in 2013 and 2018 respectively, supervised by Prof. Yunhao Liu.

My research interests include RFID, Internet-of-Things, indoor localization, wireless sensing, etc.