Identification of user application by an external eavesdropper using machine learning analysis on network traffic
Document Type
Conference Proceeding
Publication Date
5-1-2019
Abstract
An eavesdropper may infer the computer applications a person uses by collecting and analyzing the network traffic they generate. Such inference may be performed despite applying encryption on the generated packets. In this paper, we investigate the extent of the ability of several machine learning algorithms to perform this privacy breach on the network traffic generated by a user. We measure their accuracy in identifying different applications by analyzing several statistical properties of the generated traffic rather than looking into the encrypted content. We compare the performance of these algorithms and select the one with higher precision; random forest. We also evaluate the application of packet padding to modify the packet length to avoid identification by machine learning algorithms. We test the effect of packet padding on the identification ability of the various machine-learning algorithms. We investigate the performance of the random forest algorithm in detail when applied to intact and padded traffic. We show that padding may decrease the efficacy of a machine-learning algorithm when used for application classification.
Identifier
85070292415 (Scopus)
ISBN
[9781728123738]
Publication Title
2019 IEEE International Conference on Communications Workshops Icc Workshops 2019 Proceedings
External Full Text Location
https://doi.org/10.1109/ICCW.2019.8756709
Grant
CNS 1641033
Fund Ref
National Science Foundation
Recommended Citation
Fathi-Kazerooni, Sina; Kaymak, Yagiz; and Rojas-Cessa, Roberto, "Identification of user application by an external eavesdropper using machine learning analysis on network traffic" (2019). Faculty Publications. 7617.
https://digitalcommons.njit.edu/fac_pubs/7617
