Request Dependency Graph: A Model for Web Usage Mining in Large-Scale Web of Things
Document Type
Article
Publication Date
8-1-2016
Abstract
In the Web of Things (WoT) environment, Web traffic logs contain valuable information of how people interact with smart devices and Web servers. Mining the wealth of information available in the Web access logs has theoretical and practical significance for many important applications like network optimization and security management. The first critical step of the mining task is modeling the relationships among HyperText Transfer Protocol (HTTP) requests for accessing Web objects to investigate the behavior of Web clients. In this paper, we introduce the request dependency graph (RDG), a graph representation of the relationships among HTTP requests. Conceptually, a directed link from A to B in the graph means that the accessing of Web object B is caused by the accessing of A, i.e., B depends on A. We propose a methodology to establish such a graph by mining the temporal and causal information among aggregated HTTP requests. To demonstrate the value and effectiveness of the proposed model, we design and implement an algorithm for primary requests identification, which is a critical task of Web usage mining, based on the RDG. Evaluation results from a large-scale real-world Web access log shows that the RDG is a useful tool for Web usage mining.
Identifier
84982952604 (Scopus)
Publication Title
IEEE Internet of Things Journal
External Full Text Location
https://doi.org/10.1109/JIOT.2015.2452964
e-ISSN
23274662
First Page
598
Last Page
608
Issue
4
Volume
3
Recommended Citation
Liu, Jun; Fang, Cheng; and Ansari, Nirwan, "Request Dependency Graph: A Model for Web Usage Mining in Large-Scale Web of Things" (2016). Faculty Publications. 10351.
https://digitalcommons.njit.edu/fac_pubs/10351
