Repository hosted by TU Delft Library

Home · Contact · About · Disclaimer ·

Longitudinal navigation log data on a large web domain

Publication files not online:

Author: Verberne, S. · Arends, B. · Kraaij, W. · Vries, A. de
Publisher: Association for Computing Machinery, Inc
Source:39th International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR 2016, 17 July 2016 through 21 July 2016, 697-700
Identifier: 546220
doi: doi:10.1145/2911451.2914667
ISBN: 9781450342902
Keywords: Access logs · Data collection · Graph clustering · Link analysis · Navigation behavior · Information retrieval · Navigation · Websites · World Wide Web · Access log · Data collection · Graph clustering · Link analysis · Navigation behavior · Behavioral research · ICT · DSC - Data Science · TS - Technical Sciences


We have collected the access logs for our university's web domain over a time span of 4.5 years. We now release the pre-processed data of a 3-month period for research into user navigation behavior. We preprocessed the data so that only successful GET requests of web pages by non-bot users are kept. The resulting 3-month collection comprises 9.6M page visits (190K unique URLs) by 744K unique visitors. © 2016 ACM.