Sub-document timestamping of web documents
Conference Paper
(2015)
Research Group
Web Information Systems
DOI related publication
https://doi.org/10.1145/2766462.2767803
To reference this document use:
https://resolver.tudelft.nl/uuid:5ca6556d-d233-47fc-bee9-6a00be0ca015
More Info
expand_more
expand_more
Publication Year
2015
Language
English
Research Group
Web Information Systems
Pages (from-to)
1023-1026
ISBN (print)
978-1-4503-3621-5
Abstract
Knowledge about a (Web) document's creation time has been shown to be an important factor in various temporal information retrieval settings. Commonly, it is assumed that such documents were created at a single point in time. While this assumption may hold for news articles and similar document types, it is a clear oversimplification for general Web documents. In this paper, we investigate to what extent (i) this simplifying assumption is violated for a corpus of Web documents, and, (ii) it is possible to accurately estimate the creation time of individual Web documents' components (so-called sub-documents).
No files available
Metadata only record. There are no files for this record.