Sub-document timestamping of web documents

Conference Paper (2015)
Research Group
Web Information Systems
DOI related publication
https://doi.org/10.1145/2766462.2767803
More Info
expand_more
Publication Year
2015
Language
English
Research Group
Web Information Systems
Pages (from-to)
1023-1026
ISBN (print)
978-1-4503-3621-5

Abstract

Knowledge about a (Web) document's creation time has been shown to be an important factor in various temporal information retrieval settings. Commonly, it is assumed that such documents were created at a single point in time. While this assumption may hold for news articles and similar document types, it is a clear oversimplification for general Web documents. In this paper, we investigate to what extent (i) this simplifying assumption is violated for a corpus of Web documents, and, (ii) it is possible to accurately estimate the creation time of individual Web documents' components (so-called sub-documents).

No files available

Metadata only record. There are no files for this record.