Validation methodology for expert-annotated datasets