A framework for employing longitudinally collected multicenter electronic health records to stratify heterogeneous patient populations on disease history