A computer-implemented method for generating a 
longitudinal data profile from multiple disparate data sources is provided. The method includes storing, at a central 
data hub, first de-identified data received from a first 
data source, the first de-identified data including a plurality of 
data records having encrypted identifying data and an anonymous ID assigned to each 
record, wherein the anonymous ID is assigned based on a master 
list that includes a 
list of identifiers and corresponding anonymous IDs for each identifier. The method further includes storing second de-identified data received from a second 
data source, and storing third de-identified data received from a third 
data source. The method further includes 
processing the first, second, and third de-identified data to link the first, second, and third de-identified data using the anonymous ID, and generating the 
longitudinal data profile from the linked first, second, and third de-identified data.