Integrated Data Repository

The UF Health Integrated Data Repository (IDR), supported by the UF CTSI, is a large-scale database that collects and organizes information from across UF Health’s clinical and research enterprises. The UF Health IDR enables new research discoveries as well as improvements in the quality and safety of patient care.

The IDR consists of a secure, clinical data warehouse that aggregates data from the university’s various clinical and administrative information systems, including the EpicCare electronic health record system. As of August 2018, the IDR contains more than 1 billion observational facts pertaining to more than 1 million patients. The IDR also contains data about biospecimen availability through the CTSI Biorepository and patient consent for research contact through UF Consent2Share. IDR data is refreshed monthly.

Accessing IDR Data for Research

A simple application of an IDR is cohort identification: the ability to ask and answer straightforward questions regarding patient populations and occurrences of incidents over time. How many patients are in our catchment area that have particular ailments? How many have particular complications? How many have tissue in our biorepository?

Researchers can access IDR data in two ways:

  • Through the NIH-funded i2b2 tool, which provides researchers access to a HIPAA-compliant and IRB-approved “Limited Data Set.” Faculty researchers can query the i2b2 Limited Data Set to identify cohort counts as they prepare grant proposals, plan clinical trials, and write IRB protocols.
  • By applying for IRB approval to obtain more detailed data from the IDR for further research analysis.

Visit the IDR Services page for an overview, training and login.

Learn More

Related News

Project Background

Goals and Objectives

  1. Create a scalable infrastructure for the integrated data repository
  2. Create data flows from Epic and laboratory systems to the integrated data repository
  3. Create access control and governance processes for the integrated data repository
  4. Incorporate UF Consent2Share processes to track patient consent for UF to store excess tissue from their health-care visits for research purposes and/or for UF researchers to contact them about future studies
  5. Demonstrate cohort identification from the new integrated data repository


Supported by the UF CTSI and Shands HealthCare, the Integrated Data Repository is a joint effort of the colleges, the major institutes, and UF Health IT. Gigi Lipori is the project lead. For additional information, please contact Gigi Lipori; or Kari Cassel, UF Health CIO. Recombinant Data, a company focused on IDR work in academic health settings, was engaged to implement a proof of concept based on the i2b2 platform.


UF Health will have a new capability for cohort identification, providing counts of patients in clinics and outpatient settings with specified attributes. The project will create new data governance processes, data management processes, access control and consent processes.


The project kick-off meeting was held March 1, 2011. The proof of concept was operational in June 2011, with an initial focus on cohort identification (see the UF CTSI’s 2011 Cohort Identification RFA and Awardees). Researchers can now register for i2b2 access to query IDR data for cohort discovery.