Data Resources

We strongly encourage you consult with PHHP IT and UF IRB regarding any required policies and procedures prior to accessing these data resources.

All of Us Research Program

The National Institutes of Health’s (NIH) All of Us Research Program is building one of the largest biomedical data resources of its kind. The All of Us Research Hub stores health data from participants from over 1 million individuals across the United States. UF Contact: Zoe J Martusewicz, PHHP Department of Epidemiology

American Community Survey

The American Community Survey (ACS) is an ongoing survey that provides vital information on a yearly basis about our nation and its people. Information from the survey generates data that help inform how trillions of dollars in federal funds are distributed each year.

Behavioral Risk Factor Surveillance System

BRFSS is the nation’s premier system of health-related telephone surveys that collect state data about U.S. residents regarding their health-related risk behaviors, chronic health conditions, and use of preventive services. BRFSS collects data in all 50 states as well as the District of Columbia and three U.S. territories. BRFSS completes more than 400,000 adult interviews each year, making it the largest continuously conducted health survey system in the world.

Cancer Registry SEER

SEER began collecting data on cancer cases on January 1, 1973, in the states of Connecticut, Iowa, New Mexico, Utah, and Hawaii and the metropolitan areas of Detroit and San Francisco-Oakland. Since then, the SEER Program has been expanded to cover numerous additional areas.

ClinicalTrials.gov

ClinicalTrials.gov is a website and online database of clinical research studies and information about their results. The National Library of Medicine (NLM) maintains the website. The study sponsor or investigator submits information about their study to ClinicalTrials.gov and is responsible for the safety, science, and accuracy of any study the list.

Consumer Assessment of Healthcare Providers and Systems (CAHPS®) Database

The CAHPS Databases are AHRQ’s data repositories for selected CAHPS surveys. The primary purpose of the CAHPS Databases is to facilitate comparisons of CAHPS survey results by survey users. This compilation of survey results voluntarily submitted by a large pool of survey users enables healthcare organizations to compare their own results to aggregated data.

Curated Cancer Cell Atlas (3CA)

An online platform that compiles scRNA-seq data from global cancer research, aimed at mapping the diversity of cancer cell types and their microenvironments. Highlighted by its contribution to understanding tumor biology through meta-programs (Gavish et al. 2023, Nature), 3CA serves as a key resource for researchers exploring the cellular complexity of cancer, supporting advancements in targeted therapies and precision medicine.

Florida RDC

FLRDC provide secure environments supporting qualified researchers using restricted-access data while protecting respondent confidentiality. These restricted-access data come from censuses and surveys of businesses and households, linked employer-employee data, and administrative records from federal and state agencies and other sources.

Healthcare Cost and Utilization Project (HCUP)

HCUP’s Nationwide databases can be used to identify, track, and analyze national trends in healthcare utilization, access, charges, quality, and outcomes. There are state and national databases for inpatients, ambulatory survery and services, and emergency departments. There are national databases for pediatric inpatient and overall readmissions data.

Household Pulse Survey

In January 2025, the Household Pulse Survey (HPS) officially relaunched as an integral part of the Household Trends and Outlook Pulse Survey (HTOPS). In the longitudinal design of HTOPS, the survey content alternates every other month between HPS-focused content, which address timely issues affecting U.S. households, and content submitted by internal Census partners and stakeholders, allowing for flexible, targeted exploration of emergent topics.

Medical Expenditure Panel Survey

The Medical Expenditure Panel Survey, which began in 1996, is a set of large-scale surveys of families and individuals, their medical providers (doctors, hospitals, pharmacies, etc.), and employers across the United States. MEPS collects data on the specific health services that Americans use, how frequently they use them, the cost of these services, and how they are paid for, as well as data on the cost, scope, and breadth of health insurance held by and available to U.S. workers.

Merative™ MarketScan® Research Databases

The Merative™ Marketscan® Research Databases contain individual-level, de-identified healthcare claims information from employers, health plans, and Medicaid programs. UF Contact: Dr. Almut Winterstein, College of Pharmacy

Medicare Beneficiaries Survey

The Medicare Current Beneficiary Survey (MCBS) was implemented in 1991 to serve as a source of information for administering the Medicare program, estimating health care expenditures for beneficiaries, and providing a better understanding of the health and well-being of the Medicare population.

Medicare/Medicaid Claims Data

This site gives you direct access to public data released by the Centers for Medicare & Medicaid Services (CMS). 

MIMIC-IV

The Medical Information Mart for Intensive Care (MIMIC)-III database provided critical care data for over 40,000 patients admitted to intensive care units at the Beth Israel Deaconess Medical Center (BIDMC)

National Drug Early Warning System (NDEWS)

NDEWS incorporates real-time surveillance to detect early signals of potential drug epidemics. Ongoing data collection provides an integrated and comprehensive characterization of drug use and availability by synthesizing traditional, indirect sources with new, direct sources of data, as well as on-the-ground epidemiologic investigations within high-priority areas of concern.  The data is based off rapid street reporting surveys, online web monitoring, real time 911 dispatch data. Persons who would be interested in using the data should contact NDEWS.

National Health and Nutrition Examination Survey

The National Health and Nutrition Examination Survey (NHANES) collects data about the health of adults and children in the United States. They also collect data about what participants eat, drink, and take as supplements to determine how many nutrients are in their diet. These dietary interviews and blood tests help us measure the nutritional status of U.S. adults and children.

National Health Interview Survey

The National Health Interview Survey (NHIS) monitors the health of people across the country—the U.S. population—by collecting and analyzing data on a broad range of health topics. NHIS focuses on the health of U.S. children and adults.

CDC’s National Center for Health Statistics (NCHS) conducts NHIS. It is the nation’s largest and oldest national health survey. NHIS began collecting data in 1957.

National Longitudinal Survey of Adolescent to Adult Health

The National Longitudinal Study of Adolescent to Adult Health (Add Health) is a longitudinal study of a nationally representative sample of over 20,000 adolescents who were in grades 7-12 during the 1994-95 school year, and have been followed for five waves to date, most recently in 2016-18.

National Survey of Drug Use and Health

The National Survey on Drug Use and Health (NSDUH), conducted annually by the Substance Abuse and Mental Health Services Administration (SAMHSA), provides nationally representative data on the use of tobacco, alcohol, and drugs; substance use disorders; mental health issues; and receipt of substance use and mental health treatment among the civilian, noninstitutionalized population aged 12 or older in the United States.

OneFlorida+

The OneFlorida+ Clinical Research Network is a collaboration among researchers, clinicians and patients in Florida, Georgia, Alabama, Arkansas, California and Minnesota along with six affiliated health systems and practices to create an enduring infrastructure for a wide range of health research, including pragmatic clinical trials, comparative effectiveness research, implementation science studies, observational research, and cohort discovery.

PCORnet

PCORnet is a national resource, funded by PCORI, that enables insights from high quality health data, patient partnership, and research expertise deliver fast, trustworthy answers that advance health outcomes.

Surveys on Patient Safety Culture™ (SOPS™)

SOPS surveys enable healthcare organizations to assess how their providers and staff perceive various aspects of patient safety culture in five different settings: hospitals, medical offices, nursing homes, community pharmacies, and ambulatory surgery centers.

Survey of Women Across the Nation

The SWAN provides access to longitudinal data describing the physical, biological, psychological, and social changes that occur during the menopausal transition. Data collected from 3,302 SWAN participants from baseline through the 10th Annual Follow-Up visit are currently available to the public.

Trans-Omics for Precision Medicine

A comprehensive research program by the National Institutes of Health (NIH) aimed at enhancing our understanding of common and rare diseases through deep genomic and other “omic” data integration.

UK Biobank

The UK Biobank is a large-scale biomedical database and research resource, containing in-depth genetic and health information from half a million UK participants. The project aims to enable research into the prevention, diagnosis, and treatment of a wide range of serious and life-threatening illnesses. It includes EHR data sourced from the National Health Service (NHS), among other types of data.

UF Health Epic

Medical record data that is aggregated from throughout the UF Health System.

UF HealthStreet

As a national model for community engagement and translational research, HealthStreet data can be utilized for community health needs assessments, hot-spotting analyses, preliminary data for grants, and cohort identification. Our registry includes people across the State of Florida. UF Contact, Dr. Rober Baez, Department of Epidemiology.

UF Integrated Data Repository

The Integrated Data Repository is a large-scale database used to collect, store and report on information from across UF Health’s clinical and research enterprises. The IDR facilitates new research discoveries and enables patient care quality and safety. UF contact, Dr. Mattia Prosperi, Associate Dean for Artificial Intelligence.