Brown ArrowData Literacy & Management

The information on this page is intended to serve as an introduction to the topics of data science, data literacy, data management, data sharing, and research reproducibility. Though the emphasis is on health data, information from the broad data science community is included.



  • CDISC SHARE Details

    The CDISC SHARE API is a RESTful web service that allows end users to programmatically retrieve CDISC standards' metadata from CDISC SHARE to support process automation.

  • CTS-Personas Details

    An effort to create Persona profiles representing roles across the ecosystem of translational research: Basic Research, Pre-Clinical Research, Clinical Research, Clinical Implementation,Public Health. These profiles are intended for use for the CTSA community and beyond, to assist those developing software projects, educational and communication materials, and more.

  • Collecting and Using Cost Data Details

    Aligning Forces for Quality has developed this module which outlines their experiences in collecting and reporting cost data as a means of reducing health care costs.

  • ColorBrewer Details

    Online tool designed to help select good color schemes for maps and other graphics.

  • Data Collection Standards for Race, Ethnicity, Sex, Primary Language, and Disability Status Details

    Data collection standards for measures of race, ethnicity, sex, primary language, and disability status that are to be used in all national population health surveys.

  • Data Management & Curation Details

    Information on data quality, access, curation, confidentiality, citation, as well as links to tools and services for data management and curation.

  • EHR Toolkit: Toolkit for Planning an EHR-based Surveillance Program Details

    This toolkit is a set of field-tested tools designed to support planning for a public health surveillance program that will rely on data from EHR systems.

  • HRET Disparities Toolkit Details

    The Toolkit is a Web-based tool that provides hospitals, health systems, clinics, and health plans information and resources for systematically collecting race, ethnicity, and primary language data from patients.

  • How to Engage Your Community with Health Data: Hosting a 500 Cities Event (2017) Details

    This guide encourages community stakeholders to organize events around the 500 Cities data, focusing on engaging audiences, learning about the data, and generating ideas on how to use the data to advance health.

  • Improving and Using Electronic Health Records (EHRs) Data for Quality Improvement (QI) Details

    This learning guide explains how to improve electronic health record (EHR) data quality to stimulate practice quality improvement

  • MIDAS Models Details

    Computational and mathematical modeling of public health problems is a complex endeavor. The site provides a set of model profiles to aid in the process of model description, structure, assumptions and comparison.

  • Magpi Details

    Magpi is a free, cloud-based service enabling anyone to collect data on a wide variety of mobile phones and tablets, then upload and analyze the data in real-time.

  • Making Data Talk: A Workbook (2011) Details

    This workbook outlines communication concepts, a framework for communicating data, and the application of that framework to actual public health situations.

  • MeasureUp Details

    The Building Health Places Network's website contains tools aimed at measuring programs' impact on families and communities and on factors related to health.

  • OpenEpi: Open Source Epidemiologic Statistics for Public Health Details

    Open source software for epidemiologic statistics. It provides a number of epidemiologic and statistical tools for summary data.

  • OpenRefine Details

    A freely available, open-source tool for working with messy data.

  • Research Data Assistance Center (ResDAC) Details

    ResDAC provides free assistance to academic, government and non-profit researchers interested in using Medicare and/or Medicaid data for their research.

  • Spatialepidemiology.net Details

    Provides a map-based interface for the display and analysis of infectious disease epidemiological data, including molecular data, utilizing Google Maps and Google Earth.

  • State Health Practice Database for Research (SHPDR) Details

    SHPDR captures cross-sectional and longitudinal variation in states' statutes and laws to enable researchers to perform clinically oriented health economics research, and investigate the diffusion of medical technology and other health services research outcomes of interest.

  • TranStat Details

    The TranStat tool enables field personnel and researchers to enter and revise data from local outbreaks and to test for the presence of human-to-human (or animal-to-animal) transmission.

  • Visualizing Health Details

    This site contains examples of tested and evaluated graphic displays of health information. The health visualizations include graphs, charts, and images that effectively communicate risk information and help make sense of health data.

  • Weave: Web-based Analysis and Visualization Environment Details

    Web-based visualization platform that enables the visualization of available data as well as the ability to integrate, analyze and visualize data at nested levels of geography, and to disseminate the results in a web page.


  • DataScience@NIH Blog Details

    It is intended to serve as a news and discussion vehicle for the overall community and is an opportunity for the NIH Office of the Associate Director for Data Science to present timely information and receive important feedback.

  • HealthData.gov Blog Details

    Updates and commentary from the Department of Health and Human Services.