Filter
Reset all

Subjects

Content Types

Countries

AID systems

API

Certificates

Data access

Data access restrictions

Database access

Database access restrictions

Database licenses

Data licenses

Data upload

Data upload restrictions

Enhanced publication

Institution responsibility type

Institution type

Keywords

Metadata standards

PID systems

Provider types

Quality management

Repository languages

Software

Syndications

Repository types

Versioning

  • * at the end of a keyword allows wildcard searches
  • " quotes can be used for searching phrases
  • + represents an AND search (default)
  • | represents an OR search
  • - represents a NOT operation
  • ( and ) implies priority
  • ~N after a word specifies the desired edit distance (fuzziness)
  • ~N after a phrase specifies the desired slop amount
Found 187 result(s)
This site is dedicated to making high value health data more accessible to entrepreneurs, researchers, and policy makers in the hopes of better health outcomes for all. In a recent article, Todd Park, United States Chief Technology Officer, captured the essence of what the Health Data Initiative is all about and why our efforts here are so important.
The GHDx is our user-friendly and searchable data catalog for global health, demographic, and other health-related datasets. It provides detailed information about datasets ranging from censuses and surveys to health records and vital statistics, globally. It also serves as a platform for data owners to share their data with the public. The GDB Compare visualization, which allows the user to see rate of change in disease incidence, globally or by country, by age or across all ages, is especially powerful as a tool. Be sure to try adding a bottom chart, like the map, to augment the treemap that loads by default in the top chart.
Project Tycho is a repository for global health, particularly disease surveillance data. Project Tycho currently includes data for 92 notifiable disease conditions in the US, and up to three dengue-related conditions for 99 countries. Project Tycho has compiled data from reputable sources such as the US Centers for Disease Control, the World Health Organization, and National health agencies for countries around the world. Project Tycho datasets are highly standardized and have rich metadata to improve access, interoperability, and reuse of global health data for research and innovation.
The Health and Retirement Study (HRS) is a longitudinal panel study that surveys a representative sample of more than 26,000 Americans over the age of 50 every two years. The study has collected information about income, work, assets, pension plans, health insurance, disability, physical health and functioning, cognitive functioning, genetic information and health care expenditures.
The Health Data Research Innovation Gateway (the ‘Gateway’) provides a common entry point to discover and enquire about access to UK health datasets for research and innovation. It provides detailed information about the datasets, which are held by members of the UK Health Data Research Alliance, such as a description, size of the population, and the legal basis for access. The Gateway includes the ability to search for research projects, publications and health data tools, such as those related to COVID-19. New interactive features provide a community forum for researchers to collaborate and connect and the ability to add research projects. The Innovation Gateway does not hold or store any datasets or patient or health data but rather acts as a portal to allow discovery of datasets and to request access to them for health research. A dataset is a collection of related individual pieces of data but in the case of health data, identifiable information (e.g. name or NHS number) is removed and data is de-identified where possible. When you access the Gateway you will not be able to view or extract the data itself. Instead, you will be able to see information that describes what the different datasets are (e.g. where the dataset has come from, a description of the dataset, the time period and the geographical areas the dataset covers).
Country
The National Population Health Data Center (NPHDC) is one of the 20 national science data center approved by the Ministry of Science and Technology and the Ministry of Finance. The Population Health Data Archive (PHDA) is developed by NPHDC relying on the Institute of Medical Information, Chinese Academy of Medical Sciences. PHDA mainly receives scientific data from science and technology projects supported by the national budget, and also collects data from other multiple sources such as medical and health institutions, research institutions and social individuals, which is oriented to the national big data strategy and the healthy China strategy. The data resources cover basic medicine, clinical medicine, public health, traditional Chinese medicine and pharmacy, pharmacy, population and reproduction. PHDA supports data collection, archiving, processing, storage, curation, verification, certification and release in the field of population health. Provide multiple types of data sharing and application services for different hierarchy users and help them find, access, interoperate and reuse the data in a safe and controlled environment. PHDA provides important support for promoting the open sharing of scientific data of population health and domestic and foreign cooperation.
The CDHA assists researchers to create, document, and distribute public use microdata on health and aging for secondary analysis. Major research themes include: midlife development and aging; economics of population aging; inequalities in health and aging; international comparative studies of health and aging; and the investigation of linkages between social-demographic and biomedical research in population aging. The CDHA is one of fourteen demography centers on aging sponsored by the National Institute on Aging.
The Health and Medical Care Archive (HMCA) is the data archive of the Robert Wood Johnson Foundation (RWJF), the largest philanthropy devoted exclusively to health and health care in the United States. Operated by the Inter-university Consortium for Political and Social Research (ICPSR) at the University of Michigan, HMCA preserves and disseminates data collected by selected research projects funded by the Foundation and facilitates secondary analyses of the data. Our goal is to increase understanding of health and health care in the United States through secondary analysis of RWJF-supported data collections
diversitydata.org is an online tool for exploring quality of life data across metropolitan areas for people of different racial/ethnic groups in the United States. It provides values and rankings for the largest U.S. metropolitan areas on different indicators in 8 areas of life (domains), including demographics, education, economic opportunity, housing, neighborhoods, and health. It also provides a simple mapping utility, showing the range of indicator values for metros across the U.S. Data from 1999 indicators is archives in the companion Diversity Data Archive (https://diversitydata-archive.org/). For a wider selection of data on child wellbeing, visit our partner site, diversitydatakids.org (https://www.diversitydatakids.org/). diversitydata.org has been named a Health Data All Star by the Health Data Consortium. The list was compiled in consultation with leading health researchers, government officials, entrepreneurs, advocates and others to identify the health data resources that matter most.
CPES provides access to information that relates to mental disorders among the general population. Its primary goal is to collect data about the prevalence of mental disorders and their treatments in adult populations in the United States. It also allows for research related to cultural and ethnic influences on mental health. CPES combines the data collected in three different nationally representative surveys (National Comorbidity Survey Replication, National Survey of American Life, National Latino and Asian American Study).
Synapse is an open source software platform that clinical and biological data scientists can use to carry out, track, and communicate their research in real time. Synapse enables co-location of scientific content (data, code, results) and narrative descriptions of that work.
PSI is a global health organization dedicated to improving the health of people in the developing world by focusing on serious challenges like a lack of family planning, HIV and AIDS, barriers to maternal health, and the greatest threats to children under five, including malaria, diarrhea, pneumonia and malnutrition. A hallmark of PSI is a commitment to the principle that health services and products are most effective when they are accompanied by robust communications and distribution efforts that help ensure wide acceptance and proper use. PSI works in partnership with local governments, ministries of health and local organizations to create health solutions that are built to last. We use original data to monitor and evaluate our programs, generate consumer insight, estimate the impact of our solutions, and evaluate the health of the markets we work to strengthen.
Neuroimaging Tools and Resources Collaboratory (NITRC) is currently a free one-stop-shop environment for science researchers that need resources such as neuroimaging analysis software, publicly available data sets, and computing power. Since its debut in 2007, NITRC has helped the neuroscience community to use software and data produced from research that, before NITRC, was routinely lost or disregarded, to make further discoveries. NITRC provides free access to data and enables pay-per-use cloud-based access to unlimited computing power, enabling worldwide scientific collaboration with minimal startup and cost. With NITRC and its components—the Resources Registry (NITRC-R), Image Repository (NITRC-IR), and Computational Environment (NITRC-CE)—a researcher can obtain pilot or proof-of-concept data to validate a hypothesis for a few dollars.
Country
Policy-relevant observational studies for population health equity and responsible development. High-quality statistical information adult and children's health from the UN's Demographic and Health Surveys (DHS) program and UNICEF's Multiple Indicator Cluster Surveys (MICS). These datasets contain longitudinal information dating back to 1995 or 1999 for a series of social policies in up to 193 UN countries. DHS data variables include fertility, family planning and nutritional status for women aged 15-49 and young children, as well as demographic information on household structure, employment, education, wealth, and place of residence. MICS data includes information on nutritional status and child mortality, medical care during the antenatal and postnatal periods, and sibling maternal mortality, among others.
Gemma is a database for the meta-analysis, re-use and sharing of genomics data, currently primarily targeted at the analysis of gene expression profiles. Gemma contains data from thousands of public studies, referencing thousands of published papers. Users can search, access and visualize co-expression and differential expression results.
The Survey of Health, Ageing and Retirement in Europe (SHARE) is a multidisciplinary and cross-national panel database of micro data on health, socio-economic status and social and family networks of more than 140,000 individuals (approximately 530,000 interviews) aged 50 or over from 28 European countries and Israel.
<<<!!!<<< The repository is no longer available - Data previously on the site are now available at ftp://ftp.ncbi.nlm.nih.gov/pub/mhc/mhc/Final Archive. >>>!!!>>> The dbMHC database provides an open, publicly accessible platform for DNA and clinical data related to the human Major Histocompatibility Complex (MHC). The dbMHC provides access to human leukocyte antigen (HLA) sequences, HLA allele and haplotype frequencies, and clinical datasets.
Arca Data is Fiocruz's official repository for archiving, publishing, disseminating, preserving and sharing digital research data produced by the Fiocruz community or in partnership with other research institutes or bodies, with the aim of promoting new research, ensuring the reproducibility or replicability of existing research and promoting an Open and Citizen Science. Its objective is to stimulate the wide circulation of scientific knowledge, strengthening the institutional commitment to Open Science and free access to health information, in addition to providing transparency and fostering collaboration between researchers, educators, academics, managers and graduate students, to the advancement of knowledge and the creation of solutions that meet the demands of society.
Knoema is a knowledge platform. The basic idea is to connect data with analytical and presentation tools. As a result, we end with one uniformed platform for users to access, present and share data-driven content. Within Knoema, we capture most aspects of a typical data use cycle: accessing data from multiple sources, bringing relevant indicators into a common space, visualizing figures, applying analytical functions, creating a set of dashboards, and presenting the outcome.
A premier source for United States cancer statistics, SEER gathers information related to incidence, prevalence, and survival from specific geographic areas that represent 28 percent of the population, as well as compiles related reports and reports on the national cancer mortality rates. Their aim is to provide information related to cancer statistics and decrease the burden of cancer in the national population. SEER has been collecting data from cancer cases since 1973.
>>>!!!<<< caArray Retirement Announcement >>>!!!<<< The National Cancer Institute (NCI) Center for Biomedical Informatics and Information Technology (CBIIT) instance of the caArray database was retired on March 31st, 2015. All publicly-accessible caArray data and annotations will be archived and will remain available via FTP download https://wiki.nci.nih.gov/x/UYHeDQ and is also available at GEO http://www.ncbi.nlm.nih.gov/geo/ . >>>!!!<<< While NCI will not be able to provide technical support for the caArray software after the retirement, the source code is available on GitHub https://github.com/NCIP/caarray , and we encourage continued community development. Molecular Analysis of Brain Neoplasia (Rembrandt fine-00037) gene expression data has been loaded into ArrayExpress: http://www.ebi.ac.uk/arrayexpress/experiments/E-MTAB-3073 >>>!!!<<< caArray is an open-source, web and programmatically accessible microarray data management system that supports the annotation of microarray data using MAGE-TAB and web-based forms. Data and annotations may be kept private to the owner, shared with user-defined collaboration groups, or made public. The NCI instance of caArray hosts many cancer-related public datasets available for download.
Country
SAGE is a data and research platform that enables the secondary use of data related to child and youth development, health and well-being. It currently contains research data, and at a later stage we aim to also house administrative and community service delivery data. Technical infrastructure and governance processes are in place to ensure ethical use and the privacy of participants. This dataverse provides metadata for the various data holdings available in SAGE (Secondary Analysis to Generate Evidence), a research data repository based in Edmonton Alberta and an intiative of PolicyWise for Children & Families. In general, SAGE contains data holdings too sensitive for open access. Each study lists a security level which indicates the procedure required to access the data.
Country
The Health Atlas is an alliance of medical ontologists, medical systems biologists and clinical trials groups to design and implement a multi-functional and quality-assured atlas. It provides models, data and metadata on specific use cases from medical research projects from the partner institutions.
Country
With ARS - Antimicrobial Resistance Surveillance in Germany - the infrastructure for a nationwide surveillance of antimicrobial resistance has been established, which covers both the inpatient medical care and the ambulatory care sector. This is intended to reliable data on the epidemiology of antimicrobial resistance in Germany and differential statements provided by structural features of the health care and by region are possible. ARS is designed as a laboratory-based surveillance system for continuous collection of resistance data from routine for the full range of clinically relevant bacterial pathogens. Project participants and thus data suppliers are laboratories that analyze samples of medical facilities and doctors' offices microbiologically.