Search | re3data.org

ILC-CNR for CLARIN-IT repository

ILC4CLARIN

Subject(s)

Content type(s)

Country

ILC-CNR for CLARIN-IT repository is a library for linguistic data and tools. Including: Text Processing and Computational Philology; Natural Language Processing and Knowledge Extraction; Resources, Standards and Infrastructures; Computational Models of Language Usage. The studies carried out within each area are highly interdisciplinary and involve different professional skills and expertises that extend across the disciplines of Linguistics, Computational Linguistics, Computer Science and Bio-Engineering.

ChemSpider

Search and share chemistry

Subject(s)

Content type(s)

Country

ChemSpider is a free chemical structure database providing fast access to over 58 million structures, properties and associated information. By integrating and linking compounds from more than 400 data sources, ChemSpider enables researchers to discover the most comprehensive view of freely available chemical data from a single online search. It is owned by the Royal Society of Chemistry. ChemSpider builds on the collected sources by adding additional properties, related information and links back to original data sources. ChemSpider offers text and structure searching to find compounds of interest and provides unique services to improve this data by curation and annotation and to integrate it with users’ applications.

TROLLing

Tromsø Repository of Language and Linguistics

Subject(s)

Content type(s)

Country

The Tromsø Repository of Language and Linguistics (TROLLing) is a FAIR-aligned repository of linguistic data and statistical code. The archive is open access, which means that all information is available to everyone. All data are accompanied by searchable metadata that identify the researchers, the languages and linguistic phenomena involved, the statistical methods applied, and scholarly publications based on the data (where relevant). Linguists worldwide are invited to deposit data and statistical code used in their linguistic research. TROLLing is a special collection within DataverseNO (http://doi.org/10.17616/R3TV17), and C Centre within CLARIN (Common Language Resources and Technology Infrastructure, a networked federation of European data repositories; http://www.clarin.eu/), and harvested by their Virtual Language Observatory (VLO; https://vlo.clarin.eu/).

Gulf of Mexico Research Initiative Information and Data Cooperative

GRIIDC

Subject(s)

Content type(s)

Country

United States

The Gulf of Mexico Research Initiative Information and Data Cooperative (GRIIDC) is a team of researchers, data specialists and computer system developers who are supporting the development of a data management system to store scientific data generated by Gulf of Mexico researchers. The Master Research Agreement between BP and the Gulf of Mexico Alliance that established the Gulf of Mexico Research Initiative (GoMRI) included provisions that all data collected or generated through the agreement must be made available to the public. The Gulf of Mexico Research Initiative Information and Data Cooperative (GRIIDC) is the vehicle through which GoMRI is fulfilling this requirement. The mission of GRIIDC is to ensure a data and information legacy that promotes continual scientific discovery and public awareness of the Gulf of Mexico Ecosystem.

Cancer Cell Line Encyclopedia

CCLE

Subject(s)

Content type(s)

Country

United States

The Cancer Cell Line Encyclopedia project is a collaboration between the Broad Institute, and the Novartis Institutes for Biomedical Research and its Genomics Institute of the Novartis Research Foundation to conduct a detailed genetic and pharmacologic characterization of a large panel of human cancer models, to develop integrated computational analyses that link distinct pharmacologic vulnerabilities to genomic patterns and to translate cell line integrative genomics into cancer patient stratification. The CCLE provides public access to genomic data, analysis and visualization for about 1000 cell lines.

ISRIC Soil Metadata Catalogue

data.isric.org

Subject(s)

Content type(s)

Country

Netherlands

data.isric.org is our central location for searching and downloading soil data bases/layers from around the world. ISRIC - World Soil Infomation (WDC-Soils) is a regular member of the ICS World Data System (WDS). We support Open Data whenever possible, respecting inherited rights (licences).

Europeana collections

Subject(s)

Content type(s)

Country

European Union

Europeana is the trusted source of cultural heritage brought to you by the Europeana Foundation and a large number of European cultural institutions, projects and partners. It’s a real piece of team work. Ideas and inspiration can be found within the millions of items on Europeana. These objects include: Images - paintings, drawings, maps, photos and pictures of museum objects Texts - books, newspapers, letters, diaries and archival papers Sounds - music and spoken word from cylinders, tapes, discs and radio broadcasts Videos - films, newsreels and TV broadcasts All texts are CC BY-SA, images and media licensed individually.

megx.net

Marine Ecological GenomiX

Subject(s)

Content type(s)

Country

Germany

<<<!!!<<< This repository is no longer open to the public !!! >>>!!!>>>

LINCS Data Portal

Library of Integrated Network-based Signatures Data Portal

Subject(s)

Content type(s)

Country

United States

LINCS Data Portal provides access to LINCS data from various sources. The program has six Data and Signature Generation Centers: Drug Toxicity Signature Generation Center, HMS LINCS Center, LINCS Center for Transcriptomics, LINCS Proteomic Characterization Center for Signaling and Epigenetics, MEP LINCS Center, and NeuroLINCS Center.

ETH Travel Data Archive

ETHTDA

Subject(s)

Content type(s)

Country

Switzerland

<<<!!!<<< The documents are stored in the ETH Web archive and are no longer maintained >>>!!!>>>

IEDA

Interdisciplinary Earth Data Alliance

Subject(s)

Content type(s)

Country

United States

IEDA2 is currently undergoing a website reconstruction and will be back soon. IEDA is a community-based facility that serves to support, sustain, and advance the geosciences by providing data services for observational Geoscience data from the Ocean, Earth, and Polar Sciences. IEDA welcomes and encourages investigators to contribute their data to the IEDA collections so that the data can be discovered and reused by a diverse community now and in the future. The IEDA collections are: EarthChem, Geochron, System for Earth Sample Registration (SESAR), Marine Geoscience Data System (MGDS), and USAP Data Center. Meta-Search provided on the portal through IEDA Data Browser http://www.iedadata.org/databrowser .

DATA.GOV.UK

Opening up Government

Subject(s)

Content type(s)

Country

United Kingdom

The Government is releasing public data to help people understand how government works and how policies are made. Some of this data is already available, but data.gov.uk brings it together in one searchable website. Making this data easily available means it will be easier for people to make decisions and suggestions about government policies based on detailed information.

The Arabidopsis Information Resource

TAIR

Subject(s)

Content type(s)

Country

United States

The Arabidopsis Information Resource (TAIR) maintains a database of genetic and molecular biology data for the model higher plant Arabidopsis thaliana . Data available from TAIR includes the complete genome sequence along with gene structure, gene product information, metabolism, gene expression, DNA and seed stocks, genome maps, genetic and physical markers, publications, and information about the Arabidopsis research community. Gene product function data is updated every two weeks from the latest published research literature and community data submissions. Gene structures are updated 1-2 times per year using computational and manual methods as well as community submissions of new and updated genes. TAIR also provides extensive linkouts from our data pages to other Arabidopsis resources.

Integrated Climate Data Center

ICDC

Subject(s)

Content type(s)

Country

Germany

The CliSAP-Integrated Climate Data Center (ICDC) allows easy access to climate relevant data from satellite remote sensing and in situ and other measurements in Earth System Sciences. These data are important to determine the status and the changes in the climate system. Additionally some relevant re-analysis data are included, which are modeled on the basis of observational data. ICDC cooperates with the "Zentrum für Nachhaltiges Forschungsdatenmanagement "https://www.fdr.uni-hamburg.de/ to publish observational data with a doi.

ComBase

Subject(s)

Content type(s)

Country

ComBase is a resource for quantitative and predictive food microbiology. ComBase includes a database of observed microbial responses to a variety of food-related environments and a collection of predictive models.

VecNet

Vector-Borne Disease Network

Subject(s)

Content type(s)

Country

<<<!!!<<< The repository is no longer available. 2019-09-06: no more access to VecNet >>>!!!>>>

Centre National de Ressources Textuelles et Lexicales

CNRTL

Subject(s)

Content type(s)

Country

France

Created in 2005 by the CNRS, CNRTL unites in a single portal, a set of linguistic resources and tools for language processing. The CNRTL includes the identification, documentation (metadata), standardization, storage, enhancement and dissemination of resources. The sustainability of the service and the data is guaranteed by the backing of the UMR ATILF (CNRS - Université Nancy), support of the CNRS and its integration in the excellence equipment project ORTOLANG .

PhysioNet

Subject(s)

Content type(s)

Country

United States

Modern signal processing and machine learning methods have exciting potential to generate new knowledge that will impact both physiological understanding and clinical care. Access to data - particularly detailed clinical data - is often a bottleneck to progress. The overarching goal of PhysioNet is to accelerate research progress by freely providing rich archives of clinical and physiological data for analysis. The PhysioNet resource has three closely interdependent components: An extensive archive ("PhysioBank"), a large and growing library of software ("PhysioToolkit"), and a collection of popular tutorials and educational materials

OpenKIM

Knowledgebase of Interatomic Models

Subject(s)

Content type(s)

Country

United States

OpenKIM is an online suite of open source tools for molecular simulation of materials. These tools help to make molecular simulation more accessible and more reliable. Within OpenKIM, you will find an online resource for standardized testing and long-term warehousing of interatomic models and data, and an application programming interface (API) standard for coupling atomistic simulation codes and interatomic potential subroutines.

Registry of Open Data on AWS

Registry of Open Data on Amazon Web Services

Subject(s)

Content type(s)

Country

United States

The Registry of Open Data on AWS provides a centralized repository of public data sets that can be seamlessly integrated into AWS cloud-based applications. AWS is hosting the public data sets at no charge to their users. Anyone can access these data sets from their Amazon Elastic Compute Cloud (Amazon EC2) instances and start computing on the data within minutes. Users can also leverage the entire AWS ecosystem and easily collaborate with other AWS users.

CLARINO Bergen Center repository

Subject(s)

Content type(s)

Country

CLARINO Bergen Center repository is the repository of CLARINO, the Norwegian infrastructure project . Its goal is to implement the Norwegian part of CLARIN. The ultimate aim is to make existing and future language resources easily accessible for researchers and to bring eScience to humanities disciplines. The repository includes INESS the Norwegian Infrastructure for the Exploration of Syntax and Semantics. This infrastructure provides access to treebanks, which are databases of syntactically and semantically annotated sentences.

CLARIN-PL

Language Technology Centre

Subject(s)

Content type(s)

Country

Polish CLARIN node – CLARIN-PL Language Technology Centre – is being built at Wrocław University of Technology. The LTC is addressed to scholars in the humanities and social sciences. Registered users are granted free access to digital language resources and advanced tools to explore them. They can also archive and share their own language data (in written, spoken, video or multimodal form).

Synapse

Subject(s)

Content type(s)

Country

United States

Synapse is an open source software platform that clinical and biological data scientists can use to carry out, track, and communicate their research in real time. Synapse enables co-location of scientific content (data, code, results) and narrative descriptions of that work.

DesignSafe-CI Data Depot Repository

A Repository Infrastructure for Natural Hazards Datasets

Subject(s)

Content type(s)

Country

United States

The DesignSafe Data Depot Repository (DDR) is the platform for curation and publication of datasets generated in the course of natural hazards research. The DDR is an open access data repository that enables data producers to safely store, share, organize, and describe research data, towards permanent publication, distribution, and impact evaluation. The DDR allows data consumers to discover, search for, access, and reuse published data in an effort to accelerate research discovery. It is a component of the DesignSafe cyberinfrastructure, which represents a comprehensive research environment that provides cloud-based tools to manage, analyze, curate, and publish critical data for research to understand the impacts of natural hazards. DesignSafe is part of the NSF-supported Natural Hazards Engineering Research Infrastructure (NHERI), and aligns with its mission to provide the natural hazards research community with open access, shared-use scholarship, education, and community resources aimed at supporting civil and social infrastructure prior to, during, and following natural disasters. It serves a broad national and international audience of natural hazard researchers (both engineers and social scientists), students, practitioners, policy makers, as well as the general public. It has been in operation since 2016, and also provides access to legacy data dating from about 2005. These legacy data were generated as part of the NSF-supported Network for Earthquake Engineering Simulation (NEES), a predecessor to NHERI. Legacy data and metadata belonging to NEES were transferred to the DDR for continuous preservation and access.

Flanders Marine Institute

VLIZ

Subject(s)

Content type(s)

Country

Belgium

The Flanders Marine Institute (VLIZ) is a centre for marine and coastal research. As a partner in various projects and networks it promotes and supports the international image of Flemish marine scientific research and international marine education. In its capacity as a coordination and information platform, the Flanders Marine Institute (VLIZ) supports some thousand marine scientists in Flanders by disseminating their knowledge to policymakers, educators, the general public and scientists.

Subjects

Content Types

Countries

AID systems

API

Certificates

Data access

Data access restrictions

Database access

Database access restrictions

Database licenses

Data licenses

Data upload

Data upload restrictions

Enhanced publication

Institution responsibility type

Institution type

Keywords

Metadata standards

PID systems

Provider types

Quality management

Repository languages

Software

Syndications

Repository types

Versioning