Filter
Reset all

Subjects

Content Types

Countries

AID systems

API

Certificates

Data access

Data access restrictions

Database access

Database licenses

Data licenses

Data upload

Data upload restrictions

Enhanced publication

Institution responsibility type

Institution type

Keywords

Metadata standards

PID systems

Provider types

Quality management

Repository languages

Software

Repository types

Versioning

  • * at the end of a keyword allows wildcard searches
  • " quotes can be used for searching phrases
  • + represents an AND search (default)
  • | represents an OR search
  • - represents a NOT operation
  • ( and ) implies priority
  • ~N after a word specifies the desired edit distance (fuzziness)
  • ~N after a phrase specifies the desired slop amount
  • 1 (current)
Found 13 result(s)
OrthoMCL is a genome-scale algorithm for grouping orthologous protein sequences. It provides not only groups shared by two or more species/genomes, but also groups representing species-specific gene expansion families. So it serves as an important utility for automated eukaryotic genome annotation. OrthoMCL starts with reciprocal best hits within each genome as potential in-paralog/recent paralog pairs and reciprocal best hits across any two genomes as potential ortholog pairs. Related proteins are interlinked in a similarity graph. Then MCL (Markov Clustering algorithm,Van Dongen 2000; www.micans.org/mcl) is invoked to split mega-clusters. This process is analogous to the manual review in COG construction. MCL clustering is based on weights between each pair of proteins, so to correct for differences in evolutionary distance the weights are normalized before running MCL.
Country
Rodare is the institutional research data repository at HZDR (Helmholtz-Zentrum Dresden-Rossendorf). Rodare allows HZDR researchers to upload their research software and data and enrich those with metadata to make them findable, accessible, interoperable and retrievable (FAIR). By publishing all associated research software and data via Rodare research reproducibility can be improved. Uploads receive a Digital Object Identfier (DOI) and can be harvested via a OAI-PMH interface.
The Maize Genetics and Genomics Database focuses on collecting data related to the crop plant and model organism Zea mays. The project's goals are to synthesize, display, and provide access to maize genomics and genetics data, prioritizing mutant and phenotype data and tools, structural and genetic map sets, and gene models. MaizeGDB also aims to make the Maize Newsletter available, and provide support services to the community of maize researchers. MaizeGDB is working with the Schnable lab, the Panzea project, The Genome Reference Consortium, and iPlant Collaborative to create a plan for archiving, dessiminating, visualizing, and analyzing diversity data. MMaizeGDB is short for Maize Genetics/Genomics Database. It is a USDA/ARS funded project to integrate the data found in MaizeDB and ZmDB into a single schema, develop an effective interface to access this data, and develop additional tools to make data analysis easier. Our goal in the long term is a true next-generation online maize database.aize genetics and genomics database.
Brainlife promotes engagement and education in reproducible neuroscience. We do this by providing an online platform where users can publish code (Apps), Data, and make it "alive" by integragrate various HPC and cloud computing resources to run those Apps. Brainlife also provide mechanisms to publish all research assets associated with a scientific project (data and analyses) embedded in a cloud computing environment and referenced by a single digital-object-identifier (DOI). The platform is unique because of its focus on supporting scientific reproducibility beyond open code and open data, by providing fundamental smart mechanisms for what we refer to as “Open Services.”
The Harvard Dataverse is open to all scientific data from all disciplines worldwide. It includes the world's largest collection of social science research data. It is hosting data for projects, archives, researchers, journals, organizations, and institutions.
Country
DisGeNET is a discovery platform containing one of the largest publicly available collections of genes and variants associated to human diseases. DisGeNET integrates data from expert curated repositories, GWAS catalogues, animal models and the scientific literature. DisGeNET data are homogeneously annotated with controlled vocabularies and community-driven ontologies. Additionally, several original metrics are provided to assist the prioritization of genotype–phenotype relationships.
Country
The Swedish Infrastructure for Ecosystem Science (SITES) is a national infrastructure for terrestrial and limnological field research. SITES aims to promote high-quality research through long-term field measurements and field experiments, and by making data available. Quality-controlled monitoring data from SITES is freely available on the SITES Data Portal from all participating stations and thematic programs. New datasets are continuously being uploaded.
Country
The Human Metabolome Database (HMDB) is a freely available electronic database containing detailed information about small molecule metabolites found in the human body. It is intended to be used for applications in metabolomics, clinical chemistry, biomarker discovery and general education.
The South African Marine Information Management System (MIMS) is an Open Archival Information System (OAIS) repository that plays a multifaceted role in archiving, publishing, and preserving marine-related datasets. As an IODE-accredited Associate Data Unit (ADU), MIMS serves as a national node for the IODE of the IOC of UNESCO. It archives and publishes collections and subsets of marine-related datasets for the National Department of Forestry, Fisheries, and the Environment (DFFE) and its regional partners. As an IOC member organization, DFFE is committed to supporting the long-term preservation and archival of marine and coastal data for South Africa and its regional partners, promoting open access to data, and encouraging scientific collaboration. Tasked with the long-term preservation of South Africa's marine and coastal data, MIMS functions as an institutional data repository. It provides primary access to all data collected by the DFFE Oceans and Coastal Research Directorate and acts as a trusted broker of scientific marine data for a wide range of South African institutions. MIMS hosts the IODE AFROBIS Node, an OBIS Node that coordinates and collates data management activities within the sub-Saharan African region. As part of the OBIS Steering Group, MIMS represents sub-Saharan Africa on issues around biological (biodiversity) data standards. It also facilitates data and metadata publishing for the region through the GBIF and OBIS networks. Operating on the Findable, Accessible, Interoperable, and Reusable (FAIR) data principles, MIMS aligns its practices to maximize ocean data exchange and use while respecting the conditions stipulated by the Data Provider. By integrating various functions and commitments, MIMS stands as a vital component in the marine and coastal data landscape, fostering collaboration, standardization, and accessibility in alignment with international standards and regional needs.
ERIC/open is the institutional repository where Eawag scientists publish their research data. Research data is organized in Packages which contain one or more Resources. Resources are usually files containing research data proper or ancillary information such as a README-file. A URL pointing to external information might also constitute a Resource.