Filter

Subjects

Content Types

Countries

AID systems

API

Certificates

Data access

Data access restrictions

Database access

Database access restrictions

Database licenses

Data licenses

Data upload

Data upload restrictions

Enhanced publication

Institution responsibility type

Institution type

Keywords

Metadata standards

PID systems

Provider types

Quality management

Repository languages

Software

Syndications

Repository types

Versioning

  • * at the end of a keyword allows wildcard searches
  • " quotes can be used for searching phrases
  • + represents an AND search (default)
  • | represents an OR search
  • - represents a NOT operation
  • ( and ) implies priority
  • ~N after a word specifies the desired edit distance (fuzziness)
  • ~N after a phrase specifies the desired slop amount
Found 76 result(s)
The Buckeye Corpus of conversational speech contains high-quality recordings from 40 speakers in Columbus OH conversing freely with an interviewer. The speech has been orthographically transcribed and phonetically labeled. The audio and text files, together with time-aligned phonetic labels, are stored in a format for use with speech analysis software (Xwaves and Wavesurfer). Software for searching the transcription files is currently being written.
CHILDES is the child language component of the TalkBank system. TalkBank is a system for sharing and studying conversational interactions.
The World Data Center for Remote Sensing of the Atmosphere, WDC-RSAT, offers scientists and the general public free access (in the sense of a “one-stop shop”) to a continuously growing collection of atmosphere-related satellite-based data sets (ranging from raw to value added data), information products and services. Focus is on atmospheric trace gases, aerosols, dynamics, radiation, and cloud physical parameters. Complementary information and data on surface parameters (e.g. vegetation index, surface temperatures) is also provided. This is achieved either by giving access to data stored at the data center or by acting as a portal containing links to other providers.
Country
The Weizenbaum Library is the open access repository of the Weizenbaum Institute. It makes the open research results (publications and research data) of the Institute permanently accessible worldwide.
CERN, DESY, Fermilab and SLAC have built the next-generation High Energy Physics (HEP) information system, INSPIRE. It combines the successful SPIRES database content, curated at DESY, Fermilab and SLAC, with the Invenio digital library technology developed at CERN. INSPIRE is run by a collaboration of CERN, DESY, Fermilab, IHEP, IN2P3 and SLAC, and interacts closely with HEP publishers, arXiv.org, NASA-ADS, PDG, HEPDATA and other information resources. INSPIRE represents a natural evolution of scholarly communication, built on successful community-based information systems, and provides a vision for information management in other fields of science.
Country
Humadoc is a digital service that collects, preserves and distributes digital material corresponding to the intellectual production of the Faculty of Humanities of the National University of Mar del Plata. Repositories are important tools for preserving an institution's legacy; they facilitate digital preservation and academic communication.
ClinVar is a freely accessible, public archive of reports of the relationships among human variations and phenotypes, with supporting evidence. ClinVar thus facilitates access to and communication about the relationships asserted between human variation and observed health status, and the history of that interpretation. ClinVar processes submissions reporting variants found in patient samples, assertions made regarding their clinical significance, information about the submitter, and other supporting data. The alleles described in submissions are mapped to reference sequences, and reported according to the HGVS standard. ClinVar then presents the data for interactive users as well as those wishing to use ClinVar in daily workflows and other local applications. ClinVar works in collaboration with interested organizations to meet the needs of the medical genetics community as efficiently and effectively as possible
Country
This data repository allows users to publish animal tracking datasets that have been uploaded to Movebank (https://www.movebank.org/ ). Published datasets have gone through a submission and review process, and are typically associated with a written study published in an academic journal. All animal tracking data in this repository are available to the public.
CERIC Data Portal allows users to consult and manage data related to experiments carried out at CERIC (Central European Research Infrastructure Consortium) partner facilities. Data made available includes scientific datasets collected during experiments, experiment proposals, samples used and publications if any. Users can search for data based on related metadata (both their own data and other peoples' public data).
This is the KONECT project, a project in the area of network science with the goal to collect network datasets, analyse them, and make available all analyses online. KONECT stands for Koblenz Network Collection, as the project has roots at the University of Koblenz–Landau in Germany. All source code is made available as Free Software, and includes a network analysis toolbox for GNU Octave, a network extraction library, as well as code to generate these web pages, including all statistics and plots. KONECT contains over a hundred network datasets of various types, including directed, undirected, bipartite, weighted, unweighted, signed and rating networks. The networks of KONECT are collected from many diverse areas such as social networks, hyperlink networks, authorship networks, physical networks, interaction networks and communication networks. The KONECT project has developed network analysis tools which are used to compute network statistics, to draw plots and to implement various link prediction algorithms. The result of these analyses are presented on these pages. Whenever we are allowed to do so, we provide a download of the networks.
Country
FDAT is a research data repository hosted by the University of TĂĽbingen, designed to facilitate long-term archiving and publication of research data. Managed by the Information, Communication and Media Center (IKM), it primarily caters to the humanities and social sciences, while welcoming researchers from all scientific disciplines at the university. Committed to high-quality data management, FDAT emphasizes the importance of adhering to the FAIR Data Principles, promoting findability, accessibility, interoperability, and reusability of the research data it contains.
The UA Campus Repository is an institutional repository that facilitates access to the research, creative works, publications and teaching materials of the University by collecting, sharing and archiving content selected and deposited by faculty, researchers, staff and affiliated contributors.
Digital Case is Case Western Reserve University's digital library, institutional repository and digital archive. Digital Case stores, disseminates, and preserves the intellectual output of Case faculty, departments and research centers in digital formats (both "born digital" items as well as materials of historical interest that have been digitized). Kelvin Smith Library manages Digital Case on behalf of the university. With Digital Case, KSL assumes an active role in the scholarly communication process, providing expertise in the form of a set of services (metadata creation, secure environment, preservation over time) for access and distribution of the university’s collective intellectual product.
Country
RU-Economicas is the repository of the Institute of Economic Research (IIEc) of the UNAM, created to manage, promote and preserve, in digital format, the intellectual production of Institute of Economic Research. The objective of this repository is to promote scholarly communication and increase the visibility and use of the content produced at the Institute. It houses various materials, which may have been arbitrated or not, including books, journals, articles, lectures, presentations, databases, audiovisual, and so on. The RU-Economics provides the general public, students, teachers and researchers, a search service and online consultation of digital resources produced by the academic community of the Institute of Economic Research. Our repository is part of our university's Digital Archives Network (RAD-UNAM) which aims to create a network of university repositories to support university departments in the management and dissemination of their digital resources. Thus, the Institute for Economic Research adds to the efforts of the UNAM for better management of and access to intellectual products of the university community in the digital environment.
The UCI Machine Learning Repository is a collection of databases, domain theories, and data generators that are used by the machine learning community for the empirical analysis of machine learning algorithms. It is used by students, educators, and researchers all over the world as a primary source of machine learning data sets. As an indication of the impact of the archive, it has been cited over 1000 times.
The American National Election Studies (ANES) conducts national surveys and pilot studies and provides large, multifaceted datasets. Time Series Studies are conducted during years of national elections, with pre-election and post-election surveys conducted in presidential election years and post-election surveys conducted during congressional election years. Pilot Studies are normally conducted in years when there is no national election and are designed to test new, or to refine existing, instrumentation and study designs. Other Major Data Collections includes panel studies and other special studies.
Social Computing Data Repository hosts data from a collection of many different social media sites, most of which have blogging capacity. Some of the prominent social media sites included in this repository are BlogCatalog, Twitter, MyBlogLog, Digg, StumbleUpon, del.icio.us, MySpace, LiveJournal, The Unofficial Apple Weblog (TUAW), Reddit, etc. The repository contains various facets of blog data including blog site metadata like, user defined tags, predefined categories, blog site description; blog post level metadata like, user defined tags, date and time of posting; blog posts; blog post mood (which is defined as the blogger's emotions when (s)he wrote the blog post); blogger name; blog post comments; and blogger social network.
MD-SOAR is a shared digital repository platform for eleven colleges and universities in Maryland. It is currently funded by the University System of Maryland and Affiliated Institutions (USMAI) Library Consortium (usmai.org) and other participating partner institutions. MD-SOAR is jointly governed by all participating libraries, who have agreed to share policies and practices that are necessary and appropriate for the shared platform. Within this broad framework, each library provides customized repository services and collections that meet local institutional needs. Please follow the links below to learn more about each library's repository services and collections.
Country
The Research Data Center Qualiservice provides services for archiving and reusing qualitative research data from the social sciences. We advise and accompany research projects in the process of long-term data archiving and data sharing. Data curation is conducted by experts for the social sciences. We also provide research data and relevant context information for reuse in scientific research and teaching. Internationally interoperable metadata ensure that data sets are searchable and findable. Persistent identifiers (DOI) ensure that data and study contexts are citable. Qualiservice was accredited by the German Data Forum (RatSWD) in 2019 and adheres to its quality assurance criteria. Qualiservice is committed to the German Research Foundation’s (DFG) Guidelines for Safeguarding Good Scientific Practice and takes into account the FAIR Guiding Principles for scientific data management and stewardship as well as the OECD Principles and Guidelines for Access to Research Data from Public Funding. Qualiservice coordinates the networking and further development of scientific infrastructures for archiving and secondary use of qualitative data from social research within the framework of the National Research Data Infrastructure.
Pathway Commons is a convenient point of access to biological pathway information collected from public pathway databases. Information is sourced from public pathway databases and is readily searched, visualized, and downloaded. The data is freely available under the license terms of each contributing database.