Sort by

Allgemeine Bevölkerungsumfrage der Sozialwissenschaften

The German General Social Survey (ALLBUS) collects up-to-date data on attitudes, behavior, and social structure in Germany. Every two years since 1980 a representative cross section of the population is surveyed using both constant and variable questions. The ALLBUS data become available to interested parties for research and teaching as soon as they are processed and documented.

The Antarctic Glaciological Data Center (AGDC) at NSIDC archives and distributes Antarctic glaciological and cryospheric data collected by the U.S. Antarctic Program. From this Web site, you can access the data, the metadata, and the guide documentation for each data set as well as submit your data for archival, find related data sets, and access a collection of Antarctica photographs and images from the NSIDC archive. AGDC developped and gives access to A-CAP: The Antarctic Cryosphere Access Portal.

The Archaeology Data Service supports research, learning and teaching with freely available, high quality and dependable digital resources. It does this by preserving digital data in the long term, and by promoting and disseminating a broad range of data in archaeology. The ADS promotes good practice in the use of digital data in archaeology, it provides technical advice to the research community, and supports the deployment of digital technologies.

Australian Antarctic Data Centre
Data management and spatial data services

The Australian Antarctic Data Centre (AADC) provides data collection and data management services in Australia's Antarctic Science Program. The AADC manages science data from Australia's Antarctic research, maps Australia's areas of interest in the Antarctic region, manages Australia's Antarctic state of the environment reporting, and provides advice and education and a range of other products.

Bibliothekarisches Archivierungs- und Bereitstellungssystem

BABS include digital reproductions from the digitization of the Munich Digitisation CenterMunich Digitization Center/Digital Library of the Bavarian State Library including digital reproductions from copyright-free works from the BSB collections created by cooperation partners or service providers, such as digital copies from the The google-ProjectGoogle project; official publications of authorities, departments and agencies of the State of Bavaria according to the "Bavarian State Promulgation 2 December 2008 (Az.: B II 2-480-30)" on the delivery of official publications to libraries, the Promulgation Platform Bavaria (Verkündungsplattform), as well as voluntary deliveries of electronic publications of different (mainly Bavarian scientific) publishing houses and other publishers; scientifically relevant literature (open access publications and websites) of national and international origin in the Areas of Collection Emphasis of the BSB (history including classical studies, Eastern Europe, history of France and Italy, music, library science, book studies and information science) as well as Bavarica; electronic publications produced by the BSB specialist departments, especially those of the Center for Electronic Publishing (ZEP); local/regional/national licensed or purchased electronic publications

The Information Bank for Applied Research in Social Sciences (BIIACS) Central aims to provide strategic information services for the investigation and resolution of social problems and conduct rigorous analysis of databases and provide advice to decision-makers. To provide efficient access to information, the BIIACS conducted five major activities: collects, protects, preserves, and disseminates cure databases. The databases are available online and for free. They are organized in communities and collections with diverse content ranging from information on various economic, political and social of Mexico in different periods of federal government until election survey results conducted by higher education institutions as the CIDE and pollsters as Mund Americas.

Bavarian Archive for Speech Signals
Bayerisches Archiv für Sprachsignale

The Bavarian Archive for Speech Signals (BAS) is a public institution hosted by the University of Munich. This institution was founded with the aim of making corpora of current spoken German available to both the basic research and the speech technology communities via a maximally comprehensive digital speech-signal database. The speech material will be structured in a manner allowing flexible and precise access, with acoustic-phonetic and linguistic-phonetic evaluation forming an integral part of it.

BADGIR is an on-line data archive at the University of Wisconsin - Madison. From this portal you can browse or search data documentation (e.g., metadata, codebooks) and univariate summary statistics (e.g., mean, frequency counts). The database contains documented survey results from various survey-research projects in the US, Central America, and South America. Some historical datasets also (e.g. of slave-trade records).

CfA Library Datasets Dataverse
Harvard Dataverse Network

The aim of CfA Library Datasets Dataverse is creating a better information system to respond to the changing needs of astronomers not only at the CfA, but worldwide as well. As part of this growing partnership with the ADS, the CfA Library is expanding its metadata and data curation services, and in the process, creating datasets that the astronomy community may find useful. The CfA Library Datasets Dataverse has been created to share these datasets with the greater community with the hope that some members may find it useful. Please remember to acknowledge the CfA Library and the ADS and cite the work using the "Data Citation" presented under each study's "Cataloging Information" section.

The CGIAR Research Program No. 6 (CRP6): Forests, Trees and Agroforestry: Livelihoods, Landscapes and Governance aims to enhance the management and use of forests, agroforestry and tree genetic resources across the landscape, from farms to forests.

CHILDES is the child language component of the TalkBank system. TalkBank is a system for sharing and studying conversational interactions.

CLARIN Centre Vienna (CCV) is Austria’s main connection point to the European network of CLARIN Centres. It is an Austrian contribution to CLARIN-ERIC and being hosted by the Institute for Corpus Linguistics and Text Technology (ICLTT) of the Austrian Academy of Sciences. It is jointly funded by the Academy and the Federal Ministry of Science, Research and Economy. If you have language resources, would like to share these with the scientifc community (and/or the public) and want to make sure that the data will be around in the future, contact us. We offer archiving and online availability of the resources. If needed, we will assist you in converting data and metadata into required formats. CCV is embedded in the Digital Humanities Austria (DHA) initiative which has started in January 2014. DHA represents the umbrella under which the DH infrastructure activities CLARIN and DARIAH are conducted in Austria.

CLARIN-D Centre Leipzig ASV Automatische Sprachverarbeitung
CLARIN-D repository at the University of Leipzig

The CLARIN­-D repository at the University of Leipzig offers long­term preservation of digital resources, along with their descriptive metadata. The mission of the repository is to ensure the availability and long­term preservation of resources, to preserve knowledge gained in research, to aid the transfer of knowledge into new contexts, and to integrate new methods and resources into university curricula. Among the resources currently available in the Leipzig repository are a set of corpora of the Leipzig Corpora Collection (LCC), based on newspaper, Wikipedia and Web text. Furthermore several REST-based webservices are provided for a variety of different NLP-relevant tasks

Virtual Language Observatory

CLARIN is the short name for the Common Language Resources and Technology Infrastructure, which aims at providing easy and sustainable access for scholars in the humanities and social sciences to digital language data (in written, spoken, video or multimodal form) and advanced tools to discover, explore, exploit, annotate, analyse or combine them, independent of where they are located. To this end CLARIN is in the process of building a networked federation of European data repositories, service centres and centres of expertise, with single sign-on access for all members of the academic community in all participating countries. Tools and data from different centres will be interoperable, so that data collections can be combined and tools from different sources can be chained to perform complex operations to support researchers in their work. At this moment the CLARIN infrastructure is still under construction, but a number of participating centres are already offering access services to data, tools and expertise. On the services page we show the services accessible at this moment and we explain how and by whom the various services can be accessed. The main tool is the 'Virtual Language Observatory' that provides access to the different national CLARIN centers and their data.

Clarin in Denmark
CLARIN i Danmark is a Danish IT infrastructure intended for use by humanities scholars. The infrastructure includes digitized research material in the form of written and spoken texts, audio and video records, lexical resources and tools. Part of the resources collected or converted to other formats as part of the project. Other resources developed or collected in other projects and made available to researchers through The website is regularly updated with new materials and tools. The vision is to create the humanities researcher's toolbox through the creation of resources with associated tools and integrating resources together in a web-based electronic research environment provided for humanities researchers. Such access to resources and tools will allow scientists unprecedented opportunities and will also help to enhance their ability to participate in European collaborative projects. The Danish CLARIN project will eventually generate better conditions for Danish language technology research and development.


The focus of CLARIN INL Portal is on resources that are relevant to the lexicological study of the Dutch language and on resources relevant for research in and development of language and speech technology. For Example: lexicons, lexical databases, text corpora, speech corpora, language and speech technology tools, etc. The resources are: Cornetto-LMF (Lexicon Markup Framework), Corpus of Contemporary Dutch (Corpus Hedendaags Nederlands), Corpus Gysseling, Corpus VU-DNC (VU University Diachronic News text Corpus), Dictionary of the Frisian Language (Woordenboek der Friese Taal), DuELME-LMF (Lexicon Markup Framework), Language Portal (Taalportaal), Namescape, NERD (Named Entity Recognition and Disambiguation) and TICCLops (Text-Induced Corpus Clean-up online processing system).

Language Technology Centre

Polish CLARIN node – CLARIN-PL Language Technology Centre – is being built at Wrocław University of Technology. The LTC is addressed to scholars in the humanities and social sciences. Registered users are granted free access to digital language resources and advanced tools to explore them. They can also archive and share their own language data (in written, spoken, video or multimodal form).

The repository is part of the eScience infrastructure of the University of Tübingen, which is a core facility that strongly cooperates with the library and computing center of the university. Integration of the repository into the national CLARIN-D and international CLARIN infrastructures gives it wide exposure, increasing the likelihood that the resources will be used and further developed beyond the lifetime of the projects in which they were developed. Among the resources currently available in the Tübingen Center Repository, researchers can find widely used treebanks of German (e.g. TüBa-D/Z), the German wordnet (GermaNet), the first manually annotated digital treebank (Index Thomisticus), as well as descriptions of the tools used by the WebLicht ecosystem for natural language processing.

World Data System for Cold and Arid Regions(CARD) is a new scientific data sharing system which is established on the basis of the former World Data Center for Glaciology and Geocryology, Lanzhou and other data centers hosted by Cold and Arid Regions Environmental and Engineering Research Institute, Chinese Academy of Sciences. World Data System for Cold and Arid Regions is one of the constituents of World Data System. The data sharing system's main goals are to collect, manage and store the scientific data of Cold and Arid Regions area in China and provide the services for the scientific research of Cold and Arid Regions.

CPES provides access to information that relates to mental disorders among the general population. Its primary goal is to collect data about the prevalence of mental disorders and their treatments in adult populations in the United States. It also allows for research related to cultural and ethnic influences on mental health. CPES combines the data collected in three different nationally representative surveys (National Comorbidity Survey Replication, National Survey of American Life, National Latino and Asian American Study).

CrystalEye (beta)

The repository "Crystallography Open Database" now is including data and software from CrystalEye, developed by Nick Day at the department of Chemistry, the University of Cambridge under supervision of Peter Murray-Rust. The aim of the CrystalEye project is to aggregate crystallography from web resources, and to provide methods to easily browse, search, and to keep up to date with the latest published information.At present we are aggregating the crystallography from the supplementary data to articles at publishers websites.

Danish Data Archive
Dansk Data Arkiv

The Danish Data Archive (DDA) is the national social science data archive. DDA is primaily used by researchers and students wanting access to data materials created by Danish researchers or about Denmark. DDA is dedicated to the acquisition, preservation and dissemination of (primarily) quantitative data created by researchers from social science, health science and history.

Das Sozio-oekonomische Panel
Research Data Center of the SOEP

The German Socio-Economic Panel Study (SOEP) is a wide-ranging representative longitudinal study of private households, located at the German Institute for Economic Research, DIW Berlin. Every year, there were nearly 11,000 households, and more than 20,000 persons sampled by the fieldwork organization TNS Infratest Sozialforschung. The data provide information on all household members, consisting of Germans living in the Old and New German States, Foreigners, and recent Immigrants to Germany. The Panel was started in 1984. Some of the many topics include household composition, occupational biographies, employment, earnings, health and satisfaction indicators.

DataFirst is a research unit at the University of Cape Town engaged in promoting the long term preservation and reuse of data from African Socioeconomic surveys and provides a secure setting for improved access to national census and survey microdata for research purposes.

CC0 To the extent possible under law, has waived all copyright and related or neighboring rights to the database entries of

Creative Commons License Except where otherwise noted, content on this site is licensed under a Creative Commons Attribution 4.0 International License.