Open Data

Most open data are either datasets created by an organisation for a specific purpose (e.g. census data) and data resulting from academic research.  Data from government organisations tend to have specific formats and supporting documentation that enable the data to be easily accessible and understandable. Academic research data on the other hand, is only now beginning to be required by funders and the standard of datasets produced by academic researchers vary in terms of quality and reproducibility.

“Open data and content can be freely used, modified, and shared by anyone for any purpose.”

The main issue with open data is not the amount of available data, but whether the data is in a useable format with sufficient documentation to enable it to be beneficial or useful to anyone else.

At the University of Northampton all our researchers are required to create a dataset on completion of a research project that is stored within our institutional repository, Pure.  Our research support staff work with our researchers to ensure that the data in the datasets is meaningful and in file formats that can be preserved and reused. Published datasets from the University of Northampton are stored in our Research Explorer.

  • Open data sets are available from an increasing variety of sources, including the UK government, who now release data under the Open Government Licence.

    Official statistics reflecting Britain’s economy, population and society at national and local level. Includes online versions of current statistical publications including Economic Trends, Labour Market Trends, Living in Britain, Social Trends and Travel Trends.

    The London Datastore is a free and open data-sharing portal where anyone can access data relating to the capital.

    Find data published by central government, local authorities and public bodies.

    NHS data dashboards provides a way to explore some of the NHS’ datasets.

    High quality data for social science research, teaching and learning

    Guide designed to help new researchers utilise some of the extensive survey data available through the UK Data Service. In particular, the guide is designed to support those starting small research projects, especially students doing dissertations.

    The Bank of England publish data and statistics that they use to help develop the banking system and understand economic trends, the section also includes a selection of research datasets.

  • Some research funding bodies require data to be deposited in specific discipline-based repositories. Examples include:

    The Natural Environment Research Council (NERC) has an Environmental Data Service (EDS) that provides a focal point for NERC’s scientific data and information.

    Subject specific data repositories include:

    • UK Polar Data Centre, based at the British Antarctic Survey
    • British Oceanographic Data Centre (BODC)
    • Environmental Information Data Centre (EIDC)
    • National Geoscience Data Centre (NGDC)
    • UK Data Service 

    For Economic and Social Research Council (ESRC) social science qualitative and quantitative data.

    Biotechnology and Biological Sciences Research Council (BBSRC) contributes funding to a number of international bioscience data repositories and resources

    Long-established centre of excellence for Arts & Humanities Research Council (AHRC) and NERC resources

  • Geo-spatial and mapping data is available in open format from sources such as Open Street Map or the Ordnance Survey Open data products.

    data.gov.uk logo

    data.gov.uk is a searchable source of data published by central government, local authorities and public bodies. National archives, libraries, museums and heritage institutions are increasingly digitising their collections and making them freely available online.

    re3data.org logo

    re3data is a global registry of over two thousand research data repositories from a diverse range of academic disciplines, managed by DataCite. It provides information on repositories for the permanent storage and access of data sets to researchers and can be browsed by multiple categories.

    Zenodo logo

    “a catch-all repository for European Commission funded research” hosted by CERN.

    Figshare logo

    Figshare is a repository “where users can make all of their research outputs available in a citable, shareable and discoverable manner”

    Dryad logo

    • Dryad

    Dryad is “a curated resource that makes research data discoverable, freely reusable, and citable. Dryad provides a general-purpose home for a wide diversity of data types”.

    GitHub logo

    GitHub is a vast, free repository of developmental software and code

    Open Science Framework (OSF) logo

    An increasingly popular free open platform of deposited academic research. Search public projects to build on the work of others and find new collaborators.

    Mendeley Data logo

    You can use Mendeley Data to create and deposit datasets or search the harvested catalogue of 27 million datasets from domain-specific and cross-domain repositories. Provided by Elsevier and integrated with Pure.

    Data Foundry logo

    Data Foundry, data collections from the National Library of Scotland, is an excellent example of open data made available in a variety of re-useable formats.