Introduction to HerdingCATs

HerdingCATs is a Python library designed to speed up how data analysts explore and interact with open data sources.

Purpose

The aim of this project is simple:

PyPi package coming soon.

Once available, you can install with:

pip install HerdCats

poetry add HerdCats

uv add HerdCats

Herding-CATs is currently under active development.

Features will change as the project evolves.

HerdingCATs follows a Session → Explorer → Loader pattern:

HerdingCATs supports multiple open data sources:

CKAN - Widely used for open data catalogues
OpenDataSoft - Popular in Europe, especially for energy related data catalogues
Bespoke APIs - Including French Government open data and ONS Nomis

See the Supported Catalogues page for a complete list.

More sources are being added all the time.

If you need a data source that is not listed, please raise an issue.