Skip to content

A high-performance library for compressed ndarrays, with a flexible computational engine

License

Notifications You must be signed in to change notification settings

Blosc/python-blosc2

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Python-Blosc2

A fast & compressed ndarray library with a flexible compute engine

Author:The Blosc development team
Contact:blosc@blosc.org
Github:https://github.com/Blosc/python-blosc2
Actions:actions
PyPi:version
NumFOCUS:numfocus
Code of Conduct:Contributor Covenant

What is Python-Blosc2?

Python-Blosc2 is a high-performance compressed ndarray library with a flexible compute engine, using C-Blosc2 as its compression backend. It allows complex calculations on compressed data, whether stored in memory, on disk, or over the network (e.g., via Caterva2). It uses the C-Blosc2 simple and open format for storing compressed data.

More info: https://www.blosc.org/python-blosc2/getting_started/overview.html

Installing

Binary packages are available for major OSes (Win, Mac, Linux) and platforms. Install from PyPi using pip:

pip install blosc2 --upgrade

Conda users can install from conda-forge:

conda install -c conda-forge python-blosc2

Windows note

When building from source on Windows, clang-cl is required (OpenZL depends on C11 support). Make sure LLVM is on PATH and use the Ninja generator, for example:

CMAKE_GENERATOR=Ninja
CC=clang-cl
CXX=clang-cl
pip install -e .

Documentation

The documentation is available here:

https://blosc.org/python-blosc2/python-blosc2.html

You can find examples at:

https://github.com/Blosc/python-blosc2/tree/main/examples

A tutorial from PyData Global 2025 is available at:

https://github.com/Blosc/PyData-Global-2025-Tutorial

(Clik here to watch the video recording of the tutorial)

It contains Jupyter notebooks explaining the main features of Python-Blosc2.

License

This software is licensed under a 3-Clause BSD license. A copy of the python-blosc2 license can be found in LICENSE.txt.

Discussion forum

Discussion about this package is welcome at:

https://github.com/Blosc/python-blosc2/discussions

Social feeds

Stay informed about the latest developments by following us in Mastodon, Bluesky or LinkedIn.

Thanks

Blosc2 is supported by the NumFOCUS foundation, the LEAPS-INNOV project and ironArray SLU, among many other donors. This allowed the following people have contributed in an important way to the core development of the Blosc2 library:

  • Francesc Alted
  • Marta Iborra
  • Luke Shaw
  • Aleix Alcacer
  • Oscar Guiñón
  • Juan David Ibáñez
  • Ivan Vilata i Balaguer
  • Oumaima Ech.Chdig
  • Ricardo Sales Piquer

In addition, other people have participated to the project in different aspects:

  • Jan Sellner, contributed the mmap support for NDArray/SChunk objects.
  • Dimitri Papadopoulos, contributed a large bunch of improvements to many aspects of the project. His attention to detail is remarkable.
  • And many others that have contributed with bug reports, suggestions and improvements.

Developed using JetBrains IDEs.

JetBrains logo.

Citing Blosc

You can cite our work on the various libraries under the Blosc umbrella as follows:

@ONLINE{blosc,
  author = {{Blosc Development Team}},
  title = "{A fast, compressed and persistent data store library}",
  year = {2009-2025},
  note = {https://blosc.org}
}

Support Blosc for a Sustainable Future

If you find Blosc useful and want to support its development, please consider making a donation or contract to the Blosc Development Team Thank you!

Compress Better, Compute Bigger

About

A high-performance library for compressed ndarrays, with a flexible computational engine

Topics

Resources

License

Code of conduct

Contributing

Stars

Watchers

Forks

Sponsor this project

 

Packages

No packages published

Contributors 27