You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Hi @rouault -- you're absolutely right about your characterization of this issue. We are working to resolve it asap by fixing the extension mechanism and specification in V3. In the future state, all codecs (besides bytes) will be considered extensions. You can assume that a formal extension for zstd will exist soon.
all codecs (besides bytes) will be considered extensions
That's perhaps nice from a conceptual point of view, but from a practical one, for a reader outside of the Python ecosystem, practical interoperability might be miserable...
I believe it would benefit to the Zarr interoperability story if there was a minimum set of common core codecs that readers are encouraged to implement. And writers are made aware that using something outside of it could make datasets unreadable by some implementations.
I believe it would benefit to the Zarr interoperability story if there was a minimum set of common core codecs that readers are encouraged to implement. And writers are made aware that using something outside of it could make datasets unreadable by some implementations.
Agreed! This has been proposed before, and I think the zarr steering council is working on formalizing this.
Zarr version
v3.0.2
Numcodecs version
v0.15.0
Python Version
3.12
Operating System
Linux
Installation
pip install zarr
Description
Not really a zarr-python bug by itself, but more a bug of the zarr v3 ecosystem (ie the combination of zarr-python + https://github.com/zarr-developers/zarr-specs)
Following tutorial at https://zarr.readthedocs.io/en/stable/user-guide/groups.html#working-with-groups, I discovered that the following generates a Zarr V3 array
This codec is not documented at https://zarr-specs.readthedocs.io/en/latest/v3/codecs.html
This is a bit surprising.
Context: I'm updating the GDAL Zarr driver (https://gdal.org/en/stable/drivers/raster/zarr.html) , written in C++, and don't have access to numcodecs, and new codecs don't come "for free". So it would be nice that the defaults of zarr-python would match what is specified in the zarr v3 spec.
Steps to reproduce
Additional output
No response
The text was updated successfully, but these errors were encountered: