Decode predictor=2 on big-endian TIFFs by swapping to native order by brendancol · Pull Request #1507 · xarray-contrib/xarray-spatial

brendancol · 2026-05-07T14:12:53Z

Summary

PR #1498 reworked predictor=2 (horizontal differencing) to run sample-wise via a numpy view in the file's byte order. Numba's nopython mode rejects arrays with a non-native byte order, so reading a big-endian TIFF with uint16/uint32/uint64 + predictor=2 raised TypingError: Unsupported array dtype: >u2 instead of returning the pixel data.

predictor_decode and predictor_encode now byte-swap the buffer in place around the kernel call when the file byte order differs from native, then swap back so the on-disk representation stays intact for the chunk.view(file_dtype) step in _decode_strip_or_tile. uint8 is unaffected since the single-byte path skips the view.

Test plan

New test_predictor2_big_endian.py covers uint16/int16/uint32/int32 round-trips through tifffile-built BE predictor=2 deflate files.
Little-endian predictor=2 still round-trips (sanity).
uint8 BE+pred2 still works (single-byte byte-wise path).
Full geotiff test suite passes (552 passed; pre-existing matplotlib test_features recursion issue is unrelated).

PR #1498 reworked predictor=2 to run sample-wise via a numpy view at the file's byte order. Numba's nopython mode rejects arrays with a non-native byte order, so reading a big-endian TIFF with uint16/uint32/uint64 + predictor=2 raised "Unsupported array dtype: >u2" instead of returning the pixel data. predictor_decode and predictor_encode now byteswap the buffer in place around the kernel call for files whose byte order differs from native. Bytes on the way out stay in the file's order so the downstream chunk.view(file_dtype) step in _decode_strip_or_tile keeps working. uint8 is unaffected (single-byte path skips the view). Tests cover uint16/int16/uint32/int32 round-trips through tifffile-built big-endian predictor=2 files plus a little-endian sanity check.

…yteswap The earlier implementation, ``arr.view(arr.dtype.newbyteorder()).copy()``, left the result tagged with non-native byteorder (``>u2`` instead of ``<u2``). That's values-equivalent for arithmetic but breaks downstream consumers that expect native dtypes -- numba ``@ngjit`` rejects non-native arrays, which is the same class of bug PR xarray-contrib#1507 fixed for predictor=2 BE. The new implementation reverses bytes through a uint8 view: u8 = arr.view('u1').reshape(*arr.shape, arr.itemsize) return u8[..., ::-1].copy().view(arr.dtype).reshape(arr.shape) Result preserves ``arr.dtype`` and is native-endian, matching numpy's ``ndarray.byteswap()`` contract. 1-byte dtypes short-circuit to a no-op return. Tests now assert ``gpu_da.data.dtype.isnative`` and equality against the input native dtype, plus two pure-numpy tests of the helper itself. The module-level ``pytest.skip`` for missing CUDA was widened to cover both helper tests too; pulled it apart so the helper tests run without a GPU and only the GPU end-to-end tests gate on cupy+CUDA+tifffile.

* Fix read_geotiff_gpu byteswap on big-endian multi-byte TIFFs cupy.ndarray (13.x) does not expose .byteswap(), so any BE multi-byte TIFF hit AttributeError inside the GPU decode pipeline. The dispatcher in read_geotiff_gpu caught it and silently fell back to CPU, so output stayed correct but the GPU path was effectively dead for BE data. Replace both arr.byteswap() calls with a small helper that views the array as the swapped-order dtype and copies, which works on numpy and cupy arrays alike. Closes #1508 * Address PR #1515 review: preserve native dtype in _xp_byteswap The earlier implementation, ``arr.view(arr.dtype.newbyteorder()).copy()``, left the result tagged with non-native byteorder (``>u2`` instead of ``<u2``). That's values-equivalent for arithmetic but breaks downstream consumers that expect native dtypes -- numba ``@ngjit`` rejects non-native arrays, which is the same class of bug PR #1507 fixed for predictor=2 BE. The new implementation reverses bytes through a uint8 view: u8 = arr.view('u1').reshape(*arr.shape, arr.itemsize) return u8[..., ::-1].copy().view(arr.dtype).reshape(arr.shape) Result preserves ``arr.dtype`` and is native-endian, matching numpy's ``ndarray.byteswap()`` contract. 1-byte dtypes short-circuit to a no-op return. Tests now assert ``gpu_da.data.dtype.isnative`` and equality against the input native dtype, plus two pure-numpy tests of the helper itself. The module-level ``pytest.skip`` for missing CUDA was widened to cover both helper tests too; pulled it apart so the helper tests run without a GPU and only the GPU end-to-end tests gate on cupy+CUDA+tifffile.

github-actions Bot added the performance PR touches performance-sensitive code label May 7, 2026

brendancol mentioned this pull request May 7, 2026

GPU read of big-endian multi-byte TIFFs crashes on cupy.ndarray.byteswap() #1508

Closed

brendancol merged commit e387c29 into main May 7, 2026
11 checks passed

brendancol mentioned this pull request May 8, 2026

GPU predictor=2 produces wrong values on big-endian TIFFs #1517

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Decode predictor=2 on big-endian TIFFs by swapping to native order#1507

Decode predictor=2 on big-endian TIFFs by swapping to native order#1507
brendancol merged 1 commit intomainfrom
fix-predictor2-be-numba

brendancol commented May 7, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

brendancol commented May 7, 2026

Summary

Test plan

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant