Polars: py-0.20.14 Release

Release date:
March 7, 2024
Previous version:
py-0.20.13 (released February 29, 2024)
Magnitude:
1,470 Diff Delta
Contributors:
16 total committers
Data confidence:
Commits:

47 Commits in this Release

Ordered by the degree to which they evolved the repo in this version.

Authored March 6, 2024
Authored March 1, 2024
Authored March 6, 2024
Authored February 29, 2024
Authored March 1, 2024
Authored March 1, 2024

Top Contributors in py-0.20.14

ritchie46
orlp
stinodego
alexander-beedie
flisky
mickvangelderen
MarcoGorelli
dependabot-bot
petrosbar
deanm0000

Directory Browser for py-0.20.14

We haven't yet finished calculating and confirming the files and directories changed in this release. Please check back soon.

Release Notes Published

πŸ† Highlights

  • Streaming outer joins (#14828)

⚠️ Deprecations

  • Deprecate overwrite_schema parameter for DataFrame.write_delta (#14879)

πŸš€ Performance improvements

  • improve join-asof materialization (#14884)
  • Optimize chunked-id gather for binaryviews (#14878)
  • rework scalar filter kernels (#14865)
  • Reduce size of optional join-indexes (#14856)
  • Streaming outer joins (#14828)
  • Set sorted flag for cum_count on columns (#14849)

✨ Enhancements

  • Sort decimal fields (#14649)
  • Revert addition of __slots__ to Polars classes (#14857)
  • Add fastexcel to show_versions (#14869)
  • try-improve concurrency tuner (#14827)
  • Streaming outer joins (#14828)
  • Set sorted flag for cum_count on columns (#14849)
  • support use of KΓΉzuDB via pl.read_database (#14822)
  • Ensure binview types are rle-encoded in parquet write (#14818)
  • Disable timeouts (#14809)
  • cleanup spill disks in process (#14807)
  • Implement compression and skipping for binview IPC (#14789)

🐞 Bug fixes

  • Fix DataFrame.min/max for decimals (#14890)
  • Assert chunks are equal after physical cast to prevent OOB (#14873)
  • not all cpu feature flag tests were mocked (#14864)
  • Remove custom __reduce__ implementation on DataType object (#14778)
  • Allow non-strict construction / initialization of Enum columns (#14728)
  • Fix streaming parquet limit (#14783)

πŸ“– Documentation

  • Update contributing link (#14882)
  • update to use ambiguous instead of use_earliest (#14820)
  • Seperate asof from join strategy, change parameter from strategy to how in user guide (#14793)

πŸ› οΈ Other improvements

  • Pin PyPI publish action to commit (#14896)
  • reactivate argminmax simd (#14679)
  • sort by 'idx' after outer join (#14867)
  • run and fix pext64_polyfill test (#14852)
  • add missing deprecation directive in groupby.count (#14817)
  • Fix Makefile build commands (#14806)
  • Bump ruff from 0.2.0 to 0.3.0 in /py-polars (#14800)
  • Rename utils module to _utils to explicitly mark it as private (#14772)
  • Add test coverage for _cpu_check module (#14768)

Thank you to all our contributors for making this release possible! @MarcoGorelli, @Sol-Hee, @alexander-beedie, @c-peters, @deanm0000, @dependabot, @dependabot[bot], @eitsupi, @flisky, @geekvest, @mcrumiller, @mickvangelderen, @nameexhaustion, @orlp, @petrosbar, @ritchie46 and @stinodego