Polars: py-0.20.5 Release

Release date:
January 24, 2024
Previous version:
py-0.20.4 (released January 24, 2024)
Magnitude:
8,906 Diff Delta
Contributors:
30 total committers
Data confidence:
Commits:

158 Commits in this Release

Ordered by the degree to which they evolved the repo in this version.

Authored January 20, 2024
Authored January 15, 2024
Authored January 17, 2024

Top Contributors in py-0.20.5

stinodego
r-brink
ritchie46
reswqa
alexander-beedie
Wainberg
petrosbar
c-peters
MarcoGorelli
taki-mekhalfa

Directory Browser for py-0.20.5

We haven't yet finished calculating and confirming the files and directories changed in this release. Please check back soon.

Release Notes Published

⚠️ Deprecations

  • Deprecate default delimiter value for str.concat (#13690)
  • Rename pl.count() to pl.len() (#13719)
  • Deprecate dt.with_time_unit in favor of cast(pl.Int64).cast(pl.Datetime(time_unit, time_zone)) (#13667)

πŸš€ Performance improvements

  • directly embed data ptr in Buffer (#13744)

✨ Enhancements

  • Impl count_matches for array namespace (#13675)
  • Add nulls_last for list/array.sort (#13795)
  • convert fixed-offset timezones to respective Etc timezone from time zone database (#13738)
  • allow read_excel to load from remote http locations (#13753)
  • Expressify str.slice (#13747)
  • implement binview for polars-row (#13736)
  • implement binview for polars-json (#13737)
  • add architecture for polars-flavored IPC (#13734)
  • implement binview comparison kernels (#13715)
  • raise default frame/series repr height from 8 to 10 (#13699)

🐞 Bug fixes

  • do not read data for zero-length compressed buffer (#13791)
  • Fix the non-null test of transpose (#13783)
  • Raise error instead of panic when joining on wildcard/nth (#13742)
  • str.concat correctly ignore single null value (#13751)
  • Selectors by_name and by_dtype should allow empty list as input (#11024)
  • Keep Series attributes docstrings when read by Sphinx (#13731)
  • fix error message when creating DataFrame from 0-dimensional NumPy array (#13729)
  • support corr() for single-column DataFrames (#13728)
  • Use NonZeroUsize for batch_size parameter in write_csv/sink_csv/scan_ndjson (#13726)
  • error instead of panicking in sql if empty function (#13691)

πŸ“– Documentation

  • Fix typo in deprecation message of with_row_count (#13793)
  • Fix incorrect "coming from pandas" syntax (#13767)
  • Improve streaming section of the user guide (#13750)
  • improve n_unique and approx_n_unique docs (#13752)
  • add missing Series.str.find reference (#13717)
  • Be more explicit about behaviour in str.strip_chars / strip_chars_start / strip_chars_end docstrings (#13697)
  • Add doc example for datetime_ranges (#13695)
  • document %A and %B to get day name and month name (#13678)

πŸ› οΈ Other improvements

  • Make pl.duration non-anonymous (#13762)
  • Add test for describe on Object types (#13689)
  • Only run bytecode parser CI workflow for Python 3.9/3.10 (#13664)

Thank you to all our contributors for making this release possible! @29antonioac, @MarcoGorelli, @NedJWestern, @Wainberg, @alexander-beedie, @cgevans, @henryharbeck, @langestefan, @orlp, @petrosbar, @r-brink, @reswqa, @ritchie46, @stinodego and @universalmind303