Revisions of python-dask
Dominique Leuenberger (dimstar_suse)
accepted
request 1063413
from
Daniel Garcia (dgarcia)
(revision 60)
- Update to 2023.1.1 ## Enhancements * Add to_backend method to Array and _Frame (GH#9758) Richard (Rick) Zamora * Small fix for timestamp index divisions in pandas 2.0 (GH#9872) Irina Truong * Add numeric_only to DataFrame.cov and DataFrame.corr (GH#9787) James Bourbeau * Fixes related to group_keys default change in pandas 2.0 (GH#9855) Irina Truong * infer_datetime_format compatibility for pandas 2.0 (GH#9783) James Bourbeau ## Bug Fixes * Fix serialization bug in BroadcastJoinLayer (GH#9871) Richard (Rick) Zamora * Satisfy broadcast argument in DataFrame.merge (GH#9852) Richard (Rick) Zamora * Fix pyarrow parquet columns statistics computation (GH#9772) aywandji ## Documentation * Fix “duplicate explicit target name” docs warning (GH#9863) Chiara Marmo * Fix code formatting issue in “Defining a new collection backend” docs (GH#9864) Chiara Marmo * Update dashboard documentation for memory plot (GH#9768) Jayesh Manani * Add docs section about no-worker tasks (GH#9839) Florian Jetter ## Maintenance * Additional updates for detecting a distributed scheduler (GH#9890) James Bourbeau * Update gpuCI RAPIDS_VER to 23.04 (GH#9876) * Reverse precedence between collection and distributed default (GH#9869) Florian Jetter * Update xarray-contrib/issue-from-pytest-log to version 1.2.6 (GH#9865) James Bourbeau * Dont require dask config shuffle default (GH#9826) Florian Jetter * Un-xfail datetime64 Parquet roundtripping tests for new fastparquet (GH#9811) James Bourbeau * Add option to manually run upstream CI build (GH#9853) James Bourbeau * Use custom timeout in CI builds (GH#9844) James Bourbeau * Remove kwargs from make_blockwise_graph (GH#9838) Florian Jetter * Ignore warnings on persist call in test_setitem_extended_API_2d_mask (GH#9843) Charles Blackmon-Luca * Fix running S3 tests locally (GH#9833) James Bourbeau - Release 2023.1.0 ## Enhancements * Use distributed default clients even if no config is set (GH#9808) Florian Jetter * Implement ma.where and ma.nonzero (GH#9760) Erik Holmgren * Update zarr store creation functions (GH#9790) Ryan Abernathey * iteritems compatibility for pandas 2.0 (GH#9785) James Bourbeau * Accurate sizeof for pandas string[python] dtype (GH#9781) crusaderky * Deflate sizeof() of duplicate references to pandas object types (GH#9776) crusaderky * GroupBy.__getitem__ compatibility for pandas 2.0 (GH#9779) James Bourbeau * append compatibility for pandas 2.0 (GH#9750) James Bourbeau * get_dummies compatibility for pandas 2.0 (GH#9752) James Bourbeau * is_monotonic compatibility for pandas 2.0 (GH#9751) James Bourbeau * numpy=1.24 compatability (GH#9777) James Bourbeau ## Documentation * Remove duplicated encoding kwarg in docstring for to_json (GH#9796) Sultan Orazbayev * Mention SubprocessCluster in LocalCluster documentation (GH#9784) Hendrik Makait * Move Prometheus docs to dask/distributed (GH#9761) crusaderky ## Maintenance * Temporarily ignore RuntimeWarning in test_setitem_extended_API_2d_mask (GH#9828) James Bourbeau * Fix flaky test_threaded.py::test_interrupt (GH#9827) Hendrik Makait * Update xarray-contrib/issue-from-pytest-log in upstream report (GH#9822) James Bourbeau * pip install dask on gpuCI builds (GH#9816) Charles Blackmon-Luca * Bump actions/checkout from 3.2.0 to 3.3.0 (GH#9815) * Resolve sqlalchemy import failures in mindeps testing (GH#9809) Charles Blackmon-Luca * Ignore sqlalchemy.exc.RemovedIn20Warning (GH#9801) Thomas Grainger * xfail datetime64 Parquet roundtripping tests for pandas 2.0 (GH#9786) James Bourbeau * Remove sqlachemy 1.3 compatibility (GH#9695) McToel * Reduce size of expected DoK sparse matrix (GH#9775) Elliott Sales de Andrade * Remove executable flag from dask/dataframe/io/orc/utils.py (GH#9774) Elliott Sales de Andrade - Drop dask-pr9777-np1.24.patch
Dominique Leuenberger (dimstar_suse)
accepted
request 1037296
from
Markéta Machová (mcalabkova)
(revision 58)
Dominique Leuenberger (dimstar_suse)
accepted
request 1036964
from
Dirk Mueller (dirkmueller)
(revision 57)
Dominique Leuenberger (dimstar_suse)
accepted
request 1030989
from
Matej Cepl (mcepl)
(revision 56)
HAS TO GO TOGETHER WITH SR#1030988. - Update to version 2022.10.0 * Backend library dispatching for IO in Dask-Array and Dask-DataFrame (GH#9475) Richard (Rick) Zamora * Add new CLI that is extensible (GH#9283) Doug Davis * Groupby median (GH#9516) Ian Rose * Fix array copy not being a no-op (GH#9555) David Hoese * Add support for string timedelta in map_overlap (GH#9559) Nicolas Grandemange * Shuffle-based groupby for single functions (GH#9504) Ian Rose * Make datetime.datetime tokenize idempotantly (GH#9532) Martin Durant * Support tokenizing datetime.time (GH#9528) Tim Paine * Avoid race condition in lazy dispatch registration (GH#9545) James Bourbeau * Do not allow setitem to np.nan for int dtype (GH#9531) Doug Davis * Stable demo column projection (GH#9538) Ian Rose * Ensure pickle-able binops in delayed (GH#9540) Ian Rose * Fix project CSV columns when selecting (GH#9534) Martin Durant * Update Parquet best practice (GH#9537) Matthew Rocklin - move -all metapackage to -complete, mirroring upstream's [complete] extra. - update to version 2022.9.2: * Enhancements + Remove factorization logic from array auto chunking (:pr:`9507`) `James Bourbeau`_ * Documentation + Add docs on running Dask in a standalone Python script (:pr:`9513`) `James Bourbeau`_ + Clarify custom-graph multiprocessing example (:pr:`9511`) `nouman`_ * Maintenance + Groupby sort upstream compatibility (:pr:`9486`) `Ian Rose`_ - update to version 2022.9.1: * New Features + Add "DataFrame" and "Series" "median" methods (:pr:`9483`) `James Bourbeau`_ * Enhancements + Shuffle "groupby" default (:pr:`9453`) `Ian Rose`_ + Filter by list (:pr:`9419`) `Greg Hayes`_ + Added "distributed.utils.key_split" functionality to "dask.utils.key_split" (:pr:`9464`) `Luke Conibear`_ * Bug Fixes + Fix overlap so that "set_index" doesn't drop rows (:pr:`9423`) `Julia Signell`_ + Fix assigning pandas "Series" to column when "ddf.columns.min()" raises (:pr:`9485`) `Erik Welch`_ + Fix metadata comparison "stack_partitions" (:pr:`9481`) `James Bourbeau`_ + Provide default for "split_out" (:pr:`9493`) `Lawrence Mitchell`_ * Deprecations + Allow "split_out" to be "None", which then defaults to "1" in "groupby().aggregate()" (:pr:`9491`) `Ian Rose`_ * Documentation + Fixing "enforce_metadata" documentation, not checking for dtypes (:pr:`9474`) `Nicolas Grandemange`_ + Fix "it's" --> "its" typo (:pr:`9484`) `Nat Tabris`_ * Maintenance + Workaround for parquet writing failure using some datetime series but not others (:pr:`9500`) `Ian Rose`_ + Filter out "numeric_only" warnings from "pandas" (:pr:`9496`) `James Bourbeau`_ + Avoid "set_index(..., inplace=True)" where not necessary (:pr:`9472`) `James Bourbeau`_ + Avoid passing groupby key list of length one (:pr:`9495`) `James Bourbeau`_ + Update "test_groupby_dropna_cudf" based on "cudf" support for "group_keys" (:pr:`9482`) `James Bourbeau`_ + Remove "dd.from_bcolz" (:pr:`9479`) `James Bourbeau`_ + Added "flake8-bugbear" to "pre-commit" hooks (:pr:`9457`) `Luke Conibear`_ + Bind loop variables in function definitions ("B023") (:pr:`9461`) `Luke Conibear`_ + Added assert for comparisons ("B015") (:pr:`9459`) `Luke Conibear`_ + Set top-level default shell in CI workflows (:pr:`9469`) `James Bourbeau`_ + Removed unused loop control variables ("B007") (:pr:`9458`) `Luke Conibear`_ + Replaced "getattr" calls for constant attributes ("B009") (:pr:`9460`) `Luke Conibear`_ + Pin "libprotobuf" to allow nightly "pyarrow" in the upstream CI build (:pr:`9465`) `Joris Van den Bossche`_ + Replaced mutable data structures for default arguments ("B006") (:pr:`9462`) `Luke Conibear`_ + Changed "flake8" mirror and updated version (:pr:`9456`) `Luke Conibear`_
Dominique Leuenberger (dimstar_suse)
accepted
request 1002732
from
Dirk Mueller (dirkmueller)
(revision 55)
Dominique Leuenberger (dimstar_suse)
accepted
request 1000576
from
Steve Kowalik (StevenK)
(revision 54)
- Update to 2022.8.1: * Implement ma.*_like functions (:pr:`9378`) `Ruth Comer`_ * Fuse compatible annotations (:pr:`9402`) `Ian Rose`_ * Shuffle-based groupby aggregation for high-cardinality groups (:pr:`9302`) `Richard (Rick) Zamora`_ * Unpack namedtuple (:pr:`9361`) `Hendrik Makait`_ * Fix SeriesGroupBy cumulative functions with axis=1 (:pr:`9377`) `Pavithra Eswaramoorthy`_ * Sparse array reductions (:pr:`9342`) `Ian Rose`_ * Fix make_meta while using categorical column with index (:pr:`9348`) `Pavithra Eswaramoorthy`_ * Don't allow incompatible keywords in DataFrame.dropna (:pr:`9366`) `Naty Clementi`_ * Make set_index handle entirely empty dataframes (:pr:`8896`) `Julia Signell`_ * Improve dataclass handling in unpack_collections (:pr:`9345`) `Hendrik Makait`_ * Fix bag sampling when there are some smaller partitions (:pr:`9349`) `Ian Rose`_ * Add support for empty partitions to da.min/da.max functions (:pr:`9268`) `geraninam`_ * Use entry_points utility in sizeof (:pr:`9390`) `James Bourbeau`_ * Add entry_points compatibility utility (:pr:`9388`) `Jacob Tomlinson`_ * Upload environment file artifact for each CI build (:pr:`9372`) `James Bourbeau`_ * Remove werkzeug pin in CI (:pr:`9371`) `James Bourbeau`_ * Fix type annotations for dd.from_pandas and dd.from_delayed (:pr:`9362`) `Jordan Yap`_ * Ensure make_meta doesn't hold ref to data (:pr:`9354`) `Jim Crist-Harif`_ * Revise divisions logic in from_pandas (:pr:`9221`) `Richard (Rick) Zamora`_
Dominique Leuenberger (dimstar_suse)
accepted
request 988240
from
Dirk Mueller (dirkmueller)
(revision 53)
Dominique Leuenberger (dimstar_suse)
accepted
request 986475
from
Dirk Mueller (dirkmueller)
(revision 52)
Dominique Leuenberger (dimstar_suse)
accepted
request 950417
from
Dirk Mueller (dirkmueller)
(revision 50)
Dominique Leuenberger (dimstar_suse)
accepted
request 884570
from
Markéta Machová (mcalabkova)
(revision 44)
Dominique Leuenberger (dimstar_suse)
accepted
request 877824
from
Dirk Mueller (dirkmueller)
(revision 42)
Displaying revisions 21 - 40 of 81