Revisions of python-fastparquet
buildservice-autocommit
accepted
request 1225996
from
Dirk Mueller (dirkmueller)
(revision 64)
baserev update by copy to link target
Dirk Mueller (dirkmueller)
committed
(revision 63)
- update to 2024.11.0: * feat: support for writing to buffers * fix(_dtypes): np.float_ was deprecated * update for py3.13
buildservice-autocommit
accepted
request 1179165
from
Markéta Machová (mcalabkova)
(revision 62)
baserev update by copy to link target
Markéta Machová (mcalabkova)
accepted
request 1179058
from
Benjamin Greiner (bnavigator)
(revision 61)
- Update to 2024.5.0 * Allow zoneinfo objects (#916) * Use np.int64 type for day to nanosecond conversion (NEP50) (#922)
buildservice-autocommit
accepted
request 1154995
from
Dirk Mueller (dirkmueller)
(revision 60)
baserev update by copy to link target
Dirk Mueller (dirkmueller)
accepted
request 1154922
from
Benjamin Greiner (bnavigator)
(revision 59)
- Update to 2024.2.0 * allow loading categoricals even if not so in the pandas metadata, when a column is dict-encoded and we only have one row-group (#863) * apply dtype to the columns names series, even when selecting no columns (#861, 859) * don’t make strings while estimating bye column size (#858) * handle upstream depr (#857, 856)
buildservice-autocommit
accepted
request 1144842
from
Daniel Garcia (dgarcia)
(revision 58)
baserev update by copy to link target
Daniel Garcia (dgarcia)
committed
(revision 57)
- Do not run tests in s390x, bsc#1218603
buildservice-autocommit
accepted
request 1130976
from
Dirk Mueller (dirkmueller)
(revision 56)
baserev update by copy to link target
Dirk Mueller (dirkmueller)
committed
(revision 55)
Dirk Mueller (dirkmueller)
committed
(revision 54)
- update to 2023.10.0: * Datetime units in empty() with tz (#893) * Fewer inplace decompressions for V2 pages (#890 * Allow writing categorical column with no categories (#888) * Fixes for new numpy (#886) * RLE bools and DELTA for v1 pages (#885, 883)
buildservice-autocommit
accepted
request 1130498
from
Dirk Mueller (dirkmueller)
(revision 53)
baserev update by copy to link target
Dirk Mueller (dirkmueller)
committed
(revision 52)
- update to 2023.8.0: * More general timestamp units (#874) * ReadTheDocs V2 (#871) * Better roundtrip dtypes (#861, 859) * No convert when computing bytes-per-item for str (#858) - Add patch to fox the test test_delta_from_def_2 on * row-level filtering of the data. Whereas previously, only full row-groups could be excluded on the basis of their parquet metadata statistics (if present), filtering can now be done within row-groups too. The syntax is the same as before, allowing for multiple column expressions to be combined with AND|OR, depending on the list structure. This mechanism requires two passes: one to load the columns needed to create the boolean mask, and another to load the columns actually needed in the output. This will not be faster, and may be slower, but in some cases can save significant memory footprint, if a small fraction of rows are considered good and the columns for the filter expression are not in the output. * DELTA integer encoding (read-only): experimentally working, but we only have one test file to verify against, since it is not trivial to persuade Spark to produce files encoded this way. DELTA can be extremely compact a representation for * nanosecond resolution times: the new extended "logical" types system supports nanoseconds alongside the previous millis and micros. We now emit these for the default pandas time type, and produce full parquet schema including both "converted" and "logical" type information. Note that all output has isAdjustedToUTC=True, i.e., these are timestamps rather than local time. The time-zone is stored in the metadata, as before, and will be successfully recreated only in fastparquet
buildservice-autocommit
accepted
request 1096412
from
Markéta Machová (mcalabkova)
(revision 51)
baserev update by copy to link target
Markéta Machová (mcalabkova)
accepted
request 1096315
from
Arun Persaud (apersaud)
(revision 50)
update to latest version
buildservice-autocommit
accepted
request 1083474
from
Dirk Mueller (dirkmueller)
(revision 49)
baserev update by copy to link target
Dirk Mueller (dirkmueller)
committed
(revision 48)
- update to 2023.4.0: * allow loading categoricals even if not so in the pandas metadata, when a column is dict-encodedand we only have one row-group (#863)  * apply dtype to the columns names series, even when selecting no columns (#861, 859)  * don't make strings while estimating bye column size (#858)  * handle upstream depr (#857, 856)
buildservice-autocommit
accepted
request 1064743
from
Matej Cepl (mcepl)
(revision 47)
baserev update by copy to link target
Matej Cepl (mcepl)
accepted
request 1064736
from
Arun Persaud (apersaud)
(revision 46)
update to latest version
buildservice-autocommit
accepted
request 1046424
from
Dirk Mueller (dirkmueller)
(revision 45)
baserev update by copy to link target
Displaying revisions 1 - 20 of 64