GitHub / apache/datafusion-python / commits
Apache DataFusion Python Bindings
| SHA | Message | Author | Date | Stats |
|---|---|---|---|---|
| d7e137ee | Enable remaining pylints (#1298) |
Tim Saucer <t****r@g****m>
Committed by: GitHub <n****y@g****m> |
4 months ago | |
| 51dc78a6 |
chore: update datafusion minor version (#1297)
Co-authored-by: Kevin Liu <k****u@u****m>, Kevin Liu <k****u@u****m> |
Tim Saucer <t****r@g****m>
Committed by: GitHub <n****y@g****m> |
4 months ago | |
| a605b619 | Update python minimum version to 3.10 (#1296) |
Tim Saucer <t****r@g****m>
Committed by: GitHub <n****y@g****m> |
4 months ago | |
| 694a5d8d |
feat: Add SQL expression support for `with_columns` (#1286)
Co-authored-by: Hendrik Makait <h****k@m****m>, Hendrik Makait <h****k@m****m>, Hendrik Makait <h****k@m****m> |
Marko Milenković <m****m@u****m>
Committed by: GitHub <n****y@g****m> |
4 months ago | |
| d9c90d27 | feat: Add SQL expression for `repartition_by_hash` (#1285) |
Marko Milenković <m****m@u****m>
Committed by: GitHub <n****y@g****m> |
5 months ago | |
| a93d614a | with_column supports SQL expression (#1284) |
Marko Milenković <m****m@u****m>
Committed by: GitHub <n****y@g****m> |
5 months ago | |
| 29bcb0f9 | Release 50.1 (#1281) |
Tim Saucer <t****r@g****m>
Committed by: GitHub <n****y@g****m> |
5 months ago | |
| 60123493 | feat: support session token parameter for AmaoznS3 (#1275) |
GCHQDeveloper028 <5****8@u****m>
Committed by: GitHub <n****y@g****m> |
5 months ago | |
| 030873bb | Add temporary view option for into_view (#1267) |
Tim Saucer <t****r@g****m>
Committed by: GitHub <n****y@g****m> |
5 months ago | |
| fe0cf8cd | feat: allow DataFrame.filter to accept SQL strings\ (#1276) |
𝕂 <5****h@u****m>
Committed by: GitHub <n****y@g****m> |
5 months ago | |
| 6b162850 |
feat: expose `select_exprs` method on DataFrame (#1271)
Co-authored-by: Tim Saucer <t****r@g****m> |
Marko Milenković <m****m@u****m>
Committed by: GitHub <n****y@g****m> |
5 months ago | |
| 08901d53 |
feat: expose` DataFrame.parse_sql_expr` (#1274)
Co-authored-by: Tim Saucer <t****r@g****m>, Tim Saucer <t****r@g****m> |
Marko Milenković <m****m@u****m>
Committed by: GitHub <n****y@g****m> |
5 months ago | |
| 16d4c038 | chore: update dependencies (#1269) |
Tim Saucer <t****r@g****m>
Committed by: GitHub <n****y@g****m> |
5 months ago | |
| f5fdf596 |
feat: expose DataFrame.write_table (#1264)
Co-authored-by: Copilot <1****t@u****m>, Copilot <1****t@u****m> |
Tim Saucer <t****r@g****m>
Committed by: GitHub <n****y@g****m> |
5 months ago | |
| 6f3b1cab |
Unify Table representations (#1256)
Co-authored-by: Siew Kam Onn <k****w@g****m> |
Tim Saucer <t****r@g****m>
Committed by: GitHub <n****y@g****m> |
5 months ago | |
| af1c07b7 |
Freeze PyO3 wrappers & introduce interior mutability to avoid PyO3 borrow err...
Co-authored-by: ntjohnson1 <2****1@u****m>, Tim Saucer <t****r@g****m> |
kosiew <k****w@g****m>
Committed by: GitHub <n****y@g****m> |
5 months ago | |
| e75addfa | macos-13 is deprecated (#1259) |
Kevin Liu <k****u@u****m>
Committed by: GitHub <n****y@g****m> |
5 months ago | |
| 709c918e | REVERT CHANGES to commit | Siew Kam Onn <k****w@g****m> | 5 months ago | |
| 0154c031 | CHANGES to review | Siew Kam Onn <k****w@g****m> | 5 months ago | |
| f08d5b04 | Make Session Context `pyclass` frozen so interior mutability is only managed ... |
Nick <2****1@u****m>
Committed by: GitHub <n****y@g****m> |
6 months ago | |
| 0ec01020 |
Fix drop() method to handle quoted column names consistently (#1242)
Co-authored-by: Tim Saucer <t****r@g****m> |
H0TB0X420 <2****0@u****m>
Committed by: GitHub <n****y@g****m> |
6 months ago | |
| 5f8d500a | Update version number, add changelog (#1249) |
Tim Saucer <t****r@g****m>
Committed by: GitHub <n****y@g****m> |
6 months ago | |
| bf22c1d4 | Prepare for DF50 (#1231) |
Tim Saucer <t****r@g****m>
Committed by: GitHub <n****y@g****m> |
6 months ago | |
| b7d3519d | docs: add apache iceberg as datafusion data source (#1240) |
Kevin Liu <k****u@u****m>
Committed by: GitHub <n****y@g****m> |
6 months ago | |
| d54dc4ae | Support string column identifiers for sort/aggregate/window and stricter Expr... |
kosiew <k****w@g****m>
Committed by: GitHub <n****y@g****m> |
6 months ago | |
| 9e976362 | chore: remove redundant error transformation (#1232) |
Daniel Mesejo <m****n@g****m>
Committed by: GitHub <n****y@g****m> |
6 months ago | |
| c609dfa3 | feat: allow passing a slice to and expression with the [] indexing (#1215) |
Tim Saucer <t****r@g****m>
Committed by: GitHub <n****y@g****m> |
6 months ago | |
| b325a385 | fix: update link to user example for custom table provider (#1224) |
Tyler White <5****k@u****m>
Committed by: GitHub <n****y@g****m> |
6 months ago | |
| 332c69b6 | docs: fix CaseBuilder documentation example (#1225) |
Tyler White <5****k@u****m>
Committed by: GitHub <n****y@g****m> |
6 months ago | |
| f0bbad75 | Set fail on warning for documentation generation (#1218) |
Tim Saucer <t****r@g****m>
Committed by: GitHub <n****y@g****m> |
6 months ago | |
| 200a249a | Fixing a few Typos (#1220) |
Nick <2****1@u****m>
Committed by: GitHub <n****y@g****m> |
6 months ago | |
| 61f981bc | Add benchmark script and documentation for maximizing CPU usage in DataFusion... |
kosiew <k****w@g****m>
Committed by: GitHub <n****y@g****m> |
6 months ago | |
| b76cd8fd | Update development guide in README.md (#1213) |
Koustubh Rao <8****o@u****m>
Committed by: GitHub <n****y@g****m> |
7 months ago | |
| c41c7f09 | 49.0.0 release (#1211) |
Tim Saucer <t****r@g****m>
Committed by: GitHub <n****y@g****m> |
7 months ago | |
| d6d6ea69 |
build(deps): bump actions/checkout from 4 to 5 (#1204)
Co-authored-by: dependabot[bot] <4****]@u****m> Signed-off-by: dependabot[bot] <s****t@g****m> |
dependabot[bot] <4****]@u****m>
Committed by: GitHub <n****y@g****m> |
7 months ago | |
| cb29176f |
build(deps): bump url from 2.5.4 to 2.5.7 (#1210)
Co-authored-by: dependabot[bot] <4****]@u****m> Signed-off-by: dependabot[bot] <s****t@g****m> |
dependabot[bot] <4****]@u****m>
Committed by: GitHub <n****y@g****m> |
7 months ago | |
| b5536435 |
build(deps): bump actions/download-artifact from 4 to 5 (#1201)
Co-authored-by: dependabot[bot] <4****]@u****m> Signed-off-by: dependabot[bot] <s****t@g****m> |
dependabot[bot] <4****]@u****m>
Committed by: GitHub <n****y@g****m> |
7 months ago | |
| 0806ce52 | fix(build): Include build.rs in published crates (#1199) |
Colin Marc <h****i@c****m>
Committed by: GitHub <n****y@g****m> |
7 months ago | |
| 1ba8807c | Improve `show()` output for empty DataFrames (#1208) |
kosiew <k****w@g****m>
Committed by: GitHub <n****y@g****m> |
7 months ago | |
| e28256cb | suggestions (#1183) |
Dewey Dunnington <d****y@d****a>
Committed by: GitHub <n****y@g****m> |
7 months ago | |
| 58925853 | Allow users to pass a single expression instead of a list of expressions for ... |
Tim Saucer <t****r@g****m>
Committed by: GitHub <n****y@g****m> |
7 months ago | |
| f9479412 |
build(deps): bump slab from 0.4.10 to 0.4.11 (#1205)
Co-authored-by: dependabot[bot] <4****]@u****m> Signed-off-by: dependabot[bot] <s****t@g****m> |
dependabot[bot] <4****]@u****m>
Committed by: GitHub <n****y@g****m> |
7 months ago | |
| 1a6260fc |
build(deps): bump async-trait from 0.1.88 to 0.1.89 (#1203)
Co-authored-by: dependabot[bot] <4****]@u****m> Signed-off-by: dependabot[bot] <s****t@g****m> |
dependabot[bot] <4****]@u****m>
Committed by: GitHub <n****y@g****m> |
7 months ago | |
| 4472c6cf |
build(deps): bump uuid from 1.17.0 to 1.18.0 (#1202)
Co-authored-by: dependabot[bot] <4****]@u****m> Signed-off-by: dependabot[bot] <s****t@g****m> |
dependabot[bot] <4****]@u****m>
Committed by: GitHub <n****y@g****m> |
7 months ago | |
| f0af0956 |
build(deps): bump object_store from 0.12.2 to 0.12.3 (#1189)
Co-authored-by: dependabot[bot] <4****]@u****m> Signed-off-by: dependabot[bot] <s****t@g****m> |
dependabot[bot] <4****]@u****m>
Committed by: GitHub <n****y@g****m> |
7 months ago | |
| c8ddb87b | Update how to dev instructions (#1179) |
Nick <2****1@u****m>
Committed by: GitHub <n****y@g****m> |
7 months ago | |
| fd845561 | chore: Upgrade datafusion to version 49 (#1200) |
Nuno Faria <n****a@g****m>
Committed by: GitHub <n****y@g****m> |
7 months ago | |
| be888377 | Fix Python UDAF Accumulator Interface example to Properly Handle State and Up... |
kosiew <k****w@g****m>
Committed by: GitHub <n****y@g****m> |
8 months ago | |
| 94687cd6 | Change merge rule to require branches up to date but do not require a committ... |
Tim Saucer <t****r@g****m>
Committed by: GitHub <n****y@g****m> |
8 months ago | |
| cbe845b1 | Prepare for 48 release (#1175) |
Tim Saucer <t****r@g****m>
Committed by: GitHub <n****y@g****m> |
8 months ago | |
| 2e1b7136 | refactor: style loading logic in DataFrameHtmlFormatter (#1177) |
kosiew <k****w@g****m>
Committed by: GitHub <n****y@g****m> |
8 months ago | |
| 98f4773b | feat: add FFI support for user defined functions (#1145) |
Tim Saucer <t****r@g****m>
Committed by: GitHub <n****y@g****m> |
8 months ago | |
| 9362f531 |
feat: python based catalog and schema provider (#1156)
Co-authored-by: renato2099 <r****n@g****m> |
Tim Saucer <t****r@g****m>
Committed by: GitHub <n****y@g****m> |
8 months ago | |
| 95456342 | feat: collect once during display() in jupyter notebooks (#1167) |
Tim Saucer <t****r@g****m>
Committed by: GitHub <n****y@g****m> |
9 months ago | |
| 03921d96 | feat: dataframe string formatter (#1170) |
Tim Saucer <t****r@g****m>
Committed by: GitHub <n****y@g****m> |
9 months ago | |
| ef62fa89 | Add compression_level support to ParquetWriterOptions and enhance write_parqu... |
kosiew <k****w@g****m>
Committed by: GitHub <n****y@g****m> |
9 months ago | |
| 0d3c37f9 |
Consolidate DataFrame Docs: Merge HTML Rendering Section as Subpage (#1161)
Co-authored-by: Tim Saucer <t****r@g****m> |
kosiew <k****w@g****m>
Committed by: GitHub <n****y@g****m> |
9 months ago | |
| 1812a0d3 | Fix signature of `__arrow_c_stream__` (#1168) |
Kyle Barron <k****e@d****g>
Committed by: GitHub <n****y@g****m> |
9 months ago | |
| 98dc06b5 | feat: Support Parquet writer options (#1123) |
Nuno Faria <n****a@g****m>
Committed by: GitHub <n****y@g****m> |
9 months ago | |
| 9b6acec0 |
Support types other than String and Int for partition columns (#1154)
Co-authored-by: michele gregori <m****s@g****m> |
Michele Gregori <m****r@g****m>
Committed by: GitHub <n****y@g****m> |
9 months ago | |
| 13910780 | feat: upgrade df48 dependency (#1143) |
Tim Saucer <t****r@g****m>
Committed by: GitHub <n****y@g****m> |
9 months ago | |
| dc0d35a2 | Add Interruptible Query Execution in Jupyter via KeyboardInterrupt Support (#... |
kosiew <k****w@g****m>
Committed by: GitHub <n****y@g****m> |
9 months ago | |
| d6ef9bcb | Add DataFrame API Documentation for DataFusion Python (#1132) |
kosiew <k****w@g****m>
Committed by: GitHub <n****y@g****m> |
9 months ago | |
| 24f0b1a1 | Add a documentation build step in CI (#1139) |
Crystal Zhou <4****z@u****m>
Committed by: GitHub <n****y@g****m> |
9 months ago | |
| 0cc9b0a5 | Release DataFusion 47.0.0 (#1130) |
Tim Saucer <t****r@g****m>
Committed by: GitHub <n****y@g****m> |
10 months ago | |
| 278a33ef | feat: add user defined table function support (#1113) |
Tim Saucer <t****r@g****m>
Committed by: GitHub <n****y@g****m> |
10 months ago | |
| 1e7494ba | add unit tests for expression functions (#1121) |
Tim Saucer <t****r@g****m>
Committed by: GitHub <n****y@g****m> |
10 months ago | |
| e8aa671b | Move the udf module to user_defined (#1112) |
Tim Saucer <t****r@g****m>
Committed by: GitHub <n****y@g****m> |
10 months ago | |
| 5a62c4b7 | With the update to rustc 1.87 we now fail clippy due to the size of the error... |
Tim Saucer <t****r@g****m>
Committed by: GitHub <n****y@g****m> |
10 months ago | |
| f3c98ec7 | Add fill_null method to DataFrame API for handling missing values (#1019) |
kosiew <k****w@g****m>
Committed by: GitHub <n****y@g****m> |
10 months ago | |
| 7d8bcd8d |
Partial fix for 1078: Enhance DataFrame Formatter Configuration with Memory a...
Co-authored-by: Tim Saucer <t****r@g****m>, Tim Saucer <t****r@g****m> |
kosiew <k****w@g****m>
Committed by: GitHub <n****y@g****m> |
10 months ago | |
| 15b96c48 |
feat: add missing PyLogicalPlan to_variant (#1085)
Co-authored-by: Tim Saucer <t****r@g****m> |
Chen Chongchen <c****y@q****m>
Committed by: GitHub <n****y@g****m> |
10 months ago | |
| 6fbeceff | Copy over protected branch rule from datafusion repo (#1122) |
Tim Saucer <t****r@g****m>
Committed by: GitHub <n****y@g****m> |
10 months ago | |
| 10600fb8 | fix: recursive import (#1117) |
Chen Chongchen <c****y@q****m>
Committed by: GitHub <n****y@g****m> |
11 months ago | |
| 5a7f6382 | Add expression chaining of single parameter scalar functions |
deanm0000 <3****0@u****m>
Committed by: GitHub <n****y@g****m> |
11 months ago | |
| 00dea113 |
Improve col class access using __getattr__
Co-authored-by: Tim Saucer <t****r@g****m> |
deanm0000 <3****0@u****m>
Committed by: GitHub <n****y@g****m> |
11 months ago | |
| 91b66351 |
Add DataFrame usage guide with HTML rendering customization options (#1108)
Co-authored-by: Tim Saucer <t****r@g****m> |
kosiew <k****w@g****m>
Committed by: GitHub <n****y@g****m> |
11 months ago | |
| c9f15547 | feat: alias with metadata (#1111) |
Chen Chongchen <c****y@q****m>
Committed by: GitHub <n****y@g****m> |
11 months ago | |
| d0d14f6e | feat: update datafusion dependency 47 (#1107) |
Tim Saucer <t****r@g****m>
Committed by: GitHub <n****y@g****m> |
11 months ago | |
| 818975b5 | Add Configurable HTML Table Formatter for DataFusion DataFrames in Python (#1... |
kosiew <k****w@g****m>
Committed by: GitHub <n****y@g****m> |
11 months ago | |
| 09b929a6 |
Documentation updates: mention correct dataset on basics page (#1081)
Co-authored-by: Kevin Liu <k****u@u****m>, Kevin Liu <k****u@u****m> |
Florian Schäfer <3****a@u****m>
Committed by: GitHub <n****y@g****m> |
12 months ago | |
| ffafb59e | feat: support unparser (#1088) |
Chen Chongchen <c****y@q****m>
Committed by: GitHub <n****y@g****m> |
12 months ago | |
| 583e1e94 | Update changelog and version number (#1089) |
Tim Saucer <t****r@g****m>
Committed by: GitHub <n****y@g****m> |
12 months ago | |
| d0315ffa | feat: Update DataFusion dependency to 46 (#1079) |
Tim Saucer <t****r@g****m>
Committed by: GitHub <n****y@g****m> |
12 months ago | |
| 42982dad | Improve collection during repr and repr_html (#1036) |
Tim Saucer <t****r@g****m>
Committed by: GitHub <n****y@g****m> |
12 months ago | |
| b8dd97bc | Add additional ruff suggestions (#1062) |
Spaarsh <6****h@u****m>
Committed by: GitHub <n****y@g****m> |
12 months ago | |
| 7c1c08f8 |
feat: expose regex_count function (#1066)
Co-authored-by: Nirnay Roy <n****2@g****m> |
Nirnay Roy <3****y@u****m>
Committed by: GitHub <n****y@g****m> |
12 months ago | |
| 2f52688d | Add decorator for udwf (#1061) |
kosiew <k****w@g****m>
Committed by: GitHub <n****y@g****m> |
12 months ago | |
| 4f457030 | added pytest asyncio tests (#1063) |
jsai28 <5****8@u****m>
Committed by: GitHub <n****y@g****m> |
12 months ago | |
| 55141bad |
Renaming Internal Structs (#1059)
Co-authored-by: Tim Saucer <t****r@g****m> |
Spaarsh <6****h@u****m>
Committed by: GitHub <n****y@g****m> |
12 months ago | |
| 3dcf7c7e | feat/making global context accessible for users (#1060) |
jsai28 <5****8@u****m>
Committed by: GitHub <n****y@g****m> |
12 months ago | |
| b194a877 | feat/improve ruff test coverage (#1055) |
Tim Saucer <t****r@g****m>
Committed by: GitHub <n****y@g****m> |
12 months ago | |
| 9d634de6 | Update python min version to 3.9 (#1043) |
Kevin Liu <k****u@u****m>
Committed by: GitHub <n****y@g****m> |
about 1 year ago | |
| 0002372c | Enable take comments to assign issues to users (#1058) |
Tim Saucer <t****r@g****m>
Committed by: GitHub <n****y@g****m> |
about 1 year ago | |
| d72f5605 | Enable FA ruff lint (#1052) |
Tim Saucer <t****r@g****m>
Committed by: GitHub <n****y@g****m> |
about 1 year ago | |
| 973d7ec4 |
feat: Implementation of udf and udaf decorator (#1040)
Co-authored-by: Tim Saucer <t****r@g****m> |
Crystal Zhou <4****9@u****m>
Committed by: GitHub <n****y@g****m> |
about 1 year ago | |
| acd70409 |
feat: reads using global ctx (#982)
Co-authored-by: Tim Saucer <t****r@g****m> |
Ion Koutsouris <1****o@u****m>
Committed by: GitHub <n****y@g****m> |
about 1 year ago | |
| 9027b4d7 | fix: type checking (#993) |
Chen Chongchen <c****y@q****m>
Committed by: GitHub <n****y@g****m> |
about 1 year ago | |
| a80a788f | Enable Dataframe to be converted into views which can be used in register_tab... |
kosiew <k****w@g****m>
Committed by: GitHub <n****y@g****m> |
about 1 year ago | |
| 69ebf70b | Chore: Release datafusion-python 45 (#1024) |
Tim Saucer <t****r@g****m>
Committed by: GitHub <n****y@g****m> |
about 1 year ago |