An open API service providing commit metadata for open source projects.

GitHub / apache/datafusion-python / commits

Apache DataFusion Python Bindings

SHA Message Author Date Stats
d7e137ee Enable remaining pylints (#1298) Tim Saucer <t****r@g****m>
Committed by: GitHub <n****y@g****m>
4 months ago
51dc78a6 chore: update datafusion minor version (#1297)
Co-authored-by: Kevin Liu <k****u@u****m>, Kevin Liu <k****u@u****m>
Tim Saucer <t****r@g****m>
Committed by: GitHub <n****y@g****m>
4 months ago
a605b619 Update python minimum version to 3.10 (#1296) Tim Saucer <t****r@g****m>
Committed by: GitHub <n****y@g****m>
4 months ago
694a5d8d feat: Add SQL expression support for `with_columns` (#1286)
Co-authored-by: Hendrik Makait <h****k@m****m>, Hendrik Makait <h****k@m****m>, Hendrik Makait <h****k@m****m>
Marko Milenković <m****m@u****m>
Committed by: GitHub <n****y@g****m>
4 months ago
d9c90d27 feat: Add SQL expression for `repartition_by_hash` (#1285) Marko Milenković <m****m@u****m>
Committed by: GitHub <n****y@g****m>
5 months ago
a93d614a with_column supports SQL expression (#1284) Marko Milenković <m****m@u****m>
Committed by: GitHub <n****y@g****m>
5 months ago
29bcb0f9 Release 50.1 (#1281) Tim Saucer <t****r@g****m>
Committed by: GitHub <n****y@g****m>
5 months ago
60123493 feat: support session token parameter for AmaoznS3 (#1275) GCHQDeveloper028 <5****8@u****m>
Committed by: GitHub <n****y@g****m>
5 months ago
030873bb Add temporary view option for into_view (#1267) Tim Saucer <t****r@g****m>
Committed by: GitHub <n****y@g****m>
5 months ago
fe0cf8cd feat: allow DataFrame.filter to accept SQL strings\ (#1276) 𝕂 <5****h@u****m>
Committed by: GitHub <n****y@g****m>
5 months ago
6b162850 feat: expose `select_exprs` method on DataFrame (#1271)
Co-authored-by: Tim Saucer <t****r@g****m>
Marko Milenković <m****m@u****m>
Committed by: GitHub <n****y@g****m>
5 months ago
08901d53 feat: expose` DataFrame.parse_sql_expr` (#1274)
Co-authored-by: Tim Saucer <t****r@g****m>, Tim Saucer <t****r@g****m>
Marko Milenković <m****m@u****m>
Committed by: GitHub <n****y@g****m>
5 months ago
16d4c038 chore: update dependencies (#1269) Tim Saucer <t****r@g****m>
Committed by: GitHub <n****y@g****m>
5 months ago
f5fdf596 feat: expose DataFrame.write_table (#1264)
Co-authored-by: Copilot <1****t@u****m>, Copilot <1****t@u****m>
Tim Saucer <t****r@g****m>
Committed by: GitHub <n****y@g****m>
5 months ago
6f3b1cab Unify Table representations (#1256)
Co-authored-by: Siew Kam Onn <k****w@g****m>
Tim Saucer <t****r@g****m>
Committed by: GitHub <n****y@g****m>
5 months ago
af1c07b7 Freeze PyO3 wrappers & introduce interior mutability to avoid PyO3 borrow err...
Co-authored-by: ntjohnson1 <2****1@u****m>, Tim Saucer <t****r@g****m>
kosiew <k****w@g****m>
Committed by: GitHub <n****y@g****m>
5 months ago
e75addfa macos-13 is deprecated (#1259) Kevin Liu <k****u@u****m>
Committed by: GitHub <n****y@g****m>
5 months ago
709c918e REVERT CHANGES to commit Siew Kam Onn <k****w@g****m> 5 months ago
0154c031 CHANGES to review Siew Kam Onn <k****w@g****m> 5 months ago
f08d5b04 Make Session Context `pyclass` frozen so interior mutability is only managed ... Nick <2****1@u****m>
Committed by: GitHub <n****y@g****m>
6 months ago
0ec01020 Fix drop() method to handle quoted column names consistently (#1242)
Co-authored-by: Tim Saucer <t****r@g****m>
H0TB0X420 <2****0@u****m>
Committed by: GitHub <n****y@g****m>
6 months ago
5f8d500a Update version number, add changelog (#1249) Tim Saucer <t****r@g****m>
Committed by: GitHub <n****y@g****m>
6 months ago
bf22c1d4 Prepare for DF50 (#1231) Tim Saucer <t****r@g****m>
Committed by: GitHub <n****y@g****m>
6 months ago
b7d3519d docs: add apache iceberg as datafusion data source (#1240) Kevin Liu <k****u@u****m>
Committed by: GitHub <n****y@g****m>
6 months ago
d54dc4ae Support string column identifiers for sort/aggregate/window and stricter Expr... kosiew <k****w@g****m>
Committed by: GitHub <n****y@g****m>
6 months ago
9e976362 chore: remove redundant error transformation (#1232) Daniel Mesejo <m****n@g****m>
Committed by: GitHub <n****y@g****m>
6 months ago
c609dfa3 feat: allow passing a slice to and expression with the [] indexing (#1215) Tim Saucer <t****r@g****m>
Committed by: GitHub <n****y@g****m>
6 months ago
b325a385 fix: update link to user example for custom table provider (#1224) Tyler White <5****k@u****m>
Committed by: GitHub <n****y@g****m>
6 months ago
332c69b6 docs: fix CaseBuilder documentation example (#1225) Tyler White <5****k@u****m>
Committed by: GitHub <n****y@g****m>
6 months ago
f0bbad75 Set fail on warning for documentation generation (#1218) Tim Saucer <t****r@g****m>
Committed by: GitHub <n****y@g****m>
6 months ago
200a249a Fixing a few Typos (#1220) Nick <2****1@u****m>
Committed by: GitHub <n****y@g****m>
6 months ago
61f981bc Add benchmark script and documentation for maximizing CPU usage in DataFusion... kosiew <k****w@g****m>
Committed by: GitHub <n****y@g****m>
6 months ago
b76cd8fd Update development guide in README.md (#1213) Koustubh Rao <8****o@u****m>
Committed by: GitHub <n****y@g****m>
7 months ago
c41c7f09 49.0.0 release (#1211) Tim Saucer <t****r@g****m>
Committed by: GitHub <n****y@g****m>
7 months ago
d6d6ea69 build(deps): bump actions/checkout from 4 to 5 (#1204)
Co-authored-by: dependabot[bot] <4****]@u****m>
Signed-off-by: dependabot[bot] <s****t@g****m>
dependabot[bot] <4****]@u****m>
Committed by: GitHub <n****y@g****m>
7 months ago
cb29176f build(deps): bump url from 2.5.4 to 2.5.7 (#1210)
Co-authored-by: dependabot[bot] <4****]@u****m>
Signed-off-by: dependabot[bot] <s****t@g****m>
dependabot[bot] <4****]@u****m>
Committed by: GitHub <n****y@g****m>
7 months ago
b5536435 build(deps): bump actions/download-artifact from 4 to 5 (#1201)
Co-authored-by: dependabot[bot] <4****]@u****m>
Signed-off-by: dependabot[bot] <s****t@g****m>
dependabot[bot] <4****]@u****m>
Committed by: GitHub <n****y@g****m>
7 months ago
0806ce52 fix(build): Include build.rs in published crates (#1199) Colin Marc <h****i@c****m>
Committed by: GitHub <n****y@g****m>
7 months ago
1ba8807c Improve `show()` output for empty DataFrames (#1208) kosiew <k****w@g****m>
Committed by: GitHub <n****y@g****m>
7 months ago
e28256cb suggestions (#1183) Dewey Dunnington <d****y@d****a>
Committed by: GitHub <n****y@g****m>
7 months ago
58925853 Allow users to pass a single expression instead of a list of expressions for ... Tim Saucer <t****r@g****m>
Committed by: GitHub <n****y@g****m>
7 months ago
f9479412 build(deps): bump slab from 0.4.10 to 0.4.11 (#1205)
Co-authored-by: dependabot[bot] <4****]@u****m>
Signed-off-by: dependabot[bot] <s****t@g****m>
dependabot[bot] <4****]@u****m>
Committed by: GitHub <n****y@g****m>
7 months ago
1a6260fc build(deps): bump async-trait from 0.1.88 to 0.1.89 (#1203)
Co-authored-by: dependabot[bot] <4****]@u****m>
Signed-off-by: dependabot[bot] <s****t@g****m>
dependabot[bot] <4****]@u****m>
Committed by: GitHub <n****y@g****m>
7 months ago
4472c6cf build(deps): bump uuid from 1.17.0 to 1.18.0 (#1202)
Co-authored-by: dependabot[bot] <4****]@u****m>
Signed-off-by: dependabot[bot] <s****t@g****m>
dependabot[bot] <4****]@u****m>
Committed by: GitHub <n****y@g****m>
7 months ago
f0af0956 build(deps): bump object_store from 0.12.2 to 0.12.3 (#1189)
Co-authored-by: dependabot[bot] <4****]@u****m>
Signed-off-by: dependabot[bot] <s****t@g****m>
dependabot[bot] <4****]@u****m>
Committed by: GitHub <n****y@g****m>
7 months ago
c8ddb87b Update how to dev instructions (#1179) Nick <2****1@u****m>
Committed by: GitHub <n****y@g****m>
7 months ago
fd845561 chore: Upgrade datafusion to version 49 (#1200) Nuno Faria <n****a@g****m>
Committed by: GitHub <n****y@g****m>
7 months ago
be888377 Fix Python UDAF Accumulator Interface example to Properly Handle State and Up... kosiew <k****w@g****m>
Committed by: GitHub <n****y@g****m>
8 months ago
94687cd6 Change merge rule to require branches up to date but do not require a committ... Tim Saucer <t****r@g****m>
Committed by: GitHub <n****y@g****m>
8 months ago
cbe845b1 Prepare for 48 release (#1175) Tim Saucer <t****r@g****m>
Committed by: GitHub <n****y@g****m>
8 months ago
2e1b7136 refactor: style loading logic in DataFrameHtmlFormatter (#1177) kosiew <k****w@g****m>
Committed by: GitHub <n****y@g****m>
8 months ago
98f4773b feat: add FFI support for user defined functions (#1145) Tim Saucer <t****r@g****m>
Committed by: GitHub <n****y@g****m>
8 months ago
9362f531 feat: python based catalog and schema provider (#1156)
Co-authored-by: renato2099 <r****n@g****m>
Tim Saucer <t****r@g****m>
Committed by: GitHub <n****y@g****m>
8 months ago
95456342 feat: collect once during display() in jupyter notebooks (#1167) Tim Saucer <t****r@g****m>
Committed by: GitHub <n****y@g****m>
9 months ago
03921d96 feat: dataframe string formatter (#1170) Tim Saucer <t****r@g****m>
Committed by: GitHub <n****y@g****m>
9 months ago
ef62fa89 Add compression_level support to ParquetWriterOptions and enhance write_parqu... kosiew <k****w@g****m>
Committed by: GitHub <n****y@g****m>
9 months ago
0d3c37f9 Consolidate DataFrame Docs: Merge HTML Rendering Section as Subpage (#1161)
Co-authored-by: Tim Saucer <t****r@g****m>
kosiew <k****w@g****m>
Committed by: GitHub <n****y@g****m>
9 months ago
1812a0d3 Fix signature of `__arrow_c_stream__` (#1168) Kyle Barron <k****e@d****g>
Committed by: GitHub <n****y@g****m>
9 months ago
98dc06b5 feat: Support Parquet writer options (#1123) Nuno Faria <n****a@g****m>
Committed by: GitHub <n****y@g****m>
9 months ago
9b6acec0 Support types other than String and Int for partition columns (#1154)
Co-authored-by: michele gregori <m****s@g****m>
Michele Gregori <m****r@g****m>
Committed by: GitHub <n****y@g****m>
9 months ago
13910780 feat: upgrade df48 dependency (#1143) Tim Saucer <t****r@g****m>
Committed by: GitHub <n****y@g****m>
9 months ago
dc0d35a2 Add Interruptible Query Execution in Jupyter via KeyboardInterrupt Support (#... kosiew <k****w@g****m>
Committed by: GitHub <n****y@g****m>
9 months ago
d6ef9bcb Add DataFrame API Documentation for DataFusion Python (#1132) kosiew <k****w@g****m>
Committed by: GitHub <n****y@g****m>
9 months ago
24f0b1a1 Add a documentation build step in CI (#1139) Crystal Zhou <4****z@u****m>
Committed by: GitHub <n****y@g****m>
9 months ago
0cc9b0a5 Release DataFusion 47.0.0 (#1130) Tim Saucer <t****r@g****m>
Committed by: GitHub <n****y@g****m>
10 months ago
278a33ef feat: add user defined table function support (#1113) Tim Saucer <t****r@g****m>
Committed by: GitHub <n****y@g****m>
10 months ago
1e7494ba add unit tests for expression functions (#1121) Tim Saucer <t****r@g****m>
Committed by: GitHub <n****y@g****m>
10 months ago
e8aa671b Move the udf module to user_defined (#1112) Tim Saucer <t****r@g****m>
Committed by: GitHub <n****y@g****m>
10 months ago
5a62c4b7 With the update to rustc 1.87 we now fail clippy due to the size of the error... Tim Saucer <t****r@g****m>
Committed by: GitHub <n****y@g****m>
10 months ago
f3c98ec7 Add fill_null method to DataFrame API for handling missing values (#1019) kosiew <k****w@g****m>
Committed by: GitHub <n****y@g****m>
10 months ago
7d8bcd8d Partial fix for 1078: Enhance DataFrame Formatter Configuration with Memory a...
Co-authored-by: Tim Saucer <t****r@g****m>, Tim Saucer <t****r@g****m>
kosiew <k****w@g****m>
Committed by: GitHub <n****y@g****m>
10 months ago
15b96c48 feat: add missing PyLogicalPlan to_variant (#1085)
Co-authored-by: Tim Saucer <t****r@g****m>
Chen Chongchen <c****y@q****m>
Committed by: GitHub <n****y@g****m>
10 months ago
6fbeceff Copy over protected branch rule from datafusion repo (#1122) Tim Saucer <t****r@g****m>
Committed by: GitHub <n****y@g****m>
10 months ago
10600fb8 fix: recursive import (#1117) Chen Chongchen <c****y@q****m>
Committed by: GitHub <n****y@g****m>
11 months ago
5a7f6382 Add expression chaining of single parameter scalar functions deanm0000 <3****0@u****m>
Committed by: GitHub <n****y@g****m>
11 months ago
00dea113 Improve col class access using __getattr__
Co-authored-by: Tim Saucer <t****r@g****m>
deanm0000 <3****0@u****m>
Committed by: GitHub <n****y@g****m>
11 months ago
91b66351 Add DataFrame usage guide with HTML rendering customization options (#1108)
Co-authored-by: Tim Saucer <t****r@g****m>
kosiew <k****w@g****m>
Committed by: GitHub <n****y@g****m>
11 months ago
c9f15547 feat: alias with metadata (#1111) Chen Chongchen <c****y@q****m>
Committed by: GitHub <n****y@g****m>
11 months ago
d0d14f6e feat: update datafusion dependency 47 (#1107) Tim Saucer <t****r@g****m>
Committed by: GitHub <n****y@g****m>
11 months ago
818975b5 Add Configurable HTML Table Formatter for DataFusion DataFrames in Python (#1... kosiew <k****w@g****m>
Committed by: GitHub <n****y@g****m>
11 months ago
09b929a6 Documentation updates: mention correct dataset on basics page (#1081)
Co-authored-by: Kevin Liu <k****u@u****m>, Kevin Liu <k****u@u****m>
Florian Schäfer <3****a@u****m>
Committed by: GitHub <n****y@g****m>
12 months ago
ffafb59e feat: support unparser (#1088) Chen Chongchen <c****y@q****m>
Committed by: GitHub <n****y@g****m>
12 months ago
583e1e94 Update changelog and version number (#1089) Tim Saucer <t****r@g****m>
Committed by: GitHub <n****y@g****m>
12 months ago
d0315ffa feat: Update DataFusion dependency to 46 (#1079) Tim Saucer <t****r@g****m>
Committed by: GitHub <n****y@g****m>
12 months ago
42982dad Improve collection during repr and repr_html (#1036) Tim Saucer <t****r@g****m>
Committed by: GitHub <n****y@g****m>
12 months ago
b8dd97bc Add additional ruff suggestions (#1062) Spaarsh <6****h@u****m>
Committed by: GitHub <n****y@g****m>
12 months ago
7c1c08f8 feat: expose regex_count function (#1066)
Co-authored-by: Nirnay Roy <n****2@g****m>
Nirnay Roy <3****y@u****m>
Committed by: GitHub <n****y@g****m>
12 months ago
2f52688d Add decorator for udwf (#1061) kosiew <k****w@g****m>
Committed by: GitHub <n****y@g****m>
12 months ago
4f457030 added pytest asyncio tests (#1063) jsai28 <5****8@u****m>
Committed by: GitHub <n****y@g****m>
12 months ago
55141bad Renaming Internal Structs (#1059)
Co-authored-by: Tim Saucer <t****r@g****m>
Spaarsh <6****h@u****m>
Committed by: GitHub <n****y@g****m>
12 months ago
3dcf7c7e feat/making global context accessible for users (#1060) jsai28 <5****8@u****m>
Committed by: GitHub <n****y@g****m>
12 months ago
b194a877 feat/improve ruff test coverage (#1055) Tim Saucer <t****r@g****m>
Committed by: GitHub <n****y@g****m>
12 months ago
9d634de6 Update python min version to 3.9 (#1043) Kevin Liu <k****u@u****m>
Committed by: GitHub <n****y@g****m>
about 1 year ago
0002372c Enable take comments to assign issues to users (#1058) Tim Saucer <t****r@g****m>
Committed by: GitHub <n****y@g****m>
about 1 year ago
d72f5605 Enable FA ruff lint (#1052) Tim Saucer <t****r@g****m>
Committed by: GitHub <n****y@g****m>
about 1 year ago
973d7ec4 feat: Implementation of udf and udaf decorator (#1040)
Co-authored-by: Tim Saucer <t****r@g****m>
Crystal Zhou <4****9@u****m>
Committed by: GitHub <n****y@g****m>
about 1 year ago
acd70409 feat: reads using global ctx (#982)
Co-authored-by: Tim Saucer <t****r@g****m>
Ion Koutsouris <1****o@u****m>
Committed by: GitHub <n****y@g****m>
about 1 year ago
9027b4d7 fix: type checking (#993) Chen Chongchen <c****y@q****m>
Committed by: GitHub <n****y@g****m>
about 1 year ago
a80a788f Enable Dataframe to be converted into views which can be used in register_tab... kosiew <k****w@g****m>
Committed by: GitHub <n****y@g****m>
about 1 year ago
69ebf70b Chore: Release datafusion-python 45 (#1024) Tim Saucer <t****r@g****m>
Committed by: GitHub <n****y@g****m>
about 1 year ago

← Back to repository