Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Bump getdaft from 0.2.33 to 0.3.0 #1080

Closed
wants to merge 1 commit into from

Conversation

dependabot[bot]
Copy link
Contributor

@dependabot dependabot bot commented on behalf of github Aug 20, 2024

Bumps getdaft from 0.2.33 to 0.3.0.

Release notes

Sourced from getdaft's releases.

v0.3.0

‼️ v0.2 → v0.3 Migration Guide ‼️

We're proud to release version 0.3.0 of Daft! Please note that with this minor version increment, v0.3 contains several breaking changes:

  • daft.read_delta_lake
    • This function was deprecated in favor of daft.read_deltalake in v0.2.26 and is now removed. (#2663)
  • daft.read_parquet / daft.read_csv / daft.read_json
    • Schema hints are deprecated in favor of infer_schema (whether to turn on schema inference) and schema (a definitive schema if infer_schema is False, otherwise it is used as a schema hint that is applied post inference). (#2326)
  • Expression.str.normalize()
    • Parameters are now all False by default, and need to individually be toggled on. (#2647)
  • DataFrame.agg / GroupedDataFrame.agg
    • Tuple syntax for aggregations was deprecated in v0.2.18 and is now no longer supported. Please use aggregation expressions instead. (#2663)
    • Ex: df.agg([(col("x"), "sum"), (col("y"), "mean")]) should be written instead as df.agg(col("x").sum(), col("y").mean())
  • DataFrame.count
    • Calling .count() with no arguments will now return a DataFrame with column “count” which contains the length of the entire DataFrame, instead of the count for each of the columns (#1996)
  • DataFrame.with_column
    • Resource requests should now be specified on UDF expressions (@udf(num_gpus=…)) instead of on Projections (through .with_column(..., resource_request=...) (#2654)
  • DataFrame.join
    • When joining two DataFrames, columns will now be merged only if they exactly match join keys. (#2631)
    • Ex:
df1 = daft.from_pydict({
	"a": ["x", "y"],
	"b": [1, 2]
})
df2 = daft.from_pydict({
"a": ["y", "z"],
"b": [20, 30]
})
result_df = df1.join(
df2,
left_on=[col("a"), col("b")],
right_on=[col("a"), col("b")/10], # NOTE THE "/10"
how="outer"
)
result_df.sort("a").collect()

# before
╭──────┬───────╮
│ a    ┆ b     │
│ ---  ┆ ---   │
│ Utf8 ┆ Int64 │
╞══════╪═══════╡
│ x    ┆ 1     │
├╌╌╌╌╌╌┼╌╌╌╌╌╌╌┤
</tr></table> 

... (truncated)

Commits
  • b3f5260 [CHORE] fix merge conflict in repr tests (#2700)
  • fbce3ac [BUG] Fix Parquet reads with chunk sizing (#2658)
  • dbcc4bb [PERF] Add ability to automatically choose broadcast for anti/semi joins (#2699)
  • 73c0742 [CHORE] Fix FOTW #001 images notebook (#2697)
  • 416a02d [FEAT] Ellipsize scan task sources if too many (#2695)
  • f34837d [FEAT] Allow user provided schema and schema inference length for read_sql (#...
  • 2127672 [BUG]: repr mermaid fix (#2688)
  • 7237a8a [BUG] Use Daft Pickle instead of Ray Pickle and use bincode for serializing (...
  • 6f87625 [DOCS] Add join types, renaming behavior, and example to join docs (#2691)
  • 5f5bdf9 [CHORE] Deprecate schema hints (#2655)
  • Additional commits viewable in compare view

Dependabot compatibility score

Dependabot will resolve any conflicts with this PR as long as you don't alter it yourself. You can also trigger a rebase manually by commenting @dependabot rebase.


Dependabot commands and options

You can trigger Dependabot actions by commenting on this PR:

  • @dependabot rebase will rebase this PR
  • @dependabot recreate will recreate this PR, overwriting any edits that have been made to it
  • @dependabot merge will merge this PR after your CI passes on it
  • @dependabot squash and merge will squash and merge this PR after your CI passes on it
  • @dependabot cancel merge will cancel a previously requested merge and block automerging
  • @dependabot reopen will reopen this PR if it is closed
  • @dependabot close will close this PR and stop Dependabot recreating it. You can achieve the same result by closing it manually
  • @dependabot show <dependency name> ignore conditions will show all of the ignore conditions of the specified dependency
  • @dependabot ignore this major version will close this PR and stop Dependabot creating any more for this major version (unless you reopen the PR or upgrade to it yourself)
  • @dependabot ignore this minor version will close this PR and stop Dependabot creating any more for this minor version (unless you reopen the PR or upgrade to it yourself)
  • @dependabot ignore this dependency will close this PR and stop Dependabot creating any more for this dependency (unless you reopen the PR or upgrade to it yourself)

Bumps [getdaft](https://github.com/Eventual-Inc/Daft) from 0.2.33 to 0.3.0.
- [Release notes](https://github.com/Eventual-Inc/Daft/releases)
- [Changelog](https://github.com/Eventual-Inc/Daft/blob/main/.history)
- [Commits](Eventual-Inc/Daft@v0.2.33...v0.3.0)

---
updated-dependencies:
- dependency-name: getdaft
  dependency-type: direct:production
  update-type: version-update:semver-minor
...

Signed-off-by: dependabot[bot] <[email protected]>
@dependabot dependabot bot added dependencies Pull requests that update a dependency file python Pull requests that update Python code labels Aug 20, 2024
Copy link
Contributor Author

dependabot bot commented on behalf of github Aug 24, 2024

Superseded by #1098.

@dependabot dependabot bot closed this Aug 24, 2024
@dependabot dependabot bot deleted the dependabot/pip/getdaft-0.3.0 branch August 24, 2024 01:53
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
dependencies Pull requests that update a dependency file python Pull requests that update Python code
Projects
None yet
Development

Successfully merging this pull request may close these issues.

0 participants