Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Polars course #40

Open
akshayka opened this issue Feb 27, 2025 · 15 comments
Open

Polars course #40

akshayka opened this issue Feb 27, 2025 · 15 comments
Labels
help wanted Extra attention is needed

Comments

@akshayka
Copy link
Contributor

akshayka commented Feb 27, 2025

Description

We're building a course on Polars, and are seeking contributing authors! Contributors will be credited as authors in the course's directory, and in the notebooks they contribute.

Claiming a notebook.

Any notebook without an assigned author needs a contributor. To get started, leave a comment to claim a notebook you'd like to contribute, and then create a pull request with your draft. Tutorials with a "🍃" are meant to highlight marimo features that give Polars superpowers.

Notebook Description Status Author
Why Polars An introduction to Polars and its advantages over other Dataframe libraries. @koushikkhan
Dataframes The central data structure for structured data 🚧
Loading data Loading data from various different sources
Basic operations Basic operations on data 🚧 @jorammutenge
Reactive plots 🍃 How to select data from Altair or Plotly plots and get them back as dataframes 🚧 @etrotta
Dataframe transformer 🍃 Using marimo's built in table viewer and dataframe transformer 🚧
Querying dataframes with SQL 🍃 Using marimo's SQL cells to query Polars dataframes 🚧
Working with columns Expression expansion and columns 🚧 @debajyotid2
Data types Basic types and casting 🚧 @debajyotid2
Strings Working with strings @peter-gy
Missing data Handling missing data 🚧
Aggregations Group bys and aggregations @jorammutenge
Window functions Advanced expressions with windowing functions ✍️ @ngriffiths13
User-defined functions Using custom Python operations @peter-gy
NumPy functions Applying NumPy functions to Dataframes 🚧
Lazy execution Lazily process queries with LazyFrames 🚧
@akshayka akshayka added the help wanted Extra attention is needed label Feb 27, 2025
@ngriffiths13
Copy link

I’d love to take some of these. Can I start with window functions?

@akshayka
Copy link
Contributor Author

@ngriffiths13 Yes, please do! Thank you!

@jorammutenge
Copy link
Contributor

I'm happy to take on Basic operations.

Can I get started on that?

@akshayka
Copy link
Contributor Author

I'm happy to take on Basic operations.

Can I get started on that?

Definitely! Assigned that to you in the table.

@etrotta
Copy link

etrotta commented Mar 1, 2025

I'll try to take on Reactive plots

@debajyotid2
Copy link

I can take a look at Working with columns and Data types.

@akshayka
Copy link
Contributor Author

akshayka commented Mar 1, 2025

Thank you @etrotta and @debajyotid2!

@akshayka akshayka mentioned this issue Mar 1, 2025
@peter-gy
Copy link
Contributor

peter-gy commented Mar 1, 2025

I would be glad to cover Strings and User-defined functions.

@jorammutenge
Copy link
Contributor

I sent in a PR for "basic operations"

Can I take on Aggregations too?

@peter-gy
Copy link
Contributor

Hey @akshayka 👋
The User-defined functions chapter has been merged with #57, could you please mark it with a ✅?

By the way, as the number of courses grows, it would be really cool to automate the chapter status reporting using GitHub workflows.

The Raycast Extensions repo could be a great source of inspiration, where each extension has assigned community members as maintainers. Structurally, this repo is relatively similar. Whenever a change is proposed to a particular course / chapter, the associated maintainers would be pinged under the PR to take a look. For now, this kind of tooling might be an overkill, but in the long term something similar could be interesting.

@akshayka
Copy link
Contributor Author

@peter-gy, thanks for the ping — done!

By the way, as the number of courses grows, it would be really cool to automate the chapter status reporting using GitHub workflows ... Whenever a change is proposed to a particular course / chapter, the associated maintainers would be pinged under the PR to take a look.

That's an excellent idea — I like the idea of courses having owners, and I think it makes sense to invest in scalability of this repo long-term.

@Haleshot
Copy link
Collaborator

@ngriffiths13, @etrotta, @debajyotid2 Any updates in regard to the notebooks? Totally understand if things have been busy (work/otherwise), no rush at all! Let me know if you need anything from my side.

@etrotta
Copy link

etrotta commented Mar 21, 2025

Any updates in regard to the notebooks? Totally understand if things have been busy (work/otherwise), no rush at all! Let me know if you need anything from my side.

I had some issues with marimo and didn't have time to debug it, e.g. Run All not working (freezing/getting stuck) despite running each cell individually working, then got pretty busy to top it off

I will try again this weekend and ask in Discord if it is still happening on the latest version

@ngriffiths13
Copy link

@ngriffiths13, @etrotta, @debajyotid2 Any updates in regard to the notebooks? Totally understand if things have been busy (work/otherwise), no rush at all! Let me know if you need anything from my side.

Sorry just got home yesterday from a trip to Hawaii. Should be able to finish this next week!

@debajyotid2
Copy link

@Haleshot I'll submit a PR by next week!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
help wanted Extra attention is needed
Projects
None yet
Development

No branches or pull requests

7 participants