- Notifications
You must be signed in to change notification settings - Fork 76
Open
Labels
enhancementNew feature or requestNew feature or requestgood first issueGood issues to pick-up for newcomersGood issues to pick-up for newcomersperformanceSomething related to how fast the library can handle dataSomething related to how fast the library can handle data
Milestone
Description
Let's say you write
df.filter { someValue > myColumn.max() }This is way faster:
val myColumnMax = df.myColumn.max()
df.filter { someValue > myColumnMax }Maybe we could solve this by having lazily calculated stats stored inside ValueColumns. Columns are immutable after all, so it would be safe to do so and the performance gain should be significant!
Of course, this wouldn't work when you write:
df.filter { someValue > (myColumn + 1).max() } // or df.filter { someValue > myColumn.maxOf { it + 1 } }but that's okay I think. We can't have it all :)
cosmicethics
Metadata
Metadata
Assignees
Labels
enhancementNew feature or requestNew feature or requestgood first issueGood issues to pick-up for newcomersGood issues to pick-up for newcomersperformanceSomething related to how fast the library can handle dataSomething related to how fast the library can handle data