cume_dist {SparkR} | R Documentation |
Window function: returns the cumulative distribution of values within a window partition, i.e. the fraction of rows that are below the current row.
## S4 method for signature 'missing' cume_dist() cume_dist(x = "missing")
x |
empty. Should be used with no argument. |
N = total number of rows in the partition cume_dist(x) = number of values before (and including) x / N
This is equivalent to the CUME_DIST
function in SQL.
cume_dist since 1.6.0
Other window_funcs: dense_rank
,
dense_rank
,
dense_rank,missing-method
;
lag
, lag
,
lag,characterOrColumn-method
;
lead
, lead
,
lead,characterOrColumn,numeric-method
;
ntile
, ntile
,
ntile,numeric-method
;
percent_rank
, percent_rank
,
percent_rank,missing-method
;
rank
, rank
,
rank
, rank,ANY-method
,
rank,missing-method
;
row_number
, row_number
,
row_number,missing-method
## Not run:
##D df <- createDataFrame(mtcars)
##D ws <- orderBy(windowPartitionBy("am"), "hp")
##D out <- select(df, over(cume_dist(), ws), df$hp, df$am)
## End(Not run)