column_misc_functions {SparkR} | R Documentation |
Miscellaneous functions defined for Column
.
crc32(x) hash(x, ...) md5(x) sha1(x) sha2(y, x) xxhash64(x, ...) ## S4 method for signature 'Column' crc32(x) ## S4 method for signature 'Column' hash(x, ...) ## S4 method for signature 'Column' xxhash64(x, ...) ## S4 method for signature 'Column' md5(x) ## S4 method for signature 'Column' sha1(x) ## S4 method for signature 'Column,numeric' sha2(y, x)
x |
Column to compute on. In |
... |
additional Columns. |
y |
Column to compute on. |
crc32
: Calculates the cyclic redundancy check value (CRC32) of a binary column
and returns the value as a bigint.
hash
: Calculates the hash code of given columns, and returns the result
as an int column.
xxhash64
: Calculates the hash code of given columns using the 64-bit
variant of the xxHash algorithm, and returns the result as a long
column.
md5
: Calculates the MD5 digest of a binary column and returns the value
as a 32 character hex string.
sha1
: Calculates the SHA-1 digest of a binary column and returns the value
as a 40 character hex string.
sha2
: Calculates the SHA-2 family of hash functions of a binary column and
returns the value as a hex string. The second argument x
specifies the number
of bits, and is one of 224, 256, 384, or 512.
crc32 since 1.5.0
hash since 2.0.0
xxhash64 since 3.0.0
md5 since 1.5.0
sha1 since 1.5.0
sha2 since 1.5.0
## Not run:
##D # Dataframe used throughout this doc
##D df <- createDataFrame(cbind(model = rownames(mtcars), mtcars)[, 1:2])
##D tmp <- mutate(df, v1 = crc32(df$model), v2 = hash(df$model),
##D v3 = hash(df$model, df$mpg), v4 = md5(df$model),
##D v5 = sha1(df$model), v6 = sha2(df$model, 256))
##D head(tmp)
## End(Not run)