T
- type of the data.public final class Dataset<T>
extends org.apache.spark.sql.Dataset<T>
Modifier and Type | Method and Description |
---|---|
static <U> Dataset<U> |
from(org.apache.spark.sql.Dataset<U> dataset)
Using the immutable property of datasets, we can downcast spark datasets to our class using this function.
|
JavaRDD<T> |
javaRDD()
Create a javaRDD component from this data set.
|
agg, agg, agg, agg, agg, alias, alias, apply, as, as, as, cache, checkpoint, checkpoint, coalesce, col, collect, collectAsList, collectToPython, columns, count, createGlobalTempView, createOrReplaceGlobalTempView, createOrReplaceTempView, createTempView, crossJoin, cube, cube, cube, cube, describe, describe, distinct, drop, drop, drop, drop, dropDuplicates, dropDuplicates, dropDuplicates, dropDuplicates, dropDuplicates, dtypes, except, explain, explain, explode, explode, exprEnc, filter, filter, filter, filter, first, flatMap, flatMap, foreach, foreach, foreachPartition, foreachPartition, groupBy, groupBy, groupBy, groupBy, groupByKey, groupByKey, head, head, hint, hint, inputFiles, intersect, isLocal, isStreaming, javaToPython, join, join, join, join, join, join, joinWith, joinWith, limit, logicalPlan, map, map, mapPartitions, mapPartitions, mapPartitionsInR, na, numericColumns, ofRows, orderBy, orderBy, orderBy, orderBy, org$apache$spark$sql$Dataset$$aggregatableColumns, org$apache$spark$sql$Dataset$$boundEnc, org$apache$spark$sql$Dataset$$classTag, org$apache$spark$sql$Dataset$$collectFromPlan, org$apache$spark$sql$Dataset$$createTempViewCommand, org$apache$spark$sql$Dataset$$rowFunction$1, org$apache$spark$sql$Dataset$$timeZone$1, org$apache$spark$sql$Dataset$$withPlan, persist, persist, printSchema, queryExecution, randomSplit, randomSplit, randomSplit, randomSplitAsList, rdd, reduce, reduce, registerTempTable, repartition, repartition, repartition, repartition, repartition, resolve, rollup, rollup, rollup, rollup, sample, sample, schema, select, select, select, select, select, select, select, select, select, selectExpr, selectExpr, selectUntyped, show, show, show, show, show, showString, showString$default$2, sort, sort, sort, sort, sortWithinPartitions, sortWithinPartitions, sortWithinPartitions, sortWithinPartitions, sparkSession, sqlContext, stat, storageLevel, take, takeAsList, toDF, toDF, toDF, toJavaRDD, toJSON, toLocalIterator, toPythonIterator, toString, transform, union, unionAll, unpersist, unpersist, where, where, withColumn, withColumn, withColumnRenamed, withWatermark, write, writeStream
public static <U> Dataset<U> from(org.apache.spark.sql.Dataset<U> dataset)
U
- type of the dataset.dataset
- the Spark dataset.Copyright © 2018. All rights reserved.