Big Data/SparkStreaming

From Wikiversity
Jump to navigation Jump to search

Apache Spark Streaming is the streaming extension of Apache Spark.

The basic idea is buffer the data of a continuously arriving data stream for a certain amount of time. The resulting data stream segments are processed by Spark as usual. The resulting intermediate results are combined by applying specific operations like union.

References[edit | edit source]