Spark Streaming logo

Spark Streaming

Processes streaming data using the same language-integrated API as batch processing, with automatic recovery of lost work and operator state.

Made by The Apache Software Foundation

    What is Spark Streaming?

    Spark Streaming provides a language-integrated API for stream processing, enabling developers to write streaming jobs in a similar manner to batch jobs. It supports a range of programming languages, including Java, Scala, and Python. Notably, Spark Streaming offers built-in recovery mechanisms for lost work and operator state, such as sliding windows, without requiring additional code from the developer

    Highlights

    • Language-integrated API for stream processing
    • Supports Java, Scala, and Python
    • Automatic recovery of lost work and operator state
    • Enables writing streaming jobs similar to batch jobs

    Social