Back to Courses

Use the Apache Spark Structured Streaming API with MongoDB

Overview

By the end of this project, you will use the Apache Spark Structured Streaming API with Python to stream data from two different sources, store a dataset in the MongoDB database, and join two datasets. The Apache Spark Structured Streaming API is used to continuously stream data from various sources including the file system or a TCP/IP socket. One application is to continuously capture data from weather stations for historical purposes.

View Course
English
Coursera
Your dream job is just a tap away — only on the BoostGrad app.
View on Boostgrad App
View on Browser
Continue