{"id":1904,"date":"2020-07-04T06:44:08","date_gmt":"2020-07-04T10:44:08","guid":{"rendered":"http:\/\/sys4seq.com\/?p=1904"},"modified":"2022-06-07T12:10:30","modified_gmt":"2022-06-07T16:10:30","slug":"building-a-real-time-big-data-pipeline-4-spark-streaming-kafka-scala","status":"publish","type":"post","link":"https:\/\/sys4seq.com\/index.php\/2020\/07\/04\/building-a-real-time-big-data-pipeline-4-spark-streaming-kafka-scala\/","title":{"rendered":"Building a real-time big data pipeline 4 : Spark Streaming, Kafka, Scala"},"content":{"rendered":"<p>Apache Kafka is a scalable, high performance and low latency platform for handling of real-time data feeds. Kafka allows reading and writing streams of data like a messaging system; written in Scala and Java.Kafka requires Apache Zookeeper to run. Kafka v2.5.0 (scala v2.12 build) and zookeeper (v3.4.13) were installed using docker.<\/p>\n<p><a href=\"https:\/\/adinasarapu.github.io\/posts\/2020\/07\/blog-post-kafka-spark-streaming\/\" target=\"_blank\" rel=\"noopener\">&gt;&gt;&gt;<\/a><\/p>\n","protected":false},"excerpt":{"rendered":"Apache Kafka is a scalable, high performance and low latency platform for handling of real-time data feeds. Kafka allows reading and writing streams of data like a messaging system; written in Scala and Java.Kafka requires Apache Zookeeper to run. Kafka v2.5.0 (scala v2.12 build) and zookeeper (v3.4.13) were installed using docker. &gt;&gt;&gt;","protected":false},"author":1,"featured_media":0,"comment_status":"closed","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_mi_skip_tracking":false,"_monsterinsights_sitenote_active":false,"_monsterinsights_sitenote_note":"","_monsterinsights_sitenote_category":0},"categories":[44,43],"tags":[45,50,52],"_links":{"self":[{"href":"https:\/\/sys4seq.com\/index.php\/wp-json\/wp\/v2\/posts\/1904"}],"collection":[{"href":"https:\/\/sys4seq.com\/index.php\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/sys4seq.com\/index.php\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/sys4seq.com\/index.php\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/sys4seq.com\/index.php\/wp-json\/wp\/v2\/comments?post=1904"}],"version-history":[{"count":4,"href":"https:\/\/sys4seq.com\/index.php\/wp-json\/wp\/v2\/posts\/1904\/revisions"}],"predecessor-version":[{"id":1968,"href":"https:\/\/sys4seq.com\/index.php\/wp-json\/wp\/v2\/posts\/1904\/revisions\/1968"}],"wp:attachment":[{"href":"https:\/\/sys4seq.com\/index.php\/wp-json\/wp\/v2\/media?parent=1904"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/sys4seq.com\/index.php\/wp-json\/wp\/v2\/categories?post=1904"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/sys4seq.com\/index.php\/wp-json\/wp\/v2\/tags?post=1904"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}