{"id":1901,"date":"2020-06-22T06:40:41","date_gmt":"2020-06-22T10:40:41","guid":{"rendered":"http:\/\/sys4seq.com\/?p=1901"},"modified":"2022-06-07T12:10:07","modified_gmt":"2022-06-07T16:10:07","slug":"building-a-real-time-big-data-pipeline-3-spark-sql-hadoop-scala","status":"publish","type":"post","link":"https:\/\/sys4seq.com\/index.php\/2020\/06\/22\/building-a-real-time-big-data-pipeline-3-spark-sql-hadoop-scala\/","title":{"rendered":"Building a real-time big data pipeline 3 : Spark SQL, Hadoop, Scala"},"content":{"rendered":"<p>Apache Spark is an open-source cluster computing system that provides high-level API in Java, Scala, Python and R.Spark also packaged with higher-level libraries for SQL, machine learning, streaming, and graphs. Spark SQL is Spark\u2019s package for working with structured data.<\/p>\n<p><a href=\"https:\/\/adinasarapu.github.io\/posts\/2020\/02\/blog-post-spark-sql\/\" target=\"_blank\" rel=\"noopener\">&gt;&gt;&gt;<\/a><\/p>\n","protected":false},"excerpt":{"rendered":"Apache Spark is an open-source cluster computing system that provides high-level API in Java, Scala, Python and R.Spark also packaged with higher-level libraries for SQL, machine learning, streaming, and graphs. Spark SQL is Spark\u2019s package for working with structured data. &gt;&gt;&gt;","protected":false},"author":1,"featured_media":0,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"_mi_skip_tracking":false,"_monsterinsights_sitenote_active":false,"_monsterinsights_sitenote_note":"","_monsterinsights_sitenote_category":0},"categories":[44,43],"tags":[49,50,51],"_links":{"self":[{"href":"https:\/\/sys4seq.com\/index.php\/wp-json\/wp\/v2\/posts\/1901"}],"collection":[{"href":"https:\/\/sys4seq.com\/index.php\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/sys4seq.com\/index.php\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/sys4seq.com\/index.php\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/sys4seq.com\/index.php\/wp-json\/wp\/v2\/comments?post=1901"}],"version-history":[{"count":9,"href":"https:\/\/sys4seq.com\/index.php\/wp-json\/wp\/v2\/posts\/1901\/revisions"}],"predecessor-version":[{"id":1967,"href":"https:\/\/sys4seq.com\/index.php\/wp-json\/wp\/v2\/posts\/1901\/revisions\/1967"}],"wp:attachment":[{"href":"https:\/\/sys4seq.com\/index.php\/wp-json\/wp\/v2\/media?parent=1901"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/sys4seq.com\/index.php\/wp-json\/wp\/v2\/categories?post=1901"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/sys4seq.com\/index.php\/wp-json\/wp\/v2\/tags?post=1901"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}