Apache Cassandra is a distributed NoSQL database (DB) which is used for handling Big data and real-time web applications. NoSQL stands for “Not Only SQL” or “Not SQL”. NoSQL database is a non-relational data management system, that does not require a fixed schema.
Related Posts
-
Building a real-time big data pipeline 9: Spark MLlib, Regression, Python
Apache Spark expresses parallelism by three sets of APIs – DataFrames, DataSets and RDDs (Resilient -
Building a real-time big data pipeline 10: Spark Streaming, Kafka, Java
Spark Streaming is an extension of the core Apache Spark platform that enables scalable, high-throughput, -
Building a real-time big data pipeline 8: Spark MLlib, Regression, R
Apache Spark MLlib is a distributed framework that provides many utilities useful for machine learning tasks,