My Web Markups - Mate W
Each service can use the type of database that is best suited to its needs.
Helps ensure that the services are loosely coupled. Changes to one service’s database does not impact any other services.
Different services have different data storage requirements. For some services, a relational database is the best choice. Other services might need a NoSQL database such as MongoDB, which is good at storing complex, unstructured data, or Neo4J, which is designed to efficiently store and query graph data.
Services must be loosely coupled so that they can be developed, deployed and scaled independently
Database per service
Services sometimes collaborate when handling requests.
consists of a set of worker machines
Kubernetes Components | Kubernetes
f a container goes down, another container needs to start. Wouldn't it be easier if this behavior was handled by a system?
What is Kubernetes? | Kubernetes
with the etcd store to read configuration details or write new values.
responsible for relaying information to and from the control plane services
An Introduction to Kubernetes | DigitalOcean
RDD can be created either from external storage or from another RDD and stores information about its parents to optimize execution (via pipelining of operations) and recompute partition in case of failure.
RDD could be thought as an immutable parallel data structure with failure recovery possibilities.
Spark Application (often referred to as Driver Program or Application Master) at high level consists of SparkContext and user code which interacts with it creating RDDs and performing series of transformations to achieve final result. These transformations of RDDs are then translated into DAG and submitted to Scheduler to be executed on set of worker nodes.
Spark is built around the concepts of Resilient Distributed Datasets and Direct Acyclic Graph representing transformations
Spark is a generalized framework for distributed data processing
Apache Spark: core concepts, architecture and internals
Kafka is a set of machines working together to be able to handle and process real-time infinite data.
A distributed streaming platform
Apache Kafka: What Is and How It Works | by João Guilherme Berti Sczip | The Startup | Mar, 2021 | Medium