Spark cluster in Kubernetes Playground

A setup to locally test Apache Spark applications in a Kubernetes (K8S) cluster backed by a S3-based Hadoop.

This repo is a WIP.

Components

This setup has the following components:

A Docker image with a Spark-hadoop setup.
- Created a lightweight image by doing several build stages and copying only the necessary files.
The Kubernetes cluster is based in Minikube and runs in Virtualbox (check the Makefile).
The S3 buckets are emulated by Localstack and initialized in this docker-compose.
To monitor the Spark Applications, you can start a Spark History Server by applying this K8S manifest.

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
docker-image		docker-image
.gitignore		.gitignore
Makefile		Makefile
README.md		README.md
history-server.yaml		history-server.yaml
localstack-compose.yml		localstack-compose.yml