Skip to content

A setup to locally test Apache Spark applications in a Kubernetes cluster backed by a S3-based Hadoop.

Notifications You must be signed in to change notification settings

MLobo1997/spark-kubernetes-playground

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

11 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Spark cluster in Kubernetes Playground

A setup to locally test Apache Spark applications in a Kubernetes (K8S) cluster backed by a S3-based Hadoop.

This repo is a WIP.

Components

This setup has the following components:

  • A Docker image with a Spark-hadoop setup.
    • Created a lightweight image by doing several build stages and copying only the necessary files.
  • The Kubernetes cluster is based in Minikube and runs in Virtualbox (check the Makefile).
  • The S3 buckets are emulated by Localstack and initialized in this docker-compose.
  • To monitor the Spark Applications, you can start a Spark History Server by applying this K8S manifest.

About

A setup to locally test Apache Spark applications in a Kubernetes cluster backed by a S3-based Hadoop.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published