This project involves executing an end-to-end real-time data engineering pipeline using simulated bank transaction data. Leveraging technologies such as Python, Apache Kafka, PySpark, DigitalOcean, Elasticsearch, and Kibana, the project focuses on real-time data streaming and processing with Apache Spark. The pipeline enables real-time data ingestion, transformation, and visualization to deliver actionable insights and support critical decision-making processes.
- Programming Language - Python
- DigitalOcean
- Apache Kafka
- Apache Spark
- Kibana
- ElasticSearch