I am a final-year student specializing in Big Data Analytics with 6 months of experience as a Data Analyst Intern. I am passionate about building scalable data pipelines, optimizing ETL processes, and turning raw data into actionable business insights.
I am a Computer Science undergraduate with hands-on experience in building data pipelines, ETL workflows, and cloud-based data systems. I enjoy working with structured and semi-structured data, optimizing data flows, and building scalable backend solutions.
- π Iβm currently working on a Sales Data Visualization Dashboard using Python & SQL
- π― Iβm looking to collaborate on Open Source Big Data projects (using PySpark or Hadoop)
- π Iβm looking for help with Mastering advanced Cloud Data Engineering patterns on AWS
- π± Iβm currently learning Advanced ETL processes and Data Warehousing architectures
- β‘ Fun fact: I believe 80% of Data Engineering is just cleaning data (and I actually enjoy it!)
I have worked on projects involving:
- Large-scale data processing (50K+ records)
- Data validation & transformation using Python
- Cloud deployment using AWS
- SQL-based analytics and reporting
- Built a scalable backend using Python & FastAPI
- Implemented data pipelines for content processing
- Designed modular architecture for high availability
- Implemented AES-256 encryption in Python
- Optimized data handling for large files to ensure security & performance
- ETL pipelines using Python & SQL with data cleaning and transformation
- Worked with CSV, JSON, and relational datasets