As a seasoned Data Engineer with expertise in Data Science, Data Engineering, Cloud Technologies, and Generative AI, I leverage 3 years of hands-on experience to drive data-driven insights. I am passionate about integrating technical and business acumen to explore the Data+AI space, optimizing processes and solving complex problems. Currently, I lead data analytics initiatives in a part-time capacity.
- Data Analysis & Visualization - PowerBI, Python(numpy, pandas, matplotlib), R, SQL, MySQL, Oracle, Data Mining, Data Modelling
- Statistical Analysis - Multivariate Analysis, Pattern Recognition, Advanced Statistical Methods (Hypothesis Test, ANOVA, Time Series)
- Machine Learning - scikit-learn, Regression, Classification, Clustering (k-medoids, k-medoids), Neural Networks, NLP, Random Forest
- Cloud & Data Engineering - ETL/ELT Pipelines, Azure Data Lake Storage ADLS Gen2, Azure Data Factory ADF, Azure Synapse Analytics, Azure Databricks, Role Based Data Access Control(RBAC, ACL), Data Governance, Data Cataloging, Alation, Apache Kafka
- Gen AI - LLM Architecture, Retrieval Augmented Generation RAG, Context Augmented Generation CAG, GPT, BERT, LangChain
- Other Relevant - Git, GitHub, Kubernetes, Docker, PyTorch, Tensorflow, Microsoft Word, MS Excel, Microsoft PowerPoint, Requirements Gathering, Strategic Thinking, Stakeholder Communication, Business Communication
- MS in Business Analytics (STEM) β San Francisco State University (2023-2025)
- BE in Computer Science and Engineering β Visvesvaraya Technological University (2017-2021)
- Certifications: Microsoft Technology Associate, Google Certified Data Engineer