Passionate Software Developer | JAVA | Python & SQL Enthusiast | Distributed Systems | KAFKA | Spark | Bay Area Resident
π I enjoy building data pipelines and automating workflows.
π‘ Constantly learning and exploring Data + AI.
I am a versatile technologist with a strong foundation in Python, SQL, and software development principles, complemented by deep expertise in data engineering (Apache Spark, Hadoop, Snowflake, Airflow). I thrive on architecting and optimizing scalable systems, whether they are ETL pipelines, real-time data processing solutions, cloud-based architectures, or machine learning applications.
πΉ Technical Expertise: β Programming & Development: Python (incl. software design, testing), SQL, Java (Spring Boot), JavaScript (React/Next.js), APIs, System Design
β Big Data & Cloud: Apache Spark, Hadoop, Snowflake, AWS/GCP/Azure
β Data Engineering & Pipelines: ETL/ELT, Apache Airflow, Kafka, Docker
β Machine Learning & AI: Model Development & Deployment (TensorFlow, Keras, scikit-learn), Feature Engineering (TF-IDF), Exploratory Data Analysis (EDA), Data Preparation (Pandas), MLOps concepts
β Databases: MySQL, PostgreSQL, NoSQL, MongoDB
πΉ Projects & Experience:
β Led a team to win SJ Hacks 2025. by developing "SJ HOPES," a full-stack platform addressing homelessness in San Jose. Built in 24 hours, it features real-time shelter visibility, client support, and micro-opportunities using Spring Boot, React/Next.js, MySQL, and Google Maps API.
β Developed a machine learning model to predict Netflix content popularity, leveraging TF-IDF for feature extraction from textual data and performing comprehensive EDA to uncover key insights. Successfully built, trained, and evaluated various models to achieve robust predictive performance.
β Engineered an end-to-end real-time finance data pipeline using Kafka, Spark, and Snowflake for streaming data processing and analytics.
β Designed and automated a robust ETL workflow using Airflow and AWS Lambda for efficient data ingestion and transformation from diverse sources.
β Created a Big Data analytics solution using Hadoop, Spark, and Tableau to derive insights from large-scale datasets.
π‘ I am passionate about leveraging technology to solve complex problems and continuously explore new paradigms in software engineering, data science, and AI. My goal is to contribute to impactful projects by building scalable, high-performance software and data infrastructure, with a keen interest in tech for social good.
Here are some of the technologies I work with:
Let's connect! You can find me on:
π¦ Fun Fact: Some of my best code gets written after sunset... I code till night! ?
