Google Data Analytics Capstone Project
This project is part of the Google Data Analytics Professional Certificate Capstone.
The objective is to analyze Cyclistic (Divvy) bike-share data and identify behavioral differences between:
- Casual riders
- Annual members
The goal is to generate actionable insights to help convert casual riders into annual members.
- Dataset Used: July 2020 Divvy Trip Data
- Download Link:
https://divvy-tripdata.s3.amazonaws.com/202007-divvy-tripdata.zip
⚠️ Due to file size limitations, the raw dataset is not included in this repository.
- Google Sheets – Data cleaning, Pivot Tables, Visualization
- BigQuery (SQL) – Aggregation and query-based analysis
- Python (Pandas) – Data preprocessing and validation
Cyclistic-project/
│
├── sql/
│ └── cyclistic_queries.sql
│
├── python/
│ └── python.ipynb
│
├── visuals/
│ ├── avg_ride_duration_by_user_type.png
│ ├── ride_count_by_weekday.png
│ └── time_of_day_usage.png
│
└── README.md- Removed null values
- Removed records where
ride_length = 0 - Converted ride duration into proper time format
- Created derived columns:
- Day of Week
- Time of Day (Morning, Afternoon, Evening, Night)
Cleaned dataset and pivot tables available here:
👉 [https://docs.google.com/spreadsheets/d/1YhSgkbri5ox_12BcBMrfc4_Cm1w04lXFLUrugLcJh9s/edit?usp=sharing]
- Casual riders ride significantly longer (~1 hour)
- Members ride shorter trips (~18 minutes)
- Indicates leisure-oriented behavior among casual riders
- Casual riders are more active on weekends
- Members show consistent weekday usage
- Suggests members primarily use bikes for commuting
- Members peak in morning and evening hours
- Casual riders peak in afternoon and evening
- Further supports commuting vs leisure behavior pattern
- Introduce weekend membership conversion offers
- Offer targeted discounts after long-duration casual rides
- Run marketing campaigns during peak leisure hours
To reproduce this project:
- Download dataset from the link above
- Place CSV file inside a
data/folder - Execute SQL queries in BigQuery
- Run
python.ipynbfor preprocessing and validation
Shivam Kumar
Aspiring Data Analyst | SQL | Power BI | Python | Excel