This project is a simulated business case study based on real-world data from the New York City Taxi & Limousine Commission (TLC).
The objective is to analyze whether payment method (credit card vs. cash) influences taxi fare amounts, using exploratory data analysis (EDA) and statistical hypothesis testing (A/B testing). The results provide data-driven insights to help optimize pricing strategies and increase driver revenue.
In this fictional business scenario, the NYC Taxi & Limousine Commission (TLC) partnered with the data consulting firm Automatidata to develop a system that enables riders to estimate taxi fares before their trip.
As part of this initiative, the data team analyzed the relationship between fare amount and payment method, using descriptive statistics and hypothesis testing.
- Analyze whether payment method influences fare amount
- Apply descriptive statistics to summarize taxi trip data
- Conduct a hypothesis test (A/B testing)
- Generate business recommendations to increase driver revenue
For this project, we assume the dataset originates from a controlled experiment, enabling causal interpretation of results.
| Group | Description |
|---|---|
| Group A | Customers required to pay using credit card |
| Group B | Customers required to pay using cash |
This design allows us to determine whether payment method directly affects fare amount.
Source: NYC Taxi & Limousine Commission (TLC)
Key Variables:
fare_amountpayment_type
- Python
- Pandas — data manipulation
- NumPy — numerical computation
- Matplotlib & Seaborn — data visualization
- SciPy & Statsmodels — statistical hypothesis testing
- Summary statistics (mean, median, standard deviation, quartiles)
- Distribution visualization
- Outlier detection
- Comparison of fare distributions by payment type
Null Hypothesis (H₀): There is no difference in average fare amounts between credit card and cash payments.
Alternative Hypothesis (H₁): Customers who pay with credit cards have higher average fares than customers who pay with cash.
Statistical Test: Two-sample t-test Significance Level (α): 0.05
- Credit card users showed higher average fare amounts compared to cash users.
- The results were statistically significant, leading to rejection of the null hypothesis.
- This confirms that payment method influences spending behavior.
- Customers paying by credit card tend to spend more.
- Encouraging cashless payments can increase driver revenue.
-
Promote cashless payments through:
- In-app incentives
- Loyalty reward programs
- Discounts or cashback offers
-
Integrate digital payment promotions into the TLC rider app.
-
Optimize pricing strategies based on payment behavior patterns.
- Exploratory Data Analysis (EDA)
- A/B Testing
- Statistical Hypothesis Testing
- Data Visualization
- Business Analytics
- Data-Driven Decision Making