An end-to-end Machine Learning project that predicts house prices based on property features like area, location attributes, and amenities. This project covers everything from data preprocessing and model training to web deployment using Flask.
- Overview
- Demo
- Project Structure
- Features
- Tech Stack
- How to Run
- Model Performance
- Future Improvements
- Connect with Me
This project solves a regression problem where the goal is to estimate the price of a house based on features like:
- Area (sq ft)
- Number of bedrooms & bathrooms
- Road access (mainroad)
- Amenities like guest room, air conditioning, hot water, etc.
- Parking availability and furnishing status
Watch the full walkthrough video on YouTube
├── app.py # Flask backend
├── train_model.py # Model training script
├── housing.csv # Dataset
├── house_price_model.joblib # Trained model
├── scaler.joblib # Saved scaler
├── label_encoders.joblib # Saved label encoders
├── requirements.txt # Required packages
└── templates/
└── index.html # Web UI- Categorical variable encoding
- Feature scaling using
StandardScaler - Random Forest Regressor for price prediction
- Reusable model, scaler, and encoder artifacts
- Flask web interface and API endpoint
- Real-time predictions
- Languages: Python
- Libraries: pandas, numpy, scikit-learn, joblib, flask
- Tools: HTML/CSS, Flask
- Model: Random Forest Regressor
- Clone the repository
git clone https://github.com/yourusername/house-price-prediction.git
cd house-price-prediction- Install dependencies
pip install -r requirements.txt- Train the model
python train_model.py- Run the Flask app
python app.py- Open in your browser
http://127.0.0.1:5000/
Metric Score
Train R² 0.8783
Test R² 0.5917
Model Size 0.17 MB
-
Deploy the app on Render or Heroku
-
Add visual charts for better insights
-
Improve UI/UX with form validations
-
Use GridSearchCV for better hyperparameter tuning

