Skip to content

Machine learning-based photometric redshift estimation using data from the Sloan Digital Sky Survey (SDSS) Main Galaxy Sample. This project explores the effects of photometric bands, magnitude errors, and morphological parameters on the accuracy of redshift predictions using RandomForestRegressor.

Notifications You must be signed in to change notification settings

KarthikRaja-Selvan/Photo-z-Estimation-on-SDSS-MGS

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 

Repository files navigation

Photometric Redshift Estimation using Main Galaxy Sample (MGS) from Sloan Digital Sky Survey (SDSS)

Project Overview

In this project, I explore photometric redshift (photo-z) estimation using machine learning, with data drawn from the Main Galaxy Sample (MGS) in the Sloan Digital Sky Survey (SDSS) Data Release 18. The focus is on improving photo-z estimation accuracy by experimenting with different input parameters and configurations.

Key research questions addressed in this analysis:

  1. Effect of Band Reduction: How does the accuracy of photometric redshift estimation change when the number of photometric bands is reduced?
  2. Inclusion of Band Magnitude Error: What is the impact on photo-z accuracy when band magnitude errors are included in the model input?
  3. Band Contribution: Which photometric band (among U, G, R, I, Z) degrades the accuracy of photo-z estimation the most?
  4. Inclusion of Morphological Parameters: How does adding morphological information improve or affect the accuracy of photo-z estimation?

Dataset

The data used in this project is taken from the SDSS Data Release 18 (DR18). Specifically, the photometric measurements and corresponding morphological parameters of galaxies in the Main Galaxy Sample (MGS) are used as input for training and evaluating machine learning models.

Machine Learning Model

For the estimation of photometric redshifts, I have used the RandomForestRegressor from the scikit-learn library, with its default configuration:

sklearn.ensemble.RandomForestRegressor(
    n_estimators=100,
    criterion='squared_error',
    max_depth=None,
    min_samples_split=2,
    min_samples_leaf=1,
    max_features=1.0,
    bootstrap=True
)

Goals

  • To improve the accuracy of photometric redshift estimation by optimizing the input features.
  • To identify which photometric bands and parameters most significantly affect the estimation process.

About

Machine learning-based photometric redshift estimation using data from the Sloan Digital Sky Survey (SDSS) Main Galaxy Sample. This project explores the effects of photometric bands, magnitude errors, and morphological parameters on the accuracy of redshift predictions using RandomForestRegressor.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published