- Introduction
- Getting Started
- System Requirements
- Getting Help
- Reference Guides
- Data Challenges
- Known Caveats and Limitations
- Releases
- Citing
Welcome to the third COSI Data Challenge (DC3)! The COSI Data Challenges are released on a yearly basis in preparation for the launch of the COSI Small Explorer (SMEX) class mission in 2027 (Tomsick+23). They are based on simulated data, which is intended to closely mimic the real flight data. Every year the Data Challenges have increasingly more realistic source and background models, and they are analyzed with increasingly complete and matured analysis tools. In general there are two main goals of the Data Challenges:
- Facilitate development of the COSI data pipeline and analysis tools
- With routine feedback from scientists
- Alongside development of the expected source models by the science team
- Provide resources to the astrophysics community to become familiar with COSI data
- Excellent training for science team in preparation for first analyses after launch
- Public releases help with community building before COSI data is released
The only software requirement for DC3 is cosipy. A general introduction into cosipy, including installation instructions, can be found in the cosipy-intro directory. For a general introduction into analyzing data from Compton telescopes see Compton-telescope-data-analysis-intro. Note that cosipy is part of the larger COSITools, which is a broad collection of COSI data analysis tools, documentation, and verification data sets. COSITools can be installed by following the installation instructions here. This also includes MEGAlib, which is the main software program used for running simulations. However, unless you need MEGAlib and/or COSITools for other reasons, it's highly recommended to just install cosipy.
This year's Data Challenge is based on 3 months of exposure time, for an equatorial orbit at an altitude of 530 km, with a pointing that rocks between
The input models and challenges for DC3 were provided by the COSI science teams. There are challenges for the different science groups: GRBs, Positrons, Nucleosynthesis, Galactic, Extragalactic, and Dark Matter. These are described in detail in the Data Challenges section below.
Users are encouraged to post feedback on the Discussions page. This can include solutions to specific challenges, questions, issues, etc.
In summary, to get started with DC3, install cosipy, familiarize yourself with the data-products, and then start working through the Data Challenges, as described below.
One of our goals in developing cosipy is to make it easily accesible to all users. All of the Data Challenges should be doable on a laptop with at least 16 GB of RAM. We are still working on optimizing the code, and so please let us know if you are running into memory issues.
Please submit a new issue in the cosipy git repository if you have issues with the code. If you have general feedback, or need further assistance, please reach out to the COSI Data Challenges team lead, Chris Karwin ([email protected]), the cosipy implementation lead, Israel Martinez-Castellanos ([email protected]), and the pipeline development lead Carolyn Kierans ([email protected]).
- Introduction to Compton telescope data analysis
- Introduction to cosipy
- Simplified 2D tutorials (from the Gamma-ray Toys repository)
- Summary of background simulations
- Dealing with Earth occultation
- Introduction to polarization
We have created example Jupyter notebooks demonstrating all of the tools that will be needed to complete this year's data challenges. They are available as part of the cosipy release, and listed below:
Example 1: dataIO
Example 2: detector response
Example 3: GRB localization
Example 4: GRB spectral fit
Example 5: Crab spectral fit
Example 6: 511 spectral fit
Example 7: Crab imaging
Example 8: 511 imaging
Example 9: Source injector
Example 10: TS maps
Example 11: Polarization (ASAD method)
Example 12: Continuum background estimation
Example 13: Line background estimation
If you haven't worked with Jupyter before, you can find some help here.
As a very first step, try working through some of the example notebooks. It is highly recommended to start with the dataIO notebook, as this describes the general handling of COSI data, and it is needed for almost all other notebooks. Specific challenges for the different science topics are described below. You can start with whichever topic you are most interested in. Each challenge will refer you to a specific example notebook that will demonstrate the basic tools needed to complete the respective challenge. If you have completed the main challenges and are interested in exploring other models, you can employ the source injector (see the Source injector example). If you are interested in getting more involved in the cosipy development, see the Known Caveats and Limitations section at the bottom of this page, as well as the bottom of the cosipy-intro, which outlines some of the priority areas for the next stages of development.
All input models used for the simulations can be found in the DC3 source library of the COSI simulation pipeline, available here. This includes all the information about the injected sources, and it can be used for checking the results of the data challenges.
Configuration Files:
The configuration for the data binning is specified via yaml files, as demonstrated in the dataIO tutorial (and others). The data binning must match the response binning, and so the yaml configuration files must be updated accordingly when working with DC3 data. The detector response tutorial shows how to determine the response binning. The main difference with the DC3 response files is that the phi binning is now 6 degrees compared to 5 degrees for DC2. We are currently working on refining how paramaters are configured in cosipy, and this will be updated starting with DC4.
Orientation Files:
Two orientation files are available:
DC3_final_530km_3_month_with_slew_15sbins_GalacticEarth_SAA.ori
DC3_final_530km_3_month_with_slew_1sbins_GalacticEarth_SAA.ori
The 1 second binning may be optimal for analyzing transients on short time scales, but generally the 15 second binning should be sufficient and is considered the default.
Background Modeling:
DC3 includes all of the background components. Compared to the background estimates from DC2, we have now included the full SAA passage, as well as the Galacic diffuse continuum emission. Further details of the background models and simulations be found in the backgrounds directory. The available background files are listed in the data-products directory. We provide a file with the total background, as well as separate files for the individual background components. For convenience, we have also provided the binned total background for each of the different response files.
For analyzing data in DC3, the starting point is to model the backgrounds using the actual simulated backgrounds themselves. This is the ideal case, where the backgrounds are perfectly known, which of course is not very realistic. To simplify the analysis, it is sometimes helpful to start with just a single background component (e.g. albedo photons), and then move on to the total background after everything is working. The next step is to estimate the backgrounds. With DC3 we have new methods to estimate the background for both line and continuum sources. The background estimation tool for line emission can be used for both point sources and extended sources. An example tutorial can be found in the line background estimation notebook. The background estimation tool for continuum emission is currently only available for point sources. An example tutorial is available in the continuum background estimation notebook. We stress that both of these background estimation algorithms are only first versions, and further development and testing is still needed. More details are provided in the respective example tutorials.
Data Challenges for the different science topics can be found below (click to expand):
GRBs
GRB Data Challenges
The Key Objectives for GRB science with COSI are:
- Detect short GRBs following neutron star mergers, which are a known source of gravitational waves
- Constrain geometries and emission processes with polarization
The tools needed to complete these challenges are demonstrated in the GRB spectral fit, GRB localization, and Polarization (ASAD method) examples.
DC3 Goals:
- Localize a GRB and fit the spectrum
- Measure polarization (fraction and angle) of a GRB
- Localize a MGF and fit the spectrum
- Measure the polarization (fraction and angle) of a MGF
- Check if the magnetar short burst is detectable, and if detectable localize and fit the spectrum
- Stretch goal: Develop a classification technique to distinguish a GRB or MGF
We have simulated 12 GRBs, 6 Magnetar Giant Flares (MGFs), and 1 magnetar short burst within the 3 months of observation time. GRB bn090424592 is the brightest of the bursts we’ve simulated and it may be easiest to start with that one. Access to the response files and the simulated source and background files, including the full Wasabi path, is detailed in the data-products page.
The challenges will use the following detector response files:
- ResponseContinuum.o3.e100_10000.b10log.s10396905069491.m2284.filtered.nonsparse.binnedimaging.imagingresponse_nside8.area.good_chunks.h5.gz
- ResponseContinuum.o3.pol.e200_10000.b4.p12.s10396905069491.m441.filtered.nonsparse.binnedpolarization.11D_nside8.area.good_chunks.h5.gz (for polarization)
Note: the second response file is used for polarization analysis and has fewer energy bins.
Data Files:
GRB_bn081207680_3months_unbinned_data_filtered_with_SAAcut.fits.gz
GRB_bn090424592_3months_unbinned_data_filtered_with_SAAcut.fits.gz
GRB_bn100612726_3months_unbinned_data_filtered_with_SAAcut.fits.gz
GRB_bn110605183_3months_unbinned_data_filtered_with_SAAcut.fits.gz
GRB_bn131122490_3months_unbinned_data_filtered_with_SAAcut.fits.gz
GRB_bn140329295_3months_unbinned_data_filtered_with_SAAcut.fits.gz
GRB_bn161004964_3months_unbinned_data_filtered_with_SAAcut.fits.gz
GRB_bn170405777_3months_unbinned_data_filtered_with_SAAcut.fits.gz
GRB_bn180504136_3months_unbinned_data_filtered_with_SAAcut.fits.gz
GRB_bn180703876_3months_unbinned_data_filtered_with_SAAcut.fits.gz
GRB_bn080802386_flux150_3months_unbinned_data_filtered_with_SAAcut.fits.gz
GRB_bn080802386_flux300_3months_unbinned_data_filtered_with_SAAcut.fits.gz
GRB_MGF051103_3months_unbinned_data_filtered_with_SAAcut.fits.gz
GRB_MGF070201_3months_unbinned_data_filtered_with_SAAcut.fits.gz
GRB_MGF070222_3months_unbinned_data_filtered_with_SAAcut.fits.gz
GRB_MGF180128A_3months_unbinned_data_filtered_with_SAAcut.fits.gz
GRB_MGF200415A_3months_unbinned_data_filtered_with_SAAcut.fits.gz
GRB_MGF231115A_3months_unbinned_data_filtered_with_SAAcut.fits.gz
MgtBurst_bright_complex_10x_3months_unbinned_data_filtered_with_SAAcut.fits.gz
Input Models:
All input models can be found here.
The simulated GRBs occur randomly within the 3-month orientation file, with their positions chosen such that they have incidence angles under
- bn081207680: PF = 0, PA =
$0^\circ$ , t = 1836496300.0 s - bn090424592: PF = 0.1, PA =
$110^\circ$ , t = 1837507002.0 s - bn100612726: PF = 0.2, PA =
$35^\circ$ , t = 1839617230.0 s - bn110605183: PF = 0.3, PA =
$50^\circ$ , t = 1841896404.0 s - bn131122490: PF = 0.4, PA =
$175^\circ$ , t = 1842123855.0 s - bn140329295: PF = 0.5, PA =
$95^\circ$ , t = 1837652924.0 s - bn161004964: PF = 0.6, PA =
$10^\circ$ , t = 1842948485.0 s - bn170405777: PF = 0.7, PA =
$160^\circ$ , t = 1841915858.0 s - bn180504136: PF = 0.8, PA =
$45^\circ$ , t = 1836985181.0 s - bn180703876: PF = 0.9, PA =
$25^\circ$ , t = 1838652949.0 s - bn080802386: PF = 0.8, PA =
$90^\circ$ , t = 1835493492.2 s
Note: The two short GRBs (bn080802386) are unrealistically bright and are intended to be used for the polarization tutorial.
Information for the MGFs, including reference papers (PF = 1, PA =
- MGF051103 (Svinkin+21): t = 1835533723.498 s
- MGF070201 (Ofek+08): t = 1835537496.0 s
- MGF070222 (Svinkin+16): t = 1835491681.0 s
- MGF180128A (Trigg+24): t = 1835488380.0 s
- MGF200415A (Svinkin+21): t = 1835488530.0 s
- MGF231115A (Trigg+25): t = 1835533696.0 s
Information for magnetar short burst:
- MgtBurst_bright_complex_10x: no polarization, t = 1835640345.022513 s
- SGR 1935+2154 (Li+21) - a bright burst with a complex light curve. Flux has been increased by a factor of 10.
Positrons
Positron Data Challenges
The Key Objectives for the COSI positron science goal are:
- Identify potential individual positron sources in the Galaxy
- Determine the scale-height of the Galactic disk
- Study the annihilation mechanism and the differences between the Galactic disk and the Galactic bulge
- Investigate the energy at which positrons are created
For DC3, we have challenges associated with 1–3 of the Key Objectives.
The tools needed to complete these challenges are demonstrated in the 511 imaging and 511 spectral fit notebooks.
DC3 Goals:
- Image individual positron sources in the Galaxy
- Generate a full sky image of the Galactic diffuse 511 keV emission
- Determine the scale height of the 511 keV disk
- Compare line width and ortho-Ps fraction in bulge and disk
- Measure the Doppler shift in the disk
- Stretch goal: cross-correlate the measured 511 keV emission with 26Al map (1809 keV emission) to constrain propagation distances
We have developed models for the 511 keV and ortho-positronium emission from 26Al and 44Ti
The challenges will use the following detector response files:
- Response511.o4.e509_513.s20881894470591.m2555.filtered.nonsparse.binnedimaging.imagingresponse_nside16.area.good_chunks.h5.gz
- ResponseContinuum.o3.e100_10000.b10log.s10396905069491.m2284.filtered.nonsparse.binnedimaging.imagingresponse_nside8.area.good_chunks.h5.gz
- extended_source_response_511_merged.h5.gz (precomputed 511 extended source response file)
- extended_source_response_continuum_merged.h5.gz (precomputed continuum extended source response file)
The line response is for analyzing the 511 keV line emission, and the continuum response is for analyzing the orthopositronium continuum. Currently, these two components cannot be analyzed simultaneously, as desribed in the Known Caveats and Limitations section. The pre-computed extended source responses should be used for imaging in Galactic coordinates and the 511 spectral fit notebook when analyzing extended sources (i.e. positrons from 26Al or 44Ti).
Positrons from 26Al and 44Ti
Data Files:
Positrons_from_26Al_line_3months_unbinned_data_filtered_with_SAAcut.fits.gz
Positrons_from_26Al_cont_3months_unbinned_data_filtered_with_SAAcut.fits.gz
Positrons_from_44Ti_line_3months_unbinned_data_filtered_with_SAAcut.fits.gz
Positrons_from_44Ti_cont_3months_unbinned_data_filtered_with_SAAcut.fits.gz
Input Models:
The spatial distribution of positrons from 26Al and 44Ti decay is derived from the propagation of positrons in our Galaxy, where the initial positions of positrons follows the NE2001 model from Cordes & Lazio 2002. The model fits the distribution of massive stars in our Galaxy, with an assumed fraction of 2% for both 26Al and 44Ti in the bulge. The positron propagation method is described in Alexis et al. 2014. The positron rate due to 26Al is set to
The spectrum for these diffuse models have two separate components: the 511 keV annihilation line and the ortho-positronium continuum (i.e. files containing “cont”). These were obtained from the model of Guessoum et al 2005 with a computed spectrum for each phase of the interstellar medium, based on the spatial distribution described above. The spectral model was corrected by the Galactic rotation using the model of Fich, Blitz and Stark 1989 for R > 3 kpc and a solid rotation model for R < 3 kpc.
The imaged spatial distribution should resemble the initial model, shown here for the 511 keV emission from 26Al decays, and the resulting measured full-Galaxy 511 keV flux should be
Extragalactic Sources
Data Files:
LMC_Gaussian_511_3months_unbinned_data_filtered_with_SAAcut.fits.gz
LMC_Gaussian_511_x100_3months_unbinned_data_filtered_with_SAAcut.fits.gz
M31_Gaussian_511_3months_unbinned_data_filtered_with_SAAcut.fits.gz
M31_Gaussian_511_x100_3months_unbinned_data_filtered_with_SAAcut.fits.gz
Virgo_Gaussian_511_3months_unbinned_data_filtered_with_SAAcut.fits.gz
Virgo_Gaussian_511_x100_3months_unbinned_data_filtered_with_SAAcut.fits.gz
Input Models:
The fluxes for extragalactic sources (LMC, M31, Virgo) are estimated by assuming that the 511 keV photon flux is proportional to the stellar mass of the source. Using a Milky Way 511 keV flux of
The stellar masses are:
LMC:
M31:
Virgo:
We also include each source at 100x the nominal flux, in order to ensure that they are above COSI's 511 keV line sensitivity.
Globular Clusters
Data Files:
Globular_Cluster_Tuc_47_3months_unbinned_data_filtered_with_SAAcut.fits.gz
Globular_Cluster_Omega_Cen_3months_unbinned_data_filtered_with_SAAcut.fits.gz
Globular_Cluster_NGC_6397_3months_unbinned_data_filtered_with_SAAcut.fits.gz
Globular_Cluster_NGC_6121_3months_unbinned_data_filtered_with_SAAcut.fits.gz
Input Models:
The 511 keV photon flux for the brightest 4 globular clusters is estimated based on the 0.1 - 100 GeV luminosity from the Globular clusters, and the nuclear bulge and boxy bulge emission in the Milky Way Bartels et al 2018a. The GeV luminosity of the Bulge is compared to the 511 keV luminosity of the bulge Bartels et al 2018b. This GeV luminosity vs 511 keV luminosity correlation is applied to the GeV excess observed from globular clusters Zhang et al 2016 in the Galaxy to estimate the 511 keV flux of the globular clusters. The flux values shown here are 3x the values estimated from the correlation in order to make them within COSI's sensitivity limit within the 3 month observation period.
The simulated 511 keV flux for the GCs are:
Nucleosynthesis
Nucleosynthesis Data Challenges
The Key Objectives for the COSI nucleosynthesis science goal are:
- Reveal the history of core collapse supernova activity in the Galaxy
- Determine the role of the evolution of massive stars in creating the elements
- Detect nuclear line emission from young supernova remnants in the Galaxy
- Probe explosion physics in the core of supernovae
The tools needed to complete most of these challenges are demonstrated in the 511 imaging and 511 spectral fit notebooks. For the 44Ti data challenge you will need the Crab spectral fit, Crab imaging, and GRB localization notebooks since Cas A is modeled as a point source.
DC3 Goals:
- Image the 26Al (1.8 MeV) diffuse emission
- Fit the spectral shape of the Galactic 1.8 MeV line and detect the Doppler shift of the line in the disk
- Image the 60Fe diffuse emission at 1173 keV and 1332 keV
- Fit the two 60Fe decay lines to find the total mass of 60Fe in the Galaxy
- Extract the 26Al/60Fe ratio and its uncertainty
- Detect the 26Al and 60Fe emission from the Cygnus region and measure widths of the gamma-ray line emissions
- Determine the mass of 44Ti in Cas A from the 1157 keV line emission
- Stretch goal: Cross-correlate the 26Al 1.8 MeV spatial distribution with the 511 keV emission from Galactic positron annihilation
- Stretch goal: Detect and characterize the different non-gaussian line profiles of the 44Ti 1157 keV line for Cas A
We have modeled the diffuse emission from 26Al and 60Fe in our Galaxy assuming they follow the distribution of massive stars and are in agreement with INTEGRAL/SPI observations. We have also included a model for the 26Al and 60Fe emission from the Cygnus region. We have simulated the 44Ti decay from Cas A with 5 different spectral line profiles to test the resolving power of COSI. Access to the response files and the simulated source and background files, including the full Wasabi path, is detailed in the data-products page
The challenges will use the following detector response files:
- Response26Al.o4.e1805_1812.s10036231691364.m1045.filtered.nonsparse.binnedimaging.imagingresponse_nside16.area.good_chunks.h5.gz
- extended_source_response_Al26_merged.h5.gz
- Response60FeHigh.o4.e1329_1336.s10201526728102.m1287.filtered.nonsparse.binnedimaging.imagingresponse_nside16.area.good_chunks.h5.gz
- Response60FeLow.o4.e1170_1176.s9552269354945.m1188.filtered.nonsparse.binnedimaging.imagingresponse_nside16.area.good_chunks.h5.gz
- extended_source_response_Fe60_low_merged.h5.gz
- extended_source_response_Fe60_high_merged.h5.gz
- Response44Ti.o4.e1154_1160.s9607532021290.m1215.filtered.nonsparse.binnedimaging.imagingresponse_nside16.area.good_chunks.h5.gz
- extended_source_response_Ti44_merged.h5.gz
Each gamma-ray line has a different response, and the response files are labeled with each isotope. Note that 60Fe has two separate response files for the two decay lines at 1173 keV and 1332 keV. Currently, these two line components cannot be analyzed simultaneously, as desribed in the Known Caveats and Limitations section. The pre-computed extended source responses should be used for imaging in Galactic coordinates and the spectral fit notebook when analyzing extended sources (i.e. 26Al or 60Fe).
26Al and 60Fe diffuse emission
Data Files:
26Al_NE2001_3months_unbinned_data_filtered_with_SAAcut.fits.gz
60Fe_NE2001_3months_unbinned_data_filtered_with_SAAcut.fits.gz
Input Models:
The spatial distribution of the 1809 keV line from 26Al decay and the 1173 keV and 1332 keV lines from 60Fe decay in our Galaxy follow the NE2001 model from Cordes & Lazio 2002. The model fits the distribution of massive stars in our Galaxy, with an assumed fraction of 2% for both 26Al and 60Fe in the Galactic bulge. This value is a compromise between SPI observations and results of studies of star formation rate in this region. The flux of 26Al is consistent with measurements from SPI with a total mass of 2.7
Cygnus Region
Data Files:
26Al_Cyg_Region_3months_unbinned_data_filtered_with_SAAcut.fits.gz
60Fe_Cyg_Region_3months_unbinned_data_filtered_with_SAAcut.fits.gz
Input Models:
The characteristics of the 26Al emission from the Cygnus region are obtained from INTEGRAL/SPI observations described in Martin+09, while predictions for the 60Fe lines are derived from the calculation of Martin+10. The spatial distribution, for both 26Al and 60Fe, is modeled with a
44Ti line profiles from Cas A
Data Files:
CasAsymmetric_3months_unbinned_data_filtered_with_SAAcut.fits.gz
CasAunresolved_3months_unbinned_data_filtered_with_SAAcut.fits.gz
CasApartiallyresolved_3months_unbinned_data_filtered_with_SAAcut.fits.gz
CasAfullyresolved_3months_unbinned_data_filtered_with_SAAcut.fits.gz
CasAG16distribution_3months_unbinned_data_filtered_with_SAAcut.fits.gz
Input Models:
The five Cas A simulations all have slightly different line profiles and are meant to test the spectral features recovery capabilities of COSI with a low-resolution response file. The source is modeled as a point source at
Additionally, the G16 model simulates a complex distribution of clumps as detected in Grefenstette+16, where 11 spectral features from 11 different clumps are simulated with varying fluxes and doppler shifts. The 44Ti flux for the G16 is set to 1.66x the measured flux at 68 and 78 keV, matching the SPI observations of Cas A. The goal remains the same: to detect a statistically significant deviation from the Gaussian model.
Galactic
Galactic Data Challenges
There is one Key Objective for Galactic science with COSI:
- Constrain geometries and emission processes in Galactic black holes
While the main science goals for COSI are not directly related to many Galactic sources, there is no doubt that COSI’s unique energy range will bring compelling new observations. We start exploring COSI’s Galactic science capabilities in DC3.
The tools needed to complete the Galactic challenges are demonstrated in the Crab spectral fit, Crab imaging, GRB localization, and Polarization (ASAD method) notebooks. The 511 spectral fit notebook can be used to analyze the Galactic diffuse continuum extended source.
DC3 Goals:
- Perform the image and spectral fit of the Crab
- Perform the image and spectral fit for GRS 1758-258
- Test whether COSI would be able to distinguish between the two models of 1E1740.7-2942
- Measure MeV gamma-ray flux during the Gamma-ray binary flare event from PSR B1259-63 and determine the duration of flare periods
- Fit for the average Cyg X3 spectrum over the 3 months of observation and attempt to detect the spectral changes on shorter time scales
- Check the detection sensitivity in the Cyg X1 soft state and hard state
- Measure the polarization of Cyg X1 in the hard state
- Measure the spectra and polarization of transient sources MAXI J1820+070 and J1348-063
- Characterize how the the Galactic diffuse continuum emission as a background component impacts the sensitivity for point sources in the Galactic plane
- Image the Galactic diffuse continuum emission in the COSI energy band
- Measure the spectrum of the Galactic diffuse continuum emission, extracting it from the rest of the background
- Stretch goal: Make a time cut for the Cyg X1 soft and hard states and combine the two simulations to create a spectral transition, and determine COSI’s ability to detect this transition.
We have simulated 8 Galactic point sources: 3 steady-state sources (Crab, GRS 1758-258, 1E1740.7-2942) for spectral analysis, 2 transient sources (PSR B1259-63 has a single flare, and Cyg X3 has 6 state transitions), and 3 sources with polarization (Cyg-X1, MAXI J1820+070 and J1348-630). We also have a simulation of the Galactic diffuse emission, which can be included as an additional background component, or directly analysed as a source. Access to the response files and the simulated source and background files, including the full Wasabi path, is detailed in the data-products page.
The challenges will use the following detector response files:
- ResponseContinuum.o3.e100_10000.b10log.s10396905069491.m2284.filtered.nonsparse.binnedimaging.imagingresponse_nside8.area.good_chunks.h5.gz
- ResponseContinuum.o3.pol.e200_10000.b4.p12.s10396905069491.m441.filtered.nonsparse.binnedpolarization.11D_nside8.area.good_chunks.h5.gz
- extended_source_response_continuum_merged.h5.gz
Note: the second response file is used for polarization analysis and the extended source response should be used to analyze the Galactic diffuse continuum when imaging or performing the spectral fit.
Steady-state sources
Data Files:
crab_standard_3months_unbinned_data_filtered_with_SAAcut.fits.gz
Crab_Flat_Spectrum_3months_unbinned_data_filtered_with_SAAcut.fits.gz
GRS175_3months_unbinned_data_filtered_with_SAAcut.fits.gz
1E1740_compow_3months_unbinned_data_filtered_with_SAAcut.fits.gz
1E1740_twocompt_3months_unbinned_data_filtered_with_SAAcut.fits.gz
Crab Input Models:
The standard Crab model is the same as DC1 and DC2. The point source is located at
GRS 1758-258 Input Models:
The spectral model for the microquasar GRS 1758-258 near the Galactic center is based on the best fit comptonization model of epoch2 from Pottschmidt+06. This is the same model as used in DC2. The source is located at
1E1740.7-2942 Input Models:
The 2 spectral models for microquasar 1E1740.7-2942 (also known as the Great Annihilator) are the best fit models of INTEGRAL data obtained by Bouchet+09. The "compow" simulation is a thermal comptonization + powerlaw model, while the "twocompt" simulation has two components of thermal comptonization with different temperatures. Both models represent the INTEGRAL data well but strongly differ at the highest energies. The source is located close to the Galactic center at
Transient sources
Data Files:
PSRB1259_3months_unbinned_data_filtered_with_SAAcut.fits.gz
PSRB1259_10x_3months_unbinned_data_filtered_with_SAAcut.fits.gz
cygX3_3months_unbinned_data_filtered_with_SAAcut.fits.gz
PSR B1259-63 Input Models:
PSR B1259-63 is a binary gamma-ray system consisting of a radio pulsar and a massive Be-type star. The next periastron passage will be November 19th, 2027, making this a prime target for COSI. The model for DC3 is based on the work in Abdo+11 (see Model 1 in the bottom of Figure 5). The flare is simulated as a constant emission for 30 days (T > 1839487305.0). We have included two flux scenarios: the nominal value from Abdo+11 is
Cyg X3 Input Models:
In DC3 we have simulated 6 different spectral models for Cyg X3 (
Percent of time of observed state -> time in days (2 year mission) -> time in days (DC3) Quiescent: 6. 47 5.4 Transition 46. 335 41.4 FHXR: 10. 73 9.0 FIM: 22. 161 19.8 FSXR: 5.4 28 4.8 Hypersoft: 12. 86 10.8
Since DC3 simulates 3 months of exposure, we reduced the time spent in each state accordingly and increased the fluxes by a factor of 7.9 to obtain the same total number of photons as we would in 2 yrs of observations. Note: The background rates should also be scaled up in the same way to preserve the expected S/N ratio!
Polarized Sources
Data Files:
cygX1_soft_3months_unbinned_data_filtered_with_SAAcut.fits.gz
cygX1_hard_3months_unbinned_data_filtered_with_SAAcut.fits.gz
MAXI1820_3months_unbinned_data_filtered_with_SAAcut.fits.gz
MAXI1348_3months_unbinned_data_filtered_with_SAAcut.fits.gz
Cyg X1 Input Models:
The two spectral models for Cyg X1 (
MAXI J1820+070 and J1348-630 Input Models:
The spectral models for two black hole X-ray binaries, MAXI J1820+070 and MAXI J1348-630, are based on INTEGRAL data (Fig 3 of Cangemi+23), in the hard state. The polarimetric models corresponds to the measurements shown in Table 3 of the same paper. The input polarization models are divided into a low energy component (0.1 - 0.4 MeV) and a high energy component (0.4 - 10 MeV). MAXI J1820+070 and MAXI J1348-630 remained in the hard state for 60 days and 7 days, respectively, and for DC3 we have the sources 'on' for these respective times and off for the rest of the time. Therefore, MAXI J1820+070 has a nominal flux of
Galactic diffuse continuum
Data Files:
GalTotal_SA100_F98_3months_unbinned_data_filtered_with_SAAcut.fits.gz
Input Models:
The Galactic diffuse continuum emission is modeled using the v57 release of the GALPROP cosmic ray (CR) propagation and interstellar emissions framework (Porter+22). GALPROP self-consistently calculates spectra and abundances of Galactic CR species and associated diffuse emissions (gamma rays, X-rays, and radio) in 2D and 3D. The v57 release includes a set of steady-state emission model examples that reproduce the latest CR data. There are six models in total, categorized according to the CR source and interstellar radiation field (ISRF) model used for the prediction. There are 3 CR source models (SA0, SA50, SA100) and two ISRF models (R12, F98). The CR source density models are based on the distribution of injected CR power, with SA0 describing an axisymmetric disk (following the radial distribution of pulsars), SA50 describing a 50/50% split of the injected CR luminosity between disk-like and spiral arms, and SA100 describing pure spiral arms. All models have the same exponential scale height of 200 pc. The two ISRF models employ different spatial densities for both the stars and the dust but produce intensities very similar to those of the data for near- to far-infrared wavelengths at the location of the solar system (see Porter+17 and references therein). For the neutral gas distributions (atomic and molecular), a 3D model from Johannesson+18 is employed. These GALPROP models include the total emission, which is dominated by inverse Compton radiation, but also has a small contribution from Bremsstrahlung towards the upper energy bound. As our representative case, for DC3 we simulate the SA100-F98 model.
This is the first data challenge to include the Galactic diffuse continuum, and our corresponding goals this year are straight forward. Future data challenges will look more into the different model variations and key parameters.
Note that the Galactic diffuse continuum emission is also part of the standard background model for COSI, which will be employed for most analyses.
Extragalactic
Extragalactic Data Challenges
There is one Key Objective for Extragalactic science with COSI:
- Constrain geometries and emission processes in Active Galactic Nuclei (AGN)
While the main science goals for COSI are not directly related to many extragalactic sources, there is no doubt that COSI’s unique energy range will bring compelling new observations. We start exploring COSI’s extragalactic science capabilities in DC3.
The tools needed to complete the Extragalactic challenges are demonstrated in the Crab spectral fit and Polarization (ASAD method) notebooks.
DC3 Goals:
- Perform the image and spectral fit of NGC 1068 to determine the flux in the COSI band and the coronal cut-off energy
- Perform the image and spectral fit of NGC 4151 to determine the flux in the COSI band and the coronal cut-off energy
- Test whether COSI would be able to distinguish between the different NGC 4151 spectral models
- Measure MeV gamma-ray flux during the 4C+21.35 flare and determine the duration of the flaring period
- Fit the 4C+21.35 spectrum in the flaring and non-flaring state
- Perform the image and spectral fit of 3C 279
- Measure the polarization fraction and angle of 3C 279 in the hard state
We have simulated 4 extragalactic sources: 2 are steady state sources (NGC 1068 and NGC 4151), 1 transient source (4C+21.35), and 1 polarized source (3C 279). Access to the response files and the simulated source and background files, including the full Wasabi path, is detailed in the data-products page.
All challenges should use the same detector response files:
- ResponseContinuum.o3.e100_10000.b10log.s10396905069491.m2284.filtered.nonsparse.binnedimaging.imagingresponse_nside8.area.good_chunks.h5.gz
- ResponseContinuum.o3.pol.e200_10000.b4.p12.s10396905069491.m441.filtered.nonsparse.binnedpolarization.11D_nside8.area.good_chunks.h5.gz
Note: the second response file is used for polarization analysis.
Steady state sources
Data Files:
NGC_1068_3months_unbinned_data_filtered_with_SAAcut.fits.gz
NGC_4151_bright_3months_unbinned_data_filtered_with_SAAcut.fits.gz
NGC_4151_EC200_3months_unbinned_data_filtered_with_SAAcut.fits.gz
NGC_4151_EC1000_3months_unbinned_data_filtered_with_SAAcut.fits.gz
NGC_4151_faint_3months_unbinned_data_filtered_with_SAAcut.fits.gz
NGC 1068 Input Models:
The simulation for NGC 1068 is based on the spectral analysis in Bauer+2015. The source is located at
NGC 4151 Input Models:
NGC 4151 is a Seyfert galaxy which hosts one of the brightest AGN. For DC3, we have included 4 different spectral models for NGC 4151 which are all defined as a powerlaw with exponential cut-off. From NuSTAR observations, with the flux calibrated in the 20-30 keV range, we have simulated two different cut-off energies based on the analysis presented in Keck+ 2015:
- NGC_4151_ec200: Gamma=1.75, Ecut=200 keV
- NGC_4151_ec1000: Gamma=1.75, Ecut=1000 keV
From INTEGRAL observations there are two cut-off energies and two power-law gamma factors to model two different states as presented in Lubinski+ 2010:
- NGC_4151_bright: Gamma=1.71, Ecut=264 keV
- NGC_4151_faint: Gamma=1.81, Ecut=1000 keV
The source is located at
Transient source
Data Files:
4C21p35_noflare_3months_unbinned_data_filtered_with_SAAcut.fits.gz
4C21p35_flare_3months_unbinned_data_filtered_with_SAAcut.fits.gz
4C+21.35 Input Models:
We have included a simulation of quasar 4C+21.35 which has both a quiescent state simulation (noflare) and a flaring state in separate files. The lightcurve describing the quiescent flux and the flare is based on Fermi-LAT data, where the flaring state is initiated when the reported flux is 3x greater than the 16-years average flux. The normalization is derived from the integrated flux in the COSI energy band, based on an extrapolation of the Fermi-LAT log parabola function. The two states come with two different spectra: both powerlaws with different indices, where the non-flaring state has and index of 1.6, and the flaring state has an index of 2.5. The flaring state has variable flux between 1838089680 < T < 1840422465, and is zero elsewhere, and the non-flaring state is the opposite, as seen in the light curve below. Combine these files to represent a 4C+21.35 flare with a different spectral model in the flaring state.
Polarized source
Data Files:
3C279_3months_unbinned_data_filtered_with_SAAcut.fits.gz
3C 279 Input Models:
The spectral model for the DC3 simulation of quasar 3C 279 represents the high state of the source, where we have used a flux 100x its nominal value. The spectra is described is based on the model in Zhang+15, with a total integrated flux of
Dark Matter
Dark Matter Data Challenges
While the main science goals for COSI are not directly related to dark matter, there is no doubt that COSI’s unique energy range will bring compelling new observations. We start exploring COSI’s dark matter science capabilities in DC3.
The tools needed to complete these challenges are demonstrated in the 511 imaging and 511 spectral fit notebooks.
DC3 Goals:
- Image the two photon (gg) spatial distribution (either NFW or Burkert models)
- Fit for the spectra of the gg or eeg annihilation model
- Confirm that COSI is sensitive to the different spatial distributions modeled in the Burkert and NFW profiles, for either the gg or eeg model
- Estimate COSI’s sensitivity to annihilating WIMPs
Data Files:
eeg_Bur_3months_unbinned_data_filtered_with_SAAcut.fits.gz
eeg_NFW_3months_unbinned_data_filtered_with_SAAcut.fits.gz
gg_Bur_3months_unbinned_data_filtered_with_SAAcut.fits.gz
gg_NFW_3months_unbinned_data_filtered_with_SAAcut.fits.gz
Input Models:
We consider cases where two dark matter (DM) particles annihilate into either two photons (gg) or an e+e− pair with a FSR (final state radiation) photon (eeg). We use profile parameters describing the NFW or Burkert profile taken from Cirelli+10, and the fragmentation function for the eeg case is from Coogan+19, assuming a scalar mediator. For each of these models, we assume a DM mass of 3 MeV and an annihilation cross section of
The items listed here are some of the priorities for DC4 development. These can be considered as extra/advanced challenges, and anybody is welcomed to work on them, with the ultimate goal of implementing the software solutions into cosipy. Also see the bottom of the cosipy-intro for a related discussion on the next steps of development.
- Image deconvolution does not consider Earth occultation or SAA passage when computing the point source response on the fly. To include these effects, use a precomputed point source response file instead.
- The polarization analysis is currently only working for short transients that are >15 degrees off-axis. The polarization analysis tools are currently limited to short transients because they do not yet account for the changing aspect information. This will be updated soon. Additionally, the issue relating to sources that are close to on-axis stems from the polarization response, which is defined with respect to the instrument's z-axis, leading to an undefined polarization angle for on-axis sources. The problem actually pertains to all sources within ~15 degrees due to the coarse response binning. We are working on providing a polarization response defined with respect to the instrument's x-axis (where the effective area is much smaller), which will help to alleviate this issue.
- It is not currently possible to simultaneously fit continuum and line components. We have separate response files for different emission components (i.e. continuum, 511 keV, Aluminum-26, etc.), and with the current binned analysis setup in cosipy, the data binning needs to match the response binning, and thus only a single component can be analyzed at a time. Possible solutions to this include:
- Creating a single response for all components
- Creating a class that automatically matches an input model with the corresponding response
- Reparameterizing the response in such a way that prevents this issue
- Methods need to be developed to determine the response for broadened and offset line emission. These methods should utilize the baseline response files (e.g. 511 keV, Aluminum-26, Iron-60, etc.), and allow for analyzing any arbitrary line emission.
- The background estimation tools need to be further tested and developed. With DC3 we have provided first versions for estimating continuum and line backgrounds. These methods need to be tested, stressed, and further developed. Additionally, we still need background estimation tools for transient sources.
- The way in which parameters are configured needs to be refined, and callable scripts need to be added. By callable scripts we are referring to command-line options that will perform common task, such as producing light curves and spectra.
- The tools still need to be stressed to find limitations. The COSI pipeline team has been rapidly developing the cosipy library in preparation for the satellite mission. Our aim is to make this library robust, sustainable, and highly user-friendly. Through more and more user interactions and feedback, we can better learn where the code is working well, and where it breaks down.
Previous, current, and planned releases are summarized below (click to expand):
Data Challenge 1 (link): March 2023
- Focused on the 2016 COSI Balloon flight.
- Release includes real flight data for the Crab.
- Main goal is to learn the fundamentals of analyzing Compton data with COSI.
- The analysis tools used for DC1 are only preliminary (referred to as cosipy classic).
- Developed by Thomas Siegert for analysis of the 2016 balloon data.
- Contains 3 straightforward examples of COSI’s science goals:
- Extracting energy spectra from the Crab, Cen A, Cygnus X-1, and Vela.
- Imaging bright point sources, such as the Crab and Cygnus X-1.
- Imaging diffuse emission from the positron-electron annihilation 511 keV and Al-26 1.8 MeV gamma-ray lines.
Data Challenge 2 (link): March 2024
- Focused on COSI SMEX mission.
- First (alpha) release of cosipy.
- Data challenges for all the main science groups (none for dark matter and solar).
- All models and challenges provided by respective COSI science teams.
- Uses 3 months of observations, for an equatorial orbit at 550 km, with a zenith pointing.
- All background components are included, except for SAA passage (i.e. trapped cosmic rays).
- Background also includes time variability from changing geomagnetic cutoff.
- We simulated 12 background components, and 30 unique sources, running 49 different source simulations in total (using multiple models for some of the sources).
- Contains 8 main tutorials demonstrating all the tools/methods needed for completing the challenges, included as part of the cosipy release:
- dataIO
- detector response
- GRB localization
- GRB spectral fit
- Crab spectral fit
- 511 spectral fit
- Crab imaging
- 511 imaging
Data Challenge 3 (current release): April 2025
- Focused on COSI SMEX mission.
- Data challenges for all the main science groups (including dark matter), covering all of COSI's primary science objectives.
- All models and challenges provided by respective COSI science teams.
- Used 3 months of observations, for an equatorial orbit at 530 km.
- Simulations include rocking of instrument:
- Pointing changes between +/- 22 degrees every 12 hrs, with 8 minute transition time.
- Used detailed COSI SMEX mass model.
- Simulated all background components in low-Earth orbit, including variability from geomagnetic cutoff, long-term buildup, and full SAA passage.
- Background also includes the Galactic diffuse continuum for the first time.
- New methods in both MEGAlib and cosipy to account for Earth occultation with a non-zenith pointing.
- First time including polarization.
- Restructuring and refinement of the cosi-data-challenges repository.
- Simulated 43 unique sources, running 71 different source simulations in total (using multiple models for some of the sources).
- Numerous improvements to cosipy:
- First version of source injector.
- New implementation of Earth occultation in point source response.
- First polarization tools.
- New methods to estimate the background for continuum sources and line sources.
- Refinements and further developments of imaging class.
- New extended source response class.
- Contains 5 new tutorials (in addition to the ones released with DC2) demonstrating the tools/methods needed for completing the challenges, included as part of the cosipy release:
- Source injector
- TS maps
- Polarization (ASAD method)
- Continuum background estimation
- Line background estimation
Data Challenge 4: Planned for March 2026
- Currently under development following the release of DC3.
Data Challenge 5: Planned for March 2027
- Final release before launch 🚀!
If you make use of any of the data products from the COSI Data Challenges in a publication, please provide a link to this page and cite Zoglauer+23 and Martinez-Castellanos+23.