Simulating from Multiple Surveys

LightCurveLynx can simulate observations from multiple surveys in a single run. In this notebook, we show how to perform these simulations and discuss how it works.

[1]:

import numpy as np

from lightcurvelynx.astro_utils.passbands import PassbandGroup
from lightcurvelynx.graph_state import GraphState
from lightcurvelynx.models.basic_models import ConstantSEDModel
from lightcurvelynx.obstable.opsim import OpSim
from lightcurvelynx.simulate import simulate_lightcurves
from lightcurvelynx.survey_info import SurveyInfo

# Usually we would not hardcode the path to the passband files, but for this demo we will use a relative path
# to the test data directory so that we do not have to download the files.
table_dir = "../../tests/lightcurvelynx/data/passbands"

/home/docs/checkouts/readthedocs.org/user_builds/lightcurvelynx/envs/latest/lib/python3.12/site-packages/tqdm/auto.py:21: TqdmWarning: IProgress not found. Please update jupyter and ipywidgets. See https://ipywidgets.readthedocs.io/en/stable/user_install.html
  from .autonotebook import tqdm as notebook_tqdm

Prerequisite Data

For each survey that we would like to include in the simulation, we need two pieces of information:

An ObsTable that includes the survey’s pointing and noise information.
A PassbandGroup for that survey.

Let’s start by creating toy data for each survey.

The first survey will include pointings at two locations (0.0, 10.0) and (180.0, -10.0) in the “g” and “r” bands.

[2]:

obsdata1 = {
    "time": [0.0, 1.0, 2.0, 3.0],
    "ra": [0.0, 0.0, 180.0, 180.0],
    "dec": [10.0, 10.0, -10.0, -10.0],
    "filter": ["g", "r", "g", "r"],
    "zp": [5.0, 6.0, 7.0, 8.0],
    "seeing": [1.12, 1.12, 1.12, 1.12],
    "skybrightness": [20.0, 20.0, 20.0, 20.0],
    "exptime": [29.2, 29.2, 29.2, 29.2],
    "nexposure": [2, 2, 2, 2],
}
obstable1 = OpSim(obsdata1)

passband_group1 = PassbandGroup.from_preset(
    preset="LSST",
    table_dir=table_dir,
    filters=["g", "r", "i"],
)

survey_info1 = SurveyInfo(obstable=obstable1, passbands=passband_group1, survey_name="LSST")

The second survey will include pointings at two locations (0.0, 10.0) and (90.0, -10.0) in the “r” and “z” bands.

[3]:

obsdata2 = {
    "time": [0.5, 1.5, 2.5, 3.5],
    "ra": [0.0, 90.0, 0.0, 90.0],
    "dec": [10.0, -10.0, 10.0, -10.0],
    "filter": ["r", "i", "r", "i"],
    "zp": [1.0, 2.0, 3.0, 4.0],
    "seeing": [1.12, 1.12, 1.12, 1.12],
    "skybrightness": [20.0, 20.0, 20.0, 20.0],
    "exptime": [29.2, 29.2, 29.2, 29.2],
    "nexposure": [2, 2, 2, 2],
}
obstable2 = OpSim(obsdata2)

passband_group2 = PassbandGroup.from_preset(
    preset="LSST",
    table_dir=table_dir,
    filters=["r", "i"],
)

survey_info2 = SurveyInfo(obstable=obstable2, passbands=passband_group2, survey_name="LSST")

Note that, while we use the OpSim objects for both ObsTables and we use LSST preset for both passbands, this is only for simplicity of the example (only some of the data files are installed by default).

In most cases users will be interested in combining data from multiple surveys. For example we could simulate a trio of surveys: Rubin ( OpSim and LSST passbands), ZTF (ZTFObsTable with ZTF passbands) and Roman (data coming soon).

Model Creation

Nothing changes in the model creation step. We define a model and its parameters as we would with any other simulation. The model is just a recipe for generating the noise-free fluxes given parameter values. It doesn’t need to know anything about the survey.

Here we use a constant SED model (same value for all times and wavelengths). We place the object at (0.0, 10.0) so it is observed by some of the pointings from each survey.

[4]:

model = ConstantSEDModel(brightness=100.0, t0=0.0, ra=0.0, dec=10.0, redshift=0.0, node_label="my_star")

If we want to sample the positions (RA, dec) from the footprints of the combined surveys, we can use the ApproximateMOCSampler. That class contains built-in constructor methods for creating a sampling footprint from either the union or intersection of footprints of the individual surveys. See the sampling_positions notebook for details.

Simulation

We run the simulation by passing in a list of SurveyInfo (note this changed in v0.4.0. In previous versions, users passed lists of ObsTable and PassbandGroup). The parameter space is only sampled once for each object, so the observations in each survey are consistent with respect to the parameterization. The times of observation and filters used are determined by each survey. And the bandflux is computed using that survey’s passbands.

We can simulate a single object and look at its light curve.

[5]:

results = simulate_lightcurves(
    model,
    1,
    [survey_info1, survey_info2],
    obstable_save_cols=["zp"],
)
print(results["lightcurve"][0])

Simulating: 100%|██████████| 1/1 [00:00<00:00, 237.75obj/s]

   mjd filter        flux      fluxerr  flux_perfect  survey_idx  obs_idx  \
0  0.0      g  753.159185   895.605471         100.0           0        0
1  0.5      r  136.931841   338.724102         100.0           1        0
2  1.0      r -354.556054  1015.412583         100.0           0        1
3  2.5      r   60.708472   642.443596         100.0           1        2

   is_saturated   zp
0         False  5.0
1         False  1.0
2         False  6.0
3         False  3.0

In addition to the standard information like mjd and flux, we capture the survey index to help us trace which survey the data came from. This is stored in the survey_idx column of the nested light curve table. We also can save per-survey, per-observation data, such as the zero point for that observation, using the obstable_save_cols parameter as shown above.

Adding Spectra

The list of “PassbandGroups” can also include Spectrograph objects in order to evalue spectra. Let’s add a third survey with a spectrograph. Since spectographic surveys use different information, we can use the simpler SpectrographObsTable.

[6]:

from lightcurvelynx.astro_utils.spectrograph import Spectrograph
from lightcurvelynx.obstable.spectrograph_table import SpectrographObsTable

obsdata3 = {
    "time": [0.0, 1.0, 2.0, 3.0],
    "ra": [0.0, 0.0, 180.0, 180.0],
    "dec": [10.0, 10.0, -10.0, -10.0],
}
obstable3 = SpectrographObsTable(obsdata3)
spectograph = Spectrograph(np.arange(4000, 8000, 100))

survey_info3 = SurveyInfo(obstable=obstable3, passbands=spectograph, survey_name="spectrograph")

We add the spectrograph information as another survey and “passbandgroup”.

[7]:

results2 = simulate_lightcurves(
    model,
    1,
    [survey_info1, survey_info2, survey_info3],
    obstable_save_cols=["zp"],
)
results2.head()

Simulating: 100%|██████████| 1/1 [00:00<00:00, 225.67obj/s]

[7]:

id

ra

dec

nobs

t0

z

lightcurve

params

spectra

0

0.000000

10.000000

6

0.000000

mjd	filter	...	is_saturated	zp
0.0	g	...	False	5.0
+3 rows	...	...	...	...

{'my_star.ra': 0.0, 'my_star.dec': 10.0, 'my_star.redshift': 0.0, 'my_star.t0': 0.0, 'my_star.distance': None, 'my_star.brightness': 100.0}

mjd	waves	measured_flux	instrument
0.0	[4000, 4100, 4200, 4300, 4400, 4500, 4600, 4700, 4800, 4900, 5000, 5100, 5200, 5300, 5400, 5500, 5600, 5700, 5800, 5900, 6000, 6100, 6200, 6300, 6400, 6500, 6600, 6700, 6800, 6900, 7000, 7100, 7200, 7300, 7400, 7500, 7600, 7700, 7800, 7900]	[100.0, 100.0, 100.0, 100.0, 100.0, 100.0, 100.0, 100.0, 100.0, 100.0, 100.0, 100.0, 100.0, 100.0, 100.0, 100.0, 100.0, 100.0, 100.0, 100.0, 100.0, 100.0, 100.0, 100.0, 100.0, 100.0, 100.0, 100.0, 100.0, 100.0, 100.0, 100.0, 100.0, 100.0, 100.0, 100.0, 100.0, 100.0, 100.0, 100.0]	Spectrograph
1.0	[4000, 4100, 4200, 4300, 4400, 4500, 4600, 4700, 4800, 4900, 5000, 5100, 5200, 5300, 5400, 5500, 5600, 5700, 5800, 5900, 6000, 6100, 6200, 6300, 6400, 6500, 6600, 6700, 6800, 6900, 7000, 7100, 7200, 7300, 7400, 7500, 7600, 7700, 7800, 7900]	[100.0, 100.0, 100.0, 100.0, 100.0, 100.0, 100.0, 100.0, 100.0, 100.0, 100.0, 100.0, 100.0, 100.0, 100.0, 100.0, 100.0, 100.0, 100.0, 100.0, 100.0, 100.0, 100.0, 100.0, 100.0, 100.0, 100.0, 100.0, 100.0, 100.0, 100.0, 100.0, 100.0, 100.0, 100.0, 100.0, 100.0, 100.0, 100.0, 100.0]	Spectrograph

1 rows x 9 columns

The results now have two nested columns. As before, the ‘lightcurve’ column contains all of the light curve points from all of the (non-spectrographic) surveys. A new ‘spectrograph’ column contains each recorded spectra.

Running Separate Simulations

In some cases, we may want to try a set of objects on different survey conditions independently. For example, we might want to look at the exact same set of items under ZTF and Rubin independently.

LightCurveLynx provides the ability to replay a previous simulation by extracting the first simulation’s GraphState and manually providing it to a simulation. In this case, the simulation uses the given GraphState instead of sampling a new one.

Let’s look at re-running the first example (of two LSST-based surveys) with the survey’s evaluated independently on the exact same objects.

[8]:

# Run the first simulation.
results1 = simulate_lightcurves(model, 10, survey_info1)
print("\nLight Curve from Survey 1:\n", results1["lightcurve"][0])

# Extract the GraphState (parameter values) from the first simulation.
previous_state = GraphState.from_list(results1["params"].values)

# Run the second simulation, using the previous state as the initial state.
results2 = simulate_lightcurves(
    model,
    10,
    survey_info2,
    graph_state=previous_state,
)
print("\nLight Curve from Survey 2:\n", results2["lightcurve"][0])

Simulating: 100%|██████████| 10/10 [00:00<00:00, 684.15obj/s]


Light Curve from Survey 1:
    mjd filter         flux      fluxerr  flux_perfect  survey_idx  obs_idx  \
0  0.0      g  2479.544900   895.605471         100.0           0        0
1  1.0      r  -612.910928  1015.412583         100.0           0        1

   is_saturated
0         False
1         False

Simulating: 100%|██████████| 10/10 [00:00<00:00, 688.90obj/s]


Light Curve from Survey 2:
    mjd filter        flux     fluxerr  flux_perfect  survey_idx  obs_idx  \
0  0.5      r  179.358165  338.724102         100.0           0        0
1  2.5      r  377.187588  642.443596         100.0           0        2

   is_saturated
0         False
1         False

It is possible to change the values within the GraphState object before passing it to the simulate_lightcurves() function. For example, you might want to change the objects’ t0 values to correspond to the new survey’s time range. However, care should be taken when changing the values within the GraphState object. If other parameters depend on the values you change, they will not be updated automatically and you can end up with inconsistent results.

FAQ

Can we use multiple surveys and parallelization?

Yes.

Can we re-simulate objects with parallelization?

Yes.

What happens if the filter names overlap?

For the simulation, everything will just work. Since you are passing one PassbandGroup for each ObsTable it will only look for the matching filter names in that group. So if you are using LSST data with observations in r and then ZTF data with observations in r, the SEDs will be integrated by the correct filter transmission at each step.

However the name of the filter in the result column will match what appears in the ObsTable columns. So you may only see ‘r’ for each entry. If you are interested in plotting the light curves or performing any other per-filter analysis, you can expand the filter names with the survey prefix using test_results_use_full_filter_names()

[9]:

from lightcurvelynx.utils.post_process_results import results_use_full_filter_names

print("Filters (before):", list(results["lightcurve.filter"].values))

results_use_full_filter_names(results, [passband_group1, passband_group2])
print("Filters (after):", list(results["lightcurve.filter"].values))

Filters (before): ['g', 'r', 'r', 'r']
Filters (after): ['LSST_g', 'LSST_r', 'LSST_r', 'LSST_r']