Building Simple Models

In this tutorial we look at how to build a simple model and sample the parameters from a variety of basic sources. For more complex sampling see the notebooks: sampling.ipynb, adding_models.ipynb, and using_pzflow.ipynb.

Parameterized Nodes

All sources of information in LightCurveLynx live as ParameterizedNode objects. This allows us to link the nodes (and their variables) together and sample them as a single block. As you will see in this tutorial, most of the nodes are specific to the object that you want to simulate. For example if we wanted to create a model with constant brightness in the night sky with a brightness of 10, we could use:

[1]:

import matplotlib.pyplot as plt
import numpy as np

from lightcurvelynx.models.basic_models import ConstantSEDModel

model = ConstantSEDModel(brightness=10.0, node_label="a_star")

/home/docs/checkouts/readthedocs.org/user_builds/lightcurvelynx/envs/latest/lib/python3.12/site-packages/tqdm/auto.py:21: TqdmWarning: IProgress not found. Please update jupyter and ipywidgets. See https://ipywidgets.readthedocs.io/en/stable/user_install.html
  from .autonotebook import tqdm as notebook_tqdm

Users can get information about the parameterized node using Python’s builtin help() function or display a list of the parameters with the describe_params() function.

[2]:

model.describe_params()

Parameters in a_star:

ra:
    Description: The object's right ascension (in degrees)
    Source: CONSTANT with value = None

dec:
    Description: The object's declination (in degrees)
    Source: CONSTANT with value = None

redshift:
    Description: The object's redshift.
    Source: CONSTANT with value = None

t0:
    Description: The phase offset in MJD.
    Source: CONSTANT with value = None

distance:
    Description: The object's luminosity distance (in pc)
    Source: CONSTANT with value = None

brightness:
    Description: The inherent brightness of the object.
    Source: CONSTANT with value = 10.0

A ParameterizedNode can then be sampled with the sample_parameters() function. This samples the internal parameters of the model. For example we might be sampling an object’s brightness (as with the constant SED model), a host galaxy’s mass, or even the (RA, dec) position of the observations. The sample_parameters() function returns a special data structure, the GraphState, that captures this state and can later be fed into the models to generate fluxes.

Note: Users do not need to know the details of the GraphState storage, only that it can be accessed like a dictionary using the node’s label and the variable name.

[3]:

state = model.sample_parameters(num_samples=10)
state["a_star"]["brightness"]

[3]:

array([10., 10., 10., 10., 10., 10., 10., 10., 10., 10.])

The sample function produced 10 independent samples of our system’s state.

The brightness values of these samples are not particularly interesting because we were sampling from a fixed parameter. The brightness is always 10.0. However LightCurveLynx allows the user to set a node’s parameter from a variety of sources including constants (as with 10.0 above), the values stored in other nodes, or even the results of a “functional” or “computation” type node (more about that later).

LightCurveLynx includes the built-in NumpyRandomFunc which will sample from a given numpy function and use the results to set a given parameter.

[4]:

from lightcurvelynx.math_nodes.np_random import NumpyRandomFunc

brightness_func = NumpyRandomFunc("uniform", low=11.0, high=15.5)
model2 = ConstantSEDModel(brightness=brightness_func, node_label="a_star_2")
state = model2.sample_parameters(num_samples=10)

state["a_star_2"]["brightness"]

[4]:

array([11.91654308, 14.61889382, 13.73991486, 12.56756869, 12.48179135,
       12.16865578, 14.11436686, 13.58573807, 11.57720639, 15.15130824])

Now each of our 10 samples use different a different brightness value.

We can make the distributions of objects more interesting by using combinations of randomly generated parameters. Different random generators can be specified for different parameters such as brightness and redshift. For example we can sample the brightness from a Gaussian distribution and sample the redshift from a uniform distribution.

[5]:

model3 = ConstantSEDModel(
    brightness=NumpyRandomFunc("normal", loc=20.0, scale=2.0),
    redshift=NumpyRandomFunc("uniform", low=0.1, high=0.5),
    node_label="test",
)

num_samples = 10
state = model3.sample_parameters(num_samples=num_samples)
for i in range(num_samples):
    print(f"{i}: brightness={state['test']['brightness'][i]} redshift={state['test']['redshift'][i]}")

0: brightness=18.938014856944918 redshift=0.3848799914202168
1: brightness=21.72440418574937 redshift=0.1387807639547574
2: brightness=21.054432914547963 redshift=0.19796760982214892
3: brightness=18.56675119682633 redshift=0.11734872518392572
4: brightness=18.312534668065663 redshift=0.3459114936846488
5: brightness=18.964249784606434 redshift=0.21763114861413319
6: brightness=18.9921979298807 redshift=0.26483083902020865
7: brightness=20.717152249777158 redshift=0.26949007060979724
8: brightness=17.5244327382566 redshift=0.12616883133280893
9: brightness=21.85111473244421 redshift=0.14832430065929725

The sampling process creates vectors of samples for each parameter such that the i-th value of each parameter is from the same sampling run. So in the output above, sample 0 consists of all the parameter values for that sample (everything at index=0), sample 1 consists of all parameter values for that sample (everything at index=1), and so forth. This is critically important once we start dealing with parameters that are not independent. We might want to choose the host galaxy’s mass and redshift from a joint distribution. Alternatively, as we will see below, we will often want to compute one parameter as a mathematical transform of another or sample one parameter based on the value of another. In all of these cases it is important that we can access the parameters that were generated together and that these parameters stay consistent.

We can slice out a single sample using extract_single_sample() and display it.

[6]:

single_sample = state.extract_single_sample(0)
print(str(single_sample))

test:
    ra: None
    dec: None
    redshift: 0.3848799914202168
    t0: None
    distance: None
    brightness: 18.938014856944918
NumpyRandomFunc:uniform_1:
    low: 0.1
    high: 0.5
    function_node_result: 0.3848799914202168
NumpyRandomFunc:normal_2:
    loc: 20.0
    scale: 2.0
    function_node_result: 18.938014856944918

You’ll notice that there are more parameters than we manually set. Some of these are for internal bookkeeping. Parameters are created automatically by the nodes if needed. In general the user should not need to worry about these extra parameters. They can access the ones of interest with the dictionary notation.

Function Nodes

Sampling functions, such as those provided by numpy, are only one type of function that we might want to use to generate parameters. We might want to sample from other functions or apply a mathematical transform to multiple input parameters to compute a new parameter. For example consider the case of computing the distmod parameter from redshift. We can do this using the information about the cosmology, such as provided by astropy’s FlatLambdaCDM class:

[7]:

from astropy.cosmology import FlatLambdaCDM

cosmo_obj = FlatLambdaCDM(H0=73.0, Om0=0.3)
redshifts = np.array([0.1, 0.2, 0.3])
distmods = cosmo_obj.distmod(redshifts).value
print(distmods)

[38.22408047 39.8651567  40.86433594]

Importantly, you can use a FunctionNode that takes input parameter(s) and produces output parameter(s) during the generation. FunctionNode is a subclass of ParameterizedNode that wraps the functionality of collecting the inputs, running the computations, and storing the output. The user simply needs to provide the function node with the function it will use and the parameters. For example, LightCurveLynx has a DistModFromRedshift class that wraps the previous operation.

The below code samples a redshift uniformly from [0.1, 0.5], uses it to compute the distmod parameter, and sets that.

[8]:

from lightcurvelynx.astro_utils.snia_utils import DistModFromRedshift

distmod_obj = DistModFromRedshift(
    H0=73.0, Omega_m=0.3, redshift=NumpyRandomFunc("uniform", low=0.1, high=0.5)
)

or more concretely, we can create our own FunctionNode that computes y = m * x + b.

[9]:

from lightcurvelynx.base_models import FunctionNode


def _linear_eq(x, m, b):
    """Compute y = m * x + b"""
    return m * x + b


func_node = FunctionNode(
    _linear_eq,  # First parameter is the function to call.
    x=NumpyRandomFunc("uniform", low=0.0, high=10.0),
    m=5.0,
    b=-2.0,
)
print(func_node.sample_parameters(num_samples=1))

NumpyRandomFunc:uniform_1:
    low: 0.0
    high: 10.0
    function_node_result: 7.186434988454487
FunctionNode:_linear_eq_0:
    x: 7.186434988454487
    m: 5.0
    b: -2.0
    function_node_result: 33.93217494227244

The first parameter of the function node is the input function, such as our linear equation above. Each input into the function must be included as a named parameter, such as x, m, and b above. If any of the input parameters are missing the code will give an error.

Here we use constants for m and b so we use the same linear formulation for each sample. Only the value of x changes. However we could have also used function nodes, including sampling functions, to set m and b. In that case it is important to remember that each of our results sample will be the result of a sampling of all the variable parameters.

It would be tiresome to manually generate a FunctionNode object or class for every small mathematical function we need to use. As such LightCurveLynx also provides the BasicMathNode, which will take a string and (safely) compile the mathematical expression into a function.

[10]:

from lightcurvelynx.math_nodes.basic_math_node import BasicMathNode

math_node = BasicMathNode("a + b", a=5.0, b=10.0)
print(math_node.sample_parameters(num_samples=1))

BasicMathNode:eval_0:
    a: 5.0
    b: 10.0
    function_node_result: 15.0

Linked Parameters / Nodes

Often the values of one node might depend on the values of another. A great case of this is a source/host pair where the location of the object depends on that of the host. We can access another node’s sampled parameters using a . notation: {model_object}.{parameter_name}

[11]:

host = ConstantSEDModel(brightness=15.0, ra=1.0, dec=2.0, node_label="host")
source = ConstantSEDModel(brightness=10.0, ra=host.ra, dec=host.dec, node_label="source")
state = source.sample_parameters(num_samples=5)

for i in range(5):
    print(
        f"{i}: Host=({state['host']['ra'][i]}, {state['host']['dec'][i]})"
        f"Source=({state['source']['ra'][i]}, {state['source']['dec'][i]})"
    )

0: Host=(1.0, 2.0)Source=(1.0, 2.0)
1: Host=(1.0, 2.0)Source=(1.0, 2.0)
2: Host=(1.0, 2.0)Source=(1.0, 2.0)
3: Host=(1.0, 2.0)Source=(1.0, 2.0)
4: Host=(1.0, 2.0)Source=(1.0, 2.0)

We can combine the node-parameter references with functional nodes to perform actions such as sampling with noise.

Here we generate the host’s (RA, dec) from a uniform patch of the sky and then generate the source’s (RA, dec) using a Gaussian distribution centered on the host’s position. For each sample the host and source should be close, but not necessarily identical.

[12]:

host = ConstantSEDModel(
    brightness=15.0,
    ra=NumpyRandomFunc("uniform", low=10.0, high=15.0),
    dec=NumpyRandomFunc("uniform", low=-10.0, high=10.0),
    node_label="host",
)

source = ConstantSEDModel(
    brightness=100.0,
    ra=NumpyRandomFunc("normal", loc=host.ra, scale=0.1),
    dec=NumpyRandomFunc("normal", loc=host.dec, scale=0.1),
    node_label="source",
)
state = source.sample_parameters(num_samples=10)

ax = plt.figure().add_subplot()
ax.plot(state["host"]["ra"], state["host"]["dec"], "b.")
ax.plot(state["source"]["ra"], state["source"]["dec"], "r.")

for i in range(5):
    print(
        f"{i}: Host=({state['host']['ra'][i]}, {state['host']['dec'][i]})    "
        f"Source=({state['source']['ra'][i]}, {state['source']['dec'][i]})"
    )

0: Host=(13.71179434214788, -9.058972230423667)    Source=(13.732981943161459, -9.121678619284065)
1: Host=(13.579998188210888, -7.1577793345867295)    Source=(13.479316492325582, -7.128956535647843)
2: Host=(14.482918456653124, 5.167469438022529)    Source=(14.53631476100714, 5.185095661827862)
3: Host=(13.30213858157423, -9.932430941201174)    Source=(13.324104183208345, -9.865963264568824)
4: Host=(13.724065263974204, 4.734016329421502)    Source=(13.774545781951444, 4.8382236251112385)

../_images/notebooks_technical_overview_25_1.png

Again we can access all the information for a single sample. Here we see the full state tracked by the system. In addition to the host and source nodes we created, the information for the functional nodes is tracked.

[13]:

single_sample = state.extract_single_sample(0)
print(str(single_sample))

NumpyRandomFunc:uniform_3:
    low: 10.0
    high: 15.0
    function_node_result: 13.71179434214788
host:
    ra: 13.71179434214788
    dec: -9.058972230423667
    redshift: None
    t0: None
    distance: None
    brightness: 15.0
NumpyRandomFunc:uniform_4:
    low: -10.0
    high: 10.0
    function_node_result: -9.058972230423667
NumpyRandomFunc:normal_1:
    loc: 13.71179434214788
    scale: 0.1
    function_node_result: 13.732981943161459
source:
    ra: 13.732981943161459
    dec: -9.121678619284065
    redshift: None
    t0: None
    distance: None
    brightness: 100.0
NumpyRandomFunc:normal_5:
    loc: -9.058972230423667
    scale: 0.1
    function_node_result: -9.121678619284065

It is interesting to note that functional nodes themselves are parameterized nodes, allowing for more complex forms of chaining. For example we could set the low parameter from one of the NumpyRandomFuncs from another function node. This allows us to specify priors and comlex distributions.

Similarly we can now make the input parameters of one node depend on a function of parameters from other nodes. We can arbitrarily chain the computations.

[14]:

# Create a host galaxy with a random brightness.
host = ConstantSEDModel(
    brightness=NumpyRandomFunc("uniform", low=1.0, high=5.0),
    node_label="host",
)

# Create the brightness of the source as a uniformly sampled foreground brightness
# added to the 80% of the host's brightness (background).
source_brightness = BasicMathNode(
    "0.8 * val1 + val2",
    val1=host.brightness,
    val2=NumpyRandomFunc("uniform", low=1.0, high=2.0),
    node_label="plus_80percent",
)

source = ConstantSEDModel(
    brightness=source_brightness,
    node_label="source",
)
state = source.sample_parameters(num_samples=10)
print(f"Host Brightness: {state['host']['brightness']}")
print(f"Source Brightness: {state['source']['brightness']}")

Host Brightness: [4.39981629 3.67511612 4.71728626 4.84523114 4.29572223 4.25754043
 4.85247102 2.90676787 3.34343133 2.34668212]
Source Brightness: [4.76358437 4.51429876 5.64842244 5.53204467 5.13946595 4.99936858
 5.7406846  3.5269241  4.4897315  3.17000498]

The state is used within the evaluate_sed() function to generate the flux densities.

[15]:

time = np.arange(0, 10, 0.1)
waves = np.array([1000.0, 2000.0])
fluxes = source.evaluate_sed(time, waves, state)
print(f"Generated {fluxes.shape} fluxes (samples x times x wavelengths).")

# Plot the the flux for sample=0 and wavelength=1000.0.
plt.plot(time, fluxes[0, :, 0], "k-")
plt.xlabel("Time (days)")
plt.ylabel("Flux (nJy)")
plt.title("Flux for sample=0 and wavelength=1000.0")
plt.show()

Generated (10, 100, 2) fluxes (samples x times x wavelengths).

../_images/notebooks_technical_overview_32_1.png

MultiObjectModels

We expect that many users will want to simulate fluxes produced by a combination of objects, such as a supernova and its host galaxy. LightCurveLynx provides the AdditiveMultiObjectModel for computing such combinations. The flux produced by the model is the (weighted) sum of fluxes from the individual sources.

Each underlying model in the AdditiveMultiObjectModel separately applies rest frame effects to its component flux. This allows users to model unresolved objects at different distances (with different redshifts or dust extinctions). All observer frame effects are applied to the summed fluxes for consistency.

[16]:

from lightcurvelynx.models.basic_models import SinWaveModel
from lightcurvelynx.models.multi_object_model import AdditiveMultiObjectModel

# We model the host as a galaxy with a random brightness and position.
host = ConstantSEDModel(
    brightness=NumpyRandomFunc("normal", loc=10.0, scale=1.0),
    ra=NumpyRandomFunc("uniform", low=10.0, high=15.0),
    dec=NumpyRandomFunc("uniform", low=-10.0, high=10.0),
    node_label="host",
)

# We model the source as a sine wave with a given frequency and amplitude.
source = SinWaveModel(
    brightness=1.0,
    amplitude=1.0,
    frequency=0.5,
    ra=NumpyRandomFunc("normal", loc=host.ra, scale=0.1),
    dec=NumpyRandomFunc("normal", loc=host.dec, scale=0.1),
    t0=0.0,
    node_label="sin_wave_source",
)

# We combine the host and source into a multi-source model and sample it.
combined = AdditiveMultiObjectModel(
    objects=[host, source],
    node_label="combined_model",
)
state = combined.sample_parameters(num_samples=1)
print(state)

NumpyRandomFunc:uniform_2:
    low: 10.0
    high: 15.0
    function_node_result: 13.834404524808669
host:
    ra: 13.834404524808669
    dec: 7.3497806556606875
    redshift: None
    t0: None
    distance: None
    brightness: 10.677497045330453
NumpyRandomFunc:uniform_3:
    low: -10.0
    high: 10.0
    function_node_result: 7.3497806556606875
NumpyRandomFunc:normal_4:
    loc: 10.0
    scale: 1.0
    function_node_result: 10.677497045330453
NumpyRandomFunc:normal_6:
    loc: 13.834404524808669
    scale: 0.1
    function_node_result: 13.815659292820841
sin_wave_source:
    ra: 13.815659292820841
    dec: 7.411457809514209
    redshift: None
    t0: 0.0
    distance: None
    brightness: 1.0
    amplitude: 1.0
    frequency: 0.5
NumpyRandomFunc:normal_7:
    loc: 7.3497806556606875
    scale: 0.1
    function_node_result: 7.411457809514209
combined_model:
    ra: None
    dec: None
    redshift: None
    t0: None
    distance: None

[17]:

# Evaluate and generate the fluxes from the combined model for a
# single wavelength parameter sample.
time = np.arange(0, 10, 0.1)
fluxes = combined.evaluate_sed(time, np.array([1000.0]), state)

# Plot the the flux for sample=0 and wavelength=1000.0.
plt.plot(time, fluxes[:, 0], "k-")
plt.xlabel("Time (days)")
plt.ylabel("Flux (nJy)")
plt.title("Flux for combined model")
plt.show()

../_images/notebooks_technical_overview_35_0.png