Creating Custom Function Nodes

In this tutorial we do a deep dive into FunctionNode objects and how users can create their own nodes for various functions. Users should already be familiar with the concepts covered in the Sampling Parameters notebook.

[1]:

from lightcurvelynx.base_models import FunctionNode
from lightcurvelynx.math_nodes.np_random import NumpyRandomFunc

Function Node Overview

Function nodes provide users the ability to wrap arbitrary computations during the parameter generation stage. The name FunctionNode is a bit of a misnomer as these nodes can wrap any Python callable object. For simplicity, we will use the term function throughout this notebook, but users should understand the more general behavior.

The basic flow of the FunctionNode is wrapped in the base class’s compute method:

Assemble the wrapped function’s input values from the GraphState object (model parameters) and keyword arguments,
Call the wrapped function with the assembled input values,
Capture the function’s output, and
Write those values to the GraphState object.

By default each function node stores its result in a parameter called function_node_result. Since model parameters are indexed by a combination of node name and parameter name, it will often be the case that multiple nodes in the model will generate function_node_result values. As we will see later in this notebook, we can override the name of the outputs to be more user friendly.

There are two ways to use the FunctionNode class: as a standalone wrapper or as a parent class.

FunctionNode as a Standalone Wrapper

Users can wrap a function directly by passing the function and its arguments into the FunctionNode constructor. This wraps the provided function and uses the functions returned value as its output.

As a concrete example, let’s create a FunctionNode that computes wraps an existing function that computes y = m * x + b. We need to pass this function and values for each of its parameters to the constructor.

[2]:

# This is the function we would like to wrap.
def linear_eq_function(x, m, b):
    """Compute y = m * x + b"""
    return m * x + b


# This is how we wrap linear_eq_function.
func_node = FunctionNode(
    linear_eq_function,  # First argument is the function to call.
    # The function's parameters are given as keyword arguments to the FunctionNode.
    x=NumpyRandomFunc("uniform", low=0.0, high=10.0),  # Random value
    m=5.0,  # Constant value
    b=-2.0,  # Constant value
)

The first parameter of the function node is the function to evaluate, such as our linear equation above (linear_eq_function). Each input into that function must be included as a named parameter during the FunctionNode definition, such as x, m, and b above. If any of the input parameters are missing, the code will give an error. The FunctionNode class handles all the internal book keeping of: determining the names of the function’s arguments, creating internal parameters, and assembling those arguments whenever the function is called.

Here we provide constants for m and b so we use the same linear formulation for each sample. Only the value of x changes. However, we could have also used a whole tree of function nodes, including sampling functions, to set m and b. In that case it is important to remember that each of our results is a consistent sampling and computation over all the parameters in the model.

[3]:

state = func_node.sample_parameters(num_samples=5)
print(state)

NumpyRandomFunc:uniform_1:
    low: [0. 0. 0. 0. 0.]
    high: [10. 10. 10. 10. 10.]
    function_node_result: [7.35998275 6.76951037 8.75518051 0.37035029 0.6607771 ]
FunctionNode:linear_eq_function_0:
    x: [7.35998275 6.76951037 8.75518051 0.37035029 0.6607771 ]
    m: [5. 5. 5. 5. 5.]
    b: [-2. -2. -2. -2. -2.]
    function_node_result: [34.79991377 31.84755187 41.77590255 -0.14824854  1.30388548]

As described above both of the nodes (the numpy sampler and the linear function) create function_node_result parameters to store their intermediate results.

The nodes can be chained by using one FunctionNode as the value for a parameter of another. When the a FunctionNode is passed as a parameter, LightCurveLynx will automatically link that parameter to the FunctionNode’s function_node_result value. Below you can see that the input (x) of our increment function corresponds directly to the output (function_node_result) of the linear equation function.

[4]:

def increment(x):
    """Increment x by 1."""
    return x + 1


# This is how we wrap increment function.
inc_node = FunctionNode(
    increment,  # First argument is the function to call.
    # The function's parameters are given as keyword arguments to the FunctionNode.
    x=func_node,  # Use the output of func_node as input to increment.
)

state = inc_node.sample_parameters(num_samples=5)
print(state)

NumpyRandomFunc:uniform_2:
    low: [0. 0. 0. 0. 0.]
    high: [10. 10. 10. 10. 10.]
    function_node_result: [0.32024658 2.49915656 6.53915135 9.594444   1.32761105]
FunctionNode:linear_eq_function_1:
    x: [0.32024658 2.49915656 6.53915135 9.594444   1.32761105]
    m: [5. 5. 5. 5. 5.]
    b: [-2. -2. -2. -2. -2.]
    function_node_result: [-0.3987671  10.49578282 30.69575673 45.97222002  4.63805526]
FunctionNode:increment_0:
    x: [-0.3987671  10.49578282 30.69575673 45.97222002  4.63805526]
    function_node_result: [ 0.6012329  11.49578282 31.69575673 46.97222002  5.63805526]

We could make the linking of parameters more explicit by using the dot notation and the parameter name. But the behavior is identical.

[5]:

# This is how we wrap increment function.
inc_node = FunctionNode(
    increment,  # First argument is the function to call.
    # The function's parameters are given as keyword arguments to the FunctionNode.
    x=func_node.function_node_result,  # named parameter
)

state = inc_node.sample_parameters(num_samples=5)
print(state)

NumpyRandomFunc:uniform_2:
    low: [0. 0. 0. 0. 0.]
    high: [10. 10. 10. 10. 10.]
    function_node_result: [2.40248392 7.82149529 3.21421008 7.48010069 3.75064068]
FunctionNode:linear_eq_function_1:
    x: [2.40248392 7.82149529 3.21421008 7.48010069 3.75064068]
    m: [5. 5. 5. 5. 5.]
    b: [-2. -2. -2. -2. -2.]
    function_node_result: [10.01241958 37.10747645 14.07105041 35.40050346 16.75320339]
FunctionNode:increment_0:
    x: [10.01241958 37.10747645 14.07105041 35.40050346 16.75320339]
    function_node_result: [11.01241958 38.10747645 15.07105041 36.40050346 17.75320339]

More realistically, users will want to wrap functions that perform complex astronomical calculations.

FunctionNode Subclasses

In the case where users will want to create function nodes that carry around additional data, users can create subclasses of the FunctionNode class. For example, when computing the distmod from the redshift, we need to load the cosmology. While we could load the cosmology each time the function is called, it would be more efficient to load it once and reuse it across computations.

[6]:

from astropy.cosmology import FlatLambdaCDM


class DistModFromRedshift(FunctionNode):
    """A wrapper class for the _distmod_from_redshift() function.

    Parameters
    ----------
    redshift : function or constant
        The function or constant providing the redshift value.
    H0 : constant
        The Hubble constant.
    Omega_m : constant
        The matter density Omega_m.
    **kwargs : dict, optional
        Any additional keyword arguments.
    """

    def __init__(self, redshift, H0=73.0, Omega_m=0.3, **kwargs):
        # Create the cosmology once for this node. This is constructed ONCE for all samples.
        if not isinstance(H0, float) or not isinstance(Omega_m, float):
            raise ValueError("H0 and Omega_m must be constants.")
        self.cosmo = FlatLambdaCDM(H0=H0, Om0=Omega_m)

        # Call the super class's constructor with the needed information.
        super().__init__(
            func=self._distmod_from_redshift,  # "Function" being wrapped
            redshift=redshift,
            **kwargs,
        )

    def _distmod_from_redshift(self, redshift):
        """Compute distance modulus given redshift and cosmology.

        Parameters
        ----------
        redshift : float or numpy.ndarray
            The redshift value(s).

        Returns
        -------
        distmod : float or numpy.ndarray
            The distance modulus (in mag)
        """
        return self.cosmo.distmod(redshift).value

There are a few things to note from the implementation above.

First, since the cosmology is created on a per-object basis, it will be the same for every evaluation. Its parameters, H0 and Omega_m are fixed for all samples. Only the input redshift is changing.

Second, the “function” being wrapped by the function node is actually an object method. As we noted earlier, the FunctionNode can actually wrap any Python callable object. By wrapping an internal method, the computation has access to the object’s attributes via self.

Supporting Multiple Outputs

If the wrapped function produces multiple outputs, the user can assign names to each output via the outputs constructor argument. This argument takes a list of strings that is same length as the number of outputs produced. Each result is separately stored in a corresponding named parameter (instead of the default function_node_result parameter). These parameters are added automatically to the object.

[7]:

# A function that returns two values.
def _linear_pair(x, m, b):
    """Compute y1 = m * x + b and y2 = -1/m * x - b"""
    return (m * x + b, -1.0 / m * x - b)


# A function node that returns two values. The outputs are named "y1" and "y2".
func_node2 = FunctionNode(
    _linear_pair,  # First parameter is the function to call.
    x=NumpyRandomFunc("uniform", low=0.0, high=10.0),
    m=5.0,
    b=-2.0,
    outputs=["y1", "y2"],  # The output names.
)

print(func_node2.sample_parameters(num_samples=5))

NumpyRandomFunc:uniform_1:
    low: [0. 0. 0. 0. 0.]
    high: [10. 10. 10. 10. 10.]
    function_node_result: [7.94138862 1.74918516 8.85296038 9.39347049 4.21513299]
FunctionNode:_linear_pair_0:
    x: [7.94138862 1.74918516 8.85296038 9.39347049 4.21513299]
    m: [5. 5. 5. 5. 5.]
    b: [-2. -2. -2. -2. -2.]
    y1: [37.7069431   6.74592578 42.26480188 44.96735243 19.07566493]
    y2: [0.41172228 1.65016297 0.22940792 0.1213059  1.1569734 ]

The outputs can be referenced individually using the dot notation with their given name. Below we reimplement the increment function using just the y2 output as the function’s input.

[8]:

# This is how we wrap increment function.
inc_node2 = FunctionNode(
    increment,  # First argument is the function to call.
    # The function's parameters are given as keyword arguments to the FunctionNode.
    x=func_node2.y2,  # Use the named output.
)

print(inc_node2.sample_parameters(num_samples=10))

NumpyRandomFunc:uniform_2:
    low: [0. 0. 0. 0. 0. 0. 0. 0. 0. 0.]
    high: [10. 10. 10. 10. 10. 10. 10. 10. 10. 10.]
    function_node_result: [1.33664306 9.70865948 3.87266986 5.69151466 6.72552844 2.7658977
 8.89420314 7.62809401 9.1126249  7.97040226]
FunctionNode:_linear_pair_1:
    x: [1.33664306 9.70865948 3.87266986 5.69151466 6.72552844 2.7658977
 8.89420314 7.62809401 9.1126249  7.97040226]
    m: [5. 5. 5. 5. 5. 5. 5. 5. 5. 5.]
    b: [-2. -2. -2. -2. -2. -2. -2. -2. -2. -2.]
    y1: [ 4.68321531 46.54329742 17.36334931 26.45757332 31.62764219 11.8294885
 42.47101569 36.14047006 43.56312449 37.85201132]
    y2: [1.73267139 0.0582681  1.22546603 0.86169707 0.65489431 1.44682046
 0.22115937 0.4743812  0.17747502 0.40591955]
FunctionNode:increment_0:
    x: [1.73267139 0.0582681  1.22546603 0.86169707 0.65489431 1.44682046
 0.22115937 0.4743812  0.17747502 0.40591955]
    function_node_result: [2.73267139 1.0582681  2.22546603 1.86169707 1.65489431 2.44682046
 1.22115937 1.4743812  1.17747502 1.40591955]

The named output is most often used in nodes that produce a combination of correlated values, such as (RA, Dec). See the Sampling Object Positions notebook for examples.

Randomization

Care must be taken when creating new function nodes that use randomization. To be consistent, users will want the nodes to be completely random by default, but have the ability to use a provided random number generator. The difficulty is that FunctionNode.compute() does not pass along the random number generator to the function. It can’t because not all wrapped functions can even take a random number generator parameter.

Instead there are two supported approaches to enable random behavior.

Use Random Parameters (RECOMMENDED)

Users can add new parameters in the their class’s constructor that correspond to the random values they would like to generate. For example, if we wanted to implement a noisy linear function: y = m * x + b, we could add a noise parameter. We set this parameter using a NumpyRandomFunc or other random node. This approach takes care of the internal bookkeeping

[9]:

class NoisyLinear(FunctionNode):
    """A noisy linear function node."""

    def __init__(self, x, m, b, **kwargs):
        # Create the noise function once that will be constructed once, but queried for each sample.
        self.noise_func = NumpyRandomFunc("normal", loc=0.0, scale=1.0)

        # Call the super class's constructor with the needed information.
        super().__init__(
            func=self._noisy_linear_eq,  # "Function" being wrapped
            x=x,
            m=m,
            b=b,
            noise=self.noise_func,
            **kwargs,
        )

    def _noisy_linear_eq(self, x, m, b, noise):
        """Compute y = m * x + b + noise."""
        return m * x + b + noise


my_node = NoisyLinear(
    x=NumpyRandomFunc("uniform", low=0.0, high=10.0),
    m=5.0,
    b=-2.0,
)
print(my_node.sample_parameters(num_samples=5))

NumpyRandomFunc:uniform_1:
    low: [0. 0. 0. 0. 0.]
    high: [10. 10. 10. 10. 10.]
    function_node_result: [7.11673131 3.75770742 4.82096012 6.21606195 1.38165153]
NoisyLinear:_noisy_linear_eq_0:
    x: [7.11673131 3.75770742 4.82096012 6.21606195 1.38165153]
    m: [5. 5. 5. 5. 5.]
    b: [-2. -2. -2. -2. -2.]
    noise: [ 0.57417548 -0.24974284 -0.61610297 -1.76139615  0.68453986]
    function_node_result: [34.15783201 16.53879427 21.48869762 27.31891361  5.59279749]
NumpyRandomFunc:normal_2:
    loc: [0. 0. 0. 0. 0.]
    scale: [1. 1. 1. 1. 1.]
    function_node_result: [ 0.57417548 -0.24974284 -0.61610297 -1.76139615  0.68453986]

As you can see, the noise parameter is sampled first and applied as though it was any other constant.

Custom Compute Function

If users need more control over how the randomness is used, they can override the compute function which does take a random number generator. However, the compute function contains other logic that will need to be replicated, including the assembly of the functions parameters and writing the results to the GraphState. We recommend this approach only for experienced users. For examples of this approach, see the code for the NumpyRandomFunc class itself.