Parameters

Bumps fitting is centered on Parameter objects. Parameters define the search space, the uncertainty analysis and even the user interface. Constraints within and between models are implemented through parameters. Prior probabilities are defined by for parameters.

Model classes for Bumps should make it easy to define the initial value of fitting parameters and tie parameters together. When creating a model, you should be able specify parameter=value for each of the model parameters. Later, you should be able to reference the parameter within the model using M.parameter. Parameters can also be tied together by assigning the same Parameter object to two different parameters. For example, a hollow cylinder can be created using:

solvent = Parameter("solvent", value=1.2)
shell = Parameter("shell", value=4.5)
M = CoreShellCylinder(core=solvent, shell=shell, solvent=solvent,
                      radius=95, thickness=10, length=100)

The model parameter can also be a derived value that is the result of a parameter expression. For example, the following creates a cylinder whose length is twice the radius:

radius = Parameter("radius", value=3)
M = Cylinder(radius=radius, length=2*radius)

Any time you ask for M.length.value it will compute the result as 2*radius.value and return that.

You can also tie parameters together after the fact. For example, you can create the constrained cylinder using:

M = Cylinder(radius=3, length=6)
M.length = 2*M.radius

The advantage of this method is that you can easily comment out the constraint when exploring the model space, and fit length and radius freely.

Once you have defined your models and constraints you can set up you fitting parameters. There are several parameter methods which are helpful:

All these methods set the bounds attribute on the parameter in one way or another. See bumps.bounds for details. Technically, setting the parameter to dev, soft_range or pdf is equivalent to creating a probability distribution model with a single data point and Fitness.nllf equal to the negative log likelihood of seeing the parameter value in the distribution. This PDF model would be fit simultaneously with your target model with the parameter shared between them. The result is statistically sound (it is just more prior information), and conveniently, it does not affect the number of degrees of freedom in the fit.

When defining new model classes, use the static method Parameter.default() to initialize the parameter. This will accept the input argument passed in by the user and depending on its type, either create a new parameter slot and set its initial value, or link the slot to another parameter.

Free Variables

When fitting multiple datasets, you will undoubtedly have models with many shared parameters, and some parameters that differ between the models. Common patterns include:

  • different measurements may use the same material but different contrast agents,

  • they may use the same contrast agent but different materials,

  • the same material and contrast, but different sizes, or

  • a cross product with several materials and several sizes.

Often with complex models the parameter of interest is buried within the model structure. One approach is to clone the models using a deep copy of the entire structure, then tie together parameters for the bits that are changing. This proves to be confusing and difficult for new python programmers, so instead FitProblem was extended to support FreeVariables. The FreeVariables class allows you to use the same model structure with different data sets, but have some parameters that vary between the models. Each varying parameter is a slot, and FreeVariables keeps an array of parameters (actually a ParameterSet) to fill that slot, one for each model.

To define the free variables, you need the names of the different models, a parameter slot to hold the values, and a list of the different parameter values for each model. You then define the free variables as follows:

free = FreeVariables(names=["model1", "model2", ...],
                 p1=model.p1, p2=model.p2, ...)
...
problem = FitProblem(experiments, freevars=free)

The slots can be referenced by name, with the underlying parameters referenced by variable number. In the above, free.p1[1] refers to the parameter p1 when fitting data2. You can also refer to the slots by name, such as free.p1[data2.name]. The parameters in the slots have the usual properties of parameters, such as values and fit ranges. Setting the fit range makes the parameter a fitted parameter, and the fit will give the uncertainty on each parameter independently. Parameters can be copied, so that a pair of models can share the same value.

The following examples shows a neutron scattering problems with two datasets, one measured with light water and the other measured with heavy water, you can share the same material object, but use the light water scattering factors in the first and the heavy water scattering factors in the second. The problem would be composed as follows:

material = SLD('silicon', rho=2.07)
solvent = SLD('solvent') # unspecified rho
model = Sphere(radius=10, material=material, solvent=solvent)
M1 = ScatteringFitness(model, hydrogenated_data)
M2 = ScatteringFitness(model, deuterated_data)
free = FreeVariables(names=['hydrogenated', 'deuterated'],
                     solvent=solvent.sld)
free.solvent.values = [-0.561, 6.402]
model.radius.range(1,35)
problem = FitProblem([M1, M2], freevars=free)

In this particular example, the solvent is fixed for each measurement, and the sphere radius is allowed to vary between 1 and 35. Since the radius is not a free variable, the fitted radius will be chosen such that it minimizes the combined fitness of both models. In a more complicated situation, we may not know either the sphere radius or the solvent densities, but still the radius is shared between the two models. In this case we could set:

fv.solvent.range(-1,7)

and the SLD of the solvent would be fitted independently in the two data sets. Notice that we did not refer to the individual model index when setting the range. This is a convenience—range, pm and pmp can be set on the entire set as above, or individually using, e.g.,

fv.solvent[0].range(-1,0)
fv.solvent[1].range(6,7)