pyerrors
What is pyerrors?
pyerrors
is a python package for error computation and propagation of Markov chain Monte Carlo data.
It is based on the gamma method arXiv:hep-lat/0306017. Some of its features are:
- automatic differentiation for exact linear error propagation as suggested in arXiv:1809.01289 (partly based on the autograd package).
- treatment of slow modes in the simulation as suggested in arXiv:1009.5228.
- coherent error propagation for data from different Markov chains.
- non-linear fits with x- and y-errors and exact linear error propagation based on automatic differentiation as introduced in arXiv:1809.01289.
- real and complex matrix operations and their error propagation based on automatic differentiation (Matrix inverse, Cholesky decomposition, calculation of eigenvalues and eigenvectors, singular value decomposition...).
More detailed examples can found in the GitHub repository .
If you use pyerrors
for research that leads to a publication please consider citing:
- Fabian Joswig, Simon Kuberski, Justus T. Kuhlmann, Jan Neuendorf, pyerrors: a python framework for error analysis of Monte Carlo data. Comput.Phys.Commun. 288 (2023) 108750.
- Ulli Wolff, Monte Carlo errors with less errors. Comput.Phys.Commun. 156 (2004) 143-153, Comput.Phys.Commun. 176 (2007) 383 (erratum).
- Alberto Ramos, Automatic differentiation for error analysis of Monte Carlo data. Comput.Phys.Commun. 238 (2019) 19-35.
and
- Stefan Schaefer, Rainer Sommer, Francesco Virotta, Critical slowing down and error analysis in lattice QCD simulations. Nucl.Phys.B 845 (2011) 93-119.
where applicable.
There exist similar publicly available implementations of gamma method error analysis suites in Fortran, Julia and Python.
Installation
Install the most recent release using pip and pypi:
python -m pip install pyerrors # Fresh install
python -m pip install -U pyerrors # Update
Install the most recent release using conda and conda-forge:
conda install -c conda-forge pyerrors # Fresh install
conda update -c conda-forge pyerrors # Update
Install the current develop
version:
python -m pip install -U --no-deps --force-reinstall git+https://github.com/fjosw/pyerrors.git@develop
(Also works for any feature branch).
Basic example
import numpy as np
import pyerrors as pe
my_obs = pe.Obs([samples], ['ensemble_name']) # Initialize an Obs object
my_new_obs = 2 * np.log(my_obs) / my_obs ** 2 # Construct derived Obs object
my_new_obs.gamma_method() # Estimate the statistical error
print(my_new_obs) # Print the result to stdout
> 0.31498(72)
The Obs
class
pyerrors
introduces a new datatype, Obs
, which simplifies error propagation and estimation for auto- and cross-correlated data.
An Obs
object can be initialized with two arguments, the first is a list containing the samples for an observable from a Monte Carlo chain.
The samples can either be provided as python list or as numpy array.
The second argument is a list containing the names of the respective Monte Carlo chains as strings. These strings uniquely identify a Monte Carlo chain/ensemble. It is crucial for the correct error propagation that observations from the same Monte Carlo history are labeled with the same name. See Multiple ensembles/replica for details.
import pyerrors as pe
my_obs = pe.Obs([samples], ['ensemble_name'])
Error propagation
When performing mathematical operations on Obs
objects the correct error propagation is intrinsically taken care of using a first order Taylor expansion
$$\delta_f^i=\sum_\alpha \bar{f}_\alpha \delta_\alpha^i\,,\quad \delta_\alpha^i=a_\alpha^i-\bar{a}_\alpha\,,$$
as introduced in arXiv:hep-lat/0306017.
The required derivatives $\bar{f}_\alpha$ are evaluated up to machine precision via automatic differentiation as suggested in arXiv:1809.01289.
The Obs
class is designed such that mathematical numpy functions can be used on Obs
just as for regular floats.
import numpy as np
import pyerrors as pe
my_obs1 = pe.Obs([samples1], ['ensemble_name'])
my_obs2 = pe.Obs([samples2], ['ensemble_name'])
my_sum = my_obs1 + my_obs2
my_m_eff = np.log(my_obs1 / my_obs2)
iamzero = my_m_eff - my_m_eff
# Check that value and fluctuations are zero within machine precision
print(iamzero == 0.0)
> True
Error estimation
The error estimation within pyerrors
is based on the gamma method introduced in arXiv:hep-lat/0306017.
After having arrived at the derived quantity of interest the gamma_method
can be called as detailed in the following example.
my_sum.gamma_method()
print(my_sum)
> 1.70(57)
my_sum.details()
> Result 1.70000000e+00 +/- 5.72046658e-01 +/- 7.56746598e-02 (33.650%)
> t_int 2.71422900e+00 +/- 6.40320983e-01 S = 2.00
> 1000 samples in 1 ensemble:
> · Ensemble 'ensemble_name' : 1000 configurations (from 1 to 1000)
The gamma_method
is not automatically called after every intermediate step in order to prevent computational overhead.
We use the following definition of the integrated autocorrelation time established in Madras & Sokal 1988
$$\tau_\mathrm{int}=\frac{1}{2}+\sum_{t=1}^{W}\rho(t)\geq \frac{1}{2}\,.$$
The window $W$ is determined via the automatic windowing procedure described in arXiv:hep-lat/0306017.
The standard value for the parameter $S$ of this automatic windowing procedure is $S=2$. Other values for $S$ can be passed to the gamma_method
as parameter.
my_sum.gamma_method(S=3.0)
my_sum.details()
> Result 1.70000000e+00 +/- 6.30675201e-01 +/- 1.04585650e-01 (37.099%)
> t_int 3.29909703e+00 +/- 9.77310102e-01 S = 3.00
> 1000 samples in 1 ensemble:
> · Ensemble 'ensemble_name' : 1000 configurations (from 1 to 1000)
The integrated autocorrelation time $\tau_\mathrm{int}$ and the autocorrelation function $\rho(W)$ can be monitored via the methods pyerrors.obs.Obs.plot_tauint
and pyerrors.obs.Obs.plot_rho
.
If the parameter $S$ is set to zero it is assumed that the dataset does not exhibit any autocorrelation and the window size is chosen to be zero. In this case the error estimate is identical to the sample standard error.
Exponential tails
Slow modes in the Monte Carlo history can be accounted for by attaching an exponential tail to the autocorrelation function $\rho$ as suggested in arXiv:1009.5228. The longest autocorrelation time in the history, $\tau_\mathrm{exp}$, can be passed to the gamma_method
as parameter. In this case the automatic windowing procedure is vacated and the parameter $S$ does not affect the error estimate.
my_sum.gamma_method(tau_exp=7.2)
my_sum.details()
> Result 1.70000000e+00 +/- 6.28097762e-01 +/- 5.79077524e-02 (36.947%)
> t_int 3.27218667e+00 +/- 7.99583654e-01 tau_exp = 7.20, N_sigma = 1
> 1000 samples in 1 ensemble:
> · Ensemble 'ensemble_name' : 1000 configurations (from 1 to 1000)
For the full API see pyerrors.obs.Obs.gamma_method
.
Multiple ensembles/replica
Error propagation for multiple ensembles (Markov chains with different simulation parameters) is handled automatically. Ensembles are uniquely identified by their name
.
obs1 = pe.Obs([samples1], ['ensemble1'])
obs2 = pe.Obs([samples2], ['ensemble2'])
my_sum = obs1 + obs2
my_sum.details()
> Result 2.00697958e+00
> 1500 samples in 2 ensembles:
> · Ensemble 'ensemble1' : 1000 configurations (from 1 to 1000)
> · Ensemble 'ensemble2' : 500 configurations (from 1 to 500)
Observables from the same Monte Carlo chain have to be initialized with the same name for correct error propagation. If different names were used in this case the data would be treated as statistically independent resulting in loss of relevant information and a potential over or under estimate of the statistical error.
pyerrors
identifies multiple replica (independent Markov chains with identical simulation parameters) by the vertical bar |
in the name of the data set.
obs1 = pe.Obs([samples1], ['ensemble1|r01'])
obs2 = pe.Obs([samples2], ['ensemble1|r02'])
> my_sum = obs1 + obs2
> my_sum.details()
> Result 2.00697958e+00
> 1500 samples in 1 ensemble:
> · Ensemble 'ensemble1'
> · Replicum 'r01' : 1000 configurations (from 1 to 1000)
> · Replicum 'r02' : 500 configurations (from 1 to 500)
Error estimation for multiple ensembles
In order to keep track of different error analysis parameters for different ensembles one can make use of global dictionaries as detailed in the following example.
pe.Obs.S_dict['ensemble1'] = 2.5
pe.Obs.tau_exp_dict['ensemble2'] = 8.0
pe.Obs.tau_exp_dict['ensemble3'] = 2.0
In case the gamma_method
is called without any parameters it will use the values specified in the dictionaries for the respective ensembles.
Passing arguments to the gamma_method
still dominates over the dictionaries.
Irregular Monte Carlo chains
Obs
objects defined on irregular Monte Carlo chains can be initialized with the parameter idl
.
# Observable defined on configurations 20 to 519
obs1 = pe.Obs([samples1], ['ensemble1'], idl=[range(20, 520)])
obs1.details()
> Result 9.98319881e-01
> 500 samples in 1 ensemble:
> · Ensemble 'ensemble1' : 500 configurations (from 20 to 519)
# Observable defined on every second configuration between 5 and 1003
obs2 = pe.Obs([samples2], ['ensemble1'], idl=[range(5, 1005, 2)])
obs2.details()
> Result 9.99100712e-01
> 500 samples in 1 ensemble:
> · Ensemble 'ensemble1' : 500 configurations (from 5 to 1003 in steps of 2)
# Observable defined on configurations 2, 9, 28, 29 and 501
obs3 = pe.Obs([samples3], ['ensemble1'], idl=[[2, 9, 28, 29, 501]])
obs3.details()
> Result 1.01718064e+00
> 5 samples in 1 ensemble:
> · Ensemble 'ensemble1' : 5 configurations (irregular range)
Obs
objects defined on regular and irregular histories of the same ensemble can be combined with each other and the correct error propagation and estimation is automatically taken care of.
Warning: Irregular Monte Carlo chains can result in odd patterns in the autocorrelation functions.
Make sure to check the autocorrelation time with e.g. pyerrors.obs.Obs.plot_rho
or pyerrors.obs.Obs.plot_tauint
.
For the full API see pyerrors.obs.Obs
.
Correlators
When one is not interested in single observables but correlation functions, pyerrors
offers the Corr
class which simplifies the corresponding error propagation and provides the user with a set of standard methods. In order to initialize a Corr
objects one needs to arrange the data as a list of Obs
my_corr = pe.Corr([obs_0, obs_1, obs_2, obs_3])
print(my_corr)
> x0/a Corr(x0/a)
> ------------------
> 0 0.7957(80)
> 1 0.5156(51)
> 2 0.3227(33)
> 3 0.2041(21)
In case the correlation functions are not defined on the outermost timeslices, for example because of fixed boundary conditions, a padding can be introduced.
my_corr = pe.Corr([obs_0, obs_1, obs_2, obs_3], padding=[1, 1])
print(my_corr)
> x0/a Corr(x0/a)
> ------------------
> 0
> 1 0.7957(80)
> 2 0.5156(51)
> 3 0.3227(33)
> 4 0.2041(21)
> 5
The individual entries of a correlator can be accessed via slicing
print(my_corr[3])
> 0.3227(33)
Error propagation with the Corr
class works very similar to Obs
objects. Mathematical operations are overloaded and Corr
objects can be computed together with other Corr
objects, Obs
objects or real numbers and integers.
my_new_corr = 0.3 * my_corr[2] * my_corr * my_corr + 12 / my_corr
pyerrors
provides the user with a set of regularly used methods for the manipulation of correlator objects:
Corr.gamma_method
applies the gamma method to all entries of the correlator.Corr.m_eff
to construct effective masses. Various variants for periodic and fixed temporal boundary conditions are available.Corr.deriv
returns the first derivative of the correlator asCorr
. Different discretizations of the numerical derivative are available.Corr.second_deriv
returns the second derivative of the correlator asCorr
. Different discretizations of the numerical derivative are available.Corr.symmetric
symmetrizes parity even correlations functions, assuming periodic boundary conditions.Corr.anti_symmetric
anti-symmetrizes parity odd correlations functions, assuming periodic boundary conditions.Corr.T_symmetry
averages a correlator with its time symmetry partner, assuming fixed boundary conditions.Corr.plateau
extracts a plateau value from the correlator in a given range.Corr.roll
periodically shifts the correlator.Corr.reverse
reverses the time ordering of the correlator.Corr.correlate
constructs a disconnected correlation function from the correlator and anotherCorr
orObs
object.Corr.reweight
reweights the correlator.
pyerrors
can also handle matrices of correlation functions and extract energy states from these matrices via a generalized eigenvalue problem (see pyerrors.correlators.Corr.GEVP
).
For the full API see pyerrors.correlators.Corr
.
Complex valued observables
pyerrors
can handle complex valued observables via the class pyerrors.obs.CObs
.
CObs
are initialized with a real and an imaginary part which both can be Obs
valued.
my_real_part = pe.Obs([samples1], ['ensemble1'])
my_imag_part = pe.Obs([samples2], ['ensemble1'])
my_cobs = pe.CObs(my_real_part, my_imag_part)
my_cobs.gamma_method()
print(my_cobs)
> (0.9959(91)+0.659(28)j)
Elementary mathematical operations are overloaded and samples are properly propagated as for the Obs
class.
my_derived_cobs = (my_cobs + my_cobs.conjugate()) / np.abs(my_cobs)
my_derived_cobs.gamma_method()
print(my_derived_cobs)
> (1.668(23)+0.0j)
The Covobs
class
In many projects, auxiliary data that is not based on Monte Carlo chains enters. Examples are experimentally determined mesons masses which are used to set the scale or renormalization constants. These numbers come with an error that has to be propagated through the analysis. The Covobs
class allows to define such quantities in pyerrors
. Furthermore, external input might consist of correlated quantities. An example are the parameters of an interpolation formula, which are defined via mean values and a covariance matrix between all parameters. The contribution of the interpolation formula to the error of a derived quantity therefore might depend on the complete covariance matrix.
This concept is built into the definition of Covobs
. In pyerrors
, external input is defined by $M$ mean values, a $M\times M$ covariance matrix, where $M=1$ is permissible, and a name that uniquely identifies the covariance matrix. Below, we define the pion mass, based on its mean value and error, 134.9768(5). Note, that the square of the error enters cov_Obs
, since the second argument of this function is the covariance matrix of the Covobs
.
import pyerrors.obs as pe
mpi = pe.cov_Obs(134.9768, 0.0005**2, 'pi^0 mass')
mpi.gamma_method()
mpi.details()
> Result 1.34976800e+02 +/- 5.00000000e-04 +/- 0.00000000e+00 (0.000%)
> pi^0 mass 5.00000000e-04
> 0 samples in 1 ensemble:
> · Covobs 'pi^0 mass'
The resulting object mpi
is an Obs
that contains a Covobs
. In the following, it may be handled as any other Obs
. The contribution of the covariance matrix to the error of an Obs
is determined from the $M \times M$ covariance matrix $\Sigma$ and the gradient of the Obs
with respect to the external quantities, which is the $1\times M$ Jacobian matrix $J$, via
$$s = \sqrt{J^T \Sigma J}\,,$$
where the Jacobian is computed for each derived quantity via automatic differentiation.
Correlated auxiliary data is defined similarly to above, e.g., via
RAP = pe.cov_Obs([16.7457, -19.0475], [[3.49591, -6.07560], [-6.07560, 10.5834]], 'R_AP, 1906.03445, (5.3a)')
print(RAP)
> [Obs[16.7(1.9)], Obs[-19.0(3.3)]]
where RAP
now is a list of two Obs
that contains the two correlated parameters.
Since the gradient of a derived observable with respect to an external covariance matrix is propagated through the entire analysis, the Covobs
class allows to quote the derivative of a result with respect to the external quantities. If these derivatives are published together with the result, small shifts in the definition of external quantities, e.g., the definition of the physical point, can be performed a posteriori based on the published information. This may help to compare results of different groups. The gradient of an Obs
o
with respect to a covariance matrix with the identifying string k
may be accessed via
o.covobs[k].grad
Error propagation in iterative algorithms
pyerrors
supports exact linear error propagation for iterative algorithms like various variants of non-linear least squares fits or root finding. The derivatives required for the error propagation are calculated as described in arXiv:1809.01289.
Least squares fits
Standard non-linear least square fits with errors on the dependent but not the independent variables can be performed with pyerrors.fits.least_squares
. As default solver the Levenberg-Marquardt algorithm implemented in scipy is used.
Fit functions have to be of the following form
import autograd.numpy as anp
def func(a, x):
return a[1] * anp.exp(-a[0] * x)
It is important that numerical functions refer to autograd.numpy
instead of numpy
for the automatic differentiation in iterative algorithms to work properly.
Fits can then be performed via
fit_result = pe.fits.least_squares(x, y, func)
print("\n", fit_result)
> Fit with 2 parameters
> Method: Levenberg-Marquardt
> `ftol` termination condition is satisfied.
> chisquare/d.o.f.: 0.9593035785160936
> Goodness of fit:
> χ²/d.o.f. = 0.959304
> p-value = 0.5673
> Fit parameters:
> 0 0.0548(28)
> 1 1.933(64)
where x is a list
or numpy.array
of floats
and y is a list
or numpy.array
of Obs
.
Data stored in Corr
objects can be fitted directly using the Corr.fit
method.
my_corr = pe.Corr(y)
fit_result = my_corr.fit(func, fitrange=[12, 25])
this can simplify working with absolute fit ranges and takes care of gaps in the data automatically.
For fit functions with multiple independent variables the fit function can be of the form
def func(a, x):
(x1, x2) = x
return a[0] * x1 ** 2 + a[1] * x2
pyerrors
also supports correlated fits which can be triggered via the parameter correlated_fit=True
.
Details about how the required covariance matrix is estimated can be found in pyerrors.obs.covariance
.
Direct visualizations of the performed fits can be triggered via resplot=True
or qqplot=True
.
For all available options including combined fits to multiple datasets see pyerrors.fits.least_squares
.
Total least squares fits
pyerrors
can also fit data with errors on both the dependent and independent variables using the total least squares method also referred to as orthogonal distance regression as implemented in scipy, see pyerrors.fits.least_squares
. The syntax is identical to the standard least squares case, the only difference being that x
also has to be a list
or numpy.array
of Obs
.
For the full API see pyerrors.fits
for fits and pyerrors.roots
for finding roots of functions.
Matrix operations
pyerrors
provides wrappers for Obs
- and CObs
-valued matrix operations based on numpy.linalg
. The supported functions include:
inv
for the matrix inverse.cholseky
for the Cholesky decomposition.det
for the matrix determinant.eigh
for eigenvalues and eigenvectors of hermitean matrices.eig
for eigenvalues of general matrices.pinv
for the Moore-Penrose pseudoinverse.svd
for the singular-value-decomposition.
For the full API see pyerrors.linalg
.
Export data
The preferred exported file format within pyerrors
is json.gz. Files written to this format are valid JSON files that have been compressed using gzip. The structure of the content is inspired by the dobs format of the ALPHA collaboration. The aim of the format is to facilitate the storage of data in a self-contained way such that, even years after the creation of the file, it is possible to extract all necessary information:
- What observables are stored? Possibly: How exactly are they defined.
- How does each single ensemble or external quantity contribute to the error of the observable?
- Who did write the file when and on which machine?
This can be achieved by storing all information in one single file. The export routines of pyerrors
are written such that as much information as possible is written automatically as described in the following example
my_obs = pe.Obs([samples], ["test_ensemble"])
my_obs.tag = "My observable"
pe.input.json.dump_to_json(my_obs, "test_output_file", description="This file contains a test observable")
# For a single observable one can equivalently use the class method dump
my_obs.dump("test_output_file", description="This file contains a test observable")
check = pe.input.json.load_json("test_output_file")
print(my_obs == check)
> True
The format also allows to directly write out the content of Corr
objects or lists and arrays of Obs
objects by passing the desired data to pyerrors.input.json.dump_to_json
.
json.gz format specification
The first entries of the file provide optional auxiliary information:
program
is a string that indicates which program was used to write the file.version
is a string that specifies the version of the format.who
is a string that specifies the user name of the creator of the file.date
is a string and contains the creation date of the file.host
is a string and contains the hostname of the machine where the file has been written.description
contains information on the content of the file. This field is not filled automatically inpyerrors
. The user is advised to provide as detailed information as possible in this field. Examples are: Input files of measurements or simulations, LaTeX formulae or references to publications to specify how the observables have been computed, details on the analysis strategy, ... This field may be any valid JSON type. Strings, arrays or objects (equivalent to dicts in python) are well suited to provide information.
The only necessary entry of the file is the field
-obsdata
, an array that contains the actual data.
Each entry of the array belongs to a single structure of observables. Currently, these structures can be either of Obs
, list
, numpy.ndarray
, Corr
. All Obs
inside a structure (with dimension > 0) have to be defined on the same set of configurations. Different structures, that are represented by entries of the array obsdata
, are treated independently. Each entry of the array obsdata
has the following required entries:
type
is a string that specifies the type of the structure. This allows to parse the content to the correct form after reading the file. It is always possible to interpret the content as list of Obs.value
is an array that contains the mean values of the Obs inside the structure. The following entries are optional:layout
is a string that specifies the layout of multi-dimensional structures. Examples are "2, 2" for a 2x2 dimensional matrix or "64, 4, 4" for a Corr with $T=64$ and 4x4 matrices on each time slices. "1" denotes a single Obs. Multi-dimensional structures are stored in row-major format (see below).tag
is any JSON type. It contains additional information concerning the structure. Thetag
of anObs
inpyerrors
is written here.reweighted
is a Bool that may be used to specify, whether theObs
in the structure have been reweighted.data
is an array that contains the data from MC chains. We will define it below.cdata
is an array that contains the data from external quantities with an error (Covobs
inpyerrors
). We will define it below.
The array data
contains the data from MC chains. Each entry of the array corresponds to one ensemble and contains:
id
, a string that contains the name of the ensemblereplica
, an array that contains an entry per replica of the ensemble.
Each entry of replica
contains
name
, a string that contains the name of the replica
deltas
, an array that contains the actual data.
Each entry in deltas
corresponds to one configuration of the replica and has $1+N$ many entries. The first entry is an integer that specifies the configuration number that, together with ensemble and replica name, may be used to uniquely identify the configuration on which the data has been obtained. The following N entries specify the deltas, i.e., the deviation of the observable from the mean value on this configuration, of each Obs
inside the structure. Multi-dimensional structures are stored in a row-major format. For primary observables, such as correlation functions, $value + delta_i$ matches the primary data obtained on the configuration.
The array cdata
contains information about the contribution of auxiliary observables, represented by Covobs
in pyerrors
, to the total error of the observables. Each entry of the array belongs to one auxiliary covariance matrix and contains:
id
, a string that identifies the covariance matrixlayout
, a string that defines the dimensions of the $M\times M$ covariance matrix (has to be "M, M" or "1").cov
, an array that contains the $M\times M$ many entries of the covariance matrix, stored in row-major format.grad
, an array that contains N entries, one for eachObs
inside the structure. Each entry itself is an array, that contains the M gradients of the Nth observable with respect to the quantity that corresponds to the Mth diagonal entry of the covariance matrix.
A JSON schema that may be used to verify the correctness of a file with respect to the format definition is stored in ./examples/json_schema.json. The schema is a self-descriptive format definition and contains an exemplary file.
Julia I/O routines for the json.gz format, compatible with ADerrors.jl, can be found here.
1r''' 2# What is pyerrors? 3`pyerrors` is a python package for error computation and propagation of Markov chain Monte Carlo data. 4It is based on the gamma method [arXiv:hep-lat/0306017](https://arxiv.org/abs/hep-lat/0306017). Some of its features are: 5- automatic differentiation for exact linear error propagation as suggested in [arXiv:1809.01289](https://arxiv.org/abs/1809.01289) (partly based on the [autograd](https://github.com/HIPS/autograd) package). 6- treatment of slow modes in the simulation as suggested in [arXiv:1009.5228](https://arxiv.org/abs/1009.5228). 7- coherent error propagation for data from different Markov chains. 8- non-linear fits with x- and y-errors and exact linear error propagation based on automatic differentiation as introduced in [arXiv:1809.01289](https://arxiv.org/abs/1809.01289). 9- real and complex matrix operations and their error propagation based on automatic differentiation (Matrix inverse, Cholesky decomposition, calculation of eigenvalues and eigenvectors, singular value decomposition...). 10 11More detailed examples can found in the [GitHub repository](https://github.com/fjosw/pyerrors/tree/develop/examples) [![badge](https://img.shields.io/badge/-try%20it%20out-579ACA.svg?logo=)](https://mybinder.org/v2/gh/fjosw/pyerrors/HEAD?labpath=examples). 12 13If you use `pyerrors` for research that leads to a publication please consider citing: 14- Fabian Joswig, Simon Kuberski, Justus T. Kuhlmann, Jan Neuendorf, *pyerrors: a python framework for error analysis of Monte Carlo data*. Comput.Phys.Commun. 288 (2023) 108750. 15- Ulli Wolff, *Monte Carlo errors with less errors*. Comput.Phys.Commun. 156 (2004) 143-153, Comput.Phys.Commun. 176 (2007) 383 (erratum). 16- Alberto Ramos, *Automatic differentiation for error analysis of Monte Carlo data*. Comput.Phys.Commun. 238 (2019) 19-35. 17 18and 19 20- Stefan Schaefer, Rainer Sommer, Francesco Virotta, *Critical slowing down and error analysis in lattice QCD simulations*. Nucl.Phys.B 845 (2011) 93-119. 21 22where applicable. 23 24There exist similar publicly available implementations of gamma method error analysis suites in [Fortran](https://gitlab.ift.uam-csic.es/alberto/aderrors), [Julia](https://gitlab.ift.uam-csic.es/alberto/aderrors.jl) and [Python](https://github.com/mbruno46/pyobs). 25 26## Installation 27 28Install the most recent release using pip and [pypi](https://pypi.org/project/pyerrors/): 29```bash 30python -m pip install pyerrors # Fresh install 31python -m pip install -U pyerrors # Update 32``` 33Install the most recent release using conda and [conda-forge](https://anaconda.org/conda-forge/pyerrors): 34```bash 35conda install -c conda-forge pyerrors # Fresh install 36conda update -c conda-forge pyerrors # Update 37``` 38Install the current `develop` version: 39```bash 40python -m pip install -U --no-deps --force-reinstall git+https://github.com/fjosw/pyerrors.git@develop 41``` 42(Also works for any feature branch). 43 44## Basic example 45 46```python 47import numpy as np 48import pyerrors as pe 49 50my_obs = pe.Obs([samples], ['ensemble_name']) # Initialize an Obs object 51my_new_obs = 2 * np.log(my_obs) / my_obs ** 2 # Construct derived Obs object 52my_new_obs.gamma_method() # Estimate the statistical error 53print(my_new_obs) # Print the result to stdout 54> 0.31498(72) 55``` 56 57# The `Obs` class 58 59`pyerrors` introduces a new datatype, `Obs`, which simplifies error propagation and estimation for auto- and cross-correlated data. 60An `Obs` object can be initialized with two arguments, the first is a list containing the samples for an observable from a Monte Carlo chain. 61The samples can either be provided as python list or as numpy array. 62The second argument is a list containing the names of the respective Monte Carlo chains as strings. These strings uniquely identify a Monte Carlo chain/ensemble. **It is crucial for the correct error propagation that observations from the same Monte Carlo history are labeled with the same name. See [Multiple ensembles/replica](#multiple-ensemblesreplica) for details.** 63 64```python 65import pyerrors as pe 66 67my_obs = pe.Obs([samples], ['ensemble_name']) 68``` 69 70## Error propagation 71 72When performing mathematical operations on `Obs` objects the correct error propagation is intrinsically taken care of using a first order Taylor expansion 73$$\delta_f^i=\sum_\alpha \bar{f}_\alpha \delta_\alpha^i\,,\quad \delta_\alpha^i=a_\alpha^i-\bar{a}_\alpha\,,$$ 74as introduced in [arXiv:hep-lat/0306017](https://arxiv.org/abs/hep-lat/0306017). 75The required derivatives $\bar{f}_\alpha$ are evaluated up to machine precision via automatic differentiation as suggested in [arXiv:1809.01289](https://arxiv.org/abs/1809.01289). 76 77The `Obs` class is designed such that mathematical numpy functions can be used on `Obs` just as for regular floats. 78 79```python 80import numpy as np 81import pyerrors as pe 82 83my_obs1 = pe.Obs([samples1], ['ensemble_name']) 84my_obs2 = pe.Obs([samples2], ['ensemble_name']) 85 86my_sum = my_obs1 + my_obs2 87 88my_m_eff = np.log(my_obs1 / my_obs2) 89 90iamzero = my_m_eff - my_m_eff 91# Check that value and fluctuations are zero within machine precision 92print(iamzero == 0.0) 93> True 94``` 95 96## Error estimation 97 98The error estimation within `pyerrors` is based on the gamma method introduced in [arXiv:hep-lat/0306017](https://arxiv.org/abs/hep-lat/0306017). 99After having arrived at the derived quantity of interest the `gamma_method` can be called as detailed in the following example. 100 101```python 102my_sum.gamma_method() 103print(my_sum) 104> 1.70(57) 105my_sum.details() 106> Result 1.70000000e+00 +/- 5.72046658e-01 +/- 7.56746598e-02 (33.650%) 107> t_int 2.71422900e+00 +/- 6.40320983e-01 S = 2.00 108> 1000 samples in 1 ensemble: 109> · Ensemble 'ensemble_name' : 1000 configurations (from 1 to 1000) 110 111``` 112The `gamma_method` is not automatically called after every intermediate step in order to prevent computational overhead. 113 114We use the following definition of the integrated autocorrelation time established in [Madras & Sokal 1988](https://link.springer.com/article/10.1007/BF01022990) 115$$\tau_\mathrm{int}=\frac{1}{2}+\sum_{t=1}^{W}\rho(t)\geq \frac{1}{2}\,.$$ 116The window $W$ is determined via the automatic windowing procedure described in [arXiv:hep-lat/0306017](https://arxiv.org/abs/hep-lat/0306017). 117The standard value for the parameter $S$ of this automatic windowing procedure is $S=2$. Other values for $S$ can be passed to the `gamma_method` as parameter. 118 119```python 120my_sum.gamma_method(S=3.0) 121my_sum.details() 122> Result 1.70000000e+00 +/- 6.30675201e-01 +/- 1.04585650e-01 (37.099%) 123> t_int 3.29909703e+00 +/- 9.77310102e-01 S = 3.00 124> 1000 samples in 1 ensemble: 125> · Ensemble 'ensemble_name' : 1000 configurations (from 1 to 1000) 126 127``` 128 129The integrated autocorrelation time $\tau_\mathrm{int}$ and the autocorrelation function $\rho(W)$ can be monitored via the methods `pyerrors.obs.Obs.plot_tauint` and `pyerrors.obs.Obs.plot_rho`. 130 131If the parameter $S$ is set to zero it is assumed that the dataset does not exhibit any autocorrelation and the window size is chosen to be zero. 132In this case the error estimate is identical to the sample standard error. 133 134### Exponential tails 135 136Slow modes in the Monte Carlo history can be accounted for by attaching an exponential tail to the autocorrelation function $\rho$ as suggested in [arXiv:1009.5228](https://arxiv.org/abs/1009.5228). The longest autocorrelation time in the history, $\tau_\mathrm{exp}$, can be passed to the `gamma_method` as parameter. In this case the automatic windowing procedure is vacated and the parameter $S$ does not affect the error estimate. 137 138```python 139my_sum.gamma_method(tau_exp=7.2) 140my_sum.details() 141> Result 1.70000000e+00 +/- 6.28097762e-01 +/- 5.79077524e-02 (36.947%) 142> t_int 3.27218667e+00 +/- 7.99583654e-01 tau_exp = 7.20, N_sigma = 1 143> 1000 samples in 1 ensemble: 144> · Ensemble 'ensemble_name' : 1000 configurations (from 1 to 1000) 145``` 146 147For the full API see `pyerrors.obs.Obs.gamma_method`. 148 149## Multiple ensembles/replica 150 151Error propagation for multiple ensembles (Markov chains with different simulation parameters) is handled automatically. Ensembles are uniquely identified by their `name`. 152 153```python 154obs1 = pe.Obs([samples1], ['ensemble1']) 155obs2 = pe.Obs([samples2], ['ensemble2']) 156 157my_sum = obs1 + obs2 158my_sum.details() 159> Result 2.00697958e+00 160> 1500 samples in 2 ensembles: 161> · Ensemble 'ensemble1' : 1000 configurations (from 1 to 1000) 162> · Ensemble 'ensemble2' : 500 configurations (from 1 to 500) 163``` 164Observables from the **same Monte Carlo chain** have to be initialized with the **same name** for correct error propagation. If different names were used in this case the data would be treated as statistically independent resulting in loss of relevant information and a potential over or under estimate of the statistical error. 165 166 167`pyerrors` identifies multiple replica (independent Markov chains with identical simulation parameters) by the vertical bar `|` in the name of the data set. 168 169```python 170obs1 = pe.Obs([samples1], ['ensemble1|r01']) 171obs2 = pe.Obs([samples2], ['ensemble1|r02']) 172 173> my_sum = obs1 + obs2 174> my_sum.details() 175> Result 2.00697958e+00 176> 1500 samples in 1 ensemble: 177> · Ensemble 'ensemble1' 178> · Replicum 'r01' : 1000 configurations (from 1 to 1000) 179> · Replicum 'r02' : 500 configurations (from 1 to 500) 180``` 181 182### Error estimation for multiple ensembles 183 184In order to keep track of different error analysis parameters for different ensembles one can make use of global dictionaries as detailed in the following example. 185 186```python 187pe.Obs.S_dict['ensemble1'] = 2.5 188pe.Obs.tau_exp_dict['ensemble2'] = 8.0 189pe.Obs.tau_exp_dict['ensemble3'] = 2.0 190``` 191 192In case the `gamma_method` is called without any parameters it will use the values specified in the dictionaries for the respective ensembles. 193Passing arguments to the `gamma_method` still dominates over the dictionaries. 194 195 196## Irregular Monte Carlo chains 197 198`Obs` objects defined on irregular Monte Carlo chains can be initialized with the parameter `idl`. 199 200```python 201# Observable defined on configurations 20 to 519 202obs1 = pe.Obs([samples1], ['ensemble1'], idl=[range(20, 520)]) 203obs1.details() 204> Result 9.98319881e-01 205> 500 samples in 1 ensemble: 206> · Ensemble 'ensemble1' : 500 configurations (from 20 to 519) 207 208# Observable defined on every second configuration between 5 and 1003 209obs2 = pe.Obs([samples2], ['ensemble1'], idl=[range(5, 1005, 2)]) 210obs2.details() 211> Result 9.99100712e-01 212> 500 samples in 1 ensemble: 213> · Ensemble 'ensemble1' : 500 configurations (from 5 to 1003 in steps of 2) 214 215# Observable defined on configurations 2, 9, 28, 29 and 501 216obs3 = pe.Obs([samples3], ['ensemble1'], idl=[[2, 9, 28, 29, 501]]) 217obs3.details() 218> Result 1.01718064e+00 219> 5 samples in 1 ensemble: 220> · Ensemble 'ensemble1' : 5 configurations (irregular range) 221 222``` 223 224`Obs` objects defined on regular and irregular histories of the same ensemble can be combined with each other and the correct error propagation and estimation is automatically taken care of. 225 226**Warning:** Irregular Monte Carlo chains can result in odd patterns in the autocorrelation functions. 227Make sure to check the autocorrelation time with e.g. `pyerrors.obs.Obs.plot_rho` or `pyerrors.obs.Obs.plot_tauint`. 228 229For the full API see `pyerrors.obs.Obs`. 230 231# Correlators 232When one is not interested in single observables but correlation functions, `pyerrors` offers the `Corr` class which simplifies the corresponding error propagation and provides the user with a set of standard methods. In order to initialize a `Corr` objects one needs to arrange the data as a list of `Obs` 233```python 234my_corr = pe.Corr([obs_0, obs_1, obs_2, obs_3]) 235print(my_corr) 236> x0/a Corr(x0/a) 237> ------------------ 238> 0 0.7957(80) 239> 1 0.5156(51) 240> 2 0.3227(33) 241> 3 0.2041(21) 242``` 243In case the correlation functions are not defined on the outermost timeslices, for example because of fixed boundary conditions, a padding can be introduced. 244```python 245my_corr = pe.Corr([obs_0, obs_1, obs_2, obs_3], padding=[1, 1]) 246print(my_corr) 247> x0/a Corr(x0/a) 248> ------------------ 249> 0 250> 1 0.7957(80) 251> 2 0.5156(51) 252> 3 0.3227(33) 253> 4 0.2041(21) 254> 5 255``` 256The individual entries of a correlator can be accessed via slicing 257```python 258print(my_corr[3]) 259> 0.3227(33) 260``` 261Error propagation with the `Corr` class works very similar to `Obs` objects. Mathematical operations are overloaded and `Corr` objects can be computed together with other `Corr` objects, `Obs` objects or real numbers and integers. 262```python 263my_new_corr = 0.3 * my_corr[2] * my_corr * my_corr + 12 / my_corr 264``` 265 266`pyerrors` provides the user with a set of regularly used methods for the manipulation of correlator objects: 267- `Corr.gamma_method` applies the gamma method to all entries of the correlator. 268- `Corr.m_eff` to construct effective masses. Various variants for periodic and fixed temporal boundary conditions are available. 269- `Corr.deriv` returns the first derivative of the correlator as `Corr`. Different discretizations of the numerical derivative are available. 270- `Corr.second_deriv` returns the second derivative of the correlator as `Corr`. Different discretizations of the numerical derivative are available. 271- `Corr.symmetric` symmetrizes parity even correlations functions, assuming periodic boundary conditions. 272- `Corr.anti_symmetric` anti-symmetrizes parity odd correlations functions, assuming periodic boundary conditions. 273- `Corr.T_symmetry` averages a correlator with its time symmetry partner, assuming fixed boundary conditions. 274- `Corr.plateau` extracts a plateau value from the correlator in a given range. 275- `Corr.roll` periodically shifts the correlator. 276- `Corr.reverse` reverses the time ordering of the correlator. 277- `Corr.correlate` constructs a disconnected correlation function from the correlator and another `Corr` or `Obs` object. 278- `Corr.reweight` reweights the correlator. 279 280`pyerrors` can also handle matrices of correlation functions and extract energy states from these matrices via a generalized eigenvalue problem (see `pyerrors.correlators.Corr.GEVP`). 281 282For the full API see `pyerrors.correlators.Corr`. 283 284# Complex valued observables 285 286`pyerrors` can handle complex valued observables via the class `pyerrors.obs.CObs`. 287`CObs` are initialized with a real and an imaginary part which both can be `Obs` valued. 288 289```python 290my_real_part = pe.Obs([samples1], ['ensemble1']) 291my_imag_part = pe.Obs([samples2], ['ensemble1']) 292 293my_cobs = pe.CObs(my_real_part, my_imag_part) 294my_cobs.gamma_method() 295print(my_cobs) 296> (0.9959(91)+0.659(28)j) 297``` 298 299Elementary mathematical operations are overloaded and samples are properly propagated as for the `Obs` class. 300```python 301my_derived_cobs = (my_cobs + my_cobs.conjugate()) / np.abs(my_cobs) 302my_derived_cobs.gamma_method() 303print(my_derived_cobs) 304> (1.668(23)+0.0j) 305``` 306 307# The `Covobs` class 308In many projects, auxiliary data that is not based on Monte Carlo chains enters. Examples are experimentally determined mesons masses which are used to set the scale or renormalization constants. These numbers come with an error that has to be propagated through the analysis. The `Covobs` class allows to define such quantities in `pyerrors`. Furthermore, external input might consist of correlated quantities. An example are the parameters of an interpolation formula, which are defined via mean values and a covariance matrix between all parameters. The contribution of the interpolation formula to the error of a derived quantity therefore might depend on the complete covariance matrix. 309 310This concept is built into the definition of `Covobs`. In `pyerrors`, external input is defined by $M$ mean values, a $M\times M$ covariance matrix, where $M=1$ is permissible, and a name that uniquely identifies the covariance matrix. Below, we define the pion mass, based on its mean value and error, 134.9768(5). **Note, that the square of the error enters `cov_Obs`**, since the second argument of this function is the covariance matrix of the `Covobs`. 311 312```python 313import pyerrors.obs as pe 314 315mpi = pe.cov_Obs(134.9768, 0.0005**2, 'pi^0 mass') 316mpi.gamma_method() 317mpi.details() 318> Result 1.34976800e+02 +/- 5.00000000e-04 +/- 0.00000000e+00 (0.000%) 319> pi^0 mass 5.00000000e-04 320> 0 samples in 1 ensemble: 321> · Covobs 'pi^0 mass' 322``` 323The resulting object `mpi` is an `Obs` that contains a `Covobs`. In the following, it may be handled as any other `Obs`. The contribution of the covariance matrix to the error of an `Obs` is determined from the $M \times M$ covariance matrix $\Sigma$ and the gradient of the `Obs` with respect to the external quantities, which is the $1\times M$ Jacobian matrix $J$, via 324$$s = \sqrt{J^T \Sigma J}\,,$$ 325where the Jacobian is computed for each derived quantity via automatic differentiation. 326 327Correlated auxiliary data is defined similarly to above, e.g., via 328```python 329RAP = pe.cov_Obs([16.7457, -19.0475], [[3.49591, -6.07560], [-6.07560, 10.5834]], 'R_AP, 1906.03445, (5.3a)') 330print(RAP) 331> [Obs[16.7(1.9)], Obs[-19.0(3.3)]] 332``` 333where `RAP` now is a list of two `Obs` that contains the two correlated parameters. 334 335Since the gradient of a derived observable with respect to an external covariance matrix is propagated through the entire analysis, the `Covobs` class allows to quote the derivative of a result with respect to the external quantities. If these derivatives are published together with the result, small shifts in the definition of external quantities, e.g., the definition of the physical point, can be performed a posteriori based on the published information. This may help to compare results of different groups. The gradient of an `Obs` `o` with respect to a covariance matrix with the identifying string `k` may be accessed via 336```python 337o.covobs[k].grad 338``` 339 340# Error propagation in iterative algorithms 341 342`pyerrors` supports exact linear error propagation for iterative algorithms like various variants of non-linear least squares fits or root finding. The derivatives required for the error propagation are calculated as described in [arXiv:1809.01289](https://arxiv.org/abs/1809.01289). 343 344## Least squares fits 345 346Standard non-linear least square fits with errors on the dependent but not the independent variables can be performed with `pyerrors.fits.least_squares`. As default solver the Levenberg-Marquardt algorithm implemented in [scipy](https://docs.scipy.org/doc/scipy/reference/generated/scipy.optimize.least_squares.html) is used. 347 348Fit functions have to be of the following form 349```python 350import autograd.numpy as anp 351 352def func(a, x): 353 return a[1] * anp.exp(-a[0] * x) 354``` 355**It is important that numerical functions refer to `autograd.numpy` instead of `numpy` for the automatic differentiation in iterative algorithms to work properly.** 356 357Fits can then be performed via 358```python 359fit_result = pe.fits.least_squares(x, y, func) 360print("\n", fit_result) 361> Fit with 2 parameters 362> Method: Levenberg-Marquardt 363> `ftol` termination condition is satisfied. 364> chisquare/d.o.f.: 0.9593035785160936 365 366> Goodness of fit: 367> χ²/d.o.f. = 0.959304 368> p-value = 0.5673 369> Fit parameters: 370> 0 0.0548(28) 371> 1 1.933(64) 372``` 373where x is a `list` or `numpy.array` of `floats` and y is a `list` or `numpy.array` of `Obs`. 374 375Data stored in `Corr` objects can be fitted directly using the `Corr.fit` method. 376```python 377my_corr = pe.Corr(y) 378fit_result = my_corr.fit(func, fitrange=[12, 25]) 379``` 380this can simplify working with absolute fit ranges and takes care of gaps in the data automatically. 381 382For fit functions with multiple independent variables the fit function can be of the form 383 384```python 385def func(a, x): 386 (x1, x2) = x 387 return a[0] * x1 ** 2 + a[1] * x2 388``` 389 390`pyerrors` also supports correlated fits which can be triggered via the parameter `correlated_fit=True`. 391Details about how the required covariance matrix is estimated can be found in `pyerrors.obs.covariance`. 392Direct visualizations of the performed fits can be triggered via `resplot=True` or `qqplot=True`. 393 394For all available options including combined fits to multiple datasets see `pyerrors.fits.least_squares`. 395 396## Total least squares fits 397`pyerrors` can also fit data with errors on both the dependent and independent variables using the total least squares method also referred to as orthogonal distance regression as implemented in [scipy](https://docs.scipy.org/doc/scipy/reference/odr.html), see `pyerrors.fits.least_squares`. The syntax is identical to the standard least squares case, the only difference being that `x` also has to be a `list` or `numpy.array` of `Obs`. 398 399For the full API see `pyerrors.fits` for fits and `pyerrors.roots` for finding roots of functions. 400 401# Matrix operations 402`pyerrors` provides wrappers for `Obs`- and `CObs`-valued matrix operations based on `numpy.linalg`. The supported functions include: 403- `inv` for the matrix inverse. 404- `cholseky` for the Cholesky decomposition. 405- `det` for the matrix determinant. 406- `eigh` for eigenvalues and eigenvectors of hermitean matrices. 407- `eig` for eigenvalues of general matrices. 408- `pinv` for the Moore-Penrose pseudoinverse. 409- `svd` for the singular-value-decomposition. 410 411For the full API see `pyerrors.linalg`. 412 413# Export data 414 415[<img src="https://imgs.xkcd.com/comics/standards_2x.png" width="75%" height="75%">](https://xkcd.com/927/) 416 417The preferred exported file format within `pyerrors` is json.gz. Files written to this format are valid JSON files that have been compressed using gzip. The structure of the content is inspired by the dobs format of the ALPHA collaboration. The aim of the format is to facilitate the storage of data in a self-contained way such that, even years after the creation of the file, it is possible to extract all necessary information: 418- What observables are stored? Possibly: How exactly are they defined. 419- How does each single ensemble or external quantity contribute to the error of the observable? 420- Who did write the file when and on which machine? 421 422This can be achieved by storing all information in one single file. The export routines of `pyerrors` are written such that as much information as possible is written automatically as described in the following example 423```python 424my_obs = pe.Obs([samples], ["test_ensemble"]) 425my_obs.tag = "My observable" 426 427pe.input.json.dump_to_json(my_obs, "test_output_file", description="This file contains a test observable") 428# For a single observable one can equivalently use the class method dump 429my_obs.dump("test_output_file", description="This file contains a test observable") 430 431check = pe.input.json.load_json("test_output_file") 432 433print(my_obs == check) 434> True 435``` 436The format also allows to directly write out the content of `Corr` objects or lists and arrays of `Obs` objects by passing the desired data to `pyerrors.input.json.dump_to_json`. 437 438## json.gz format specification 439The first entries of the file provide optional auxiliary information: 440- `program` is a string that indicates which program was used to write the file. 441- `version` is a string that specifies the version of the format. 442- `who` is a string that specifies the user name of the creator of the file. 443- `date` is a string and contains the creation date of the file. 444- `host` is a string and contains the hostname of the machine where the file has been written. 445- `description` contains information on the content of the file. This field is not filled automatically in `pyerrors`. The user is advised to provide as detailed information as possible in this field. Examples are: Input files of measurements or simulations, LaTeX formulae or references to publications to specify how the observables have been computed, details on the analysis strategy, ... This field may be any valid JSON type. Strings, arrays or objects (equivalent to dicts in python) are well suited to provide information. 446 447The only necessary entry of the file is the field 448-`obsdata`, an array that contains the actual data. 449 450Each entry of the array belongs to a single structure of observables. Currently, these structures can be either of `Obs`, `list`, `numpy.ndarray`, `Corr`. All `Obs` inside a structure (with dimension > 0) have to be defined on the same set of configurations. Different structures, that are represented by entries of the array `obsdata`, are treated independently. Each entry of the array `obsdata` has the following required entries: 451- `type` is a string that specifies the type of the structure. This allows to parse the content to the correct form after reading the file. It is always possible to interpret the content as list of Obs. 452- `value` is an array that contains the mean values of the Obs inside the structure. 453The following entries are optional: 454- `layout` is a string that specifies the layout of multi-dimensional structures. Examples are "2, 2" for a 2x2 dimensional matrix or "64, 4, 4" for a Corr with $T=64$ and 4x4 matrices on each time slices. "1" denotes a single Obs. Multi-dimensional structures are stored in row-major format (see below). 455- `tag` is any JSON type. It contains additional information concerning the structure. The `tag` of an `Obs` in `pyerrors` is written here. 456- `reweighted` is a Bool that may be used to specify, whether the `Obs` in the structure have been reweighted. 457- `data` is an array that contains the data from MC chains. We will define it below. 458- `cdata` is an array that contains the data from external quantities with an error (`Covobs` in `pyerrors`). We will define it below. 459 460The array `data` contains the data from MC chains. Each entry of the array corresponds to one ensemble and contains: 461- `id`, a string that contains the name of the ensemble 462- `replica`, an array that contains an entry per replica of the ensemble. 463 464Each entry of `replica` contains 465`name`, a string that contains the name of the replica 466`deltas`, an array that contains the actual data. 467 468Each entry in `deltas` corresponds to one configuration of the replica and has $1+N$ many entries. The first entry is an integer that specifies the configuration number that, together with ensemble and replica name, may be used to uniquely identify the configuration on which the data has been obtained. The following N entries specify the deltas, i.e., the deviation of the observable from the mean value on this configuration, of each `Obs` inside the structure. Multi-dimensional structures are stored in a row-major format. For primary observables, such as correlation functions, $value + delta_i$ matches the primary data obtained on the configuration. 469 470The array `cdata` contains information about the contribution of auxiliary observables, represented by `Covobs` in `pyerrors`, to the total error of the observables. Each entry of the array belongs to one auxiliary covariance matrix and contains: 471- `id`, a string that identifies the covariance matrix 472- `layout`, a string that defines the dimensions of the $M\times M$ covariance matrix (has to be "M, M" or "1"). 473- `cov`, an array that contains the $M\times M$ many entries of the covariance matrix, stored in row-major format. 474- `grad`, an array that contains N entries, one for each `Obs` inside the structure. Each entry itself is an array, that contains the M gradients of the Nth observable with respect to the quantity that corresponds to the Mth diagonal entry of the covariance matrix. 475 476A JSON schema that may be used to verify the correctness of a file with respect to the format definition is stored in ./examples/json_schema.json. The schema is a self-descriptive format definition and contains an exemplary file. 477 478Julia I/O routines for the json.gz format, compatible with [ADerrors.jl](https://gitlab.ift.uam-csic.es/alberto/aderrors.jl), can be found [here](https://github.com/fjosw/ADjson.jl). 479''' 480from .obs import * 481from .correlators import * 482from .fits import * 483from .misc import * 484from . import dirac as dirac 485from . import input as input 486from . import linalg as linalg 487from . import mpm as mpm 488from . import roots as roots 489from . import integrate as integrate 490from . import special as special 491 492from .version import __version__ as __version__