Updated documentation. Added README

9191f383 · Alberto Ramos · a54113fa · 9191f383 · 9191f383 · a54113fa
Commit 9191f383 authored Jul 14, 2020 by Alberto Ramos
7 changed files
--- a/README.md
+++ b/README.md
+# ADerrors.jl
+
+Error propagation and analysis of Monte Carlo data with the (``\Gamma``) method and automatic differentiation in `Julia`
+
+The full documentation of the package is available via the usual
+[Julia `REPL` help
+mode](https://docs.julialang.org/en/v1/stdlib/REPL/#Help-mode-1) and
+online in [HTML format](https://ific.uv.es/~alramos/docs/ADerrors/).
+
+This work is an implementation of several ideas in data analysis. If you use this package for your scientific work, please consider citing:
+- U. Wolff, "Monte Carlo errors with less errors".
+  Comput.Phys.Commun. 156 (2004) 143-153. DOI: 10.1016/S0010-4655(03)00467-3
+- F. Virotta, "Critical slowing down and error analysis of lattice QCD simulations." PhD thesis.
+- Stefan Schaefer, Rainer Sommer, Francesco Virotta, "Critical slowing
+  down and error analysis in lattice QCD simulations". Nucl.Phys.B 845 (2011) 93-119.
+- A. Ramos, "Automatic differentiation for error analysis of Monte Carlo data". Comput.Phys.Commun. 238 (2019) 19-35. DOI: 10.1016/j.cpc.2018.12.020. 
+- M. Bruno, R. Sommer, In preparation.
+
+## Installation
+
+The package in not in the general registry. Still one can use the package manager
+```julia
+julia> import Pkg
+(v1.1) pkg> add https://gitlab.ift.uam-csic.es/alberto/aderrors.jl
+```
+
+## Tutorial
+
+It is better to start with the [Getting started](https://ific.uv.es/~alramos/docs/ADerrors/tutorial/) guide.
+
+
+
--- a/docs/make.jl
+++ b/docs/make.jl
@@ -2,7 +2,8 @@ using Documenter, ADerrors

 makedocs(modules=[ADerrors], doctest=true,
         pages = [
-             "ADerrors" => "index.md",
+             "Getting Started" => "tutorial.md", 
+             "API" => "api.md",
             "Contents" => "toc.md"
             ], 
         sitename = "ADerrors.jl",

--- a/docs/src/index.md
+++ b/docs/src/index.md
-# ADerrors.jl
-
-This package implementes error analysis of Monte Carlo data with the (``\Gamma``) method and automatic differentiation for error propagation.
-```@contents
-Pages = ["index.md"]
-Depth = 3
-```
-
-## Getting started
-
-`ADerrors.jl` is a package for error propagation and analysis of Monte carlo data. At the core of the package is the `uwreal` data type, that is able to store variables with uncertainties
-```@repl gs
-using ADerrors
-a = uwreal([1.0, 0.1], 1) # 1.0 +/- 0.1
-```
-It can also store MC data
-```@repl gs
-# Generate some correlated data
-eta  = randn(1000);
-x    = Vector{Float64}(undef, 1000);
-x[1] = 0.0;
-for i in 2:1000
-    x[i] = x[i-1] + eta[i]
-    if abs(x[i]) > 1.0
-        x[i] = x[i-1]
-    end
-end
-
-b = uwreal(x.^2, 200)
-c = uwreal(x.^4, 200)
-```
-Correlations between variables are treated consistently. This requires that each variable that is defined with `uwreal` contains an ensemble `ID` tag. In the previous examples `a` has been measured on ensemble `ID` 1, while both `b` and `c` have been measured on ensemble `ID` 200. This will treat the measurements in `b` and `c` as statistically correlated, while `a` will be uncorrelated with both `b` and `c`.
-
-One can perform operations with `uwreal` variables as if there were normal floats
-```@repl gs
-d = 2.0 + sin(b)/a + 2.0*log(c)/(a+b+1.0)
-```
-Error propagation is done automatically, and correlations are consistently taken into account. Note that once complex derived `uwreal` variables are defined, as for example `d` above, they will in general depend on more than one ensemble `ID`. 
-
-In order to perform the error analysis of a variable, one should use the `uwerr` function
-```@repl gs
-uwerr(d);
-println("d: ", d)
-```
-One can get detailed information on the error of a variables 
-```@repl gs
-uwerr(d);
-println("Details on variable d: ")
-details(d)
-```
-where we can clearly see that there are two ensembles contributing to the uncertainty in `d`. We recognize this as `d` being a function of both `a` (measured on ensemble 1), and `b` and `c`, measured on ensemble 200.
-
-Note that one does not need to use `uwerr` on a variable unless one is interested in the error on that variable. For example
-```@repl gs
-global x = 1.0
-for i in 1:100
-	global x = x - (d*cos(x) - x)/(-d*sin(x) - 1.0)
-end
-uwerr(x)
-print("Root of d*cos(x) - x:" )
-details(x)
-```
-determines the root on ``d\cos(x) - x`` by using Newton's method, and propagates the error on `d` into an error on the root. The error on `x` is only determined once, after the 100 iterations are over. 
-
-### Advanced topics
-
-#### Errors in fit parameters
-
-`ADerrors.jl` does not provide an interface to perform fits, but once the minima of the ``\chi^2`` has been found, it can help propagating the error from the data to the fit parameters. 
-
-Here we are going to [repeat the example](https://github.com/JuliaNLSolvers/LsqFit.jl) of the package `LsqFit.jl` using `ADerrors.jl` for error propagation.
-```@repl fits
-using LsqFit, ADerrors
-
-# a two-parameter exponential model
-# x: array of independent variables
-# p: array of model parameters
-# model(x, p) will accept the full data set as the first argument `x`.
-# This means that we need to write our model function so it applies
-# the model to the full dataset. We use `@.` to apply the calculations
-# across all rows.
-@. model(x, p) = p[1]*exp(-x*p[2])
-
-# some example data
-# xdata: independent variables
-# ydata: dependent variable as uwreal
-xdata = range(0, stop=10, length=20);
-ydata = Vector{uwreal}(undef, length(xdata));
-for i in eachindex(ydata)
-	ydata[i] = uwreal([model(xdata[i], [1.0 2.0]) + 0.01*getindex(randn(1),1), 0.01], i)
-	uwerr(ydata[i]) # We will need the errors for the weights of the fit
-end
-p0 = [0.5, 0.5];
-```
-We are ready to fit `(xdata, ydata)` to our model using `LsqFit.jl`, but for error propagation using `ADerrors.jl` we need the ``\chi^2`` function.
-```@repl fits
-# ADerrors will need the chi^2 as a function of the fit parameters and 
-# the data. This can be constructed easily from the model above
-chisq(p, d) = sum( (d .- model(xdata, p)) .^2 ./ 0.01^2)
-```
-Now we can fit the data and compute the uncertainties in our fit parameters
-```@repl fits
-fit = curve_fit(model, xdata, value.(ydata), 1.0 ./ err.(ydata).^2, p0) # This is LsqFit.jl
-(fitp, cexp) = fit_error(chisq, coef(fit), ydata); # This is error propagation with ADerrors.jl
-uwerr.(fitp);
-
-println("chi^2 / chi^2_exp: ", sum(fit.resid .^2), " / ", cexp, " (dof: ", dof(fit), ")")
-for i in 1:2
-	println("Fit parameter: ", i, ": ", fitp[i])
-end
-```
-
-!!! note 
-    `ADerrors.jl` is completely agnostic about fitting the data. Error propagation is performed once the minimum of the ``\chi^2`` function is known, and it does not matter how this minima is found. One is not constrained to use `LsqFit.jl`, as there are many alternatives in the `Julia` ecosystem: `LeastSquaresOptim.jl`, `MINPACK.jl`, `Optim.jl`, ...
-
-
-#### Missing measurements in one ensemble
-
-`ADerrors.jl` can deal with observables that are not measured on every configuration, and still take correctly the correlations/autocorrelations into account. Here we show an extreme case where one observable is measured only in the even configurations, while the oher is measured on the odd coficurations.
-```@repl gaps
-using ADerrors, Plots
-pgfplotsx();
-
-# Generate some correlated data
-eta  = randn(10000);
-x    = Vector{Float64}(undef, 10000);
-x[1] = 0.0;
-for i in 2:10000
-    x[i] = x[i-1] + 0.2*eta[i]
-    if abs(x[i]) > 1.0
-        x[i] = x[i-1]
-    end
-end
-
-# x^2 only measured on odd configurations
-x2 = uwreal(x[1:2:9999].^2,  1001, collect(1:2:9999),  10000)
-# x^4 only measured on even configurations
-x4 = uwreal(x[2:2:10000].^4, 1001, collect(2:2:10000), 10000)
-
-rat = x2/x4
-uwerr(rat);
-```
-In this case `uwerr` complains because with the chosen window the variance for ensemble with ID 1001 is negative. Looking at the normalized autocorrelation function we can easily spot the problem
-```@repl gaps
-iw = window(rat, 1001)
-r  = rho(rat, 1001);
-dr = drho(rat, 1001);
-plot(collect(1:100), 
-	r[1:100], 
-	yerr = dr[1:100], 
-	seriestype = :scatter, title = "Chosen Window: " * string(iw))
-savefig("rat_cf.png") # hide
-```
-![rat plot](rat_cf.png)
-The normalized autocorrelation function is oscillating, and the chosen window is 2! We better fix the window to 50 for this case
-```@repl gaps
-wpm = Dict{Int64, Vector{Float64}}()
-wpm[1001] = [50.0, -1.0, -1.0, -1.0]
-uwerr(rat, wpm)
-println("Ratio: ", rat)
-```
-
-Note however that this is very observable dependent
-```@repl gaps
-prod = x2*x4
-uwerr(prod)
-iw = window(prod, 1001)
-r  = rho(prod, 1001);
-dr = drho(prod, 1001);
-plot(collect(1:2*iw), 
-	r[1:2*iw], 
-	yerr = dr[1:2*iw], 
-	seriestype = :scatter, title = "Chosen Window: " * string(iw))
-savefig("prod_cf.png") # hide
-```
-![pod plot](prod_cf.png)
-In general error analysis with data with arbitrary gaps is possible, and fully supported in `ADerrors.jl`, but it can be tricky, and certaiinly requires to examine the data carefully.
-
-## Creating `uwerr` data types
-
-```@docs
-uwreal
-cobs
-```
-
-## I/O
-
-```@docs
-write_uwreal
-read_uwreal
-```
-
-## Error analysis 
-
-```@docs
-uwerr
-cov
-trcov
-```
-
-## Information on error analysis
-
-There are several methods to get information about the error analysis of a `uwreal` variable. With the exception of `value`, these methods require that a proper error analysis has been performed via a call to the `uwerr` method.
-
-```@docs
-value
-err
-derror
-taui
-dtaui
-rho
-drho
-window
-details
-neid
-```
-
-## Error propagation in iterative algorithms
-
-### Finding root of functions
-
-```@docs
-root_error
-```
-
-### Fits
-
-`ADerrors.jl` is agnostic about how you approach a fit (i.e. minimize a ``\chi^2`` function), but once the central values of the fit parameters have been found (i.e. using `LsqFit.jl`, `LeastSquaresOptim.jl`, etc... ) `ADerrors.jl` is able to determine the error of the fit parameters, returning them as `uwreal` type. It can also determine the expected value of your ``\chi^2`` given the correlation in your fit parameters.
-
-```@docs
-fit_error
-chiexp
-```
-
-### Integrals
-
-```@docs
-int_error
-```
-
-## Index 
-
-```@index
-```
--- a/docs/src/toc.md
+++ b/docs/src/toc.md

+# Contents
+
+## Contents
+
 ```@contents
-Pages = ["index.md"]
+Pages = ["tutorial.md", "api.md"]
 Depth = 3
 ```
+
+## Index 
+
+```@index
+```
--- a/src/ADerrorsCF.jl
+++ b/src/ADerrorsCF.jl
@@ -838,6 +838,7 @@ M = [1.0 0.2 0.1
 mcov = cov(x)
 d = tr(mcov * M)
 println("Better be zero: ", d -trcov(M, x))
+```
 """
 trcov(M, a::Vector{uwreal}) = trcov(M, a::Vector{uwreal}, wsg, empt)
 trcov(M, a::Vector{uwreal}, wpm::Dict{Int64,Vector{Float64}}) = trcov(M, a::Vector{uwreal}, wsg, wpm::Dict{Int64,Vector{Float64}})

--- a/src/ADerrorsTypes.jl
+++ b/src/ADerrorsTypes.jl
@@ -90,13 +90,13 @@ uwreal(v::Float64, prop::Vector{Bool}, der::Vector{Float64}) = uwreal(v, 0.0, 0.
 function Base.show(io::IO, a::uwreal)
    
    if (length(a.prop) == 0)
-        print(a.mean)
+        print(io, a.mean)
        return
    end
    
    if (length(a.cfd) > 0) 
-        print(a.mean, " +/- ", a.err)
+        print(io, a.mean, " +/- ", a.err)
    else
-        print(a.mean, " (Error not available... maybe run uwerr)")
+        print(io, a.mean, " (Error not available... maybe run uwerr)")
    end
 end
--- a/src/ADerrorsUtils.jl
+++ b/src/ADerrorsUtils.jl
@@ -149,6 +149,7 @@ xp = [sum(value.(dt) ./ dx)/sum(1.0 ./ dx)]

 # Compare chi^2 and expected chi^2
 println("chi^2 / chi_exp^2: ", chisq(xp, value.(dt)), " / ", chiexp(chisq, xp, dt))
+```
 """
 function chiexp(chisq::Function,
                xp::Vector{Float64},