Probabilities

Probabilities API

The probabilities API is defined by

ProbabilitiesEstimator
probabilities
probabilities_and_outcomes

and related functions that you will find in the following documentation blocks:

Probabilitities

ComplexityMeasures.ProbabilitiesEstimator — Type

ProbabilitiesEstimator

The supertype for all probabilities estimators.

In ComplexityMeasures.jl, probability distributions are estimated from data by defining a set of possible outcomes $\Omega = \{\omega_1, \omega_2, \ldots, \omega_L \}$, and assigning to each outcome $\omega_i$ a probability $p(\omega_i)$, such that $\sum_{i=1}^N p(\omega_i) = 1$. It is the role of a ProbabilitiesEstimator to

Define $\Omega$, the "outcome space", which is the set of all possible outcomes over which probabilities are estimated. The cardinality of this set can be obtained using total_outcomes.
Define how probabilities $p_i = p(\omega_i)$ are assigned to outcomes $\omega_i$.

In practice, probability estimation is done by calling probabilities with some input data and one of the following probabilities estimators. The result is a Probabilities p (Vector-like), where each element p[i] is the probability of the outcome ω[i]. Use probabilities_and_outcomes if you need both the probabilities and the outcomes, and use outcome_space to obtain $\Omega$ alone. The element type of $\Omega$ varies between estimators, but it is guranteed to be hashable. This allows for conveniently tracking the probability of a specific event across experimental realizations, by using the outcome as a dictionary key and the probability as the value for that key (or, alternatively, the key remains the outcome and one has a vector of probabilities, one for each experimental realization).

Some estimators can deduce $\Omega$ without knowledge of the input, such as SymbolicPermutation. For others, knowledge of input is necessary for concretely specifying $\Omega$, such as ValueHistogram with RectangularBinning. This only matters for the functions outcome_space and total_outcomes.

All currently implemented probability estimators are listed in a nice table in the probabilities estimators section of the online documentation.

Estimator	Principle	Input data
`CountOccurrences`	Count of unique elements	`Any`
`ValueHistogram`	Binning (histogram)	`Vector`, `Dataset`
`TransferOperator`	Binning (transfer operator)	`Vector`, `Dataset`
`NaiveKernel`	Kernel density estimation	`Dataset`
`SymbolicPermutation`	Ordinal patterns	`Vector`, `Dataset`
`SymbolicWeightedPermutation`	Ordinal patterns	`Vector`, `Dataset`
`SymbolicAmplitudeAwarePermutation`	Ordinal patterns	`Vector`, `Dataset`
`SpatialSymbolicPermutation`	Ordinal patterns in space	`Array`
`Dispersion`	Dispersion patterns	`Vector`
`SpatialDispersion`	Dispersion patterns in space	`Array`
`Diversity`	Cosine similarity	`Vector`
`WaveletOverlap`	Wavelet transform	`Vector`
`PowerSpectrum`	Fourier transform	`Vector`