skpro.distributions.Mixture#
- class skpro.distributions.Mixture(distributions, weights=None, indep_rows=True, indep_cols=True, index=None, columns=None)[source]#
Mixture of distributions.
- Parameters:
- distributionslist of tuples (str, BaseDistribution) or BaseDistribution
list of mixture components
- weightslist of float, optional, default = None
list of mixture weights, will be normalized to sum to 1 if not provided, uniform mixture is assumed
- indep_rowsbool, optional, default = True
if True, rows are sampled independently from the mixture components. If False, the same component is used for all rows. Relevant only in
samplemethod and non-marginal outputs.- indep_colsbool, optional, default = True
if True, columns are sampled independently from the mixture components. If False, the same component is used for all columns. Relevant only in
samplemethod and non-marginal outputs.- indexpd.Index, optional, default = inferred from component distributions
- columnspd.Index, optional, default = inferred from component distributions
- Attributes:
atInteger location indexer, for single index.
iatInteger location indexer, for single index.
ilocInteger location indexer, for groups of indices.
locLocation indexer, for groups of indices.
nameReturn the name of the object or estimator.
ndimNumber of dimensions of self.
shapeShape of self, a pair (2-tuple).
Methods
cdf(x)Cumulative distribution function.
clone()Obtain a clone of the object with same hyper-parameters and config.
clone_tags(estimator[, tag_names])Clone tags from another object as dynamic override.
create_test_instance([parameter_set])Construct an instance of the class, using first test parameter set.
create_test_instances_and_names([parameter_set])Create list of all test instances and a list of names for them.
energy([x])Energy of self, w.r.t.
get_class_tag(tag_name[, tag_value_default])Get class tag value from class, with tag level inheritance from parents.
Get class tags from class, with tag level inheritance from parent classes.
Get config flags for self.
Get object's parameter defaults.
get_param_names([sort])Get object's parameter names.
get_params([deep])Get a dict of parameters values for this object.
Return distribution parameters in a dict of DataFrame.
get_tag(tag_name[, tag_value_default, ...])Get tag value from instance, with tag level inheritance and overrides.
get_tags()Get tags from instance, with tag level inheritance and overrides.
get_test_params([parameter_set])Return testing parameter settings for the estimator.
haz(x)Hazard function.
head([n])Return the first n rows.
Check if the object is composite.
log_pdf(x)Logarithmic probability density function.
log_pmf(x)Logarithmic probability mass function.
mean()Return expected value of the distribution.
pdf(x)Probability density function.
pdfnorm([a])a-norm of pdf, defaults to 2-norm.
plot([fun, ax])Plot the distribution.
pmf(x)Probability mass function.
ppf(p)Quantile function = percent point function = inverse cdf.
quantile(alpha)Return entry-wise quantiles, in Proba/pred_quantiles mtype format.
reset()Reset the object to a clean post-init state.
sample([n_samples])Sample from the distribution.
set_config(**config_dict)Set config flags to given values.
set_params(**kwargs)Set the object's direct parameters and the parameters of components.
set_random_state([random_state, deep, ...])Set random_state pseudo-random seed parameters for self.
set_tags(**tag_dict)Set instance level tag overrides to given values.
surv(x)Survival function.
tail([n])Return the last n rows.
to_df()Return distribution parameters as a single DataFrame.
to_str()Return string representation of self.
var()Return element/entry-wise variance of the distribution.
Examples
>>> from skpro.distributions.mixture import Mixture >>> from skpro.distributions.normal import Normal
>>> n1 = Normal(mu=[[0, 1], [2, 3], [4, 5]], sigma=1) >>> n2 = Normal(mu=3, sigma=2, index=n1.index, columns=n1.columns) >>> m = Mixture(distributions=[("n1", n1), ("n2", n2)], weights=[0.3, 0.7]) >>> mixture_sample = m.sample(n_samples=10)
- property at[source]#
Integer location indexer, for single index.
Use
my_distribution.at[index]forpandas-like row/column subsetting ofBaseDistributiondescendants.indexcan be anypandasatcompatible index subsetter.my_distribution.at[index]ormy_distribution.at[row_index, col_index]subsetmy_distributionto the row selected byrow_index, col bycol_index, to exactly the same col/rows aspandasatwould subset rows inmy_distribution.indexand columns inmy_distribution.columns.
- cdf(x)[source]#
Cumulative distribution function.
Let \(X\) be a random variables with the distribution of
self, taking values in(N, n)DataFrame-s Let \(x\in \mathbb{R}^{N\times n}\). By \(F_{X_{ij}}\), denote the marginal cdf of \(X\) at the \((i,j)\)-th entry, i.e., \(F_{X_{ij}}(t) = \mathbb{P}(X_{ij} \leq t)\).The output of this method, for input
xrepresenting \(x\), is aDataFramewith same columns and indices asself, and entries \(F_{X_{ij}}(x_{ij})\).- Parameters:
- x
pandas.DataFrameor 2Dnp.ndarray representing \(x\), as above
- x
- Returns:
pd.DataFramewith same columns and index asselfcontaining \(F_{X_{ij}}(x_{ij})\), as above
- clone()[source]#
Obtain a clone of the object with same hyper-parameters and config.
A clone is a different object without shared references, in post-init state. This function is equivalent to returning
sklearn.cloneofself.Equivalent to constructing a new instance of
type(self), with parameters ofself, that is,type(self)(**self.get_params(deep=False)).If configs were set on
self, the clone will also have the same configs as the original, equivalent to callingcloned_self.set_config(**self.get_config()).Also equivalent in value to a call of
self.reset, with the exception thatclonereturns a new object, instead of mutatingselflikereset.- Raises:
- RuntimeError if the clone is non-conforming, due to faulty
__init__.
- RuntimeError if the clone is non-conforming, due to faulty
- clone_tags(estimator, tag_names=None)[source]#
Clone tags from another object as dynamic override.
Every
scikit-basecompatible object has a dictionary of tags. Tags may be used to store metadata about the object, or to control behaviour of the object.Tags are key-value pairs specific to an instance
self, they are static flags that are not changed after construction of the object.clone_tagssets dynamic tag overrides from another object,estimator.The
clone_tagsmethod should be called only in the__init__method of an object, during construction, or directly after construction via__init__.The dynamic tags are set to the values of the tags in
estimator, with the names specified intag_names.The default of
tag_nameswrites all tags fromestimatortoself.Current tag values can be inspected by
get_tagsorget_tag.- Parameters:
- estimatorAn instance of :class:BaseObject or derived class
- tag_namesstr or list of str, default = None
Names of tags to clone. The default (
None) clones all tags fromestimator.
- Returns:
- self
Reference to
self.
- classmethod create_test_instance(parameter_set='default')[source]#
Construct an instance of the class, using first test parameter set.
- Parameters:
- parameter_setstr, default=”default”
Name of the set of test parameters to return, for use in tests. If no special parameters are defined for a value, will return “default” set.
- Returns:
- instanceinstance of the class with default parameters
- classmethod create_test_instances_and_names(parameter_set='default')[source]#
Create list of all test instances and a list of names for them.
- Parameters:
- parameter_setstr, default=”default”
Name of the set of test parameters to return, for use in tests. If no special parameters are defined for a value, will return “default” set.
- Returns:
- objslist of instances of cls
i-th instance is
cls(**cls.get_test_params()[i])- nameslist of str, same length as objs
i-th element is name of i-th instance of obj in tests. The naming convention is
{cls.__name__}-{i}if more than one instance, otherwise{cls.__name__}
- energy(x=None)[source]#
Energy of self, w.r.t. self or a constant frame x.
Let \(X, Y\) be i.i.d. random variables with the distribution of
self.If
xisNone, returns \(\mathbb{E}[|X-Y|]\) (per row), “self-energy”. Ifxis passed, returns \(\mathbb{E}[|X-x|]\) (per row), “energy wrt x”.The CRPS is related to energy: it holds that \(\mbox{CRPS}(\mbox{self}, y)\) = self.energy(y) - 0.5 * self.energy().
- Parameters:
- xNone or pd.DataFrame, optional, default=None
if
pd.DataFrame, must have same rows and columns asself
- Returns:
pd.DataFramewith same rows asself, single column"energy"each row contains one float, self-energy/energy as described above.
- classmethod get_class_tag(tag_name, tag_value_default=None)[source]#
Get class tag value from class, with tag level inheritance from parents.
Every
scikit-basecompatible object has a dictionary of tags. Tags may be used to store metadata about the object, or to control behaviour of the object.Tags are key-value pairs specific to an instance
self, they are static flags that are not changed after construction of the object.The
get_class_tagmethod is a class method, and retrieves the value of a tag taking into account only class-level tag values and overrides.It returns the value of the tag with name
tag_namefrom the object, taking into account tag overrides, in the following order of descending priority:Tags set in the
_tagsattribute of the class.Tags set in the
_tagsattribute of parent classes,
in order of inheritance.
Does not take into account dynamic tag overrides on instances, set via
set_tagsorclone_tags, that are defined on instances.To retrieve tag values with potential instance overrides, use the
get_tagmethod instead.- Parameters:
- tag_namestr
Name of tag value.
- tag_value_defaultany type
Default/fallback value if tag is not found.
- Returns:
- tag_value
Value of the
tag_nametag inself. If not found, returnstag_value_default.
- classmethod get_class_tags()[source]#
Get class tags from class, with tag level inheritance from parent classes.
Every
scikit-basecompatible object has a dictionary of tags. Tags may be used to store metadata about the object, or to control behaviour of the object.Tags are key-value pairs specific to an instance
self, they are static flags that are not changed after construction of the object.The
get_class_tagsmethod is a class method, and retrieves the value of a tag taking into account only class-level tag values and overrides.It returns a dictionary with keys being keys of any attribute of
_tagsset in the class or any of its parent classes.Values are the corresponding tag values, with overrides in the following order of descending priority:
Tags set in the
_tagsattribute of the class.Tags set in the
_tagsattribute of parent classes,
in order of inheritance.
Instances can override these tags depending on hyper-parameters.
To retrieve tags with potential instance overrides, use the
get_tagsmethod instead.Does not take into account dynamic tag overrides on instances, set via
set_tagsorclone_tags, that are defined on instances.For including overrides from dynamic tags, use
get_tags.- Returns:
- collected_tagsdict
Dictionary of tag name : tag value pairs. Collected from
_tagsclass attribute via nested inheritance. NOT overridden by dynamic tags set byset_tagsorclone_tags.
- get_config()[source]#
Get config flags for self.
Configs are key-value pairs of
self, typically used as transient flags for controlling behaviour.get_configreturns dynamic configs, which override the default configs.Default configs are set in the class attribute
_configof the class or its parent classes, and are overridden by dynamic configs set viaset_config.Configs are retained under
cloneorresetcalls.- Returns:
- config_dictdict
Dictionary of config name : config value pairs. Collected from _config class attribute via nested inheritance and then any overrides and new tags from _onfig_dynamic object attribute.
- classmethod get_param_defaults()[source]#
Get object’s parameter defaults.
- Returns:
- default_dict: dict[str, Any]
Keys are all parameters of
clsthat have a default defined in__init__. Values are the defaults, as defined in__init__.
- classmethod get_param_names(sort=True)[source]#
Get object’s parameter names.
- Parameters:
- sortbool, default=True
Whether to return the parameter names sorted in alphabetical order (True), or in the order they appear in the class
__init__(False).
- Returns:
- param_names: list[str]
List of parameter names of
cls. Ifsort=False, in same order as they appear in the class__init__. Ifsort=True, alphabetically ordered.
- get_params(deep=True)[source]#
Get a dict of parameters values for this object.
This expands on get_params of standard BaseObject by also retrieving components parameters when
deep=Truea component’s follows the named object API (either sequence of str, BaseObject tuples or dict[str, BaseObject]).- Parameters:
- deepbool, default=True
Whether to return parameters of components.
If True, will return a dict of parameter name : value for this object, including parameters of components.
If False, will return a dict of parameter name : value for this object, but not include parameters of components.
- Returns:
- dict[str, Any]
Dictionary of parameter name and value pairs. Includes direct parameters and indirect parameters whose values implement get_params or follow the named object API (either sequence of str, BaseObject tuples or dict[str, BaseObject]).
If
deep=Falsethe name-value pairs for this object’s direct parameters (you can see these via get_param_names) are returned.If
deep=Truethen the parameter name-value pairs are returned for direct and component (indirect) parameters.When a BaseObject’s direct parameter value implements get_params the component parameters are returned as [direct_param_name]__[component_param_name] for 1st level components. Arbitrary levels of component recursion are supported (if the component has parameter’s whose values are objects that implement get_params). In this case, return parameters follow [direct_param_name]__[component_param_name]__[param_name] format.
When a BaseObject’s direct parameter value is a sequence of (name, BaseObject) tuples or dict[str, BaseObject] the parameters name and value pairs of all component objects are returned. The parameter naming follows
scikit-learnconvention of treating named component objects like they are direct parameters; therefore, the names are assigned as [component_param_name]__[param_name].
- get_params_df()[source]#
Return distribution parameters in a dict of DataFrame.
Available only for simple parametric distributions, i.e., distributions with tag “distr:paramtype” having value “parametric”.
- Returns:
- dict of pd.DataFrame
Dictionary with all distribution parameters, as
pd.DataFrame. Keys are the parameter names, values are thepd.DataFrame. EachDataFramehas the same index asselfand columns asself. Entries are the values of the distribution parameters.
- get_tag(tag_name, tag_value_default=None, raise_error=True)[source]#
Get tag value from instance, with tag level inheritance and overrides.
Every
scikit-basecompatible object has a dictionary of tags. Tags may be used to store metadata about the object, or to control behaviour of the object.Tags are key-value pairs specific to an instance
self, they are static flags that are not changed after construction of the object.The
get_tagmethod retrieves the value of a single tag with nametag_namefrom the instance, taking into account tag overrides, in the following order of descending priority:Tags set via
set_tagsorclone_tagson the instance,
at construction of the instance.
Tags set in the
_tagsattribute of the class.Tags set in the
_tagsattribute of parent classes,
in order of inheritance.
- Parameters:
- tag_namestr
Name of tag to be retrieved
- tag_value_defaultany type, optional; default=None
Default/fallback value if tag is not found
- raise_errorbool
whether a
ValueErroris raised when the tag is not found
- Returns:
- tag_valueAny
Value of the
tag_nametag inself. If not found, raises an error ifraise_erroris True, otherwise it returnstag_value_default.
- Raises:
- ValueError, if
raise_errorisTrue. The
ValueErroris then raised iftag_nameis not inself.get_tags().keys().
- ValueError, if
- get_tags()[source]#
Get tags from instance, with tag level inheritance and overrides.
Every
scikit-basecompatible object has a dictionary of tags. Tags may be used to store metadata about the object, or to control behaviour of the object.Tags are key-value pairs specific to an instance
self, they are static flags that are not changed after construction of the object.The
get_tagsmethod returns a dictionary of tags, with keys being keys of any attribute of_tagsset in the class or any of its parent classes, or tags set viaset_tagsorclone_tags.Values are the corresponding tag values, with overrides in the following order of descending priority:
Tags set via
set_tagsorclone_tagson the instance,
at construction of the instance.
Tags set in the
_tagsattribute of the class.Tags set in the
_tagsattribute of parent classes,
in order of inheritance.
- Returns:
- collected_tagsdict
Dictionary of tag name : tag value pairs. Collected from
_tagsclass attribute via nested inheritance and then any overrides and new tags from_tags_dynamicobject attribute.
- classmethod get_test_params(parameter_set='default')[source]#
Return testing parameter settings for the estimator.
- haz(x)[source]#
Hazard function.
Let \(X\) be a random variables with the distribution of
self, taking values in(N, n)DataFrame-s Let \(x\in \mathbb{R}^{N\times n}\). By \(h_{X_{ij}}\), denote the marginal hazard of \(X\) at the \((i,j)\)-th entry, i.e., \(h_{X_{ij}}(t) = \frac{f_{X_{ij}}(t)}{S_{X_{ij}}(t)}\), where \(f_{X_{ij}}\) is the marginal pdf, and \(S_{X_{ij}}\) is the marginal survival function at the \((i,j)\)-th entry.The output of this method, for input
xrepresenting \(x\), is aDataFramewith same columns and indices asself, and entries \(h_{X_{ij}}(x_{ij})\).- Parameters:
- x
pandas.DataFrameor 2Dnp.ndarray representing \(x\), as above
- x
- Returns:
pd.DataFramewith same columns and index asselfcontaining \(h_{X_{ij}}(x_{ij})\), as above
- head(n=5)[source]#
Return the first n rows.
If there are less than n rows in
self, returns clone ofself.For negative n, returns all rows except the last n.
- Parameters:
- nint, default=5
Number of rows to return.
- Returns:
selfsubset to the first n rows, i.e.,self.iloc[0:min(n, len(self))]
- property iat[source]#
Integer location indexer, for single index.
Use
my_distribution.iat[index]forpandas-like row/column subsetting ofBaseDistributiondescendants.indexcan be anypandasiatcompatible index subsetter.my_distribution.iat[index]ormy_distribution.iat[row_index, col_index]subsetmy_distributionto the row selected byrow_index, col bycol_index, to exactly the same col/rows aspandasiatwould subset rows inmy_distribution.indexand columns inmy_distribution.columns.
- property iloc[source]#
Integer location indexer, for groups of indices.
Use
my_distribution.iloc[index]forpandas-like row/column subsetting ofBaseDistributiondescendants.indexcan be anypandasiloccompatible index subsetter.my_distribution.iloc[index]ormy_distribution.iloc[row_index, col_index]subsetmy_distributionto rows selected byrow_index, cols bycol_index, to exactly the same cols/rows aspandasilocwould subset rows inmy_distribution.indexand columns inmy_distribution.columns.
- is_composite()[source]#
Check if the object is composite.
A composite object is an object which contains objects as parameter values.
- Returns:
- bool
Whether self contains a parameter whose value is a BaseObject, list of (str, BaseObject) tuples or dict[str, BaseObject].
- property loc[source]#
Location indexer, for groups of indices.
Use
my_distribution.loc[index]forpandas-like row/column subsetting ofBaseDistributiondescendants.indexcan be anypandasiloccompatible index subsetter.my_distribution.loc[index]ormy_distribution.loc[row_index, col_index]subsetmy_distributionto rows selected byrow_index, cols bycol_index, to exactly the same cols/rows aspandaslocwould subset rows inmy_distribution.indexand columns inmy_distribution.columns.
- log_pdf(x)[source]#
Logarithmic probability density function.
Numerically more stable than calling pdf and then taking logartihms.
Let \(X\) be a random variables with the distribution of
self, taking values in (N, n)DataFrame-s Let \(x\in \mathbb{R}^{N\times n}\). By \(p_{X_{ij}}\), denote the marginal pdf of \(X\) at the \((i,j)\)-th entry.The output of this method, for input
xrepresenting \(x\), is aDataFramewith same columns and indices asself, and entries \(\log p_{X_{ij}}(x_{ij})\).If
selfhas a mixed or discrete distribution, this returns the weighted continuous part of self’s distribution instead of the pdf, i.e., the marginal pdf integrate to the weight of the continuous part.- Parameters:
- x
pandas.DataFrameor 2Dnp.ndarray representing \(x\), as above
- x
- Returns:
pd.DataFramewith same columns and index asselfcontaining \(\log p_{X_{ij}}(x_{ij})\), as above
- log_pmf(x)[source]#
Logarithmic probability mass function.
Numerically more stable than calling pmf and then taking logartihms.
Let \(X\) be a random variables with the distribution of
self, taking values in (N, n)DataFrame-s Let \(x\in \mathbb{R}^{N\times n}\). By \(m_{X_{ij}}\), denote the marginal pdf of \(X\) at the \((i,j)\)-th entry, i.e., \(m_{X_{ij}}(x_{ij}) = \mathbb{P}(X_{ij} = x_{ij})\).The output of this method, for input
xrepresenting \(x\), is aDataFramewith same columns and indices asself, and entries \(\log m_{X_{ij}}(x_{ij})\).If
selfhas a mixed or discrete distribution, this returns the weighted continuous part of self’s distribution instead of the pdf, i.e., the marginal pdf integrate to the weight of the continuous part.- Parameters:
- x
pandas.DataFrameor 2Dnp.ndarray representing \(x\), as above
- x
- Returns:
pd.DataFramewith same columns and index asselfcontaining \(\log m_{X_{ij}}(x_{ij})\), as above
- mean()[source]#
Return expected value of the distribution.
Let \(X\) be a random variable with the distribution of
self. Returns the expectation \(\mathbb{E}[X]\)- Returns:
pd.DataFramewith same rows, columns asselfexpected value of distribution (entry-wise)
- pdf(x)[source]#
Probability density function.
Let \(X\) be a random variables with the distribution of
self, taking values in(N, n)DataFrame-s Let \(x\in \mathbb{R}^{N\times n}\). By \(p_{X_{ij}}\), denote the marginal pdf of \(X\) at the \((i,j)\)-th entry.The output of this method, for input
xrepresenting \(x\), is aDataFramewith same columns and indices asself, and entries \(p_{X_{ij}}(x_{ij})\).If
selfhas a mixed or discrete distribution, this returns the weighted continuous part of self’s distribution instead of the pdf, i.e., the marginal pdf integrate to the weight of the continuous part.- Parameters:
- x
pandas.DataFrameor 2Dnp.ndarray representing \(x\), as above
- x
- Returns:
pd.DataFramewith same columns and index asselfcontaining \(p_{X_{ij}}(x_{ij})\), as above
- pdfnorm(a=2)[source]#
a-norm of pdf, defaults to 2-norm.
computes a-norm of the entry marginal pdf, i.e., \(\mathbb{E}[p_X(X)^{a-1}] = \int p(x)^a dx\), where \(X\) is a random variable distributed according to the entry marginal of self, and \(p_X\) is its pdf
- Parameters:
- a: int or float, optional, default=2
- Returns:
- pd.DataFrame with same rows and columns as self
- each entry is \(\mathbb{E}[p_X(X)^{a-1}] = \int p(x)^a dx\), see above
- plot(fun=None, ax=None, **kwargs)[source]#
Plot the distribution.
Different distribution defining functions can be selected for plotting via the
funparameter. The functions available are the same as the methods of the distribution class, e.g.,"pdf","cdf","ppf".For array distribution, the marginal distribution at each entry is plotted, as a separate subplot.
- Parameters:
- funstr, optional, default=”pdf” for continuous distributions, otherwise “cdf”
the function to plot, one of “pdf”, “cdf”, “ppf”
- axmatplotlib Axes object, optional
matplotlib Axes to plot in if not provided, defaults to current axes (
plot.gca)- kwargskeyword arguments
passed to the plotting function
- Returns:
- figmatplotlib.Figure, only returned if self is array distribution
matplotlig Figure object for subplots
- axmatplotlib.Axes
the axis or axes on which the plot is drawn
- pmf(x)[source]#
Probability mass function.
Let \(X\) be a random variables with the distribution of
self, taking values in(N, n)DataFrame-s Let \(x\in \mathbb{R}^{N\times n}\). By \(m_{X_{ij}}\), denote the marginal mass of \(X\) at the \((i,j)\)-th entry, i.e., \(m_{X_{ij}}(x_{ij}) = \mathbb{P}(X_{ij} = x_{ij})\).The output of this method, for input
xrepresenting \(x\), is aDataFramewith same columns and indices asself, and entries \(m_{X_{ij}}(x_{ij})\).If
selfhas a mixed or discrete distribution, this returns the weighted continuous part of self’s distribution instead of the pdf, i.e., the marginal pdf integrate to the weight of the continuous part.- Parameters:
- x
pandas.DataFrameor 2Dnp.ndarray representing \(x\), as above
- x
- Returns:
pd.DataFramewith same columns and index asselfcontaining \(p_{X_{ij}}(x_{ij})\), as above
- ppf(p)[source]#
Quantile function = percent point function = inverse cdf.
Let \(X\) be a random variables with the distribution of
self, taking values in(N, n)DataFrame-s Let \(x\in \mathbb{R}^{N\times n}\). By \(F_{X_{ij}}\), denote the marginal cdf of \(X\) at the \((i,j)\)-th entry.The output of this method, for input
prepresenting \(p\), is aDataFramewith same columns and indices asself, and entries \(F^{-1}_{X_{ij}}(p_{ij})\).- Parameters:
- p
pandas.DataFrameor 2D np.ndarray representing \(p\), as above
- p
- Returns:
pd.DataFramewith same columns and index asselfcontaining \(F_{X_{ij}}(x_{ij})\), as above
- quantile(alpha)[source]#
Return entry-wise quantiles, in Proba/pred_quantiles mtype format.
This method broadcasts as follows: for a scalar alpha, computes the alpha-quantile entry-wise, and returns as a pd.DataFrame with same index, and columns as in return. If alpha is iterable, multiple quantiles will be calculated, and the result will be concatenated column-wise (axis=1).
The ppf method also computes quantiles, but broadcasts differently, in numpy style closer to tensorflow. In contrast, this quantile method broadcasts as
sktimeforecaster predict_quantiles, i.e., columns first.- Parameters:
- alphafloat or list of float of unique values
A probability or list of, at which quantiles are computed.
- Returns:
- quantilespd.DataFrame
Column has multi-index: first level is variable name from self.columns, second level being the values of alpha passed to the function. Row index is self.index. Entries in the i-th row, (j, p)-the column is the p-th quantile of the marginal of self at index (i, j).
- reset()[source]#
Reset the object to a clean post-init state.
Results in setting
selfto the state it had directly after the constructor call, with the same hyper-parameters. Config values set byset_configare also retained.A
resetcall deletes any object attributes, except:hyper-parameters = arguments of
__init__written toself, e.g.,self.paramnamewhereparamnameis an argument of__init__object attributes containing double-underscores, i.e., the string “__”. For instance, an attribute named “__myattr” is retained.
config attributes, configs are retained without change. That is, results of
get_configbefore and afterresetare equal.
Class and object methods, and class attributes are also unaffected.
Equivalent to
clone, with the exception thatresetmutatesselfinstead of returning a new object.After a
self.reset()call,selfis equal in value and state, to the object obtained after a constructor call``type(self)(**self.get_params(deep=False))``.- Returns:
- self
Instance of class reset to a clean post-init state but retaining the current hyper-parameter values.
- sample(n_samples=None)[source]#
Sample from the distribution.
- Parameters:
- n_samplesint, optional, default = None
number of samples to draw from the distribution
- Returns:
- pd.DataFrame
samples from the distribution
if
n_samplesisNone:
returns a sample that contains a single sample from
self, inpd.DataFramemtype format convention, withindexandcolumnsasself* if n_samples isint: returns apd.DataFramethat containsn_samplesi.i.d. samples fromself, inpd-multiindexmtype format convention, with samecolumnsasself, and rowMultiIndexthat is product ofRangeIndex(n_samples)andself.index
- set_config(**config_dict)[source]#
Set config flags to given values.
Configs are key-value pairs of
self, typically used as transient flags for controlling behaviour.set_configsets dynamic configs, which override the default configs.Default configs are set in the class attribute
_configof the class or its parent classes, and are overridden by dynamic configs set viaset_config.Configs are retained under
cloneorresetcalls.- Parameters:
- config_dictdict
Dictionary of config name : config value pairs.
- Returns:
- selfreference to self.
Notes
Changes object state, copies configs in config_dict to self._config_dynamic.
- set_params(**kwargs)[source]#
Set the object’s direct parameters and the parameters of components.
Valid parameter keys can be listed with
get_params().Like BaseObject implementation it allows values of indirect parameters of a component to be set when a parameter’s value is an object that implements set_params. This also also expands the functionality to allow parameter to allow the indirect parameters of components to be set when a parameter’s values follow the named object API (either sequence of str, BaseObject tuples or dict[str, BaseObject]).
- Returns:
- Self
Instance of self.
- set_random_state(random_state=None, deep=True, self_policy='copy')[source]#
Set random_state pseudo-random seed parameters for self.
Finds
random_statenamed parameters viaself.get_params, and sets them to integers derived fromrandom_stateviaset_params. These integers are sampled from chain hashing viasample_dependent_seed, and guarantee pseudo-random independence of seeded random generators.Applies to
random_stateparameters inself, depending onself_policy, and remaining component objects if and only ifdeep=True.Note: calls
set_paramseven ifselfdoes not have arandom_state, or none of the components have arandom_stateparameter. Therefore,set_random_statewill reset anyscikit-baseobject, even those without arandom_stateparameter.- Parameters:
- random_stateint, RandomState instance or None, default=None
Pseudo-random number generator to control the generation of the random integers. Pass int for reproducible output across multiple function calls.
- deepbool, default=True
Whether to set the random state in skbase object valued parameters, i.e., component estimators.
If False, will set only
self’srandom_stateparameter, if exists.If True, will set
random_stateparameters in component objects as well.
- self_policystr, one of {“copy”, “keep”, “new”}, default=”copy”
“copy” :
self.random_stateis set to inputrandom_state“keep” :
self.random_stateis kept as is“new” :
self.random_stateis set to a new random state,
derived from input
random_state, and in general different from it
- Returns:
- selfreference to self
- set_tags(**tag_dict)[source]#
Set instance level tag overrides to given values.
Every
scikit-basecompatible object has a dictionary of tags. Tags may be used to store metadata about the object, or to control behaviour of the object.Tags are key-value pairs specific to an instance
self, they are static flags that are not changed after construction of the object.set_tagssets dynamic tag overrides to the values as specified intag_dict, with keys being the tag name, and dict values being the value to set the tag to.The
set_tagsmethod should be called only in the__init__method of an object, during construction, or directly after construction via__init__.Current tag values can be inspected by
get_tagsorget_tag.- Parameters:
- **tag_dictdict
Dictionary of tag name: tag value pairs.
- Returns:
- Self
Reference to self.
- surv(x)[source]#
Survival function.
Let \(X\) be a random variables with the distribution of
self, taking values in(N, n)DataFrame-s Let \(x\in \mathbb{R}^{N\times n}\). By \(S_{X_{ij}}\), denote the marginal survival of \(X\) at the \((i,j)\)-th entry, i.e., \(S_{X_{ij}}(t) = \mathbb{P}(X_{ij} \gneq t)\).The output of this method, for input
xrepresenting \(x\), is aDataFramewith same columns and indices asself, and entries \(F_{X_{ij}}(x_{ij})\).- Parameters:
- x
pandas.DataFrameor 2Dnp.ndarray representing \(x\), as above
- x
- Returns:
pd.DataFramewith same columns and index asselfcontaining \(S_{X_{ij}}(x_{ij})\), as above
- tail(n=5)[source]#
Return the last n rows.
If there are less than n rows in
self, returns clone ofself.For negative n, returns all rows except the first n.
- Parameters:
- nint, default=5
Number of rows to return.
- Returns:
selfsubset to the last n rows, i.e.,self.iloc[max(len(self) - n, 0):]
- to_df()[source]#
Return distribution parameters as a single DataFrame.
Available only for simple parametric distributions, i.e., distributions with tag “distr:paramtype” having value “parametric”.
- Returns:
- pd.DataFrame
DataFrame with all distribution parameters. column is a MultiIndex (paramname, varname). row index is the index of the distribution. Entries are the values of the distribution parameters.
- var()[source]#
Return element/entry-wise variance of the distribution.
Let \(X\) be a random variable with the distribution of
self. Returns \(\mathbb{V}[X] = \mathbb{E}\left(X - \mathbb{E}[X]\right)^2\), where the square is element-wise.- Returns:
pd.DataFramewith same rows, columns asselfvariance of distribution (entry-wise)