spateo.tdr.models.models_backbone.backbone_methods#

Module Contents#

Classes#

NLPCA

This is a global solver for principal curves that uses neural networks.

Functions#

orth_dist(y_true, y_pred)

Loss function for the NLPCA NN. Returns the sum of the orthogonal

ElPiGraph_method(→ Tuple[numpy.ndarray, numpy.ndarray])

Generate a principal elastic tree.

SimplePPT_method(→ Tuple[numpy.ndarray, numpy.ndarray])

Generate a simple principal tree.

PrinCurve_method(→ Tuple[numpy.ndarray, numpy.ndarray])

This is the global module that contains principal curve and nonlinear principal component analysis algorithms that

spateo.tdr.models.models_backbone.backbone_methods.orth_dist(y_true, y_pred)[source]#

Loss function for the NLPCA NN. Returns the sum of the orthogonal distance from the output tensor to the real tensor.

class spateo.tdr.models.models_backbone.backbone_methods.NLPCA[source]#

Bases: object

This is a global solver for principal curves that uses neural networks. .. attribute:: None

fit(data: numpy.ndarray, epochs: int = 500, nodes: int = 25, lr: float = 0.01, verbose: int = 0)[source]#

This method creates a model and will fit it to the given m x n dimensional data.

Parameters:
data

A numpy array of shape (m,n), where m is the number of points and n is the number of dimensions.

epochs

Number of epochs to train neural network, defaults to 500.

nodes

Number of nodes for the construction layers. Defaults to 25. The more complex the curve, the higher this number should be.

lr

Learning rate for backprop. Defaults to .01

verbose

Verbose = 0 mutes the training text from Keras. Defaults to 0.

project(data: numpy.ndarray) Tuple[numpy.ndarray, numpy.ndarray][source]#

The project function will project the points to the curve generated by the fit function. Given back is the projection index of the original data and a sorted version of the original data.

Parameters:
data

m x n array to project to the curve

Returns:

A one-dimension array that contains the projection index for each point in data. all_sorted: A m x n+1 array that contains data sorted by its projection index, along with the index.

Return type:

proj

create_model(num_dim: int, nodes: int, lr: float)[source]#

Creates a tf model.

Parameters:
num_dim

How many dimensions the input space is

nodes

How many nodes for the construction layers

lr

Learning rate of backpropigation

Returns:

Keras Model

Return type:

model (object)

spateo.tdr.models.models_backbone.backbone_methods.ElPiGraph_method(X: numpy.ndarray, NumNodes: int = 50, topology: Literal[tree, circle, curve] = 'curve', Lambda: float = 0.01, Mu: float = 0.1, alpha: float = 0.0, FinalEnergy: Literal[Base, Penalized] = 'Penalized', **kwargs) Tuple[numpy.ndarray, numpy.ndarray][source]#

Generate a principal elastic tree. Reference: Albergante et al. (2020), Robust and Scalable Learning of Complex Intrinsic Dataset Geometry via ElPiGraph.

Parameters:
X

DxN, data matrix list.

NumNodes

The number of nodes of the principal graph. Use a range of 10 to 100 for ElPiGraph approach.

topology

The appropriate topology used to fit a principal graph for each dataset.

Lambda

The attractive strength of edges between nodes (constrains edge lengths)

Mu

The repulsive strength of a node’s neighboring nodes (constrains angles to be close to harmonic)

alpha

Branching penalty (penalizes number of branches for the principal tree)

FinalEnergy

Indicating the final elastic emergy associated with the configuration. Currently it can be “Base” or “Penalized”

**kwargs

Other parameters used in elpigraph.computeElasticPrincipalTree. For details, please see: https://elpigraph-python.readthedocs.io/en/latest/basics.html

Returns:

The nodes in the principal tree. edges: The edges between nodes in the principal tree.

Return type:

nodes

spateo.tdr.models.models_backbone.backbone_methods.SimplePPT_method(X: numpy.ndarray, NumNodes: int = 50, sigma: float | int | None = 0.1, lam: float | int | None = 1, metric: str = 'euclidean', nsteps: int = 50, err_cut: float = 0.005, seed: int | None = 1, **kwargs) Tuple[numpy.ndarray, numpy.ndarray][source]#

Generate a simple principal tree. Reference: Mao et al. (2015), SimplePPT: A simple principal tree algorithm, SIAM International Conference on Data Mining.

Parameters:
X

DxN, data matrix list.

NumNodes

The number of nodes of the principal graph. Use a range of 100 to 2000 for PPT approach.

sigma

Regularization parameter.

lam

Penalty for the tree length.

metric

The metric to use to compute distances in high dimensional space. For compatible metrics, check the documentation of sklearn.metrics.pairwise_distances.

nsteps

Number of steps for the optimisation process.

err_cut

Stop algorithm if proximity of principal points between iterations less than defined value.

seed

A numpy random seed.

**kwargs

Other parameters used in simpleppt.ppt. For details, please see: https://github.com/LouisFaure/simpleppt/blob/main/simpleppt/ppt.py

Returns:

The nodes in the principal tree. edges: The edges between nodes in the principal tree.

Return type:

nodes

spateo.tdr.models.models_backbone.backbone_methods.PrinCurve_method(X: numpy.ndarray, NumNodes: int = 50, epochs: int = 500, lr: float = 0.01, scale_factor: int | float = 1, **kwargs) Tuple[numpy.ndarray, numpy.ndarray][source]#

This is the global module that contains principal curve and nonlinear principal component analysis algorithms that work to optimize a line over an entire dataset. Reference: Chen et al. (2016), Constraint local principal curve: Concept, algorithms and applications.

Parameters:
X

DxN, data matrix list.

NumNodes

Number of nodes for the construction layers. Defaults to 50. The more complex the curve, the higher this number should be.

epochs

Number of epochs to train neural network, defaults to 500.

lr

Learning rate for backprop. Defaults to .01

scale_factor

**kwargs

Other parameters used in global algorithms. For details, please see: https://github.com/artusoma/prinPy/blob/master/prinpy/glob.py

Returns:

The nodes in the principal tree. edges: The edges between nodes in the principal tree.

Return type:

nodes