ot.datasets
Simple example datasets
Functions
- ot.datasets.make_1D_gauss(n, m, s)[source]
return a 1D histogram for a gaussian distribution (n bins, mean m and std s)
Examples using ot.datasets.make_1D_gauss
data:image/s3,"s3://crabby-images/e5574/e55740a35313406e6d149528f05721b655d9e4c0" alt=""
1D Wasserstein barycenter: exact LP vs entropic regularization
data:image/s3,"s3://crabby-images/ca919/ca919fc7298ef03dc4ded6362520f00f205429d9" alt=""
1D Wasserstein barycenter demo for Unbalanced distributions
- ot.datasets.make_2D_samples_gauss(n, m, sigma, random_state=None)[source]
Return n samples drawn from 2D gaussian \(\mathcal{N}(m, \sigma)\)
- Parameters:
n (int) – number of samples to make
m (ndarray, shape (2,)) – mean value of the gaussian distribution
sigma (ndarray, shape (2, 2)) – covariance matrix of the gaussian distribution
random_state (int, RandomState instance or None, optional (default=None)) – If int, random_state is the seed used by the random number generator; If RandomState instance, random_state is the random number generator; If None, the random number generator is the RandomState instance used by np.random.
- Returns:
X – n samples drawn from \(\mathcal{N}(m, \sigma)\).
- Return type:
ndarray, shape (n, 2)
Examples using ot.datasets.make_2D_samples_gauss
data:image/s3,"s3://crabby-images/77350/773502522226a2d852ce136d044ee8446901ee57" alt=""
Optimal Transport between 2D empirical distributions
data:image/s3,"s3://crabby-images/5f500/5f5006fdfb3ac8b47eaca14a2969e101b0681f27" alt=""
Translation Invariant Sinkhorn for Unbalanced Optimal Transport
data:image/s3,"s3://crabby-images/5cbfe/5cbfe03d2eb765b2c4911ee1948503c9d47239b7" alt=""
2D examples of exact and entropic unbalanced optimal transport
data:image/s3,"s3://crabby-images/7c88c/7c88c9d634bdc8bd6cc7881372b5ca0fb729146f" alt=""
Partial Wasserstein and Gromov-Wasserstein example
data:image/s3,"s3://crabby-images/585c3/585c325e1b3af9c4486c45b81265d84c82ec6e32" alt=""
Regularization path of l2-penalized unbalanced optimal transport
- ot.datasets.make_data_classif(dataset, n, nz=0.5, theta=0, p=0.5, random_state=None, **kwargs)[source]
Dataset generation for classification problems
- Parameters:
dataset (str) – type of classification problem (see code)
n (int) – number of training samples
nz (float) – noise level (>0)
p (float) – proportion of one class in the binary setting
random_state (int, RandomState instance or None, optional (default=None)) – If int, random_state is the seed used by the random number generator; If RandomState instance, random_state is the random number generator; If None, the random number generator is the RandomState instance used by np.random.
- Returns:
X (ndarray, shape (n, d)) – n observation of size d
y (ndarray, shape (n,)) – labels of the samples.
Examples using ot.datasets.make_data_classif
data:image/s3,"s3://crabby-images/3e512/3e5126908bf09e6562a71d8096b8600e01dccbb6" alt=""
Dual OT solvers for entropic and quadratic regularized OT with Pytorch
data:image/s3,"s3://crabby-images/0ba1a/0ba1af434907f7dc7b7498c022cbcc60913b02b3" alt=""
OT with Laplacian regularization for domain adaptation
data:image/s3,"s3://crabby-images/d8e4d/d8e4d2bd873de83a55141cc4d50ae8533c03524c" alt=""
OT for domain adaptation on empirical distributions
- ot.datasets.make_1D_gauss(n, m, s)[source]
return a 1D histogram for a gaussian distribution (n bins, mean m and std s)
- ot.datasets.make_2D_samples_gauss(n, m, sigma, random_state=None)[source]
Return n samples drawn from 2D gaussian \(\mathcal{N}(m, \sigma)\)
- Parameters:
n (int) – number of samples to make
m (ndarray, shape (2,)) – mean value of the gaussian distribution
sigma (ndarray, shape (2, 2)) – covariance matrix of the gaussian distribution
random_state (int, RandomState instance or None, optional (default=None)) – If int, random_state is the seed used by the random number generator; If RandomState instance, random_state is the random number generator; If None, the random number generator is the RandomState instance used by np.random.
- Returns:
X – n samples drawn from \(\mathcal{N}(m, \sigma)\).
- Return type:
ndarray, shape (n, 2)
- ot.datasets.make_data_classif(dataset, n, nz=0.5, theta=0, p=0.5, random_state=None, **kwargs)[source]
Dataset generation for classification problems
- Parameters:
dataset (str) – type of classification problem (see code)
n (int) – number of training samples
nz (float) – noise level (>0)
p (float) – proportion of one class in the binary setting
random_state (int, RandomState instance or None, optional (default=None)) – If int, random_state is the seed used by the random number generator; If RandomState instance, random_state is the random number generator; If None, the random number generator is the RandomState instance used by np.random.
- Returns:
X (ndarray, shape (n, d)) – n observation of size d
y (ndarray, shape (n,)) – labels of the samples.