orthogonal_mp#

sklearn.linear_model.orthogonal_mp(X, y, *, n_nonzero_coefs=None, tol=None, precompute=False, copy_X=True, return_path=False, return_n_iter=False)#

正交匹配追踪 (OMP)。

解决 n_targets 个正交匹配追踪问题。一个问题的实例形式如下：

当通过非零系数的数量参数化时，使用 n_nonzero_coefs ： argmin ||y - Xgamma||^2 subject to ||gamma||_0 <= n_{nonzero coefs}

当通过误差参数化时，使用参数 tol ： argmin ||gamma||_0 subject to ||y - Xgamma||^2 <= tol

更多信息请参阅用户指南。

Parameters:

X形状为 (n_samples, n_features) 的类数组: 输入数据。假设列具有单位范数。
y形状为 (n_samples,) 或 (n_samples, n_targets) 的 ndarray: 输入目标。
n_nonzero_coefsint, 默认=None: 解决方案中期望的非零条目数量。如果为 None（默认），此值设置为 n_features 的 10%。
tolfloat, 默认=None: 残差的平方范数最大值。如果非 None，则覆盖 n_nonzero_coefs。
precompute‘auto’ 或 bool, 默认=False: 是否进行预计算。当 n_targets 或 n_samples 非常大时，可以提高性能。
copy_Xbool, 默认=True: 算法是否必须复制设计矩阵 X。只有在 X 已经是 Fortran 顺序时，false 值才有帮助，否则仍然会进行复制。
return_pathbool, 默认=False: 是否返回沿前向路径的非零系数的每个值。用于交叉验证。
return_n_iterbool, 默认=False: 是否返回迭代次数。

Returns:

coef形状为 (n_features,) 或 (n_features, n_targets) 的 ndarray: OMP 解决方案的系数。如果 return_path=True ，这包含整个系数路径。在这种情况下，其形状为 (n_features, n_features) 或 (n_features, n_targets, n_features)，并且遍历最后一个轴会生成按活动特征增加顺序的系数。
n_iters类数组或 int: 每个目标的活动特征数量。仅当 return_n_iter 设置为 True 时返回。

See also

OrthogonalMatchingPursuit: 正交匹配追踪模型。
orthogonal_mp_gram: 使用 Gram 矩阵和乘积 X.T * y 解决 OMP 问题。
lars_path: 使用 LARS 算法计算最小角回归或 Lasso 路径。
sklearn.decomposition.sparse_encode: 稀疏编码。

Notes

正交匹配追踪由 S. Mallat, Z. Zhang 提出， Matching pursuits with time-frequency dictionaries, IEEE Transactions on Signal Processing, Vol. 41, No. 12. (December 1993), pp. 3397-3415. (https://www.di.ens.fr/~mallat/papiers/MallatPursuit93.pdf)

此实现基于 Rubinstein, R., Zibulevsky, M. 和 Elad, M., Efficient Implementation of the K-SVD Algorithm using Batch Orthogonal Matching Pursuit Technical Report - CS Technion, April 2008. https://www.cs.technion.ac.il/~ronrubin/Publications/KSVD-OMP-v2.pdf

Examples

>>> from sklearn.datasets import make_regression
>>> from sklearn.linear_model import orthogonal_mp
>>> X, y = make_regression(noise=4, random_state=0)
>>> coef = orthogonal_mp(X, y)
>>> coef.shape
(100,)
>>> X[:1,] @ coef
array([-78.68...])