ConvergenceWarning: lbfgs失败收敛（status=1）：STOP: 总迭代次数达到限制

9 浏览2023年5月4日

匿名的 2023年5月4日

0 Comments

我有一个包含数值和分类数据的数据集，我想根据患者的医疗特征预测不良结果。我为数据集定义了一个预测流程，如下所示：

X = dataset.drop(columns=['target'])
y = dataset['target']
# 定义数值和分类转换器
numeric_transformer = Pipeline(steps=[
    ('knnImputer', KNNImputer(n_neighbors=2, weights="uniform")),
    ('scaler', StandardScaler())])
categorical_transformer = Pipeline(steps=[
    ('imputer', SimpleImputer(strategy='constant', fill_value='missing')),
    ('onehot', OneHotEncoder(handle_unknown='ignore'))])
# 将对象列分配给分类转换器，将其余列分配给数值转换器
preprocessor = ColumnTransformer(transformers=[
    ('num', numeric_transformer, selector(dtype_exclude="object")),
    ('cat', categorical_transformer, selector(dtype_include="object"))
])
# 将分类器追加到预处理流程中
# 现在我们有一个完整的预测流程
clf = Pipeline(steps=[('preprocessor', preprocessor),
                      ('classifier', LogisticRegression())])
X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.2)
clf.fit(X_train, y_train)
print("模型得分: %.3f" % clf.score(X_test, y_test))

然而，运行此代码时，我收到以下警告信息：

ConvergenceWarning: lbfgs failed to converge (status=1):
STOP: TOTAL NO. of ITERATIONS REACHED LIMIT.
Increase the number of iterations (max_iter) or scale the data as shown in:
    https://scikit-learn.org/stable/modules/preprocessing.html
Please also refer to the documentation for alternative solver options:
    https://scikit-learn.org/stable/modules/linear_model.html#logistic-regression
  extra_warning_msg=_LOGISTIC_SOLVER_CONVERGENCE_MSG)
    模型得分: 0.988

有人能解释一下这个警告是什么意思吗？我对机器学习还是新手，对于如何改进预测模型还有些迷茫。正如你可以从numeric_transformer中看到的，我通过标准化对数据进行了缩放。我也对模型得分为何如此高而感到困惑，不知道这是好事还是坏事。

LogisticRegression: Unknown label type: 'continuous' using sklearn in python

ConvergenceWarning: Liblinear 未能收敛，请增加迭代次数。

Python/Scikit-Learn - 无法处理多类和连续值的混合

区分过拟合和良好预测

AttributeError: 'LinearRegression' object has no attribute 'predict_proba' 属性错误：'LinearRegression'对象没有'predict_proba'属性

在RandomForestRegressor中遇到了“continuous is not supported”错误。

如何在训练完支持向量机（SVM）模型后加载未标记的数据进行情感分类？

scikit-learn：如何缩放回预测结果中的 'y'

在python-sklearn中遇到了"ValueError: Expected 2D array, got 1D array instead"的错误。

TypeError: float() argument must be a string or a number, not 'function' – Python/Sklearn

UndefinedMetricWarning: F-score is ill-defined and being set to 0.0 in labels with no predicted samples 未定义的度量警告: 在没有预测样本的标签中，F-score被设置为0.0。

ValueError: 数量错误的项目传递 - 意义和建议？

使用model.predict()预测数值时的线性回归错误

使用scikit-learn在朴素贝叶斯分类器中混合分类和连续数据

具有字符串/分类特征（变量）的线性回归分析？

iloc给出了'IndexError: single positional indexer is out-of-bounds'错误。

SciKit-Learn: 使用train_test_split遇到问题

在model.predict()期间使用array.reshape(-1, 1)来重新调整您的数据？

Python：IndexError：数组的索引过多

Sklearn Pipeline: 在ColumnTransformer中进行OneHotEncode后获取特征名称

ConvergenceWarning: lbfgs失败收敛（status=1）：STOP: 总迭代次数达到限制

0 答案