sklearn.metrics.f1_score (2024)

sklearn.metrics.f1_score(y_true, y_pred, *, labels=None, pos_label=1, average='binary', sample_weight=None, zero_division='warn')[source]¶

Compute the F1 score, also known as balanced F-score or F-measure.

The F1 score can be interpreted as a harmonic mean of the precision andrecall, where an F1 score reaches its best value at 1 and worst score at 0.The relative contribution of precision and recall to the F1 score areequal. The formula for the F1 score is:

\[\text{F1} = \frac{2 * \text{TP}}{2 * \text{TP} + \text{FP} + \text{FN}}\]

Where \(\text{TP}\) is the number of true positives, \(\text{FN}\) is thenumber of false negatives, and \(\text{FP}\) is the number of false positives.F1 is by defaultcalculated as 0.0 when there are no true positives, false negatives, orfalse positives.

Support beyond binary targets is achieved by treating multiclassand multilabel data as a collection of binary problems, one for eachlabel. For the binary case, setting average='binary' will returnF1 score for pos_label. If average is not 'binary', pos_label is ignoredand F1 score for both classes are computed, then averaged or both returned (whenaverage=None). Similarly, for multiclass and multilabel targets,F1 score for all labels are either returned or averaged depending on theaverage parameter. Use labels specify the set of labels to calculate F1 scorefor.

See also

fbeta_score: Compute the F-beta score.
precision_recall_fscore_support: Compute the precision, recall, F-score, and support.
jaccard_score: Compute the Jaccard similarity coefficient score.
multilabel_confusion_matrix: Compute a confusion matrix for each class or sample.

Notes

When true positive + false positive + false negative == 0 (i.e. a classis completely absent from both y_true or y_pred), f-score isundefined. In such cases, by default f-score will be set to 0.0, andUndefinedMetricWarning will be raised. This behavior can be modified bysetting the zero_division parameter.

References

[1]

Wikipedia entry for the F1-score.

Examples

>>> import numpy as np>>> from sklearn.metrics import f1_score>>> y_true = [0, 1, 2, 0, 1, 2]>>> y_pred = [0, 2, 1, 0, 0, 1]>>> f1_score(y_true, y_pred, average='macro')0.26...>>> f1_score(y_true, y_pred, average='micro')0.33...>>> f1_score(y_true, y_pred, average='weighted')0.26...>>> f1_score(y_true, y_pred, average=None)array([0.8, 0. , 0. ])

>>> # binary classification>>> y_true_empty = [0, 0, 0, 0, 0, 0]>>> y_pred_empty = [0, 0, 0, 0, 0, 0]>>> f1_score(y_true_empty, y_pred_empty)0.0...>>> f1_score(y_true_empty, y_pred_empty, zero_division=1.0)1.0...>>> f1_score(y_true_empty, y_pred_empty, zero_division=np.nan)nan...

>>> # multilabel classification>>> y_true = [[0, 0, 0], [1, 1, 1], [0, 1, 1]]>>> y_pred = [[0, 0, 0], [1, 1, 1], [1, 1, 0]]>>> f1_score(y_true, y_pred, average=None)array([0.66666667, 1. , 0.66666667])

Examples using `sklearn.metrics.f1_score`¶

Probability Calibration curves

Precision-Recall

Semi-supervised Classification on a Text Dataset

sklearn.metrics.f1_score (2024)

Examples using sklearn.metrics.f1_score¶

References

Examples using `sklearn.metrics.f1_score`¶