#!/usr/bin/env python # coding: utf-8 #

Please cite us if you use the software

# # PyCM Document # ### Version : 4.0 # ----- # ## Table of contents #

Overview
Installation

Source Code
PyPI
Easy Install
MATLAB

Usage

From Vector
Direct CM
Iterating And Casting
Activation Threshold
Load From File
Sample Weights
Transpose
Metrics Off
Relabel
Position
To Array
Combine
Plot
Distance/Similarity
Parameter Recommender
Compare
ROC Curve
Precision-Recall Curve
Multilabel Confusion Matrix
Online Help
Acceptable Data Types

Basic Parameters

True Positive
True Negative
False Positive
False Negative
Condition Positive
Condition Negative
Test Outcome Positive
Test Outcome Negative
Population

Class Statistics

True Positive Rate
True Negative Rate
Positive Predictive Value
Negative Predictive Value
False Negative Rate
False Positive Rate
False Discovery Rate
False Omission Rate
Accuracy
Error Rate
FBeta Score
Matthews Correlation Coefficient
Informedness
Markedness
Positive Likelihood Ratio
Negative Likelihood Ratio
Diagnostic Odds Ratio
Prevalence
G-Measure
Random Accuracy
Random Accuracy Unbiased
Jaccard Index
Information Score
Confusion Entropy
Modified Confusion Entropy
Area Under The ROC Curve
Distance Index
Similarity Index
Discriminant Power
Youden Index
Positive Likelihood Ratio Interpretation
Negative Likelihood Ratio Interpretation
Discriminant Power Interpretation
AUC Value Interpretation
Matthews Correlation Coefficient Interpretation
Yule's Q Interpretation
Gini Index
Lift Score
Automatic/Manual
Bray-Curtis Dissimilarity
Optimized Precision
Index of Balanced Accuracy
G-Mean
Yule's Q
Adjusted G-Mean
Adjusted F-Score
Overlap Coefficient
Braun-Blanquet Similarity
Otsuka Ochiai Coefficient
Tversky Index
Area Under The PR Curve
Individual Classification Success Index
Confidence Interval
Net Benefit
Average
Weighted Average
Sensitivity Index
Hamming Distance

Overall Statistics

Kappa
Kappa Unbiased
Kappa No Prevalence
Weighted Kappa
Kappa Standard Error
Kappa 95% CI
Chi Squared
Chi Squared DF
Phi Squared
Cramer's V
Standard Error
95% CI
Bennett's S
Scott's PI
Gwet's AC1
Reference Entropy
Response Entropy
Cross Entropy
Joint Entropy
Conditional Entropy
Kullback-Leibler Divergence
Mutual Information
Goodman-Kruskal's Lambda A
Goodman-Kruskal's Lambda B
Landis-Koch's Benchmark
Fleiss' Benchmark
Altman's Benchmark
Cicchetti's Benchmark
Cramer's Benchmark
Matthews's Benchmark
Goodman-Kruskal's Lambda A Benchmark
Goodman-Kruskal's Lambda B Benchmark
Krippendorff's Alpha Benchmark
Pearson's C Benchmark
Overall Accuracy
Overall Random Accuracy
Overall Random Accuracy Unbiased
Positive Predictive Value Micro
Negative Predictive Value Micro
True Positive Rate Micro
True Negative Rate Micro
False Positive Rate Micro
False Negative Rate Micro
F1 Score Micro
Positive Predictive Value Macro
Negative Predictive Value Macro
True Positive Rate Macro
True Negative Rate Macro
False Positive Rate Macro
False Negative Rate Macro
F1 Score Macro
Accuracy Macro
Overall Jaccard Index
Hamming Loss
Zero-one Loss
No Information Rate
P Value
Overall Confusion Entropy
Overall Modified Confusion Entropy
Overall Matthews Correlation Coefficient
Global Performance Index
Class Balance Accuracy
AUNU
AUNP
Relative Classifier Information
Pearson's C
Classification Success Index
Adjusted Rand Index
Bangdiwala's B
Krippendorff's Alpha
Weighted Alpha
Aickin's Alpha
Brier Score
Log Loss

Full
Matrix
Normalized Matrix
Stat
Compare Report

Save

pycm
HTML
CSV
object
comp

Input Errors
Examples
Cite
References

# PyCM is a multi-class confusion matrix library written in Python that supports both input data vectors and direct matrix, and a proper tool for post-classification model evaluation that supports most classes and overall statistics parameters. # PyCM is the swiss-army knife of confusion matrices, targeted mainly at data scientists that need a broad array of metrics for predictive models and accurate evaluation of a large variety of classifiers. #

Fig1. ConfusionMatrix Block Diagram

Notice : digit (the number of digits to the right of the decimal point in a number) is new in version 0.6 (default value : 5)
Only for print and save

Notice : cm.statistic_result prev versions (0.2 >)

Notice : new in version 0.3

Notice : _ removed from overall statistics names in version 1.6

Notice : matrix, normalized_matrix & normalized_table added in version 1.5 (changed from print style)

Notice : numpy.array support in versions > 0.7

Notice : classes added in version 3.2

Notice : __getitem__ and __contains__ methods added in version 3.8

Notice : new in version 0.8.1
In direct matrix mode actual_vector and predict_vector are empty

Notice : confusion matrices input in array format is new in version 3.6

Notice : new in version 3.5

Example 3

Notice : new in version 0.9

Example 4

Notice : new in version 0.9.5

Example 5

Notice : new in version 1.2

Notice : new in version 1.2

Notice : new in version 3.9

Notice : new in version 1.5

Notice : sort added in version 3.9

Notice : new in version 2.8

Notice : only works in vector mode

Notice : new in version 2.9

Notice : new in version 3.0

Notice : new in version 3.0

Notice : new in version 3.8

Fig2. Parameter Recommender Block Diagram

Notice : also available in HTML report

Notice : The recommender system assumes that the input is the result of classification over the whole data rather than just a part of it. If the confusion matrix is the result of test data classification, the recommendation is not valid.

Notice : is_imbalanced , new in version 3.3

Fig3. Compare Block Diagram

Notice : overall_benchmark_weight and class_benchmark_weight, new in version 3.3

Notice : From version 3.8, Goodman-Kruskal's Lambda A, Goodman-Kruskal's Lambda B, Krippendorff's Alpha, and Pearson's C benchmarks are considered in the overall score and default weights of the overall benchmarks are modified accordingly.

Notice : new in version 3.7

Notice : new in version 3.7

Notice : new in version 4.0

Notice : alt_link , new in version 2.4

ConfusionMatrix

Notice : metrics_off, new in version 3.9

Compare

Notice : weight renamed to class_weight in version 3.3

Notice : overall_benchmark_weight and class_benchmark_weight, new in version 3.3

ROCCurve

PRCurve

MultiLabelCM

PLR	Model contribution
1 >	Negligible
1 - 5	Poor
5 - 10	Fair
> 10	Good

NLR	Model contribution
0.5 - 1	Negligible
0.2 - 0.5	Poor
0.1 - 0.2	Fair
0.1 >	Good

AUC	Model performance
0.5 - 0.6	Poor
0.6 - 0.7	Fair
0.7 - 0.8	Good
0.8 - 0.9	Very Good
0.9 - 1.0	Excellent

MCC	Interpretation
0.3 >	Negligible
0.3 - 0.5	Weak
0.5 - 0.7	Moderate
0.7 - 0.9	Strong
0.9 - 1.0	Very Strong

Q	Interpretation
0.25 >	Negligible
0.25 - 0.5	Weak
0.5 - 0.75	Moderate
> 0.75	Strong

Kappa	Strength of Agreement
0 >	Poor
0 - 0.2	Slight
0.2 – 0.4	Fair
0.4 – 0.6	Moderate
0.6 – 0.8	Substantial
0.8 – 1.0	Almost perfect

Kappa	Strength of Agreement
0.40 >	Poor
0.40 - 0.75	Intermediate to Good
More than 0.75	Excellent

Kappa	Strength of Agreement
0.2 >	Poor
0.2 – 0.4	Fair
0.4 – 0.6	Moderate
0.6 – 0.8	Good
0.8 – 1.0	Very Good

Cramer's V	Strength of Association
0.1 >	Negligible
0.1 – 0.2	Weak
0.2 – 0.4	Moderate
0.4 – 0.6	Relatively Strong
0.6 – 0.8	Strong
0.8 – 1.0	Very Strong

Overall MCC	Strength of Association
0.3 >	Negligible
0.3 - 0.5	Weak
0.5 - 0.7	Moderate
0.7 - 0.9	Strong
0.9 - 1.0	Very Strong

Alpha	Strength of Agreement
0.667 >	Low
0.667 - 0.8	Tentative
0.8 <	High

Lambda A	Strength of Association
0 - 0.2	Very Weak
0.2 - 0.4	Weak
0.4 - 0.6	Moderate
0.6 - 0.8	Strong
0.8 - 1.0	Very Strong
1.0	Perfect

Lambda B	Strength of Association
0 - 0.2	Very Weak
0.2 - 0.4	Weak
0.4 - 0.6	Moderate
0.6 - 0.8	Strong
0.8 - 1.0	Very Strong
1.0	Perfect

C	Strength of Association
0 - 0.1	Not Appreciable
0.1 - 0.2	Weak
0.2 - 0.3	Medium
0.3 <	Strong