Metrics

Metrics#

Regression#

RMSEとMAE#

どっちがいいか？という議論があるらしい

Chai, T., & Draxler, R. R. (2014). Root mean square error (RMSE) or mean absolute error (MAE)?–Arguments against avoiding RMSE in the literature. Geoscientific model development, 7(3), 1247-1250.

Classification#

Recall#

再現率（recall）：実際にPositiveであるデータのうち正しく予測できたものの割合、どれだけFalse Negativeを小さくできたか

R e c a l l = \frac{T P}{T P + F N}

Precision#

適合率（precision）：予測したPositiveにおいて正しく予測できたものの割合、どれだけFalse Positiveを小さくできたか

P r e c i s i o n = \frac{T P}{T P + F P}

F1-score#

PrecisionとRecallにはトレードオフ関係がある（https://datawokagaku.com/f1score/ ）ため、調和平均をとったF1-scoreというものもある

F 1 = \frac{2}{\frac{1}{R e c a l l} + \frac{1}{P r e c i s i o n}} = 2 \frac{R e c a l l \times P r e c i s i o n}{R e c a l l + P r e c i s i o n}

ROC曲線#

true positive rate $T P R$ （recallの別名、陽性者を正しく陽性だと当てる率、sensitivityとも）とfalse positive rate $F P R$ （陰性者のうち偽陽性になる率）

\begin{array}{r} T P R = \frac{T P}{P} = \frac{T P}{T P + F N} = \frac{Positiveを当てたもの}{Positiveのもの} \\ F P R = \frac{F P}{N} = \frac{F P}{F N + T N} = \frac{Positiveを外したもの}{Negativeのもの} \end{array}

を用いて閾値を変えながら描いた曲線をreceiver operating characteristic（ROC; 受信者操作特性）曲線という。

ROC曲線の下側の面積（Area Under the Curve）をROC-AUCという。ランダムなアルゴリズム（chance level）だとROC-AUCは0.5になる

../../_images/eac7bbda2ce4fca48291e9f68442a2f35ae37aaae3f5575dd965d371ff1caa12.png

Precision-Recall Curve#

さまざまなthresholdの元でのRecallとPrecisionを算出し、横軸にRecall、縦軸にPrecisionを結んだグラフ

PR曲線の下側の面積（Area Under the Curve）をPR-AUCあるいはAverage Precisionという

A P = \sum_{k = 1}^{K} (R_{k} - R_{k - 1}) P_{k}

ROC曲線とは異なり、「ランダムなアルゴリズムなら0.5」のような安定したスケールではなく、スケールはクラスのバランス（不均衡具合）に依存する

その分、不均衡データにおけるモデル評価に向いていると言われている

../../_images/3cf7d57b5a66c4ae2d06540993aad64604ca638093cc4754172e0f17b8ccb30f.png

imbalanced dataとPR曲線・ROC曲線#

ある不均衡データがあったとする

from sklearn.datasets import make_classification
size = 1000
X, y = make_classification(n_samples=size, n_features=2, n_informative=1, n_redundant=1, weights=[0.95, 0.05],
                           class_sep=0.5, n_classes=2, n_clusters_per_class=1, random_state=0)

import pandas as pd
pd.Series(y).value_counts()

0    943
1     57
Name: count, dtype: int64

accuracy_score=0.94
recall_score=0.053
precision_score=0.43
f1_score=0.094

<sklearn.metrics._plot.confusion_matrix.ConfusionMatrixDisplay at 0x7f698f553dc0>

../../_images/3d678622e20e8da305aaec0cb5d9fae622cd395cfb3ab53958e70a436bcb5fd6.png

../../_images/6af5e5d4077e21971601526d0a9faf60670a20b9368fd6247e5dcdca9bdcc261.png

コサイン類似度#

ベクトルの方向が似ているものは似ている

コサイン類似度（Cosine Similarity）とは？：AI・機械学習の用語辞典 - ＠IT

2つのベクトルがなす角（コサイン）の値が類似度として使える、ということになる

\begin{array}{r} a \cdot b = | | a | | | | b | | \cos (a, b) \\ ⟹ \cos (a, b) = \frac{a \cdot b}{| | a | | | | b | |} \end{array}

../../_images/15fee7f72b598cb5e25502a147f544cc9c49cf61c1dc5bf3bc7e9653561b4b0c.png

../../_images/3b291d1be72447c2a9c04d4af1bce4eb34f853e55f6dd293991974f077ea43d9.png