NO.PZ2021083101000007
问题如下:
Select performance data from the cross- validation set confusion matrices is presented in Exhibit 1:
Azarov and Bector evaluate the Dataset XYZ performance metrics for Confusion Matrices A, B, and C in Exhibit 1. Azarov says, “For Ganymede’s purposes, we should be most concerned with the cost of Type I errors. ”
Based on Exhibit 1, which confusion matrix demonstrates the most favorable value of the performance metric that best addresses Azarov’s concern?
选项:
A.Confusion Matrix A
Confusion Matrix B
Confusion Matrix C
解释:
A is correct.
Precision is the ratio of correctly predicted positive classes to all predicted positive classes and is useful in situations where the cost of false positives or Type I errors is high.
Confusion Matrix A has the highest precision and therefore demonstrates the most favorable value of the performance metric that best addresses Azarov’s concern about the cost of Type I errors.
Confusion Matrix A has a precision score of 0.95, which is higher than the precision scores of Confusion Matrix B (0.93) and Confusion Matrix C (0.86).
B is incorrect because precision, not accuracy, is the performance measure that best addresses Azarov’s concern about the cost of Type I errors.
Confusion Matrix B demonstrates the most favorable value for the accuracy score (0.92), which is higher than the accuracy scores of Confusion Matrix A (0.91) and Confusion Matrix C (0.91).
Accuracy is a performance measure that gives equal weight to false positives and false negatives and is considered an appropriate performance measure when the class distribution in the dataset is equal (a balanced dataset).
However, Azarov is most concerned with the cost of false positives, or Type I errors, and not with finding the equilibrium between precision and recall.
Furthermore, Dataset XYZ has an unequal (unbalanced) class distribution between positive sentiment and negative sentiment sentences.
C is incorrect because precision, not recall or F1 score, is the performance measure that best addresses Azarov’s concern about the cost of Type I errors.
Confusion Matrix C demonstrates the most favorable value for the recall score (0.97), which is higher than the recall scores of Confusion Matrix A (0.87) and Confusion Matrix B (0.90).
Recall is the ratio of correctly predicted positive classes to all actual positive classes and is useful in situations where the cost of false negatives, or Type II errors, is high.
However, Azarov is most concerned with the cost of Type I errors, not Type II errors.
F1 score is more appropriate (than accuracy) when there is unequal class distribution in the dataset and it is necessary to measure the equilibrium of precision and recall.
Confusion Matrix C demonstrates the most favorable value for the F1 score (0.92), which is higher than the F1 scores of Confusion Matrix A (0.91) and Confusion Matrix B (0.91).
Although Dataset XYZ has an unequal class distribution between positive sentiment and negative sentiment sentences, Azarov is most concerned with the cost of false positives, or Type I errors, and not with finding the equilibrium between precision and recall.
考点:Model Training - Performance Evaluation
Azarov and Bector evaluate the Dataset XYZ performance metrics for Confusion Matrices A, B, and C in Exhibit 1. Azarov says, “For Ganymede’s purposes, we should be most concerned with the cost of Type I errors. ”
Based on Exhibit 1, which confusion matrix demonstrates the most favorable value of the performance metric that best addresses Azarov’s concern?
就是最关心的是type I error。然后题目问的是:哪个confusion matrix能够最好的表现出Azarov的担忧,不过,显示出来的是最好的值?就是最让他可以不用担忧吗?感觉题目的表述好难理解。
不用担忧就是Precision越接近于1越好,所以是A,是这样吗?