NO.PZ202208300200000604
问题如下:
The method of scaling that Khan will use on Dataset 1 is most likely:选项:
A.winsorization. B.normalization. C.standardization.解释:
SolutionC is correct. Standardization is the process of both centering and scaling the variables. In this case, each temperature value will be standardized against the historical mean temperature for the geographic area. Standardization requires that the data be normally distributed, and the resulting standardized variable will have an arithmetic mean of 0 and a standard deviation of 1.
A is incorrect. Winsorization is a process of handling outliers that replaces extreme values and outliers with the maximum (for large-value outliers) and minimum (for small-value outliers) values of data points that are not outliers.
B is incorrect. Normalization is the process of rescaling numeric values in the rage of [0, 1], which is not the case for determining variation against historical temperature. Normalization can be used when the distribution of data is not known but treatment of outliers is necessary.
normalization.为什么不选?