开发者:上海品职教育科技有限公司 隐私政策详情

应用版本:4.2.11(IOS)|3.2.5(安卓)APP下载

海伦岛主 · 2021年08月14日

A为什么不对

NO.PZ2015120204000048

问题如下:

Steele performs exploratory data analysis. To assist in feature selection, she wants to create a visualization that shows the most informative words in the dataset based on their term frequency (TF) values.

Given her objective, the visualization that Steele should create in the exploratory data analysis step is a:

选项:

A.

scatter plot

B.

word cloud.

C.

document term matrix.

解释:

B is correct. Steele wants to create a visualization that shows the most informative words in the dataset based on their term frequency (TF, the ratio of the number of times a given token occurs in the dataset to the total number of tokens in the dataset) values. A word cloud is a common visualization when working with text data as it can be made to visualize the most informative words and their TF values. The most commonly occurring words in the dataset can be shown by varying font size, and color is used to add more dimensions, such as frequency and length of words.

A为什么不对

1 个答案

星星_品职助教 · 2021年08月15日

同学你好,

本题问的是“..... create a visualization that shows the most informative words in the dataset based on their term frequency (TF) values”。这本身就是word cloud的定义,即出现频率最多的词汇显示的更清晰。直接选择就可以了。

scatter plot上每个点的地位是平等的,并不能显示哪个散点出现的更为频繁,哪个点又是“informative”的。

但最好的方法还是直接掌握词云的定义。