开发者:上海品职教育科技有限公司 隐私政策详情

应用版本:4.2.11(IOS)|3.2.5(安卓)APP下载

xiaobaiybz · 2022年03月25日

这题在强化班讲义的哪一页呢?中文如何翻译呢?谢谢老师

* 问题详情,请 查看题干

NO.PZ202108310100000101

问题如下:

Based on the text exploration method used for Dataset ABC, tokens that potentially carry important information useful for differentiating the sentiment embedded in the text are most likely to have values that are:

选项:

A.

low

B.

intermediate

C.

high

解释:

B is correct.

When analyzing term frequency at the corpus level, also known as collection frequency, tokens with intermediate term frequency (TF) values potentially carry important information useful for differentiating the sentiment embedded in the text.

A is incorrect because tokens with the lowest TF values are mostly proper nouns or sparse terms (noisy terms) that are not important to the meaning of the text.

C is incorrect because tokens with the highest TF values are mostly stop words (noisy terms) that do not contribute to differentiating the sentiment embedded in the text.

这题在强化班讲义的哪一页呢?中文如何翻译呢?谢谢老师

1 个答案

星星_品职助教 · 2022年03月26日

同学你好,

本题大意为什么样的token会携带有辨识度的信息,即需要保留。

排除高TF(排除stop words),和低TF(排除专有名词等sparse word)的词汇,选择不高不低的intermediate即可。

关于本题的考点,以及TF的讲解都可以参照强化班这一页的内容。