开发者:上海品职教育科技有限公司 隐私政策详情

应用版本:4.2.11(IOS)|3.2.5(安卓)APP下载

youtkr · 2022年07月13日

关于number是不是也不对

* 问题详情,请 查看题干

NO.PZ201512020300000607

问题如下:

Is Steele’s statement regarding Step 1 of the preprocessing of raw text data correct?

选项:

A.

Yes.

B.

No, because her suggested treatment of punctuation is incorrect.

C.

No, because her suggested treatment of extra white spaces is incorrect

解释:

B is correct. Although most punctuations are not necessary for text analysis and should be removed, some punctuations (e.g., percentage signs, currency symbols, and question marks) may be useful for ML model training. Such punctuations should be substituted with annotations (e.g., /percentSign/, /dollarSign/, and /questionMark/) to preserve their grammatical meaning in the text. Such annotations preserve the semantic meaning of important characters in the text for further text processing and analysis stages.

如题

1 个答案

星星_品职助教 · 2022年07月13日

同学你好,

是的,完全移除所有的numbers也属于太绝对了。

  • 1

    回答
  • 1

    关注
  • 436

    浏览
相关问题