开发者:上海品职教育科技有限公司 隐私政策详情

应用版本:4.2.11(IOS)|3.2.5(安卓)APP下载

小鱼 · 2020年01月20日

问一道题:NO.PZ2015120204000041

问题如下:

Steele begins building a model that combines the structured financial data and the sentiment data. She starts with cleansing and wrangling the raw structured financial data. Exhibit 1 presents a small sample of the raw dataset before cleansing: Each row represents data for a particular firm.

Exhibit 1 Sample of Raw Structured Data Before Cleansing


What type of error appears to be present in the IPO Date column of Exhibit 1?

选项:

A.

invalidity error.

B.

inconsistency error.

C.

non-uniformity error.

解释:

C is correct. A non-uniformity error occurs when the data are not presented in an identical format. The data in the “IPO Date” column represent the IPO date of each firm. While all rows are populated with valid dates in the IPO Date column, the dates are presented in different formats (e.g., mm/dd/yyyy, dd/mm/yyyy).

这道题我估计再做一遍还会错,怎么样去区分 inconsistency error 和 non-uniformity error

1 个答案

星星_品职助教 · 2020年01月20日

同学你好,

数据不一致(inconsistency error)。数据之间产生了明显的冲突和不一致现象。例如“某某女士”的性别却显示为男。

数据格式不统一(non-uniformity error)。数据前后的格式不相同。例如这道题,IPO date这一列的第一行的格式明显和后三行不同。但不存在冲突的现象