开发者:上海品职教育科技有限公司 隐私政策详情

应用版本:4.2.11(IOS)|3.2.5(安卓)APP下载

Jon · 2023年03月07日

K nearest neighbor 与k means clustering

* 问题详情,请 查看题干

NO.PZ201512020300000503

问题如下:

Which of the following machine learning techniques is most appropriate for executing Step 2:

选项:

A.

K-Means Clustering

B.

Principal Components Analysis (PCA)

C.

Classification and Regression Trees (CART)

解释:

A is correct. K-Means clustering is an unsupervised machine learning algorithm which repeatedly partitions observations into a fixed number, k, of nonoverlapping clusters (i.e., groups).
B is incorrect. Principal Components Analysis is a long-established statistical method for dimension reduction, not clustering. PCA aims to summarize or reduce highly correlated features of data into a few main, uncorrelated composite variables.
C is incorrect. CART is a supervised machine learning technique that is most commonly applied to binary cla
ssification or regression.

K nearest neighbor 与k means clustering老是弄混,如何记忆

1 个答案

星星_品职助教 · 2023年03月07日

同学你好,

这两种方法除了名字里都有个K以外,差别挺大的。

定义是不同的:K nearest neighbor是supervised learning,需要有Y变量即target variable;而k means clustering是unsupervised learning,不需要Y变量,这是最本质的区别。

做法是不同的:KNN需要先对已有数据进行分类标注,然后再将新的待分类的那一个数据点根据距离最近的“邻居”进行分类;k means clustering是先设几个中心点,然后把其他数据点分别根据中心点进行聚类。

总之就是这两种方法除了名字类似外,其余没什么联系。


  • 1

    回答
  • 0

    关注
  • 408

    浏览
相关问题

NO.PZ201512020300000503 问题如下 Whiof the following machine learning techniques is most appropriate for executing Step 2: A.K-Means Clustering B.PrincipComponents Analysis (PC C.Classification anRegression Trees (CART) A is correct. K-Means clustering is unsupervisemachine learning algorithm whirepeatey partitions observations into a fixenumber, k, of nonoverlapping clusters (i.e., groups).B is incorrect. PrincipComponents Analysis is a long-establishestatisticmethofor mension rection, not clustering. Paims to summarize or rehighly correlatefeatures of ta into a few main, uncorrelatecomposite variables.C is incorrect. CART is a supervisemachine learning technique this most commonly applieto binary classification or regression. 这题不是根据financiannon-financicharacteristics进行分类了么,这种应该是监督式学习啊,为什么是k-mean啊,不是很明白。非监督式不是不应该设定参数的特点么,

2023-05-29 15:20 1 · 回答

NO.PZ201512020300000503 PCA是哪里讲到的?可以解析一下吗?谢谢

2021-10-08 18:38 1 · 回答

NO.PZ201512020300000503 老师好,根据题目信息,比较容易排除B。然后我就不知道A和C两个选哪一个。A我觉得也对,那我讲讲为什么我觉得C也对吧。 题干说根据一些财务和非财务的特征分成20个小组,其实我们用树的形式也可以分成20个组呀。> <麻烦老师帮忙讲下,谢谢。

2021-05-22 14:24 1 · 回答

老师请问Step 2里有这样的描述“financiannon-financicharacteristics”, 这个描述是否属于贴了标签呢?

2020-02-22 18:22 1 · 回答