问题

我想对我的数据集中的特定列执行K均值。由于这些是分类数据，我计划对其进行独热编码。现在我想知道是否可以对特定列执行K均值，并显示结果（例如一个组的结果）以及所有列？

例如，我有col1、col2和col3，对col2和col3执行K均值，这些列已进行了独热编码，并显示包括col1、col2和col3在内的结果。
我希望我已经清楚地表达了我的问题。

英文:

I would like to do a K-means on specific columns of my data set.
As these are categorical data, I plan to do a onehot_encoding on it. Now I would like to know if it is possible to do K-means on specific columns and display the results (of a group for example) with all the columns?

For example i have col1, col2 and col3, K-means on col2 and col3which are onehot_encoded and display results with col1, col2 and col3.
I hope I have clearly expressed my concern

答案1

得分: 4

这是kmeans的基本文档遵循的部分：

from sklearn.cluster import KMeans
#在这里选择你的列
X = df[['col1', 'col2', 'col3']]
kmeans = KMeans(n_clusters=2, random_state=0).fit(X)
#这将为您提供分组
kmeans.predict(X)

因此，kmeans预测命令将为您提供分组，您可以将其添加到原始数据中。

英文:

This follows the basic documentation of kmeans:

from sklearn.cluster import KMeans
#here you select your columns
X = df[[&#39;col1&#39;, &#39;col2&#39;, &#39;col3&#39;]]
kmeans = KMeans(n_clusters=2, random_state=0).fit(X)
#this will give you the groups back
kmeans.predict(X)

So the kmeans predict command will give you the group back which you can add to your original data.

通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库，让每个人都能够通过互相帮助和分享经验来进步。

如何在特定列上执行K均值聚类？

问题

答案1

将For语句向量化 – 对角线上的零

AttributeError: dlsym(0x7fc4cfd563b0, add_all_items_to_map): symbol not found; running Go from Python using C

create a new folder everyday as per UTC time in my s3 bucket and save json files in it

如何同时重定向输出并为子进程保留控制台窗口？

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。