2020年1月6日 15:11:39go评论104阅读模式

英文:

How to export cleaned data from a jupyter notebook, not the original data

问题

我刚刚开始学习使用Jupyter笔记本。我有一个名为“Diseases”的数据文件。

打开数据文件

import pandas as pd
df = pd.read_csv('Diseases.csv')

选择来自名为“DIABETES”的列的数据，即选择具有糖尿病的受试者ID，是1，否则是0。

df[df.DIABETES > 1]

现在我想导出这些经过清理的数据（行数较少）

df.to_csv('diabetes-filtered.csv')

这将导出原始数据文件，而不是具有较少行的已筛选df。
我在另一个问题中看到需要使用inplace参数。但我不知道该如何使用。

英文:

I have just started to learn to use Jupyter notebook. I have a data file called 'Diseases'.

Opening data file

import pandas as pd
df = pd.read_csv(&#39;Diseases.csv&#39;)

Choosing data from a column named 'DIABETES', i.e choosing subject IDs that have diabetes, yes is 1 and no is 0.

df[df.DIABETES &gt;1]

Now I want to export this cleaned data (that has fewer rows)

df.to_csv(&#39;diabetes-filtered.csv&#39;)

This exports the original data file, not the filtered df with fewer rows.
I saw in another question that the inplace argument needs to be used. But I don't know how.

答案1

得分: 4

你忘记将筛选后的 DataFrame 分配回 df1：

import pandas as pd 
df = pd.read_csv('Diseases.csv')
df1 = df[df.DIABETES > 1]
df1.to_csv('diabetes-filtered.csv')

或者你可以将筛选和导出到文件链接在一起：

import pandas as pd 
df = pd.read_csv('Diseases.csv')
df[df.DIABETES > 1].to_csv('diabetes-filtered.csv')

英文:

You forget assign back filtered DataFrame, here to df1:

import pandas as pd 
df = pd.read_csv(&#39;Diseases.csv&#39;)
df1 = df[df.DIABETES &gt;1]
df1.to_csv(&#39;diabetes-filtered.csv&#39;)

Or you can chain filtering and exporting to file:

import pandas as pd 
df = pd.read_csv(&#39;Diseases.csv&#39;)
df[df.DIABETES &gt;1].to_csv(&#39;diabetes-filtered.csv&#39;)

通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库，让每个人都能够通过互相帮助和分享经验来进步。

如何从 Jupyter 笔记本中导出经过清理的数据，而不是原始数据。

问题

答案1

Python和临时文件移动到临时路径后，它们不再被视为文件。

如何使用Python从亚马逊的搜索结果中爬取所有页面。

递归乘法的时间复杂度

如何在使用特定语言环境的小数分隔符时格式化小数精度？

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。