2023年6月2日 03:25:52go评论62阅读模式

英文:

Index of .iloc API in Pandas

问题

如上面info()结果所示，我的数据集DF中有11列，索引从0到10。现在，我想提取前10列（即索引从0到9的列）。然而，当我尝试使用下面的代码：

DF.iloc[:, 0:9]

它只返回前9列（从CompPrice到Urban）。

在这种情况下，我需要将我的代码更改为：

DF.iloc[:, 0:10]

才能获得我真正想要的结果（即从CompPrice到US）。

我对iloc()的索引感到困惑。为什么它需要'10'而不是'9'，但起始索引却是'0'。起始和结束索引不一致。

英文:

&lt;class &#39;pandas.core.frame.DataFrame&#39;&gt;
RangeIndex: 400 entries, 0 to 399
Data columns (total 11 columns):
 #   Column       Non-Null Count  Dtype 
---  ------       --------------  ----- 
 0   CompPrice    400 non-null    int64 
 1   Income       400 non-null    int64 
 2   Advertising  400 non-null    int64 
 3   Population   400 non-null    int64 
 4   Price        400 non-null    int64 
 5   ShelveLoc    400 non-null    object
 6   Age          400 non-null    int64 
 7   Education    400 non-null    int64 
 8   Urban        400 non-null    object
 9   US           400 non-null    object
 10  HighSales    400 non-null    object
dtypes: int64(7), object(4)
memory usage: 34.5+ KB

As shown in the info() result above, there are 11 columns indexed from 0 to 10 in my dataset, DF. Now, I would like to extract only the first 10 columns (that are the columns with the indices 0 to 9). However, when I try to use the code below:

DF.iloc[:, 0:9]

It returns only the first 9 columns (that is, from CompPrice to Urban).

In this case, I need to change my code to:

DF.iloc[:, 0:10]

to get what I actually want (that is, from CompPrice to US).

I'm really confused by iloc() indices. Why it requires '10' instead '9' but starts with the index '0'. The starting and ending indices are not consistent.

答案1

得分: 2

短答案是因为这是 Python 索引的工作方式。Pandas 使用 iloc 进行行选择与 Python 索引保持一致。考虑以下列表：

lst = ['a', 'b', 'c', 'd', 'e', 'f']

lst[0:1] 返回索引 0 到 1-1：['a']
lst[0:2] 返回索引 0 到 2-1：['a', 'b']
lst[0:3] 返回索引 0 到 3-1：['a', 'b', 'c']
[0:n] 总是返回索引 0 到 n-1。

Pandas 行为方式相同。

英文:

The short answer is because this is how Python indexing works. Pandas row selection with iloc is consistent with Python indexing. Consider the list:

lst = [&#39;a&#39;, &#39;b&#39;, &#39;c&#39;, &#39;d&#39;, &#39;e&#39;, &#39;f&#39;]

lst[0:1] returns index 0 to 1-1: ['a']
lst[0:2] returns index 0 to 2-1: ['a', 'b']
lst[0:3] returns index 0 to 3-1: ['a', 'b', 'c']
[0:n] always returns index 0 to n-1.

Pandas behaves the same way.

答案2

得分: 1

你所观察到的是pandas的标准功能。如果您查看文档，可以找到定义。这是有意的和合乎逻辑的，因为Python列表的功能方式相同。根据文档：

>.iloc 主要基于整数位置（从轴的0到长度-1），但也可以与布尔数组一起使用。如果请求的索引器超出边界，.iloc 将引发 IndexError，但切片索引器允许超出边界的索引（这符合Python/NumPy切片语义）。

英文:

What you are observing is the standard functionality of pandas. If you look in the documentation, you can find the definition. This is intended and logical, as Python lists function the same way. As per the docs:

>.iloc is primarily integer position based (from 0 to length-1 of the axis), but may also be used with a boolean array. .iloc will raise IndexError if a requested indexer is out-of-bounds, except slice indexers which allow out-of-bounds indexing. (this conforms with Python/NumPy slice semantics).

答案3

得分: 1

按照惯例，索引范围为 [start, stop)，意味着起始索引被包括在内，但停止索引被排除在外。

range(1, 10) # 返回 1, 2, 3, 4, 5, 6, 7, 8, 9，但不包括 10

英文:

By convention, index ranges are [start, stop) meaning that the start index is included but the stop index is excluded.

range(1, 10) # returns 1, 2, 3, 4, 5, 6, 7, 8, 9 but not 10

通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库，让每个人都能够通过互相帮助和分享经验来进步。

Pandas中.iloc API的索引

问题

答案1

答案2

答案3

在Matplotlib绘图上添加一个圆圈到特定日期。

Re: checking if triplets of integers occur in a list – why does my code give the wrong answer for one specific test case?

过滤最接近目标值的数字并消除重复观察。

我的Python导入不按教程所述工作。

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

发表评论