2023年1月9日 17:18:30go评论113阅读模式

英文:

Creating new column with values that depends on other columns

问题

以下是您的代码的翻译部分：

这是我的数据框架
import pandas as pd
url = 'https://www.basketball-reference.com/boxscores/pbp/200911060GSW.html'
dfs = pd.read_html(url)
df = dfs[0]
df.columns = df.columns.droplevel()  # 删除数据框架的“1st Q”多级标题
df.rename(columns={'Unnamed: 2_level_1': 'PM1', 'Unnamed: 4_level_1': 'PM2'}, inplace=True)
然后，我对库里的数据进行了子集化，因为我关注他的动作。
df_curry = df.loc[df["Golden State"].str.contains("Curry", na=False)]
df_curry`
现在我尝试将投篮命中和未命中的投篮添加到一个新列中，以便稍后计算命中率，但我总是得到错误消息“str' object has no attribute 'str'”。也许有人可以帮助我或给我另一种方法。
# 计算命中率
field_throws_missed = 0
field_throws_hit = 0`
# 创建新列
df_curry["Field Goals Hit"] = 0
df_curry["Field Goals Missed"] = 0
df_curry["Field Goals Percentage"] = 0`
for row in range(len(df_curry["Golden State"])):
  if df_curry.iloc[row]["Golden State"].str.contains("misses 2|misses 3"): 
    field_throws_missed += 1
    df_curry.iloc[row]["Field Goals Missed"] = field_throws_missed
  elif df_curry.iloc[row]["Golden State"].str.contains("makes 2|makes 3"): 
    field_throws_hit += 1
    df_curry.iloc[row]["Field Goals Hit"] = field_throws_hit`

希望这能帮助您理解您的代码。如果您有其他问题，请随时提出。

英文:

This is my dataframe

import pandas as pd
url = &#39;https://www.basketball-reference.com/boxscores/pbp/200911060GSW.html&#39;
dfs = pd.read_html(url)
df = dfs[0] 
df.columns = df.columns.droplevel() # drops the &quot;1st Q&quot; Multilevel header of the dataframe
df.rename(columns={&#39;Unnamed: 2_level_1&#39;: &#39;PM1&#39;, &#39;Unnamed: 4_level_1&#39;: &#39;PM2&#39;}, inplace=True)

then i have made a subset of curry because I focus on his actions.

df_curry = df.loc[df[&quot;Golden State&quot;].str.contains(&quot;Curry&quot;, na=False)]
df_curry`

now i tried to insert the hit and not hit throws into a new column to calculate the quote later but i always get the error "str' object has no attribute 'str'.
Maybe someone can help me or give me another approach

# Calculating Hit Rate
field_throws_missed = 0
field_throws_hit = 0`
# Creating the new Columns
df_curry[&quot;Field Goals Hit&quot;] = 0
df_curry[&quot;Field Goals Missed&quot;] = 0
df_curry[&quot;Field Goals Percentage&quot;] = 0`
for row in range(len(df_curry[&quot;Golden State&quot;])):
  if df_curry.iloc[row][&quot;Golden State&quot;].str.contains(&quot;misses 2|misses 3&quot;): 
    field_throws_missed += 1
    df_curry.iloc[row][&quot;Field Goals Missed&quot;] = field_throws_missed
  elif df_curry.iloc[row][&quot;Golden State&quot;].str.contains(&quot;makes 2|makes 3&quot;): 
    field_throws_hit += 1
    df_curry.iloc[row][&quot;Field Goals Hit&quot;] = field_throws_hit`

答案1

得分: 0

不需要循环，要计算True值的数量，请使用Series.cumsum进行累积求和：

df_curry = df.loc[df["Golden State"].str.contains("Curry", na=False)].copy()
df_curry["Field Goals Hit"] = df_curry["Golden State"].str.contains("misses 2|misses 3").cumsum()
df_curry["Field Goals Missed"] = df_curry["Golden State"].str.contains("makes 2|makes 3").cumsum()

编辑：如果需要在下一行添加1，请使用以下代码：

df_curry["Field Goals Hit"] = df_curry["Golden State"].str.contains("misses 2|misses 3").shift(fill_value=0).cumsum()
df_curry["Field Goals Missed"] = df_curry["Golden State"].str.contains("makes 2|makes 3").shift(fill_value=0).cumsum()

英文:

No loops necessary here, for count Trues values use cumulative sum by Series.cumsum:

df_curry = df.loc[df[&quot;Golden State&quot;].str.contains(&quot;Curry&quot;, na=False)].copy()
df_curry[&quot;Field Goals Hit&quot;] = df_curry[&quot;Golden State&quot;].str.contains(&quot;misses 2|misses 3&quot;).cumsum()
df_curry[&quot;Field Goals Missed&quot;] = df_curry[&quot;Golden State&quot;].str.contains(&quot;makes 2|makes 3&quot;).cumsum()

EDIT: If need add 1 in next row use:

df_curry[&quot;Field Goals Hit&quot;] = df_curry[&quot;Golden State&quot;].str.contains(&quot;misses 2|misses 3&quot;).shift(fill_value=0).cumsum()
df_curry[&quot;Field Goals Missed&quot;] = df_curry[&quot;Golden State&quot;].str.contains(&quot;makes 2|makes 3&quot;).shift(fill_value=0).cumsum()

通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库，让每个人都能够通过互相帮助和分享经验来进步。

创建一个新列，其值取决于其他列。

问题

答案1

关于通过套接字发送NumPy数组存在一些问题。

从Python项目中加载数据存储实体到Go语言会导致嵌套的结构体切片错误。

操作包含JSON字符串字典的JSON文件

TypeError: 对于 ‘pygame.time.Clock’ 对象，’tick’ 描述符不适用于 ‘int’ 对象

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。