2020年1月7日 00:19:12go评论119阅读模式

英文:

New column based on a filter and an index of multiples columns?

问题

I understand your request. Here's the translated code portion for your first scenario:

我明白你的要求。以下是你的第一个情景的翻译代码部分：

```python
for each row : 
    if (df['value type'] == 'value train'):
        #and (type,company) is the same
        df['train value'] = df['value']
        remove row

And here's the translated code portion for your second scenario:

以下是你的第二个情景的翻译代码部分：

```python
if df['value time'] == 'present' then add to new column

英文:

I've been trying to search/think about an answer, probably with a melt or stack, but still can't seem to do it.

Here's my DF :

d = {&#39;type&#39; : [1, 2, 3, 4, 5, 1, 2, 3, 4, 5],
 &#39;company&#39; : [&#39;A&#39;, &#39;B&#39;, &#39;C&#39;, &#39;D&#39;, &#39;E&#39;,&#39;A&#39;, &#39;B&#39;, &#39;C&#39;, &#39;D&#39;, &#39;E&#39;],
 &#39;value type&#39;: [&#39;value car&#39;,&#39;value car&#39;,&#39;value car&#39;,&#39;value car&#39;,&#39;value car&#39;, &#39;value train&#39;,&#39;value train&#39;,&#39;value train&#39;,&#39;value train&#39;,&#39;value train&#39;,],
 &#39;value&#39;: [0.1, 0.2, 0.3, 0.4, 0.5, 0.15, 0.25, 0.35, 0.45, 0.55] }

df = pd.DataFrame(d)

Here is what I want (I have the array on the left, I want the one on the right):

As you can see, I want a new column "train value" based on the combination (type,company)

Something like

for each row : 
    if (df[&#39;value type&#39;] == &#39;value train&#39;):
        #and (type,company) is the same
        df[&#39;train value&#39;] = df[&#39;value&#39;]
        remove row

For example, the company A from type 1 will have a new value in a new column for its train value.
Is there a way to do this properly ?

EDIT::: There was a good answer but I didn't explain myself clearly. I want only a new column with only "one value type". For example my new DF :

d = {&#39;type&#39; : [1, 2, 3, 4, 5, 1, 2, 3, 4, 5],
 &#39;company&#39; : [&#39;A&#39;, &#39;B&#39;, &#39;C&#39;, &#39;D&#39;, &#39;E&#39;,&#39;A&#39;, &#39;B&#39;, &#39;C&#39;, &#39;D&#39;, &#39;E&#39;],
 &#39;month&#39; : [&#39;jan&#39;, &#39;feb&#39;, &#39;marc&#39;, &#39;apr&#39;, &#39;may&#39;, &#39;jan&#39;, &#39;feb&#39;, &#39;marc&#39;, &#39;apr&#39;, &#39;sep&#39;],
 &#39;business&#39; : [&#39;business1&#39;, &#39;business2&#39;, &#39;business3&#39;, &#39;business4&#39;, &#39;business5&#39;, &#39;business6&#39;, &#39;business7&#39;, &#39;business8&#39;, &#39;business9&#39;, &#39;business10&#39;], 
 &#39;value time&#39;: [&#39;past&#39;, &#39;past&#39;, &#39;past&#39;, &#39;past&#39;, &#39;present&#39;, &#39;present&#39;, &#39;present&#39;, &#39;present&#39;, &#39;future&#39;, &#39;future&#39;],
 &#39;value&#39;: [0.1, 0.2, 0.3, 0.4, 0.11, 0.21, 0.31, 0.41, 0.45, 0.55] }

df = pd.DataFrame(d)

Heres what I want this time :

If possible, only the values with the "present" will be in the new column. Something like

if df[&#39;value time&#39;] == &#39;present&#39; then add to new column

答案1

得分: 2

你应该对你的数据框进行重塑：

company_to_type = df.set_index('company')['type'].to_dict()
df = df.pivot(index='company', columns='value type', values='value').reset_index()
df['type'] = df.company.map(company_to_type)
df = df.rename_axis(None, axis=1)
df = df[['type', 'company', 'value train', 'value car']]

你将得到：

   type company  value train  value car
0     1       A         0.15        0.1
1     2       B         0.25        0.2
2     3       C         0.35        0.3
3     4       D         0.45        0.4
4     5       E         0.55        0.5

英文:

You should pivot your dataframe:

company_to_type = df.set_index(&#39;company&#39;)[&#39;type&#39;].to_dict()
df = df.pivot(index=&#39;company&#39;, columns=&#39;value type&#39;, values=&#39;value&#39;).reset_index()
df[&#39;type&#39;] = df.company.map(company_to_type)
df = df.rename_axis(None, axis=1)
df = df[[&#39;type&#39;, &#39;company&#39;, &#39;value train&#39;, &#39;value car&#39;]]

and you'll get

   type company  value train  value car
0     1       A         0.15        0.1
1     2       B         0.25        0.2
2     3       C         0.35        0.3
3     4       D         0.45        0.4
4     5       E         0.55        0.5

通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库，让每个人都能够通过互相帮助和分享经验来进步。

新列基于筛选和多列的索引？

问题

答案1

删除字符串开头的关键词，但不删除字符串中的所有关键词。

最好的方法是如何迭代每一行，针对以下情况？

可以在创建新数据集时使用if else函数吗？

I want to broadcast an pytorcc tensor of dimension (a,b,c) onto an array of dimension (b,c) to get an output of dimension (a,c) how do I do this?

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

发表评论