问题

行数	col1	col2	col3	...	col13
numofrows	string	string	timest	...	int
	string	string	timest	...	int

英文:

I have a pyspark dataframe that I'd like to get the row count for. Once I get the row count, I'd like to add it to the top left corner of the data frame, as shown below.

I've tried creating the row first and doing a union on the empty row and the dataframe, but the empty row gets overwritten. I've tried adding it as a literal in a column, but having trouble nulling the remainder of the column as well as the row. Any advice?

dataframe:

col1	col2	col3	...	col13
string	string	timest	...	int

for a few rows.

desired output:

row_count	col1	col2	col3	...	col13
numofrows
	string	string	timest	...	int

So the row count would sit where an otherwise empty row and empty column meet.

答案1

得分: 0

假设 `df` 是你的数据框：

```python
from pyspark.sql import functions as F

cnt = df.count()

columns_list = df.columns

df = df.withColumn("row_count", F.lit(None).cast("int"))
schema = df.schema

cnt_line = spark.createDataFrame([[None for x in columns_list] + [cnt]], schema=schema)

df.unionAll(cnt_line).show()


<details>
<summary>英文:</summary>

Assuming `df` is your dataframe:
```python
from pyspark.sql import functions as F

cnt = df.count()

columns_list = df.columns

df = df.withColumn(&quot;row_count&quot;, F.lit(None).cast(&quot;int&quot;))
schema = df.schema

cnt_line = spark.createDataFrame([[None for x in columns_list] + [cnt]], schema=schema)

df.unionAll(cnt_line).show()

通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库，让每个人都能够通过互相帮助和分享经验来进步。

Pyspark：添加具有行计数的单个值的行/列

问题

答案1

Pandas的replace正在执行在两个不同的数据框上。

使用streamlit的st.number_input()进行条件语句和四舍五入。

计算相似值在字典列表中的出现次数

如何在pandas中比较当前行与前n行和后n行？

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

发表评论