2023年8月4日 21:33:25go评论111阅读模式

英文:

How to print the result of current_date() in PySpark?

问题

这是在Python中非常简单的，但我目前正在学习在Databricks中使用PySpark。

我只想看看在PySpark中current_date()返回什么。

我尝试过的内容：

from pyspark.sql import functions as fn

print(fn.current_date())
# 结果：Column&lt;&#39;current_date()&#39;&gt;

fn.current_date()
# 结果：Out[35]: Column&lt;&#39;current_date()&#39;&gt;

fn.first(fn.current_date())
# 结果：Out[36]: Column&lt;&#39;first(current_date())&#39;&gt;

fn.current_date()[0]
# 结果：Out[37]: Column&lt;&#39;current_date()[0]&#39;&gt;

display(fn.current_date())
# 结果：Column&lt;&#39;current_date()&#39;&gt;

这是否完全不可能？

英文:

This is very simple in python, but I am currently learning PySpark in Databricks.

I just want to see what is returned by current_date() in PySpark.

What I have tried:

from pyspark.sql import functions as fn

print(fn.current_date())
# Result: Column&lt;&#39;current_date()&#39;&gt;

fn.current_date()
# Result: Out[35]: Column&lt;&#39;current_date()&#39;&gt;

fn.first(fn.current_date())
# Result: Out[36]: Column&lt;&#39;first(current_date())&#39;&gt;

fn.current_date()[0]
# Result: Out[37]: Column&lt;&#39;current_date()[0]&#39;&gt;

display(fn.current_date())
# Result: Column&lt;&#39;current_date()&#39;&gt;

Is it just not possible?

答案1

得分: 1

你可以使用 spark.sql() 来处理这个情况。

示例：

print(spark.sql("select string(current_date())").collect()[0][0])
#2023-08-04

英文:

You can use spark.sql() for this case.

Example:

print(spark.sql(&quot;select string(current_date())&quot;).collect()[0][0])
#2023-08-04

答案2

得分: 0

在Spark中，列表达式（例如current_date()）在将它们放入数据框作为列并要求显示数据框之前不会显示结果。

考虑以下示例：

spark.range(1) - 创建一个数据框
.select(F.current_date()) - 选择使用函数current_date创建的列
.show() - 打印数据框

from pyspark.sql import functions as F

spark.range(1).select(F.current_date()).show()
# +--------------+
# |current_date()|
# +--------------+
# |    2023-08-04|
# +--------------+

spark.sql("select current_date()") - 使用SQL表达式创建数据框和列
.show() - 打印数据框

spark.sql("select current_date()").show()
# +--------------+
# |current_date()|
# +--------------+
# |    2023-08-04|
# +--------------+

.head() - 访问数据框的第一行（作为pyspark.sql.types.Row对象）
[0] - 访问行的第一个元素（"列"）

spark.sql("select current_date()").head()[0]
# datetime.date(2023, 8, 4)

在Databricks中，display(df) 也应该有效，但您必须创建df，例如：

display(spark.sql("select current_date()"))

英文:

In Spark, column expressions (e.g. current_date()) do not show results until they are put into dataframes as columns and then the dataframe is instructed to be shown.

Consider the following examples:

spark.range(1) - creating a dataframe
.select(F.current_date()) - selecting a column created using function current_date
.show() - printing the dataframe

from pyspark.sql import functions as F

spark.range(1).select(F.current_date()).show()
# +--------------+
# |current_date()|
# +--------------+
# |    2023-08-04|
# +--------------+

spark.sql("select current_date()") - creating both dataframe and column using SQL expression
.show() - printing the dataframe

spark.sql(&quot;select current_date()&quot;).show()
# +--------------+
# |current_date()|
# +--------------+
# |    2023-08-04|
# +--------------+

.head() - accessing the dataframe's first row (as a pyspark.sql.types.Row object)
[0] - accessing the first element ("column") of the row

spark.sql(&quot;select current_date()&quot;).head()[0]
# datetime.date(2023, 8, 4)

In Databricks, display(df) should also work, but for this you must create the df, e.g.:

display(spark.sql(&quot;select current_date()&quot;))

通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库，让每个人都能够通过互相帮助和分享经验来进步。

如何在PySpark中打印current_date()的结果？

问题

答案1

答案2

如何在另一个注释中使用给定日期创建和使用DateFormat？

将参数传递给使用 `spark.read.format(jdbc)` 格式的查询。

Pandas – 表格透视

Pandas 多重索引与多个条件

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

发表评论