When running a For loop in Big Query the sql_expression_list is not being evaluated the same as if it was run alone

huangapple go评论61阅读模式
英文:

When running a For loop in Big Query the sql_expression_list is not being evaluated the same as if it was run alone

问题

以下是翻译好的部分:

我正在运行一个 For 循环,从数据集的信息模式中收集一个 table_expression,其中包含表中 ARRAY<STRING> 类型的列的名称。我能够按预期生成这个 table_expression。从这里,我尝试循环遍历这些列并评估每列的内容。所有这些列都包含一个字符串数组,其中"empty"项目具有一个单个空格。

当我运行这个查询时,我收到以下错误:

> 查询错误:函数 ARRAY_LENGTH 的参数类型不匹配:STRING。支持的签名:ARRAY_LENGTH(ARRAY)

如果我将 DO 后的查询孤立出来,查询将正常运行,如预期一样处理。

我曾运行过类似的循环以生成列上的空值计数,并取得了成功,因此似乎是评估数组时出现了问题,但我不确定是什么原因导致的。

我尝试以相同的方式运行相同的查询,将 field.field_path 视为字符串值,但我没有收到与单独运行 DO 查询时相同的结果,我认为它可能将 field.field_path 评估为字符串值而不是变量。
英文:

I am running a For loop to gather a table_expression from a dataset's information schema that contains the names of columns that are of ARRAY<STRING> type in a table. I am able to generate this table_expression as intended. From here I am trying to loop through the columns and evaluate each column's contents. The columns all contain an Array of Strings with "empty" items having a single whitespace.

 FOR field IN (
  SELECT
    field_path, data_type
  FROM
    `project.dataset.INFORMATION_SCHEMA.COLUMN_FIELD_PATHS`
  WHERE
    table_name = &#39;table_1&#39;
    AND data_type LIKE &#39;ARRAY%&#39; ) DO
  SELECT
    ARRAY_LENGTH(field.field_path) AS s3,
  IF
    (REGEXP_CONTAINS(ARRAY_TO_STRING(field.field_path, &#39;,&#39;),r&#39;.*?\,\s\,.*?&#39;),1,0) AS b1,
  IF
    (REGEXP_CONTAINS(ARRAY_TO_STRING(field.field_path, &#39;,&#39;),r&#39;^\s\,.*?&#39;),1,0) AS b2,
  IF
    (REGEXP_CONTAINS(ARRAY_TO_STRING(field.field_path, &#39;,&#39;),r&#39;.*?\,\s$&#39;),1,0) AS b3
  FROM
    `project.dataset.table_1`);
END
  FOR;

When I run this query I receive the error:

> Query error: No matching signature for function ARRAY_LENGTH for
> argument types: STRING. Supported signature: ARRAY_LENGTH(ARRAY)

If I isolate the query after the DO, the query runs without this error and processes as intended.

I have run a similar loop to produce null counts on columns and had success, so it seems to be an issue with evaluating Arrays but I am unsure of what is causing it.

I have tried running the same query, treating field.field_path as a String value, but I do not receive the same results as when I run the DO query stand-alone and I believe it might be evaluating the field.field_path as a string value and not a variable.

答案1

得分: 0

我认为它可能会将field.field_path评估为字符串值而不是变量。

由于field是一个变量,它不能作为列名使用。相反,您可以考虑像下面这样的动态SQL。

例如,

FOR field IN (
  SELECT
    field_path, data_type
  FROM
    `project.dataset.INFORMATION_SCHEMA.COLUMN_FIELD_PATHS`
  WHERE
    table_name = 'table_1'
    AND data_type LIKE 'ARRAY%'
) DO
EXECUTE IMMEDIATE FORMAT('
  SELECT
    ARRAY_LENGTH(%s) AS s3
  FROM
    `project.dataset.table_1`
', field.field_path);
END FOR;
英文:

>I believe it might be evaluating the field.field_path as a string value and not a variable.

Since field is a variable, it can't be used as a column name. Instead, you can consider a dynamic SQL like below.

For example,

FOR field IN (
  SELECT
    field_path, data_type
  FROM
    `project.dataset.INFORMATION_SCHEMA.COLUMN_FIELD_PATHS`
  WHERE
    table_name = &#39;table_1&#39;
    AND data_type LIKE &#39;ARRAY%&#39; 
) DO
EXECUTE IMMEDIATE FORMAT(&quot;&quot;&quot;
  SELECT
    ARRAY_LENGTH(%s) AS s3
  FROM
    `project.dataset.table_1`
&quot;&quot;&quot;, field.field_path);
END FOR;

</details>



huangapple
  • 本文由 发表于 2023年3月4日 05:19:36
  • 转载请务必保留本文链接:https://go.coder-hub.com/75631968.html
匿名

发表评论

匿名网友

:?: :razz: :sad: :evil: :!: :smile: :oops: :grin: :eek: :shock: :???: :cool: :lol: :mad: :twisted: :roll: :wink: :idea: :arrow: :neutral: :cry: :mrgreen:

确定