2023年3月31日 16:15:55go评论65阅读模式

英文:

Print line containing pattern preceded by different line containing a different pattern

问题

macOS 13.3 Ventura因此BSD版本的grep、awk等等。

如何搜索并打印包含特定模式的行，其中该行必须由包含不同模式的不同行先导？

文本包含类似以下的行（用大写字母作为参考，...==无关字符）。

  ...                   # 一些不确定数量的行。
A ... &quot;model&quot; = \&lt; ...
  ...                   # 一些不确定数量的行。
B ... PXS4@0 ...
  ...                   # 一些不确定数量的行。
C ... &quot;model&quot; = \&lt; ...
  ...                   # 一些不确定数量的行。
D ... PXS2@0 ...
  ...                   # 一些不确定数量的行。
E ... &quot;model&quot; = \&lt; ...
  ...                   # 一些不确定数量的行。
F ... PXS1@0 ...
  ...                   # 一些不确定数量的行。
G ... &quot;model&quot; = \&lt; ...
  ...                   # 一些不确定数量的行。
H ... &quot;model&quot; = \&lt; ...
  ...                   # 一些不确定数量的行。

只有包含“model”并由包含不同模式的PXS[[:digit:]]@0的行应该出现：

C ... &quot;model&quot; = \&lt; ...
E ... &quot;model&quot; = \&lt; ...
G ... &quot;model&quot; = \&lt; ...

据我所知，macOS的awk和grep不支持回顾先行和前瞻。

我认为以下命令会找到匹配PXS...，然后找到并打印model...，但它会打印行“A”：

awk '/(PXS\[\[:digit:\]\]@0 )+?model&quot; = \&lt;/ { print }'

以下命令也接近，但会打印行“A”。由于它打印了“A”，我不明白为什么它不会打印“H”。

grep -e &quot;.\*PXS\[\[:digit:\]\]@0 &quot; -e &quot;.\*model&quot; = \&lt;&quot;&quot; | grep -v -e &quot;.\*PXS\[\[:digit:\]\]@0 &quot;

请 enlighten me！

英文:

macOS 13.3 Ventura hence BSD versions of grep, awk, et al.

How do I search for and print a line containing a pattern where the line MUST be preceded by a different line containing a different pattern?

The text contains lines like these (leading CAPS for reference, ...==irrelevant chars).

  ...                   # An indeterminate number of lines.
A ... &quot;model&quot; = \&lt; ...
  ...                   # An indeterminate number of lines.
B ... PXS4@0 ...
  ...                   # An indeterminate number of lines.
C ... &quot;model&quot; = \&lt; ...
  ...                   # An indeterminate number of lines.
D ... PXS2@0 ...
  ...                   # An indeterminate number of lines.
E ... &quot;model&quot; = \&lt; ...
  ...                   # An indeterminate number of lines.
F ... PXS1@0 ...
  ...                   # An indeterminate number of lines.
G ... &quot;model&quot; = \&lt; ...
  ...                   # An indeterminate number of lines.
H ... &quot;model&quot; = \&lt; ...
  ...                   # An indeterminate number of lines.

ONLY lines with "model" that are preceded by a line with PXS[[:digit:]]@0 should appear:

C ... &quot;model&quot; = \&lt; ...
E ... &quot;model&quot; = \&lt; ...
G ... &quot;model&quot; = \&lt; ...

AFAICT the regex in macOS's awk & grep do not support look-behind and look-ahead.

I thought this would find a match of PXS... and then find/print model... but it prints line "A":

awk &#39;/(PXS\[\[:digit:\]\]@0 )+?model&quot; = \&lt;/ { print }&#39;

This also comes close but prints line "A". Since it prints "A" I don't understand why it doesn't also print "H".

grep -e &quot;.\*PXS\[\[:digit:\]\]@0 &quot; -e &quot;.\*model&quot; = \&lt;&quot;&quot; | grep -v -e &quot;.\*PXS\[\[:digit:\]\]@0 &quot;

Enlighten me please!

答案1

得分: 0

MacOs

perl -n0e &#39;print $_ =~ /PXS[[:digit:]]@0.*\n.*\n/g&#39; 文件名 | perl -p -e &#39;s/PXS[[:digit:]]@0[^\n]*\n//g&#39;

Linux

grep -zoP &#39;PXS[[:digit:]]@0.*\n.*\n&#39; 文件名 | sed -z -E &#39;s/PXS[[:digit:]]@0[^\n]*\n//g&#39;

英文:

MacOs

perl -n0e &#39;print $_ =~ /PXS[[:digit:]]@0.*\n.*\n/g&#39; filename | perl -p -e &#39;s/PXS[[:digit:]]@0[^\n]*\n//g&#39;

First step: leave only lines with PXS[[:digit:]]@0 and the next lines.
Second step: remove lines with PXS[[:digit:]]@0

Linux

grep -zoP &#39;PXS[[:digit:]]@0.*\n.*\n&#39; filename | sed -z -E &#39;s/PXS[[:digit:]]@0[^\n]*\n//g&#39;

Grep to find lines with PXS[[:digit:]]@0 and the next lines, sed to remove lines containing PXS[[:digit:]]@0 from output.

答案2

得分: 0

你可以尝试类似以下的代码：

awk '/PXS[0-9][@]0/{getline;if(match($0,"model")){ print;}}'

/PXS[0-9][@]0/ 将匹配前缀行

getline; 将读取下一行（并填充$0）

match($0,"model") 将查看该行是否包含正则表达式'model'。

英文:

you can try something like

awk &#39;/PXS[0-9][@]0/{getline;if(match($0,&quot;model&quot;)){ print;}}&#39;

/PXS[0-9][@]0/ will match the prefix line

getline; will read the next line (and populate $0)

match($0,"model") will see if that line contains the regexp 'model'

答案3

得分: 0

感谢指导我方向正确。这给了我C、E和G行。

我在awk中使用了一个循环来找到第一行带有PXS[[:digit:]]@0的内容，然后使用子循环找到第二行带有"model" = <的内容。文件是确定的：如果第一行存在，第二行也会存在（但不会直接跟在第一行后面）。

我还设置了awk的分隔符，因为我想要的最终值是在"model" = \<"之后和">之前。

awk -F'&lt;&quot;|&quot;&gt;' 'BEGIN {while (getline != 0) if ($0 ~ /PXS[[:digit:]]@0 /) {while (getline != 0) if ($0 ~ /&quot;model&quot; = &lt;/){print $2; break;}}}'

我喜欢把整个操作都放在一个awk命令中，我的另一个解决方案需要使用5个管道的grep。

英文:

Thanks for steering me in the right direction. This gives me lines C, E, and G.

I used a loop in awk to find the first line with PXS[[:digit:]]@0 and a sub loop to find the second with "model" = <. The file is deterministic: if the first line is present, the second will be (but not directly after the first).

I also set awk's delimiters since the final value I want is after "model" = \<" and before ">.

awk -F&#39;&lt;&quot;|&quot;&gt;&#39; &#39;BEGIN {while (getline != 0) if ($0 ~ /PXS[[:digit:]]@0 /) {while (getline != 0) if ($0 ~ /&quot;model&quot; = &lt;/){print $2; break;}}}&#39;

I like having the whole thing in one awk command, my other solution required 5 piped greps.

答案4

得分: 0

$ awk '/model/ && match(prevline,/PXS[0-9]@0/){print} {prevline=$0}' file
C ... "model" = < ...
E ... "model" = < ...
G ... "model" = < ...

英文:

$ awk &#39;/model/ &amp;&amp; match(prevline,/PXS[0-9]@0/){print} {prevline=$0}&#39; file
C ... &quot;model&quot; = \&lt; ...
E ... &quot;model&quot; = \&lt; ...
G ... &quot;model&quot; = \&lt; ...

通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库，让每个人都能够通过互相帮助和分享经验来进步。

打印包含在不同模式前面的行，这些行包含不同的模式。

问题

答案1

MacOs

Linux

MacOs

Linux

答案2

答案3

答案4

Mac Developer cert "no root certificate found", but it's right there in my keychain

Regexp to find images in html (golang)

使用正则表达式切换多语言子字符串的位置

使用Oracle SQL中的regexp_substr从字符串中提取特定关键字

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

发表评论