2023年4月17日 17:01:01go评论88阅读模式

英文:

Removing duplicate XML markup with awk

问题

寻找替换两行中的重复实例，例如：

    &lt;\section&gt;
         &lt;\section&gt;

用单个 </section> 条目替换。

输入文件中的空格数量可能有所不同。

如果可以使用sed完成，那就更好。但也许我需要使用awk。

英文:

Looking to replace duplicate instances over two lines such as:

&lt;\section&gt;
     &lt;\section&gt;

with a single </section> entry.

Amount of white space in input file may vary.

If this can be done with sed, all the better. But maybe I need to use awk.

答案1

得分: 1

使用GNU sed 的 -E、-z 和 \s：

$ sed -Ez 's:(&lt;\\section&gt;)\s*\n\s*:&lt;/section&gt;:g' 文件
&lt;/section&gt;

如果不希望在两个 <\section> 之间有多个空行或空白行，将每个 \s 替换为 [[:blank:]]。它还会一次性将整个输入读入内存。

英文:

Using GNU sed for -E, -z and \s:

$ sed -Ez &#39;s:(&lt;\\section&gt;)\s*\n\s*:&lt;/section&gt;:g&#39; file
&lt;/section&gt;

That would allow multiple empty lines or lines of blanks between 2 occurrences of <\section>, if that's undesirable then replace each \s with [[:blank:]]. It will also read the whole of the input into memory at once.

答案2

得分: 0

像这样可能会起作用（GNU sed）：

sed -Ez 's:(&lt;\\section&gt;)[[:space:]]+:&lt;/section&gt;:'

英文:

Something like this might work (GNU sed):

sed -Ez &#39;s:(&lt;\\section&gt;)[[:space:]]+:&lt;/section&gt;:&#39;

答案3

得分: 0

这可能适用于您（GNU sed）：

sed -E 'N;s/(<\\section>)\s*\n\s*/<\/section>/;P;D' file

打开一个两行窗口，并使用模式匹配替换所需的字符串。

英文:

This might work for you (GNU sed):

sed -E &#39;N;s/(&lt;\\section&gt;)\s*\n\s*/&lt;\/section&gt;/;P;D&#39; file

Open a two line window and using pattern matching substitute the required string.

通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库，让每个人都能够通过互相帮助和分享经验来进步。

Removing duplicate XML markup with awk

问题

答案1

答案2

答案3

如何使用sed内联正确地将~替换为$HOME？

合并多个sed命令效果不佳。

你可以使用bash中的grep、awk和/或sed来过滤文本文件中的多行模式吗？

Conditional removing of text using sed

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。