2023年3月4日 00:17:12go评论81阅读模式

英文:

Problems using preg_replace

问题

Regx不是我的强项。

我有一个大文件，我想要替换以下示例：

<g:id><![CDATA[131614-3XL]]></g:id>

应该替换为：

<g:id><![CDATA[131614-3XL]]></g:id><g:id2><![CDATA[131614]]></g:id2>

请注意，" -3XL" 在id2中被删除，并请注意，-3XL可能是许多其他组合。例如：-4XL或-32/32或-42.5等等。但它总是以" -"开头。

我尝试使用preg_replace，但我无法弄清楚。

英文:

Regx is not my thing.

I have a large file where I want to replace the following example:

&lt;g:id&gt;&lt;![CDATA[131614-3XL]]&gt;&lt;/g:id&gt;

should be replace with:

&lt;g:id&gt;&lt;![CDATA[131614-3XL]]&gt;&lt;/g:id&gt;&lt;g:id2&gt;&lt;![CDATA[131614]]&gt;&lt;/g:id2&gt;

Please note that "-3XL" is deleted in id2 and please note that -3XL could be many other combinations. fx. -4XL or -32/32 or -42,5 and so on. But it always starts with -

I have tried using preg_replace but I can figure it out.

答案1

得分: 0

以下是您可以开始使用的代码。根据CDATA内容的更改程度，您可能需要进行一些调整。

$str = "<g:id><![CDATA[131614-3XL]]></g:id>";
$expected = "<g:id><![CDATA[131614-3XL]]></g:id><g:id2><![CDATA[131614]]></g:id2>";

# 获取第一部分，不包括g:id包装和-3XL部分
$post_replace = preg_replace("/^<g:id>(.*?CDATA\[\d+)\-[^\]]+(.*?)<\/g:id>$/","$1$2", $str);

$output = "$str<g:id2>$post_replace</g:id2>";

if ($output == $expected) {
    print "Success\n";
}

请注意，上述代码中的HTML实体（如"）已被替换为正常的HTML标记（如"）。

英文:

Here is code that you can start with. You might need to adjust depending upon how much the CDATA content changes.

$str = &quot;&lt;g:id&gt;&lt;![CDATA[131614-3XL]]&gt;&lt;/g:id&gt;&quot;;
$expected = &quot;&lt;g:id&gt;&lt;![CDATA[131614-3XL]]&gt;&lt;/g:id&gt;&lt;g:id2&gt;&lt;![CDATA[131614]]&gt;&lt;/g:id2&gt;&quot;;

# Get the first section without the g:id wrapper and without the -3XL section
$post_replace = preg_replace(&quot;/^&lt;g:id&gt;(.*?CDATA\[\d+)\-[^\]]+(.*?)&lt;\/g:id&gt;$/&quot;,&quot;$1$2&quot;, $str);

$output = &quot;$str&lt;g:id2&gt;$post_replace&lt;/g:id2&gt;&quot;;

if ($output == $expected) {
    print &quot;Success\n&quot;;
}

答案2

得分: 0

使用捕获组来提取 CDATA 中的数字，这样你就可以将它复制到替换部分而不包括 -XXX。

$result = preg_replace('##&lt;g:id&gt;&lt;!\[CDATA\[(\d+)-[^]]+\]\]&gt;&lt;/g:id&gt;##', '$0&lt;g:id2&gt;&lt;![CDATA[$1]]&gt;&lt;/g:id2&gt;', $string);

$0 是整个匹配，$1 是 CDATA 中的数字。

英文:

Use a capture group to grab the number in the CDATA so you can copy it to the replacement without the -XXX after it.

$result = preg_replace(&#39;#&lt;g:id&gt;&lt;!\[CDATA\[(\d+)-[^]]+\]\]&gt;&lt;/g:id&gt;#&#39;, &#39;$0&lt;g:id2&gt;&lt;![CDATA[$1]]&gt;&lt;/g:id2&gt;&#39;, $string);

$0 is the entire match, $1 is the number in the CDATA.

DEMO

通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库，让每个人都能够通过互相帮助和分享经验来进步。

使用preg_replace时出现的问题

问题

答案1

答案2

如何找到WordPress网页的代码文件？

在PHP中如何使用用户定义的变量包含文件

在反序列化过程中预处理属性。

Ways to Use `mysqli_stmt_num_rows()` To verify user credentials

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

发表评论