问题

如标题所述，我正在寻找最安全的Java正则表达式，以从HTML标记中删除用于Jasper文本字段标记为HTML的样式，但不影响任何内容和标记的一致性。例如，对于从前端接收到的以下输入：

&lt;p&gt;This text contains &lt;sub style=&quot;background-color:powderblue;&quot;&gt;subscript&lt;/sub&gt; text.&lt;/p&gt;

抱歉，没有转义引号。我发现这段代码运行良好：

String output = input.replaceAll(&quot;style=\&quot;[^&gt;]*\&quot;&quot;,&quot;&quot;);

然后输出应该是：

 &lt;p&gt;This text contains &lt;sub&gt;subscript&lt;/sub&gt; text.&lt;/p&gt;

英文:

As stated in the title - I am looking for safest Java regex to remove styles from HTML tags intended for Jasper text field marked as HTML, but not touching any content and tags consistency. For example, for the following input received from front-end:

&lt;p&gt;This text contains &lt;sub style=&quot;background-color:powderblue;&quot;&gt;subscript&lt;/sub&gt; text.&lt;/p&gt;

Sorry for not escaped quotes. I found this code works fine:

String output = input.replaceAll(&quot;style=\&quot;[^&gt;]*\&quot;&quot;,&quot;&quot;);

then output should be:

 &lt;p&gt;This text contains &lt;sub&gt;subscript&lt;/sub&gt; text.&lt;/p&gt;

答案1

得分: 1

首先，正则表达式不适用于删除内容。正则表达式只是检查是否与特定字符集匹配的_检查_。

除此之外，使用replaceAll的这段代码应该能起作用

String output = input.replaceAll(
        &quot;(&lt;[^&gt;]+?)\\s+style\\s*=\\s*[&#39;\&quot;][^&#39;\&quot;]*[&#39;\&quot;](.*?&gt;)&quot;, &quot;$1$2&quot;);

英文:

First off, a regex isn't something to use if you want to remove something. A regex is purely a check if something matches a certain set of characters.

But apart from that, this code using replaceAll should do the trick

String output = input.replaceAll(
        &quot;(&lt;[^&gt;]+?)\\s+style\\s*=\\s*[&#39;\&quot;][^&#39;\&quot;]*[&#39;\&quot;](.*?&gt;)&quot;, &quot;$1$2&quot;);

通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库，让每个人都能够通过互相帮助和分享经验来进步。

Java正则表达式以从Jasper文本字段中移除HTML标签样式

问题

答案1

覆盖 Kotlin 类中的变量的 get 方法？

处理重复的参数检查，抛出异常。

how to mock a mehod using spring boot SpringJUnit4ClassRunner where mongo tamplate is used where the method contains static call

如何为MyBatis语句创建性能监听器。

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

发表评论