2023年2月24日 02:05:09go评论106阅读模式

英文:

Replace backslash followed by double quotation in a text file in Python

问题

import re
file_path = "backslash_double_quotation.txt"
with open(file_path, "r") as input_file:
    raw_text = input_file.read()
processed_text = re.sub(r'&quot;', '', raw_text)
print(raw_text)
print(processed_text)

英文:

I have a text file, and its content is like this:

&quot;good to know it \&quot; so nice \&quot; &quot;

I use Python to read its contents and want to replace " with an empty string.

The code I am using is:

import re
file_path = &quot;backslash_double_quotation.txt&quot;
with open(file_path, &quot;r&quot;) as input_file:
    raw_text = input_file.read()
processed_text = re.sub(r&#39;\&quot;&#39;, &quot;&quot;, raw_text)
print(raw_text)
print(processed_text)

and I expect processed_text like this:

&quot;good to know it  so nice  &quot;

However, the actual output is:

good to know it \ so nice \

All the double quotations are replaced by empty strings.
How can I fix this?

答案1

得分: 1

使用字符串可以使用 .replace() 方法来替换字符串中的特定字符或单词。

例如：

text = "good to know it \" so nice \""
print(text.replace("\"", " "))

这将输出：

good to know it   so nice

对于你的代码：

import re
file_path = "backslash_double_quotation.txt"
with open(file_path, "r") as input_file:
    raw_text = input_file.read()
processed_text = raw_text.replace("\"", "")
print(raw_text)
print(processed_text)

如果你想使用 re，则可以使用以下方式：

processed_text = re.sub(r"\\\"", "", raw_text)

英文:

With strings you can use .replace() to replace specific characters or words in a string.

For example:

text = &quot;good to know it \&quot; so nice \&quot;&quot;
print(text.replace(&quot;\&quot;&quot;, &quot; &quot;))

The output for this is:

good to know it   so nice

With your code:

import re
file_path = &quot;backslash_double_quotation.txt&quot;
with open(file_path, &quot;r&quot;) as input_file:
    raw_text = input_file.read()
processed_text = raw_text.replace(&quot;\&quot;&quot;, &quot;&quot;)
print(raw_text)
print(processed_text)

If you want to use re then:

processed_text = re.sub(r&quot;\\&quot;, &quot;&quot;, raw_text)

答案2

得分: 1

你没有得到预期的结果，因为你的示例中使用了"raw-string"，即"r"。如果你添加了"r"，你应该指定你的正则表达式，而不包含任何转义字符。

只需在你的示例中移除"r"，它就会按预期工作：

processed_text = re.sub('"', '', raw_text)

参考链接：

Raw String Notation

英文:

You don't get the expected result because of "raw-string", "r" in your example. If you add "r" you should specify your regex expression without any escape characters.

Just remove "r" in your example and it will work as expected:

processed_text = re.sub(&#39;\&quot;&#39;, &quot;&quot;, raw_text)

Reference:

Raw String Notation

答案3

得分: 0

处理一个接一个

processed_text = raw_text.replace('\"', '')
processed_text = processed_text.replace('\\', '')

英文:

Eliminate one by one

processed_text = raw_text.replace(&#39;&quot;&#39;, &#39;&#39;)
processed_text = processed_text.replace(&#39;\&#39;, &#39;&#39;)

答案4

得分: 0

不含代码的翻译如下：

难以想象，一个转义的双引号 \" 表示的意思不同于将此引号包含在双引号分隔的字符串中。因此，很难想象不使用转义的转义符 \\ 来区分字符串中包含的转义与不将后续的双引号（如果有的话）视为字符串结束符。

这似乎是一种明确区分的方法 -

https://regex101.com/r/FH2Dfp/1

查找（原始上下文，用 r' ' 包裹）：

(?&lt;!\\)((?:\\\\)*)\\&quot;

替换为：

`\1`

英文:

It's hard to imagine that an escaped double quote \" means something else than include this quote in the delimited double quote string. Therefore it's impossible to imagine not using an escaped escape \\ to differentiate an included escape in the string from not treating a following double quote (if any) as the closing string delimiter.

This seems to be a nonambiguous way to tell the difference -

https://regex101.com/r/FH2Dfp/1

Find (raw context, wrap in r' '):

(?&lt;!\\)((?:\\\\)*)\\&quot;

Replace with:

\1

答案5

得分: 0

我发现这个有效:

processed_text = re.sub(r'\\&quot;', '', raw_text)

英文:

I found this works:

processed_text = re.sub(r&#39;\\&quot;&#39;, &quot;&quot;, raw_text)

通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库，让每个人都能够通过互相帮助和分享经验来进步。

在Python中替换文本文件中的反斜杠后跟双引号。

问题

答案1

答案2

答案3

答案4

答案5

Pyautogui无法输入表情符号。

计算多个GeoDataFrame条目的面积。

如何获取我下载目录中特定文件的最新版本？

创建具有独立依赖关系的动态Airflow任务。

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。

发表评论