2023年5月15日 06:09:46go评论63阅读模式

英文:

Why does strip() method returns a list from a list of words?

问题

第一个问题是我不知道为什么返回空字符串。
第二个问题是我不知道为什么 strip() 的行为与 .split() 不同。我认为可能与每种方法的返回语句有关，但我想更好地理解为什么会发生这种情况。

英文:

I was trying to make a list with each char of a sentence.

Given the sentence "7h3 qu1ck brown fox jumps ov3r 7h3 lazy dog"

I used the .split() method to separate the words and afterwards another .split() method:

x = [i.split() for i in sentence]

and it returned:

[[&#39;7&#39;], [&#39;h&#39;], [&#39;3&#39;], [], [&#39;q&#39;], [&#39;u&#39;], [&#39;1&#39;], [&#39;c&#39;], [&#39;k&#39;], [], [&#39;b&#39;], [&#39;r&#39;], [&#39;o&#39;], [&#39;w&#39;], [&#39;n&#39;], [], [&#39;f&#39;], [&#39;o&#39;], [&#39;x&#39;], [], [&#39;j&#39;], [&#39;u&#39;], [&#39;m&#39;], [&#39;p&#39;], [&#39;s&#39;], [], [&#39;o&#39;], [&#39;v&#39;], [&#39;3&#39;], [&#39;r&#39;], [], [&#39;7&#39;], [&#39;h&#39;], [&#39;3&#39;], [], [&#39;l&#39;], [&#39;a&#39;], [&#39;z&#39;], [&#39;y&#39;], [], [&#39;d&#39;], [&#39;o&#39;], [&#39;g&#39;]]

So I was trying to figure out why this was happening and tried with:

[i.strip() for i in sentence if len(i) &gt; 0]

and it returned:

[&#39;7&#39;, &#39;h&#39;, &#39;3&#39;, &#39;&#39;, &#39;q&#39;, &#39;u&#39;, &#39;1&#39;, &#39;c&#39;, &#39;k&#39;, &#39;&#39;, &#39;b&#39;, &#39;r&#39;, &#39;o&#39;, &#39;w&#39;, &#39;n&#39;, &#39;&#39;, &#39;f&#39;, &#39;o&#39;, &#39;x&#39;, &#39;&#39;, &#39;j&#39;, &#39;u&#39;, &#39;m&#39;, &#39;p&#39;, &#39;s&#39;, &#39;&#39;, &#39;o&#39;, &#39;v&#39;, &#39;3&#39;, &#39;r&#39;, &#39;&#39;, &#39;7&#39;, &#39;h&#39;, &#39;3&#39;, &#39;&#39;, &#39;l&#39;, &#39;a&#39;, &#39;z&#39;, &#39;y&#39;, &#39;&#39;, &#39;d&#39;, &#39;o&#39;, &#39;g&#39;]

The first issue I'm facing is that I don't know why is returning empty strings.
The second issue is that I don't know why strip() behaves differently than .split(). I think that maybe it can be related to the return statement of each method, but I would like to understand better why this happen.

答案1

得分: 1

.split(delim)接受一个字符串并返回以delim为分隔符的子字符串列表。.strip()将删除字符串中的任何前导或尾随空格，并返回结果字符串。

你遇到的第一个问题是错误地拆分字符串。在x = [i.split() for i in sentence]中，你遍历了sentence的每个字符，而不是每个单词。所以，i.split()试图将单个字符按照空格拆分，这没有太多意义，所以它只返回包含该字符的列表。请看下面的示例：

&gt;&gt;&gt; sentence = '7h3 qu1ck brown fox jumps ov3r 7h3 lazy dog'
&gt;&gt;&gt; [i for i in sentence]
['7', 'h', '3', ' ', 'q', 'u', '1', 'c', 'k', ' ', 'b', 'r', 'o', 'w', 'n', ' ', 'f', 'o', 'x', ' ', 'j', 'u', 'm', 'p', 's', ' ', 'o', 'v', '3', 'r', ' ', '7', 'h', '3', ' ', 'l', 'a', 'z', 'y', ' ', 'd', 'o', 'g']
&gt;&gt;&gt; sentence.split(' ')
['7h3', 'qu1ck', 'brown', 'fox', 'jumps', 'ov3r', '7h3', 'lazy', 'dog']

相反，你想要按每个空格字符(' ')拆分，以获取每个单词的列表：

x = sentence.split(' ')

回应你尝试做什么的问题：

我试图创建一个包含句子每个字符的列表。

你只需要将字符串转换为列表，例如：

&gt;&gt;&gt; list('7h3 qu1ck brown fox jumps ov3r 7h3 lazy dog')
['7', 'h', '3', ' ', 'q', 'u', '1', 'c', 'k', ' ', 'b', 'r', 'o', 'w', 'n', ' ', 'f', 'o', 'x', ' ', 'j', 'u', 'm', 'p', 's', ' ', 'o', 'v', '3', 'r', ' ', '7', 'h', '3', ' ', 'l', 'a', 'z', 'y', ' ', 'd', 'o', 'g']

如果你首先想按每个单词拆分，然后获取每个单词的每个字符，那么可以这样做：

&gt;&gt;&gt; [list(word) for word in '7h3 qu1ck brown fox jumps ov3r 7h3 lazy dog'.split()]
[['7', 'h', '3'], ['q', 'u', '1', 'c', 'k'], ['b', 'r', 'o', 'w', 'n'], ['f', 'o', 'x'], ['j', 'u', 'm', 'p', 's'], ['o', 'v', '3', 'r'], ['7', 'h', '3'], ['l', 'a', 'z', 'y'], ['d', 'o', 'g']]

英文:

.split(delim) takes in a string and returns a list of substrings that are between delim. .strip() will remove any leading or trailing whitespace from the string and return the resulting string.

The first issue you encountered was that you were splitting the string incorrectly. In x = [i.split() for i in sentence], you iterate over every character of sentence, not every word. So, i.split() is attempting to split a single character by nothing. This doesn't make much sense, so it just returns a list with that character. See this below:

&gt;&gt;&gt; sentence = &#39;7h3 qu1ck brown fox jumps ov3r 7h3 lazy dog&#39;
&gt;&gt;&gt; [i for i in sentence]
[&#39;7&#39;, &#39;h&#39;, &#39;3&#39;, &#39; &#39;, &#39;q&#39;, &#39;u&#39;, &#39;1&#39;, &#39;c&#39;, &#39;k&#39;, &#39; &#39;, &#39;b&#39;, &#39;r&#39;, &#39;o&#39;, &#39;w&#39;, &#39;n&#39;, &#39; &#39;, &#39;f&#39;, &#39;o&#39;, &#39;x&#39;, &#39; &#39;, &#39;j&#39;, &#39;u&#39;, &#39;m&#39;, &#39;p&#39;, &#39;s&#39;, &#39; &#39;, &#39;o&#39;, &#39;v&#39;, &#39;3&#39;, &#39;r&#39;, &#39; &#39;, &#39;7&#39;, &#39;h&#39;, &#39;3&#39;, &#39; &#39;, &#39;l&#39;, &#39;a&#39;, &#39;z&#39;, &#39;y&#39;, &#39; &#39;, &#39;d&#39;, &#39;o&#39;, &#39;g&#39;]
&gt;&gt;&gt; sentence.split(&#39; &#39;)
[&#39;7h3&#39;, &#39;qu1ck&#39;, &#39;brown&#39;, &#39;fox&#39;, &#39;jumps&#39;, &#39;ov3r&#39;, &#39;7h3&#39;, &#39;lazy&#39;, &#39;dog&#39;]

Instead, you want to split by each space character (' ') to get a list of each word:

x = sentence.split(&#39; &#39;)

In response to what you were trying to do:
> I was trying to make a list with each char of a sentence.

All you need to do is convert the string to a list with list(). For example:

&gt;&gt;&gt; list(&#39;7h3 qu1ck brown fox jumps ov3r 7h3 lazy dog&#39;)
[&#39;7&#39;, &#39;h&#39;, &#39;3&#39;, &#39; &#39;, &#39;q&#39;, &#39;u&#39;, &#39;1&#39;, &#39;c&#39;, &#39;k&#39;, &#39; &#39;, &#39;b&#39;, &#39;r&#39;, &#39;o&#39;, &#39;w&#39;, &#39;n&#39;, &#39; &#39;, &#39;f&#39;, &#39;o&#39;, &#39;x&#39;, &#39; &#39;, &#39;j&#39;, &#39;u&#39;, &#39;m&#39;, &#39;p&#39;, &#39;s&#39;, &#39; &#39;, &#39;o&#39;, &#39;v&#39;, &#39;3&#39;, &#39;r&#39;, &#39; &#39;, &#39;7&#39;, &#39;h&#39;, &#39;3&#39;, &#39; &#39;, &#39;l&#39;, &#39;a&#39;, &#39;z&#39;, &#39;y&#39;, &#39; &#39;, &#39;d&#39;, &#39;o&#39;, &#39;g&#39;]

If you first want to split by each word, then get each character of of each word, then do this:

&gt;&gt;&gt; [list(word) for word in &#39;7h3 qu1ck brown fox jumps ov3r 7h3 lazy dog&#39;.split()]
[[&#39;7&#39;, &#39;h&#39;, &#39;3&#39;], [&#39;q&#39;, &#39;u&#39;, &#39;1&#39;, &#39;c&#39;, &#39;k&#39;], [&#39;b&#39;, &#39;r&#39;, &#39;o&#39;, &#39;w&#39;, &#39;n&#39;], [&#39;f&#39;, &#39;o&#39;, &#39;x&#39;], [&#39;j&#39;, &#39;u&#39;, &#39;m&#39;, &#39;p&#39;, &#39;s&#39;], [&#39;o&#39;, &#39;v&#39;, &#39;3&#39;, &#39;r&#39;], [&#39;7&#39;, &#39;h&#39;, &#39;3&#39;], [&#39;l&#39;, &#39;a&#39;, &#39;z&#39;, &#39;y&#39;], [&#39;d&#39;, &#39;o&#39;, &#39;g&#39;]]

通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库，让每个人都能够通过互相帮助和分享经验来进步。

strip()方法为什么会从单词列表中返回一个列表？

问题

答案1

如何在Pandas中将字符串列表转换为（对象）列表？

将列值连接为一个串。

无法将字符串转换为整数（来自CSV文件）- 输入字符串错误：“4”

绝对导入在使用pytest时可以正常工作，但在使用python运行时失败。

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

发表评论