2023年2月16日 16:53:01go评论96阅读模式

英文:

How to extract span in re.finditer method in python?

问题

# 在 re.finditer 的结果中，我使用了 i.span 而不是 i，但我得到了如下的结果。
[i.span() for i in result]

# 我要从 re.finditer 中提取 span，像 (0,10), (12,18), ...
# 请帮帮我！

# 我定义了一个用于获取 re.finditer 结果的函数。
# 代码如下。
import re
def convert_ftn_to_token(seq):
    va = '[a-z]{1,}'
    ftn_lst = ['sin','cos','tan','log_', 'e ?\\^'] 
    ftn_lst = [ftn + ' ?\\{? ?' + va +' ?\\}?'+ ' ?\\(?' for ftn in ftn_lst]
    ftn_lst2  = [chr(i) for i in range(65,91)] + [chr(i) for i in range(97,123)]
    ftn_lst2 = [ftn + ' ?\\( ?' + va + ' ?\\)' for ftn in ftn_lst2]
    ftn_c = re.compile(
        '|'.join(ftn_lst2) +'|'+
        '|'.join(ftn_lst)
    )
    return re.finditer(ftn_c,seq)

# 对于 results 中的每一个 i，使用 i.span()。

英文:

the results of re.finditer is as below.

[i for i in result]
=[&lt;re.Match object; span=(0, 10), match=&#39;sin theta &#39;&gt;,
 &lt;re.Match object; span=(12, 18), match=&#39;cos x &#39;&gt;,
 &lt;re.Match object; span=(20, 26), match=&#39;e ^ x &#39;&gt;,
 &lt;re.Match object; span=(26, 32), match=&#39;f( x )&#39;&gt;,
 &lt;re.Match object; span=(37, 45), match=&#39;log_ {x}&#39;&gt;]

Here, I used the code i.span instead of i, but I just got something as below.

[&lt;function Match.span(group=0, /)&gt;,
 &lt;function Match.span(group=0, /)&gt;,
 &lt;function Match.span(group=0, /)&gt;,
 &lt;function Match.span(group=0, /)&gt;,
 &lt;function Match.span(group=0, /)&gt;]

I'm gonna extract span in re.finditer.
like (0,10), (12,18), ...

Help me please!

I defined the function for getting re.finditer
The code is as below.

import re
def convert_ftn_to_token(seq):
    va = &#39;[a-z]{1,}&#39;
    ftn_lst = [&#39;sin&#39;,&#39;cos&#39;,&#39;tan&#39;,&#39;log_&#39;, &#39;e ?\^&#39;] 
    ftn_lst = [ftn + &#39; ?\{? ?&#39; + va +&#39; ?\}?&#39; for ftn in ftn_lst]
    ftn_lst2  = [chr(i) for i in range(65,91)] + [chr(i) for i in range(97,123)]
    ftn_lst2 = [ftn + &#39; ?\( ?&#39; + va + &#39; ?\)&#39; for ftn in ftn_lst2]
    ftn_c = re.compile(
        &#39;|&#39;.join(ftn_lst2) +&#39;|&#39;+
        &#39;|&#39;.join(ftn_lst)
    )
    return re.finditer(ftn_c,seq)

i.span for i in results

答案1

得分: 0

You can use start() and end() in regex's Match object, documentation about it here. They correspond to the lower and upper bound of span respectively. As for the grouping stated in the docs, that only applies if you are intending to use the grouping functionality of Match. If you intend to get the span of the entire match, you can simply do match.start() and match.end(), where match is the match object returned by the regex.

Another option is using span() of the same Match object. Note this is different from just span which will give you the memory address rather than actually call the function. Doing match.span() will give you a tuple of the start and end. Taking your first match object as an example this would return (0,10)

英文:

答案2

得分: 0

.span 是一个方法，不是一个属性。你需要使用 .span()，它会返回开始和结束的元组。

英文:

.span is a method, not an attribute. You want .span() which will give the start, end tuple.

通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库，让每个人都能够通过互相帮助和分享经验来进步。

在Python中使用re.finditer方法提取匹配的片段如何实现？

问题

答案1

答案2

二进制数组函数解决方案

将笨拙格式的Excel数据使用Python转换成表格格式。

防止Matplotlib删除坐标轴上的数字的方法

如何使用变量而不是数字在花括号内格式化字符串？

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。