问题

我对Rexpy感兴趣，因为我正在寻找一个能推断匹配字符串的正则表达式的工具。通过使用rexpy.extract和help来检查，它看起来可能是我想要的。

extract(examples, tag=False, encoding=None, as_object=False, extra_letters=None, full_escape=False, remove_empties=False, strip=False, variableLengthFrags=False, max_patterns=None, min_diff_strings_per_pattern=1, min_strings_per_pattern=1, size=None, seed=None, dialect='portable', verbose=0)
    从示例中提取正则表达式并返回。
    
    通常示例应该是Unicode（即Python3中的“str”和Python2中的“unicode”）。但是，如果指定了编码，可以传递编码的字符串。
    
    结果将始终为Unicode。
    
    如果设置了as_object，则会返回提取器对象，其中结果在.results.rex中；否则，将返回一列正则表达式，作为Unicode字符串。

所以我尝试了一个例子：

&gt;&gt;&gt; from tdda import rexpy
&gt;&gt;&gt; s = 'andrew.gelman@statistics.com'
&gt;&gt;&gt; rexpy.extract(s)
['^[.@]$', '^[a-z]$']

我期望得到类似于['^[a-z].[a-z]@[a-z].[a-z]$']而不是['^[.@]$', '^[a-z]$']。提取器只是告诉我特殊符号'.'和'@'在字符串中的某个位置被使用了吗？

英文:

I am interesting in Rexpy because I am looking for a tool which infers a regular expression that would match a string. Inspecting rexpy.extract with help it looked like it 'might' be what I want.

extract(examples, tag=False, encoding=None, as_object=False, extra_letters=None, full_escape=False, remove_empties=False, strip=False, variableLengthFrags=False, max_patterns=None, min_diff_strings_per_pattern=1, min_strings_per_pattern=1, size=None, seed=None, dialect=&#39;portable&#39;, verbose=0)
    Extract regular expression(s) from examples and return them.
    
    Normally, examples should be unicode (i.e. ``str`` in Python3,
    and ``unicode`` in Python2). However, encoded strings can be
    passed in provided the encoding is specified.
    
    Results will always be unicode.
    
    If as_object is set, the extractor object is returned,
    with results in .results.rex; otherwise, a list of regular
    expressions, as unicode strings is returned.

So I tried an example:

&gt;&gt;&gt; from tdda import rexpy
&gt;&gt;&gt; s = &#39;andrew.gelman@statistics.com&#39;
&gt;&gt;&gt; rexpy.extract(s)
[&#39;^[.@]$&#39;, &#39;^[a-z]$&#39;]

I expected something similar to ['^[a-z].[a-z]@[a-z].[a-z]$'] rather than ['^[.@]$', '^[a-z]$']. Is the extractor just telling me that special symbols '.' and '@' are used 'somewhere' in the string?

答案1

得分: 3

The examples parameter expects an iterable of strings, by providing a single string as the parameter the function iterates over each individual character and is outputting regular expressions to match those single character examples.

尝试提供一个字符串列表，例如 rexpy.extract(

展开收缩

).

英文:

Try providing a list of strings instead, e.g. rexpy.extract(

展开收缩

).

通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库，让每个人都能够通过互相帮助和分享经验来进步。

你应该如何解释 tdda.rexpy.extract 的输出？

问题

答案1

使用正则表达式匹配两个逗号之间的字符串。

这个自定义日志记录器为什么不使用根格式化程序？

合并/连接Notepad++中基于分隔符的行。

How to schedule awaitables for sequential execution without awaiting, without prior knowing the number of awaitables?

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

发表评论