2020年8月14日 01:48:04go评论127阅读模式

英文:

Antlr4/Java : how to make a semantic predicate that skips a token (lexer) according to the parser rule that calls it

问题

我想要使用我的词法规则
```antlr4
NEW_LINE : '\n' -> skip;

像普通规则一样。理解这一点：我希望忽略换行符，除非它们是必需的，以创建类似Python的语法。例如，在这里，换行符被忽略：

cook("banana",
     "potatoe)

但是在新语句中不可能跳过换行符，就像这样：

cook("banana", "potatoe") varA = 12.4

在cook()和赋值之间必须有一个换行符。这就是为什么有时我必须跳过换行符，但仍然需要在其他地方强制它们的原因。

这就是我想到的：

start
	: line*
	;
line
	: line_expression (NEW_LINE | EOF)
	;
line_expression
	: expression
	| assignment
	;
expression
	: Decimal
	| Integer
	| Text
	| Boolean
	;

并创建一个语义谓词，如“如果调用解析器规则不是line，则skip();它。”
现在我只需要帮助来实现这一点。

我希望我表达清楚！

PS：如果不清楚，我正在使用Java作为主要语言。

英文:

I would like to use my lexer rule

NEW_LINE : &#39;\n&#39; -&gt; skip;

Like a normal rule. Understanding by this: I want to ignore the new lines except when they are mandatory, to create a Python similar syntax. For example, here, new lines are ignored:

cook(&quot;banana&quot;,
     &quot;potatoe)

but it is impossible to skip the new line for a new statement, like this:

cook(&quot;banana&quot;, &quot;potatoe&quot;) varA = 12.4

, there must be a new line between cook() and the assignment. This is why I sometimes have to skip the new lines, but still force them somewhere else.

This is why I got this idea:

start
	: line*
	;
line
	: line_expression (NEW_LINE | EOF)
	;
line_expression
	: expression
	| assignment
	;
expression
	: Decimal
	| Integer
	| Text
	| Boolean
	;

And make a semantic predicate like "if the calling parser rule is not line, skip(); it."
Now I just need help to do that.

I hope I was clear !

PS: I'm using Java as main language if that wasn't clear

答案1

得分: 1

您可以跟踪遇到的(的数量（如果遇到)则减少此数量）。然后，只有在此数量等于零时才创建NL令牌。

这里是一个快速演示：

语法规则 T;
@lexer::members {
  int parensLevel = 0;
}
解析
 : .*? EOF
 ;
OPAR    : '(' {parensLevel++;};
CPAR    : ')' {parensLevel--;};
NUMBER  : [0-9]+ ('.' [0-9]+)?;
STRING  : '"' ~'"'* '"';
ASSIGN  : '=';
COMMA   : ',';
ID      : [a-zA-Z]+;
SPACES  : [ \t]+ -> skip;
NL      : {parensLevel == 0}? [\r\n]+;
NL_SKIP : [\r\n]+ -> skip;

如果您向词法分析器提供以下输入：

cook("banana",
     "potatoe")
  varA = 12.4

将创建以下标记：

ID                        `cook`
'('                       `(` 
STRING                    `"banana"`
','                       `,`
STRING                    `"potatoe"`
')'                       `)`
NL                        `\n`
ID                        `varA`
'='                       `=`
NUMBER                    `12.4`

正如您所看到的，括号内的NL被跳过，而在)之后的NL未被跳过。

英文:

You could keep track of the number of ( you encounter (and decrease this numbers if you encounter a )). Then you only create NL tokens if this number is equal to zero.

Here's a quick demo:

grammar T;
@lexer::members {
  int parensLevel = 0;
}
parse
 : .*? EOF
 ;
OPAR    : &#39;(&#39; {parensLevel++;};
CPAR    : &#39;)&#39; {parensLevel--;};
NUMBER  : [0-9]+ ( &#39;.&#39; [0-9]+)?;
STRING  : &#39;&quot;&#39; ~&#39;&quot;&#39;* &#39;&quot;&#39;;
ASSIGN  : &#39;=&#39;;
COMMA   : &#39;,&#39;;
ID      : [a-zA-Z]+;
SPACES  : [ \t]+ -&gt; skip;
NL      : {parensLevel == 0}? [\r\n]+;
NL_SKIP : [\r\n]+ -&gt; skip;

If you feed the lexer the following input:

cook(&quot;banana&quot;,
     &quot;potatoe&quot;)
  varA = 12.4

the following tokens will be created:

ID                        `cook`
&#39;(&#39;                       `(`
STRING                    `&quot;banana&quot;`
&#39;,&#39;                       `,`
STRING                    `&quot;potatoe&quot;`
&#39;)&#39;                       `)`
NL                        `\n`
ID                        `varA`
&#39;=&#39;                       `=`
NUMBER                    `12.4`

As you can see, the NL inside the parens is skipped, while the one after the ) is not.

通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库，让每个人都能够通过互相帮助和分享经验来进步。

Antlr4/Java : how to make a semantic predicate that skips a token (lexer) according to the parser rule that calls it

问题

答案1

如何在Python中将存储在列表中的字典中的值从字符串更新为整数？

全栈网络托管服务

Java：如何获取ShortBuffer中的项目数？

返回Java对象从Mono

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。