2023年7月24日 15:41:35go评论95阅读模式

英文:

Analyzing execution of a Python program from another Python program

问题

我想编写一个Python程序，分析其他任意的Python程序的执行。

例如，假设我有一个名为main.py的Python脚本，它调用一个名为func的函数若干次。我想创建另一个名为analyzer.py的脚本，可以在main.py运行时“查看”它，并记录func被调用的次数。我还想记录传递给func的输入参数列表，以及每次调用时func的返回值。

我不能以任何方式修改main.py或func的源代码。理想情况下，analyzer.py应该适用于任何Python程序和任何函数。

我发现实现这一目标的最佳方法是让analyzer.py以子进程的方式运行main.py，并使用pdb进行调试。

script = "main.py"
process = subprocess.Popen(['python', '-m', 'pdb', script], stdin=subprocess.PIPE, stdout=subprocess.PIPE, stderr=subprocess.PIPE)

然后，我可以通过进程的stdin发送pdb命令给程序，然后通过stdout读取输出。

要检索func的输入参数和返回值，我需要：

通过分析其文件找到func的第一行的行号
发送该文件/行号的断点命令
发送继续命令
导入pickle，将locals()序列化，并打印到stdout（以获取输入参数）
发送返回命令（转到函数的末尾）
序列化__return__并打印到stdout
发送继续命令

我想知道是否有更好的方法来实现这个目标。

英文:

I want to write a Python program that analyzes the execution of other arbitrary Python programs.

For example, suppose I have a Python script called main.py that calls a function func a certain number of times. I want to create another script called analyzer.py that can "look inside" main.py while it's running and record how many times func was called. I also want to record the list of input arguments passed to func, and the return value of func each time it was called.

I cannot modify the source code of main.py or func in any way. Ideally analyzer.py would work for any python program, and for any function.

The best way I have found to accomplish this is to have analyzer.py run main.py as a subprocess using pdb.

script = &quot;main.py&quot;
process = subprocess.Popen([&#39;python&#39;, &#39;-m&#39;, &#39;pdb&#39;, script], stdin=subprocess.PIPE, stdout=subprocess.PIPE, stderr=subprocess.PIPE)

I can then send pdb commands to the program via the process' stdin and then read the output via stdout.

To retrieve the input parameters and return values of func, I need to

Find the line number of the first line of func by analyzing its file
Send a breakpoint command for this file/lineno
Send continue command
Import pickle, serialize locals(), and print to stdout (to get input parameters)
Send return command (go to end of function)
Serialize __return__ and print to stdout
Send continue command

I'm wondering if there is a better way to accomplish this

答案1

得分: 2

不使用管道控制pdb，你可以在执行import main之前使用sys.settrace配置自己的跟踪函数（文档链接）。当然，你也可以使用importlib.import_module("main")、runpy.run_module()或runpy.run_path()。

例如，

import sys

def trace(frame, event, args):
    if event == "call":
        print(frame.f_code.co_name, frame.f_locals)

sys.settrace(trace)

# (这里你可以执行`import main`来把控制权交给它)

def func(a, b, c):
    return a + b + c

func(1, 2, 3)
func("a", "b", "c")

输出结果为：

func {'a': 1, 'b': 2, 'c': 3}
func {'a': 'a', 'b': 'b', 'c': 'c'}

英文:

Instead of controlling pdb with pipes, you can just configure your own trace function using sys.settrace before doing import main. (Of course you can also do importlib.import_module("main") or runpy.run_module() or runpy.run_path().)

For instance,

import sys


def trace(frame, event, args):
    if event == &quot;call&quot;:
        print(frame.f_code.co_name, frame.f_locals)


sys.settrace(trace)

# (this is where you&#39;d `import main` to cede control to it)

def func(a, b, c):
    return a + b + c


func(1, 2, 3)
func(&quot;a&quot;, &quot;b&quot;, &quot;c&quot;)

prints out

func {&#39;a&#39;: 1, &#39;b&#39;: 2, &#39;c&#39;: 3}
func {&#39;a&#39;: &#39;a&#39;, &#39;b&#39;: &#39;b&#39;, &#39;c&#39;: &#39;c&#39;}

通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库，让每个人都能够通过互相帮助和分享经验来进步。

分析一个Python程序从另一个Python程序中执行

问题

答案1

如何在将网格导入Fipy后从Python中访问gmsh代码？

描述数据何时使用 `value_counts`。

如何在继承内置集合类型时避免mypy的投诉？

无法使Tkinter正确显示。

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

发表评论