问题

file_push_status (文件推送状态：成功或失败)
first_test_status (第一个测试状态：通过或失败)
second_test_status (第二个测试状态：通过或失败)
first_test_time_taken (第一个测试所需时间：多长时间)
second_test_time_taken (第二个测试所需时间：多长时间)

在Prometheus文档中查阅，但无法确定应该使用摘要（Summary）还是直方图（Histogram）。我了解Prometheus不支持布尔值（前三个情况），应该如何处理？

如有需要，可以附上现有的批处理作业代码。谢谢。

英文:

There is a Python batch job that pushes huge file(s) to a shared location, once the file(s) are pushed, couple of tests will be run against that/those file(s).
I'm trying to get some metrics around the batch job & planning to use Node exporter having below metrics or labels.

file_push_status (success or failure)
first_test_status (Pass or Fail)
second_test_status (Pass or Fail)
first_test_time_taken (How long)
second_test_time_taken (How long)

Gone thru prometheus documentation, but unable to get a clarity whether Summary or Histogram should be used here ? I understand, Prometheus doesnt support Boolean(1st 3 cases), how those should be handled ?

If needed will attach the existing batch job code, thank you.

答案1

得分: 1

以下是要翻译的内容：

"For small number of files you don't need histograms.

Make all three metrics gauges.

Something like

# HELP file_push_success A metric with 0/1 value showing result of file push job. 0 - failure.
# TYPE file_push_success gauge
file_push_success{file=&quot;filename.txt&quot;} 1

# HELP file_push_test_success A metric with 0/1 value showing result of corresponding test after file being pushed. 0 - failure.
# TYPE file_push_test_success gauge
file_push_test_success{file=&quot;filename.txt&quot;, test=&quot;1&quot;} 1
file_push_test_success{file=&quot;filename.txt&quot;, test=&quot;2&quot;} 0

# HELP file_push_test_duration_seconds Duration of corresponding test after file being pushed
# TYPE file_push_test_duration_seconds gauge
file_push_test_duration_seconds{file=&quot;filename.txt&quot;, test=&quot;1&quot;} 5
file_push_test_duration_seconds{file=&quot;filename.txt&quot;, test=&quot;2&quot;} 13

Here I grouped related metrics into one with different labels. It would be more easier to support (for example when you'll decide to add new tests), and is generally advised by Prometheus documentation."

英文:

For small number of files you don't need histograms.

Make all three metrics gauges.

Something like

# HELP file_push_success A metric with 0/1 value showing result of file push job. 0 - failure.
# TYPE file_push_success gauge
file_push_success{file=&quot;filename.txt&quot;} 1 

# HELP file_push_test_success A metric with 0/1 value showing result of corresponding test after file being pushed. 0 - failure.
# TYPE file_push_test_success gauge
file_push_test_success{file=&quot;filename.txt&quot;, test=&quot;1&quot;} 1
file_push_test_success{file=&quot;filename.txt&quot;, test=&quot;2&quot;} 0

# HELP file_push_test_duration_seconds Duration of corresponding test after file being pushed 
# TYPE file_push_test_duration_seconds gauge
file_push_test_duration_seconds{file=&quot;filename.txt&quot;, test=&quot;1&quot;} 5
file_push_test_duration_seconds{file=&quot;filename.txt&quot;, test=&quot;2&quot;} 13

通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库，让每个人都能够通过互相帮助和分享经验来进步。

监控批处理作业由Prometheus。

问题

答案1

prometheus的ConstLabels取值

应用TA-Lib的KAMA到带有groupby的DataFrame。

如何将计数分配给一个新列，计算在当前行不属于被计算的组时的行数？

Python函数，我应该如何开始编程。

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

发表评论