2023年7月10日 17:57:43go评论278阅读模式

英文:

query-exporter in Docker container not working

问题

我正在尝试在Docker容器中运行query-exporter。在开发人员的建议下，我已经在docker中启用了IPv6，方法是将以下内容添加到我的docker daemon.json文件中并重新启动：

{
  "experimental": true,
  "ip6tables": true
}

我正在使用以下docker-compose文件：

version: "3.3"

services:
  prometheus:
    container_name: prometheus
    image: prom/prometheus
    restart: always
    volumes:
      - ./prometheus:/etc/prometheus/
      - prometheus_data:/prometheus
    command:
      - '--config.file=/etc/prometheus/prometheus.yml'
      - '--storage.tsdb.path=/prometheus'
      - '--web.console.libraries=/usr/share/prometheus/console_libraries'
      - '--web.console.templates=/usr/share/prometheus/consoles'
    ports:
      - 9090:9090
    networks:
      - prom_app_net

  # 其他服务...

volumes:
  # 定义卷...
networks:
  prom_app_net:
  slurm:
    enable_ipv6: true
    ipam:
      config:
        - subnet: 2001:0DB8::/112

然后在slurmctld容器上安装了query-exporter，并使用以下config.yaml文件运行它：

databases:
  db1:
    dsn: sqlite:////test.db
    connect-sql:
      - PRAGMA application_id = 123
      - PRAGMA auto_vacuum = 1
    labels:
      region: us1
      app: app1

metrics:
  metric1:
    type: gauge
    description: A sample gauge

queries:
  query1:
    interval: 5
    databases: [db1]
    metrics: [metric1]
    sql: SELECT random() / 1000000000000000 AS metric1

但是它没有工作 - Prometheus将目标列为离线。但容器设置似乎没问题，因为如果我运行以下测试导出器：

from prometheus_client import start_http_server, Summary
import random
import time

# 创建一个用于跟踪时间和请求的指标。
REQUEST_TIME = Summary('request_processing_seconds', 'Time spent processing request')

# 使用指标装饰函数。
@REQUEST_TIME.time()
def process_request(t):
    """一个需要一些时间的虚拟函数。"""
    time.sleep(t)

if __name__ == '__main__':
    # 启动服务器以公开指标。
    start_http_server(8082)
    # 生成一些请求。
    while True:
        process_request(random.random())

Prometheus可以成功连接到目标。

根据更新信息，问题可能是query-exporter尝试绑定IPv6，而测试的test_query.py使用IPv4在端口8082上。您可以尝试将Prometheus指向8082/tcp -> [::]:8082，但如何实现取决于您的具体配置。

英文:

I am trying to get query-exporter to run in a Docker container. With advice from the developer I have enabled IPv6 in docker by putting:

{
  &quot;experimental&quot;: true,
  &quot;ip6tables&quot;: true
}

in my docker daemon.json and restarted.

I am using the following docker-compose file:

version: &quot;3.3&quot;

services:
  prometheus:
    container_name: prometheus
    image: prom/prometheus
    restart: always
    volumes:
      - ./prometheus:/etc/prometheus/
      - prometheus_data:/prometheus
    command:
      - &#39;--config.file=/etc/prometheus/prometheus.yml&#39;
      - &#39;--storage.tsdb.path=/prometheus&#39;
      - &#39;--web.console.libraries=/usr/share/prometheus/console_libraries&#39;
      - &#39;--web.console.templates=/usr/share/prometheus/consoles&#39;
    ports:
      - 9090:9090
    networks:
      - prom_app_net
    

  grafana:
    container_name: grafana
    image: grafana/grafana
    user: &#39;472&#39;
    restart: always
    environment:
      GF_INSTALL_PLUGINS: &#39;grafana-clock-panel,grafana-simple-json-datasource&#39;
    volumes:
      - grafana_data:/var/lib/grafana
      - ./grafana/provisioning/:/etc/grafana/provisioning/
      - &#39;./grafana/grafana.ini:/etc/grafana/grafana.ini&#39;
    env_file:
      - ./grafana/.env_grafana
    ports:
      - 3000:3000
    depends_on:
      - prometheus
    networks:
      - prom_app_net


  mysql:
    image: mariadb:10.10
    hostname: mysql
    container_name: mysql
    environment:
      MYSQL_RANDOM_ROOT_PASSWORD: &quot;yes&quot;
      MYSQL_DATABASE: slurm_acct_db
      MYSQL_USER: slurm
      MYSQL_PASSWORD: password
    volumes:
      - var_lib_mysql:/var/lib/mysql
    networks:
      - slurm
#    network_mode: host


  slurmdbd:
    image: prom-slurm-cluster:${IMAGE_TAG:-21.08.6}
    build:
      context: .
      args:
        SLURM_TAG: ${SLURM_TAG:-slurm-21-08-6-1}
    command: [&quot;slurmdbd&quot;]
    container_name: slurmdbd
    hostname: slurmdbd
    volumes:
      - etc_munge:/etc/munge
      - etc_slurm:/etc/slurm
      - var_log_slurm:/var/log/slurm
      - cgroups:/sys/fs/cgroup:ro
    expose:
      - &quot;6819&quot;
    ports:
      - &quot;6819:6819&quot;
    depends_on:
      - mysql
    privileged: true
    cgroup: host
    networks:
      - slurm
    #network_mode: host

  slurmctld:
    image: prom-slurm-cluster:${IMAGE_TAG:-21.08.6}
    command: [&quot;slurmctld&quot;] 
    container_name: slurmctld
    hostname: slurmctld
    volumes:
      - etc_munge:/etc/munge
      - etc_slurm:/etc/slurm
      - slurm_jobdir:/data
      - var_log_slurm:/var/log/slurm
      - etc_prometheus:/etc/prometheus
      - /sys/fs/cgroup:/sys/fs/cgroup:rw
    expose:
      - &quot;6817&quot;
      - &quot;8080&quot;
      - &quot;8081&quot;
      - &quot;8082/tcp&quot;
    ports:
      - 8080:8080
      - 8081:8081
      - 8082:8082/tcp
    depends_on:
      - &quot;slurmdbd&quot;
    privileged: true
    cgroup: host

    #network_mode: host
    networks:
      - slurm

  c1:
    image: prom-slurm-cluster:${IMAGE_TAG:-21.08.6}
    command: [&quot;slurmd&quot;]
    hostname: c1
    container_name: c1
    volumes:
      - etc_munge:/etc/munge
      - etc_slurm:/etc/slurm
      - slurm_jobdir:/data
      - var_log_slurm:/var/log/slurm
      - cgroups:/sys/fs/cgroup:ro
    expose:
      - &quot;6818&quot;
    depends_on:
      - &quot;slurmctld&quot;
    privileged: true
    cgroup: host 
    #network_mode: host
    networks:
      - slurm
    

  c2:
    image: prom-slurm-cluster:${IMAGE_TAG:-21.08.6}
    command: [&quot;slurmd&quot;]
    hostname: c2
    container_name: c2
    volumes:
      - etc_munge:/etc/munge
      - etc_slurm:/etc/slurm
      - slurm_jobdir:/data
      - var_log_slurm:/var/log/slurm
      - cgroups:/sys/fs/cgroup:ro
    expose:
      - &quot;6818&quot;
      - &quot;22&quot;
    depends_on:
      - &quot;slurmctld&quot;
    privileged: true
    cgroup: host
    networks:
      - slurm
    #network_mode: host




volumes:
  etc_munge:
  etc_slurm:
  slurm_jobdir:
  var_lib_mysql:
  var_log_slurm:
  grafana_data:
  prometheus_data:
  cgroups: 
  etc_prometheus:

networks:
  prom_app_net:
  slurm:
    enable_ipv6: true
    ipam:
      config: 
        - subnet: 2001:0DB8::/112

Then installed query-exporter on the slurmctld container and run it with the following config.yaml:

databases:
  db1:
    dsn: sqlite:////test.db
    connect-sql:
      - PRAGMA application_id = 123
      - PRAGMA auto_vacuum = 1
    labels:
      region: us1
      app: app1


metrics:
  metric1:
    type: gauge
    description: A sample gauge


queries:
  query1:
    interval: 5
    databases: [db1]
    metrics: [metric1]
    sql: SELECT random() / 1000000000000000 AS metric1

But it is not working - prometheus lists the target as being down:

But the container set-up seems to be fine as if I run the following test exporter:

from prometheus_client import start_http_server, Summary
import random
import time

# Create a metric to track time spent and requests made.
REQUEST_TIME = Summary(&#39;request_processing_seconds&#39;, &#39;Time spent processing request&#39;)

# Decorate function with metric.
@REQUEST_TIME.time()
def process_request(t):
    &quot;&quot;&quot;A dummy function that takes some time.&quot;&quot;&quot;
    time.sleep(t)

if __name__ == &#39;__main__&#39;:
    # Start up the server to expose the metrics.
    start_http_server(8082)
    # Generate some requests.
    while True:
        process_request(random.random())

Prometheus can connect to the target fine:

Can anyone see what the problem could be?

Thanks!

Update

I run query-exporter by hand on the slurmctld container. There isnt anything in the container logs about query-exporter:

2023-07-10 10:11:37 ---&gt; Starting the MUNGE Authentication service (munged) ...
2023-07-10 10:11:37 ---&gt; Waiting for slurmdbd to become active before starting slurmctld ...
2023-07-10 10:11:37 -- slurmdbd is not available.  Sleeping ...
2023-07-10 10:11:39 -- slurmdbd is now active ...
2023-07-10 10:11:39 ---&gt; starting systemd ...

I think th etest_query.py that works is using IPv4 on port 8082, while the query exporter is trying to bind IPv6.

docker port slurmctld gives:

8080/tcp -&gt; 0.0.0.0:8080
8080/tcp -&gt; [::]:8080
8081/tcp -&gt; 0.0.0.0:8081
8081/tcp -&gt; [::]:8081
8082/tcp -&gt; 0.0.0.0:8082
8082/tcp -&gt; [::]:8082

I guess i need to pint prometheus at 8082/tcp -> [::]:8082 when the query-exporter runs, but I'm not sure how to do it.

答案1

得分: 0

使用 query-exporter config.yaml -H 0.0.0.0 -p 8082 运行可以使其工作。

英文:

Running with query-exporter config.yaml -H 0.0.0.0 -p 8082 gets it to work.

通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库，让每个人都能够通过互相帮助和分享经验来进步。

query-exporter在Docker容器中无法工作。

问题

答案1

Python asyncio sleep – taking excessive time.

轻量级易安装数据库

Python Image/Button click via selenium

Error managing bytes array while trying to code my first Burp extension

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

发表评论