2023年5月28日 14:22:19go评论96阅读模式

英文:

Attach eBPF sockops program to a specific container cgroup

问题

我想将一个eBPF sockops程序附加到特定的Kubernetes Pod。我正在使用以下方式的bpf_prog_attach()辅助函数：

err = bpf_prog_attach(sockops_prog_fd, cgroup_fd, BPF_CGROUP_SOCK_OPS, 0);

以下是我附加到SOCKOPS挂钩的BPF程序：

#include &lt;linux/in.h&gt;
#include &lt;linux/tcp.h&gt;
#include &lt;linux/bpf.h&gt;
#include &lt;sys/socket.h&gt;
#include &lt;bpf/bpf_endian.h&gt;
#include &lt;bpf/bpf_helpers.h&gt;
char LICENSE[] SEC(&quot;license&quot;) = &quot;GPL&quot;;
// sock_ops_map将sock_ops键映射到套接字描述符
struct {
  __uint(type, BPF_MAP_TYPE_SOCKHASH);
  __uint(max_entries, 65535);
  __type(key, struct sock_key);
  __type(value, __u64);
} sock_ops_map SEC(&quot;.maps&quot;);
// `sock_key'是sockmap的键
struct sock_key {
  __u32 sip4;
  __u32 dip4;
  __u32 sport;
  __u32 dport;
} __attribute__((packed));
// `sk_extract_key'从`bpf_sock_ops'结构中提取键
static inline void sk_extract_key(struct bpf_sock_ops *ops,
                                  struct sock_key *key) {
  key-&gt;dip4 = ops-&gt;remote_ip4;
  key-&gt;sip4 = ops-&gt;local_ip4;
  key-&gt;sport = (bpf_htonl(ops-&gt;local_port) &gt;&gt; 16);
  key-&gt;dport = ops-&gt;remote_port &gt;&gt; 16;
}
SEC(&quot;sockops&quot;)
int bpf_add_to_sockhash(struct bpf_sock_ops *skops) {
  __u32 family, op;
  family = skops-&gt;family;
  op = skops-&gt;op;
  bpf_printk(&quot;Got new operation %d for socket.\n&quot;, op);
  switch (op) {
    case BPF_SOCK_OPS_PASSIVE_ESTABLISHED_CB:
    case BPF_SOCK_OPS_ACTIVE_ESTABLISHED_CB:
      if (family == AF_INET) {
        struct sock_key key = {};
        sk_extract_key(skops, &amp;key);
        int ret = bpf_sock_hash_update(skops, &amp;sock_ops_map, &amp;key, BPF_NOEXIST);
        if (ret != 0) {
          bpf_printk(&quot;Failed to update sockmap: %d\n&quot;, ret);
        } else {
          bpf_printk(&quot;Added new socket to sockmap\n&quot;);
        }
      }
      break;
    default:
      break;
  }
  return 0;
}

在上述代码中，当我提供cgroup_fd为/sys/fs/cgroup/unified cgroup时，程序可以正常工作 - eBPF程序被加载，并且打印语句正常工作。

然而，当我使用特定的Kubernetes Pod cgroup（使用cgroup_fd为/sys/fs/cgroup/unified/kubepods-burstable-podad4348c2_ac53_4c09_a9dc_c207a6c68dec.slice:cri-containerd:30a47e8e847277317a29ff7bdcf5bf03391ff79b847be647120d285f62a0f7e6）时，程序仍然成功附加，但我不会收到打印语句。

附加到子cgroup的SOCKOPS挂钩是否存在问题？还是特定Kubernetes Pod的cgroup与unified/中的不同？

英文:

I want to attach an eBPF sockops program to a specific kubernetes pod. I am using the bpf_prog_attach() helper as follows:

err = bpf_prog_attach(sockops_prog_fd, cgroup_fd, BPF_CGROUP_SOCK_OPS, 0);

And here is the BPF program that I attach to the SOCKOPS hook:

#include &lt;linux/in.h&gt;
#include &lt;linux/tcp.h&gt;
#include &lt;linux/bpf.h&gt;
#include &lt;sys/socket.h&gt;
#include &lt;bpf/bpf_endian.h&gt;
#include &lt;bpf/bpf_helpers.h&gt;
char LICENSE[] SEC(&quot;license&quot;) = &quot;GPL&quot;;
// sock_ops_map maps the sock_ops key to a socket descriptor
struct {
__uint(type, BPF_MAP_TYPE_SOCKHASH);
__uint(max_entries, 65535);
__type(key, struct sock_key);
__type(value, __u64);
} sock_ops_map SEC(&quot;.maps&quot;);
// `sock_key&#39; is a key for the sockmap
struct sock_key {
__u32 sip4;
__u32 dip4;
__u32 sport;
__u32 dport;
} __attribute__((packed));
// `sk_extract_key&#39; extracts the key from the `bpf_sock_ops&#39; struct
static inline void sk_extract_key(struct bpf_sock_ops *ops,
struct sock_key *key) {
key-&gt;dip4 = ops-&gt;remote_ip4;
key-&gt;sip4 = ops-&gt;local_ip4;
key-&gt;sport = (bpf_htonl(ops-&gt;local_port) &gt;&gt; 16);
key-&gt;dport = ops-&gt;remote_port &gt;&gt; 16;
}
SEC(&quot;sockops&quot;)
int bpf_add_to_sockhash(struct bpf_sock_ops *skops) {
__u32 family, op;
family = skops-&gt;family;
op = skops-&gt;op;
bpf_printk(&quot;Got new operation %d for socket.\n&quot;, op);
switch (op) {
case BPF_SOCK_OPS_PASSIVE_ESTABLISHED_CB:
case BPF_SOCK_OPS_ACTIVE_ESTABLISHED_CB:
if (family == AF_INET) {
struct sock_key key = {};
sk_extract_key(skops, &amp;key);
int ret = bpf_sock_hash_update(skops, &amp;sock_ops_map, &amp;key, BPF_NOEXIST);
if (ret != 0) {
bpf_printk(&quot;Failed to update sockmap: %d\n&quot;, ret);
} else {
bpf_printk(&quot;Added new socket to sockmap\n&quot;);
}
}
break;
default:
break;
}
return 0;
}

In above, when I provide the cgroup_fd for the /sys/fs/cgroup/unified cgroup, the program works - the eBPF program gets loaded, and the print statement works.

However, when I use the specific cgroup for a Kubernetes pod (using the cgroup_fd as /sys/fs/cgroup/unified/kubepods-burstable-podad4348c2_ac53_4c09_a9dc_c207a6c68dec.slice:cri-containerd:30a47e8e847277317a29ff7bdcf5bf03391ff79b847be647120d285f62a0f7e6, then the program still attaches successfully but I don't get the print statements.

Is there a problem in attaching to the SOCKOPS hook for a child cgroup? Or is the cgroup for a specific kubernetes pod different from the one in unified/?

答案1

得分: 0

问题似乎出在目录名称上。在我的系统上，每个Kubernetes Pod都有两个对应的目录。例如，在我的情况下，具有ID ad4348c2-ac53-4c09-a9dc-c207a6c68dec 的Kubernetes Pod具有以下两个cgroup目录：

正确的cgroup可以使用以下命令找到（或通过检查Pod的JSON输出找到）：

因此，正确的cgroup用于附加和检查套接字消息的Pod将是 kubepods-burstable-podad4348c2_ac53_4c09_a9dc_c207a6c68dec.slice:cri-containerd:3a4af0e09c0e7e506fef59b92cbeb008b0a3e66d442e54e5ca5ded642841a335。

英文:

It seems the issue was in the directory name. On my system, each kubernetes pod had two corresponding directories. For example, in my case, the kubernetes pod with ID ad4348c2-ac53-4c09-a9dc-c207a6c68dec had the following two cgroup directories:

$  ls | grep ad4348c2_ac53_4c09_a9dc_c207a6c68dec
kubepods-burstable-podad4348c2_ac53_4c09_a9dc_c207a6c68dec.slice:cri-containerd:30a47e8e847277317a29ff7bdcf5bf03391ff79b847be647120d285f62a0f7e6
kubepods-burstable-podad4348c2_ac53_4c09_a9dc_c207a6c68dec.slice:cri-containerd:3a4af0e09c0e7e506fef59b92cbeb008b0a3e66d442e54e5ca5ded642841a335

The correct cgroup can be found using following command (or by inspecting the pod's json output):

$ kubectl get pods -A -o custom-columns=PodName:.metadata.name,PodUID:.metadata.uid,ContainerID:.status.containerStatuses[0].containerID
PodName                                                            PodUID                                 ContainerID
frontend-b74f77687-sd8rf                                           ad4348c2-ac53-4c09-a9dc-c207a6c68dec   containerd://3a4af0e09c0e7e506fef59b92cbeb008b0a3e66d442e54e5ca5ded642841a335

Hence, the correct cgroup for the pod to attach and inspect socket messages would be kubepods-burstable-podad4348c2_ac53_4c09_a9dc_c207a6c68dec.slice:cri-containerd:3a4af0e09c0e7e506fef59b92cbeb008b0a3e66d442e54e5ca5ded642841a335

通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库，让每个人都能够通过互相帮助和分享经验来进步。

将eBPF sockops程序附加到特定容器的cgroup

问题

答案1

在不同主机上代理GRPC请求

我的镜像没有被Kubernetes部署。

Golang：什么是etext？

持久卷不支持快照（Velero，MinIO，FSB）。

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。