2023年1月9日 19:26:20go评论127阅读模式

英文:

Gitlab User list API with Python and Amazon S3

问题

我正在尝试从我们的私有GitLab服务器实例中提取完整的用户列表，并将其输入到一个S3存储桶中，以供我们需要时进行引用。最终，我会使用Lambda/cfn的某种形式来定期删除它并重新运行它，以在每周更新它。我在Python方面不是很擅长，目前这是我的代码：

import json
import boto3
import re
import os
import sys
import botocore
import urllib3
from pprint import pprint
sess = boto3.Session(profile_name="sso-profile-here")
s3_client = sess.client("s3")
bucket_name = "user-statistics"
http = urllib3.PoolManager()
baseuri = "https://git.tools.dev.mycompany.net/api/v4/"
access_token = "access-token-code"
def get_gitlab_users(access_token=access_token, baseuri=baseuri):
    headers = {
        "Content-Type": "application/json",
        "Authorization": "Bearer {}".format(access_token),
    }
    url = "{}/users/?per_page=100&active=true&without_project_bots=true&next_page=x-next-page".format(
        baseuri
    )
    req = http.request(method="GET", url=url, headers=headers)
    result = json.loads(req.data)
    s3_client.put_object(
        Bucket=bucket_name, Key="get_users_gitlab.json", Body=json.dumps(result)
    )
if __name__ == "__main__":
    get_gitlab_users(access_token=access_token, baseuri=baseuri)

我希望能够在S3存储桶中获取每个页面上的所有用户，并且还希望在S3存储桶中对其进行一些格式化，当我从存储桶中下载时，格式看起来非常难以阅读，我不确定是否可以改进它，有人能提出我可以做的建议吗？

请忽略访问令牌直接存储在代码中的事实，这仅用于测试阶段，我将确保它不会直接存储在代码中。

提前感谢任何建议。

英文:

I'm trying to pull a complete user list from our private gitlab server instance and input it into an S3 Bucket to reference whenever we need. Eventually I will have some form of Lambda/cfn deleting it and running it again every week to update it. I'm not so great with Python and this is what I have so far..

import json
import boto3
import re
import os
import sys
import botocore
import urllib3
from pprint import pprint
sess = boto3.Session(profile_name=&quot;sso-profile-here&quot;)
s3_client = sess.client(&quot;s3&quot;)
bucket_name = &quot;user-statistics&quot;
http = urllib3.PoolManager()
baseuri = &quot;https://git.tools.dev.mycompany.net/api/v4/&quot;
access_token = &quot;access-token-code&quot;
def get_gitlab_users(access_token=access_token, baseuri=baseuri):
    headers = {
        &quot;Content-Type&quot;: &quot;application/json&quot;,
        &quot;Authorization&quot;: &quot;Bearer {}&quot;.format(access_token),
    }
    url = &quot;{}/users/?per_page=100&amp;active=true&amp;without_project_bots=true&amp;next_page=x-next-page&quot;.format(
        baseuri
    )
    req = http.request(method=&quot;GET&quot;, url=url, headers=headers)
    result = json.loads(req.data)
    s3_client.put_object(
        Bucket=bucket_name, Key=&quot;get_users_gitlab.json&quot;, Body=json.dumps(result)
    )
if __name__ == &quot;__main__&quot;:
    get_gitlab_users(access_token=access_token, baseuri=baseuri)

What I would like to be able to do is pull all the users on each page and also format it a bit neater in the S3 bucket, When I download it from the bucket the format is really unreadable and I'm not sure if I can improve it, can anyone suggest anything I can do?

Please also ignore the fact my access token is directly in the code here, it's for testing at this stage and I will make sure it's not stored directly in code.

Thanks in advance for any suggestions.

答案1

得分: 0

你可以尝试使用 python-gitlab 包来代替 requests。这样获取用户信息应该会更容易：

import gitlab
baseuri = "https://git.tools.dev.mycompany.net"
access_token = "access-token-code"
gl = gitlab.Gitlab(baseuri, private_token=access_token)
users = [user.asdict() for user in gl.users.list()]
users
# [{'id': 1,
#   'username': 'username1',
#   'name': 'name1',
#   'state': 'active',
#   'avatar_url': 'https://avatar.com/1',
#   'web_url': 'https://git.tools.dev.mycompany.net/username1'},
# ...]

英文:

You can try to use python-gitlab package instead of requests. It should be a lot easier to get user infos :

import gitlab
baseuri = &quot;https://git.tools.dev.mycompany.net&quot;
access_token = &quot;access-token-code&quot;
gl = gitlab.Gitlab(baseuri , private_token=access_token)
users = [user.asdict() for user in gl.users.list()]
users
# [{&#39;id&#39;: 1,
#  &#39;username&#39;: &#39;username1&#39;,
#  &#39;name&#39;: &#39;name1&#39;,
#  &#39;state&#39;: &#39;active&#39;,
#  &#39;avatar_url&#39;: &#39;https://avatar.com/1&#39;,
#  &#39;web_url&#39;: &#39;https://git.tools.dev.mycompany.net/username1&#39;},
# ...]

通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库，让每个人都能够通过互相帮助和分享经验来进步。

Gitlab用户列表API使用Python和Amazon S3

问题

答案1

如何防止VSCode在语句间重新排列Python导入？

使用loop.run_in_executor从异步函数中调用同步函数。

用Pandas在Python中重塑和清理制表符分隔的数据文件

TypeError: 类型为Properties的对象不可JSON序列化 (Sagemaker管道)

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。