2023年6月26日 02:17:42go评论126阅读模式

英文:

Download a file from a GitHub repository using Python

问题

我想从GitHub存储库中下载一个单个文件。在bash中，您可以执行类似以下的操作：

curl -kLSs "https://github.com/mikolalysenko/lena/archive/master.tar.gz" | tar xz --wildcards '*lena.png' --strip-components=1

以在当前工作目录中下载并保存此文件。如何在Python中做到这一点（即不调用bash命令）？

英文:

I would like to download a single file from a GitHub repository. In bash, you could do something like this:

curl -kLSs &quot;https://github.com/mikolalysenko/lena/archive/master.tar.gz&quot; | tar xz --wildcards &#39;*lena.png&#39; --strip-components=1`

to download and save this file in the current working directory. How would one do this using only Python (aka. not calling a bash command)?

答案1

得分: 2

这是要翻译的内容：

"Possible duplicate of https://stackoverflow.com/questions/14120502/how-to-download-and-write-a-file-from-github-using-requests

But if you want to avoid using external modules, the following code can work:

import urllib.request

url = &quot;&quot;
file_name = &quot;file_name&quot;

with urllib.request.urlopen(url) as file, open(file_name, &#39;w&#39;) as f:
    f.write(file.read().decode())

You can change open function parameters to save the file in your desired place"

英文:

Possible duplicate of https://stackoverflow.com/questions/14120502/how-to-download-and-write-a-file-from-github-using-requests

But if you want to avoid using external modules, the following code can work:

import urllib.request

url = &quot;&quot;
file_name = &quot;file_name&quot;

with urllib.request.urlopen(url) as file, open(file_name, &#39;w&#39;) as f:
    f.write(file.read().decode())

You can change open function parameters to save the file in your desired place

答案2

得分: 1

I am not sure if with "pure python" you mean without modules, if not using the method urlretrive from the urllib module could be a solution:

import urllib.request


def main():
    urllib.request.urlretrieve("https://raw.githubusercontent.com/mikolalysenko/lena/master/lena.png", "test.png")


if __name__ == "__main__":
    main()

To download files from github you have to use the raw.githubusercontent.com domain, because the files of github repositories get stored there, but this answer explains it better.

英文:

I am not sure if with "pure python" you mean without modules, if not using the method urlretrive from the urllib module could be a solution:

import urllib.request


def main():
    urllib.request.urlretrieve(&quot;https://raw.githubusercontent.com/mikolalysenko/lena/master/lena.png&quot;, &quot;test.png&quot;)


if __name__ == &quot;__main__&quot;:
    main()

To download files from github you have to use the raw.githubusercontent.com domain, because the files of github repositories get stored there, but this answer explains it better.

答案3

得分: 0

以下是代码部分的翻译：

import requests
from io import BytesIO
from pathlib import Path
from zipfile import ZipFile

file_name = 'lena.png'
timeout = 10
url = 'https://github.com/mikolalysenko/lena/archive/master.zip'

r = requests.get(url, timeout=timeout)
if r.ok:
    print('found:', Path(url).name)
    with ZipFile(BytesIO(r.content)) as f:
        for file in f.namelist():
            if file.endswith(file_name):
                print('found:', file)
                data = f.read(file)
                Path(Path(file).name).write_bytes(data)
else:
    print(r)
    print(r.text)

请注意，代码中的注释和字符串没有被翻译。

英文:

I use this do download the zip embedded changlog.txt file from a project I follow on github.

import requests
from io import BytesIO
from pathlib import Path
from zipfile import ZipFile


file_name = &#39;lena.png&#39;
timeout = 10
url = &#39;https://github.com/mikolalysenko/lena/archive/master.zip&#39;

r = requests.get(url, timeout=timeout)
if r.ok:
	print(&#39;found:&#39;, Path(url).name)
	with ZipFile(BytesIO(r.content)) as f:
		for file in f.namelist():
			if file.endswith(file_name):
				print(&#39;found:&#39;, file)
				data = f.read(file)
				Path(Path(file).name).write_bytes(data)
else:
	print(r)
	print(r.text)

通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库，让每个人都能够通过互相帮助和分享经验来进步。

使用Python从GitHub存储库下载文件。

问题

答案1

答案2

答案3

如何在使用Scrapy中的get_project_settings()时指定代理列表的路径？

Kivy: 将 BoxLayout 对齐到另一个 BoxLayout 的右侧

决策树分类器 // 准确性分数

有没有一个函数可以返回所有行并排除在Python数据帧中不符合条件的行？

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

发表评论