问题

I can help you with the translation. Here is the translated content:

所以我有一个网站，名为example.com，我试图发送请求，当您在浏览器上加载该网站时，会有一个加载屏幕，可能持续2-4秒（它不是验证码，也不是Cloudflare），然后转到主页。在Python中，我试图发送HTTP GET请求到该页面，但遇到了加载屏幕，有没有办法完全避免这个屏幕？或者至少等待屏幕消失，然后获取响应？

import requests
response = requests.get("example.com")
print(response.text)

期望的响应应该是主页，而不是加载屏幕。我想避免使用Selenium、Playwright或任何浏览器仿真包，只有作为最后的手段，如果没有可能的方法可以仅使用requests库来绕过它。

英文:

So I have a website, named example.com I am trying to send a requests too, when you load up the site on the browser there is a loading screen for maybe, 2 - 4 seconds (it is not captcha nor cloudflare) and then goes to the main page. In python I am trying to send a http get requests to that page but am met with the loading screen, is there anyway I can compelstely avoid that screen? Or atleast wait for the screen to pass then retrieve the response?

import requests 
response = requests.get(“example.com”)
print(response.text)

The desired response would be the main page, not the loading screen. I would like to refrain from using selenium, playwright or any browser emulation package, only as a last resort if there is NO possible way to bypass this simply with requests library.

答案1

得分: 1

这应该可以运行。

import pyppeteer
import asyncio
pageurl = ""
async def main():
    # 启动 Chromium 浏览器，也可以使用 Chrome 代替 Chromium。
    browser = await pyppeteer.launch(headless=False)
    # 创建一个空白页面
    page = await browser.newPage()
    # 转到请求的页面并在网站上运行动态代码。
    await page.goto(pageurl)
    # 提供页面的 HTML 内容
    cont = await page.content()
    return cont
# 打印用户配置文件的 HTML 代码：tupac
print(asyncio.get_event_loop().run_until_complete(main()))

英文:

this should work.

import pyppeteer
import asyncio
pageurl = &quot;&quot;
async def main():
    # launches a chromium browser, can use chrome instead of chromium as well.
    browser = await pyppeteer.launch(headless=False)
    # creates a blank page
    page = await browser.newPage()
    # follows to the requested page and runs the dynamic code on the site.
    await page.goto(pageurl)
    # provides the html content of the page
    cont = await page.content()
    return cont
# prints the html code of the user profiel: tupac
print(asyncio.get_event_loop().run_until_complete(main()))

通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库，让每个人都能够通过互相帮助和分享经验来进步。

如何在Python请求中绕过加载屏幕？

问题

答案1

获取嵌套TypedDicts的键

从 pandas 透视表复制到数据框以计算小计

How to modify JSON files inside a zip file using shell script/ ANT script/ java code/ python code? which of these is efficient and easy?

如何为 pandas 数据透视表 DataFrame 添加样式？

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。