问题

我正在尝试使用Go语言从一个需要用户/密码登录的网站上爬取数据。使用Python的requests库可以很简单地实现这一点：

import requests

session = requests.Session()
session.post("https://site.com/login", data={'username': 'user', 'password': '123456'})

# 访问需要身份验证的URL
resp = session.get('https://site.com/restricted/url')

请问如何用Go语言实现相同的功能呢？谢谢。

英文:

I'm trying to scrape data from a website that requires user/password login using go. With python this is simple using requests lib:

import requests

session = requests.Session()
session.post(&quot;https://site.com/login&quot;, data={ &#39;username&#39;: &#39;user&#39;, &#39;password&#39;: &#39;123456&#39; })

# access URL that requires authentication
resp = session.get(&#39;https://site.com/restricted/url&#39;)

What is a simple way to accomplish the same thing with golang? thanks.

答案1

得分: 6

创建一个自定义的HTTP Client 实例，并将一个 cookie jar 附加到它上面。

英文:

Create a custom HTTP Client instance and attach a cookie jar to it.

答案2

得分: 2

我写了一个名为Colly的爬虫框架，它可以直接处理HTTP会话。你可以通过以下方式实现上述功能：

c := colly.NewCollector()
c.Post("https://example.com/login", map[string]string{"user": "x", "pass": "y"})

你可以在GitHub上找到这段代码。还有一个完整的处理身份验证的示例也是可用的。

英文:

I wrote a scraping framework called Colly which handles HTTP sessions out of the box. You can achieve the mentioned functionality similarly:

c := colly.NewCollector()
c.Post(&quot;https://example.com/login&quot;, map[string]string{&quot;user&quot;: &quot;x&quot;, &quot;pass&quot;: &quot;y&quot;})

The code can be found on GitHub.
A complete example of handling authentications is also available.

通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库，让每个人都能够通过互相帮助和分享经验来进步。

How do I maintain logged in session with golang for scraping?

问题

答案1

答案2

在golang/mongodb聚合查询中，复合字面量中缺少类型。

How to use an alternate go.mod file for local development?

如何在Golang的单元测试中测试net.Conn？

Golang的jsonrpc2服务器在哪里监听？

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

发表评论