2023年2月14日 02:33:27go评论88阅读模式

英文:

Finding an element on website using Selenium for Python that has newlines inside its class name

问题

I'm trying to scrape some data from LinkedIn but I noticed that the elements id change each time I load the page with Selenium. So I tried using class name to find all the elements but the class names have newline inside of them, preventing me from scraping the website.

example of class with newlines here

Website link example

I tried doing the below:

job_test = "ember-view   jobs-search-results__list-item occludable-update p0 relative scaffold-layout__list-item\n              
              
              "
job_list = driver.find_elements(By.CLASS_NAME, job_test)

I even tried this:

job_test = '''ember-view   jobs-search-results__list-item occludable-update p0 relative scaffold-layout__list-item
              
              
              '''
job_list = driver.find_elements(By.CLASS_NAME, job_test)

But it does not show me any elements when I print job_list. What do I do here?

英文:

example of class with newlines here

Website link example

I tried doing the below:

job_test = &quot;ember-view   jobs-search-results__list-item occludable-update p0 relative scaffold-layout__list-item\n              \n              \n              &quot;
job_list = driver.find_elements(By.CLASS_NAME, job_test)

I even tried this:

job_test = &#39;&#39;&#39;ember-view   jobs-search-results__list-item occludable-update p0 relative scaffold-layout__list-item
              
              
              &#39;&#39;&#39;
job_list = driver.find_elements(By.CLASS_NAME, job_test)

But it does not show me any elements when I print job_list. What do I do here?

答案1

得分: 2

By.CLASS_NAME 只接受一个类名，所以你不能传递多个类名。请参考：使用Selenium时出现无效选择器：不允许复合类名错误

解决方案

要创建作业列表，你需要使用WebDriverWait 来诱发 visibility_of_all_elements_located()，你可以使用以下任一定位策略：

使用 CLASS_NAME:

driver.get('https://www.linkedin.com/jobs/search/?currentJobId=3425809260&amp;keywords=python')
job_list = WebDriverWait(driver, 20).until(EC.visibility_of_all_elements_located((By.CLASS_NAME, "jobs-search-results__list-item")))

使用 CSS_SELECTOR:

driver.get('https://www.linkedin.com/jobs/search/?currentJobId=3425809260&amp;keywords=python')
job_list = WebDriverWait(driver, 20).until(EC.visibility_of_all_elements_located((By.CSS_SELECTOR, "li.jobs-search-results__list-item")))

使用 XPATH:

driver.get('https://www.linkedin.com/jobs/search/?currentJobId=3425809260&amp;keywords=python')
job_list = WebDriverWait(driver, 20).until(EC.visibility_of_all_elements_located((By.XPATH, "//li[contains(@class, 'jobs-search-results__list-item')]")))

英文:

By.CLASS_NAME accepts only one classname, so you can't pass multiple. See: Invalid selector: Compound class names not permitted error using Selenium

Solution

To create the job list you have to induce WebDriverWait for visibility_of_all_elements_located() and you can use either of the following locator strategies:

Using CLASS_NAME:

driver.get(&#39;https://www.linkedin.com/jobs/search/?currentJobId=3425809260&amp;keywords=python&#39;)
job_list = WebDriverWait(driver, 20).until(EC.visibility_of_all_elements_located((By.CLASS_NAME, &quot;jobs-search-results__list-item&quot;)))

Using CSS_SELECTOR:

driver.get(&#39;https://www.linkedin.com/jobs/search/?currentJobId=3425809260&amp;keywords=python&#39;)
job_list = WebDriverWait(driver, 20).until(EC.visibility_of_all_elements_located((By.CSS_SELECTOR, &quot;li.jobs-search-results__list-item&quot;)))

Using XPATH:

driver.get(&#39;https://www.linkedin.com/jobs/search/?currentJobId=3425809260&amp;keywords=python&#39;)
job_list = WebDriverWait(driver, 20).until(EC.visibility_of_all_elements_located((By.XPATH, &quot;//li[contains(@class, &#39;jobs-search-results__list-item&#39;)]&quot;)))

通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库，让每个人都能够通过互相帮助和分享经验来进步。

使用Selenium for Python在网站上查找具有换行符的类名的元素

问题

答案1

解决方案

Solution

用纯JS从左到右调整框的宽度大小。

Python Tkinter的grid方法因某种原因未按预期工作。

你可以使用golang如何获取特定网站上卖家的名称？

如何为在[0,1]和[0,255]范围内归一化的图像添加图像保存功能？

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。