2020年1月3日 14:17:46go评论130阅读模式

英文:

How click the parent class if the child href matches the requirement

问题

I've tired to find the right selenium code to get click the main parent class if the following requirements exist in the class :

Parent Class
<div class ="col-xs-2-4 shopee-search-item-result__item" data-sqe="item">
Child class
<a data-sqe="link" href= all urls that is printed in python.>
Child class contains this element
<div class = "_1gkBDw _2O43P5">
<div class = "_1HvBLA">
<div class = "_3ao649" data-sqe="ad"> Ad</div>

Here is the code bellow

from selenium import webdriver
from selenium.webdriver.common.action_chains import ActionChains
import time
from selenium.webdriver.common.action_chains import ActionChains
from selenium.webdriver.support.ui import WebDriverWait
from selenium.webdriver.common.by import By
from selenium.webdriver.support import expected_conditions as EC
from selenium.webdriver.common.keys import Keys
import csv
import time
url = 'https://shopee.com.my/search?keyword=mattress'
driver = webdriver.Chrome(executable_path=r'E:/users/Francabicon/Desktop/Bots/others/chromedriver.exe')
driver.get(url)
time.sleep(0.8)
# select language
driver.find_element_by_xpath('//div[@class="language-selection__list"]/button').click()
time.sleep(3)
# scroll few times to load all items 
for x in range(10):
    driver.execute_script("window.scrollBy(0,300)")
    time.sleep(0.1)
# get all links (without clicking)
all_items = driver.find_elements_by_xpath('//a[@data-sqe="link"]')
print('len:', len(all_items))
all_urls = []
j = 0
k = 45
for item in all_items:
    url = item.get_attribute('href')
    all_urls.append(url)
print(all_urls)
a= len(all_urls)
# now use links
i = 0
while i <= 4 :
    WebDriverWait(driver, 20).until(EC.element_to_be_clickable((By.XPATH, "//div[@class='col-xs-2-4 shopee-search-item-result__item' and @data-sqe='item']//a[@class='link' and @href= all_urls[i]]"))).click()
    i+=1

I've tried to locate:
-Div the whole class
-locate classes and the href individually
-click the first five columns
but it all always fails.

Traceback (most recent call last):
  File "E:/Users/Asashin/Desktop/Bots/click test 7.py", line 52, in <module>
    WebDriverWait(driver, 20).until(EC.element_to_be_clickable((By.XPATH, "//div[@class='col-xs-2-4 shopee-search-item-result__item' and @data-sqe='item']//a[@class='link' and @href= all_urls[i]]"))).click()
  File "C:\Users\User\AppData\Local\Programs\Python\Python37-32\lib\site-packages\selenium\webdriver\support\wait.py", line 80, in until
    raise TimeoutException(message, screen, stacktrace)
selenium.common.exceptions.TimeoutException: Message:

Can I be solved?

英文:

Hi I've tired to find the right selenium code to get click the main parent class if the following requirements exist in the class :

Parent Class
&lt;div class =&quot;col-xs-2-4 shopee-search-item-result__item&quot; data-sqe=&quot;item&quot;&gt;
Child class
&lt;a data-sqe=&quot;link&quot; href= all urls that is printed in python.&gt;
Child class contains this element
&lt;div class = &quot;_1gkBDw _2O43P5&quot;&gt;
&lt;div class = &quot;_1HvBLA&quot;&gt;
&lt;div class = &quot;_3ao649&quot; data-sqe=&quot;ad&quot;&gt; Ad&lt;/div&gt;

Here is the code bellow

from selenium import webdriver
from selenium.webdriver.common.action_chains import ActionChains
import time
from selenium.webdriver.common.action_chains import ActionChains
from selenium.webdriver.support.ui import WebDriverWait
from selenium.webdriver.common.by import By
from selenium.webdriver.support import expected_conditions as EC
from selenium.webdriver.common.keys import Keys
import csv
import time
url = &#39;https://shopee.com.my/search?keyword=mattress&#39;
driver = webdriver.Chrome(executable_path=r&#39;E:/users/Francabicon/Desktop/Bots/others/chromedriver.exe&#39;)
driver.get(url)
time.sleep(0.8)
# select language
driver.find_element_by_xpath(&#39;//div[@class=&quot;language-selection__list&quot;]/button&#39;).click()
time.sleep(3)
# scroll few times to load all items 
for x in range(10):
    driver.execute_script(&quot;window.scrollBy(0,300)&quot;)
    time.sleep(0.1)
# get all links (without clicking)
all_items = driver.find_elements_by_xpath(&#39;//a[@data-sqe=&quot;link&quot;]&#39;)
print(&#39;len:&#39;, len(all_items))
all_urls = []
j = 0
k = 45
for item in all_items:
    url = item.get_attribute(&#39;href&#39;)
    all_urls.append(url)
print(all_urls)
a= len(all_urls)
# now use links
i = 0
while i &lt;= 4 :
    WebDriverWait(driver, 20).until(EC.element_to_be_clickable((By.XPATH, &quot;//div[@class=&#39;col-xs-2-4 shopee-search-item-result__item&#39; and @data-sqe=&#39;item&#39;]//a[@class=&#39;link&#39; and @href= all_urls[i]]&quot;))).click()
    i+=1

I've tried to locate:
-Div the whole class
-locate classes and the href individualy
-click the first five columns
but it all always fails.

Traceback (most recent call last):
  File &quot;E:/Users/Asashin/Desktop/Bots/click test 7.py&quot;, line 52, in &lt;module&gt;
    WebDriverWait(driver, 20).until(EC.element_to_be_clickable((By.XPATH, &quot;//div[@class=&#39;col-xs-2-4 shopee-search-item-result__item&#39; and @data-sqe=&#39;item&#39;]//a[@class=&#39;link&#39; and @href= all_urls[i]]&quot;))).click()
  File &quot;C:\Users\User\AppData\Local\Programs\Python\Python37-32\lib\site-packages\selenium\webdriver\support\wait.py&quot;, line 80, in until
    raise TimeoutException(message, screen, stacktrace)
selenium.common.exceptions.TimeoutException: Message:

Can I be solved?

答案1

得分: 1

我已经做了一些更改。

当您获取href值时，您获取的是完整的URL，而不是在DOM中看到的URL，所以您需要删除前面的部分以便以后验证。
在最后的while循环中，all_urls[i]是一个变量，您需要将它传递为变量而不是字符串。
一旦您点击每个链接，您需要使用driver.back()返回到父页面。

from selenium import webdriver
from selenium.webdriver.support.ui import WebDriverWait
from selenium.webdriver.common.by import By
from selenium.webdriver.support import expected_conditions as EC
import time
url = 'https://shopee.com.my/search?keyword=mattress'
driver = webdriver.Chrome(executable_path=r'E:/users/Francabicon/Desktop/Bots/others/chromedriver.exe')
driver.get(url)
# 选择语言
WebDriverWait(driver, 5).until(EC.element_to_be_clickable((By.XPATH, '//div[@class="language-selection__list"]/button'))).click()
time.sleep(3)
# 滚动几次以加载所有项目
for x in range(10):
    driver.execute_script("window.scrollBy(0,300)")
    time.sleep(0.1)
# 获取所有链接（不点击）
all_items = driver.find_elements_by_xpath('//a[@data-sqe="link"]')
print('len:', len(all_items))
all_urls = []
j = 0
k = 45
for item in all_items:
    # 这将给您锚标签的完整URL
    url = item.get_attribute('href')
    # 为了以后点击验证href，您需要删除前面的部分
    urlfinal = url.split('https://shopee.com.my')[1]
    all_urls.append(urlfinal)
print(all_urls)
a = len(all_urls)
# 现在使用链接
i = 0
while i <= 4:
    # 通过子标签识别父标签，使用以下XPath。
    WebDriverWait(driver, 20).until(EC.element_to_be_clickable((By.XPATH, "//div[@class='col-xs-2-4 shopee-search-item-result__item' and @data-sqe='item'][.//a[@data-sqe='link' and @href='" + all_urls[i] + "']]"))).click()
    driver.back()
    i += 1

英文:

I have made couple of changes.

>1. When you are fetching href values you are getting complete url and not the url you are seeing in DOM so you need to remove the preceding values in order to verify later.
>
>2. In the last while loop all_urls[i] is variable you need passed it as variable not string.
>
>3. Once you click each link you need to come back to the parent page again by using driver.back()

from selenium import webdriver
from selenium.webdriver.support.ui import WebDriverWait
from selenium.webdriver.common.by import By
from selenium.webdriver.support import expected_conditions as EC
import time
url = &#39;https://shopee.com.my/search?keyword=mattress&#39;
driver = webdriver.Chrome(executable_path=r&#39;E:/users/Francabicon/Desktop/Bots/others/chromedriver.exe&#39;)
driver.get(url)
# select language
WebDriverWait(driver,5).until(EC.element_to_be_clickable((By.XPATH,&#39;//div[@class=&quot;language-selection__list&quot;]/button&#39;))).click()
time.sleep(3)
# scroll few times to load all items
for x in range(10):
    driver.execute_script(&quot;window.scrollBy(0,300)&quot;)
    time.sleep(0.1)
# get all links (without clicking)
all_items = driver.find_elements_by_xpath(&#39;//a[@data-sqe=&quot;link&quot;]&#39;)
print(&#39;len:&#39;, len(all_items))
all_urls = []
j = 0
k = 45
for item in all_items:
    # This give you whole url of the anchor tag
    url = item.get_attribute(&#39;href&#39;)
    # You need to remove the preceding values in order to verify href later for clicking
    urlfinal=url.split(&#39;https://shopee.com.my&#39;)[1]
    all_urls.append(urlfinal)
print(all_urls)
a= len(all_urls)
# now use links
i = 0
while i &lt;= 4 :
    #Identify the parent tag by child tag use following Xpath.
    WebDriverWait(driver, 20).until(EC.element_to_be_clickable((By.XPATH, &quot;//div[@class=&#39;col-xs-2-4 shopee-search-item-result__item&#39; and @data-sqe=&#39;item&#39;][.//a[@data-sqe=&#39;link&#39; and @href=&#39;&quot; + all_urls[i] +&quot;&#39;]]&quot;))).click()
    driver.back()
    i+=1

答案2

得分: 0

如何获取父元素：

child_element = driver.find_element_by_xpath('//a[@data-sqe="link"]')
parent_element = child_element.find_element_by_xpath('./ancestor::divcontains(@class, "shopee-search-item-result__item")')

如何获取具有特定子元素的元素：

element = driver.find_element_by_xpath('div[contains(@class, "shopee-search-item-result__item") and .//a[@data-sqe="link"]]')

英文:

How to get parent element:

child_element = driver.find_element_by_xpath(&#39;//a[@data-sqe=&quot;link&quot;]&#39;)
parent_element = child_element.find_element_by_xpath(&#39;./ancestor::div[contains(@class, &quot;shopee-search-item-result__item&quot;)][1]&#39;)

How to get element with specific child:

element = driver.find_element_by_xpath(&#39;div[contains(@class, &quot;shopee-search-item-result__item&quot;) and .//a[@data-sqe=&quot;link&quot;]]&#39;)

通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库，让每个人都能够通过互相帮助和分享经验来进步。

如何点击父类如果子类的href与要求匹配

问题

答案1

答案2

Is it possible to specify a foreign key for a table field in Django models.py to any table from the database?

HTML: 使用一个字符串变量来改变文本块中的数值

将一个列表拆分成具有重叠元素的元组列表。

如何在Python中绘制netCDF海冰范围数据集？

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。