Web scraping is a commonly used technology existing for a couple of decades now. We use programming languages like Python with libraries namely Beautiful Soup, Selenium, Scrapy, etc. or R with Rvest package for web scraping. Other web scraping tools are also available. A vivid explanation of how we do web scraping using Python is provided in the article named Web Scraping using Beautiful Soup. Further to the above, in this article, we are going to understand XPath and how it can be used to navigate through the HTML document for Web Scraping with Selenium to reach the target nodes using Python. Our Target Website – Let us consider the webpage of a very popular Indian daily English newspaper Telegraph https://www.telegraphindia.com/ What Library are we using – We are using the robust Selenium web driver to navigate through the Telegraph website using XPath. What is XPath – The full form of XPath is XML Path language. It is a very popular Web Technology and W3C standard. We can access precise information in an XML/HTML document using these concise and powerful XPath statements. Understanding the structure of XPath – First of all, let us see how can we find XPath of […]
The post How to use XPath for Web Scraping with Selenium appeared first on Ivy Professional School | Official Blog.