How to get data from a webpage in Python

1 min

One way to get data from a webpage in Python is to use the requests library to send an HTTP request to the URL of the webpage you want to access, and then use the beautifulsoup4 library to parse and extract the data from the HTML or XML that the webpage returns. Here is an example of how you might use these libraries to get the title of a webpage:

import requests
from bs4 import BeautifulSoup

url = 'https://www.example.com'
response = requests.get(url)
soup = BeautifulSoup(response.text, 'html.parser')
title = soup.find('title').text
print(title)

Another way is to use the pandas library which has a read_html() method that can scrape tables from html pages and returns a list of dataframe.

import pandas as pd
tables = pd.read_html("https://www.example.com")

You could also use a headless browser like Selenium to scrape dynamic webpages which are rendered by JavaScript.