Beautiful Soup 4 Tutorial #1 - Web Scraping With Python

0h 17m video Transcribed Jun 30, 2026 Watch on YouTube ↗

Beginner 12 min read For: Python beginners interested in learning web scraping with Beautiful Soup.

557.8K

Views

12.3K

Likes

363

Comments

145

Dislikes

2.3%

📈 Moderate

AI Summary

This tutorial introduces Beautiful Soup 4, a Python library for web scraping and HTML parsing. It covers installation, reading local HTML files, modifying tags, and fetching web pages using the requests library. The video also demonstrates a practical example of extracting GPU prices from a website.

Chapters

1 Introduction and Installation 00:00 2 Reading Local HTML 02:40 3 Finding and Modifying Tags 05:07 4 Sponsor Break 09:13 5 Reading HTML from the Web 09:42 6 Extracting Price Data 12:40 7 Conclusion 16:23

[00:12]

What Beautiful Soup Does

Beautiful Soup allows extracting information from HTML documents and modifying them programmatically using Python.

[01:55]

Installation via pip

Install Beautiful Soup 4 using 'pip install beautifulsoup4'. Alternative commands include 'pip3 install beautifulsoup4' or 'python -m pip install beautifulsoup4'.

[03:08]

Reading a Local HTML File

Use 'with open("index.html", "r") as f: soup = BeautifulSoup(f, "html.parser")' to read and parse a local HTML file.

[05:07]

Accessing Tags by Name

Access the first occurrence of a tag using 'soup.tagname' (e.g., 'soup.title'). Use '.string' to get or modify the text inside a tag.

[07:29]

Finding Multiple Tags

Use 'soup.find_all("tagname")' to get all tags of a given type. Use 'soup.find("tagname")' to get only the first match.

[09:42]

Installing the Requests Library

Install the requests library with 'pip install requests' to fetch web pages.

[10:58]

Fetching HTML from a Website

Send a GET request with 'requests.get(url)' and access the HTML content via 'result.text'. Then parse it with BeautifulSoup.

[13:08]

Searching for Specific Text

Use 'soup.find_all(text="$")' to find all occurrences of a dollar sign. Then navigate to the parent tag to extract the full price.

[15:22]

Extracting Nested Data

Use '.parent' to move up the parse tree, then '.find("strong")' to locate the price tag, and '.string' to get the numeric value.

Clickbait Check

95% Legit

"The title accurately describes the tutorial content; it's a genuine introduction to Beautiful Soup 4 for web scraping."

Mentioned in this Video

Beautiful Soup 4

tool

requests

tool

AlgoExpert

service

Beautiful Soup Documentation

link

GitHub Repository with Code

link

Tutorial Checklist

1 01:55 Install Beautiful Soup 4: pip install beautifulsoup4

2 02:55 Import BeautifulSoup: from bs4 import BeautifulSoup

3 03:08 Open local HTML file: with open('index.html', 'r') as f: soup = BeautifulSoup(f, 'html.parser')

4 04:39 Print prettified HTML: print(soup.prettify())

5 05:50 Access first tag by name: tag = soup.title

6 06:29 Get text inside tag: tag.string

7 06:46 Modify text inside tag: tag.string = 'new text'

8 07:47 Find all tags of a type: soup.find_all('p')

9 09:42 Install requests: pip install requests

10 10:58 Fetch webpage: result = requests.get(url); html = result.text

11 11:16 Parse HTML from string: soup = BeautifulSoup(html, 'html.parser')

12 13:22 Find text occurrences: soup.find_all(text='$')

13 15:05 Get parent tag: parent = prices[0].parent

14 16:03 Find nested tag and get string: strong = parent.find('strong'); price = strong.string

Study Flashcards (12)

How do you install Beautiful Soup 4?

easy Click to reveal answer

pip install beautifulsoup4