What is a yellow page scraper?

A yellow page scraper (or Yellow Pages data extractor) is a type of tool specifically designed to scrape yellow pages data, including businesses and their contact data like business name, phone numbers, and business email.

What is a yellow page?

Yellow pages are telephone directories for businesses and organizations. They include names, contact information and business addresses. This information was printed for advertisement purposes on yellow-coloured pages. Today, the yellow pages have become a commonly used term for any telephone directory for commercial use in many countries. Since the printing of yellow pages has ceased, these directories are now available on the internet. They offer additional features, including reviews, discussion boards, and links to websites and social media. Some of the yellow pages online are: * https://www.yellowpages.ca/ for Canada * https://www.yellowpagesdirectory.com/ for the USA * https://yellowpageseu.net/ for Europe * https://www.yellowpages.com.au/ for Australia

Is yellow page scraping legal?

The legality of scraping data from the YellowPages website varies based on several considerations. In the European Union, for example, the General Data Protection Regulation (GDPR) strictly regulates the collection and usage of personal data. Moreover, many websites, such as online yellow pages, typically include clauses in their terms of service that forbid automated data scraping.

Data Web Data Scraping

Python Yellow Page Scraper: How to scrape Yellow Pages

Cem Dilmegani

updated on Sep 17, 2025

See our ethical norms

Yellow pages provide easy access to a variety of services/businesses, which may not all show up in your Google search. Search engines report results based on relevance to the search term, whereas online yellow pages show results based on geographic areas. If you are looking to get a better local reach for your business, it can be helpful to advertise in online yellow pages for your region.

This article will guide you through the process of building your own web scraper to collect data from Yellow Pages directories.

How to scrape data from Yellow Pages with Python

Step 1: Install required libraries

Before we begin, you need to install the necessary Python libraries. Open your terminal or command prompt and run the following command:

This command installs requests for making HTTP requests, BeautifulSoup for parsing HTML, and pandas for data manipulation and export.

Step 2: Define the target URL

The first step in any scraping project is to identify the target URL. For YellowPages.az, the URLs are quite straightforward.

For example, searching for “HOTEL” and sorting the results by ratings would use this URL:

Here:

search-title=OTEL defines the keyword we are searching for.
sort-by=ratings sorts the results based on their ratings.

Step 3: Fetch HTML content

Once you have your target URL, the next step is to download its HTML content. The requests library in Python is perfect for this. To ensure our request mimics a real browser and avoids potential blocking, we’ll also include a User-Agent header.

r.text contains the full HTML source code of the fetched page.
BeautifulSoup(r.text, “html.parser”) transforms this raw HTML into a structured, navigable object. This allows us to easily search for and extract specific elements like business names, addresses, or phone numbers.

Step 4: Extract business details

4.1 Locating data with the inspect tool

Before writing any extraction code, we need to understand how the data is organized within the website’s HTML. This crucial step is performed using your browser’s “Inspect” tool (e.g., Chrome DevTools, Firefox Developer Tools).

4.2 Extracting the Data with BeautifulSoup

With the knowledge of where each data point resides, we can now programmatically extract it. On YellowPages.az, each business listing is neatly encapsulated within a <div> element that has the class .card-recording.

We will iterate through these card-recording elements and, for each card, collect the following information:

Business name
Address
Phone numbers
Email
Website
Rating and number of votes

Step 5: Save data to CSV

After the data is extracted and stored in our data list (a list of dictionaries), the next logical step is to convert it into a structured format and export the data.

We’ll use the pandas library for this, which excels at handling tabular data and can easily export it to a CSV file.

Step 6: Filtering with URL parameters

YellowPages.az, like many other web pages, uses URL query parameters to control how search results are displayed.By strategically adjusting these parameters, you can customize your search and data extraction.

Sorting Options (sort-by)

asc: Sorts results in alphabetical order (A → Z).
desc: Sorts results in reverse alphabetical order (Z → A).
ratings: Sorts results by their rating value.

Search Options (search-by)

brand: Searches specifically by business or brand name.
category: Searches within defined business categories.
numbers: Searches by phone numbers.

This specific query will search for hotels by their brand name and then sort the results alphabetically.

Step 7: Pagination

One of the most common challenges in web scraping is dealing with pagination. YellowPages.az, like most directories, splits its search results across multiple pages.

If you only scrape the first page, you’ll miss a significant portion of the data. Proper pagination handling ensures that your scraping tool collects results from all available pages.

There are two primary methods to handle pagination on YellowPages.az:

7.1 Easy method (URL pattern)

The most straightforward approach is to observe how the page number changes in the URL.

Page 1: https://yellowpages.az/results/?search-title=OTEL&sort-by=ratings
Page 2: https://yellowpages.az/results/page/2/?search-title=OTEL&sort-by=ratings
Page 3: https://yellowpages.az/results/page/3/?search-title=OTEL&sort-by=ratings

This clear pattern allows your data scraper to loop through pages by simply inserting the page number into the URL. With this method, you control how many pages to scrape by adjusting the range in the for loop.

Example snippet:

7.2 DOM navigation

YellowPages.az also includes a navigation bar in its HTML, typically structured like this:

Example snippet with a limit:

💡Conclusion

Scraping YellowPages.az with Python offers a flexible approach to systematically collecting structured business data. By combining:

requests to efficiently fetch HTML content,
BeautifulSoup to parse and extract specific details,
pandas to organize the extracted information into a tabular format and export results,
Leveraging URL parameters (search-by, sort-by) to filter and sort results according to your needs,
And handling pagination to capture data from all available pages.

FAQs about Yellow Page scrapers

Principal Analyst

Cem Dilmegani

Principal Analyst

Follow On

Cem has been the principal analyst at AIMultiple since 2017. AIMultiple informs hundreds of thousands of businesses (as per similarWeb) including 55% of Fortune 500 every month.

Cem's work has been cited by leading global publications including Business Insider, Forbes, Washington Post, global firms like Deloitte, HPE and NGOs like World Economic Forum and supranational organizations like European Commission. You can see more reputable companies and resources that referenced AIMultiple.

Throughout his career, Cem served as a tech consultant, tech buyer and tech entrepreneur. He advised enterprises on their technology decisions at McKinsey & Company and Altman Solon for more than a decade. He also published a McKinsey report on digitalization.

He led technology strategy and procurement of a telco while reporting to the CEO. He has also led commercial growth of deep tech company Hypatos that reached a 7 digit annual recurring revenue and a 9 digit valuation from 0 within 2 years. Cem's work in Hypatos was covered by leading technology publications like TechCrunch and Business Insider.

Cem regularly speaks at international technology conferences. He graduated from Bogazici University as a computer engineer and holds an MBA from Columbia Business School.

View Full Profile

Be the first to comment

Your email address will not be published. All fields are required.

How to scrape data from Yellow Pages with Python

Conclusion

FAQs about Yellow Page scrapers

We follow ethical norms & our process for objectivity. AIMultiple's customers in Web Data Scraping include Bright Data, Oxylabs, Decodo, Webshare, Coresignal, Apify, Zyte.

Next to Read

Voice AIAug 23

Python Yellow Page Scraper: How to scrape Yellow Pages

How to scrape data from Yellow Pages with Python

Step 1: Install required libraries

Step 2: Define the target URL

Step 3: Fetch HTML content

Step 4: Extract business details

4.1 Locating data with the inspect tool

4.2 Extracting the Data with BeautifulSoup

Step 5: Save data to CSV

Step 6: Filtering with URL parameters

7.1 Easy method (URL pattern)

7.2 DOM navigation

💡Conclusion

FAQs about Yellow Page scrapers

Be the first to comment

Next to Read

Top AI Note Takers Tested: Motion, Fellow, Otter, and TL;DV

Indeed Scraper: How to Extract Job Data & Top Scraping Services

Best YouTube Scrapers: Free & Paid Tools

Best LinkedIn Scrapers: Benchmarks + Python Guide

Facebook Scrapers: 4 Methods & 4 APIs to Scrape FB

Performance Comparison of the Best E-commerce Scrapers