Python web crawler

Run the program

Mar 20,22

To run the program, simply enter the values of the PROJECT_NAME and HOMEPAGE variables in the main.py file. For example, to …

Read More

Create workers 2

Mar 20,22

In this section we will write the code for our workers. We will create worker threads that will be configured …

Read More

Create workers 1

Mar 20,22

In this section we will write the code that will enable multiple spiders to run at the same time and …

Read More

Creating the spider

Mar 20,22

In this section we will write a code that will ask a user to enter two parameters: the name of …

Read More

Parse domain names

Mar 20,22

In this chapter we are going to write two functions in a new .py file. The functions will extract the …

Read More

Add links to queue

Mar 20,22

After we gather the links from a webpage, we need to add them to the queue so they can be …

Read More
Geek University 2022