Skip to content

dorlevi28/URLscraper

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 

Repository files navigation

URLscraper

This is a simple, preety basic URL scrapper using BeatifulSoup package in Python.

The scraper extract and parses all HTML elements from all given URL that are websites.txt.

We use depth argument to indicate depth for the parsing process.

We extract the anchor element() with an existing href.

Save all URL's to a dataframe and write it to txt file.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages