PYTHON WEB SCRAPER
A powerful and user-friendly web scraping tool built with Python beautiful soup and PyQt5.
The Web Scraper Application is a versatile and intuitive tool designed to make web scraping tasks effortless. Built using Python and PyQt5, it offers a sleek interface coupled with powerful functionality, allowing users to easily scrape content from websites and save it for further analysis.
- User-Friendly Interface: The application features an intuitive user interface, making it easy for users of all skill levels to navigate and utilize its features.
- Scraping Capabilities: Users can enter a URL and scrape the content of the corresponding webpage with a single click. The scraped content is displayed in real-time within the application.
- HTML Content Saving: The application allows users to save the scraped HTML content to a text file for future reference or analysis.
- Task Bar Navigation: With a built-in task bar, users can seamlessly switch between different functionalities such as scraping, viewing, and accessing the about section.
- Python 3.x
- PyQt5 library
- BeautifulSoup4 library
- Requests library
- Clone the repository:
- Navigate to the project directory:
- Install the required dependencies:
git clone https://github.com/NathanCordeiro/WEB-SCRAPER.git
cd WEB-SCRAPER
pip install -r requirements.txt
- Run the application by executing the
main.py
file. - Enter the URL of the website you want to scrape in the designated input field.
- Click on the "Scrape" button to initiate the scraping process.
- The scraped HTML content will be displayed in the application's view section. Additionally, it will be saved to a text file named
scraped_content.html
.
python main.py
Contributions are what make the open-source community such an amazing place to learn, inspire, and create. Any contributions you make are greatly appreciated.
- Fork the Project
- Create your Feature Branch (
git checkout -b feature/AmazingFeature
) - Commit your Changes (
git commit -m 'Add some AmazingFeature'
) - Push to the Branch (
git push origin feature/AmazingFeature
) - Open a Pull Request
Distributed under the MIT License. See LICENSE for more information.
Nathan Cordeiro - nathanjohncordeiro@gmail.com
Project Link: https://github.com/NathanCordeiro/WEB-SCRAPER