Skip to content

A python GUI web scraper made with beautiful soup and PyQt5.

License

Notifications You must be signed in to change notification settings

NathanCordeiro/WEB-SCRAPER

Repository files navigation


PYTHON WEB SCRAPER
A powerful and user-friendly web scraping tool built with Python beautiful soup and PyQt5.

About the Project

The Web Scraper Application is a versatile and intuitive tool designed to make web scraping tasks effortless. Built using Python and PyQt5, it offers a sleek interface coupled with powerful functionality, allowing users to easily scrape content from websites and save it for further analysis.


Features

  • User-Friendly Interface: The application features an intuitive user interface, making it easy for users of all skill levels to navigate and utilize its features.
  • Scraping Capabilities: Users can enter a URL and scrape the content of the corresponding webpage with a single click. The scraped content is displayed in real-time within the application.
  • HTML Content Saving: The application allows users to save the scraped HTML content to a text file for future reference or analysis.
  • Task Bar Navigation: With a built-in task bar, users can seamlessly switch between different functionalities such as scraping, viewing, and accessing the about section.

Getting Started

Prerequisites

  • Python 3.x
  • PyQt5 library
  • BeautifulSoup4 library
  • Requests library

Installation

  1. Clone the repository:
  2. git clone https://github.com/NathanCordeiro/WEB-SCRAPER.git
  3. Navigate to the project directory:
  4. cd WEB-SCRAPER
  5. Install the required dependencies:
  6. pip install -r requirements.txt

Usage

  1. Run the application by executing the main.py file.
  2. python main.py
  3. Enter the URL of the website you want to scrape in the designated input field.
  4. Click on the "Scrape" button to initiate the scraping process.
  5. The scraped HTML content will be displayed in the application's view section. Additionally, it will be saved to a text file named scraped_content.html.

Contributing

Contributions are what make the open-source community such an amazing place to learn, inspire, and create. Any contributions you make are greatly appreciated.

  1. Fork the Project
  2. Create your Feature Branch (git checkout -b feature/AmazingFeature)
  3. Commit your Changes (git commit -m 'Add some AmazingFeature')
  4. Push to the Branch (git push origin feature/AmazingFeature)
  5. Open a Pull Request

License

Distributed under the MIT License. See LICENSE for more information.


Contact

Nathan Cordeiro - nathanjohncordeiro@gmail.com

Project Link: https://github.com/NathanCordeiro/WEB-SCRAPER

About

A python GUI web scraper made with beautiful soup and PyQt5.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages