Fix error in Python-based web scraper wit GUI

300.0 USD

300.0 USD peopleperhour 技术与编程 海外
23天前

详细信息

Hello Freelancers,I'm searching for a developer familiar with web scraping and Python to fix an existing web scraper which scrapes product data from products from category links from italian ecommerce-website www.yeppon.it
The script basically works, but it gives an error on certain points when scraping, I think because of a light change in the structure of the website which causes an error when the script tries to scrape a products text description.
Goal of this project is to fix the errors so the script works like it used to again, scraping data of products from given category-URLs from the website and giving out the data in csv-files. I think this won't be too much of an effort because it is basically this one error which needs to be located and fixed, everything else still seems to work fine. Price can be discussed.
Some facts:
1. The web scraper is based on Python with a GUI. It's final version comes as an exe file (therefore I can't attach it in the project description, I will send it in the messages or work stream).2. It scrapes certain product data (like product name, price, description, image links) by category links which can be entered into the GUI. The GUI also has some input fields, these are just for fixed strings which can be entered into the fields and will be given out in the CSV files the script gives the product data in.3. The scraper technically still works, however, it gives an error when scraping certain categories. You can check this by running the tool, filling out the given input fields with the data explained in the "Instructions" tab of the tool and then start scraping. It will produce this error (can be found in the log file):
--------------------------------------------------------------------------------------------------------------2024-04-24 12:44:24,765:ERROR:'descriptionHtml'Traceback (most recent call last): File "async_scraper.py", line 739, in scrape description_html = pdata["pageProps"]["product"][ ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^KeyError: 'descriptionHtml'2024-04-24 12:44:24,765:INFO: Finally2024-04-24 12:44:24,765:INFO:
No data found!
------------------------------------------------------------------------------------------------------------------4. The error seems to occur when scraping a products text description. The text description consists of three possible elements:- bulletpoints formatted into an ul element- a text description which is cleaned/has HTML code removed/replaced- scraping data from a table on the website and putting it into a given HTML structure

It was developed by a freelancer from PPH for a colleague of mine, unfortunately I can't reach my colleague for quite some time now to ask for all the details or the freelancers name, so I will post this to the public.
Scraping some categories will result in the error mentioned above, for example:https://www.yeppon.it/c/elettrodomestici/grandi-elettrodomestici/asciugabiancheriaorhttps://www.yeppon.it/c/elettrodomestici/grandi-elettrodomestici/frigoriferi
Others work just fine, like:https://www.yeppon.it/c/telefonia/smartphone/smart-phone

I will attach the files I have about this project from my colleague. As I can't attach exe or rar files, I attached:
- a first version of the Python code (it is a beta version which will give another error which is solved in the final exe file and not the final code, just to give you an impression), as well as the code of the GUI and the requirements. These are async_scraper.txt, gui.txt and requirements.txt

免责声明

该外包需求信息来源于站外平台,本站仅提供公开信息部分字段展示与订阅服务,更多请查看免责声明

关注公众号,不定期副业成功案例分享
关注公众号

不定期副业成功案例分享

领先一步获取最新的外包任务吗?

立即订阅

类似推荐

I aim to retain the services of an experienced programmer to develop a unique Expert Advisor capable of integrating data from TradingView into MetaTrader 5. The Expert Advisor will need to scrape values displayed on a particular indicator loaded within TradingView and utilize this information to place trades through MetaTrader 5's built-in automation functionality. Specifically, the indicator will provide entry price points, stop losses, take profit levels one and two, and the Expert Advisor must be engineered to recognize these parameters, executing trades directly or through pending orders once triggers are reached. An input-controlled trailing stop mechanism should also be incorporated to lock in profits after the first take profit level is hit. Additionally, flexibility is desired through inputs defining trade size either as a fixed monetary value or percentage of equity. I also want a news filter added so that I can select to stop trading for a period of time (manual input by value in minutes) before and after red news events. I also want to be able to select the hours (by value input) that the EA trades The ideal candidate will possess extensive familiarity with both MetaTrader 5 and TradingView platforms, allowing seamless integration between the two. A robust understanding of MQL5 and experience architecting custom automated trading systems within MetaTrader 5 is essential. Capability in web-scraping, indicator interpretation, and order placement directly from third-party data are equally important qualifications. The programmer undertaking this project will design a sophisticated yet user-friendly Expert Advisor bridging the gap between TradingView's analysis capabilities and MetaTrader 5's execution functionality.
500.0 GBP 技术与编程 peopleperhour 海外
1天前
*Please no copy and paste replies to this job offer, all replies without reading it thoroughly will be declined straight away. I am looking for 10,000 Emails that i can contact Music artists directly or their team, im looking for independent artist not (adele,taylor swift level.) Id like the focus to be on music artist in the UK & USA it can be any genre but not Hip Hop ideally looking for artists in the following genres - Metal, Rock, Indie, Pop, funk, country anything that falls into this category and its sub genres would be awesome. we are looking to sell a service that helps push their music online, so artists with some music already released is ideal not any music artist without music and 0 following atleast 500 followers. Websites like - PlaylistPush, Groover, Sound Campaign, SubmitHub, Boost Collective. *these website have the exact customers we are looking for if we can somehow scrape their users and get their emails would be perfect. You can also look at websote for example Spotify which artist leave their email in the bio and you can scrape them from their also, this is the same for social media and youtube which music artist have channels wih their email available. i needs the excel to show ( Artist Name, Genre of music, email, social media link, follower count if possible (estimated) and anythign else you believe would add to the objective of the project. I understand the not all information required may not be present or not every company may not have a marketing contact etc, but we need to try and focus on the Music Artist that do. Flexible Budget. Warm Regards, Helder
250.0 GBP 技术与编程 peopleperhour 海外
1天前