Extract text from a pdf file in Italian into plaintext

121.0 USD

121.0 USD peopleperhour Writing & Translation Overseas
401 days ago

Description

I have a PDF file of a book in Italian (748 pages) with 50 chapters (average of 14 pages per chapter), from which I need some (not all) text on the page to be extracted (copied & pasted) into 50 plaintext files, one sentence per line, one txt file per chapter.
I will provide you with 1 PDF file in Italian, and you will need to provide me with 50 plaintext files (.txt) containing the text from the 50 chapters, one sentence per line.
Important: you will need to pay attention to the accents as they are important in Italian - "é" is not the same as "e"; all accents from the original PDF need to be preserved in the plaintext files.
A sample page is attached - the parts highlighted in yellow need to be extracted, while the rest of the text (the phonetic spelling underneath every line of Italian) is to be ignored. Not all pages have the phonetic spelling, in which case all of the text needs to be extracted.

关注公众号,不定期副业成功案例分享
Follow WeChat

Success story sharing

Want to stay one step ahead of the latest teleworks?

Subscribe Now

Similar Teleworks