I have a PDF file of a book in Italian (748 pages) with 50 chapters (average of 14 pages per chapter), from which I need some (not all) text on the page to be extracted (copied & pasted) into 50 plaintext files, one sentence per line, one txt file per chapter.
I will provide you with 1 PDF file in Italian, and you will need to provide me with 50 plaintext files (.txt) containing the text from the 50 chapters, one sentence per line.
Important: you will need to pay attention to the accents as they are important in Italian - "é" is not the same as "e"; all accents from the original PDF need to be preserved in the plaintext files.
A sample page is attached - the parts highlighted in yellow need to be extracted, while the rest of the text (the phonetic spelling underneath every line of Italian) is to be ignored. Not all pages have the phonetic spelling, in which case all of the text needs to be extracted.