PDF -> ePUB: deleting s Best Practices

Dear All,

I'm new to Calibre, however those of you who are not surely know about the problem of broken lines when converting PDF to ePUB. codes appear wherever they want to and split text into thousands of passages which looks weird.

This article (https://dearauthor.com/ebooks/calibr...nversion-tips/) suggests using Heuristic Processing during conversion to get rid of s, but it didn't work for me - I used the range from 0.4 to 0.6 with absolutely no result.

The same article proposes to use Search & Replace function and it was a solution in my case! I used the following logic: \. + (*SKIP)(*FAIL)|\ |\d + 

I assumed that s after dot (".") were an author-defined start of the new passage, so i didn't touch them (\. + (*SKIP)), while standalone s (\ ) and s which follow any word (\d + ) were replaced with nothing (= deleted), as almost always they were breaking sentence into useless passages.

Everything would have been prefectly fine, except one thing: the above-mentioned algorythm deletes "useful" s after headlines, which are usually highlighted with code (THIS IS HEADLINE ) and paragraphs (chapters???), which are highlighted with <a id> code (<a id="p8"></a> ).

So, what I need is to add an exception to my algorythm so that s are not deleted when they follow </a> and codes. I played around with quite a number of different variants, but still can't find my Grails. Possibly (*SKIP)(*FAIL) architecture does not suppose multiple skip logic: I ignore 1 parameter from the very beginning and want to add 2 more - so finally 3 in total.

Any thoughts?

PDF -> ePUB: deleting s Best Practices

Trending Articles

Practice Sheet of Right form of verbs for HSC Students

VMOU RSCIT Result 2017, RSCIT Result VMOU rkcl.vmou.ac.in Name Wise

NCERT Solutions for Class 9th Sanskrit Chapter 3 पाथेयम्

Four Air Leitchville Pty Ltd v Hurlad Pty Ltd (No 3) [2024] FCA 238

High-speed Ethernet switches a bright spot in network forecasts

Trial of East Grinstead man accused of rape to begin next week

WONHO – Better Than Me – Single [iTunes Plus M4A]

Rajasthan Board 10th Result 2016 Roll No wise & Name Wise

Theja Surapaneni The ‘Most Attractive' Man on Australian TV Of All Time

MS-CHAPV2 NAP Policy failing - Reason Code 65

Ex-Colchester United youth player Craig Winskill carried out armed robbery to...

Karimnagar District Tahsildars Phone Numbers-Mobile Numbers Telangana-State

Bureau of Internal Revenue: Regional Offices (Directory)

Form: VAT: registration - land and property (VAT5L)

A/L Technology Stream – Subject combinations, Syllabuses and Teacher guides

Wazifa Remedy to Increase Enlarge Penis Size

Arms accused back in court next month

TBT: Samini “Tempo” Feat Mugeez (R2Bees) Prod by Kaywa

In Court: Cases heard at Central Devon Magistrates' Court

Schools benefit from American donation