Quantcast
Channel: MobileRead Forums - Conversion
Viewing all articles
Browse latest Browse all 3900

soft hyphens in docx conversion output

$
0
0
Soft hyphens marks (characters U+00AD, or entitities #173 or shy), originally existing in html, are exported to docx (again) as shy characters (code 00AD).

Which is not quite desired behaviour, cause MS Word implements optional word breaks differently, and characters 00AD itself are simply displayed (visually simillary as standard hyphens).

Exported docx document containing shy characters can be repaired by searching shy characters (using symbol ^0173), and replacing them: either by Word "optional word break" (^-), or (mostly in my case) just deleting them by replacing by nothing...

Anyway: Is such export behaviour intentional? Or - mayby - is for some reason inevitable? Is there any way how to achieve replacing shy characters to MS Word "optional word break" as part of conversion?

Viewing all articles
Browse latest Browse all 3900

Trending Articles