Details
-
Type:
Bug
-
Status: Closed
-
Priority:
Major
-
Resolution: Fixed
-
Affects Version/s: 6.2.5
-
Fix Version/s: 6.3
-
Component/s: Core/Parsing
-
Labels:None
-
Environment:any
-
Support Case References:Support Case 14183:- https://icesoft.my.salesforce.com/5000g00001ljXSF?srPos=0&srKp=500
Description
The PDF in question plots out text one character at a time which seems to be giving our auto space algorithm some grief.
Disabling auto space insertion with org.icepdf.core.views.page.text.autoSpace=false make the text selection feel a little more natural but the output has no spaces....
The test cases is fairly simple from an encoding point of view and should be relatively easy to debug and hopefully improve our auto space calculation.
Disabling auto space insertion with org.icepdf.core.views.page.text.autoSpace=false make the text selection feel a little more natural but the output has no spaces....
The test cases is fairly simple from an encoding point of view and should be relatively easy to debug and hopefully improve our auto space calculation.
When the '6' before the "TECC_.. "is parsed and sorted it getting a bunch of trialing spaces for some reason. When the "_ISC" is selected we are actually selecting the spaces on top of the text. When the whole line is selected then we select all text.
[6, , TECC_ISC_-, , hTECC, , ISC, , , Re]