Details
-
Type: Bug
-
Status: Closed
-
Priority: Major
-
Resolution: Fixed
-
Affects Version/s: 6.0.1, 6.0.2
-
Fix Version/s: 6.1.1
-
Component/s: Core/Parsing
-
Labels:None
-
Environment:All
-
Support Case References:Support Case #13657 - https://icesoft.my.salesforce.com/5007000001YEpXG
Description
When searching for text with the provided PDF file in the ICEpdf Viewer, it highlights more area than just the specific text.
In this selection case the postscript looks as follows:
/F1 12.84 Tf 0 1 -1 0 0 0 Tm 240.1 -698.6 TD[(R)]TJ
9.12 0 TD[(3)]TJ
-9.12 14.4 TD[(R)]TJ
9.12 0 TD[(4)]TJ
-9.12 14.4 TD[(R)]TJ
9.12 0 TD[(5)]TJ
-9.12 14.52 TD[(R)]TJ
9.12 0 TD[(6)]TJ
-9.12 14.4 TD[(R)]TJ
9.12 0 TD[(7)]TJ
-9.12 -72.24 TD[(R)]TJ
9.12 0 TD[(2)]TJ
-25.2 -28.08 TD[(R)]TJ
9.12 0 TD[(8)]TJ
6.96 129.2 TD[(R)]TJ
9.12 0 TD[(9)]TJ
We have code that tries to property detect vertical writing but in this case I think the issue might be around new line detection as the letters are plotted out one by one and we need to look at Y to figure out if a word break is needed.