ICEpdf
  1. ICEpdf
  2. PDF-1305

Viewer RI 6.3.2 fails to find text with specially crafted whitespaces

    Details

    • Type: Bug Bug
    • Status: Open
    • Priority: Major Major
    • Resolution: Unresolved
    • Affects Version/s: 6.3.2
    • Fix Version/s: None
    • Component/s: Viewer RI
    • Labels:
      None
    • Environment:
      Windows 7 x64, Oracle JDK 8u60

      Description

      I observed many PDFs in our application, which contain WordText objects consisting of several whitespaces. The problem with those whitespaces is they prevent a text from being hit.

      Unfortunately, I cannot upload those PDFs because of copyright restrictions, but I managed to find equivalent PDF in the Internet: https://ipres2017.jp/wp-content/uploads/35Michelle-Lindlar.pdf
      I attached it to this ticket, see 35Michelle-Lindlar.pdf.

      This looks like a regression since 6.2.2, see comparison screenshot attached.

      Steps to reproduce:
      1) Open attached PDF in Viewer RI.
      2) Navigate to Search pane (Ctrl + F).
      3) Type "test-set for PDF validation", click "Search" button and observe results.

      Actual behavior:
      a) nothing is highlighted on page 1 (see comparison.png attached).
      b) "Searched 11 pages (0 matches)" text appears at the bottom of search pane

      Expected behavior:
      a) "test-set for PDF validation" should be highlighted on page 1
      b) "Searched 11 pages (1 match)" text should appear at the bottom of search pane
      1. 35Michelle-Lindlar.pdf
        153 kB
        Yauheni Sidarenka
      1. comparison.png
        462 kB

        Activity

        There are no comments yet on this issue.

          People

          • Assignee:
            Patrick Corless
            Reporter:
            Yauheni Sidarenka
          • Votes:
            0 Vote for this issue
            Watchers:
            1 Start watching this issue

            Dates

            • Created:
              Updated: