ICEpdf
  1. ICEpdf
  2. PDF-438

Extracting text from document doesn't work properly.

    Details

    • Type: Bug Bug
    • Status: Closed
    • Priority: Major Major
    • Resolution: Fixed
    • Affects Version/s: 4.3.2
    • Fix Version/s: 4.3.4
    • Component/s: Core/Parsing
    • Labels:
      None
    • Environment:
      ICEpdf PRO 4.3.2, ICEpdf Viewer

      Description

      While extracting text from attached document I have found that line:

      "last flight (if one was defined for that flight). Regardless of the data,"

      consists of 2 LineText objects:
      1. "last flight (if one was" and
      2. "defined for that flight). Regardless of the data,".

      It looks like space between words "was" and "defined" is missing so if I would search for the word "defined" you will not find it.

      Adding space manualy between LineText objects causes problem in different line:

      "airport reference point latitude/longitude position shows adjacent to the".

      It consists of:
      1. "airport reference p" and
      2. "oint latitude/longitude position shows adjacent to the".

      If I put space between them I will get "airport reference p oint latitude/longitude position shows adjacent to the" and searching for a word "point" fails.
      1. example.pdf
        47 kB
        Evgheni Sadovoi

        Activity

        Evgheni Sadovoi created issue -
        Evgheni Sadovoi made changes -
        Field Original Value New Value
        Attachment example.pdf [ 14459 ]
        Evgheni Sadovoi made changes -
        Salesforce Case [5007000000MGD1l]
        Patrick Corless made changes -
        Fix Version/s 5.0 [ 10314 ]
        Patrick Corless made changes -
        Fix Version/s 4.3.3 [ 10333 ]
        Fix Version/s 5.0 [ 10314 ]
        Patrick Corless made changes -
        Fix Version/s 4.3.4 [ 10341 ]
        Fix Version/s 4.3.3 [ 10333 ]
        Patrick Corless made changes -
        Status Open [ 1 ] Resolved [ 5 ]
        Resolution Fixed [ 1 ]
        Patrick Corless made changes -
        Status Resolved [ 5 ] Closed [ 6 ]

          People

          • Assignee:
            Patrick Corless
            Reporter:
            Evgheni Sadovoi
          • Votes:
            0 Vote for this issue
            Watchers:
            0 Start watching this issue

            Dates

            • Created:
              Updated:
              Resolved: