ICEpdf
  1. ICEpdf
  2. PDF-384

PDF to PNG conversion results in text misplaced and overlapping

    Details

    • Type: Improvement Improvement
    • Status: Closed
    • Priority: Major Major
    • Resolution: Fixed
    • Affects Version/s: 4.3
    • Fix Version/s: 4.3.4
    • Component/s: Core/Parsing
    • Labels:
      None
    • Environment:
      Linux, Java 6

      Description

      Converting the attached PDF results in text that looks like the screenshot. This issue actually exists in earlier versions of ICE (4.2). I should have logged it last year.

      This is not a high priority for us but I'm logging it so you are aware of it. I think we can work around it by restructuring the document but I'm not certain.
      1. Dental+Enrollment+Declining-January+2011.pdf
        155 kB
        David Martineau
      2. MNDA Ciriello Investments.pdf
        25 kB
        Contractpal Inc.
      1. screenshot-1.jpg
        140 kB

        Activity

        Hide
        Patrick Corless added a comment -

        Definately some sort of text layout interpretation issue.

        There is a strange (()0.0000 bween the and P. I suspect this 0.000 offset is being handled incorectly. Shouldn't be to difficult to correct the issue.

        Show
        Patrick Corless added a comment - Definately some sort of text layout interpretation issue. There is a strange (()0.0000 bween the and P. I suspect this 0.000 offset is being handled incorectly. Shouldn't be to difficult to correct the issue.
        Hide
        Contractpal Inc. added a comment -

        Here is another PDF that results in the same problem-- skewed text.

        Show
        Contractpal Inc. added a comment - Here is another PDF that results in the same problem-- skewed text.
        Hide
        Evgheni Sadovoi added a comment - - edited

        Looks like PDF-474 to me

        Show
        Evgheni Sadovoi added a comment - - edited Looks like PDF-474 to me
        Hide
        Patrick Corless added a comment - - edited

        Very rare corner case where the content stream has a empty strings defined within a TJ string draw operation. When the the empty string was encountered we where jumping to a zero advance instead of the previous advance.

        Show
        Patrick Corless added a comment - - edited Very rare corner case where the content stream has a empty strings defined within a TJ string draw operation. When the the empty string was encountered we where jumping to a zero advance instead of the previous advance.

          People

          • Assignee:
            Patrick Corless
            Reporter:
            David Martineau
          • Votes:
            0 Vote for this issue
            Watchers:
            1 Start watching this issue

            Dates

            • Created:
              Updated:
              Resolved: