ICEpdf
  1. ICEpdf
  2. PDF-886

Text selection/extraction geometric space issue

    Details

    • Type: Bug Bug
    • Status: Closed
    • Priority: Major Major
    • Resolution: Fixed
    • Affects Version/s: 5.1.2
    • Fix Version/s: 6.0.1
    • Component/s: Core/Parsing
    • Labels:
      None
    • Environment:
      any

      Description

      This support request came in though the forums. The PDF in question has a strange coordinate space for both glyph space and glyph space to user space. As a result the resultant glyphs bounds are not in the correct coordinate space and our text sorting feature breaks each letter into it's own word breaking text search and selection in general.

        Activity

        Patrick Corless created issue -
        Patrick Corless made changes -
        Field Original Value New Value
        Fix Version/s 5.2.1 [ 12071 ]
        Hide
        Patrick Corless added a comment -

        Very similar circumstances to PDF-854 where the text is written along the y axes instead of the x which is what our text extraction code and text selection code assumes.

        Made adjustment calculations for the -y shear.

        Show
        Patrick Corless added a comment - Very similar circumstances to PDF-854 where the text is written along the y axes instead of the x which is what our text extraction code and text selection code assumes. Made adjustment calculations for the -y shear.
        Repository Revision Date User Message
        ICEsoft Public SVN Repository #46215 Thu Nov 12 10:34:13 MST 2015 patrick.corless PDF-886 adjustments for text extraction coordinates system that are none standard.
        Files Changed
        Commit graph MODIFY /icepdf/branches/icepdf-6.0.0_P01/icepdf/core/src/org/icepdf/core/pobjects/graphics/text/LineText.java
        Commit graph MODIFY /icepdf/branches/icepdf-6.0.0_P01/icepdf/core/src/org/icepdf/core/pobjects/graphics/text/GlyphText.java
        Repository Revision Date User Message
        ICEsoft Public SVN Repository #46216 Thu Nov 12 10:34:25 MST 2015 patrick.corless PDF-886 adjustments for text extraction coordinates system that are none standard.
        Files Changed
        Commit graph MODIFY /icepdf/trunk/icepdf/core/src/org/icepdf/core/pobjects/graphics/text/GlyphText.java
        Commit graph MODIFY /icepdf/trunk/icepdf/core/src/org/icepdf/core/pobjects/graphics/text/LineText.java
        Hide
        Patrick Corless added a comment -

        Marking as fixed.

        Show
        Patrick Corless added a comment - Marking as fixed.
        Patrick Corless made changes -
        Status Open [ 1 ] Resolved [ 5 ]
        Resolution Fixed [ 1 ]
        Patrick Corless made changes -
        Status Resolved [ 5 ] Closed [ 6 ]

          People

          • Assignee:
            Patrick Corless
            Reporter:
            Patrick Corless
          • Votes:
            0 Vote for this issue
            Watchers:
            1 Start watching this issue

            Dates

            • Created:
              Updated:
              Resolved: