ICEpdf
  1. ICEpdf
  2. PDF-745

Filter/remove duplicate text in Chrystal reports generated files

    Details

    • Type: Bug Bug
    • Status: Closed
    • Priority: Major Major
    • Resolution: Fixed
    • Affects Version/s: 5.0.6
    • Fix Version/s: 5.0.6_P01, 5.0.7
    • Component/s: Core/Parsing
    • Labels:
      None
    • Environment:
      any

      Description

      We've been given a few file from a client that where generated with Chrystal reports. The files contain a duplication of text, basically the page is printed to PostScript twice.

      The client has asked us to detect this dupilcation so that the selection of word and subsequent copy to the clipboard doesn't result in duplicate text.

        Activity

        Patrick Corless created issue -
        Patrick Corless made changes -
        Field Original Value New Value
        Fix Version/s 5.0.7 [ 11470 ]
        Repository Revision Date User Message
        ICEsoft Public SVN Repository #40862 Wed Apr 23 08:17:30 MDT 2014 patrick.corless PDF-745 addition of configurable auto duplicate text detection also fix for searching for numbers.
        Files Changed
        Commit graph MODIFY /icepdf/branches/icepdf-5.0.1/icepdf/viewer/src/org/icepdf/ri/common/search/DocumentSearchControllerImpl.java
        Commit graph MODIFY /icepdf/branches/icepdf-5.0.1/icepdf/core/src/org/icepdf/core/pobjects/graphics/text/WordText.java
        Commit graph MODIFY /icepdf/branches/icepdf-5.0.1/icepdf/core/src/org/icepdf/core/pobjects/graphics/text/LineText.java
        Commit graph MODIFY /icepdf/branches/icepdf-5.0.1/icepdf/viewer/src/org/icepdf/ri/common/tools/TextSelectionPageHandler.java
        Repository Revision Date User Message
        ICEsoft Public SVN Repository #40863 Wed Apr 23 08:20:54 MDT 2014 patrick.corless PDF-745 addition of configurable auto duplicate text detection also fix for searching for numbers.
        Files Changed
        Commit graph MODIFY /icepdf/branches/icepdf-5.0.1/icepdf/core/src/org/icepdf/core/pobjects/graphics/text/PageText.java
        Patrick Corless made changes -
        Fix Version/s 5.0.6_P01 [ 11471 ]
        Fix Version/s 5.0.7 [ 11470 ]
        Hide
        Patrick Corless added a comment -

        It should be noted that the following system properties must be set.

        -Dorg.icepdf.core.views.page.text.autoSpace=false
        -Dorg.icepdf.core.views.page.text.trim.duplicates=true

        Show
        Patrick Corless added a comment - It should be noted that the following system properties must be set. -Dorg.icepdf.core.views.page.text.autoSpace=false -Dorg.icepdf.core.views.page.text.trim.duplicates=true
        Hide
        Patrick Corless added a comment -

        There are still some corners cases where the dection code will fail but overall the improvement is considerable.

        Show
        Patrick Corless added a comment - There are still some corners cases where the dection code will fail but overall the improvement is considerable.
        Patrick Corless made changes -
        Status Open [ 1 ] Resolved [ 5 ]
        Resolution Fixed [ 1 ]
        Patrick Corless made changes -
        Fix Version/s 5.0.7 [ 11470 ]
        Repository Revision Date User Message
        ICEsoft Public SVN Repository #42063 Fri Aug 01 13:44:59 MDT 2014 patrick.corless PDF-745 applied customer patch for duplicate word detection.
        Files Changed
        Commit graph MODIFY /icepdf/branches/icepdf-5.0.1/icepdf/core/src/org/icepdf/core/pobjects/graphics/text/PageText.java
        Patrick Corless made changes -
        Status Resolved [ 5 ] Closed [ 6 ]

          People

          • Assignee:
            Patrick Corless
            Reporter:
            Patrick Corless
          • Votes:
            0 Vote for this issue
            Watchers:
            1 Start watching this issue

            Dates

            • Created:
              Updated:
              Resolved: