[PDF-624] Text extraction is not correctly mapping GID to valid unicode value. - ICEsoft JIRA Issue Tracker

Details

Type: Bug
Status: Closed
Priority: Major
Resolution: Fixed
Affects Version/s: 5.0.2
Fix Version/s: 5.0.3
Component/s: Core/Parsing
Labels:
None
Environment:
ny

Description

The PDF in question (support drive) produces garbage when the text is extracted from the second page. I've verified that there is valid Unicode information available in the file. For some reason the the PRO version is having difficulty getting at this information.

Further investigation is needed.

Activity

Ascending order - Click to sort in descending order

Hide

Permalink

Patrick Corless added a comment - 29/Jul/13 1:57 PM

I've fixed a bug in the Encoding class for NFont that insures the encoding differences array is properly parsed and stored.

Show

Patrick Corless added a comment - 29/Jul/13 1:57 PM I've fixed a bug in the Encoding class for NFont that insures the encoding differences array is properly parsed and stored.

Hide

Permalink

Patrick Corless added a comment - 29/Jul/13 2:01 PM

Update 5.0.1 branch and trunk.

Show

Patrick Corless added a comment - 29/Jul/13 2:01 PM Update 5.0.1 branch and trunk.

People

Assignee:

Patrick Corless

Reporter:

Patrick Corless

Votes:

0 Vote for this issue

Watchers:

1 Start watching this issue

Dates

Created:

29/Jul/13 10:16 AM

Updated:

01/Apr/15 3:00 PM

Resolved:

29/Jul/13 2:01 PM