Hi Vern,
   Thank you for your responses.
   PDFBox was suggested, a java solution.
   I have downloaded the jar files, modified my classpath and woohoo I have a
   text output from the PDF.
   Interestingly, I needed to add the -sort=true command line option to get
   usable text out of the PDF but this took less than an hour to install and
   test.
   Obviously, I have only tested with one pdf but the results are promising.
   And for an experiment I passed the text to an AI API to return the
   required information from the PDF text file in JSON and that worked very
   nicely so now I have a PDF extracted to text and the text converted to
   JSON providing only the requested fields.
   Cheers
   Don
    
   Don Brown
   Senior Consultant
    
   [1]OneTeam IT Pty Ltd
   P: 1300 088 400
   -----Original Message-----
   From: MIDRANGE-L <midrange-l-bounces@xxxxxxxxxxxxxxxxxx> On Behalf Of Vern
   Hamberg via MIDRANGE-L
   Sent: Tuesday, 17 June 2025 8:59 AM
   To: midrange-l@xxxxxxxxxxxxxxxxxx
   Cc: Vern Hamberg <vhamberg@xxxxxxxxxxxxxxx>
   Subject: Re: Convert PDF to text
   Hi Don
   Probably I was the one who said Poppler will not install with yum - I
   looked at the list of installed and available packages in Open Source
   Package Management (OSPM) when connected with a 7.6 machine - Poppler does
   not seem to be there, unless it uses a different name.
   Someone else mentioned Ghostscript, as did I - it can do some of what you
   need, I believe. We used it to generate PCL from PDF files, it did fine,
   we didn't have any weird PDF-ish stuff. That person mentioned another
   tool, I can't say if it's in the OSPM.
   Jack Woehr mentioned pypdf - some Python component, I assume - I did not
   see it listed separately in OSPM, maybe it's part of another python
   package.
   *Regards*
   *Vern Hamberg*
   IBM Champion 2025 <cid:part1.cW8DDgxI.raSwjvav@centurylink.net> CAAC
   (COMMON Americas Advisory Council) IBM Influencer 2023
   On 6/16/2025 4:26 PM, Don Brown via MIDRANGE-L wrote:
   > Thanks Patrik,
   >
   > I thought I read somewhere that the Poppler tools would not install with
   > yum on IBMi ?
   >
   > Thanks for the link, I will give it a go.
   >
   > Have you, or anyone installed and used these tools ?
   >
   > Thanks
   > Don
   >
   > Â
   >
   > Don Brown
   >
   > Senior Consultant
   > Â
   > [1]OneTeam IT Pty Ltd
   > P: 1300 088 400
   >
   > -----Original Message-----
   > From: MIDRANGE-L<midrange-l-bounces@xxxxxxxxxxxxxxxxxx> On Behalf Of
   > Patrik Schindler
   > Sent: Monday, 16 June 2025 6:38 PM
   > To: Midrange Systems Technical Discussion<midrange-l@xxxxxxxxxxxxxxxxxx>
   > Subject: Re: Convert PDF to text
   >
   > Hello Don,
   >
   > Am 16.06.2025 um 09:12 schrieb Don Brown via MIDRANGE-L
   > <midrange-l@xxxxxxxxxxxxxxxxxx>:
   >
   > > 1. Does anyone have a recommended solution to achieve converting a pdf
   > to text. I am after a php or native rpg ish solution. Not python please.
   >
   > I'd use the pdftotext command from the poppler-utils package in PASE. I
   > assume the poppler-utils package is available for installation via yum.
   >
   > [2][2]
https://en.wikipedia.org/wiki/Poppler_(software)
   >
   > :wq! PoC
   >
   > --
   > This is the Midrange Systems Technical Discussion (MIDRANGE-L) mailing
   > list To post a message email:MIDRANGE-L@xxxxxxxxxxxxxxxxxx To subscribe,
   > unsubscribe, or change list options,
   > visit: [3][3]
https://lists.midrange.com/mailman/listinfo/midrange-l
   > or email:MIDRANGE-L-request@xxxxxxxxxxxxxxxxxx
   > Before posting, please take a moment to review the archives at
   > [4][4]
https://archive.midrange.com/midrange-l.
   >
   > Please contactsupport@xxxxxxxxxxxxxxxxxxxx for any subscription related
   > questions.
   >
   > --
   > Message protected by MailGuard: e-mail anti-virus, anti-spam and content
   > filtering.
   > [5][5]
https://www.mailguard.com.au
   >
   > References
   >
   > Visible links
   > 1.[6]
https://www.oneteamit.com.au/
   > 2.[7]
https://en.wikipedia.org/wiki/Poppler_(software)
   > 3.[8]
https://lists.midrange.com/mailman/listinfo/midrange-l
   > 4.[9]
https://archive.midrange.com/midrange-l.
   > 5.[10]
https://www.mailguard.com.au/
   >
   --
   This is the Midrange Systems Technical Discussion (MIDRANGE-L) mailing
   list To post a message email: MIDRANGE-L@xxxxxxxxxxxxxxxxxx To subscribe,
   unsubscribe, or change list options,
   visit: [11]
https://lists.midrange.com/mailman/listinfo/midrange-l
   or email: MIDRANGE-L-request@xxxxxxxxxxxxxxxxxx
   Before posting, please take a moment to review the archives at
   [12]
https://archive.midrange.com/midrange-l.
   Please contact support@xxxxxxxxxxxxxxxxxxxx for any subscription related
   questions.
   --
   Message protected by MailGuard: e-mail anti-virus, anti-spam and content
   filtering.
   [13]
https://www.mailguard.com.au
References
   Visible links
   1. 
https://www.oneteamit.com.au/
   2. 
https://en.wikipedia.org/wiki/Poppler_(software)
   3. 
https://lists.midrange.com/mailman/listinfo/midrange-l
   4. 
https://archive.midrange.com/midrange-l.
   5. 
https://www.mailguard.com.au/
   6. 
https://www.oneteamit.com.au/
   7. 
https://en.wikipedia.org/wiki/Poppler_(software)
   8. 
https://lists.midrange.com/mailman/listinfo/midrange-l
   9. 
https://archive.midrange.com/midrange-l.
  10. 
https://www.mailguard.com.au/
  11. 
https://lists.midrange.com/mailman/listinfo/midrange-l
  12. 
https://archive.midrange.com/midrange-l.
  13. 
https://www.mailguard.com.au/
As an Amazon Associate we earn from qualifying purchases.