Author Topic: Loading with PDF indexer, but getting invalid generic index error  (Read 2680 times)

Jason_B

  • Guest
We are using the PDF indexer on a windows server and are occasionally, about once or twice a week, getting an
ARS1197E  Invalid generic index file format sequence    error when we are not using the generic indexer.  This only started to occur after we upgraded to CMOD 9.5
The file gets indexed successfully, and the error will occur once the loading of the pdf is started.  Any ideas!?!
thanks!

Lars Bencze

  • Full Member
  • ***
  • Posts: 116
  • CMOD Expert at Skandia
    • View Profile
    • INACTIVE - Bezland Consulting
Re: Loading with PDF indexer, but getting invalid generic index error
« Reply #1 on: March 08, 2016, 07:40:36 AM »
Hi Jason,

Are you by any chance using the "graphical indexer" feature of PDF Indexer?
If so, it is possible that sometimes you get data that is partially outside of the indexing "rubber rectangle" you have specified.
I believe the PDF Indexer builds a "generic" indexer file based on the fields it finds in the PDF file. (Other forum experts will probably laugh at me and let you know if I'm wrong here :) - that's OK, I can take it.  ;D )

If not, I would guess that this may be due to invalid data being snuck into the document, such as a linefeed or other non-printable character in the middle of a field. This would invalidate the format of the generic indexer file that is auto-created and would cause such an error message.

When you have this message, have you looked in the arstmp folder, or whatever folder is used during indexing? You should be able to find the faulty *.ind file there, and you can manually look through it to check for errors. (You can find the valid format for indexing files in the CMOD docs online)

Hope this helps!

/Lars
OnDemand for MP expert. #Multiplatforms #Admin #Scripts #Performance #Support #Architecture #PDFIndexing #TSM/SP #DB2 #CustomSolutions #Integration #UserExits #Migrations #Workflow #ECM #Cloud #ODApi

Justin Derrick

  • IBM Content Manager OnDemand Consultant
  • Administrator
  • Hero Member
  • *****
  • Posts: 2231
  • CMOD Guru for hire...
    • View Profile
    • Tenacious Consulting
Re: Loading with PDF indexer, but getting invalid generic index error
« Reply #2 on: March 08, 2016, 07:51:33 AM »
With the old indexer, you had to extend the 'box' around text by 0.1" in order to make sure that the box was large enough to encompass the entire word.  Otherwise you'd get chopped off index values.  Maybe you're getting bad data and CMOD is complaining?
IBM CMOD Professional Services: http://TenaciousConsulting.com
Call:  +1-866-533-7742  or  eMail:  jd@justinderrick.com
IBM CMOD Wiki:  https://CMOD.wiki/
FREE IBM CMOD Education & Webinars:  https://CMOD.Training/

Interests: #AIX #Linux #Multiplatforms #DB2 #TSM #SP #Performance #Security #Audits #Customizing #Availability #HA #DR