Author Topic: Download issue for PDF files as Damage file uploaded to CMOD  (Read 110 times)

MD MUSLEHUDDIN

  • Newbie
  • *
  • Posts: 2
    • View Profile
Download issue for PDF files as Damage file uploaded to CMOD
« on: November 30, 2020, 03:27:17 AM »
Hi,


We are facing issue to download pdf files from CMOD and We are getting below error message in System log.

Error Message:
1. This file is Damaged
2. File does not begin with '%PDF

Cause:
Issue happening because Damage file uploading to CMOD. We are creating batch files with 100 pdf files and uploading to CMOD. If one file is damage in a batch, full batch getting corrupted and uploading corrupted files. So we are unable to download files from CMOD as files is corrupted.

We are not getting any error logs at the time of Loading documents.

We want to restrict upload for damage files. Is there any option is available in CMOD to restrict upload damage file.



Thanks in advance for helping

Justin Derrick

  • IBM Content Manager OnDemand Consultant
  • Administrator
  • Hero Member
  • *****
  • Posts: 1964
  • CMOD Guru for hire...
    • View Profile
    • Tenacious Consulting
Re: Download issue for PDF files as Damage file uploaded to CMOD
« Reply #1 on: November 30, 2020, 09:52:04 AM »
In cases like this, whoever produces the files needs to fix their process.  Content Manager OnDemand is an archive -- we shouldn't modify anything, ever.

If you want to detect the errors so you can reject them, then you need to build some custom code around arsload to do that testing first. 

-JD.
IBM CMOD Professional Services: http://TenaciousConsulting.com
Call:  +1-866-533-7742  or  eMail:  jd@justinderrick.com
IBM CMOD Wiki:  https://CMOD.wiki/
FREE IBM CMOD Education & Webinars:  https://CMOD.Training/

Interests: #AIX #Linux #Multiplatforms #DB2 #TSM #SP #Performance #Security #Audits #Customizing #Availability #HA #DR

MD MUSLEHUDDIN

  • Newbie
  • *
  • Posts: 2
    • View Profile
Re: Download issue for PDF files as Damage file uploaded to CMOD
« Reply #2 on: December 09, 2020, 07:12:26 AM »
Hi Justin,


Thank you for your advise.

We are unable to detect any error log at the time of upload files.

Could you please advise to enable error log to detect any damage files is uploading to CMOD. So we can generate custom code to restrict damage files upload.

Please advice if there any Technote to to build custom code for  arsload and detect any error for damage files.



Thanks in advance for helping


Ed_Arnold

  • Hero Member
  • *****
  • Posts: 1007
    • View Profile
Re: Download issue for PDF files as Damage file uploaded to CMOD
« Reply #3 on: December 09, 2020, 09:52:35 AM »
Let me just take a wild guess.

Are you loading using the generic indexer?  That would mean that no PDF processing is getting invoked during loading.

You mention the error message about "%PDF".  I just browsed one of my PDFs and sure enough the first line starts with "%PDF-1.5".

So could a clever programmer (that's not me) come up with something that scans the input file, ?perhaps with the generic indexer input?, to verify that the expected start of each PDF starts with the proper characters?

? Perhaps the problem is a generic indexer not having the right offsets ?

Ed Arnold
#zOS #ODF

Justin Derrick

  • IBM Content Manager OnDemand Consultant
  • Administrator
  • Hero Member
  • *****
  • Posts: 1964
  • CMOD Guru for hire...
    • View Profile
    • Tenacious Consulting
Re: Download issue for PDF files as Damage file uploaded to CMOD
« Reply #4 on: December 09, 2020, 11:41:29 AM »
Hi MD.

There's nothing in CMOD's code that will help you validate the contents of a PDF *before* loading -- you'll have to do that yourself with some validation tool, and there are lots of open-source PDF libraries that will do a basic check.

You may want to switch your process to use the PDF Indexer, so that CMOD will try to open the PDF files and read them in order to identify and separate the resources.

-JD.
IBM CMOD Professional Services: http://TenaciousConsulting.com
Call:  +1-866-533-7742  or  eMail:  jd@justinderrick.com
IBM CMOD Wiki:  https://CMOD.wiki/
FREE IBM CMOD Education & Webinars:  https://CMOD.Training/

Interests: #AIX #Linux #Multiplatforms #DB2 #TSM #SP #Performance #Security #Audits #Customizing #Availability #HA #DR