Author Topic: PDF Indexer of extremely large files - 1.2 million pages  (Read 2111 times)

Steve Bechtolt

  • Jr. Member
  • **
  • Posts: 56
    • View Profile
PDF Indexer of extremely large files - 1.2 million pages
« on: February 13, 2018, 11:50:04 AM »
Does anyone have metrics of PDF indexing of extremely large PDF files?
We're talking about files 1-2.4 GB and 800,000 to 1,200,000 pages per file.
Steve Bechtolt
IBM Certified Solutions Expert - IBM Content Management - OnDemand Multiplatform
ERM as a Service - DXC Technology

Stephen McNulty

  • Jr. Member
  • **
  • Posts: 57
    • View Profile
Re: PDF Indexer of extremely large files - 1.2 million pages
« Reply #1 on: February 20, 2018, 03:11:42 PM »
the largest PDF files I have to deal with are 700MB with 8300 pages, 4 fields. usually 5 minutes to index, 1 minute to load. 
#ISERIES #ODWEK #XML

jsquizz

  • Global Moderator
  • Hero Member
  • *****
  • Posts: 576
    • View Profile
Re: PDF Indexer of extremely large files - 1.2 million pages
« Reply #2 on: March 26, 2018, 07:51:12 AM »
Does anyone have metrics of PDF indexing of extremely large PDF files?
We're talking about files 1-2.4 GB and 800,000 to 1,200,000 pages per file.

Done this before, but it was a process

Infoprint XT to take Xerox metacode and convert to afp -- AFP2PDF -- Massive PDF.   Hours to load
#CMOD #DB2 #AFP2PDF #TSM #AIX #RHEL #AWS #AZURE #GCP #EVERYTHING