Author Topic: Retrieval i/o performance (Read 4348 times)

wwwalton · « **on:** April 25, 2013, 05:43:03 AM »

So, we run CMOD MP 8.5.6 under AIX 7 with a local cache only storage and DB2 9.5. We generically index certain PDFs. Currently a single retrieval of a 100k PDF (compressed when stored) takes .15 seconds using the 66 record. New PDFs being designed, the black and white version is about 1 Mb in size and takes about 1.0 seconds to retreive. The color version is much larger, around 4 Mb, and retrieval time jumps to 25 seconds. Since these are used by web (I used the gui client to test), retrieval times are critical. Any ideas as to how this time could be reduced for the color version?
Thanks,
-Walt

Paul · « **Reply #1 on:** May 03, 2013, 09:28:31 AM »

I have found some issues with retrieval and the java console version. Particularly java 1.6.0_14 when using ODWEK 8.4.1 and 8.5.0.6.

Trambak · « **Reply #2 on:** May 19, 2013, 09:02:50 PM »

How do you load these pdf documents - generic indexer or pdf indexer? If there is a resource portion that gets pulled while pulling these documents using ODWEK, think about caching those resource files externally.

wwwalton · « **Reply #3 on:** May 20, 2013, 04:54:53 AM »

These are all loaded generic so no opportunity that I know of to deal with resources separately. I used the the Windows installed client which I do not believe uses java, but is a natively compiled application.
Thanks,
-ww

Justin Derrick · « **Reply #4 on:** May 20, 2013, 05:34:37 AM »

Do you have compression set to 'disable'? Also, try increasing your object size. At 4MB each, you're only getting 2.5 PDFs per CMOD object. Up to 100MB should be fine.

Also, try running a retrieval with 'arsdoc get' on the server. That will eliminate the possibility of any client PC / network issues.

-JD.

wwwalton · « **Reply #5 on:** May 21, 2013, 05:38:09 AM »

Thanks for your response Derek, just a couple of questions. Since I was using the times off the '66' record, the client shouldn't matter right?

Also, the doc on compression says:
Disable
OnDemand does not compress the input data. Choose this option when the input data is already compressed, such as a compressed TIFF. The documents are uncompressed by the appropriate viewer on the client, for example, Acrobat Reader.
None
OnDemand does not compress the input data when loading it into the system. When the user selects a document for viewing, OnDemand compresses the document before transmitting it over the network and will uncompress the document on the client.
So, if I read this correctly, I still pay a penalty storing uncompressed as time will be taken to compress/decompress anyway?
Thanks again for your input.
-ww

wwwalton · « **Reply #6 on:** May 21, 2013, 05:39:12 AM »

Ooops, meant Justin.

Paul · « **Reply #7 on:** May 21, 2013, 08:54:13 AM »

Shouldn't you use OD77 since these are documents and not TIFFs?

Justin Derrick · « **Reply #8 on:** May 21, 2013, 09:08:26 AM »

Yup, WW is definitely right -- set compression to 'none', and not 'disable'. (My bad, sorry!)

Paul: The overwhelming majority of PDFs use compression by default, and trying to compress them is a waste of time -- the savings would be negligible, and some files actually grow in size due to the overhead of the compression method.

Paul · « **Reply #9 on:** May 21, 2013, 09:24:07 AM »

Thanks, Justin!

OnDemand User Group

News:

Author Topic: Retrieval i/o performance (Read 4348 times)

wwwalton

Retrieval i/o performance

Paul

Re: Retrieval i/o performance

Trambak

Re: Retrieval i/o performance

wwwalton

Re: Retrieval i/o performance

Justin Derrick

Re: Retrieval i/o performance

wwwalton

Re: Retrieval i/o performance

wwwalton

Re: Retrieval i/o performance

Paul

Re: Retrieval i/o performance

Justin Derrick

Re: Retrieval i/o performance

Paul

Re: Retrieval i/o performance