Author Topic: ACIF lineData Arabic  (Read 2479 times)

MHassan

  • Jr. Member
  • **
  • Posts: 64
    • View Profile
ACIF lineData Arabic
« on: July 13, 2020, 06:35:35 AM »
Hello ,

I am trying to index a .txt file with line data that have arabic content but when i try to set my triggers and fields with report wizard , the arabic content is appeared like the screenshot i provide below , so anyone have an idea if i want to index .txt files with arabic content how to appear normal like the option in the pdf indexer 9output hexadecimal strings).

Justin Derrick

  • IBM Content Manager OnDemand Consultant
  • Administrator
  • Hero Member
  • *****
  • Posts: 2229
  • CMOD Guru for hire...
    • View Profile
    • Tenacious Consulting
Re: ACIF lineData Arabic
« Reply #1 on: July 13, 2020, 09:27:40 AM »
Which codepage are you using for processing / displaying this?  I'd suspect you'd need the file produced in Unicode, and then use the corresponding Unicode codepage to match it when working with ACIF.  Otherwise, I suspect you're mixing codepages, which I don't think would work at all.

-JD.
IBM CMOD Professional Services: http://TenaciousConsulting.com
Call:  +1-866-533-7742  or  eMail:  jd@justinderrick.com
IBM CMOD Wiki:  https://CMOD.wiki/
FREE IBM CMOD Education & Webinars:  https://CMOD.Training/

Interests: #AIX #Linux #Multiplatforms #DB2 #TSM #SP #Performance #Security #Audits #Customizing #Availability #HA #DR

MHassan

  • Jr. Member
  • **
  • Posts: 64
    • View Profile
Re: ACIF lineData Arabic
« Reply #2 on: July 13, 2020, 11:21:42 AM »
Which codepage are you using for processing / displaying this?  I'd suspect you'd need the file produced in Unicode, and then use the corresponding Unicode codepage to match it when working with ACIF.  Otherwise, I suspect you're mixing codepages, which I don't think would work at all.

-JD.

Hello JD,
Thanks for reply , i am using 850 codepage for processing , the file is produced in UTF-8 , but it contain english and arabic mixed content, so i can't found a codepage that will generate the file as it is , and the one i found only was Codepage=(1208) but it shows a line fine and the next line the characters are separated from each other  like the screenshot below

Justin Derrick

  • IBM Content Manager OnDemand Consultant
  • Administrator
  • Hero Member
  • *****
  • Posts: 2229
  • CMOD Guru for hire...
    • View Profile
    • Tenacious Consulting
Re: ACIF lineData Arabic
« Reply #3 on: July 13, 2020, 02:01:16 PM »
Looks like you need to work with the team that produced the file to ensure it's properly formatted for Unicode. 

-JD.
IBM CMOD Professional Services: http://TenaciousConsulting.com
Call:  +1-866-533-7742  or  eMail:  jd@justinderrick.com
IBM CMOD Wiki:  https://CMOD.wiki/
FREE IBM CMOD Education & Webinars:  https://CMOD.Training/

Interests: #AIX #Linux #Multiplatforms #DB2 #TSM #SP #Performance #Security #Audits #Customizing #Availability #HA #DR

MHassan

  • Jr. Member
  • **
  • Posts: 64
    • View Profile
Re: ACIF lineData Arabic
« Reply #4 on: July 14, 2020, 04:27:13 AM »
Hello JD,

It seems it works fine on the navigator with the codepage (1208 )like the screenshot below , but on the CMOD client it appear  that it has an issue as the screenshot i provided before ,
Thank you JD for your effort
Modify message

Justin Derrick

  • IBM Content Manager OnDemand Consultant
  • Administrator
  • Hero Member
  • *****
  • Posts: 2229
  • CMOD Guru for hire...
    • View Profile
    • Tenacious Consulting
Re: ACIF lineData Arabic
« Reply #5 on: July 14, 2020, 02:37:24 PM »
Make sure you've got the right codepage set in the Application definition so that the client knows how to display it.

At this point, it may sound like a CMOD Client bug with displaying arabic characters.

-JD.
IBM CMOD Professional Services: http://TenaciousConsulting.com
Call:  +1-866-533-7742  or  eMail:  jd@justinderrick.com
IBM CMOD Wiki:  https://CMOD.wiki/
FREE IBM CMOD Education & Webinars:  https://CMOD.Training/

Interests: #AIX #Linux #Multiplatforms #DB2 #TSM #SP #Performance #Security #Audits #Customizing #Availability #HA #DR

Darrell Bryant

  • Full Member
  • ***
  • Posts: 104
  • Sed fugit interea fugit inreparabile tempus-Virgil
    • View Profile
Re: ACIF lineData Arabic
« Reply #6 on: July 15, 2020, 05:46:05 AM »
Code page 850 is for Western European languages. It looks like code page 864 or 1256 would be correct for ASCII Arabic data.
#IBMi #iSeries #PDF #XML #400 Indexer #ASM

MHassan

  • Jr. Member
  • **
  • Posts: 64
    • View Profile
Re: ACIF lineData Arabic
« Reply #7 on: July 15, 2020, 05:56:48 AM »
Code page 850 is for Western European languages. It looks like code page 864 or 1256 would be correct for ASCII Arabic data.

Both codepage didn't work for me , the only codepage that worked for me is 1208 that appeared fine in navigator but in CMOD client appeared as the screenshot in my previous replies some are fine and some are messy