Author Topic: Character removal question  (Read 731 times)

DDP021

  • Sr. Member
  • ****
  • Posts: 343
    • View Profile
Character removal question
« on: February 22, 2023, 09:14:51 AM »
Hi there...Stupid question...We see on the Application setup under the LOAD INFORMATION tab options for Character Removal (Leading, imbedded, trailing) for App Grp names.  We also see these options under the APP GRP setup under the FIELD INFORMATION tab.   If you wanted to incorporate these, where would set them, the APP setup, AppGrp setup or both?

Darrell Bryant

  • Full Member
  • ***
  • Posts: 104
  • Sed fugit interea fugit inreparabile tempus-Virgil
    • View Profile
Re: Character removal question
« Reply #1 on: February 22, 2023, 09:49:44 AM »
Character removal on the Application Group > Field Information tab is used to 'cleanup' search values entered by the end user before querying the database.
From the Administrator Client help text:

String Embedded. You can define from one to eight characters that Content Manager OnDemand removes from search values typed by the user before querying the database. Content Manager OnDemand can remove embedded, leading, and trailing characters. For example, suppose you define a field to store index values that contain phone numbers. The values in the database contain only numbers. That is, the values do not include the dash character. However, when users query the database with a phone number, they typically type search strings such as 303-555-1212. Since the database values do not include the dash character, Content Manager OnDemand would not find a document to match the query. However, you can use the Embedded field to enable users to type values that include the dash character. Simply type a dash character in the Embedded field. That way, the user can type 303-555-1212 (or 3035551212) in the search field. Content Manager OnDemand removes all occurrences of the character specified in the Embedded field from the search string typed by the user before querying the database.

#IBMi #iSeries #PDF #XML #400 Indexer #ASM

Justin Derrick

  • IBM Content Manager OnDemand Consultant
  • Administrator
  • Hero Member
  • *****
  • Posts: 2228
  • CMOD Guru for hire...
    • View Profile
    • Tenacious Consulting
Re: Character removal question
« Reply #2 on: February 22, 2023, 10:19:20 AM »
You may also find this interesting if you're loading CMOD Generic Index files, and doing character removal at load time to normalize data:  https://cmod.wiki/index.php?title=Default_Values_and_the_Generic_Indexer

-JD.
IBM CMOD Professional Services: http://TenaciousConsulting.com
Call:  +1-866-533-7742  or  eMail:  jd@justinderrick.com
IBM CMOD Wiki:  https://CMOD.wiki/
FREE IBM CMOD Education & Webinars:  https://CMOD.Training/

Interests: #AIX #Linux #Multiplatforms #DB2 #TSM #SP #Performance #Security #Audits #Customizing #Availability #HA #DR

DDP021

  • Sr. Member
  • ****
  • Posts: 343
    • View Profile
Re: Character removal question
« Reply #3 on: February 23, 2023, 02:59:38 AM »
Thanks everyone for the response.  I guess my question is, with the option to set these parameters on both the Application and Application Group setups,  which option is the better place to set turn them on (ie Application or Appliction group definition).

Justin Derrick

  • IBM Content Manager OnDemand Consultant
  • Administrator
  • Hero Member
  • *****
  • Posts: 2228
  • CMOD Guru for hire...
    • View Profile
    • Tenacious Consulting
Re: Character removal question
« Reply #4 on: February 23, 2023, 07:43:43 AM »
Hrm.  Interesting question.

It depends on how you want to 'normalize' the document metadata, so that you can ensure the maximum level of success in FINDING a document you're searching for. 

In the vanilla thick client & ICN, since there's not a lot of options for modifying queries entered by end users to match specific formats (like a US Social Security Number or phone number) it's probably best to just store them as a series of digits in a string.  It doesn't make sense to treat them as numbers -- because you won't be doing comparisons like greater-than or less-than.

If you ensure that SSNs are loaded with spaces, dashes, or dots removed, then ensure that spaces, dashes, or dots are removed at query time, you're dramatically increasing the chance that someone can find what they're looking for.

Hope that helps.

-JD.
« Last Edit: February 23, 2023, 09:21:00 AM by Justin Derrick »
IBM CMOD Professional Services: http://TenaciousConsulting.com
Call:  +1-866-533-7742  or  eMail:  jd@justinderrick.com
IBM CMOD Wiki:  https://CMOD.wiki/
FREE IBM CMOD Education & Webinars:  https://CMOD.Training/

Interests: #AIX #Linux #Multiplatforms #DB2 #TSM #SP #Performance #Security #Audits #Customizing #Availability #HA #DR

DDP021

  • Sr. Member
  • ****
  • Posts: 343
    • View Profile
Re: Character removal question
« Reply #5 on: February 23, 2023, 08:02:46 AM »
Thanks Justin.  We just noticed when setting a new definition up, both APP and APP Grp setups had the Chraracter Removal option and weren't sure where we would want to set that up.  For this particular setup, we just want to removed trailing spaces on the Account number index.  So werent sure where to set that on, the App or App Grp definition.  Take care, Dave.