On Tuesday, November 12, as part of a company wide initiative to bring all corporate websites under the OpenText brand, the Community will be down briefly around 11am Eastern / 8am Pacific as we transition to a new domain name.  The community will be offline while we make the change and when it comes back online it will be located at community.opentext.com and the old domain will redirect to this new location.  Thank you for your patience as we make this important change.

Wikis - Page

Match special characters with ControlPoint Eduction

1 Likes

ISSUE:

With ControlPoint, you have the possibility to match patterns as a regulation expression. For example,
(([\ ]90)|([0][0-9]?)|(\ \(90\))|([\ ]([ ]?)\([0-9]{2}\))|(\(\ 90\))|([90]*))([ ]?)((\([0-9]{3}\))|([0-9]{3}))([ ]?)([0-9]{3})(\s*[\-]?)([0-9]{2})(\s*[\-]?)([0-9]{2}) is an example of regular expression to match Turkish phone numbers in documents.

The thing is that there is a “ ” sign in this RegEx. By default, ControlPoint pattern matching engine, as known as Eduction, excludes these special characters.

HOW TO STEPS:

If you want to match them, you need to add a parameter in the section describing your “repository” in the Connector Framework Service (CFS) configuration file.

Edit for example the Indexer/FileSystem Connector Framework/ControlPointFileSystem Connector Framwork.cfg file if you have created “MyRepository” as a FileSystem source and add
TangibleCharacter=  to the parameters section of the repository.

[MyRepositoryEductionSettings]
MaxMatchesPerDoc=10000
TangibleCharacter=
SearchFields=DRECONTENT
Entity0=number/phone_tk
EntityField0=CPED_NUMBER_PHONE_TK
ResourceFiles=eduction\number_phone_tk.ecr

Tags:

Labels:

How To-Best Practice
Comment List
Related
Recommended