Thread: Indexing comma delimited Info.-

This question is not answered. Helpful answers available: 5. Correct answers available: 1.


Permlink Replies: 4 - Pages: 1 - Last Post: Jan 28, 2009 3:25 PM Last Post By: redonis Threads: [ Previous | Next ]
redonis

Posts: 4
Registered: 03/28/08
Indexing comma delimited Info.-
Posted: Jan 28, 2009 8:38 AM
 
Click to report abuse...   Click to reply to this thread Reply
Hello, i'm working on a solution based on oracle text to index names of people and companies, but the problem is how to handle a delimiter that is part of the data, this is the sintax:

[1st last name],[2nd last name], [married last name], [1st first name],[2nd first name]

Ej.:(note: on a missing married last name it omits the string and you get ,,) like :
ABBOTT,HAIM,,JULIE,JEAN
ABDELHAFIZ,IBRAHIM,,MAHFOUZ,IBRAHIM
ABULARACH,GARCIA,,JORGE,ALFREDO

Using all fields:
ABDO,GARCIA,WILLIAMS,ROSA,MARIA
BRAN,PINZON,POZUELOS,MARIA,MAGDALENA
BUSTAMANTE,GOMAR,PEREIRA,SANDRA,IRENE

In other hand you have in the same table and field, names of companies like:
AGROMUNDO, S.A.
AMERICANAS, S.A.
BOXUS, S.A.

As you see some times there's a visible white space( ) between words:
ADSE, MORRISON,, ELVIA, HANS
but in regular cases like:
BRAN,PINZON,POZUELOS,MARIA,MAGDALENA
There's not.

In the index table(XXX$I) i get a full name token it doesnt take logic tokens like:
BRAN
PINZON
POZUELOS
MARIA
MAGDALENA

Is there a way to configure context to set the , as a break char(word) and get this right(already tried to set the character(,) in the break words with no luck)?

Or do you suggest any strategy on this?

Thanks for your comments

Edited by: user628087 on Jan 28, 2009 8:41 AM
Roger Ford

Posts: 960
Registered: 01/10/01
Re: Indexing comma delimited Info.-
Posted: Jan 28, 2009 8:47 AM   in response to: redonis in response to: redonis
 
Click to report abuse...   Click to reply to this thread Reply
The comma will be a break character, and will split words unless you've explicitly set it as a PRINTJOINS or SPLITJOINS character.
redonis

Posts: 4
Registered: 03/28/08
Re: Indexing comma delimited Info.-
Posted: Jan 28, 2009 8:52 AM   in response to: Roger Ford in response to: Roger Ford
 
Click to report abuse...   Click to reply to this thread Reply
The only setting on this matter, was:

exec CTX_DDL.ADD_STOPWORD('CTXSYS.DEFAULT_STOPLIST', ',');
exec CTX_DDL.ADD_STOPWORD('CTXSYS.DEFAULT_STOPLIST', 'S.A.');
exec CTX_DDL.ADD_STOPWORD('CTXSYS.DEFAULT_STOPLIST', ',S.A.');
exec CTX_DDL.ADD_STOPWORD('CTXSYS.DEFAULT_STOPLIST', 'S,A,');
exec CTX_DDL.ADD_STOPWORD('CTXSYS.DEFAULT_STOPLIST', ',S,A,');

How do i confirm about the printjoins?

Edited by: user628087 on Jan 28, 2009 8:53 AM
Barbara Boehmer

Posts: 2,950
Registered: 04/28/00
Re: Indexing comma delimited Info.-
Posted: Jan 28, 2009 12:02 PM   in response to: redonis in response to: redonis
 
Click to report abuse...   Click to reply to this thread Reply
How do i confirm about the printjoins?

select ctx_report.create_index_script ('your_index_name_goes_here') from dual;
redonis

Posts: 4
Registered: 03/28/08
Re: Indexing comma delimited Info.-
Posted: Jan 28, 2009 3:25 PM   in response to: Barbara Boehmer in response to: Barbara Boehmer
 
Click to report abuse...   Click to reply to this thread Reply
Thanks a LOT, it is solved now!

Edited by: redonis on Jan 28, 2009 3:25 PM
Legend
Guru Guru : 2500 - 1000000 pts
Expert Expert : 1000 - 2499 pts
Pro Pro : 500 - 999 pts
Journeyman Journeyman : 200 - 499 pts
Newbie Newbie : 0 - 199 pts
Oracle ACE Director
Oracle ACE Member
Oracle Employee ACE
Helpful Answer (5 pts)
Correct Answer (10 pts)

Point your RSS reader here for a feed of the latest messages in all forums