|
Hello, i'm working on a solution based on oracle text to index names of people and companies, but the problem is how to handle a delimiter that is part of the data, this is the sintax:
[1st last name],[2nd last name], [married last name], [1st first name],[2nd first name]
Ej.:(note: on a missing married last name it omits the string and you get ,,) like :
ABBOTT,HAIM,,JULIE,JEAN
ABDELHAFIZ,IBRAHIM,,MAHFOUZ,IBRAHIM
ABULARACH,GARCIA,,JORGE,ALFREDO
Using all fields:
ABDO,GARCIA,WILLIAMS,ROSA,MARIA
BRAN,PINZON,POZUELOS,MARIA,MAGDALENA
BUSTAMANTE,GOMAR,PEREIRA,SANDRA,IRENE
In other hand you have in the same table and field, names of companies like:
AGROMUNDO, S.A.
AMERICANAS, S.A.
BOXUS, S.A.
As you see some times there's a visible white space( ) between words:
ADSE, MORRISON,, ELVIA, HANS
but in regular cases like:
BRAN,PINZON,POZUELOS,MARIA,MAGDALENA
There's not.
In the index table(XXX$I) i get a full name token it doesnt take logic tokens like:
BRAN
PINZON
POZUELOS
MARIA
MAGDALENA
Is there a way to configure context to set the , as a break char(word) and get this right(already tried to set the character(,) in the break words with no luck)?
Or do you suggest any strategy on this?
Thanks for your comments
Edited by: user628087 on Jan 28, 2009 8:41 AM
|