Hi all,
I am trying to figure out a problem I am facing in identifying how to correctly see data in EDQ. Considering the table below, I have a set of customer profiles that are linked to a certain classifications all in a column unfortunately. There are 10 unique classifications only. Therefore, I know there are a 100 combinations available for my profile data. I want to create reference data of these 100 combinations and show the count of associated profiles with each and every combination.
I have used the GROUP processor and grouped by the classifications. However, what this does is shows the count associated with every unique classification. What I want is the count of profiles pertaining to different classification combinations. For example customer number 1 and 6 have the same combination of classifications. So for this particular combination the count of profiles will be 2. Also please be aware I have an excess of 10 million profiles with unique customer number.
Please help me identifying a process that will allow me to see this in OEDQ.
Thank you for the support!
Customer Number | Classifications |
1 | Yellow |
1 | Orange |
1 | White |
1 | Black |
2 | Yellow |
2 | Orange |
3 | Yellow |
3 | Orange |
3 | Black |
5 | Yellow |
4 | Orange |
4 | Black |
6 | Yellow |
6 | Orange |
6 | White |
6 | Black |