Data360 Analyze

 View Only
  • 1.  I want to find records that uniquely match from given datasets

    Posted 08-26-2020 04:14

    I have two datasets

    Dataset1:

    |COL1|

    1

    2

     

    Dataset2:

    |col1|     |col2|

     1                 2

     1                 3

     2                 3

     

    Here when I match both datasets based on 'COL1' AND 'col1

    i will get three rows

    1  2

    1  3

    2  3

    My expectation is join does not work for value 1 because it has two matches.

    Please suggest solution where I can do this using one single node and not using multiple nodes(x-ref+remove duplicates) so that I can get output, that has unique match only and ignores not two or more matches.

     



  • 2.  RE: I want to find records that uniquely match from given datasets

    Employee
    Posted 08-26-2020 10:20

    The node is operating correctly as there are multiple instances of the value '1' in the Left data set. Records are processed one at a time and the result of the join for the current input record are processed and written to the results set at that time.

    The correlation nodes do not offer an option to post-process the join results to identify distinct value combinations. It would be necessary to use a separate node to obtain the disinct combinations.



  • 3.  RE: I want to find records that uniquely match from given datasets

    Posted 08-26-2020 13:43

    Dear Adrian,

    Could you please provide the code for this

    Thanks :)



  • 4.  RE: I want to find records that uniquely match from given datasets

    Employee
    Posted 08-27-2020 01:15

    There is no code as such, the logic was built by configuring code-free nodes. An example data flow is attached.

    Note, in the example the field in Dataset1 is being renamed just to clarify the operation of the node.

     

     

    Attached files

    Merge_Node_Join_Distinct_Example_342 - 27 Aug 2020.lna