Quantcast
Channel: Data Preparation & Blending discussions
Viewing all articles
Browse latest Browse all 4999

Help on "Distance matrix", created from 8 different input files

$
0
0

Hi, 

Need some expertise, hope someone has a good idea.

 

I have a client that supplies 8 different source files, that I need to analyze for redundant data, ie. which records exist across all files. I imagine an output similar to a distance matrix, showing how the data intertwine with each other across the files.

 

Sources contains: unique_id, source_reference

 

Any good ideas.

 

Cell B2, would hold all data in source file A, Cell B3, Would show all records contained in both source A and source B.
Matrix.jpg


Viewing all articles
Browse latest Browse all 4999

Trending Articles