Hi,
Need some expertise, hope someone has a good idea.
I have a client that supplies 8 different source files, that I need to analyze for redundant data, ie. which records exist across all files. I imagine an output similar to a distance matrix, showing how the data intertwine with each other across the files.
Sources contains: unique_id, source_reference
Any good ideas.
Cell B2, would hold all data in source file A, Cell B3, Would show all records contained in both source A and source B.