Quantcast
Channel: Data Preparation & Blending discussions
Viewing all articles
Browse latest Browse all 4999

Reading XML files into table while maintaining relations

$
0
0

I want to read the tags marked with text from XML files that follow this pattern:

 XML_Alteryx.PNG

I have created a Talend Open Studio job that does this very well. However I obviously can't use the XPath-structure in Alteryx. I tried a few things, but nothing seems to work.

My biggest concern is that at the lowest level of AAAAB and AAAAC the field types can vary, so AAAAB has field 1 and field 2 types, but AAAAC has field 1 and field 3 types. These can't mix, I need to know exactly which field 1 is aligned with field 2 or field 3.

And I can't depict that relation in Alteryx.

 

My first try was this, where every parser reads a single child element. It keeps all the fields I want, unfortunately it creates lots of duplicates and no correct relations. Not to mention that it is inredibly slow because of the duplicates.

XML_Alteryx2.PNG

My second try was to parse every field individually, but I can't join these correctly, because I have no key fields with which to join the records. Matching by record order doesn't work because of the field 1/2 separation at the lowest level.

XML_Alteryx3.PNG

 

I hope my problem is understandable. I feel like I'm tackling this whole issue from the wrong perspective.

 


Viewing all articles
Browse latest Browse all 4999

Trending Articles