I want to read the tags marked with text from XML files that follow this pattern:
I have created a Talend Open Studio job that does this very well. However I obviously can't use the XPath-structure in Alteryx. I tried a few things, but nothing seems to work.
My biggest concern is that at the lowest level of AAAAB and AAAAC the field types can vary, so AAAAB has field 1 and field 2 types, but AAAAC has field 1 and field 3 types. These can't mix, I need to know exactly which field 1 is aligned with field 2 or field 3.
And I can't depict that relation in Alteryx.
My first try was this, where every parser reads a single child element. It keeps all the fields I want, unfortunately it creates lots of duplicates and no correct relations. Not to mention that it is inredibly slow because of the duplicates.
My second try was to parse every field individually, but I can't join these correctly, because I have no key fields with which to join the records. Matching by record order doesn't work because of the field 1/2 separation at the lowest level.
I hope my problem is understandable. I feel like I'm tackling this whole issue from the wrong perspective.