Quantcast
Channel: Data Preparation & Blending discussions
Viewing all articles
Browse latest Browse all 4999

Identifying and counting occurrence of a string

$
0
0

Hey, this is my first post to the forum and I haven't reviewed the rules too thoroughly, so I hope my post does not violate any. I have a dataset (pictured below) with 100+categories that just have letters. I would like to be able to identify strings that appear less than 5 times and categorize rows that contain that value as "Rare". I can't use a summarize node with a group by and a count, because I would have to add all 100+ of them and the count would just count the exact match of the combination of letters instead of the individual letter. Any thoughts or ideas?

What the dataset looks like


Viewing all articles
Browse latest Browse all 4999

Trending Articles