Data Manager Tips & Tricks: using 'LIKE' and wildcards in dataset filters

Avatar image
Dec 01, 2020
Chan Bulgin wrote
Hi:

Recently, I was working with a very large data set where I only needed a small subset of the data it contained. One way to accomplish this rather easily is to use SQL LIKE and wildcards to include only data that matches the pattern you need. To do this, you must know the characters or values you want to include or exclude.

In the Define Incoming Data set wizard, you can apply this code technique in the Filter tab. The syntax is:

[Field Name] LIKE '%VALUE%'

This will include only rows where the [Field Name] contains the string 'VALUE'. You can also use '%VALUE' or 'VALUE%' to include only rows where VALUE is at the end or start (respectively). Redshift also allows the use of the wildcard _ to search for a single character.

Full use syntax is here: https://docs.aws.amazon.com/redshift/latest/dg/r_patternmatching_condition_like.html

See attached picture for LityxIQ Data Manager Example.
1 Answer
Avatar image
Dec 01, 2020
Chan Bulgin wrote
Note this also allows the use of NOT before LIKE.