A reader asked how to extract the bond type "CD" from the following input string: "CD Corporation du 20/12/2010 4.5% Ã 26 semaines". Although it's easy to grab the first two characters in a tMap using a substring function, there is an off-the-shelf component tExtractRegexFields that can handle varying lengths. This job uses a tFixedFlowInput to provide data. The tFixedFlowInput is run into a tExtractRegexFields which breaks the input into two strings: investment type and remainder. The tExtractRegexFields is connected to a tMap which filters the columns. The result is output to tLogRow. Input Data for Regex-parsing Job The test data consists of four records representing four French investment types: CD, OAT, BTF, BTAN. Types vary in character length: 2, 3, 4. While a simple substring() in a tMap is a quick way to pluck the first two characters off of a string, that solution won't work for the varying characters. Regular expressions can be ...