In a previous commit, processFile.rb was peppered with #gsub calls to convert newlines in extracted text to spaces. The possibility of newlines in the extracts has two implications:
- regexps processing the text need to be aware that multi-line patterns might be needed
- the output comma-separated value (CSV) file is ill-formed for certain uses
An example of a multi-line extract is the xref field of the second grant extracted from ipg140107 (see lines 2509-2530 of ipg140107.extract), though others are likely to exist.