I am ingesting data to MarkLogic using MLCP, but many records got skipped due to invalid characters in the file.
Is there any way to ignore the invalid characters and ingest all records present in the CSV without skipping records?
Below is the error message in the logs:
WARN Skipped record: abc.csv at line 1414, reason: invalid char between encapsulated token and delimiter
It would be helpful if you provided an example of the records that were causing the exception to be thrown. However, the most common reason is that you have a
,as a delimiter and have quotes within the value that is not encapsulating the entire value.For instance:
In this case,
"bar" Yis invalid. You could fix that by escaping the quotes:https://www.marklogic.com/blog/delimited_text_mlcp