I got a spreadsheet which looks like this. I will like to keep the file column, but extract only the sentences with the word "India". Is there a way to do that? Prefer to use KNIME or R, but happy with any solution.
Only the sentences with "India" is extracted, but the file column is kept.


This can be achieved using the
dplyrandstr_detect()from thestringrpackage. Note that "India | india" in the following code will capture both "India" and the grammatically incorrect "india" in case it exists: