In the dataset are cases like this: `einem realen Umfeld einzusetzen. \n\nJohn liebte`. Should we remove the space before the `\n`? With code like this: `t.replace(" < br > ", "\n").replace("< br > ", "\n")`