One common situation is that you generate data for a topic with an initial set of articles, then notice some articles that ought to be excluded. (This is especially the case if the articles list comes from the Dashboard, representing a set of articles edited for some course or program... which may include some edits that aren't closely related to the focus area of that course/program.)
If you upload a new CSV of articles that removes some unwanted ones, then re-import and re-generate data, the removed articles still show up.
The simplest solution to this might be to add a routine to destroy all the generated data for a topic when relevant inputs change, forcing re-generation from a clean state.
A more user-friendly option would be to allow the list of articles to be edited with additions and removals via a UI, and only regenerate or prune data related to added/removed articles.
One common situation is that you generate data for a topic with an initial set of articles, then notice some articles that ought to be excluded. (This is especially the case if the articles list comes from the Dashboard, representing a set of articles edited for some course or program... which may include some edits that aren't closely related to the focus area of that course/program.)
If you upload a new CSV of articles that removes some unwanted ones, then re-import and re-generate data, the removed articles still show up.
The simplest solution to this might be to add a routine to destroy all the generated data for a topic when relevant inputs change, forcing re-generation from a clean state.
A more user-friendly option would be to allow the list of articles to be edited with additions and removals via a UI, and only regenerate or prune data related to added/removed articles.