I have several projects on Python I am currently versioning with Git. I also have the input-output data that is gradually changing over the course of the project. I need to be able to re-run my pipeline later exactly in a way it was executed at a given time.
What would be the best way of doing it without inserting the source data into the git repository?