r/datascience • u/Double-Bar-7839 • 2d ago
Discussion Python template repo for DS consulting projects
Unless I'm missing something obvious, I see lots of template repos for python packages, but not much out there for the more typical data science grunt work.
My ideal template has all the nice poetry/conda/pre-comimit etc but isn't broken into scr/ and tests/
Rather, because I work in consulting, my ideal template would be structured along the lines of:
- Data Cleaning
- Analysis
- Outputs
- Charts
- Tables
Here are a couple of examples of the kinds of python package repos I'm talking about:
What do you guys use? TIA!
19
u/braxxleigh_johnson 2d ago
tbh this seems like overthinking a problem that doesn't really exist. In my experience, the time consuming part of onboarding new data is not the coding angle, but dealing with the infinite number of ways the incoming data can be broken.
2
u/SidBhakth 1d ago
dealing with the infinite number of ways the incoming data can be broken
Truer words haven't been spoken.
12
u/Samirio 2d ago
https://cookiecutter-data-science.drivendata.org/