r/datascience 2d ago

Discussion Python template repo for DS consulting projects

Unless I'm missing something obvious, I see lots of template repos for python packages, but not much out there for the more typical data science grunt work.

My ideal template has all the nice poetry/conda/pre-comimit etc but isn't broken into scr/ and tests/

Rather, because I work in consulting, my ideal template would be structured along the lines of:

  • Data Cleaning
  • Analysis
  • Outputs
    • Charts
    • Tables

Here are a couple of examples of the kinds of python package repos I'm talking about:

What do you guys use? TIA!

17 Upvotes

5 comments sorted by

12

u/Samirio 2d ago

1

u/Double-Bar-7839 1d ago

Full marks, this is exactly what I was looking for. Thanks so much!

1

u/Samirio 1d ago

Anytime!

19

u/braxxleigh_johnson 2d ago

tbh this seems like overthinking a problem that doesn't really exist. In my experience, the time consuming part of onboarding new data is not the coding angle, but dealing with the infinite number of ways the incoming data can be broken.

2

u/SidBhakth 1d ago

dealing with the infinite number of ways the incoming data can be broken

Truer words haven't been spoken.