Skip to content
@datarootsio

dataroots

Supporting your data driven strategy.

🖖 Welcome to Dataroots' GitHub org

youtube meetup web blog hugginface instagram linkedin twitter email

stars

Dataroots was founded out of a strong belief that AI & data-driven solutions can be used by companies to gain a competitive edge in terms of company processes, customer interactions and legal compliance. Our mission is to deliver data-driven solutions with unrivalled longevity and business impact for our clients.

ℹ️ Feel free to browse around, below are some quick starting points.

Terraform

Tutorials

Templates

  • ml-skeleton-py: An opinionated project template that allows you to get started on a new machine learning project
  • python-minimal-boilerplate: A minimal-yet-opinionated project template to kickstart a new Python project
  • skeleton-pyspark: An opinionated project template that allows you to get started on an ETL job with PySpark

Models

Rootsacademy Projects:

Open source packages

  • artyfarty: ggplot2 theme + palette presets
  • cheek: Crontab-like scHeduler for Effective Execution of tasKs, cheek for short
  • databooks: for sharing and caring about Jupyter notebooks ❤️
  • dbt-fabric: dbt adapter for Microsoft Fabric Data Warehouses
  • expiring-lru-cache: LRU caching with expiration period
  • github-stats-card: ⭐️ a minimal but inclusive github stats badge ⭐️
  • nbdefs2py: extract functions and classes from notebooks
  • phonehome: KISS telemetry for FOSS packages
  • rootsstyle: a dataroots inspired style for Matplotlib
  • tf-profile: CLI tool to profile Terraform runs, written in Go

Our events 🍻

Check out all our events at dataroots.io/events/ or sign up to our weekly digest 👈

Our blog ✍️

Our latest posts:

Check out all our posts at dataroots.io/blog/ 👈

Join our team! ❤️

Our open positions:

For more info check out dataroots.io/careers 👈

Popular repositories Loading

  1. tf-profile tf-profile Public

    CLI tool to profile Terraform runs, written in Go

    Go 163 4

  2. ml-skeleton-py ml-skeleton-py Public template

    A best-practices first project template that allows you to get started on a new machine learning project.

    Python 145 19

  3. databooks databooks Public

    A CLI tool to reduce the friction between data scientists by reducing git conflicts removing notebook metadata and gracefully resolving git conflicts.

    Python 113 5

  4. artyfarty artyfarty Public

    ggplot2 theme + palette presets

    R 96 7

  5. tutorial-face-mask-detection tutorial-face-mask-detection Public

    In this project, we develop a pipeline to detect unmasked faces in images. This can, for example, be used to alert people that do not wear a mask when entering a building.

    Jupyter Notebook 89 18

  6. tutorial-great-expectations tutorial-great-expectations Public

    A tutorial for the Great Expectations library.

    Jupyter Notebook 73 18

Repositories

Showing 10 of 79 repositories
  • .github Public

    🚀 Get started in our repos

    datarootsio/.github’s past year of commit activity
    Python 12 1 2 0 Updated Dec 10, 2025
  • genai-2025-lt Public
    datarootsio/genai-2025-lt’s past year of commit activity
    Vue 0 0 0 7 Updated Dec 2, 2025
  • langgraph-template-travel-planner Public template

    A LangGraph template for building AI agents with human-in-the-loop, conditional routing, observability (Langfuse), and a Reflex UI demonstrated through a travel planner use case.

    datarootsio/langgraph-template-travel-planner’s past year of commit activity
    Python 17 MIT 2 0 0 Updated Sep 18, 2025
  • datarootsio/dbt_core_initiative’s past year of commit activity
    0 0 0 1 Updated Sep 9, 2025
  • datarootsio/airflow-workshop’s past year of commit activity
    Python 0 3 0 0 Updated Sep 7, 2025
  • crime-committed Public

    Can you solver our murder mystery?

    datarootsio/crime-committed’s past year of commit activity
    0 0 0 0 Updated Sep 4, 2025
  • datarootsio/rootsacademy-2024-docker-101’s past year of commit activity
    Python 0 0 0 1 Updated Jul 22, 2025
  • terraform-provider-dagster Public

    Terraform provider to manage dagster cloud resources.

    datarootsio/terraform-provider-dagster’s past year of commit activity
    Go 8 MIT 1 14 5 Updated Jul 14, 2025
  • your-best-bet Public

    MLOps with dbt + python to orchestrate a ML pipeline beating bookies odds

    datarootsio/your-best-bet’s past year of commit activity
    Jupyter Notebook 6 1 0 4 Updated Jun 20, 2025
  • unity_catalog_template Public

    This Repo is used to develop a template for https://www.unitycatalog.io/ that can be used for a quick client setup.

    datarootsio/unity_catalog_template’s past year of commit activity
    Bicep 3 0 0 0 Updated Jun 13, 2025

Most used topics

Loading…