Senior DevOps  Engineer – Azure Databricks & Kubernetes

About RENEWCAST

Founded in 2020, RENEWCAST is a leader in precision forecasting for wind energy. We harness cutting-edge machine learning to deliver highly accurate power production forecasts. Our mission is to advance renewable energy through innovation, collaboration, and technical excellence.

Recently, RENEWCAST secured a €2 million funding round led by South Western Power Group (SWPG) and CDP Venture Capital’s Green Transition Fund, strengthening our position as a global leader in renewable energy forecasting. This investment enables us to expand our tech team, enhance our forecasting models, and scale operations across Europe and the U.S. market.

Backed by Beamline Accelerator, Helen Ventures, and Tech4Planet, we are assembling a world-class team of engineers, data scientists, and energy experts to redefine wind and solar power forecasting.

We're building a flexible, hands-on data science platform using Azure Databricks and AKS. One day, you'll roll up your sleeves to write Terraform modules; the next, you'll be optimizing Databricks SQL queries or orchestrating complex workflows on Kubernetes—and everything in between.

You'll work closely with our Data Platform Manager, MLOps Engineer, and Data Engineer, bringing deep best-practice knowledge to every technical challenge. Your contributions will have both architectural impact and hands-on implementation.

We're looking for a core team member who can own projects end-to-end, proactively and collaboratively plan all stages, and drive every initiative through to completion.

Responsibilities

Terraform-Driven Infrastructure

  • Write, test, and maintain Terraform modules for Azure resources AKS, Databricks workspaces, VNETs, storage, networking) and aim to embed policy-as-code OPA/Kyverno) to enforce guardrails.

Azure Databricks Operations

  • Provision, secure, and tune clusters; optimize Delta Lake tables and notebook jobs; minimize data egress costs.

Kubernetes orchestration & Management

  • design, deploy and operate AKS clusters at scale for batch, workflow and model-serving workloads; configure autoscaling, CNI networking, private endpoints and peering gateways; orchestrate varied processes via Helm/ kustomize and custom operators. 

ML Lifecycle Automation & Model Serving

  • Embed MLflow into CI/CD workflows: track experiments, package models, automate deployments and model-serving endpoints in collaboration with our MLOps Engineer

GitHub-Driven CI/CD & GitOps

  • Build and maintainGitHub Actions workflows and Argo CD configurations for infrastructure, services and ML models.

Platform Reliability, Monitoring &FinOps

  • Define SLOs/SLAs; instrument with Prometheus/ Grafana/ OpenTelemetry; drive proactive cloud monitoring, tagging, rightsizing and chargeback for cost optimization.  

Cross-Team Collaboration & Architectural Guidance

  • Partner across backed, frontend, product and data teams to tackle diverse technical challenges; leverage your prior experience to shape our decision-making, codify proven practices and lead architectural discussions. 

Required Expertise 

  • 5+ years of hands-on experience in DevOps /MLOps roles, with a track record of championing best practices and driving design and code reviews 

  • Terraform & Policy-as-Code: Modules, workspaces, remote state, drift detection (all Azure); OPA/Kyverno guardrails

  • Azure Databricks: Workspace ops, Delta Lake optimization, job orchestration, and data ops

  • Kubernetes (AKS): Cluster design and orchestration, autoscaling, CNI & network policies, private VNETs, peering gateways

  • CI/CD & GitOps: GitHub Actions, Argo CD or equivalent 

  • Observability & Monitoring: Prometheus, Grafana, and OpenTelemetry for metrics, alerting and cloud monitoring

  • Cloud Cost Optimization: Tagging, rightsizing, chargeback, Azure Cost Management

Preferred Skills
  • Modern Orchestrators: Dagster (or equivalent) for ELT pipeline design

  • Tensorial Data Pipelines:  Zarr, Dask, and large-scale batch processing

  • Certifications: CKA/CKAD, TerraformAssociate, Databricks Data Engineer, Azure DevOps Engineer, FinOps practitioner

Bonus
  • Cloud-Agnostic Compute Provisioning: spot/preemtible strategies across clouds

  • Streaming/ Event Ingestion: Kafka or Pulsar fundamentals

  • Unit Testing & QA: pytest, automated infra testing (terratest, kitchen-terraform)

  • FinOps Tooling: Azure Cost Management, charge-back implementations 

What We Offer
  • Strategic Ownership & Architectural Impact – Lead design and ops of our next-generation ML platform with direct CTO/CEO visibility

  • Competitive Compensation + ESOP

  • Flexible Hybrid Remote – Offices in Tallinn and Rome, work where you're most productive

  • Career Growth – Mentor track and architect track, with hands-on delivery expected

How to Apply

Ready to dive into code, reshape our platform and collaborate across teams to deliver high‑velocity ML services? Let's talk.

Send your resume to recruiting@renewcast.com

or Click the button below to apply.

 
Apply Now!