Resume
WORK EXPERIENCE
Hivebrite – Staff Site Reliability Engineer (March 2024 - Today)
Drove cross-team initiatives to improve reliability, scalability and developer productivity across a multi-tenant SaaS platform. Built internal tools to automate infrastructure and incident workflows, defined standards for CI/CD and mentored engineers across the organization.
CI/CD & Infrastructure
- Deployed self-hosted GitHub Actions runners on GKE with autoscaling.
- Designed and operated Terraflow, an internal CI/CD orchestrator for Terraform to reduce deployment toil and prevent infrastructure drift. (article)
- Migrated legacy Buildkite pipelines to GitHub Workflows for faster, more reliable builds.
- Added automated security and quality checks using pre-commit hooks across all repositories.
- Integrated Temporal for workflow orchestration and Qdrant as the vector database for the content-recommendation service.
- Leading AI tooling transition within the team, training, Cursor rules creation and AI pull request review tool comparison (Cursor Bugbot vs Claude Code vs Graphite Diamond)
- Led AI tooling transition within the team and participated in AI PR review comparison (Cursor Bugbot vs Claude Code vs Graphite Diamond).
Reliability & Observability
- Redefined cross-team incident management workflows to improve runbook and postmortem quality and enhance production support and on-call experience.
- Migrated the Istio Operator to Helm-based Istio deployments to eliminate downtime, simplify upgrades and reduce tech debt.
- Created an agentic Slack Incident Response Bot to automate Datadog monitors and vendor status updates while integrating with Claude Code and MCP servers to retrieve related incidents, recovery steps and documentation.
- Led observability platform comparison across Datadog, Grafana Cloud, Coralogix, Signoz and self-hosted with OSS.
Performance & Optimization
- Drove the Ruby on Rails webserver migration from Puma to Falcon, performing benchmarks to achieve ~36% lower latency, ~67% higher throughput and ~86% fewer 5xx responses with flat memory usage. Trade-off: ~58% more CPU usage.
Leadership & Org Impact
- Helped on cross-team initiatives, contributing to architecture reviews and engineering roadmap.
- Founded the company Engineering Blog and established publishing standards.
- Mentored engineers through pairing sessions and internal technical workshops.
- Terraflow recognized in Hivebrite’s Crédit Impôt Recherche & Innovation 2024.
Stack : GCP, GKE, Kubernetes, Istio, Helm, Keda, Terraform, Terragrunt, ArgoCD, Kargo, Wiz, Datadog, PostgreSQL, Temporal, Qdrant, GitHub, CI/CD, Cursor, MCP, Dust, Claude Code, Graphite Diamond
Languages : Python, Bash, Golang
3D Systems – Senior Site Reliability Engineer (March 2023 - March 2024)
As a Senior DevOps Engineer and Observability Specialist within 3D Systems’ cloud software division, I improved the company observability stack, optimized system performance and trained teams in advanced monitoring and troubleshooting techniques.
Observability & Monitoring
- Investigated and resolved Prometheus high cardinality and OOM issues, improving cluster stability. (article)
- Optimized Prometheus performance, reducing average CPU usage by 119% and RAM usage by 139%. (article)
- Rebuilt and enhanced the Thanos stack, including StoreGateway reliability fixes and Redis optimizations.
- Implemented Service Level Objectives (SLOs) using Pyrra, following a comparative study against Sloth. (article)
- Implemented continuous profiling with Polar Signals to gain deeper insight into system performance.
- Delivered a major Grafana update and refactor using Infrastructure-as-Code for dashboards, plugins and data sources.
Infrastructure & Automation
- Implemented Harbor for secure and efficient container image management.
- Developed GitHub repository templates with automatic child synchronization, improving development consistency. (article)
- Provided ongoing production operations support and incident response.
Technical Leadership & Advocacy
- Authored documentation, comparative studies and internal training sessions.
- Led community initiatives around observability and monitoring practices:
- Speaker at PromCON EU 2023: Finding useless and resource-hungry Prometheus metrics
- Speaker at Geekle DevOps Global Summit 2023 : Modern Grafana dashboard design
- Joined the Grafana Champion program, engaging further with the Grafana community.
Stack : GCP, GKE, Kubernetes, Terraform, Helm, Keda, Grafana, Prometheus, Thanos, Polar Signals, ELK, Redis, Harbor, GitHub, Copilot, CI/CD
Languages : Bash, Golang
Powder – DevOps Engineer / Technical Lead (December 2021 - March 2023)
As a Technical Lead working in an AI-powered startup building a gaming clips platform on AWS, I was responsible for the platform’s architecture and end-to-end implementation. My work covered cloud infrastructure management, automation, observability, CI/CD, Kubernetes operations and platform security.
Key Projects
- Designed and deployed a complete observability stack using Grafana, Prometheus, Loki, OpenTelemetry and Tempo. Added black-box monitoring, status pages and alerting to ensure service availability and reliability. (article)
- Migrated all workloads to a GitOps model with ArgoCD, improving deployment consistency and rollback safety.
Infrastructure & Automation
- Managed a multi-account AWS environment using Infrastructure-as-Code (Terraform with custom modules).
- Standardized CI/CD with GitHub Actions and reusable workflows across all repositories.
- Developed internal tools, including a custom CLI to simplify AWS and Kubernetes related operations.
- Authored extensive technical documentation and led internal training sessions.
Security & Reliability
- Built a bastion architecture to secure access to production systems. (article)
- Implemented secure secret management using AWS Secrets Manager.
- Applied security best practices across the platform with Trivy, tfsec and Kyverno.
- Provided ongoing production operations support and incident response.
Stack : AWS, EKS, Kubernetes, Linux, Ubuntu, Ansible, Terraform, ArgoCD, Helm, Keda, Grafana, Prometheus, Grafana Loki, Grafana Tempo, GitHub, PostgreSQL
Languages : Bash, Golang
Acoss – DevOps Engineer (May 2020 - December 2021)
Worked on an on-premises Java application platform built upon OpenStack and Kubernetes. Using Prometheus, Grafana, and the EFK stack, I was responsible for the platform’s observability, reliability and performance.
Stack : OpenStack, Kubernetes, Cilium, Terraform, Flatcar Linux, ArgoCD, Prometheus, Grafana, Elastic Stack (ELK), Fluent Bit, Gitlab, CI/CD
Languages : Bash, Python
Airbus – DevOps Engineer (June 2018 - April 2020)
Contributed to the Airbus OneAtlas program, a military-grade satellite image processing solution built on Kubernetes, within a SAFe/Scrum framework. In a highly secure environment, I was responsible for the solution’s architecture, provisioning, performance and quality assurance on Google Cloud Platform.
Stack : GCP, GKE, Kubernetes, Terraform, MongoDB, Gitlab, CI/CD, Prometheus, Grafana, Thanos, JMeter
Languages : Golang, Python, Bash
Orange – System Engineer & Technical Lead (January 2015 - June 2018)
As a Technical Lead, I managed the on-premises infrastructure of Orange’s Digital Factory, leading multiple technical platforms and projects with a strong focus on infrastructure services, automation and security. I played a key role in three major projects: building a new Linux container-based cloud platform, developing a hardened hosting platform for the internal PKI and secret management service and implementing the Cassandra NoSQL database.
Stack : Data Centers, Ubuntu, CentOS, ITIL, Gitlab, Ansible, Cassandra, Vault, LAMP, MariaDB (Galera), Docker, Docker Swarm, Rancher, Xymon, OpenSCAP, Bareos
Languages : Python, Bash, Golang
Air France – System Engineer (August 2013 - January 2015)
In a business-critical environment, collaborated closely with all IT departments to maintain data center reliability, operating Unix and Linux systems without impacting strict uptime and performance SLAs.
Stack : Data Centers, Red Hat Enterprise Linux (RHEL), Solaris, Solaris Zones, ZFS, VMware, ITIL, SAN, CFEngine, Veritas Cluster Server (VCS), Veritas Volume Manager (VxVM), IBM Tivoli Storage Manager (TSM), HP IBM Tivoli Workload Scheduler (TWS), OpenView Operations (OVO)
Languages : Bash
i2N – System Administrator (February 2011 - June 2013)
Built and operated a customer-facing web and mail hosting platform using open-source technologies, handling system administration and custom front-end development for client websites.
Stack : OVH, Linux, Debian, LAMP, Postfix, Plesk, SVN
Languages : HTML5, CSS3, JS, jQuery, PHP, Bash
Computacenter – System Administrator (October 2010 - December 2010)
Updated the FAED system across PACA for the National Forensic Police Service (SNPS).
KEY SKILLS
- Unix & Linux Systems
- Scripting & CI/CD Automation
- Kubernetes & CNCF Ecosystem
- Infrastructure as Code
- Automation & Scripting
- Monitoring / Observability
- Cloud platforms (AWS / GCP)
- DevOps & Platform Engineering
- AI Tooling (Cursor, Dust, MCP)
PROGRAMMING SKILLS
- Bash / Shell scripting
- Go (Golang)
- Python
SOFT SKILLS
- Teamwork
- Analytical Thinking
- Problem Solving
- Effective Communication
- Versatility
CERTIFICATIONS
- Certified Kubernetes Security Specialist (CKS)
- Certified Kubernetes Administrator (CKA)
- Linux Professional Institute (LPIC-1)
- Professional Scrum Master (PSM I)
- ITIL V3-2011 Foundation (ITILF)
EDUCATION
CNAM – Bachelor’s Degree in Computer Science (BAC +3)
VOLUNTEERING
LANGUAGES
- French - Native speaker
- English - Native speaker
INTERESTS
- Rock climbing
- Old adventure games
- Books
- FOSS