About Our Client:
Fast-growing Technology company is seeking a Director of DevOps Engineering for permanent position.
Responsibilities:
· Proven track record of managing and mentoring a team of SREs and DBREs, driving reliability, scalability, and operational excellence across systems, and fostering a collaborative
and high-performance culture.
· Strong ability to influence cross-functional teams and drive initiatives that improve system reliability and scalability.
· Database Reliability: Manage MySQL (replication, sharding, partitioning) and PostgreSQL databases for high availability and performance in a high-traffic SaaS environment.
· Scalable Infrastructure: Experience designing and scaling highly available, fault-tolerant systems, especially for high-traffic environments (e.g., SaaS platforms, microservices).
· Cloud Platforms: Expertise with Google Cloud Platform (GCP), Amazon Web Services (AWS), or other cloud platforms for building, deploying, and managing applications and infrastructure.
· Distributed Systems: Solid understanding of distributed computing principles, including load balancing, high availability (HA), auto-scaling, and failover mechanisms.
· CI/CD & Automation: Optimize CI/CD pipelines using GitHub, Jenkins, and other tools to streamline deployment and infrastructure management.
· Collaboration: Work closely with engineering teams to design reliable, scalable, and cost-effective solutions.
Requirements:
· Experience: 8+ years in SRE, DBRE, or infrastructure engineering with 4+ years in a leadership role.
· Strong understanding of MySQL replication technologies. (eg. Tungsten, Galera, Group Replication)
· Deep knowledge of database sharding and partitioning strategies, particularly in high-traffic, high-availability SaaS environments.
· Solid understanding of cloud infrastructure (Google Cloud Platform and AWS) and best practices for deployment, scaling, and cost management.
· Proficiency in Python for automation, scripting, and system integrations.
· Experience with PostgreSQL and managing both relational and NoSQL databases.
· Expertise in Terraform, CI/CD pipelines (GitHub, Jenkins), and configuration management tools (Ansible, Puppet, Chef).
· Familiarity with containerization (Docker, Kubernetes) and orchestration in cloud environments.
#RIMCA