Mid/Senior DevOps Engineer
About the Company
Intelmatix is a pioneering deep-tech AI company founded by MIT technologists, with a global presence in Riyadh, London, and Boston. We are on a mission to transform businesses into cognitive enterprises through Decision Intelligence. Our technology blends cutting-edge AI with real-world applications, delivering scalable, secure, and data-driven solutions to global clients.
At Intelmatix, our culture is built around innovation, collaboration, and continuous learning. We believe in fostering a work environment that is inclusive, dynamic, and supportive, allowing our team members to grow both professionally and personally. With a strong foundation in AI and cloud technologies, we are committed to making a significant impact in the industry.
This role matters to the company's growth as we continue to expand our platform engineering team, focusing on developing and maintaining the infrastructure that powers our AI-driven platforms. The successful candidate will play a critical part in architecting, shaping, and supporting our cloud infrastructure, ensuring it is reliable, scalable, and cost-efficient.
Key Responsibilities
We are looking for a highly motivated Mid/Senior DevOps Cloud Engineer with a specialized focus on Google Cloud Platform (GCP) to join our Platform Engineering team. The ideal candidate will have a strong background in infrastructure automation, cloud-native solutions, and enabling the seamless deployment of complex applications, including AI/ML models.
- Architect, maintain, and optimize cloud infrastructure on GCP with a focus on reliability, scalability, and cost-efficiency.
- Automate global infrastructure provisioning using Terraform to ensure consistent environments across the lifecycle.
- Build and maintain robust CI/CD pipelines for application, data, and model deployment workflows (GitHub Actions).
- Work closely with Data Science teams to deploy and monitor machine learning models and analytical services.
- Implement and enforce security best practices, including IAM, VPC service controls, and Zero Trust architectures.
- Set up and maintain modern observability stacks (Prometheus, Grafana, Loki) for proactive monitoring and alerting.
- Assist in managing cross-cloud integrations, leveraging knowledge of AWS and Azure where applicable.
- Collaborate with the development team to ensure smooth integration of applications with the cloud infrastructure.
- Participate in on-call rotations to ensure 24/7 support for our cloud infrastructure.
- Stay up-to-date with the latest developments in cloud technologies and apply this knowledge to improve our infrastructure and processes.
Requirements & Qualifications
Must-Have
- Bachelor’s degree in Computer Science, Engineering, or a related field.
- 3–5 years of dedicated experience in DevOps, Site Reliability Engineering (SRE), or Cloud Engineering roles.
- Deep hands-on experience with Google Cloud services (e.g., GCE, GKE, GCS, Cloud SQL, IAM, and VPC).
- Strong proficiency with Docker and orchestration via Google Kubernetes Engine (GKE) or similar services in other cloud providers.
- Proven experience managing infrastructure through Terraform.
- Experience building automation pipelines using GitHub Actions or similar tools.
- Proficiency in Python and Bash for task automation and tooling.
Nice-to-Have
- Google Professional Cloud DevOps Engineer or Professional Cloud Architect or any cloud certifications for other cloud providers.
- Familiarity with AWS (EC2, S3, EKS) or Azure (AKS, App Service).
- Familiarity with GCP-specific data tools (BigQuery, Vertex AI, Dataflow) or MLOps tools (MLflow, Airflow).
- Understanding of Zero Trust Network (ZTN) concepts and Cloudflare.
- Basic experience with PostgreSQL administration.
Technical Skills
Cloud & Infrastructure
We utilize a range of cloud services to support our infrastructure, including Google Cloud Platform (GCP), Amazon Web Services (AWS), and Microsoft Azure. Our team is responsible for managing and optimizing these resources to ensure high availability and scalability.
- GCP (GKE, GCE, BigQuery)
- AWS (EC2, S3, EKS)
- Azure (AKS, App Service)
CI/CD & Automation
We leverage various tools to automate our deployment pipelines and infrastructure provisioning, including Terraform, GitHub Actions, and Docker. These tools enable us to streamline our processes, reduce manual errors, and improve overall efficiency.
- Terraform
- GitHub Actions
- Docker
Security & Monitoring
Security and monitoring are critical components of our infrastructure. We use a combination of tools, including Cloudflare, Prometheus, Grafana, and Loki, to ensure the security and integrity of our systems and data.
- Cloudflare
- Prometheus
- Grafana
- Loki
What We Offer
At Intelmatix, we offer a competitive annual salary ranging from 110000 to 145000, depending on experience. In addition to a competitive salary, we provide a range of benefits, including:
- Remote flexibility and the opportunity to work with a global team.
- Equity/stock options, allowing you to share in the company's success.
- A learning budget to support your professional development and continuous learning.
- Comprehensive health, dental, and vision insurance to ensure your well-being.
- A generous PTO policy, providing you with the time you need to relax and recharge.
- An equipment stipend to ensure you have the tools you need to perform your job effectively.
- Opportunities for growth and advancement within the company.
Our team culture is built around collaboration, innovation, and mutual respect. We believe in fostering a work environment that is inclusive, dynamic, and supportive, allowing our team members to grow both professionally and personally.
Frequently Asked Questions
What is the remote work setup like?
We offer a flexible remote work arrangement, allowing you to work from anywhere in the world. We use a range of tools, including Slack, Zoom, and GitHub, to facilitate communication and collaboration among team members.
What is the hiring process and timeline?
Our hiring process typically involves an initial screening, followed by a series of interviews with the engineering team. We aim to complete the hiring process within 2-3 weeks, depending on the complexity of the role and the availability of candidates.
What is the team size and tech stack?
Our platform engineering team is relatively small, with a mix of experienced engineers and junior talent. We utilize a range of technologies, including GCP, AWS, Azure, Terraform, GitHub Actions, and Docker. We are committed to staying up-to-date with the latest developments in cloud technologies and applying this knowledge to improve our infrastructure and processes.