Senior/Lead AI DevOps/SRE

EPAM Systems

  • Казахстан
  • Постоянная работа
  • Полная занятость
  • 1 мес. назад
We are currently seeking an experienced Lead AI DevOps/SRE to join our team.In this pivotal role, you will collaborate closely with data scientists and software developers to ensure seamless integration and optimize the operational efficiency of our AI deployments. Your expertise will be pivotal in deploying, maintaining, and scaling our cutting-edge AI solutions, encompassing LLMs and RAG systems.As a key team member, you will spearhead both traditional DevOps responsibilities and innovative approaches to MLOps. Your proactive involvement will be essential in driving the success of our AI initiatives and maximizing their impact across the organization.This position offers remote setup with the flexibility to work from any location in Kazakhstan, whether it's your home or well-equipped offices in Astana, Almaty or Karaganda.ResponsibilitiesImplement and maintain CI/CD pipelines for AI and machine learning projects, ensuring robust deployment strategies and continuous integrationMonitor and ensure the reliability, availability, and performance of AI applications, particularly those involving LLMs and RAGCollaborate with AI research teams to operationalize machine learning models and systems efficientlyDevelop and enforce best practices for version control, configuration management, and testing of AI-driven software solutionsUtilize MLOps tools such as Kubeflow, MLflow, or TensorFlow Extended (TFX) to streamline the machine learning lifecycle from experimentation to productionImplement monitoring solutions that track both system metrics and model performance to facilitate proactive issue resolutionParticipate in on-call rotations to support the operational health of critical systems, employing SRE principles to meet service-level objectives (SLOs) and reduce downtimeRequirementsBachelor's degree in Computer Science, Engineering, or a related fieldProven experience as a DevOps Engineer or SRE, with a strong background in software development and automationExpertise in deployment and management of LLMs, including technologies like RAGProficient in CI/CD tools (Jenkins, GitLab CI, CircleCI) and infrastructure as code (Terraform, Ansible)Solid knowledge of container orchestration technologies (Kubernetes, Docker)Familiarity with MLOps tools and practices to support machine learning lifecycle managementNice to haveExperience with cloud services (AWS, GCP, Azure), particularly in AI/ML deploymentsBackground in monitoring tools like Prometheus, Grafana, and ELK stackUnderstanding of Python, particularly in data science and machine learning contextsCertification in Kubernetes, AWS/GCP/Azure, or similar technologiesWe offer/BenefitsWe connect like-minded people:
  • Delivering innovative solutions to industry leaders, making a global impact
  • Enjoyable working environment, whether it is the vibrant office or the comfort of your own home
  • Opportunity to work abroad for up to two months per year
  • Relocation opportunities within our offices in 55+ countries
  • Corporate and social events
We invest in your growth:
  • Leadership development, career advising, soft skills and well-being programs
  • Certifications, including GCP, Azure and AWS
  • Unlimited access to LinkedIn Learning and Get Abstract
  • Free English classes with certified teachers
  • Discounts in local language schools, including online courses for the Kazakh language
We cover it all:
  • Participation in the Employee Stock Purchase Plan
  • Monetary bonuses for engaging in the referral program
  • Medical & family care package
  • Six trust days per year (sick leave without a medical certificate)
  • Coverage of psychology sessions of your choice
  • Benefits package (sports activities, a variety of stores and services)
EPAM is a team of technologists and innovators united by a passion for technology. In Kazakhstan, we operate across all cities with offices in Astana, Almaty, and Karaganda and work with the world's leading companies from different industries. In 2023, EPAM received the Export Excellence Award at the esteemed Digital Bridge Awards, showcasing our commitment to excellence and innovation.

EPAM Systems

Похожие вакансии

  • Senior Backend-разработчик Node.js

    Моторная компания Астана-Моторс

    • Алматы
    MyStartups – дочерняя IT-компания Astana Motors, специализирующаяся на создании технологичных решений для оптимизации бизнес-процессов в разных сферах. Сейчас мы активно расширяе…
    • 1 д. назад

    Просмотреть похожие вакансии:

  • Senior Backend Developer (Python)

    Моторная компания Астана-Моторс

    • Алматы
    MyStartups – дочерняя IT-компания Astana Motors, специализирующаяся на создании технологичных решений для оптимизации бизнес-процессов в разных сферах. Сейчас мы активно расширяе…
    • 1 д. назад

    Просмотреть похожие вакансии:

  • Senior Frontend Vuejs developer

    Bilim Land (Bilim Group)

    • Алматы
    Bilim Group - это продуктовая EdTech-компания в Казахстане. Мы работаем с огромным масштабом изменений - со всей системой образования страны. В нашей экосистеме сегодня более 20 …
    • 1 д. назад

    Просмотреть похожие вакансии: