We are seeking a skilled Engineer who excels in both software development and infrastructure administration to join our team. This role requires a versatile professional who can seamlessly work across the full infrastructure lifecycle - from provisioning and maintenance to developing automation tools for our Kubernetes/Istio/Linux-based platform.
Key Responsibilities
Operations Management
Provision and maintain nodes and clusters in our production environment
Respond to alerts, handle incidents, and implement effective countermeasures
Perform OS and middleware updates with minimal service disruption
Ensure all security requirements are met according to company standards
Participate in midnight release schedules and monitoring rotations
Create comprehensive operation manuals and obtain necessary approvals for production changes
Development Initiatives
Define requirements and create technical design documentation including diagrams and pros/cons analyses
Develop automation solutions for cluster/node provisioning
Create self-healing systems and automated OS/middleware upgrade processes
Implement infrastructure-as-code practices to improve scalability and reliability
User Support
Provide expert-level support for complex cases beyond the capacity of standard user support teams
Guide teams during migration from legacy platforms to our new private cloud infrastructure
Required Qualifications
3+ years of software development experience covering the full SDLC (requirement definition, design, implementation, testing, release) using Golang and/or Python
3+ years of hands-on infrastructure administration experience in production environments with Linux, Kubernetes, Istio, Nginx, Mesos, or Marathon
Deep understanding of Kubernetes/container/Linux provisioning and troubleshooting at scale
Experience operating large-scale systems (100+ servers)
Strong knowledge of networking fundamentals and TCP/IP
Solid understanding of distributed systems and high-availability architectures
Certified Kubernetes Administrator (CKA) certification or ability to obtain within 3 months of joining
Ability to follow strict documentation and approval processes required for infrastructure changes
Strong sense of responsibility for system stability and meeting deadlines.
Experience with Mesos/Marathon orchestration systems
Proficiency in Golang programming language
Bachelor's or Master's degree in Computer Science, Engineering, or related technical field
Collaboration: Confluence, JIRA, Slack, MS Teams, Viber
Process: Scrumban methodology
Languages: Go, Python, Shell Script
What We Offer
Opportunity to join as a regular full-time employee
Work with cutting-edge technologies in a large-scale production environment
Collaborative team focused on infrastructure innovation and reliability
Career growth in both operational excellence and software development
Qualified candidates should demonstrate exceptional attention to detail, strong problem-solving abilities, and the capacity to work effectively under pressure in a mission-critical environment.