Senior Engineer - Cloud Ops & Server
Bangalore, IN, 560022
Role Summary
Senior Engineer – CloudOps & Server will be responsible for the end-to-end operations of the enterprise CloudOps and Server. The role is responsible for ensuring high availability, resilience, security, and performance across on‑premises and multicloud environments (Azure, AWS, GCP as applicable). This involves driving operational excellence, leading cloud migration initiatives, establishing Infrastructure-as-Code practices, strengthening DevSecOps maturity, and ensuring governance, compliance, and cost optimization across the landscape. The Engineer will collaborate closely with Process, Security, and Architecture teams, and drive continuous improvement to enhance operational efficiency, automation, and service reliability.
Main Tasks
Leadership
- Lead and mentor Server administrators, and platform engineers
- Build competency development plans aligned to modern cloud and DevSecOps skills.
- Foster a culture of collaboration, innovation, accountability, and continuous learning.
- Drive performance evaluations, workload planning, and resource allocation.
CloudOps & Server Infrastructure Management
- Handle L3 operations of on-premises and cloud infrastructure across Windows, Linux, virtualization platforms (VMware/Hyper‑V), and containerized workloads.
- Manage identity, core infrastructure services (AD, DNS, DHCP), and hybrid integrations.
- Ensure patching, compliance, hardening, vulnerability remediation, and baseline security standards across all assets.
- Ensure end-to-end observability using cloud-native and enterprise monitoring tools.
Multicloud Engineering & Migration Support
- Lead and support cloud migration activities to Azure, AWS, or GCP.
- Implement and govern Azure Landing Zones, subscription governance, RBAC, cost controls, and resource consistency.
- Ensure adoption of Infrastructure as Code using Terraform.
- Drive reusable IaC module creation, policy-as-code, and automated deployments.
- Oversee network, identity, security, and platform readiness for cloud-hosted workloads.
DevSecOps & Automation
- Work with DevSecOps teams to improve CI/CD pipelines, security scanning, governance, and automation.
- Ensure integration of security controls, vulnerability scanning, and secrets management.
- Promote shift-left practices and automation first mindset.
- Develop operational runbooks, automation scripts, and GitOps workflows.
Operations Excellence & Incident/Problem/Change Management
- Serve as the point of contact for major incidents related to cloud and server operations.
- Closely coordinate with Process teams for: Change Management (impact assessment, scheduling, risk mitigation) & Problem Management (root cause analysis and permanent fixes)
- Drive operational efficiency through automation, elimination of manual tasks, and standardization.
- Ensure SLA, KPI, and audit compliance across all domains.
Security, Governance & Compliance
- Act as SPOC for vulnerability remediation in Cloud & Server domains.
- Drive compliance with InfoSec standards, Zero Trust principles, and regulatory frameworks.
- Collaborate with security and governance teams on assessments and audits.
Stakeholder Management
- Collaborate with internal departments, global IT teams, and external partners
Value Initiatives
- Identify and propose cost-saving and efficiency‑driven initiatives (optimization of idle resources, automation-driven time savings).
Skills
Experience
- 10–12 years of IT Infrastructure experience with at least 3 years in L3 roles.
- Proven experience managing Azure, AWS, or GCP environments at enterprise scale.
- Hands-on experience with Terraform IaC
- Experience supporting cloud migrations and modernization programs.
- Familiarity with DevSecOps toolchains.
- Strong background in server operations (Windows/Linux), virtualization, hybrid infrastructure.
- Strong communication, stakeholder management, and leadership
- Analytical thinking and decision-making
- Ability to manage cross-functional teams and global collaborations
- Strategic thinking with a focus on efficiency and scalability