We are USA HQ well-funded Startup. In a process of setting up SRE Practice in Bangalore, India.
Position : SRE Architect
Domain : Data Security & Cyber Security
Experience : 8+ Yrs
Work Location : Bangalore, India
Compensation : 40 to 60 LPA
What we are looking:
- Excellent dealing with high-availability, fault-tolerant, scalable, resilient and distributed systems.
- Expert working at AWS cloud computing infrastructure and its components.
- Hands-on with containerization, container & cluster management - Kubernetes, Docker, EKS etc.
- Hands-on experience with configuration management tools (Ansible, Terraform etc)
- Proven and Hands-on experience in handling large scale infrastructure like Package management, EC2, SQS, S3, MongoDB and Distributed systems like Kafka, Yarn, Elastic Search etc..
- Familiarity with container orchestration tools (K8's, ECS, swarm) build, artifacts, packaging, service discovery management tools.
- Good at any of the following languages - Python, Java, Go.
- Source code management and Implementation of security best practices.
- Good at analysing App bottlenecks, performance degradation and implementing automated process/tools to detect such anomalies.
- Design, architect and implement best in class CI/CD pipelines
- Accountable for infrastructure design, Automation, stability, resilience, performance, monitoring, security, and implementation of right practices.
- Build and manage infrastructure as a code and experience with Terraform
- Collaborate with engineering teams to improve the development/production environment.
- Containerizing and orchestrating with K8S and driving the micro-services adoption across multiple engineering functions.
- Owning/Building functional KPIs for services, incident, and infrastructure metrics.
- Identify and track metrics such as MTTR (mean time to recovery, repair, respond or resolve) in order to exceed SLA expectations
- Build services and Maintain once they are online by measuring and monitoring availability, latency and overall system reliability.
- Building solutions and Monitoring at scale with Prometheus and TICK stack.
- Participate in Defining cloud data strategy, including designing multi-phased implementation roadmaps