Job Description
Hiver is looking for an engineer to join our DevOps/Site Reliability Engineering (SRE) team which ensures that all our user-facing services and production systems keep running smoothly. You will also be closely working with the Product teams to improve the complete lifecycle of services, right from the inception and design, deployment, operation, and optimisation. We are an engineering focussed team so we keep investing in improving our tools, tests, processes, and technology. We consider our people to be our biggest asset and we strive to build a culture where everyone is continuously learning and growing.
Responsibilities:
- Ownership and maintenance of uptime of infrastructure based out of AWS & GCP Oncall support for Production Infrastructure and delivering final solutions to recurring problems
- Work with various Engineering Teams and help them in implementing solutions for Performance, Scalability, and Security Troubleshoot and improve the deployment and release process for our infrastructure based on AWS (EC2), Kubernetes, and Jenkins
- Build and maintain HiversCore Infrastructure components that allow Hiver to scale and support thousands of concurrent users
Requirements:
- 2+ years of experience in SRE Operations / System Administrator / DevOps role
- Strong working knowledge of Linux and operating system fundamentals
- Experience in any Configuration Management Tool ( Ansible Preferred )
- Good knowledge of networking (HTTP/HTTPS, SSH, TCP, UDP)
- Experience with containerisation (Docker, Kubernetes)
- Experience working with Cloud platforms (AWS) Scripting knowledge (Python / Bash)
- Experience with monitoring tools like DataDog / CloudWatch / Grafana is a must
- Experience working with a CI/CD setup is a must (Jenkins)
- Experience with ELK stack will be an advantage
Found this job inappropriate? Report to us