Site Reliability Engineer

Standort: Sweden Gehalt: Up to €1 per hour
Bereich: IT Service Provider Bereich: Freiberufler
Reference #: CR/065560_1573469182

My Client is urgently looking for an SRE Consultant to evolve how they work with container deployment and orchestration scale. They currently have a strong focus on monitoring side and are eager to move proactive approach with predictive analytics and behavior analysis.

Monitoring, controlling and scaling micro service's based connectivity platform deployed in IBM cloud. Platform is connected with multiple systems, which are mostly hosted by IT or other suppliers.

Their Infrastructure is based on:
* Distributed microservices architecture
* Java and Node.js backend applications
* Orchestration: Kubernetes, Terraform
* Messaging: Kafka, MQTT
* Database: DB2, MongoDB, Redis
* Evolve how we work with container deployment and orchestration at scale
* Maintain the Kubernetes clusters in different regions
* Build automated infrastructure to deliver metrics from production environments
* Monitoring, alerting, and incident resolution, provide root cause analysis for incidents
* Identify performance bottlenecks
* Infrastructure as code
* Automation as much as possible
* Continuous improvement of the infrastructure
A successful candidate should have:
* Mcs/Bcs or higher degree in relevant area
* 3+ years of experience in hosting and operating microservices based systems
* 3+ years of experience in running Cloud based production environments
* Experience with Docker, Kubernetes, Linux
* Monitoring systems like Prometheus, Grafana, Opsgenie, etc
* Log management systems like Splunk, Stackdriver, etc
* Experience with CI/CD pipelines
* Good programming skills
* Proficient in English

Good to have
* IoT platform experience
* Building scalable platform experience
* Experience with disaster recovery
* Experience with Chaos Engineering
* Strong troubleshooting skills