ITTConnect is seeking an SRE / DevOps engineer, with experience in Java applications to work remotely for a client in the US. This is a position with a global leader in consulting, digital transformation, technology and engineering services present in nearly 50 countries. The end client is in the Telecom area (internet, mobile, cable TV provider). Initial assignment until December.
Job location: Remote, work from home anywhere in Brazil, following US Eastern time.
All interviews will be in English only (must be professionally fluent).
Responsibilities:
* Production support: Triage of production systems and understanding of analysis and how to drive toward resolution while participating in large group discussions.
* Perform impact analysis on infrastructure and applications.
* Identify, prepare, execute mitigation plans.
* Perform production deployments either manually or through automation.
* Perform required deployment verifications after application or services post deployments.
* Take deep dives in Java code to identify possible fixes for production issues.
* Troubleshooting production issues and driving the bridges
* Work as a contributing team member together with other team members in other states and countries.
Requirements:
* At least 5 years’ experience in supporting JAVA Application / services hosted in Linux environments.
Kubernetes:
* Ability to perform cluster level administration on K8s platform.
* Creating and maintaining scripts to maintain, monitor and alerts on K8s platform.
* Comfortable with kubectl / YAML
Docker:
* Understanding of docker.
* Good experience in writing docker files.
* Creating images, maintaining docker registry.
Ansible:
* Playbook creation for repeatable tasks
* Perform installation of software (platform and code deployment)
* Take documentation and create roles to install software.
* Create reusable roles and playbooks
* CloudFormation and Terraform
Monitoring Tools:
* New Relic - APM, Insights, Infrastructure
* Ability to create alerts and dashboards.
* Splunk - Querying and dashboard creation
Application Performance Tuning:
* Participate in load testing and resolve testing bottlenecks.
* Java heap and thread dump analysis
Jenkins:
* Knowledge and understanding of Jenkins Pipelines
* Ability to analyze console job log for errors.
OS Support and Troubleshooting:
* Redhat Enterprise Linux
* Amazon Linux
* Alpine Linux
* Windows 2012/2016
Nice to have tools & technologies:
* Kubernetes Platform: Rancher, GCP
* Hashicorp Tools: Vault, Consul
* Middleware: JBoss EAP, JBoss Fuse
* Elastic Search – implementation and troubleshooting
* AWS Services: EC2, Lambda, RDS, VPC, IAM Roles, DynamoDB