We are looking for a highly skilled DevSecOps, and we are delighted that you want to be part of our team!
We're consistently choosing to help customers overcome their IT challenges providing consulting expertise to support IT strategy, outsourced operations, staff augmentation and digital transformation for companies such as ArcelorMittal, Air Liquide, Volvo Group, MLSE and many more. Take a look at our website, you can see some of the exciting work we are doing: https://www.metait.ca/
Why build your carrer at Meta?
We offer autonomy, clear goals and a dynamic and challenging environment, where professionals have the opportunity to interact with different technologies, participate in all types of projects, bring new ideas and work from anywhere in Brazil and (why not?) anywhere in the world. In addition, we are one of the best companies to work for in Brazil according to Great Place to Work and one of the 10 fastest growing technology companies in the country for 3 consecutive years, according to Anuário Informático Hoje.
Key Responsibilities:
- Ensure the high availability and performance of our services.
- Respond to and resolve incidents swiftly and conduct post-incident reviews.
- Monitor and analyze system capacity and plan for future growth.
- Automate repetitive tasks and processes to improve operational efficiency.
- Set up and maintain monitoring, logging, and alerting systems.
- Collaborate with software engineers to design reliable and scalable systems.
- Maintain comprehensive documentation for systems and processes.
- Implement security measures and conduct security reviews.
- Optimize system and application performance and resolve bottlenecks.
- Participate in retrospectives and continuous improvement initiatives.
- Define and maintain Service Level Objectives (SLOs).
Required Skills and Experience:
- Strong knowledge of Linux/Unix systems and systems administration.
- Understanding of network protocols (TCP/IP, DNS, HTTP).
- Experience with CI/CD pipelines and infrastructure as code (IaC) tools (e.g., Terraform, Ansible).
- Expertise in monitoring tools (e.g., Prometheus, Grafana) and logging systems.
- Experience with cloud platforms (e.g., AWS, GCP, Azure) and container orchestration (e.g., Docker, Kubernetes).
- Awareness of security best practices and ability to implement security measures.
- Strong analytical and troubleshooting skills.
- Excellent written and verbal communication skills.
- Ability to work effectively in a collaborative environment.
Preferred Qualifications:
- Familiarity with SRE principles and practices.
- Experience conducting blameless postmortems and driving systemic improvements.
- Knowledge of performance optimization techniques and tools.