Experience: 6+ years
Location: [Add Location / Remote/Hybrid if applicable]
Department: Engineering / Infrastructure
We are committed to building software that solves real-world problems. Our Site Reliability Engineers (SREs) play a crucial role in ensuring that our systems remain reliable, scalable, and efficient. We are seeking an experienced SRE to join our team and drive continuous improvement in our infrastructure and operations.
Monitor & Maintain Systems – Ensure high availability, reliability, and performance of production environments by monitoring system health and responding to incidents.
Automation – Develop, implement, and improve automation tools to reduce manual tasks and enhance efficiency.
Collaboration – Partner with development teams to design, build, and operate scalable and resilient systems.
Performance Tuning – Analyze system metrics, identify bottlenecks, and optimize overall performance.
Incident Management – Lead incident response, perform root cause analysis, and implement preventive measures.
Documentation – Maintain clear documentation of system architecture, processes, and best practices.
Capacity Planning – Anticipate growth and ensure infrastructure is prepared to scale effectively.
Experience: 6+ years in Site Reliability Engineering, Operations, or Software Engineering.
Education: Bachelor’s degree in Computer Science, Engineering, or a related field.
Technical Skills: