at Marktine Technology Solutions Pvt. Ltd. - posted by Sonal Jain
5-8 Years
Salary Not Disclosed
Full Time
Remote
0
Site Reliability Engineer - Data Center Storage
As a Site Reliability Engineer - Data Center Storage, you will be responsible for:
Must haves:
● Hands-on working knowledge of the command line in Linux systems.
Site Reliability Engineer -
Data Center Storage
As a Site Reliability Engineer - Data Center Storage, you will be responsible for:
Must haves:
● Hands-on working knowledge of the command line in Linux systems ● Understanding of networking, data center infrastructure, and server provisioning and booting ● Managing your Jira ticket queue and troubleshooting based on logs and alerts ● Identifying code-related issues in the validation process and creating tickets for the appropriate team to implement a fix ● Communicating extensively with data center operations teams, working hand-in-hand to resolve both hardware and software issues ● Proactively monitoring data storage utilization, I/O capacity, and alerts ● Understanding storage use cases for common virtualization platforms such as VMware or OpenStack ● Being on call during business hours for storage-related alerts and escalations ● Maintaining and contributing to technical documentation, troubleshooting manuals, and runbooks. ● Continuously reviewing, learning, and understanding internal services and tools relevant to our workflows ● Being familiar with SLI/SLO/SLA concepts, error budgets, and other SRE terminology, and knowing how to design them ● Experience with continuous deployment using tools like Jenkins, GitHub Actions, Puppet, Ansible, etc. ● Ability to write automation scripts using Shell or Python Good to have: ● Basic knowledge of Pure Storage products such as FlashArray and FlashBlade ● Experience with hardware from vendors such as Cisco, Brocade, and Supermicro ● Scripting experience in Python or Ansible is desirable ● Familiarity with automated booting in a Linux environment The IDEAL Data Center Site Reliability Engineer will also have: ● Education in Computer Science, Information Systems, or Computer Hardware Engineering ● Excellent interpersonal and teamwork skills ● Strong written and verbal communication skills ● Detail-oriented and a well-organized self-starter ● Open to constructive feedback ● Strong problem-solving skills, particularly related to server hardware ● Ability to take ownership of hardware and software issues and see them through to resolution ● Proven experience as an SRE, DevOps Engineer, or in a similar role
No skills