Experience : 10+ Years
No Of Position : 2
Notice Period : 2 Weeks
The Operations, Middleware and Systems Support Specialist will be the go to person for troubleshooting time sensitive and critical issues. In addition to hands-on troubleshooting, the specialist will work directly with L1, L2 and middleware support to deep dive into issues with a full stack perspective of the impacted application/system. The specialist should be able to quickly understand the application eco system and investigate from multiple approaches.
Lead and Drive implementations that span across ITOPS, DevOPS, SecOPS and CloudOPS
Design and Build application infrastructure leveraging cloud technologies, focusing on automation, high availability and scalability
Provide support for application level performance troubleshooting, investigating memory, cpu, network and storage issues etc
Analyze, Troubleshoot and Drive to resolution issues and incidents working with SME�s in each domain
Quickly onboard and support multiple applications with complex integrations and architecture.
Investigate issues with a full stack view taking into account all tiers and interconnected systems.
Identify and troubleshoot system performance, uptime and availability issues including network and connectivity.
Capture diagnostic data, open cases with product/platform vendors and work with vendor support to resolve issues.
Continuously review operational readiness of applications through monitoring tools, scripting and automation.
Identify and resolve security risks/vulnerabilities from a System, Network and Middleware perspective while establishing measures that proactively find vulnerabilities.
Continuously review maintenance procedures, run books for accuracy, efficiency and optimizations.
Review, refine and tune monitoring and alerting thresholds.
Directly work with incident managers, L1 and L2 support in triaging, resolving issues thru RCA.
Work closely with DevOps and Applications engineering teams in rolling out new applications/systems and establish operational procedures and guidelines.
Bachelors degree in Computer Science or similar field
Masters degree in Computer Science or similar is a plus
8+ years of systems engineering experience
6+ years of Operations and Systems Support experience
5+ years of systems administration experience
5+ years of DevOps/CloudOps experience
Must have hands-on experience with systems administration (Linux and its flavors)
Must be experienced with AWS cloud automation tools and technologies like chef, CFT's, Terraform, BeanStalk etc.
Must have hands-on experience with container technologies like Docker etc.
Experience with container application platform like OpenShift is a plus
Must have hands-on experience with provisioning infrastructure in AWS VPC's( ec2, RDS, ELB's, Cloud Front, EFS, GlusterFS, S3, CloudWatch, API Gateway, Route 53 e.t.c)
Must have excellent understanding of VPC setup, subnet, routing, Direct connect and VPC Peering
Must have hands-on experience with setting up security groups, IAM roles and policies, WAF etc.
Must have hands-on expertise with cloud watch integration with paging systems (PagerDuty) and management tools (Cloud Health)
Must have hands-on experience with implementing scalable infrastructure (Auto scaling groups etc.)
Must have hands-on experience with cloud to application integrations (SNS etc.)
Must have hands-on experience with setup, configuration and tuning of monitoring and alerting tools like AppDynamics, New Relic, Apica, sensu, etc.
Must have hands-on experience with setup, configuration and tuning of log collection/aggregation tools like SUMOLogic, Splunk etc.
Must have hands-on experience with Apache Web Server, WebSphere or Web Logic or JBoss or Tomcat or similar
Must have hands-on experience with service discovery applications like consul or similar
Must have hands-on experience with Akamai or Cloudfront or Cloudflare CDN's