Deployment of new releases, environments for applications.
Responding to emails and incident tickets, maintaining issue ownership.
Build and maintain highly scalable, large scale deployments globally
Practice sustainable incident response/management and blameless post-mortems.
Monitor and maintain production environment stability.
Perform production support activities that involve the assignment of issues and issue analysis and resolution within the specified SLAs.
Coordinate with the Application Development Team to resolve issues on production.
Suggest fixes to complex issues by doing a thorough analysis of the root cause and impact of the defect.
Provide daily support with a resolution of escalated tickets and act as a liaison to business and technical leads to ensure issues are resolved in a timely manner.
Technical hands-on troubleshooting, including parsing logs and following stack traces.
Experience and Skills:
Basic Linux OS knowledge
Hands-on experience on Shell/UNIX commands.
Experience in Monitoring tools like Grafana, Logging tool like ELK.
Rest API working experience to execute curl, Analysing request and response, HTTP codes etc.
Database queries execution experience in any of DB (MySQL,Postgres /Mongo)
Knowledge on Incidents and escalation practices.
Ability to troubleshoot issues and able to handle different types of customer inquiries.
Should have worked in incident management tools like service now