Job Description:
Responsibilities:
- Involve in whole process of Development, Production System Operation including system maintenance, monitoring, automation, backend operation, ensuring high availability, regular application release, troubleshooting, middleware performance tuning and collaborate with functional , technical team members to provide high quality services.
- Involve in automation of routine manual production/non-production operation using technologies like Ansible, Docker etc. Will be the key person to propose, implement automation to increase productivity with High Quality.Β
- Always improve the system performance, stability to provide 100% availability.
- Should have service ownership mind & proactively able to react to the production issues.
- Propose new technologies, tools etc to improve whole process of development, testing and production operations. Strong self-learning ability, motivation to work on new Technologies.
- Work closely with developers, product manager, project manager, team lead, security and QA team members in different location (Singapore, Japan, India etc).
Qualifications:
- Must-have
- Over 5 years of experience on DevOps, handling high traffic production system independently, troubleshooting (middleware, infra etc), automation, regular operation etc. Hands on with the large and scalable system.
- Good knowledge in CICD pipeline using tools such as Jenkins/Bamboo and VCS such as GIT/SVN
- Experience in production automation using Docker, Kubernetes, Ansible etc. Should have real experience in implementation of automation using industry standard best practice, safety.
- Strong knowledge in LINUX based system operation and extensive skills in Linux commands.
- Strong scripting knowledge like Perl, shell, Python, etc.
- Good knowledge in System, application monitoring and alert notification.
- Understanding of web application and related specification (security, HTTP protocol, RFC, etc)
- Identify process gaps and recommend on best practices based on industry standards.
- Provide technical expertise on complexΒ automationΒ and functional issues.
- Flexible for emergency support timing based on the business requirement. Must adapt to business needs in terms of working hours.
- Non-Business hour emergency support for critical issues.
- Nice-to-have
- System development experience in PHP, Java etc.
- Experience in Microsoft Azure
- Big Data technologies such as Hadoop, NoSQL, text mining
- Team management experience