Chi tiết thông tin tuyển dụng "Expert, Site Reliability Engineering"
Mức lương
Thỏa thuận
Địa điểm
- Số 06 Quang Trung, Hoàn Kiếm, Hà Nội.
- Perform 24/7 monitoring and handle alerts of services of the entire IT infrastructure/application/services. In case encounter difficulties, escalate to L3 for coordinated processing.
- Ensure projects/specialized operations departments provide adequate alert/incident handling instructions for new services before golive and periodically review and update existing alert/incident handling instructions.
- Responsible for periodically reviewing issues/vulnerabilities in IT infrastructure/applications/services within scope of responsibility
- Provide in-depth transfer skills in monitoring and handling alerts and critical IT service incidents
- Participate Lead the standardizing and developing relevant processes and regulations to ensure effective monitoring and handling of alerts/incidents.
- Coordinate with relevant units to promptly restore services/systems, investigate root causes, propose solutions and implement solutions.
- Participate in implementing changes across the software development environment, including on Prem and cloud.
- Implement the development and promulgation of standards and operate centralized monitoring tools (Dynatrace, Grafana, Splunk...)
- Implement monitoring tool integration and support building monitoring dashboards for new IT infrastructure/applications/services
- Ensure projects/specialized operations departments provide adequate monitoring indicators/monitoring thresholds for new services before golive.
- Manage the lifecycle of IT incidents, including identifying, classifying, coordinating and resolving incidents according to SLAs
- Be the contact point during troubleshooting, ensuring effective communication among technical, operations and business departments
- Root cause analysis (RCA) after each incident, recommending preventive measures and process improvements. Coordinate with relevant teams to minimize downtime and improve system availability.
- Participate in developing and maintaining incident management processes according to standards and best practices
- Support integration of automated security scanning into deployment pipelines: SAST, DAST, container scanning
- Collaborate with DevOps teams on secrets management and secure configuration practices (HashiCorp Vault, AWS Secrets Manager)
- Monitor and improve key delivery metrics: MTTR, Deployment frequency, Change failure rate
- Support security tools integration: Snyk, Aqua Security, Prisma Cloud, SonarQube
- Deploy and manage infrastructure on AWS (EC2, EKS, Lambda, RDS, CloudWatch, IAM, etc)
- Operate and troubleshoot Kubernetes clusters (EKS, on-prem) including monitoring, scaling, and incident response
- Work with Infrastructure as Code tools: Terraform
- Support containerized workloads and microservices monitoring and troubleshooting
- Participate in ensuring high availability, disaster recovery, and backup strategies
- Bachelor's degree or higher in finance, economics, banking, business administration, or computer science.
Experience
- At least 8 years in IT development and operations at a large enterprise.
Language Proficiency
- English, Level 3 (TOEIC = 550) / or as per company regulations from time to time.
Other Requirements
- International certification in Systems is an advantage.
Cách thức ứng tuyển
Ứng viên nộp hồ sơ trực tuyến bằng cách bấm "Ứng tuyển" ngay dưới đây.
Thông tin công ty
Giới thiệu
Ngân hàng Thương mại Cổ phần Kỹ Thương Việt Nam (Techcombank)
Trong những năm trở lại đây, Techcombank liên tiếp được vinh danh tại các giải thưởng được trao bởi các tổ chức quốc tế uy tín như: EuroMoney, Global Finance, Wells Fargo, Bank of New York Mellon, AsiaRisk, Finance Asia, Global Banking and Finance Review, vv….Bên cạnh đó, ngân hàng còn được vinh danh tại các giải thưởng Nhân sự uy tín như: Nơi làm việc tốt nhất châu Á; Top 2 Nơi làm việc tốt nhất Việt Nam ngành Ngân hàng 5 năm liên tiếp (2016-2020); Vietnam HR Awards; Thương hiệu Nhà Tuyển dụng hấp dẫn nhất với sinh viên Việt Nam....
Với định vị thương hiệu “Vượt trội hơn mỗi ngày”, Techcombank cam kết tạo điều kiện để khách hàng, đối tác và chính cán bộ nhân viên có thể tiến tới phiên bản vượt trội của riêng mình.
Quy mô
Địa chỉ
