Job Description:
Implement integrations requested by customers
Design procedures for system troubleshooting and maintenance
Investigate and resolve technical issues
Use Gitlab to auto- deploy our code branches it to a kubernetes cluster
Perform root cause analysis for production errors
Develop software to integrate with internal back- end systems
Deploy updates and fixes
Build tools to reduce occurrences of errors and improve customer experience
Add security monitoring like IDS to the cluster
Develop scripts to automate visualization
Compensation & Benefits:
Men’s Day, Women’s Day, Children’s Day, Mid- Autumn Festival and other benefits under the provisions of the company;
Flat, open and sharing culture with friendly management team; outsourcing company with product mindset;
01 day remote work per month; A flexitime allowance of 90- 180 minutes per month for employees
Work performance review 2 times/ year (in April and October);
Social insurance, health insurance, unemployment insurance and Bao Viet care insurance;
Training courses and working opportunities with technical gurus who built and operated world- class applications with millions of users. This might be a good chance for graduated students to learn cutting- edge technologies and how to build scalable system from scratch;
Saturday & Sunday OFF, Overtime pay is 150%, 200%, 300% as per labor law;
Minimum 14 paid leaves per annum for all employees after probation;
Yearly company trip and year- end party, quarterly team building and weekly eating together; English- Japanese Club, Sports Clubs;
Nice & modern working space with young, dynamic & friendly colleagues and free coffee, tea, drinks;
01 hour paid leave per day for women having children under 12 months
Performance bonus, 13th- month salary, public holidays bonus (2/9, 30/4, 1/5, 1/1); bonus for Excellent Employee and Excellent Team;
Requirements:
Must have
Experience with Logging, monitoring, and tracing tools: such as ELK, Datadog, Open Telemetry, Jaeger …
Set up Kubernetes clusters on- premise.
2+ years experience with networking and Linux administration
Strong problem- solving skills and ability to learn new technologies (passionate about technology)
Experience configuring open- source software on Linux (proficiency in Linux).
Explored and worked with monitoring and tracing systems (familiarity with Prometheus- Grafana, Opentelemetry- Jaeger).
Built CI/CD, and GitOps pipelines for applications on Kubernetes (proficiency in Helm).
Nice to have
Strong in Ansible scripting.
Proficient in English is a preference