Mô tả công việc
Responsibilities for Data Engineer at GGG:
Design and manage data pipelines in accordance with the data engineering lifecycle.
Develop and maintain robust data platform architecture.
Collaborate with stakeholders across Executive, Marketing, Accounting, Finance, and other teams to address data- related technical issues and support their data infrastructure needs.
Develop analytics tools that leverage the data pipeline to provide actionable insights into customer acquisition, operational efficiency, and other key business performance metrics.
Construct the necessary infrastructure for optimal ETL pipelines from diverse data sources using Python, SQL, and big data technologies; Java is a plus
Identify, design, and implement internal process enhancements, including automating manual tasks, optimizing data delivery, and redesigning infrastructure for improved scalability.
Extract large, complex data sets that fulfill both functional and non- functional business requirements.
Partner with data scientists and analysts to enhance the functionality of our data systems.
Create data tools for analytics and data science teams to support the development and optimization of our product, aiming to position it as an innovative industry leader.
Qualifications for Data Engineer at GGG
3+ years of experience in a Data Engineer role.
Having a degree in Computer Science, Information Systems, Software Engineering or another related field.
Wide knowledge about software products using in F&B field is a plus
Having a degree in Statistics, Informatics is a plus
Should also have experience using most of the following software/tools/platforms:
Skilled in data builder and workflow orchestration tools such as Airflow and DBT.
Extensive SQL expertise, with hands- on experience in both relational SQL and NoSQL databases, including SQL Server, MySQL, PostgreSQL, MongoDB, among others.
Proficient in programming languages such as Python, Scala, Java ...
Familiar with reporting and BI tools like Zeppelin, Superset, Metabase, with additional proficiency in PowerBI is a plus.
Experienced with big data technologies, including Hadoop Yarn, Hive, Spark, Presto (Trino), Kafka and others.
Having Data Engineer certification from major cloud providers is a plus.
Skilled in designing dimensional data model and optimizing OLAP and OLTP database.