Data Engineer

数据工程师

Job Summary

职位概述

As a data engineer, you will be responsible for designing and setting up data infrastructure, developing and maintaining efficient data pipelines, and integrating data from various sources. Your role will also involve ensuring data quality and governance, optimizing performance, and implementing security measures. Additionally, you will collaborate with different teams and stay updated on emerging technologies.

作为一名数据工程师,你将负责设计和建立数据基础设施,开发和维护高效的数据管道,以及集成来自不同来源的数据。你的角色还包括确保数据质量和治理、优化性能以及实现安全措施。此外,您将与不同的团队合作,并随时了解新兴技术。


Job Responsibilities

工作职责

1. Perform data cleaning, data processing & ETL (Extract, Transform, Load) from various sources into data storage systems.

执行从各种来源到数据存储系统的数据清理、数据处理和ETL(提取、转换、加载)。

2. Ensure data quality through careful planning, development, and implementation.

通过仔细规划、开发和实施确保数据质量。

3. Maintain documentation of data pipelines, processes, and infrastructure.

维护数据管道、流程和基础架构的文档。

4. Regularly monitor and troubleshoot data pipelines or data-related issues and resolve them in a timely manner.

定期对数据管道或数据相关问题进行监控和故障排除,并及时解决。

5. Collaborate actively with other departments to identify areas of improvement and to understand their needs and requirements.

积极与其他部门合作,确定需要改进的领域,并了解他们的需求和要求。

6. Provide comprehensive support and assistance to colleagues in automating data-related processes or system operations with the aim of enhancing productivity and data accuracy.

在自动化数据相关流程或系统操作方面为同事提供全面的支持和帮助,以提高生产力和数据准确性。

7. Continuously aiding colleagues in navigating the transition to automated workflows, including providing training on how to effectively utilize and leverage the automated processes.

帮助同事过渡到自动化的工作流程,并提供如何有效利用和利用自动化流程的培训。

8. Engage in internal data evaluation and analysis, and generate reports using Power BI.

参与内部数据评估和分析,并使用Power BI生成报告。


Skills & Qualifications

技能与任职资格

1. Bachelor's degree preferably in Computer Science/Engineering/Data Science/Mathematics or any other related field;

本科以上学历,计算机科学/工程/数据科学/数学或其他相关专业优先;

2. Experience in working with diverse databases, including SQL and MongoDB, with a good understanding of both Relational and Non-Relational Database concepts, querying, and database structure;

有使用各种数据库的经验,包括SQL和MongoDB,对关系数据库和非关系数据库的概念、查询和数据库结构有充分的了解;

3. Possess a proficient command of Python and demonstrate adaptability in working with other scripting languages based on a solid understanding;

熟练掌握Python,并在扎实理解的基础上表现出与其他脚本语言合作的适应性;

4. Proficiency in Microsoft Excel, PowerPoint, Word & other relevant tools for basic data analysis, reporting, & documentation;

熟练使用Microsoft Excel、PowerPoint、Word等相关工具进行基础数据分析、报告和文档编制;

5. Proficiency in Microsoft Power Tools: Power BI for data visualization & reporting purposes, Power Automate for creating automated workflows;

熟练使用Microsoft Power Tools:用于数据可视化和报告的Power BI,用于创建自动化工作流的Power automation;

6. Basic understanding of HTML & CSS;

对HTML和CSS有基本的了解;

7. Knowledge and experience on AWS database management is an added value;

具备AWS数据库管理方面的知识和经验优先考虑;

8. Knowledge of data pre-processing, feature engineering & model training techniques with exposure to at least one machine learning framework, such as TensorFlow, Scikit-Learn, or PyTorch is an added value;

了解数据预处理,特征工程和模型训练技术,至少接触过一个机器学习框架,如TensorFlow, Scikit-Learn或PyTorch优先考虑;

9. Adequate knowledge & interest/passion in automotive is an added advantage;

对汽车行业充分了解、感兴趣、拥有热情优先考虑;

10. Proactive, hardworking and persistent;

有积极主动、勤奋、坚持不懈的精神;

11. Self-discipline, self-starter, and honest;

自律、主动、诚实

12. Good team player, good coordination and learning ability, and able to work under pressure;

具有良好的团队合作精神,良好的协调和学习能力,能承受工作压力;

13. Possesses a logical mind and work carefully;

具有清晰的逻辑思维,和工作认真的态度;

14. Excellent communication and written skills in English;

优秀的英文沟通和书写能力

15. Two years experience.

拥有两年工作经验。