DATA ENGINEER
- Ghana/Rwanda
- Permenant
- TBC
- Permenant
- Urgent
Effortless Data Mastery: Introducing our Data Engineers, your architects of efficiency. Proficient in data architecture, integration, and modelling, they create streamlined data ecosystems. With expertise in big data technologies, cloud platforms, and data security, they ensure data quality and accessibility. Collaborating with stakeholders, they transform raw data into actionable insights. Remoteli's experts ensure your data landscape thrives with precision and innovation.
Data Architecture: Design, build, and maintain data architecture, including databases, data warehouses, and data lakes, to ensure efficient storage and retrieval of data.
Data Integration: Develop data pipelines and ETL (Extract, Transform, Load) processes to collect, transform, and load data from various sources into structured formats.
Data Modeling: Create and manage data models, schemas, and data dictionaries for optimal data organisation and efficient querying.
Database Management: Manage databases using relational database management systems (e.g., SQL Server, PostgreSQL) or NoSQL databases (e.g., MongoDB, Cassandra).
Data Quality and Governance: Ensure data quality by implementing validation, cleaning, and enrichment processes, and adhere to data governance policies.
Big Data Technologies: Proficiency in big data technologies like Hadoop, Spark, and Hive for processing and analysing large datasets.
Cloud Platforms: Familiarity with cloud platforms like AWS, Azure, or Google Cloud for data storage, processing, and analytics.
Streaming Data: Experience with real-time data streaming technologies like Apache Kafka or Amazon Kinesis for processing streaming data.
Data Warehousing: Knowledge of data warehousing concepts and tools like Amazon Redshift, Google BigQuery, or Snowflake for data storage and analytics.
Data Visualization: Collaborate with data analysts and business stakeholders to design and develop data visualisations using tools like Tableau, Power BI, or Looker.
Version Control and Collaboration Tools: Proficiency in using version control systems like Git and collaboration tools like Jira for tracking and managing data engineering tasks.
Scripting and Programming: Skill in scripting languages like Python, along with programming languages like Java or Scala, for data manipulation and processing.
SQL and NoSQL: Expertise in SQL for querying and manipulating relational databases and familiarity with NoSQL databases for unstructured data storage.
API Integration: Ability to integrate with external APIs to collect data from various sources and systems.
Data Security: Implement data security measures to protect sensitive information, including encryption, access controls, and data masking.
Data Migration: Experience with data migration projects, transferring data between systems while ensuring accuracy and integrity.
Big Data Technologies: Mastery of technologies like Hadoop, Spark, and Flink for processing and analysing large-scale datasets efficiently.
Database Management Systems (DBMS): Proficiency in both relational DBMS (e.g., SQL Server, PostgreSQL) and NoSQL databases (e.g., MongoDB, Cassandra) for data storage.
Data Warehousing Platforms: Familiarity with data warehousing platforms like Amazon Redshift, Google BigQuery, Snowflake, or Microsoft Azure Synapse Analytics.
Cloud Platforms: Skill in using cloud platforms like AWS, Azure, or Google Cloud for scalable data storage, processing, and analytics.
ETL Tools: Proficiency in ETL tools like Apache NiFi, Talend, Informatica, or Apache Airflow for designing and automating data pipelines.
Streaming Platforms: Understanding of streaming platforms like Apache Kafka, Amazon Kinesis, or Google Cloud Pub/Sub for real-time data processing.
Data Integration Tools: Familiarity with tools like Apache Camel or MuleSoft for integrating data from various sources and systems.
Version Control Systems: Mastery of using Git and platforms like GitHub or GitLab for version control and collaborative development.
Scripting and Programming Languages: Proficiency in scripting languages like Python and programming languages like Java or Scala for data manipulation.
SQL and Query Languages: Expertise in SQL for querying and manipulating relational databases, and familiarity with query languages like GraphQL.
Data Modeling Tools: Skill in using data modelling tools like ER/Studio, ERwin, or dbForge Studio for designing database schemas.
Data Visualization Tools: Knowledge of data visualisation tools like Tableau, Power BI, or Looker for creating meaningful insights from data.
Containerization and Orchestration: Familiarity with containerization platforms like Docker and orchestration tools like Kubernetes for managing data applications.
Workflow Automation Tools: Understanding of tools like Apache NiFi or Apache Airflow for orchestrating and automating data workflows.
Data Security Tools: Knowledge of data security tools and practices for implementing encryption, access controls, and data masking.
API Integration: Ability to work with RESTful APIs and other integration methods for collecting and sending data to external systems.
Document Management Systems: Familiarity with document management systems like SharePoint or Confluence for storing and sharing data-related documentation.