Soum Logo

Soum

Data Engineer

Reposted An Hour Ago
In-Office or Remote
2 Locations
Junior
In-Office or Remote
2 Locations
Junior
As a Data Engineer, you'll build and maintain data pipelines, ensure data quality, and collaborate with teams to optimize data processes for analytics and ML applications.
The summary above was generated by AI
Role: Data Engineer
Location: Egypt, Uzbekistan, and Pakistan (Remote)
Work Week: Sunday – Thursday
Work Timings: 9:00 AM – 6:00 PM (Saudi Arabian Time Zone)

Overview:
We’re seeking a Data Engineer to design, build, and maintain the data infrastructure that underpins our analytics, ML models, and decision-making processes. You’ll be responsible for building scalable data pipelines, integrating diverse data sources, and ensuring data quality, reliability, and accessibility across the organization. Working closely with data scientists, analysts, and product teams, you’ll enable data-driven insights while optimizing for performance and scalability. This is a great opportunity to have a direct impact on how data is leveraged across a fast-growing company.

Role & Responsibilities:

  • Data Pipeline Development & Optimization:
  • Design, build, and maintain scalable and reliable data pipelines to support analytics, ML models, and business reporting.
  • Collaborate with data scientists and analysts to ensure data is available, clean, and optimized for downstream use.
  • Implement data quality checks, monitoring, and validation processes.
  • Data Architecture & Integration:
  • Work with cross-functional teams to design efficient ETL/ELT workflows using modern data tools.
  • Integrate data from multiple sources (databases, APIs, third-party tools) into centralized storage solutions (data lakes/warehouses).
  • Support cloud-based infrastructure for data storage and retrieval.
  • Performance & Scalability:
  • Monitor, troubleshoot, and optimize existing data pipelines to handle large-scale, real-time data flows.
  • Implement best practices for query optimization and cost-efficient data storage.
  • Ensure data is available and accessible for business-critical operations.
  • Collaboration & Documentation:
  • Partner with product, engineering, and business stakeholders to understand data requirements.
  • Document data workflows, schemas, and best practices.
  • Support a culture of data reliability, governance, and security.

Requirements:

  • Proficiency in Python and SQL for data engineering tasks.
  • Strong understanding of ETL/ELT processes, data warehousing, and data modeling.
  • Hands-on experience with cloud platforms (AWS, GCP, or Azure) and data storage solutions (BigQuery, Redshift, Snowflake, etc.).
  • Familiarity with data orchestration tools Airflow, Airbyte is a must.
  • Experience with containerization & deployment tools (Docker, Kubernetes) is a plus.
  • Knowledge of data governance, security, and best practices for handling sensitive data.
  • Familiarity to work with Git and GitHub.
  • Dataform is a must
  • Strong skills in eliciting requirements from cross-functional stakeholders and translating them into actionable data engineering tasks.

Experience:

  • 2+ years in data engineering, building and maintaining data pipelines.
  • 2+ years in SQL and Python development for production environments.
  • Experience working in fast-growing startup environments is a plus.
  • Exposure to real-time data processing frameworks (Kafka, Spark, Flink) is a plus.

Top Skills

Airbyte
Airflow
AWS
Azure
BigQuery
Dataform
Docker
GCP
Git
Git
Kubernetes
Python
Redshift
Snowflake
SQL

Similar Jobs

16 Days Ago
Remote
Cairo, EGY
Senior level
Senior level
Artificial Intelligence • Information Technology • Software • Analytics
The Senior Data Engineer is responsible for creating scalable data pipelines, maintaining ETL processes, and developing a semantic layer for analytics, collaborating with various teams to ensure data quality and accessibility.
Top Skills: Data VaultETLRelational Data WarehousesSemantic LayerSQLSsis
19 Days Ago
Remote or Hybrid
8 Locations
20K-200K Annually
Senior level
20K-200K Annually
Senior level
Information Technology • Mobile • Consulting
The role involves building a centralized data lake on GCP, developing SPARK-powered data pipelines, ensuring data quality, and collaborating with cross-functional teams for advanced analytics and data models.
Top Skills: AirflowDbtDockerGCPKubernetesLookerLuigiNoSQLPrefectPysparkPythonScalaSQLTerraform
2 Days Ago
Remote
2 Locations
Senior level
Senior level
Information Technology • Software • Cybersecurity
As a Senior Data Engineer, you will architect security data ecosystems by designing data lakehouse architectures, implementing real-time streaming pipelines, and enabling AI/ML features. You will manage data ingestion patterns and ensure system integrity through automation and observability.
Top Skills: Apache BeamApache FlinkDbtGoGoogle Cloud PlatformKubernetesPythonScalaSQLTerraform

What you need to know about the NYC Tech Scene

As the undisputed financial capital of the world, New York City is an epicenter of startup funding activity. The city has a thriving fintech scene and is a major player in verticals ranging from AI to biotech, cybersecurity and digital media. It also has universities like NYU, Columbia and Cornell Tech attracting students and researchers from across the globe, providing the ecosystem with a constant influx of world-class talent. And its East Coast location and three international airports make it a perfect spot for European companies establishing a foothold in the United States.

Key Facts About NYC Tech

  • Number of Tech Workers: 549,200; 6% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Capgemini, Bloomberg, IBM, Spotify
  • Key Industries: Artificial intelligence, Fintech
  • Funding Landscape: $25.5 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Greycroft, Thrive Capital, Union Square Ventures, FirstMark Capital, Tiger Global Management, Tribeca Venture Partners, Insight Partners, Two Sigma Ventures
  • Research Centers and Universities: Columbia University, New York University, Fordham University, CUNY, AI Now Institute, Flatiron Institute, C.N. Yang Institute for Theoretical Physics, NASA Space Radiation Laboratory

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account