
Somto A. Mbah
Senior Data Engineer & Developer
Senior Data Engineer with 6+ years of expertise in Big Data Architecture and Full-Stack Development. Proven track record in designing scalable ETL/ELT pipelines using Apache Spark and Airflow, orchestrating containerized workloads with Kubernetes, and provisioning AWS Cloud Infrastructure (Redshift, EC2) via Terraform. Expert in Python, SQL, and React-based visualization to drive operational efficiency and business intelligence.
Skills
Data Engineering & Analytics
Web Development
Cloud & DevOps
Methodologies
Education
B.Sc. Computer Science
Minor in Statistics
University of Manitoba
2014 - 2018
Experience
Full Stack Developer & Senior Data Engineer
RK Publishing
Jun 2021 β Present
- βΉCustom CRM Engineering: Architected a scalable Python CRM, implementing Redis caching to slash query latency by 20% and drive a 35% increase in sales team operational throughput.
- βΉData Migration & Architecture: Containerized Apache Spark (PySpark) ETL pipelines using Docker and orchestrated CI/CD workflows via GitHub Actions and Airflow to migrate TB-scale datasets into AWS Redshift, ensuring GDPR compliance and 98% data integrity.
- βΉStrategic Technical Leadership: Directed technical strategy for system enhancements, serving as the subject matter expert on software architecture; drove a 40% improvement in user adoption and a 25% reduction in reported bugs through proactive root-cause analysis.
- βΉPerformance Optimization: Established a robust CI/CD pipeline using GitHub Actions to trigger automated PyTest and Jest suites, replacing manual QA and reducing production issues by 30% while adhering to GDPR data privacy standards.
- βΉClient Solutions: Partnered with key accounts to resolve complex technical challenges, resulting in a 15% increase in client satisfaction scores.
Lead Real-time Data Analyst
24-7 Intouch
Jan 2019 β Apr 2021
- βΉAnalytics Operations: Directed real-time data analytics operations, leveraging Power BI and SQL to optimize resource allocation, driving a 20% surge in team productivity and 5% reduction in operational downtime.
- βΉProcess Automation: Engineered automated ETL ingestion workflows to deprecate manual reporting, increasing data accuracy by 25% and ensuring 99.9% uptime for C-suite executive dashboards.
- βΉPredictive Modeling: Optimized workforce planning by strategically analyzing historical data trends, resulting in a 30% increase in Service Level Objective (SLO) attainment.
- βΉTeam Leadership: Led technical training and onboarding initiatives, reducing time-to-productivity for new analysts by 40%.
Projects

WE Properties App
A luxury real-estate application concept featuring a reactive 'alive' interface, advanced filtering, and a premium golden aesthetic.

OpenMetric ETL Platform
End-to-end data pipeline built with Python and Airflow, featuring automated quality validation and interactive React/D3.js dashboards for cryptocurrency sentiment analysis.

Bonjour Book
An interactive bilingual digital book for children featuring custom SVG illustrations, text-to-speech narration, and word highlighting.

Bellabeat Case Study
Data analysis of Fitbit fitness trackers to identify trends and inform marketing strategies for a health-focused tech company.