What I do
With 4+ years of proven experience transforming complex data into actionable insights that drive real business impact across industries. I specialize in building scalable data pipelines, uncovering hidden patterns, and delivering analytics solutions that empower decision-makers. Below is a quick overview of my core data analytics and engineering capabilities. Want to explore further? Check out my resume and data projects portfolio to see how I turn data challenges into competitive advantages.
Data Analysis & Visualization
Transform raw data into compelling visual stories. Expert in statistical analysis, trend identification, and executive-ready dashboards for decision-making.
Business Intelligence & Reporting
Create executive dashboards and KPI tracking systems. Transform complex data into clear, actionable insights that stakeholders understand and use.
Machine Learning & Predictive Analytics
Develop predictive models and statistical solutions. From loan approval predictions to market trend analysis, build ML solutions that drive business decisions.
Data Engineering & ETL
Build robust data pipelines and automated workflows. Deliver data extraction, transformation, and scalable architectures that process thousands of records daily.
- All
- Data Engineering
- Data Analysis
- Visualization
- Machine Learning
- SQL Projects
Financial Data Engineering Pipeline
Production-grade stock data pipeline merging Polygon.io and Yahoo Finance. Handles 50K+ daily records with sub-30s latency, fault-tolerant design, and comprehensive monitoring for backtesting and portfolio analysis.
- Multi-source fusion (Polygon.io + Yahoo Finance)
- Market-aware automation
- JSONB metadata & batch inserts
- Comprehensive monitoring & metrics
Python Financial Analytics Collection
Tech stock performance analysis for Apple, Microsoft, Netflix, Google, Amazon, and Meta. Reveals Netflix's highest volatility and strong Apple-Microsoft correlation using moving averages and technical indicators.
- Moving-average crossover logic
- Volatility comparisons
- Correlation findings
Tableau Portfolio (Netflix & UK Jobs)
Interactive Tableau dashboards analyzing Netflix content trends and UK job demographics. Features time-series analysis, KPI tracking, and comparative visualizations published to Tableau Public.
- Netflix content analysis dashboard
- UK job demographics dashboard
- Published to Tableau Public
Metabase BI Case Studies (Piespace)
Comprehensive Metabase BI dashboards for Piespace case study covering revenue analysis, product performance, customer metrics, invoice tracking, and feedback insights with executive-level KPI visualizations.
- Executive-friendly KPI design
- Practical funnel/retention views
- Invoice & feedback analysis
US Consumer Complaints (SQL)
SQL analysis of Consumer Financial Protection Bureau complaints. Data cleaning with date formatting, NULL handling, state-based filtering, and automated ID generation for complaint tracking and pattern analysis.
- Robust cleaning steps
- Targeted slice-and-dice queries
- Date handling & ID generation
Forbes Global 2000 (SQL)
SQL analysis of Forbes Global 2000 companies. Data cleaning, type conversion, performance ranking, and comparative financial metrics analyzing asset base versus market capitalization from Kaggle dataset.
- Consistent typing & cleaning
- Comparative metrics analysis
- Performance ranking
Loan Approval Predictor (Django + ML)
Django web app with Random Forest ML model predicting loan approval probability (80%+ accuracy). Dockerized deployment with PostgreSQL, analyzes income and credit history for instant loan assessment feedback.
- End-to-end ML to web deployment
- Local & Docker runs
- Basic security practices
Martin Kilombe
Data Analyst ● Data Engineer- +254713342013
- martin@martinkilombe.dev
- www.martinkilombe.dev
Lead Data Analyst and Data Engineer with 4+ years of progressive experience transforming data into actionable business insights. Expertise in end-to-end data solutions: from building robust ETL pipelines processing 10M+ records monthly to creating executive dashboards that improve decision-making by 40%. Proven track record in data quality frameworks, real-time analytics, and leveraging emerging technologies like local LLMs for automated reporting. Strong background in stakeholder collaboration, process optimization, and driving data-driven culture across organizations.
Work Experiences
Lead Data Analyst driving data-driven decision making and organizational insights. Specializing in stakeholder collaboration, dashboard development, and data quality frameworks. Expert in SQL optimization, real-time analytics, and emerging AI technologies for automated reporting and enhanced privacy in data processing.
- Collaborated closely with stakeholders to deliver dashboards and insights, improving decision-making effectiveness by 40%.
- Designed and implemented a real-time data quality framework monitoring 100+ metrics, improving data health visibility across the organization.
- Optimized SQL queries and PostgreSQL processes, reducing execution times by 30% and ensuring reliable insights delivery.
- Documented reporting processes and governance guidelines, aligning analysis outputs with compliance standards.
- Pioneered integration of local LLMs (Google MedGemma) to automate summarization and enhance reporting privacy.
Data Engineer focused on building scalable ETL pipelines and data infrastructure. Specialized in Python and SQL development, data warehouse architecture, and implementing reliable data orchestration workflows with high uptime requirements.
- Designed and deployed ETL pipelines in Python and SQL, automating ingestion of 10M+ financial and operational records monthly, cutting manual data preparation by 60%.
- Built and maintained data warehouse structures (PostgreSQL & Cloud SQL) that supported real-time analytics for 5 business units, reducing reporting delays by 40%.
- Implemented data orchestration workflows with Docker, improving deployment reliability and achieving 99.5% pipeline uptime across environments.
- Developed monitoring and alerting systems for pipelines, enabling 30% faster incident resolution and proactive issue detection.
Data Analyst focused on process automation, data quality initiatives, and cross-functional collaboration. Specialized in Python and SQL automation, dashboard design, and building organizational data literacy.
- Automated key reporting processes using Python and SQL, reducing reporting turnaround by 50% and improving accessibility for operational teams.
- Implemented data quality monitoring initiatives, cutting data errors by 20% and strengthening confidence in insights.
- Collaborated across teams to define KPIs and design dashboards, improving campaign targeting efficiency.
- Trained non-technical users in data interpretation, enhancing organizational data literacy and resilience in analytics adoption.
Business Intelligence Analyst specializing in market research, competitive analysis, and data modeling. Focused on delivering strategic insights for business growth and maintaining robust data infrastructure for actuarial and business analysis.
- Conducted in-depth market research and competitive analysis to identify market trends, customer preferences, and potential opportunities, providing valuable insights for strategic planning and business growth.
- Developed and maintained data models, cubes, and metadata documentation, enabling efficient ad-hoc analysis.
- Produced market research and competitive analysis reports that informed strategic planning and product development.
Projects
Dual-source OHLC + real-time market data pipeline with Python, PostgreSQL, and SQLAlchemy. Features market-aware scheduling, structured logging, and performance tracking with automated validation and JSONB metadata storage.
Comprehensive financial data analysis toolkit using Python, Pandas, NumPy, and Matplotlib. Automated data collection, statistical analysis, and interactive visualizations for financial market insights.
Interactive dashboards exploring content patterns and labor market trends. Features KPI tiles, time-series analysis, and comparative views published to Tableau Public.
End-to-end ML deployment using Random Forest algorithm in Django application. Containerized with Docker, featuring web interface for loan approval predictions and model performance tracking.
Comprehensive SQL analysis of US consumer financial complaints data. Data cleaning, trend analysis, and insights extraction using advanced SQL techniques and aggregation functions.
Skills
Data Tools
- SQL, Power BI, Tableau, Metabase, Excel
Data Engineering & Pipelines
- ETL development, Data Warehousing (PostgreSQL, Cloud SQL, BigQuery)
- Orchestration (Airflow, dbt), Streaming (Kafka, Spark Streaming)
- CI/CD (GitHub)
Programming & Automation
- Python (Pandas, NumPy, Scikit-learn), Bash, Docker
Business Intelligence
- Dashboard creation, KPI development, stakeholder reporting
Data Governance & Quality
- Data validation, profiling, and monitoring
AI & Emerging Tech
- AI data preparation, ethical AI practices, local LLMs for summarization & insights
Education
MSc in Finance and Accounting
2021 - 2023
Bachelor of Science in Actuarial Science and Statistics
Second Class - Upper Division
2015 - 2019
Certification
Interests
- Hiking
- Go Karting
- Bowling
- Travelling
Get In Touch
Whether you're curious about my recent work, interested in a collaboration, or just want to share your thoughts, I'd love to hear from you. Let's create something impactful together.