Currently, I am a data engineer at Solita where I design and develop data pipelines and workflows.
✦ Analyzed user journeys and removed friction points, cutting drop-offs by about 20%.
✦ Built and orchestrated complex, end-to-end data pipelines, from ingesting APIs to transformations.
✦ Owned data governance, defining retention rules and masking PII.
- SolitaData EngineerMar 2025 - Current
- Braive, Mental Healthcare AppData ScientistAug 2024 - Mar 2025
✦ Analyzed app usage data to optimize user funnels, resulting in effective onboarding and improved user retention.
✦ Assisted with user and data insights during AI feature sprints that eases the clinician workflow.
✦ Acted as a bridge between leadership and technical staff by translating business needs to events data tracking and data pipelines. - RISE Research Institutes of SwedenMaster Thesis InternJan 2024 - Jun 2024
✦ Worked as an ML engineer in cross-functional project between RISE and Smart Eye.
✦ Designed and built state of the art data synthesis methods.
✦ Assisted developers at Smart Eye with the implementation, which resulted in improved accuracy of autonomous‑driving systems. - KBLab, The National Library of SwedenData ScientistJan 2023 - Dec 2023
✦ Curated large open source datasets, including the parliamentary proceedings, increasing transparency into the Swedish government.
✦ Helped with the training of Swedish Large Language models including KBWhisper.
✦ Built end-to-end pipelines for unstructured digitised materials, with OCR on PDFs, to text segmentation with Huggingface models, to structured datasets.
- Uppsala UniversityMSc - Machine Learning (Aug 2022 - Jun 2024)BSc - Mathematics and Statistics (Aug 2018 - Jun 2022)
While pursuing my degree, I delivered lectures and led workshops on experimental design and predictive modelling for data science undergrads. That taught me alot about explaining complex ideas clearly.
I developed quite a few projects during that time.
✦ A real-time data pipeline in Kafka + Spark Streaming on UPPMAX clusters.
✦ An implementation of a Bayesian sports ranking algorithm in Python.
✦ A benchmark and tuning of clustering algorithms on real datasets.