Work
  • Solita
    Data Engineer
    Mar 2025 - Current

    Currently, I am a data engineer at Solita where I design and develop data pipelines and workflows.

    ✦ Analyzed user journeys and removed friction points, cutting drop-offs by about 20%.
    ✦ Built and orchestrated complex, end-to-end data pipelines, from ingesting APIs to transformations.
    ✦ Owned data governance, defining retention rules and masking PII.

  • Braive, Mental Healthcare App
    Data Scientist
    Aug 2024 - Mar 2025

    ✦ Analyzed app usage data to optimize user funnels, resulting in effective onboarding and improved user retention.
    ✦ Assisted with user and data insights during AI feature sprints that eases the clinician workflow.
    ✦ Acted as a bridge between leadership and technical staff by translating business needs to events data tracking and data pipelines.

  • RISE Research Institutes of Sweden
    Master Thesis Intern
    Jan 2024 - Jun 2024

    ✦ Worked as an ML engineer in cross-functional project between RISE and Smart Eye.
    ✦ Designed and built state of the art data synthesis methods.
    ✦ Assisted developers at Smart Eye with the implementation, which resulted in improved accuracy of autonomous‑driving systems.

  • KBLab, The National Library of Sweden
    Data Scientist
    Jan 2023 - Dec 2023

    ✦ Curated large open source datasets, including the parliamentary proceedings, increasing transparency into the Swedish government.
    ✦ Helped with the training of Swedish Large Language models including KBWhisper.
    ✦ Built end-to-end pipelines for unstructured digitised materials, with OCR on PDFs, to text segmentation with Huggingface models, to structured datasets.

Education
  • Uppsala University
    MSc - Machine Learning (Aug 2022 - Jun 2024)
    BSc - Mathematics and Statistics (Aug 2018 - Jun 2022)

    While pursuing my degree, I delivered lectures and led workshops on experimental design and predictive modelling for data science undergrads. That taught me alot about explaining complex ideas clearly.

    I developed quite a few projects during that time.

    ✦ A real-time data pipeline in Kafka + Spark Streaming on UPPMAX clusters.
    ✦ An implementation of a Bayesian sports ranking algorithm in Python.
    ✦ A benchmark and tuning of clustering algorithms on real datasets.