I'm a Data Scientist/Data Engineer at Solita where I design and develop data pipelines and workflows.
✦ Built and orchestrated end-to-end data pipelines from ingestion to dbt transformations.
✦ Implemented cloud infra with Terraform and CI/CD pipelines.
✦ Developed Flask APIs to serve data across internal apps.
- SolitaData Scientist/Data EngineerMar 2025 - Current
- Braive, digital therapy platformData ScientistAug 2024 - Feb 2025
✦ Analyzed app usage data to improve onboarding funnel and retention of users.
✦ Mapped business needs to events tracking in the Braive app, enabling PM and leadership visibility into the product.
✦ Owned data in AI feature sprints to automate the clinicians work. - RISE Research Institutes of SwedenMaster Thesis InternJan 2024 - Jun 2024
✦ Designed, built, and trained data synthesis systems for autonomous driving.
✦ Built data generation pipeline for rare driving scenarios.
✦ Worked as an ML engineer in a project between RISE and Smart Eye. - KBLab, National Library of SwedenData ScientistJan 2023 - Dec 2023
✦ Built Evals for the curation of data used for training Swedish LLMs such as KBWhisper.
✦ Constructed workflows for digitizing text materials, with OCR on PDFs and text segmentation using pre-trained models on Huggingface.
✦ Curated large national datasets, including the parliamentary proceedings, helping with transparency to the Swedish government.
- Uppsala UniversityMSc - Machine Learning (Aug 2022 - Jun 2024)BSc - Mathematics and Statistics (Aug 2018 - Jun 2022)
In parallel to my studies, I also served as a lecturer and held workshops in Statistics within experimental design and predictive models for data science undergrads.
I also developed a few projects:
✦ A real-time pipeline in Kafka/Spark on UPPMAX clusters.
✦ Implemented a Bayesian sports ranking algorithm in Python.
✦ Benchmarked and tuned clustering algorithms on real-world datasets.