Maria Oros

Data Science Consulting · Statistics · Machine Learning · AI

I help research groups and organizations turn complex data into clear, defensible decisions. Drawing on a decade of experience across industry and academia — and my current role at the UW–Madison Data Science Institute — I deliver statistical modeling, machine learning, and AI solutions for high-stakes scientific and business problems.

My work centers on translating rigorous methods — causal inference, Bayesian statistics, and foundation models — into production-ready systems, reproducible tools, and results that hold up to scrutiny. I partner with research labs, industry teams, and open-source communities on projects spanning agriculture, biopharma, finance, and beyond.

I trained in mathematics at the University of Guanajuato and CIMAT under Dr. Carlos Valero and Dr. Rafael Herrera Guzmán, and now work alongside Dr. Kyle Cranmer and faculty at UW–Madison. As a Latina in STEM and first-generation college graduate, I lead engagements with care — building the kind of collaborative space where rigorous work and good partnership reinforce each other.

Professional Experience

UW–Madison Data Science Institute
Powered by American Family Insurance

Data Scientist

2023 – current
  • Managed a portfolio of data science research projects with UW–Madison, partner universities, and industry stakeholders.
  • Designed and developed statistical, machine learning, and AI-driven solutions.
  • Reported results and insights directly to academic and industry stakeholders.

Game Coder Studios

Data Analyst

2022 – 2023
  • Core data analyst for Runiverse, an MMORPG video game.
  • Performed statistical analysis and large-scale simulations to evaluate game design scenarios.
  • Provided data-driven recommendations to support strategic game design decisions.

BBVA (Mexico & Spain)

Analyst Data Scientist

2021 – 2022
  • Led the development of a national-scale credit risk assessment model for individual customers.
  • Built and validated statistical models used in production decision systems.
  • Collaborated directly with stakeholders across Mexico and Spain.

Aprende Institute

Data Analyst Mid

2021
  • Designed A/B testing principles to improve user experience and learning outcomes.
  • Member of the Insights team supporting data-driven decisions for the front office.
  • Applied statistical analysis to evaluate E-learning platform performance and user behavior.

True Home

Junior Data Scientist

2019 – 2021
  • Developed and deployed a national real estate pricing model used in production.
  • Led data curation and extraction pipelines across multiple data sources.
  • Applied NLP techniques to detect legal issues in property listings and improve automation reliability.

CIMAT & University of Guanajuato

Mathematics Degree — Graduate Studies

2018
  • Graduated By thesis on Hamiltonian Systems and Gaussian Processes, with coursework in Dynamical Systems, Fourier Analysis, Stochastic Processes, and Mathematical Physics.
  • Awarded the CONACYT scholarship to pursue thesis research in theoretical mathematics and physics.
  • Thesis dissertation approved unanimously upon defense.

Publications & Research

Selected Publications

↗ Full list on Google Scholar

Presentations & Talks

Talk
Poster
Lightning
Video
A Hybrid model for Protein Purification
DSI Reading Group · UW–Madison
April 2026
UW-Madison Reseach Bazaar · YouTube
March 2026
Meta-analytical application and open source tools in agriculture
Midwest Machine Learning Symposium · University of Chicago
Jun 2025
Can LLMs be anomaly detectors?
Data Science Institute · UW–Madison
May 2025
An open-source crop disease forecasting tool
Research Bazaar · UW–Madison
Feb 2025
UW-Madison Research Bazaar · YouTube
March 2025
UNAM, Ciencias.TV · Facebook Live
2018

Services

Engagements typically combine the practices below — scoped to the problem, the data, and the decision at stake.

Statistical Modeling

  • Bayesian inference & hierarchical models
  • Causal inference & experimental design
  • Meta-analysis & evidence synthesis
  • A/B testing & decision analysis

Machine Learning & AI

  • Predictive & forecasting models
  • Foundation models & LLM applications
  • Anomaly detection & risk scoring
  • Model validation & monitoring

Research Collaboration & Advising

  • Applied research partnerships
  • Method development & technical review
  • Reproducible open-source tools
  • Strategic advising for data teams

Tooling

Languages

  • Python
  • R
  • SQL
  • SAS

Cloud & DevOps

  • AWS
  • GCP
  • Docker
  • Git / GitHub

Featured Projects

HIC Modeling

Mechanistic Modeling of Hydrophobic Interaction Chromatography

Boehringer Ingelheim · Industry Collaboration

Developed mechanistic models to characterize protein behavior in hydrophobic interaction chromatography (HIC), a key purification step in biopharmaceutical manufacturing. The models capture adsorption dynamics and elution profiles to support process development and reduce experimental burden.

Mechanistic Modeling Chromatography Bioprocessing
Open Lambda

Open Lambda

Contributed to the design and development of the Open Lambda website, shaping the visual identity of this open-source serverless platform.

Agricultural Forecasting Tool

Agricultural Forecasting System

Open-source tool for predicting crop disease risks using ML models.

ROI Calculator

Economic ROI Calculator

Economic models for fungicide profitability assessment.

Team & Collaborators

What Collaborators Say

Themes that emerge across recommendations:

Moments

A few moments with my team and collaborators across projects in data science, agriculture, and open-source tools.

Get In Touch