I'm Data Scientist and former Astrophysicist working in Fintech.
π Data Science Advisor and Educator @ fintech startup
π Automate cash forecasting, foreign exchange (FX) hedging, and cash reporting with Python
π©πΌβπ« Teach corporate treasury teams (including Fortune 500 companies) about AI, ML, and LLMs
π PhD in Astronomy & Astrophysics + HBSc in Astronomy & Physics from the University of Toronto
π¬ Taught 500+ technical and non-technical students over 17 classes, including on the use of Python
π Published several quantitative research papers in high-impact journals
π₯ Mentored a student on a year-long data project through to publication
π§΅ Fun fact: I cross-stitch realistic astronomy observations on Etsy
Languages: Python (Scikit-Learn, Pandas, NumPy, Matplotlib, Seaborn, SciPy), SQL (BigQuery, MySQL)
Tools: Git/GitHub, Jupyter, Streamlit, Docker, APIs (Claude, OpenAI), Tableau
Email me at jessicacampbell.astro@gmail.com
Connect with me on LinkedIn at linkedin.com/astrosica
This repository contains my data science portfolio projects, implemented primarily in Python.
End-to-end ML projects in Python including standard ML workflows with an emphasis on comparing multiple algorithms, evaluating tradeoffs, and model interpretability.
- Predicting Credit Card Approvals: Modelled credit card application approvals using demographic and financial features. Trained logistic regression, KNN, and random forest models to compare performance and interpret key drivers of approval decisions.
Implementation of foundational ML algorithms in Python, including standard pre-processing, feature engineering, hyperparameter tuning, and model evaluation.
- Predicting Loan Defaults with Random Forest: Built a random forest model to predict loan default likelihood using financial data. Addressed class imbalance (16% default rate) with hyperparameter tuning (optimizing average precision), threshold optimization (optimizing F2-score), and SMOTE resampling.
- Classifying anonymized data with KNN: Built a KNN model to classify anonymized data into two categories, demonstrating the impact of feature scaling and k-value tuning.
- Predicting Ad Clicks with Logistic Regression: Modeled ad-click likelihood using demographic and behaviour information with a logistic regression model. Implemented feature engineering (including cyclical temporal feature mapping), multicollinearity reduction, threshold optimization, and model performance testing.
- Predicting Synthetic Credit Scores with Linear Regression: Modeled synthetic credit scores based on financial and demographic features using linear regression. Implemented feature engineering, correlation analysis, multicollinearity reduction, and statistical significance testing to evaluate feature importance and improve model interpretability.
Data analytics projects using SQL, Tableau, and Excel, focusing on data storytelling through dashboards and reports.
- Insurance Analysis: Developed an interactive Tableau dashboard to report and analyze 70K insurance claims to support marketing and budget decisions.
- Marketing Analysis: Analyzed 100K e-commerce sales records using SQL (Google BigQuery) and Excel to uncover trends in customer behaviour, reporting sales and marketing metrics using an interactive Tableau dashboard.
- TTC Delay Analysis: Cleaned and analyzed 40K subway delay records for 2022-2023 using SQL and Tableau, assessing YoY KPIs and delay causes and providing performance improvement recommendations.

