Will Jobs

Data scientist and software engineer with 7+ years of consulting experience who can bridge the communication gap with decision-makers

EXPERIENCE

Travelers Insurance
Senior Associate Data Scientist ← Associate Data Scientist ← Data Science Consultant
Jun. 2020 – Present
  • Lead the migration of a business-critical modeling codebase from on-premises SAS to Python in the cloud (AWS). Technical requirements include the ability to process over 100 million auto policies representing billions of dollars in revenue, runtime performance that meets or exceeds the current solution, ease of use for less technical users, and thorough testing
  • Use two-stage generalized linear models to appropriately price auto insurance policies worth over $10 billion of written premium
  • Trained a propensity model in Python using XGboost (a gradient-boosting library) and used a matching algorithm in R to understand the change in customer profile after the introduction of a new insurance product
  • Built a model to identify cross-selling opportunities as part of a data science competition. Out of a field of nearly 30 teams, came in second place.
ROI Solutions
Database Developer II ← PL/SQL Developer
Feb. 2018 – Jan. 2020
  • Added functionality and improved performance (in some cases over 2X) and maintainability of an Oracle back-end that powers a CRM application serving dozens of non-profit clients with a combined donor base in the millions
  • Developed a model to improve the performance of a client’s direct mail campaign, increasing response rates over 20%
  • Worked with client-facing teams to address issues, identify bugs, and better understand customers’ needs
The Cadmus Group
Associate ← Senior Analyst ← Analyst ← Research Analyst
Aug. 2011 – Feb. 2018
  • Conducted analyses and created visualizations using R, Python, SQL, and Tableau in projects related to climate change, hydraulic fracturing, drinking water contaminants, greenhouse gas emissions, and home energy use
  • Developed database applications in Microsoft Access in support of a variety of EPA programs
  • Created a web application in a highly secured, locked-down environment for the Army Corps of Engineers to prioritize assets most vulnerable to climate change

EDUCATION

University of Massachusetts, Amherst
Master of Science in Statistics (3.98 GPA)
Aug. 2019 – May 2021

Certificate in Statistical and Computational Data Science. Coursework included regression, Bayesian statistics, machine learning, neural networks, natural language processing (NLP), survival analysis, design of experiments, and visualization

Vassar College
Bachelor of Arts in Chemistry, Minor in Computer Science (3.99 GPA)
Aug. 2006 – May 2010

General Honors, Departmental Honors, Phi Beta Kappa Society. Publications in ACS Omega and Biophysical Journal

PROJECTS

Neural Network Dog Breed Classifier | Python, fastai, Streamlit, deep learning
  • Created a user-friendly Streamlit application to classify photos of dogs into one of 150 AKC-recognized breeds
  • Scraped approximately 20,000 images of dogs from the internet for training and scoring
  • Used fastai and transfer learning to fine-tune an ImageNet-based convolutional neural network (CNN), achieving a test set accuracy of 53.8%
Python Wrapper for Regulations.gov API | Python
  • Developed a high-level API to download public comments from Regulations.gov
  • Used Python to greatly simplify the download process by handling the complex pagination scheme, API request limits, and layers of requests needed to access comment data

TECHNICAL SKILLS

Languages: Python, R, SAS, SQL, PL/SQL (Oracle), Java, C++, JavaScript, HTML, CSS, VBA

Cloud: Amazon Web Services (AWS) (Certified Cloud Practitioner), Google Cloud Platform (GCP)

Machine Learning: scikit-learn, PyTorch, fastai, H2O, DataRobot

Visualization: D3.js, ggplot2, Matplotlib, Tableau

Other: Git, Linux, Excel, Access