About

About Me

I’m a data engineer and scientist with experience in managing projects, building machine learning models, and designing real-time dashboards. I enjoy analyzing data and creating systems that improve business performance.

I’ve applied my skills in various arenas, from manufacturing to machine learning competitions. Whether it’s blending predictions using my custom hill-climbing algorithm or streamlining data pipelines, I’m always excited to find the story hidden in the data.

Let’s connect and explore the ways I can contribute to your team!

Career

Career

My work has included large-scale project management, Python module development for machine learning/data analysis, and high-level Kaggle competitions. I bring both technical skills and a problem-solving mindset to every project I work on.

05/2023 – 04/2024

Application / Data Engineer

iNOEX, Lancaster, PA

I led the iDM 4.0 project (iNOEX Data Management), building machine learning models, real-time dashboards, and automated reporting for global manufacturers, while ensuring seamless data integration and client support.

01/2020 – 01/2022

Machinist

Acero Precision, West Chester, PA

At Acero Precision, I sharpened my problem-solving skills by programming CNC machines and optimizing tool paths for surgical medical parts, building the technical precision I now bring to data engineering and machine learning.

05/2022 – Present

Kaggle Master

Kaggle

As an experienced participant on Kaggle, Google’s data science platform, I’ve uploaded custom datasets, shared code showcasing my data analysis, and developed machine learning models to compete in various challenges.

08/2017 – 11/2019

Cofounder

Active Impulse, Scottsdale, AZ

Active Impulse, an e-commerce platform for Polaris vehicle parts, was my first entrepreneurial challenge. I wore many hats as the business grew: database design, front-end development, and optimizing data handling and inventory.

Projects

Personal Projects

In my quest for efficiency, I’ve developed tools and solutions for data analysis and machine learning. I've had a great deal of fun and gained valuable knowledge doing these. This collection of projects demonstrates my dedication to continuously improving my skills.

December 5th, 2022

fasteda: A Way to Accelerate My Exploratory Data Analysis

I built fasteda, a Python module for rapid exploratory data analysis of DataFrames. It offers key statistics, correlation plots, and visualizations like histograms and pairplots, with enhanced support for binary classification datasets.

April 13th, 2023

hillclimbers: An Adaptable Hill Climbing Algorithm

hillclimbers is a Python module I developed that blends machine learning model predictions using hill climbing to optimize performance. It selects diverse models and adjusts weights to improve the desired evaluation metric, and helped me achieve two 4th place finishes in Kaggle competitions.

September 12th, 2024

How I Created the Largest Geoguessr Dataset in the World

Geoguessr is a game where you are dropped into google street view in a random part of the world and your objective is to guess exactly where you are. Check out my YouTube video where I demonstrate how I used the Geoguessr API to build this dataset, perform analysis, and create visualizations.

Competitions

Competitions

I’ve achieved top rankings in various machine learning competitions, including high placements in regression, classification, and feature imputation tasks. These contests have covered a wide range of topics, such as predicting region flooding, wild blueberry yield, academic risk of students, the probability of an instance being a highly magnetized rotating neutron star, feature imputation in a heat flux dataset, and employee attrition.

Hover over and click through the pictures below to view solution details.

0 Completed Competitions
0 Datasets Created
0 Jupyter Notebooks
0 Discussion Contributions

My Kaggle Stats

My Kaggle statistics reflect my active engagement in the community and my commitment to collaboration and knowledge sharing on Google's data science platform.

Skills

Skills

These are the skills I have developed the most expertise in for working with data in a diverse set of contexts.

Contact

Contact Me

I’m actively seeking opportunities in data analysis, data engineering, or machine learning. If you're looking for someone who can bring strong analytical skills and technical expertise to your team, feel free to reach out!

Contact Number

+1 610-757-7862

Email Address

matthewhill.op@gmail.com