SKILLS SUMMARY:

  • Expertise in architecting, developing, and deploying large scale applications with a focus on reliability, stability, and maintainability
  • Introducing Devops, monitoring, observability, and reliability best practices into existing Engineering organizations
  • Experience introducing, migrating to, and maintaining Kubernetes systems across multiple environments in multiple corporations
  • Invested in keeping up-to-date with new languages, tools, frameworks, and relevant technologies and methodologies
  • Ability to work productively with individuals at all levels and roles within an organization

TECHNOLOGIES:

  • Kubernetes (CNCF ecosystem)
  • Cloud Architecture (AWS/GCP)
  • Terraform
  • CI/CD Processes and Tooling
  • Python/Flask (and related ecosystem)
  • Docker
  • PostgreSQL
  • Non-relational Data Stores
  • Apache Kafka/Redpanda
  • Node.js/Golang
  • Ansible (and alternatives)

WORK EXPERIENCE:

Senior Software Engineer, Platform

Bond Financial Technologies (acq. FIS), USA (Remote)

06/2021 - Current

  • Responsible for the operation and maintenace of our Cloud infrastructure, reliability, observability, and monitoring
  • Introduced and educated the engineering team on the use of Datadog for observability and monitoring
  • Coordinated with external security consultants on maintaining SOC2 Type 2 certification
  • Led efforts to reduce remove redundent systems and duplicated functionality to reduce operational complexity
  • Reconciled existing infrastructure and tooling into appropriate Terraform projects
  • Assisted Product Engineering with new feature development based on priorities and required timelines
  • Mentored junior engineers to effectively manage projects themselves with appropriate prioritization and communication

Senior Software Engineer - Site Reliability Engineering

Fullstory, Austin, TX, USA (Remote)

02/2019 - 06/2021

  • Migrated node scheduling from a job-based taints, to an attribute based model
  • Managed production and pre-production Kubernetes environments
  • Managed and scaled our internal Prometheus-based monitoring systems
  • Evangelized the introduction of a Service Mesh across the engineering organization

Senior Software Engineer - Site Reliability Engineering

Yonder (formally New Knowledge), Austin, Texas, USA

02/2019 - 02-2020

  • Owned and prioritized all Devops, Infrastructure, and Site Reliability requirements
  • Executed a Cloud Migration strategy to migrate a live workload from Azure to AWS
  • Maintained multiple Kubernetes clusters for both production and staging workloads
  • Worked with individual Product Engineering leads to reduce operational complexity and streamline our engineering process
  • Actively worked to reduce existing overengineered solutions and improve engineering productivity

Senior Software Engineer - Infrastructure

Pixlee, Austin, Texas, USA

11/2018 - 02/2019

  • Updated the development workflow of core applications to include modern and professional software engineering practices
  • Designed and developed reproducable and automated developer environments based in a Kubernetes environment
  • Identified and communicated fundamental issues in the existing configuration management, and developed a safe migration plan to correct the issues
  • Identified and communicated issues in the current production infrastructure which negatively impact system cost, reliabilty, and operational insight
  • Developed a safe, long term plan to migrate to Kubernetes in order to reduce the infrastructure bloat, consolidate services, improve reliability, and ease operational burden

Staff Software Engineer

Cratejoy, Austin, Texas, USA

01/2018 - 10/2018

  • Managed our production Kubernetes infrastructure, staging environments, and CI/CD pipelines
  • Interfaced with individual product teams in order to plan for upcoming deployment, monitoring, and tooling needs
  • Architected and managed the development of the Cratejoy Custom Domain SSL feature (with Lets Encrypt)
  • Migrated our central application deployments to team-specific automated deployments
  • Developed internal services to aid in the ease of development of user facing products

Senior Software Engineer

Cratejoy, Austin, Texas, USA

02/2015 - 01/2018

  • Formed and led our Site Reliability Engineering team in order to prioritize stability, reliability, performance, and ease of development
  • Developed internal support for, and implemented, an Engineering On-Call rotation and emergency response playbook
  • Identify, investigate, and resolve platform-wide performance and reliability issues
  • Developed and release a reliable internal Traffic Analysis system (with full grainularity), used throughout the company to make business critical decisions
  • Led a migration of our internal infrastructure from Ansible managed machines to Kubernetes
  • Developed and maintained features for the Merchant Tools section of the Cratejoy Platform
  • Designed, implemented, and rolled out PayPal support for all storefronts, which is used by 1000+ Merchants, and accounts for ~15% of platform purchases
  • Received multiple internal awards, including company wide 'Impact of the Quarter' (Q4 2016) and 'Engineering Values' (Q1 2017)

EDUCATION:

Bachelor of Computer Science

Honours Computer Science Co-op, Psychology Minor

University of Waterloo, Waterloo, Ontario

09/2009 - 12/2014