WORK EXPERIENCE:
Senior Software Engineer, Platform
Bond Financial Technologies (acq. FIS), USA (Remote)
06/2021 - Current
- Responsible for the operation and maintenace of our Cloud infrastructure, reliability, observability, and monitoring
- Introduced and educated the engineering team on the use of Datadog for observability and monitoring
- Coordinated with external security consultants on maintaining SOC2 Type 2 certification
- Led efforts to reduce remove redundent systems and duplicated functionality to reduce operational complexity
- Reconciled existing infrastructure and tooling into appropriate Terraform projects
- Assisted Product Engineering with new feature development based on priorities and required timelines
- Mentored junior engineers to effectively manage projects themselves with appropriate prioritization and communication
Senior Software Engineer - Site Reliability Engineering
Fullstory, Austin, TX, USA (Remote)
02/2019 - 06/2021
- Migrated node scheduling from a job-based taints, to an attribute based model
- Managed production and pre-production Kubernetes environments
- Managed and scaled our internal Prometheus-based monitoring systems
- Evangelized the introduction of a Service Mesh across the engineering organization
Senior Software Engineer - Site Reliability Engineering
Yonder (formally New Knowledge), Austin, Texas, USA
02/2019 - 02-2020
- Owned and prioritized all Devops, Infrastructure, and Site Reliability requirements
- Executed a Cloud Migration strategy to migrate a live workload from Azure to AWS
- Maintained multiple Kubernetes clusters for both production and staging workloads
- Worked with individual Product Engineering leads to reduce operational complexity and streamline our engineering process
- Actively worked to reduce existing overengineered solutions and improve engineering productivity
Senior Software Engineer - Infrastructure
Pixlee, Austin, Texas, USA
11/2018 - 02/2019
- Updated the development workflow of core applications to include modern and professional software engineering practices
- Designed and developed reproducable and automated developer environments based in a Kubernetes environment
- Identified and communicated fundamental issues in the existing configuration management, and developed a safe migration plan to correct the issues
- Identified and communicated issues in the current production infrastructure which negatively impact system cost, reliabilty, and operational insight
- Developed a safe, long term plan to migrate to Kubernetes in order to reduce the infrastructure bloat, consolidate services, improve reliability, and ease operational burden
Staff Software Engineer
Cratejoy, Austin, Texas, USA
01/2018 - 10/2018
- Managed our production Kubernetes infrastructure, staging environments, and CI/CD pipelines
- Interfaced with individual product teams in order to plan for upcoming deployment, monitoring, and tooling needs
- Architected and managed the development of the Cratejoy Custom Domain SSL feature (with Lets Encrypt)
- Migrated our central application deployments to team-specific automated deployments
- Developed internal services to aid in the ease of development of user facing products
Senior Software Engineer
Cratejoy, Austin, Texas, USA
02/2015 - 01/2018
- Formed and led our Site Reliability Engineering team in order to prioritize stability, reliability, performance, and ease of development
- Developed internal support for, and implemented, an Engineering On-Call rotation and emergency response playbook
- Identify, investigate, and resolve platform-wide performance and reliability issues
- Developed and release a reliable internal Traffic Analysis system (with full grainularity), used throughout the company to make business critical decisions
- Led a migration of our internal infrastructure from Ansible managed machines to Kubernetes
- Developed and maintained features for the Merchant Tools section of the Cratejoy Platform
- Designed, implemented, and rolled out PayPal support for all storefronts, which is used by 1000+ Merchants, and accounts for ~15% of platform purchases
- Received multiple internal awards, including company wide 'Impact of the Quarter' (Q4 2016) and 'Engineering Values' (Q1 2017)