We are seeking a Senior Platform / Backend Recovery Engineer to take ownership of stabilizing and recovering a complex microservices platform during a critical transition phase. This role is ideal for a senior engineer with deep backend and platform expertise who can quickly assess risk, restore control over distributed systems, and define a path for long-term scalability and modernization. You will work hands-on across architecture, infrastructure, and application layers, supporting system recovery while helping drive ongoing development and migration initiatives. This is a high-impact role requiring strong technical judgment, autonomy, and experience operating in fast-moving, ambiguous environments.
Key Responsibilities
- Lead stabilization and recovery efforts for a partially handed-over microservices platform
- Assess system health, identify risks, and establish a roadmap for remediation and ongoing development
- Troubleshoot and improve distributed backend services, messaging systems, and data flows
- Support and contribute to framework modernization, service migration, and architectural improvements
- Work across infrastructure, CI/CD pipelines, and cloud environments to improve reliability and operational resilience
- Strengthen production security practices, including credential rotation, IAM audits, and incident response readiness
- Collaborate with cross-functional stakeholders while operating with high ownership in a lean engineering environment
- Contribute to technical decision-making and help create sustainable engineering practices for the platform moving forward
Core Requirements
- 8+ years of backend and platform engineering experience
- Strong hands-on experience with Node.js and TypeScript
- Deep understanding of distributed systems and microservices architectures
- Experience using AI-assisted development tools (such as Claude Code) at an architectural level
- Production experience with CQRS and event sourcing, including event stores, aggregate roots, projections, and replay
- Experience with messaging systems such as RabbitMQ, Kafka, or similar technologies
- Strong PostgreSQL and data architecture experience
- Solid understanding of CI/CD, Docker (Docker Swarm preferred), and cloud infrastructure (GCP preferred)
- Ability to work effectively in incomplete or partially documented systems
- Production security experience, including IAM, credential management, and incident response
- Excellent written and verbal communication skills in English
- Ability to work independently in a remote, fast-paced environment
Nice-to-Have
- Experience supporting framework migrations, dependency rebuilds, or IP-sensitive engineering work
- Experience leading strangler pattern migrations
- Hands-on production experience with Auth0, SAML, and SCIM 2.0
- Experience using AI-assisted development tools (such as Claude Code) at an architectural level
- Exposure to no-code platform migrations (Xano, Bubble, or similar)
- Experience working in small, high-ownership engineering teams