A
Alpaca
Backend
Operations Reliability Engineer - Automations New
GoTypeScriptReactSqlPostgreSQLDockerKubernetesCi/CdMicroservicesApis
About the Position
Alpaca is a US-headquartered self-clearing broker-dealer and brokerage infrastructure for stocks, ETFs, options, crypto, fixed income, 24/5 trading, and more. They are seeking an Operations Reliability Engineer to build automations and tooling to optimize operational workflows and reduce manual processes.
Responsibilities
- Design, build, test, deploy, and monitor production automations and UIs that remove manual steps and reduce operation time.
- Partner with frontend engineers to productize ops tooling so global teams can run functions with predictable staffing.
- Execute operational procedures to surface painful manual processes prior to automation.
- Instrument and report baseline and outcome metrics (MTTC, manual-steps removed, queue sizes, ops satisfaction) and iterate based on measured impact.
- Produce Platform Opportunity Briefs / RFCs for higher-level platform tooling and automations.
- Collaborate with licensed BD leadership, Compliance, and Security to build auditable, safe automations with role-based access and clear runbooks.
- Own the full lifecycle of the systems you build, including automated deployment (CI/CD with tools like ArgoCD and Terraform), proactive monitoring, On-call support rotations and incident response, following a "you build it, you run it" philosophy.
- Build systems with auditability, traceability, and data lineage as a first-class concern to ensure transparency for our auditors and regulators.
Requirements
- 5+ years of professional software engineering experience, with a proven track record of shipping and operating complex, large-scale systems in production.
- Strong business sense and understanding of operations.
- Deep, hands-on expertise in Golang, including a strong command of its concurrency models (goroutines, channels), memory management, and standard library.
- Proven track record of building user-facing features end-to-end with Typescript/React.
- Proficient with SQL and relational databases, preferably PostgreSQL.
- Demonstrated ability to reason about human workflows as systems, not just software services.
- Experience with observability, tracing, continuous profiling.
- Exceptional analytical and problem-solving skills, with the ability to deconstruct complex requirements into clear technical components and excellent communication skills for working in a cross-functional environment.
- High ownership mindset with bias toward durable, structural fixes over tactical patches.
Benefits
- Competitive Salary & Stock Options
- Health Benefits
- New Hire Home-Office Setup: One-time USD $500
- Monthly Stipend: USD $150 per month via a Brex Card
Operations Reliability Engineer - Automations New