Senior Production Engineer
Company: CoreWeave
Location: Sunnyvale
Posted on: February 18, 2026
|
|
|
Job Description:
Job Description Job Description CoreWeave is The Essential Cloud
for AI™. Built for pioneers by pioneers, CoreWeave delivers a
platform of technology, tools, and teams that enables innovators to
build and scale AI with confidence. Trusted by leading AI labs,
startups, and global enterprises, CoreWeave combines superior
infrastructure performance with deep technical expertise to
accelerate breakthroughs and turn compute into capability. Founded
in 2017, CoreWeave became a publicly traded company (Nasdaq: CRWV)
in March 2025. Learn more at www.coreweave.com. About CoreWeave
CoreWeave is the AI Hyperscaler™, delivering a cloud platform of
cutting-edge services powering the next wave of AI. Our technology
provides enterprises and leading AI labs with performant,
efficient, and resilient accelerated-compute infrastructure. As one
of TIME's 100 Most Influential Companies of 2024, CoreWeave
operates a rapidly expanding footprint of data centers across the
US and Europe. We thrive in environments where adaptability,
resilience, and curiosity drive innovation. If you enjoy building
and improving distributed systems, solving hard production
problems, and operating cloud infrastructure at meaningful scale —
you'll feel right at home here. About the Role Production
Engineering ensures CoreWeave's cloud delivers world-class
reliability, performance, and operational excellence. We are hiring
a Senior Production Engineer to take direct, hands-on ownership of
critical tooling that drives reliability and delivery success. In
this role, you will work broadly across the cloud stack designing,
implementing, deploying, and operating systems that improve
delivery velocity, service availability, and operational safety.
You'll be responsible for leading end-to-end technical projects,
maintaining long-lived systems the team owns, and strengthening our
operational foundations through durable engineering investments.
This is a role for someone who enjoys building , debugging , and
operating production systems. You will collaborate closely with
service owners, but your primary impact comes from the reliability,
quality, and maturity of the systems you deliver and maintain over
time. What You'll Do Take hands-on ownership of critical systems
and frameworks, driving their architecture, implementation, and
long-term evolution. Lead end-to-end delivery of engineering
projects that improve availability, scalability, operational
automation, and failure recovery. Build and maintain observability,
alerting, automated remediation, and resilience testing for the
systems you support. Participate in incident response as a
subject-matter expert; drive deep root-cause investigations and
implement lasting fixes. Improve runbooks, sources of truth,
deployment workflows, and operational tooling to harden production
readiness. Eliminate single points of failure and reduce
operational toil through automation, refactors, and system
redesigns. Ship production code regularly in Python, Go, or similar
languages, and participate in on-call rotations. Maintain and
mature long-term projects and frameworks owned by the team,
ensuring they remain reliable, well-instrumented, and easy to
operate. Collaborate with platform teams to ensure new features and
services integrate cleanly with our reliability best-practices and
tooling. What You've Worked On (Minimum Qualifications) 7 years of
engineering experience building and operating distributed systems
or cloud platforms. Demonstrated ability to debug complex
production issues end-to-end, across services, infrastructure
layers, and automation. Strong programming or scripting ability
(Python, Go, or similar), with experience shipping and operating
production services and tools. Deep knowledge of cloud-native
technologies and distributed system patterns, particularly
Kubernetes. Experience with modern observability stacks: metrics,
tracing, structured logs, SLOs/SLIs, and incident lifecycle
practices. A track record of successfully delivering hands-on
reliability improvements through engineering execution. Preferred
Qualifications Experience building internal tooling, frameworks, or
automation that supports high-availability cloud operations.
Familiarity with DR/BCP, service tiering, capacity planning, or
chaos engineering. Background operating or building large-scale AI
or GPU-accelerated infrastructure. Experience maintaining
multi-year ownership of foundational production systems. Why
CoreWeave At CoreWeave, we work hard, have fun, and move fast.
You'll join a team that values curiosity, ownership, and creative
problem-solving. Production Engineering sits at the intersection of
reliability and AI infrastructure, building systems that enable the
world's most powerful AI cloud. Core Values Be Curious at Your Core
Act Like an Owner Empower Employees Deliver Best-in-Class Client
Experiences Achieve More Together Benefits 100% employer-paid
medical, dental, and vision coverage Life, short- and long-term
disability insurance 401(k) with generous employer match Flexible
PTO and childcare support through Kinside Catered lunch daily (for
office-based employees), weekly massages (NY/NJ) Dynamic,
collaborative culture focused on innovation and learning California
Consumer Privacy Act — California applicants only CoreWeave is an
equal opportunity employer committed to diversity and
inclusiveness. We consider all qualified applicants without regard
to race, color, nationality, gender identity or expression, sexual
orientation, religion, disability, or age. What We Offer The range
we've posted represents the typical compensation range for this
role. To determine actual compensation, we review the market rate
for each candidate which can include a variety of factors. These
include qualifications, experience, interview performance, and
location. In addition to a competitive salary, we offer a variety
of benefits to support your needs, including: Medical, dental, and
vision insurance - 100% paid for by CoreWeave Company-paid Life
Insurance Voluntary supplemental life insurance Short and long-term
disability insurance Flexible Spending Account Health Savings
Account Tuition Reimbursement Ability to Participate in Employee
Stock Purchase Program (ESPP) Mental Wellness Benefits through
Spring Health Family-Forming support provided by Carrot Paid
Parental Leave Flexible, full-service childcare support with
Kinside 401(k) with a generous employer match Flexible PTO Catered
lunch each day in our office and data center locations A casual
work environment A work culture focused on innovative disruption
Our Workplace While we prioritize a hybrid work environment, remote
work may be considered for candidates located more than 30 miles
from an office, based on role requirements for specialized skill
sets. New hires will be invited to attend onboarding at one of our
hubs within their first month. Teams also gather quarterly to
support collaboration California Consumer Privacy Act - California
applicants only CoreWeave is an equal opportunity employer,
committed to fostering an inclusive and supportive workplace. All
qualified applicants and candidates will receive consideration for
employment without regard to race, color, religion, sex,
disability, age, sexual orientation, gender identity, national
origin, veteran status, or genetic information. As part of this
commitment and consistent with the Americans with Disabilities Act
(ADA) , CoreWeave will ensure that qualified applicants and
candidates with disabilities are provided reasonable accommodations
for the hiring process, unless such accommodation would cause an
undue hardship. If reasonable accommodation is needed, please
contact: careers@coreweave.com. Export Control Compliance This
position requires access to export controlled information. To
conform to U.S. Government export regulations applicable to that
information, applicant must either be (A) a U.S. person, defined as
a (i) U.S. citizen or national, (ii) U.S. lawful permanent resident
(green card holder), (iii) refugee under 8 U.S.C. § 1157, or (iv)
asylee under 8 U.S.C. § 1158, (B) eligible to access the export
controlled information without a required export authorization, or
(C) eligible and reasonably likely to obtain the required export
authorization from the applicable U.S. government agency. CoreWeave
may, for legitimate business reasons, decline to pursue any export
licensing process.
Keywords: CoreWeave, Concord , Senior Production Engineer, IT / Software / Systems , Sunnyvale, California