Principal MLOps Engineer, AI Inference
Company: Red Hat
Location: Boston
Posted on: March 22, 2025
Job Description:
Principal MLOps Engineer, AI InferenceJob SummaryAt Red Hat we
believe the future of AI is open and we are on a mission to bring
the power of open-source LLMs and vLLM to every enterprise. Red Hat
Inference team accelerates AI for the enterprise and brings
operational simplicity to GenAI deployments. As leading developers,
maintainers of the vLLM project, and inventors of state-of-the-art
techniques for model compression, our team provides a stable
platform for enterprises to build, optimize, and scale LLM
deployments.We are seeking an experienced ML Ops engineer to work
closely with our product and research teams to scale SOTA deep
learning products and software. As an ML Ops engineer, you will
work closely with our technical and research teams to manage
training and deployment pipelines, create DevOps and CI/CD
infrastructure, and scale our current technology stack. If you are
someone who wants to contribute to solving challenging technical
problems at the forefront of deep learning, this is the role for
you!In this role, your primary responsibility will be to build and
release the Red Hat AI Inference runtimes, continuously improve the
processes and tooling used by the DevOps team, and find
opportunities to automate procedures and tasks.What you will do
- Collaborate with research and product development teams to
scale machine learning products for internal and external
applications
- Create and manage training and deployment pipelines
- Test to ensure correctness, responsiveness, and efficiency
- Troubleshoot, debug and upgrade Dev & Test pipelines
- Identifying and deploying cybersecurity measures by
continuously performing vulnerability assessment and risk
management
- Collaborate with a cross-functional team about market
requirements and best practices
- Keep abreast of the latest technologies and standards in the
fieldWhat you will bring
- 5+ years of experience in MLOps, DevOps, Automation and modern
Software Deployment practices
- Strong experience with Git, Github Actions including
self-hosted runners, Terraform, Jenkins, and common technologies
for automation and monitoring
- Experience with Kubernetes/Openshift
- Experience with Agile methodology
- Experience with Cloud Computing using at least one of the
following Cloud infrastructures: AWS, GCP, Azure, or IBM Cloud
- Strong programming skills with proven experience implementing
Python-based machine learning solutions
- Solid troubleshooting skills
- Ability to interact comfortably with the other members of a
large, geographically dispersed team
- Experience maintaining an infrastructure and ensuring stability
while adding new features
- Familiarity with contributing to the vLLM community is a
plus
- While a Bachelor's degree or higher in computer science,
mathematics, or a related discipline is valued, we prioritize
technical prowess, initiative, problem solving, and practical
experienceThe salary range for this position is $170,770.00 -
$281,770.00. Actual offer will be based on your qualifications.Pay
TransparencyRed Hat determines compensation based on several
factors including but not limited to job location, experience,
applicable skills and training, external market value, and internal
pay equity. Annual salary is one component of Red Hat's
compensation package. This position may also be eligible for bonus,
commission, and/or equity. For positions with Remote-US locations,
the actual salary range for the position may differ based on
location but will be commensurate with job duties and relevant work
experience.About Red HatRed Hat is the world's leading provider of
enterprise open source software solutions, using a
community-powered approach to deliver high-performing Linux, cloud,
container, and Kubernetes technologies. Spread across 40+
countries, our associates work flexibly across work environments,
from in-office, to office-flex, to fully remote, depending on the
requirements of their role. Red Hatters are encouraged to bring
their best ideas, no matter their title or tenure. We're a leader
in open source because of our open and inclusive environment. We
hire creative, passionate people ready to contribute their ideas,
help solve complex problems, and make an impact.Benefits---
Comprehensive medical, dental, and vision coverage--- Flexible
Spending Account - healthcare and dependent care--- Health Savings
Account - high deductible medical plan--- Retirement 401(k) with
employer match--- Paid time off and holidays--- Paid parental leave
plans for all new parents--- Leave benefits including disability,
paid family medical leave, and paid military leave--- Additional
benefits including employee stock purchase plan, family planning
reimbursement, tuition reimbursement, transportation expense
account, employee assistance program, and more!Note: These benefits
are only applicable to full time, permanent associates at Red Hat
located in the United States.Diversity, Equity & Inclusion at Red
HatRed Hat's culture is built on the open source principles of
transparency, collaboration, and inclusion, where the best ideas
can come from anywhere and anyone. When this is realized, it
empowers people from diverse backgrounds, perspectives, and
experiences to come together to share ideas, challenge the status
quo, and drive innovation. Our aspiration is that everyone
experiences this culture with equal opportunity and access, and
that all voices are not only heard but also celebrated. We hope you
will join our celebration, and we welcome and encourage applicants
from all the beautiful dimensions of diversity that compose our
global village.Equal Opportunity Policy (EEO)Red Hat is proud to be
an equal opportunity workplace and an affirmative action employer.
We review applications for employment without regard to their race,
color, religion, sex, sexual orientation, gender identity, national
origin, ancestry, citizenship, age, veteran status, genetic
information, physical or mental disability, medical condition,
marital status, or any other basis prohibited by law.
#J-18808-Ljbffr
Keywords: Red Hat, Boston , Principal MLOps Engineer, AI Inference, Engineering , Boston, Massachusetts
Didn't find what you're looking for? Search again!
Loading more jobs...