Senior ML Infrastructure Engineer
Company: Hippocratic AI
Location: Palo Alto
Posted on: May 24, 2025
Job Description:
About Us:Hippocratic AI has developed a safety-focused Large
Language Model (LLM) for healthcare. The company believes that a
safe LLM can dramatically improve healthcare accessibility and
health outcomes in the world by bringing deep healthcare expertise
to every human. No other technology has the potential to have this
level of global impact on health.Why Join Our Team:
- Innovative Mission: We are developing a safe,
healthcare-focused large language model (LLM) designed to
revolutionize health outcomes on a global scale.
- Visionary Leadership: Hippocratic AI was co-founded by CEO
Munjal Shah, alongside a group of physicians, hospital
administrators, healthcare professionals, and artificial
intelligence researchers from leading institutions, including El
Camino Health, Johns Hopkins, Stanford, Microsoft, Google, and
NVIDIA.
- Strategic Investors: We have raised a total of $278 million in
funding, backed by top investors such as Andreessen Horowitz,
General Catalyst, Kleiner Perkins, NVIDIA's NVentures, Premji
Invest, SV Angel, and six health systems.
- World-Class Team: Our team is composed of leading experts in
healthcare and artificial intelligence, ensuring our technology is
safe, effective, and capable of delivering meaningful improvements
to healthcare delivery and outcomes.Position Overview:We are
seeking a skilled ML Infrastructure Engineer to help design, build,
and maintain a robust orchestration platform for managing a diverse
set of Large Language Models (LLMs). The ideal candidate will have
hands-on experience with infrastructure orchestration tools such as
Kubernetes and Terraform, as well as a strong understanding of
multi-cloud environments. This role offers the opportunity to work
on cutting-edge technologies and play a key part in scaling our AI
infrastructure.Key Responsibilities:Infrastructure Development &
Maintenance:
- Build and maintain infrastructure for deploying and managing
LLMs at scale.
- Implement automated processes using Kubernetes and
Infrastructure as Code (IAC) tools like Terraform.Orchestration
Platform Support:
- Contribute to the development and optimization of an
orchestration platform for managing a heterogeneous set of
LLMs.
- Monitor and troubleshoot issues in the platform to ensure high
availability and performance.Cloud Integration:
- Deploy and manage resources across multiple cloud platforms
(e.g., AWS, Azure, Google Cloud).
- Optimize cloud resource usage for cost efficiency and
scalability.Collaboration:
- Work closely with ML engineers and DevOps teams to ensure
smooth deployment and operation of AI models.
- Provide feedback on system designs and recommend improvements
to infrastructure workflows.Performance Monitoring:
- Implement tools and processes to monitor system health,
identify bottlenecks, and improve model lifecycle management.
- Perform capacity planning to support growing infrastructure
needs.Qualifications:Technical Skills:
- 3-5 years of experience in infrastructure engineering, DevOps,
or a related field.
- Experience with enterprise GPUs such as H200, H100, A100.
- Proficiency with Kubernetes, Terraform, and other IAC
tools.
- Familiarity with multi-cloud environments and cloud-native
services (e.g., AWS Lambda, Google Cloud Run, Azure
Functions).
- Programming skills in Python, Bash, or a similar language for
automation and scripting.
- Basic understanding of ML workflows and frameworks like
TensorFlow, PyTorch, or Hugging Face is a plus.Soft Skills:
- Strong problem-solving skills and attention to detail.
- Good communication and collaboration abilities to work
effectively with cross-functional teams.
- Eagerness to learn new technologies and improve existing
systems.Education & Experience:
- Bachelor's degree in Computer Science, Engineering, or a
related field (or equivalent work experience).
#J-18808-Ljbffr
Keywords: Hippocratic AI, Davis , Senior ML Infrastructure Engineer, Engineering , Palo Alto, California
Didn't find what you're looking for? Search again!
Loading more jobs...