AN
Machine Learning Engineer, Infrastructure
Job Description
Anthropic is seeking a skilled Machine Learning Engineer to focus on infrastructure. You will play a critical role in building and scaling the systems that power our advanced AI models, including Claude. This involves developing efficient training pipelines, robust deployment mechanisms, and optimizing hardware utilization. You'll collaborate closely with ML researchers and other engineers to ensure our infrastructure can support the next generation of AI.
**Responsibilities:**
- Design, implement, and maintain scalable ML infrastructure for training and inference.
- Optimize model performance and resource utilization on diverse hardware.
- Develop tools and frameworks to streamline the ML development lifecycle.
- Troubleshoot and resolve issues in production ML systems.
**Qualifications:**
- Bachelor's or Master's degree in Computer Science or a related quantitative field.
- Proven experience with cloud platforms (AWS, GCP, Azure).
- Strong proficiency in Python and ML frameworks (PyTorch, JAX).
- Experience with distributed systems and containerization technologies (Docker, Kubernetes).
- Familiarity with ML Ops practices.
**Benefits:**
- Competitive salary and stock options.
- Excellent health, dental, and vision coverage.
- Generous PTO and paid holidays.
- Support for continuous learning and professional growth.
- Opportunity to work on highly impactful AI safety and research projects.
Skills & Tags
mlopsinfrastructurepythonpytorchjax