AN
Senior Software Engineer, Machine Learning Platform
Job Description
Anthropic is building reliable, interpretable, and steerable AI systems. We are looking for experienced Software Engineers to join our ML Platform team. You will play a critical role in building and scaling the infrastructure that enables our researchers and engineers to develop, train, and deploy state-of-the-art AI models efficiently and safely. This includes MLOps, data pipelines, distributed training, and inference optimization.
Responsibilities:
- Design, build, and maintain robust and scalable ML infrastructure and tooling.
- Develop and optimize data pipelines for training and evaluating large AI models.
- Implement and improve systems for distributed training and efficient inference.
- Collaborate closely with ML researchers and engineers to understand their needs and provide effective solutions.
- Ensure the reliability, performance, and security of our ML platforms.
Minimum Qualifications:
- Bachelor's degree in Computer Science, Engineering, or a related field, or equivalent practical experience.
- 5+ years of experience in software engineering, with a focus on backend systems or infrastructure.
- Strong programming skills in Python or Go.
Preferred Qualifications:
- Experience with MLOps best practices and tools (e.g., Kubeflow, MLflow).
- Familiarity with cloud platforms (AWS, GCP, Azure) and containerization (Docker, Kubernetes).
- Experience with distributed computing frameworks (e.g., Ray, Spark).
- Understanding of ML workflows and common challenges in large-scale model development.
Skills & Tags
ml platformmlopsinfrastructurepythongo