Aleph Alpha is hiring a Senior AI Researcher to work on the core technical problems of large-scale pre-training. The role involves designing inference-efficient architectures, optimising training recipes, and training models on a large scale cluster. The team culture is built on ownership, autonomy, and empowerment, with a flat organisational structure and efficient management.
Requirements
- Proficiency in Python and PyTorch-based training workflows
- Strong track record in machine learning research and software engineering
- Strong mathematical foundation and ability to reason formally about optimisation, scaling behaviour, and training dynamics
- Experience with transformer training dynamics, optimisation, and large distributed training jobs
- Ability to design rigorous experiments, reason clearly from noisy results, and translate empirical observations into robust training decisions
- Strong software engineering practices, including writing maintainable, well-tested code and supporting reproducible experimentation workflows
- Ability to implement complex model architectures efficiently and reliably and to debug complex issues across model code, training dynamics, and distributed systems
- Effective collaboration within a research and engineering team and clear communication about work across Pre-training and the broader AAR/AA organization
Benefits
- 30 days of paid vacation
- Access to a variety of fitness & wellness offerings via Wellhub
- Mental health support through nilo.health
- Substantially subsidized company pension plan for your future security
- Subsidized Germany-wide transportation ticket
- Budget for additional technical equipment
- Flexible working hours for better work-life balance and hybrid working model
- Virtual Stock Option Plan
- JobRad® Bike Lease