Aleph Alpha is hiring a Senior AI Researcher to work on the core technical problems of large-scale pre-training. The role involves designing inference-efficient architectures, optimising training recipes, and training models on a large scale cluster. The team culture is built on ownership, autonomy, and empowerment, with a flat organisational structure and efficient management.

Requirements

  • Proficiency in Python and PyTorch-based training workflows
  • Strong track record in machine learning research and software engineering
  • Strong mathematical foundation and ability to reason formally about optimisation, scaling behaviour, and training dynamics
  • Experience with transformer training dynamics, optimisation, and large distributed training jobs
  • Ability to design rigorous experiments, reason clearly from noisy results, and translate empirical observations into robust training decisions
  • Strong software engineering practices, including writing maintainable, well-tested code and supporting reproducible experimentation workflows
  • Ability to implement complex model architectures efficiently and reliably and to debug complex issues across model code, training dynamics, and distributed systems
  • Effective collaboration within a research and engineering team and clear communication about work across Pre-training and the broader AAR/AA organization

Benefits

  • 30 days of paid vacation
  • Access to a variety of fitness & wellness offerings via Wellhub
  • Mental health support through nilo.health
  • Substantially subsidized company pension plan for your future security
  • Subsidized Germany-wide transportation ticket
  • Budget for additional technical equipment
  • Flexible working hours for better work-life balance and hybrid working model
  • Virtual Stock Option Plan
  • JobRad® Bike Lease