Open call · № 04

Member of Technical Staff, Mid-training.

Munich · full-time · remote

Instruction-tuned models are not directly suited to raw computer-use trajectories. Mid-training adapts the base model to our data before downstream RL.

This role owns domain adaptation to human screencasts, supervised fine-tuning on hindsight-enriched trajectories, catastrophic-forgetting mitigation, optimizer choices, and data mixtures.

You will design the SFT corpus, maintain the training recipe, build the eval loop, and decide when an initialization is ready for downstream training.

Apply

Email franz@pdoom.org with five bullet points about evidence of exceptional ability.