togetherAI Interview Question

Code multi-head attention, how to implement speculative decoding, etc