d-Matrix Interview Question

LLM Quantization methods. Flash Attention