Featured posts

MLA Architecture
Research 04.27.25
The RoPE Compatibility Problem in DeepSeek's Multi Head Latent Attention
Read More →
Matrix Multiplications
Analysis 01.01.25
Analysis of Matrix Multiplications in Transformer Architectures
Read More →

Less readworthy posts

- Aakash Varma