Accelerating Large Language Model Decoding with Speculative Sampling
As a Staff Engineer Kinara Ai, I specialise in creating advanced compilers for our edge AI chip. My primary focus revolves around optimising AI models. I love math, deep learning and optimisations.