Nevin Baiju’s Post

This week I came across an opportunity to make the Llama model run faster using the beauty of AVX SIMD programming. Sometimes rethinking simple operations like matrix multiplications can bring about a lot of improvement. I have written down a detailed journal of how I went about modifying the matmul function to achieve that. #HighPerformanceComputing #HPC #AVX #SIMDProgramming #LLAMA2 #Optimization #LLMModel #CProgramming #DeepLearning #MachineLearning #ParallelComputing #Vectorization #PerformanceOptimization #ComputationalScience #ScientificComputing

Kannan K D P

SDE-II at Amazon | Ex-Mercedes Benz | Hacking for FinTech

1y

Inspiring!!, Nice optimisation you have done there. I hope you submitted a pull request for this! 😊

Like
Reply
Gireesan Namboothiri P

Works on Data Science| Ex Nissan | Ex Mahindra | Bosch | MS IIT Madras | 5 Intl. Publications | 5 Patents

1y

Nice find. Did you submit a PR ?

Like
Reply
Risad Kaipurath Puthiyapurayil

Software Developer | Looking for SDE role

1y

Inspiring work!!

Bhartendu TK

Staff Scientist @ Paypal 🛰 IIST 🌎

1y

Good work Nevin!

See more comments

To view or add a comment, sign in

Explore topics