Ravi Shankar’s Post

View profile for Ravi Shankar, graphic

Manager II - Machine Learning Product Discovery @ Bed Bath & Beyond | Machine Learning, Computer Vision, NLP, LLMs

Deploying Llama2 models can be expensive but AWS has specific hardware in the Inferential family that supports cost effective deployment. The blog post discusses the deployment of Llama 2 on Amazon EC2 Inf2 instances using AWS Inferentia2 for both training and inference. It provides detailed steps on creating, compiling, and deploying the Llama-2 model using the latest AWS Neuron SDK release, achieving high performance at low cost. https://lnkd.in/gDib5znW

PyTorch

PyTorch

pytorch.org

Kaushik Tummalapalli

Machine Learning Engineer @ CVS Health | NYU Alum | AI Engineering | Data Science Enthusiast |

8mo

Thanks for sharing Ravi Shankar!

To view or add a comment, sign in

Explore topics