Kili Technology’s Post

Kili Technology reposted this

View profile for Paul G., graphic

AI Alignment and Safety @Kili

In light of the huge AI announcements during Apple's #WWDC, it's a great time to explore on-device language models, starting with Apple's open-sourced #OpenELM model shared in April. 🔹 Training Overview: Apple's OpenELM model is designed to run efficiently on personal devices, offering robust natural language understanding and generation capabilities. 🔹 Datasets: The model used a mix of diverse and open-sourced datasets: RefinedWeb, PILE, RedPajama, and Dolma v1.6. 🔹 Training Methods: They did on-the-fly tokenization where they dynamically tokenized and filtered text during training so they can quickly experiment and iterate faster. 🔹 Instruction Tuning: For fine-tuning they used UltraFeedback, specifically designed to improve language models through #RLHF. Dive deeper here: https://lnkd.in/e7K6-eka #wwdc #AI #LLM

  • No alternative text description for this image

To view or add a comment, sign in

Explore topics