Co-founder at Voliom | Empowering Startups by Scaling Software Development Teams | Custom Software Development AI ML DevOps Cloud Big Data
Check out Google DeepMind Gemini 1.5 Pro model, with its 1 million token context window, can navigate busy environments by recalling details from a video tour and following complex instructions. Whether it's finding a specific desk or remembering a favorite drink, this technology is pushing the boundaries of what's possible. #google #ai #robotic
“Hey robot, take me somewhere I can draw?” 🤖 We challenged our helper robots to navigate their way around a busy space - using Gemini 1.5 Pro. With the model’s 1 million token context window, it’s able to recall an environment after watching a video tour, and successfully followed a range of instructions - from finding a specific desk to remembering a favorite drink. Find out more in our latest paper → https://dpmd.ai/4bUobbj