Fireworks AI

Fireworks AI · 2025-03-06T18:10:09.691Z

Building an AI Agent with Reasoning Capability is as simple as 👇 We are excited to share this demo built by Shane Thomas using Mastra and Fireworks AI for DeepSeek R1 model. Mastra is a TypeScript Agentic AI Framework that lets you build intelligent agents with persistent memory, robust state management, contextual data integration, and transparent tracking. Checkout the project here: https://lnkd.in/dvfk-yqe

Software Development

Redwood City, CA 15,003 followers

Generative AI platform empowering developers and businesses to scale at high speeds

See jobs Follow

View all 75 employees

About us

Fireworks.ai offers generative AI platform as a service. We optimize for rapid product iteration building on top of gen AI as well as minimizing cost to serve. https://fireworks.ai/careers

Website: http://fireworks.ai
External link for Fireworks AI
Industry: Software Development
Company size: 11-50 employees
Headquarters: Redwood City, CA
Type: Privately Held
Founded: 2022
Specialties: LLMs and Generative AI

Locations

Primary

Redwood City, CA 94063, US

Get directions

Employees at Fireworks AI

See all employees

Updates

Fireworks AI reposted this
Lin Qiao

CEO and cofounder of Fireworks AI
6mo
Report this post
🔥 Announcing FireOptimizer/Multi-LoRA 🔥 I didn't expect what I considered to be a small feature launched last year delivered a powerful impact to our customers. I'm excited to announce Multi-LoRA, an important component of FireOptimizer. Personalized experiences are critical to driving greater usage, retention and customer satisfaction for your product. Without Multi-LoRA, deploying hundreds of fine-tuned models on separate GPUs would be prohibitively expensive. With Multi-LoRA, you can now deliver personalized experiences across thousands of users and use cases, without scaling your costs! More specifically, Multi-LoRA has benefits below: -- Fine-tune and serve hundreds of personalized LoRA models at the same cost as a single base model, which is just $0.2/1M tokens for Llama3.1 8B -- 100x cost-efficiency compared to serving 100 fine-tuned models without Multi-LoRA on other platforms with per-GPU pricing -- Convenient deployment on Fireworks Serverless with per-token pricing and competitive inference speeds, or Fireworks On-Demand and Reserved for larger workloads Multi-LoRA is part of FireOptimizer, our adaptation engine designed to customize and enhance AI model performance for your unique use cases and workload. FireOptimizer capabilities include Adaptive Speculative Execution (https://lnkd.in/ejdD-wGG), that enables up to 3x latency improvements, Customizable Quantization (https://lnkd.in/dwpTU233), to precisely balance speed and quality, and LoRA Fine-Tuning (https://lnkd.in/et2UFzDy) to customize and improve model performance. ⚡Cresta uses Multi-LoRA to personalize their Knowledge Assist feature for each individual customer on the Fireworks enterprise platform. "Fireworks' Multi-LoRA capabilities align with Cresta's strategy to deploy custom AI through fine-tuning cutting-edge base models. It helps unleash the potential of AI on private enterprise data." - Tim Shi, Co-Founder and CTO of Cresta ⚡Brainiac Labs helps businesses leverage their proprietary data to fine-tune and deploy models using Multi-LoRA on the Fireworks self-serve platform. “Using Fireworks, clients with limited AI expertise can successfully maintain and improve the solutions I provide. Additionally, students in my course are able to complete real-world fine-tuning projects, dedicating just a few hours per week to the process.” - Scott Kramer, CEO of Brainiac Labs 👉 Read more in our blog post https://lnkd.in/d3_HGRqy

Multi-LoRA: Personalize AI at scale and deliver the best experience for each customer and use case, with 100x cost-efficiency

fireworks.ai

11 Comments

Like Comment Share
Fireworks AI

15,003 followers
1h Edited
Report this post
We’re beyond thrilled to share that Fireworks AI has been named #10 on Fast Company’s list of the world’s most innovative AI companies! This recognition highlights our core mission: empowering developers to easily build GenAI applications on state of the art open models. We are honored to be mentioned alongside industry leaders and our partners NVIDIA, Amazon, Google, and Mistral AI. Join our exceptional team of innovators. Find your perfect role on our careers page and become part of our rapidly growing success story 🚀 https://lnkd.in/geVW6EFk
2 Comments

Like Comment Share
Fireworks AI reposted this
Lin Qiao

CEO and cofounder of Fireworks AI
23h Edited
Report this post
🔥 Fireworks AI matches DeepSeek AI pricing 🔥 After significant performance optimization in the past 2 months, we are excited to pass our efficiency improvement back to our users. We launched two additional Deepseek R1 tiers to current tier: 1. base: providing the matching price with the original Deepseek API on self-serve platform. 2. ultra fast: optimized through FireOptimizer for specific workload for max speed. Enterprise only. Given R1 is an important logical reasoning model, we have been continuously adding features to make development easy on Fireworks AI developer cloud: https://lnkd.in/eX5cy7D6 Tell us what other features are on your mind. We will build what you need.
Lin Qiao

CEO and cofounder of Fireworks AI
3w Edited

🔥 Fireworks AI Developer Cloud for DeepSeek AI models 🐳 Fireworks mission is to provide the best developer toolchain using open models for transparency, steerability, control, privacy, low latency and cost. Within one month, Fireworks launched a comprehensive AI developer cloud for Deepseek models. Here is a list of launched product features: 👉 Launched Deepseek models on the same day of open weights release! 👉 Latency and cost optimization: 🪄 Fireworks continuously pushes top performance and cost efficiency with a special version of FireAttention and distributed inference engine optimized for Deepseek’s unique MLA, MTP, and wide MOE architecture. 🪄 Controllable reasoning effort: we added shorter and better CoT for R1 via reasoning_effort = low 👉 Agentic development: 🪄 Agentic multi-modal workflow: we added vision capability to Deepseek v3 and R1 🪄 Agentic tool use: we added function calling to Deepseek v3 so it can integrate easily with other tools and APIs for agent development 🪄 Constrained generation: we added JSON mode and Grammar mode to Deepseek v3 and R1 👉 Model quality enhancement: 🪄 Additional Deepseek derivative models: launched Perplexity R1-1776 with higher accuracy for deep research and many tuned Deepseek models in production 👉 Research reproduction: 🪄 Reinforcement learning verifiable reward with minimal label 🪄 Distillation: R1 doing better than human We have many more features to launch soon, including Deepseek SFT and RL tuning platform as part of the FireOptimizer [https://lnkd.in/ejdD-wGG] stack. We will release many real world demos for you to get up to speed on the developer platform of Deepseek. Stay tuned! Please comment below what your wish list is. I would love to hear from you.
7 Comments

Like Comment Share
Fireworks AI

15,003 followers
2d Edited
Report this post
Fireworks AI matches DeepSeek pricing for R1, with secure deployments in EU and US Excited to share the latest enhancements to our DeepSeek R1 offerings: 💡 Base DeepSeek R1: Cost-effective and high-quality throughput for real-time applications (Endpoint: deepseek-r1-basic) 🚀 Ultra-Fast DeepSeek R1: Up to 130 tokens/sec for lightning-fast interactions on Fireworks Enterprise. ⚡ Fast DeepSeek R1: Balanced performance at 90 tokens/sec, optimized for interactive applications on Fireworks Serverless. (Endpoint: deepseek-r1) With specialized versions of FireAttention and tailored distributed inference, we're pushing the envelope for speed, efficiency, and cost-effectiveness in agentic products. More innovations coming soon with Blackwell GPUs! Explore our optimized DeepSeek deployments here: https://lnkd.in/edJh9MXb

Faster, more efficient DeepSeek on the Fireworks AI Developer Cloud

fireworks.ai

Like Comment Share
Fireworks AI reposted this
Fireworks AI

15,003 followers
2d
Report this post
We're thrilled to announce a groundbreaking integration Fireworks AI now seamlessly integrates with NVIDIA NIM microservices, powered by NVIDIA AI Enterprise. This means enterprises can rapidly deploy advanced AI models—accelerating innovation and driving competitive advantage like never before. Here's why this is game-changing: → Unmatched performance: Supercharge your AI capabilities with industry-leading open-source models like DeepSeek and Llama. → Expanded possibilities: Instantly access NVIDIA NIM’s extensive AI offerings, including embeddings, video processing, and 3D modeling. → Effortless integration: Utilize powerful NVIDIA Llama Nemotron Reasoning models effortlessly within Fireworks AI. Learn more about how Fireworks AI Supports NVIDIA NIM Deployments for Blazing AI Inference: https://lnkd.in/dtndSbhv

Fireworks AI Now Supports NVIDIA NIM Deployments for Blazing AI Inference

fireworks.ai

5 Comments

Like Comment Share
Fireworks AI

15,003 followers
2d
Report this post
We're thrilled to announce a groundbreaking integration Fireworks AI now seamlessly integrates with NVIDIA NIM microservices, powered by NVIDIA AI Enterprise. This means enterprises can rapidly deploy advanced AI models—accelerating innovation and driving competitive advantage like never before. Here's why this is game-changing: → Unmatched performance: Supercharge your AI capabilities with industry-leading open-source models like DeepSeek and Llama. → Expanded possibilities: Instantly access NVIDIA NIM’s extensive AI offerings, including embeddings, video processing, and 3D modeling. → Effortless integration: Utilize powerful NVIDIA Llama Nemotron Reasoning models effortlessly within Fireworks AI. Learn more about how Fireworks AI Supports NVIDIA NIM Deployments for Blazing AI Inference: https://lnkd.in/dtndSbhv

Fireworks AI Now Supports NVIDIA NIM Deployments for Blazing AI Inference

fireworks.ai

5 Comments

Like Comment Share
Fireworks AI

15,003 followers
5d
Report this post
Fireworks AI is coming to NVIDIA GTC! If you're building LLM applications in production, don't miss Lin Qiao's panel talk about Own your AI: Building an open-source AI strategy on March 18th from 4pm to 5pm. ⭐️Plus, our team would love to meet you at Booth 3238! Let's connect, share insights, and explore how Fireworks can supercharge your AI journey. Will you be there? Let us know!
2 Comments

Like Comment Share
Fireworks AI

15,003 followers
1w Edited
Report this post
🚀 Announcing DeepSeek R1 & V3 Fine-Tuning on Fireworks AI Fine-tuning state-of-the-art open models has never been easier. With DeepSeek R1 & V3 fine-tuning now available on Fireworks, you can tailor model behavior to your specific use case—with a seamless path to dedicated deployment. Key Benefits of DeepSeek Fine-Tuning on Fireworks: ✅ Quantization Aware Tuning (QAT): Ensures high accuracy, efficiency, and training speed. ✅ Seamless Model Alignment: QAT minimizes discrepancies between training and deployment performance. ✅ Optimized for Large-Scale Models: Efficiently manages memory and complexity in Mixture of Experts architectures. ✅ Effortless Deployment: Fine-tuned models require dedicated deployments, fully supported on Fireworks. 👉 With just three lines of code, you can fine-tune and deploy your model with ease. Check out this blog to read more: https://lnkd.in/dfKmRtWq

Fine-Tuning DeepSeek v3 & R1 to optimize quality, latency, & cost

fireworks.ai

1 Comment

Like Comment Share
Fireworks AI

15,003 followers
2w
Report this post
A smart reasoning LLM is good, but a smart reasoning VLM is better! We're thrilled to share a demo showcasing "AI Research Assistant" that with DeepSeek R1 which can now reason over both text and images thanks to Fireworks AI's new Document Inlining feature! Check out the demo below and experience the future of multimodal AI reasoning! 👇 Read the detailed blog here: https://lnkd.in/dZbi9ETD Get started with building your use-cases with DeepSeek R1 model on Fireworks AI: https://lnkd.in/g9Xt4grp

1 Comment

Like Comment Share
Fireworks AI

15,003 followers
2w
Report this post
Building an AI Agent with Reasoning Capability is as simple as 👇 We are excited to share this demo built by Shane Thomas using Mastra and Fireworks AI for DeepSeek R1 model. Mastra is a TypeScript Agentic AI Framework that lets you build intelligent agents with persistent memory, robust state management, contextual data integration, and transparent tracking. Checkout the project here: https://lnkd.in/dvfk-yqe

1 Comment

Like Comment Share

Browse jobs

Funding

Fireworks AI 2 total rounds

Last Round

Series B Aug 7, 2024

US$ 52.0M

Investors

Sequoia Capital + 8 Other investors

See more info on crunchbase

Fireworks AI

Software Development

Redwood City, CA 15,003 followers

Generative AI platform empowering developers and businesses to scale at high speeds

About us

Locations

Employees at Fireworks AI

Ian White

Full-stack Software Engineer

Alex Shapiro

Dmytro Ivchenko

Generative AI

Lin Qiao

CEO and cofounder of Fireworks AI

Updates

Join now to see what you are missing

Similar pages

Together AI

Perplexity

HeyGen

LangChain

Coactive AI

Cortex

Horizon3.ai

Clay

ClickHouse

Codeium

Browse jobs

Engineer jobs

Scientist jobs

Analyst jobs

Machine Learning Engineer jobs

Developer jobs

Software Engineer jobs

Manager jobs

Intern jobs

Director jobs

Vice President jobs

Data Analyst jobs

Project Manager jobs

Associate jobs

Researcher jobs

Data Engineer jobs

Data Scientist jobs

Python Developer jobs

Senior Software Engineer jobs

Sales Director jobs

Solutions Engineer jobs

Funding