Fireworks AITranslation site

2mos agorelease 224 0 0

Valued at over $500 million and founded in 2022, it focuses on providing generative AI services such as large model fine-tuning, inference and deployment for enterprises and developers.

Language:
en
Collection time:
2025-06-14
Fireworks AIFireworks AI

Fireworks AI Company Profile

Fireworks AI是一家成立于2022年的美国生成式AI初创公司,专注于为企业和开发者提供大模型微调、推理、部署等服务。其创始团队成员多来自Meta、Google等大厂,创始人兼CEO乔琳毕业于复旦大学,是加利福利亚大学圣巴巴拉分校的计算机科学博士,曾担任Meta PyTorch的负责人,在LinkedIn及IBM也有过技术工作经验。团队凭借在Meta多年积累的经验和技术实力,致力于帮助企业快速实现AI转型。Fireworks AI开发了定制FireAttention推理引擎,与Open Source的vLLM相比,推理时间缩短12倍,降低使用成本,获得了众多资本青睐,2024年B轮融资后估值达5.52亿美元。

Core business and technology

  • Large Model Service: Provides over 100 advanced text, image, audio, and multimodal macromodels covering large language models, image generation models, audio generation models, video generation models, embedded models, and more, dramatically optimized for latency, throughput, and cost.
  • Model fine-tuning: Helps developers quickly customize models through ultra-fast LoRA fine-tuning technology, which takes only minutes from dataset preparation to querying the fine-tuned model, and the fine-tuned model can be seamlessly deployed into existing business processes.
  • Reasoning Optimization: Enables semantic caching to avoid duplicate computation, specializes in capturing application workload patterns and building them into an inference stack that can automatically adapt to developer or enterprise workloads.
  • Cost and efficiency: 12x reduction in inference time compared to traditional methods and 40x reduction compared to GPT-4; 140 billion tokens of data processed per day with 99.99% API uptime; RAG speed 9x higher than Groq, SDXL image generation 6x faster than other providers' averages, and presumably decoding speeds up to 1,000 tokens/sec; and Open Source Five times lower cost compared to the original model, and thirty times lower in the case of further fine-tuning.

Market Performance and Customers

  • Financing: In July 2024, the company completed a $52 million Series B financing round led by Sequoia Capital, with participation from NVIDIA, AMD, MongoDB, etc., valuing the company at $552 million and totaling $77 million in financing.
  • client base: Enterprises including Cresta, Cursor, Liner, DoorDash, Quora, and Upwork, as well as individual clients such as Cursor and Superhuman, customize quantitative solutions for specific use cases.

Team & Background

  • Founder's backgroundQiao Lin, the founder and CEO, graduated from Fudan University, has a PhD in Computer Science from University of California Santa Barbara, was the head of Meta PyTorch, and has technical work experience at LinkedIn and IBM. Most of the team members are from Meta, Google and other big factories, with more than 1/3 Chinese members.
  • technical accumulation: The team is committed to helping enterprises rapidly realize AI transformation with the experience and technical strength accumulated in Meta for many years.

Vision for the future

  • Building a comprehensive knowledge access API: Work on creating a powerful API with precise calls to different models and APIs to enable access to full knowledge.
  • Expanding Teams and Partnerships: Plans to use the new funding to expand the team and plans to expand partnerships with AI companies to drive the industry's shift to composite AI systems.

data statistics

Relevant Navigation

No comments

none
No comments...