Braintrust

braintrust.dev

Founded Year

2023

Stage

Series A | Alive

Total Raised

$44.3M

Valuation

$0000

Last Raised

$36M | 1 yr ago

Mosaic Score
The Mosaic Score is an algorithm that measures the overall financial health and market potential of private companies.

+289 points in the past 30 days

About Braintrust

Braintrust is a technology company that builds a platform for developing AI applications within the artificial intelligence sector. The company provides tools for evaluating and managing large linguistic models, including prompt management, performance tracking, and dataset management. Braintrust's solutions include features such as real-time execution trace visualization, monitoring, and the option for self-hosting to meet data control and compliance needs. It was founded in 2023 and is based in San Francisco, California.

Headquarters Location

548 Market Street

San Francisco, California, 94104,

United States

707-682-7588

ESPs containing Braintrust

The ESP matrix leverages data and analyst insight to identify and rank leading companies in a given technology landscape.

AI agent observability, evaluation, & governance

Enterprise Tech / Enterprise Applications

The AI agent observability, evaluation, & governance market provides platforms and tools to monitor, test, and ensure the quality of AI agent systems in production environments. These solutions offer real-time tracking, benchmarking against industry standards, automated testing, and comprehensive analytics to identify reliability issues and risks. The market serves AI development teams, IT operati…

Braintrust named as Challenger among 15 other companies, including Arize, Weights & Biases, and Credo AI.

Research containing Braintrust

Get data-driven expert analysis from the CB Insights Intelligence Unit.

CB Insights Intelligence Analysts have mentioned Braintrust in 3 CB Insights research briefs, most recently on Sep 5, 2025.

Sep 5, 2025 report

Book of Scouting Reports: The AI Agent Tech Stack

May 16, 2025 report

Book of Scouting Reports: 2025’s AI 100

Apr 24, 2025 report

AI 100: The most promising artificial intelligence startups of 2025

Expert Collections containing Braintrust

Expert Collections are analyst-curated lists that highlight the companies you need to know in the most important technology spaces.

Braintrust is included in 4 Expert Collections, including Artificial Intelligence.

Artificial Intelligence

10,195 items

Generative AI

2,793 items

Companies working on generative AI applications and infrastructure.

AI 100 (2025)

100 items

AI 100 (All Winners 2018-2025)

100 items

Latest Braintrust News

Evals Reimagined: Braintrust’s Engineering Approach to AI Development

Aug 24, 2025

August 24, 2025, 8:50 am IDT Ankur Goyal, CEO of Braintrust, recently addressed attendees at the AI Engineer World’s Fair, outlining five hard-earned lessons for developing successful AI applications. His insights emphasized the indispensable role of robust evaluation systems, moving beyond superficial metrics to genuinely engineered approaches. The core message was clear: building impactful AI demands a sophisticated engineering mindset, particularly in how we assess and refine model performance. Effective evaluations are not incidental; they are deliberately constructed to reflect real-world performance. Goyal noted, “The most important property of a good dataset is that you can reconcile it with reality.” This means moving past purely synthetic data to continuously incorporate genuine user feedback, transforming complaints into actionable evaluation metrics. He stressed that evaluations should be proactive, used “to play offense” by identifying new use cases and predicting performance, rather than merely for regression testing. A mature evaluation system, for instance, should enable a product team to roll out an update incorporating a new model within 24 hours. The era of simple prompt engineering is waning, replaced by “context engineering.” This involves optimizing the entire informational context provided to a large language model (LLM), including meticulously defined tools and their outputs. Braintrust’s analysis reveals that a vast majority of the tokens in a typical prompt are not from the system prompt itself, but from tool definitions and, predominantly, tool responses—a significant 67.6%. This demands precision in how tools are structured and how their outputs are presented to the model, as even subtle changes, like shifting from JSON to YAML, can dramatically impact LLM comprehension and performance. Agility is paramount in the rapidly evolving AI landscape, where a new model can fundamentally alter product viability. Goyal highlighted how a feature that previously yielded only 10% performance with GPT 4o became 58% viable with Claude 4 Sonnet. Such dramatic shifts underscore the need for model-agnostic systems, allowing developers to quickly integrate and test new models without extensive code changes. This proactive approach ensures organizations are prepared to capitalize on sudden leaps in model capabilities. Braintrust’s new “Loop” feature directly addresses this need for holistic optimization, empowering developers to optimize the entire evaluation system, not just isolated prompts. The Loop feature allows users to auto-optimize prompts, datasets, and scorers directly within the platform. This comprehensive approach yields dramatically better results, as demonstrated by a benchmark showing an improvement from 8.9% (prompt only) to 39.14% when the dataset, prompt, and scorers are optimized together. This enables rapid, intentional iteration, ensuring AI applications are continuously aligned with evolving model capabilities and user needs.

Aug 15, 2025

6 startups con posibilidades de romper la barrera de los mil millones de dólares en 2025

Aug 9, 2025

Braintrust Unveils Loop, Automating AI Model Evaluation

Feb 21, 2025

Braintrust's seed round: $5m to build infrastructure for AI products - Blog - Braintrust

Nov 14, 2024

How custom evals get consistent results from LLM applications

Braintrust Frequently Asked Questions (FAQ)

When was Braintrust founded?
Braintrust was founded in 2023.
Where is Braintrust's headquarters?
Braintrust's headquarters is located at 548 Market Street, San Francisco.
What is Braintrust's latest funding round?
Braintrust's latest funding round is Series A.
How much did Braintrust raise?
Braintrust raised a total of $44.3M.
Who are the investors of Braintrust?
Investors of Braintrust include Andreessen Horowitz, Datadog, Databricks Ventures, Saam Motamedi, Elad Gil and 17 more.
Who are Braintrust's competitors?
Competitors of Braintrust include LatticeFlow AI, LangWatch, Arize, Align AI, Orq and 7 more.

Compare Braintrust to Competitors

Manot

Manot focuses on the reliability and optimization of large language models (LLMs) within the AI and machine learning sector. The company provides a platform that centralizes user interaction data, offers insights and analytics, and enables optimizations such as synthetic data generation and automated prompt adjustments to improve LLM performance. Manot serves enterprise clients aiming to personalize and optimize their AI systems to meet user needs. It was founded in 2021 and is based in Glendale, California.

Mesh HQ

Mesh HQ provides artificial intelligence (AI) driven developer tools within the technology sector. The company has an AI companion that captures live context from various development environments and collaboration tools, manages code snippets, and supports multiple large language models, while ensuring local data processing for control. Its products integrate with tools such as Chrome, VS Code, Visual Studio, JetBrains, and Obsidian, to assist in the coding process for developers. It was founded in 2020 and is based in Cincinnati, Ohio.

CrewAI

CrewAI develops technology related to multi-agent automation within the artificial intelligence sector. The company provides a platform for building, deploying, and managing AI agents that automate workflows across various industries. Its services include tools, templates for development, and tracking and optimization of AI agent performance. The company was founded in 2024 and is based in Middletown, Delaware.

Vellum

Vellum is a developer platform that provides tools for defining, evaluating, and monitoring AI solutions through a test-driven development approach, addressing the needs of engineers, product managers, and domain experts. Vellum serves sectors that require AI integration, including healthcare, finance, and customer service industries. It was founded in 2023 and is based in New York, New York.

LlamaIndex

LlamaIndex specializes in building artificial intelligence knowledge assistants. The company provides a framework and cloud services for developing context-augmented AI agents, which can parse complex documents, configure retrieval-augmented generation (RAG) pipelines, and integrate with various data sources. Its solutions apply to sectors such as finance, manufacturing, and information technology by offering tools for deploying AI agents and managing knowledge. LlamaIndex was formerly known as GPT Index. It was founded in 2023 and is based in Mountain View, California.

Arize

Arize provides tools for AI observability and LLM evaluation within the machine learning and artificial intelligence sectors. The company offers a platform for monitoring, diagnosing, and improving the performance of AI models and applications in production. Arize's tools are based on open-source standards and can integrate with existing AI infrastructure. It was founded in 2020 and is based in Mill Valley, California.

CBI websites generally use certain cookies to enable better interactions with our sites and services. Use of these cookies, which may be stored on your device, permits us to improve and customize your experience. You can read more about your cookie choices at our privacy policy here. By continuing to use this site you are consenting to these choices.

How VCs Use CB Insights

Professional Services

Platform Overview

Braintrust

Founded Year

Stage

Total Raised

Valuation

Last Raised

Mosaic Score
The Mosaic Score is an algorithm that measures the overall financial health and market potential of private companies.

About Braintrust

Headquarters Location

ESPs containing Braintrust

Research containing Braintrust

Expert Collections containing Braintrust

Artificial Intelligence

Generative AI

AI 100 (2025)

AI 100 (All Winners 2018-2025)

Latest Braintrust News

Braintrust Frequently Asked Questions (FAQ)

Compare Braintrust to Competitors

How VCs Use CB Insights

Professional Services

Platform Overview

Founded Year

Stage

Total Raised

Valuation

Last Raised

Mosaic Score The Mosaic Score is an algorithm that measures the overall financial health and market potential of private companies.

About Braintrust

Headquarters Location

ESPs containing Braintrust

Research containing Braintrust

Expert Collections containing Braintrust

Artificial Intelligence

Generative AI

AI 100 (2025)

AI 100 (All Winners 2018-2025)

Latest Braintrust News

Braintrust Frequently Asked Questions (FAQ)

Compare Braintrust to Competitors

Mosaic Score
The Mosaic Score is an algorithm that measures the overall financial health and market potential of private companies.