TabPFN-3 just released: a pre-trained tabular foundation model for up to 1M rows [R][N]

Our take

Today marks the release of TabPFN-3, an advanced pre-trained tabular foundation model capable of handling up to 1 million rows in a single forward pass, eliminating the need for training or tuning. Building on the success of its predecessors, TabPFN-3 offers significant improvements in scale, speed, and accuracy, achieving a remarkable 93% win rate over classical machine learning on TabArena. With versatile deployment options and enhanced capabilities, this model empowers users to transform their data-driven workflows.

The recent release of TabPFN-3 marks a significant advancement in the landscape of tabular data processing, promising to empower users with unprecedented capabilities in their data management efforts. This model's ability to predict on tabular data in a single forward pass—without the need for extensive training or hyperparameter tuning—positions it as a game-changing tool for data analysts and scientists alike. As organizations increasingly turn to AI-native solutions, innovations like TabPFN-3 provide a clear pathway forward, especially when contrasted with traditional tools that may no longer meet the demands of modern data complexities. This evolution in data processing capabilities resonates with themes in our coverage, such as the challenges faced in traditional workflows highlighted in Excel Crashes w/ ODBC Query After Copilot Integration and the transformative potential of AI-native workflows discussed in I Let CodeSpeak Take Over My Repository.

At its core, TabPFN-3 elevates the scale of data handling to an impressive 1 million rows, a tenfold increase over its predecessor. This scalability, combined with innovations like reduced KV cache and row-chunked inference, enables users to leverage powerful AI capabilities on a single GPU. The implications of this are profound; practitioners can now tackle more complex datasets without the prohibitive costs and technical barriers that previously hindered their analysis. The speed enhancements—boasting 10x to 1000x faster inference—further underline a shift towards efficiency, making it a viable option for real-time data processing scenarios. As more organizations explore the integration of AI into their workflows, the significance of these enhancements cannot be overstated.

Moreover, the introduction of unique features like Thinking Mode, which allows for test-time compute to enhance predictions, underscores TabPFN-3's commitment to pushing the boundaries of what users can expect from AI models. This approach not only elevates accuracy—demonstrated by a 93% win rate over classical machine learning on platforms like TabArena—but also encourages a more collaborative interaction between users and the model. The focus on user empowerment is evident, as it invites users to explore and experiment rather than merely consume predetermined outputs. This user-centric approach aligns with broader trends in the industry that emphasize adaptability and innovation, as seen in the recent funding raised by platforms such as Wirestock to support multimodal data supply for AI labs in Wirestock raises $23M to supply creative multimodal data to AI labs.

Looking ahead, the introduction of TabPFN-3 not only sets a new standard for tabular data processing but also raises important questions about the future of data management technologies. As users embrace these innovative solutions, it will be critical to observe how they adapt their workflows and strategies in response to such advancements. Will traditional tools fade into obsolescence, or can they evolve to meet the new demands of an increasingly data-driven world? As organizations continue to navigate this shifting landscape, it will be fascinating to witness which paths they choose to explore and how these decisions will shape the future of data analytics.

TabPFN-3 was released today, the next iteration of the tabular foundation model, originally published in Nature.

Quick recap for anyone new to TabPFN: TabPFN predicts on tabular data in a single forward pass - no training, no hyperparameter search, no tuning. Built on TabPFN-2.5 (Nov 2025) and TabPFNv2 (Nature, Jan 2025), which together crossed 3M downloads and 200+ published applications.

What's new:

Scale: 1M rows on a single H100 (10x larger than 2.5).A reduced KV cache (~8GB per million rows per estimator) and row-chunked inference make this practical on a single GPU
Speed: 10x-1000x faster inference than previous versions. 120x on SHAP via KV caching
Thinking Mode (API only): test-time compute pushes predictions further via one-time extra fitting at inference. Beats every non-TabPFN method on TabArena by over 200 Elo, including 4-hour-tuned AutoGluon 1.5 extreme. Gap more than doubles to 420 Elo on the larger-data slice.
Accuracy: it has a 93% win rate over classical ML on TabArena
Many-class: native non-parametric retrieval decoder supporting up to 160 classes
Calibrated quantile regression: bar-distribution regression head produces calibrated quantile predictions in a single forward pass
Lifts adjacent tasks: time-series, interpretability, and new SOTA on relational benchmarks.
3 deployment paths: API, enterprise licensing, and open-source weights (permissive for research and academic evaluation)

You can try it here or read the model report here. Happy to answer questions in the comments.

submitted by /u/rsesrsfh
[link] [comments]

Read on the original site

Open the publisher's page for the full experience

View original article →

TabPFN-3 just released: a pre-trained tabular foundation model for up to 1M rows [R][N]

Tagged with